KR20080008205A

KR20080008205A - Method of encoding and decoding video signal

Info

Publication number: KR20080008205A
Application number: KR1020070012315A
Authority: KR
Inventors: 박승욱; 전병문; 박지호
Original assignee: 엘지전자 주식회사
Priority date: 2006-07-19
Filing date: 2007-02-06
Publication date: 2008-01-23

Abstract

본 발명은 영상 신호를 인코딩 하고 디코딩 하는 방법에 관한 것이다. 본 발명에 따른 일 실시예에서, 레이어 간 텍스쳐 예측을 위해 전송되는 베이스 레이어와 퀄러티 레벨에 관한 정보를 이용하여, 베이스 레이어의 퀄러티 베이스뿐만 아니라 베이스 레이어의 FGS 레이어의 모션 정보를 기초로 현재 레이어 퀄러티 베이스 또는 현재 레이어의 FGS 레이어에 대한 레이어 간 모션 예측을 수행한다. 따라서, 레이어 간 모션 예측이 좀더 유연해지고 코딩 효율이 향상된다.The present invention relates to a method for encoding and decoding a video signal. In an embodiment of the present invention, the current layer quality is based on the motion information of the FGS layer of the base layer as well as the quality base of the base layer, using information on the base layer and the quality level transmitted for inter-layer texture prediction. Inter-layer motion prediction is performed on the FGS layer of the base or current layer. Thus, inter-layer motion prediction is more flexible and coding efficiency is improved.

Description

Method for encoding and decoding video signal

도 1은 베이스 레이어의 퀄러티 베이스만을 이용하는 종래의 레이어 간 모션 예측 방법을 개념적으로 나타낸 것이고,1 conceptually illustrates a conventional inter-layer motion prediction method using only a quality base of a base layer,

도 2는 베이스 레이어의 FGS 레이어를 이용하여 현재 레이어의 퀄러티 베이스에 대한 모션을 예측하는 본 발명의 일 실시예를 개념적으로 나타낸 것이고,FIG. 2 conceptually illustrates an embodiment of the present invention for predicting motion for quality base of a current layer using an FGS layer of a base layer,

도 3은 베이스 레이어의 FGS 레이어를 이용하여 현재 레이어의 FGS 레이어에 대한 모션을 예측하는 본 발명의 다른 실시예를 개념적으로 나타낸 것이고,3 conceptually illustrates another embodiment of the present invention for predicting motion for an FGS layer of a current layer using an FGS layer of a base layer,

도 4는 본 발명에 따른 영상 신호의 인코딩 방법이 적용되는 인코딩 장치의 구성을 나타낸 것이고,4 illustrates a configuration of an encoding apparatus to which an encoding method of a video signal according to the present invention is applied.

도 5는 도 4의 장치에 의해 인코딩 된 비트 스트림을 디코딩 하는 장치의 구성을 나타낸 것이다.FIG. 5 shows a configuration of an apparatus for decoding a bit stream encoded by the apparatus of FIG. 4.

<도면의 주요 부분에 대한 부호의 설명><Explanation of symbols for the main parts of the drawings>

110 : BL 인코더 120 : EL 인코더110: BL encoder 120: EL encoder

130 : 먹서 210 : 디먹서130: Eat 210: Demux

220 : BL 디코더 230 : EL 디코더220: BL decoder 230: EL decoder

본 발명은 영상 신호를 인코딩 하고 디코딩 하는 방법에 관한 것이다.The present invention relates to a method for encoding and decoding a video signal.

스케일러블 영상 코덱(SVC : Scalable Video Codec) 방식은 영상 신호를 인코딩 함에 있어, 최고 화질로 인코딩 하되, 그 결과로 생성된 픽처 시퀀스의 부분 시퀀스(시퀀스 전체에서 간헐적으로 선택된 프레임의 시퀀스)를 디코딩 하여 사용해도 어느 정도 화질의 영상 표현이 가능하도록 하는 방식이다.The scalable video codec (SVC) method encodes a video signal, and encodes at the highest quality, but decodes a partial sequence of the resultant picture sequence (a sequence of intermittently selected frames in the entire sequence). It is a method to allow a certain level of image expression even if used.

스케일러블 방식으로 인코딩 된 픽처 시퀀스는 그 부분 시퀀스만을 수신하여 처리함으로써도 어느 정도 화질의 영상 표현이 가능하지만, 비트 레이트(bit rate)가 낮아지는 경우 화질 저하가 크게 나타난다. 이를 해소하기 위해서, 낮은 전송률을 위한 별도의 보조 픽처 시퀀스, 예를 들어 소화면 및/또는 초당 프레임 수 등이 낮은 픽처 시퀀스를 제공할 수도 있다.A picture sequence encoded in a scalable manner can be represented to a certain degree of image quality by receiving and processing only a partial sequence. However, when a bit rate is lowered, image quality deterioration is large. In order to solve this, a separate auxiliary picture sequence for a low data rate, for example, a small picture and / or a low picture sequence per frame may be provided.

보조 픽처 시퀀스를 베이스 레이어(base layer)로, 주 픽처 시퀀스를 인핸스드(enhanced) 레이어 또는 인핸스먼트(enhancement) 레이어라고 부른다. 베이스 레이어와 인핸스드 레이어는 동일한 영상 신호원을 인코딩 한 것으로, 두 레이어의 영상 신호에는 잉여 정보(리던던시(redundancy))가 존재한다. 따라서, 베이스 레이어가 제공되는 경우에는, 코딩 효율을 높이기 위해 베이스 레이어의 모션 정보 및/또는 이미지 데이터인 텍스처(texture) 정보를 이용하여 인핸스드 레이어의 영 상 신호를 예측하여 인코딩 하는 레이어 간 예측 방법(interlayer prediction method)을 사용할 수 있다.The auxiliary picture sequence is called a base layer, and the main picture sequence is called an enhanced layer or an enhancement layer. The base layer and the enhanced layer encode the same video signal source, and redundant information (redundancy) exists in the video signals of the two layers. Therefore, when a base layer is provided, an inter-layer prediction method for predicting and encoding an image signal of an enhanced layer using motion information and / or texture information, which is image data, of the base layer to improve coding efficiency. (interlayer prediction method) can be used.

또한, 베이스 레이어의 SNR(Signal-to-Noise Ratio)을 향상, 즉 화질을 개선하기 위하여 인핸스드 레이어가 사용될 수도 있는데, 이를 SNR 스케일러빌러티, FGS(Fine Granular Scalability), 또는 점증적 리파인먼트(Progressive Refinement)라 한다.In addition, an enhanced layer may be used to improve the signal-to-noise ratio (SNR) of the base layer, that is, to improve image quality, which may be used for SNR scalability, fine granular scalability (GFS), or incremental refinement (FGS). Progressive Refinement).

FGS에 따르면, 각 화소에 대응되는 변환 계수(transform coefficients), 예를 들어 DCT(Discrete Cosine Transform) 계수가 비트 표현상의 해상도에 따라 베이스 레이어(일명 퀄러티 베이스(quality base))와 FGS 레이어로 나누어 인코딩 되고, 전송 환경이 나쁜 경우에는 FGS 레이어의 전송이 생략되어, 디코딩 되는 영상의 화질을 저하시키면서 비트 레이트를 낮출 수 있다. 즉, FGS 레이어는, 퀄러티 베이스의 양자화 과정에서 발생하는 손실을 보상하기 위한 것으로, 전송 또는 디코딩 환경에 대응하여 비트 레이트를 제어할 수 있는 높은 유연성을 제공한다.According to the FGS, transform coefficients corresponding to each pixel, for example, DCT (Discrete Cosine Transform) coefficients, are divided into a base layer (also called a quality base) and an FGS layer according to the resolution of the bit representation. If the transmission environment is bad, transmission of the FGS layer may be omitted, thereby lowering the bit rate while reducing the quality of the decoded video. That is, the FGS layer compensates for the loss occurring during the quantization of the quality base, and provides a high flexibility of controlling the bit rate in response to a transmission or decoding environment.

예를 들어, 변환 계수를 양자화 스텝 사이즈(일명 QP), 예를 들어 QP=32로 양자화하여 베이스 레이어가 생성되었다면, FGS 레이어 레벨 1(일명 FGS 레벨 1(FGS L1))은, 원래의 변환 계수와 베이스 레이어(퀄러티 베이스)의 양자화된 계수를 역양자화하여 구한 변환 계수와의 차이에 대해서 QP 32보다 더 높은 품질에 대응되는 양자화 스텝 사이즈인, 예를 들어 QP=26으로 양자화하여 생성된다. 마찬가지로, FGS 레벨 2(FGS L2)는, 원래의 변환 계수와, 퀄러티 베이스 및 FGS 레벨 1의 양자화된 계수의 합을 역양자화하여 구한 변환 계수와의 차이에 대해서, 예를 들어 QP=20으로 양자화하여 생성된다.For example, if the base layer is generated by quantizing the transform coefficients to a quantization step size (aka QP), for example QP = 32, then FGS layer level 1 (aka FGS level 1 (FGS L1)) is the original transform coefficient. The difference between the transform coefficient obtained by inverse quantization of the quantized coefficient of the base layer (quality base) and the quantization step size corresponding to a higher quality than QP 32 is generated by quantization, for example, QP = 26. Similarly, FGS level 2 (FGS L2) quantizes, for example, QP = 20 for the difference between the original transform coefficient and the transform coefficient obtained by inverse quantization of the sum of the quality base and the quantized coefficient of FGS level 1. Is generated.

한편, 레이어 간 예측 중 레이어 간 모션 예측은, 베이스 레이어의 모션 정보를 이용하여 인핸스드 레이어, 즉 현재 레이어의 영상 신호를 예측하여 인코딩 한다.Meanwhile, inter-layer motion prediction during inter-layer prediction predicts and encodes an image signal of an enhanced layer, that is, a current layer, using motion information of the base layer.

또한, 현재의 JSVM(Joint Scalable Video Model)은, FGS와 관련하여 잉여 정보를 효율적으로 이용할 수 있도록, FGS L1, FGS L2 등의 FGS 레이어에도 모션 정보를 전송할 수 있도록 하고 있다. 퀄러티 베이스에 비해 더 높은 화질을 제공하는 FGS 레이어에 전송되는 모션 정보는 퀄러티 베이스의 모션 정보에 비해 정밀도가 더 뛰어나다.In addition, the current Joint Scalable Video Model (JSVM) allows motion information to be transmitted to FGS layers such as FGS L1 and FGS L2 so that redundant information can be efficiently used in connection with FGS. Motion information transmitted to the FGS layer, which provides higher image quality compared to quality base, is more accurate than motion information based on quality.

하지만, 현재의 레이어 간 모션 예측은, 도 1에 도시한 바와 같이, 베이스 레이어에 모션 정보를 포함하는 하나 이상의 FGS 레이어(레벨)이 있는 경우에도 FGS 레이어의 모션 정보를 이용하지 못하고 정밀도가 떨어지는 퀄러티 베이스의 모션 정보만을 이용하고 있는 실정이다.However, as shown in FIG. 1, the current inter-layer motion prediction does not use the motion information of the FGS layer and has low precision even when there is one or more FGS layers (levels) including the motion information in the base layer. Only the base motion information is used.

본 발명은 이러한 사정을 감안하여 창작된 것으로서, 본 발명의 목적은, 레이어 간 예측에서 코딩 효율을 향상시키기 위한 것으로, 베이스 레이어의 FGS 레이어에 포함된 모션 정보를 이용하는 방법을 제공하는데 있다.The present invention was created in view of the above circumstances, and an object of the present invention is to improve coding efficiency in inter-layer prediction, and to provide a method of using motion information included in an FGS layer of a base layer.

상기한 목적을 달성하기 위해 본 발명의 일 실시예에 따른 영상 신호를 인코딩 하는 방법은, 동일한 영상 신호로부터 인코딩 되는 하위 레이어의 모션 정보를 이용하여 현재 레이어의 비트 스트림을 생성하되, 상기 하위 레이어는 하위 레벨의 인코딩에서 발생하는 손실을 보상하기 위한 하나 이상의 레벨을 갖는 제 1 레이어를 포함하고, 상기 제 1 레이어에서 모션 정보를 포함하는 소정의 레벨이 상기 현재 레이어의 비트 스트림 생성에 이용되는 것을 특징으로 한다.In order to achieve the above object, a method of encoding a video signal according to an embodiment of the present invention generates a bit stream of a current layer by using motion information of a lower layer encoded from the same video signal. A first layer having one or more levels for compensating for losses occurring in lower level encoding, wherein a predetermined level comprising motion information in the first layer is used to generate the bit stream of the current layer It is done.

또한, 본 발명의 다른 실시예에 따른 인코딩 된 영상 비트 스트림을 디코딩 하는 방법은, 동일한 영상 신호에 대한 하위 레이어의 비트 스트림을 디코딩 하고, 상기 디코딩 되는 하위 레이어의 모션 정보를 이용하여 현재 레이어의 비트 스트림을 디코딩 하되, 상기 하위 레이어는 하위 레벨의 인코딩에서 발생하는 손실을 보상하기 위한 하나 이상의 레벨을 갖는 제 1 레이어를 포함하고, 상기 제 1 레이어에서 모션 정보를 포함하는 소정의 레벨이 상기 현재 레이어의 디코딩에 이용되는 것을 특징으로 한다.In addition, the method of decoding an encoded video bit stream according to another embodiment of the present invention, decoding the bit stream of the lower layer for the same video signal, using the motion information of the decoded lower layer bits of the current layer Decode a stream, wherein the lower layer comprises a first layer having one or more levels for compensating for losses incurred in lower level encoding, wherein the predetermined layer containing motion information in the first layer is the current layer; It is characterized in that it is used for decoding.

본 발명의 일 예에서, 상기 현재 레이어의 헤더 영역에는 상기 하위 레이어 및 상기 하위 레이어 내의 레벨을 가리키는 정보가 기록되는데, 상기 정보가 가리키는 레벨이 모션 정보를 포함하지 않는 경우 상기 정보가 가리키는 레벨보다 낮고 모션 정보를 갖는 하위 레벨이 상기 현재 레이어의 비트 스트림 생성에 이용될 수 있다. 상기 모션 정보는 모션 벡터를 포함한다.In an example of the present invention, information indicating the lower layer and the level in the lower layer is recorded in the header area of the current layer, and the level indicated by the information is lower than the level indicated by the information if the level does not include motion information. A lower level with motion information may be used to generate the bit stream of the current layer. The motion information includes a motion vector.

또한, 본 발명의 다른 실시예에 따른 영상 신호를 인코딩 하는 방법은, 동일한 영상 신호로부터 인코딩 되는 하위 레이어를 이용하여 현재 레이어의 비트 스트 림을 생성하되, 상기 현재 레이어와 하위 레이어는 각각 하위 레벨의 인코딩에서 발생하는 손실을 보상하기 위한 하나 이상의 레벨을 갖는 제 1 및 제 2 레이어를 포함하고, 상기 제 1 레이어의 임의의 레벨은 상기 제 1 레이어 내의 하위 레벨의 모션 정보 또는 상기 제 2 레이어 내의 소정의 레벨의 모션 정보를 이용하여 인코딩 되고, 상기 제 1 레이어의 임의의 레벨의 인코딩에 현재 레이어 또는 하위 레이어 중 어떤 레이어의 모션 정보가 사용되는지를 가리키는 정보가 기록되는 것을 특징으로 한다.In addition, according to another embodiment of the present invention, a method of encoding a video signal generates a bit stream of a current layer by using a lower layer encoded from the same video signal, wherein the current layer and the lower layer each have a lower level. First and second layers having one or more levels to compensate for losses arising in the encoding, wherein any level of the first layer is a lower level of motion information in the first layer or a predetermined level in the second layer. It is encoded using the motion information of the level of, characterized in that the information indicating which motion information of the current layer or the lower layer is used for encoding any level of the first layer is characterized in that it is recorded.

또한, 본 발명의 또 다른 실시예에 따른 인코딩 된 영상 비트 스트림을 디코딩 하는 방법은, 동일한 영상 신호에 대한 하위 레이어의 비트 스트림을 디코딩 하고, 상기 디코딩 되는 하위 레이어를 이용하여 현재 레이어의 비트 스트림을 디코딩 하되, 상기 현재 레이어와 하위 레이어는 각각 하위 레벨의 인코딩에서 발생하는 손실을 보상하기 위한 하나 이상의 레벨을 갖는 제 1 및 제 2 레이어를 포함하고, 상기 제 1 레이어의 임의의 레벨의 디코딩에 현재 레이어 또는 하위 레이어 중 어떤 레이어의 모션 정보가 사용되는지를 가리키는 정보가 독출되고, 이를 기초로 상기 제 1 레이어의 임의의 레벨은 상기 제 1 레이어 내의 하위 레벨의 모션 정보 또는 상기 제 2 레이어 내의 소정의 레벨의 모션 정보를 이용하여 디코딩 되는 것을 특징으로 한다.In addition, the method of decoding an encoded video bit stream according to another embodiment of the present invention, decoding the bit stream of the lower layer for the same video signal, and decodes the bit stream of the current layer using the decoded lower layer Wherein the current layer and the lower layer each comprise a first and a second layer having one or more levels to compensate for the loss arising in the lower level of encoding, and are present in decoding any level of the first layer. Information indicating which motion information of a layer or a lower layer is used is read out, and based on this, any level of the first layer is based on motion information of a lower level in the first layer or a predetermined level in the second layer. It is characterized in that the decoding using the level of the motion information.

본 발명의 일 예에서, 상기 정보는 레벨에 대응되는 슬라이스 단위 또는 레벨 내의 영상 블록 단위로 정의될 수 있다. 또한, 상기 정보가 하위 레이어의 모션 정보가 사용되는 것을 가리킬 때 상기 제 1 레이어를 제외한 현재 레이어의 비 트 스트림 생성에 이용되는 하위 레이어와 레벨로부터 상기 임의의 레벨의 인코딩에 이용되는 소정의 레벨이 유도되는데, 상기 제 1 레이어를 제외한 현재 레이어의 비트 스트림 생성에 이용되는 하위 레이어의 제 2 레이어 내의 레벨 n이 모션 정보를 포함하지 않는 경우 상기 레벨 n보다 낮고 모션 정보를 갖는 하위 레벨이 상기 임의의 레벨의 인코딩에 이용될 수 있다.In one example of the present invention, the information may be defined in a slice unit corresponding to a level or in an image block unit in a level. In addition, when the information indicates that motion information of a lower layer is used, a predetermined level used for encoding the arbitrary level is selected from a lower layer and a level used for generating a bit stream of the current layer except for the first layer. It is derived that if the level n in the second layer of the lower layer used to generate the bit stream of the current layer except for the first layer does not include motion information, the lower level lower than the level n and having the motion information is the arbitrary level. Can be used for encoding of levels.

이하, 본 발명의 바람직한 실시예에 대해 첨부 도면을 참조하여 상세히 설명한다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.

현재의 JSVM에 따르면, 레이어 간 예측 중 레이어 간 텍스쳐 예측에서는, 베이스 레이어의 퀄러티 베이스뿐만 아니라 베이스 레이어의 FGS 레이어를 이용하여 현재 레이어의 영상 신호를 인코딩 하고 있는데, 레이어 간 예측에 이용한 베이스 레이어와 해당 베이스 레이어의 FGS 레이어를 포함하는 정보, 예를 들어 base_id_plus1를 슬라이스(slice) 헤더에 포함시켜 전송한다.According to the current JSVM, in inter-layer texture prediction during inter-layer prediction, the video signal of the current layer is encoded not only by the quality base of the base layer but also by using the FGS layer of the base layer. Information including the FGS layer of the base layer, for example, base_id_plus1 is included in the slice header and transmitted.

디코더는, 비트 스트림의 슬라이스 헤더에 포함된 base_id_plus1로부터 레이어 간 예측에 이용할 베이스 레이어를 결정하고, 또한 상기 결정된 베이스 레이어 내의 퀄러티 베이스 또는 FGS 레이어(퀄러티 레벨)를 구별하기 위한 정보, 예를 들어 QualityBaseLevel을 상기 base_id_plus1로부터 유도하여 레이어 간 예측에 이용할 퀄러티 베이스 또는 FGS 레이어를 결정한다. 하지만, 이러한 정보는 레이어 간 모션 예측에는 사용되지 않고 있었다.The decoder determines the base layer to be used for inter-layer prediction from base_id_plus1 included in the slice header of the bit stream, and also provides information for identifying a quality base or FGS layer (quality level) in the determined base layer, for example, QualityBaseLevel. A quality base or an FGS layer to be used for inter-layer prediction is determined from the base_id_plus1. However, this information has not been used for inter-layer motion prediction.

이와 같이 레이어 간 텍스쳐 예측을 위해 베이스 레이어와 퀄러티 레벨에 관 한 정보가 전송되고 있으므로, 본 발명에서는, 새로운 정보의 추가적인 전송 없이 이러한 정보를 이용하여, 베이스 레이어의 퀄러티 베이스뿐만 아니라 베이스 레이어의 FGS 레이어의 모션 정보를 기초로 하는 레이어 간 모션 예측을 수행할 수 있다.As information about the base layer and the quality level are transmitted for the inter-layer texture prediction as described above, in the present invention, the FGS layer of the base layer as well as the quality base of the base layer can be used by using such information without additional transmission of new information. Inter-layer motion prediction may be performed based on the motion information.

본 발명의 일 실시예에서는, 도 2에 도시한 바와 같이, 현재 레이어(인핸스드 레이어)의 퀄러티 베이스의 모션 예측에 어떤 레이어의 어떤 FGS 레벨의 모션 벡터가 이용되었는지를 슬라이스 헤더에 정의되는 base_id_plus1와 상기 base_id_plus1에 포함된 QualityLevelBase를 기초로 확인한다.In an embodiment of the present invention, as shown in FIG. 2, base_id_plus1 defined in a slice header indicates whether a motion vector of which FGS level of which layer is used for quality based motion prediction of a current layer (enhanced layer). Check based on QualityLevelBase included in base_id_plus1.

상기 base_id_plus1은, 슬라이스 단위로 정의되는데, 해당 슬라이스 내의 매크로블록의 레이어 간 예측에 어떤 베이스 레이어가 이용되었는지 가리키는 소정 비트, 예를 들어 4 비트의 정보, 상기 베이스 레이어에서 어떤 FGS 레이어가 이용되었는지 가리키는 소정 비트, 예를 들어 2 비트의 정보(QualityBaseLevel) 등을 포함하여 구성될 수 있다.The base_id_plus1 is defined in units of slices, and includes predetermined bits indicating which base layer is used for inter-layer prediction of macroblocks in the slice, for example, information of 4 bits, and indicating which FGS layer is used in the base layer. Bits, for example, two bits of information (QualityBaseLevel) and the like.

현재 레이어(인핸스드 레이어) 아래의 베이스 레이어는 각각 1 내지 3의 FGS 레이어(FGS L1 ~ L3)를 더 포함할 수 있고, FGS L0는 해당 베이스 레이어의 퀄러티 베이스에 해당한다.The base layer below the current layer (enhanced layer) may further include 1 to 3 FGS layers FGS L1 to L3, respectively, and FGS L0 corresponds to the quality base of the base layer.

따라서, 슬라이스 헤더 내의 base_id_plus1에 포함된 QualityLevelBase가 0이면, 상기 base_id_plus1가 가리키는 베이스 레이어의 퀄러티 베이스(FGS L0)의 모션 및/또는 텍스쳐 정보가 현재 레이어에 대한 레이어 간 예측에 이용된다. 마찬가지로, 상기 QualityLevelBase가 11이면, 상기 base_id_plus1가 가리키는 베이 스 레이어의 퀄러티 베이스(FGS L3)의 모션 및/또는 텍스쳐 정보가 현재 레이어에 대한 레이어 간 예측에 이용된다.Therefore, if QualityLevelBase included in base_id_plus1 in the slice header is 0, motion and / or texture information of the quality base FGS L0 of the base layer indicated by base_id_plus1 is used for inter-layer prediction for the current layer. Similarly, if the QualityLevelBase is 11, motion and / or texture information of the quality base (FGS L3) of the base layer indicated by the base_id_plus1 is used for inter-layer prediction for the current layer.

이때, 상기 base_id_plus1와 이에 포함된 QualityLevelBase가 가리키는 베이스 레이어의 FGS 레이어에 모션 정보가 포함되지 않는 경우, 상기 base_id_plus1가 가리키는 베이스 레이어의 퀄러티 베이스의 모션 정보가 현재 레이어에 대한 레이어 간 모션 예측에 이용될 수 있고, 또는, 상기 QualityLevelBase가 가리키는 FGS 레이어보다 아래에 있는 모션 정보를 포함하는 다른 FGS 레이어의 모션 정보가 현재 레이어에 대한 레이어 간 모션 예측에 이용될 수도 있다.In this case, when motion information is not included in the FGS layer of the base layer indicated by the base_id_plus1 and QualityLevelBase included therein, the motion information of the quality base of the base layer indicated by the base_id_plus1 may be used for inter-layer motion prediction for the current layer. Alternatively, motion information of another FGS layer including motion information below the FGS layer indicated by the QualityLevelBase may be used for inter-layer motion prediction for the current layer.

본 발명의 다른 실시예에서는, 도 3에 도시한 바와 같이, 베이스 레이어의 FGS 레이어의 모션 정보를 이용하여 현재 레이어(인핸스드 레이어)의 FGS 레이어에 대한 모션을 예측한다.In another embodiment of the present invention, as shown in FIG. 3, the motion information for the FGS layer of the current layer (enhanced layer) is predicted using the motion information of the FGS layer of the base layer.

현재의 JSVM에서, FGS 레이어에 대해서는 해당 레이어(현재 레이어)의 아래 레벨의 FGS 레이어(또는 퀄러티 베이스)만을 이용하여 텍스쳐 및/또는 모션 벡터가 예측되므로, 다른 레이어를 이용한 예측, 즉 레이어 간 예측은 적용되지 않고, 이에 따라 FGS 레이어의 슬라이스에는 base_id_plus1 정보가 전송되지 않는다.In the current JSVM, texture and / or motion vectors are predicted using only the lower level FGS layer (or quality base) of that layer (the current layer) for the FGS layer, so prediction with other layers, i.e. inter-layer prediction In this case, base_id_plus1 information is not transmitted to the slice of the FGS layer.

따라서, 현재 레이어의 FGS 레이어에 대한 레이어 간 모션 예측을 위해서는, 해당 레이어(현재 레이어)의 아래 레벨의 FGS 레이어(또는 퀄러티 베이스)의 모션 정보를 이용할 것인지 다른 레이어(베이스 레이어)의 FGS 레이어를 이용할 것인지를 가리키는 정보, 예를 들어 1 비트의 motion_base_prediction_flag가 전송되어야 하는데, 상기 motion_base_prediction_flag는 매크로블록 단위로 정의되거나 또는 슬라이스 단위로 정의될 수 있다.Therefore, for inter-layer motion prediction for the FGS layer of the current layer, the motion information of the FGS layer (or quality base) of the lower level of the layer (current layer) or the FGS layer of another layer (base layer) is used. Information indicating whether or not, for example, motion_base_prediction_flag of 1 bit should be transmitted. The motion_base_prediction_flag may be defined in macroblock units or slice units.

상기 motion_base_prediction_flag는, 현재 레이어의 아래 레벨의 FGS 레이어(또는 퀄러티 베이스)의 모션 정보를 이용하여 현재 레이어의 FGS 레이어의 모션이 예측되는 경우에는 예를 들어 '0'의 값이 주어지고, 다른 레이어(베이스 레이어)의 FGS 레이어의 모션 정보를 이용하여 현재 레이어의 FGS 레이어의 모션이 예측되는 경우에는 예를 들어 '1'의 값이 주어질 수 있다.The motion_base_prediction_flag is given a value of '0', for example, when the motion of the FGS layer of the current layer is predicted using the motion information of the FGS layer (or the quality base) of the lower level of the current layer. When the motion of the FGS layer of the current layer is predicted using the motion information of the FGS layer of the base layer), for example, a value of '1' may be given.

다른 레이어(베이스 레이어)의 FGS 레이어의 모션 정보가 이용되는 경우, 즉 motion_base_prediction_flag가 '1'인 경우, 현재 레이어의 FGS 레이어의 모션의 예측에 어떤 베이스 레이어와 어떤 FGS 레이어가 이용될지는 현재 레이어의 퀄러티 베이스 슬라이스에 정의되는 base_id_plus1 정보를 기초로 결정될 수 있다.When the motion information of the FGS layer of another layer (base layer) is used, that is, motion_base_prediction_flag is '1', the quality of the current layer to determine which base layer and which FGS layer are used to predict the motion of the FGS layer of the current layer. It may be determined based on base_id_plus1 information defined in the base slice.

도 4는 본 발명에 따른 영상 신호의 인코딩 방법이 적용되는 인코딩 장치의 구성을 도시한 것이다.4 illustrates a configuration of an encoding apparatus to which an encoding method of a video signal according to the present invention is applied.

상기 인코딩 장치는, 베이스 레이어의 데이터 스트림을 기초로 하는 레이어 간 예측을 통해 영상 신호에 대한 인핸스드 레이어를 생성하는 인핸스드 레이어 인코더(EL Encoder)(120) 및 생성되는 인핸스드 레이어의 데이터 스트림 및/또는 베이스 레이어의 데이터 스트림을 기 지정된 포맷으로 인캡슐(encapsulate) 한 다음 기 지정된 전송 포맷으로 상호 먹싱 하여 출력하는 먹서(Muxer)(130)를 포함하여 구성될 수 있다.The encoding apparatus includes an enhanced layer encoder (EL Encoder) 120 for generating an enhanced layer for an image signal through inter-layer prediction based on a data stream of a base layer, and a data stream of the generated enhanced layer; And / or a muxer 130 for encapsulating the data stream of the base layer in a predetermined format and then muxing and outputting each other in a predetermined transmission format.

다른 인코딩 장치에 의해 다른 용도로 이미 생성된 베이스 레이어의 데이터 스트림이 레이어 간 예측을 통한 인핸스드 레이어의 생성을 위해 상기 인코딩 장치에 제공될 수 있다. 또는, 상기 인코딩 장치는 동일한 소오스의 영상 신호에 대한 베이스 레이어를 생성하는 베이스 레이어 인코더(BL Encoder)(110)를 더 포함하여 직접 베이스 레이어를 생성하여 레이어 간 예측에 이용할 수도 있다.A data stream of a base layer already generated for another use by another encoding device may be provided to the encoding device for generation of an enhanced layer through inter-layer prediction. Alternatively, the encoding apparatus may further include a base layer encoder 110 generating a base layer of an image signal of the same source, and may directly generate a base layer to use for inter-layer prediction.

상기 먹서(130)는 베이스 레이어와 인핸스드 레이어를 상호 먹싱 하여 출력하는데, 디코딩 장치가 인핸스드 레이어의 데이터 스트림과는 별개의 다른 경로를 통해 베이스 레이어의 데이터 스트림을 얻을 수 있는 경우를 고려하여, 상기 EL 인코더(120)에서 생성되는 인핸스드 레이어의 데이터 스트림만을 먹싱 할 수도 있다.The muxer 130 muxes and outputs the base layer and the enhanced layer, considering that the decoding apparatus can obtain the data stream of the base layer through a path separate from the data stream of the enhanced layer. Only data streams of the enhanced layer generated by the EL encoder 120 may be muxed.

상기 EL 인코더(120)는, 현재 레이어의 퀄러티 베이스를 인코딩 할 때, 아래 레이어(베이스 레이어)의 퀄러티 베이스뿐만 아니라 FGS 레이어를 이용하여 레이어 간 예측, 즉 레이어 간 텍스쳐 예측 및/또는 레이어 간 모션 예측을 할 수 있는데, 레이어 간 예측에 이용된 베이스 레이어와 해당 베이스 레이어의 FGS 레이어(또는 퀄러티 베이스)를 가리키기 위한 정보(base_id_plus1)를 현재 레이어의 퀄러티 베이스 슬라이스 헤더에 삽입한다.When encoding the quality base of the current layer, the EL encoder 120 uses the FGS layer as well as the quality base of the lower layer (base layer) to perform inter-layer prediction, that is, inter-layer texture prediction and / or inter-layer motion prediction. The base layer used for inter-layer prediction and information (base_id_plus1) indicating the FGS layer (or the quality base) of the base layer are inserted into the quality base slice header of the current layer.

상기 EL 인코더(120)는, 현재 레이어의 퀄러티 베이스에 대한 레이어 간 텍스쳐 예측에서 임의의 베이스 레이어의 임의의 FGS 레벨(레이어)를 사용하였지만 상기 임의의 FGS 레벨이 모션 정보를 포함하지 않는 경우, 현재 레이어의 퀄러티 베이스에 대한 레이어 간 모션 예측에서는 상기 임의의 베이스 레이어의 퀄러티 베이스의 모션 정보나 상기 임의의 FGS 레벨보다 아래에 있는 모션 정보를 포함하는 FGS 레벨의 모션 정보를 이용할 수도 있다. 이 경우, 상기 EL 인코더(120)는, 현재 레이어의 퀄러티 베이스 슬라이스 헤더에는 레이어 간 텍스쳐 예측에서 사용된 베이스 레이어와 FGS 레벨을 가리키는 정보(base_id_plus1)를 삽입한다.The EL encoder 120 uses any FGS level (layer) of any base layer in inter-layer texture prediction for the quality base of the current layer, but if the any FGS level does not include motion information, In inter-layer motion prediction for a quality base of a layer, motion information of an FGS level including motion information of the quality base of the arbitrary base layer or motion information below the arbitrary FGS level may be used. In this case, the EL encoder 120 inserts information (base_id_plus1) indicating the base layer and the FGS level used in the inter-layer texture prediction in the quality base slice header of the current layer.

또한, 상기 EL 인코더(120)는, 현재 레이어의 FGS 레이어에 모션 정보를 제공하면서 인코딩 할 때, 현재 레이어의 아래 레벨의 FGS 레이어(또는 퀄러티 베이스)의 모션 정보를 이용하거나 또는 다른 레이어(베이스 레이어)의 FGS 레이어(또는 퀄러티 베이스)의 모션 정보를 이용하여 레이어 간 모션 예측을 할 수 있는데, 이를 구별하기 위한 1 비트의 정보(motion_base_prediction_flag)를 매크로블록 단위 또는 슬라이스 단위로 정의하여 매크로블록 헤더 또는 FGS 레이어 슬라이스 헤더에 삽입한다.In addition, the EL encoder 120 uses motion information of an FGS layer (or quality base) below the current layer or encodes another layer (base layer) when encoding while providing motion information to the FGS layer of the current layer. Motion prediction between layers can be performed using the motion information of the FGS layer (or quality base) of.) A macroblock header or FGS is defined by defining one bit information (motion_base_prediction_flag) in units of macroblocks or slices. Insert into the layer slice header.

인코딩 된 데이터 스트림은 유선 또는 무선으로 디코딩 장치에 전송되거나 기록 매체를 매개로 하여 전달되며, 디코딩 장치는 이후 설명하는 방법에 따라 원래의 영상 신호를 복원하게 된다.The encoded data stream is wired or wirelessly transmitted to the decoding apparatus or transmitted via a recording medium, and the decoding apparatus restores the original video signal according to the method described later.

도 5는 도 4의 장치에 의해 인코딩 된 비트 스트림을 디코딩 하는 장치의 구성을 도시한 것이다.FIG. 5 illustrates a configuration of an apparatus for decoding a bit stream encoded by the apparatus of FIG. 4.

상기 디코딩 장치는, 전송 포맷으로 수신되는 비트 스트림에서 인핸스드 레이어와 베이스 레이어의 데이터 스트림을 분리하여 출력하는 디먹서(Demuxer)(210), 베이스 레이어의 데이터 스트림으로부터 베이스 레이어의 영상 신호를 복원하는 베이스 레이어 디코더(BL Decoder)(220), 및 상기 디먹서(210)로 부터 출력되는 인핸스드 레이어의 데이터 스트림으로부터 인핸스드 레이어의 영상 신호를 복원하되, 해상도, 프레임 레이트 등을 달리하는 베이스 레이어의 텍스처 및/또는 모션 정보를 기초로 레이어 간 예측을 통해 인핸스드 레이어의 영상 신호를 복원하는 인핸스드 레이어 디코더(EL Decoder)(230)를 포함하여 구성될 수 있다.The decoding apparatus includes a demuxer 210 that separates and outputs an enhanced layer and a data stream of a base layer from a bit stream received in a transmission format, and restores an image signal of the base layer from a data stream of the base layer. The base layer decoder (BL Decoder) 220 and the base layer to restore the image signal of the enhanced layer from the data stream of the enhanced layer output from the demuxer 210, the base layer having a different resolution, frame rate, etc. And an enhanced layer decoder (EL Decoder) 230 which reconstructs an image signal of the enhanced layer through inter-layer prediction based on texture and / or motion information.

상기 베이스 레이어의 데이터 스트림은 인핸스드 레이어의 데이터 스트림과 먹싱 된 상태로 제공될 수도 있고 또는 별도의 경로를 통해 제공될 수도 있다. 예를 들어, 베이스 레이어는 DVD, Blu-ray와 같은 저장 매체를 통해 제공되고, 상기 베이스 레이어와 관련된 인핸스드 레이어는 유선 또는 무선 네트워크를 통해 전송되거나 또는 다른 저장 매체를 통해 제공될 수도 있다.The data stream of the base layer may be provided muxed with the data stream of the enhanced layer or may be provided through a separate path. For example, the base layer may be provided via a storage medium such as DVD, Blu-ray, and the enhanced layer associated with the base layer may be provided via a wired or wireless network or through another storage medium.

상기 EL 디코더(230)는, 레이어 간 모션 예측을 통해 인코딩 된, 인핸스드 레이어(현재 레이어)의 퀄러티 베이스를 디코딩 할 때, 현재 레이어의 퀄러티 베이스 슬라이스 헤더에서 레이어 간 예측에 이용된 아래 레이어(베이스 레이어)와 해당 베이스 레이어의 FGS 레이어(레벨)을 가리키는 정보(base_id_plus1)를 먼저 독출하여 이로부터 베이스 레이어와 해당 FGS 레벨(QualityLevelBase)을 확인하고, 확인된 베이스 레이어의 FGS 레벨의 모션 정보를 이용하여 현재 레이어의 퀄러티 베이스에 대한 레이어 간 모션 예측을 수행한다.When the EL decoder 230 decodes the quality base of the enhanced layer (current layer) encoded through inter-layer motion prediction, the lower layer (base) used for inter-layer prediction in the quality base slice header of the current layer. Layer) and information (base_id_plus1) indicating the FGS layer (level) of the base layer are read first, and the base layer and the corresponding FGS level (QualityLevelBase) are checked therefrom, and the motion information of the FGS level of the checked base layer is used. Inter-layer motion prediction is performed on the quality base of the current layer.

상기 EL 디코더(230)는, 상기 base_id_plus1으로부터 유도된 QualityLevelBase가 예를 들어 0인 경우 상기 base_id_plus1이 가리키는 베이스 레이어의 퀄러티 베이스의 모션 정보를 이용하여 레이어 간 모션 예측을 수행하고, 상기 QualityLevelBase가 예를 들어 3인 경우 상기 base_id_plus1이 가리키는 베이스 레이어의 FGS 레벨 3(FGS L3)의 모션 정보를 이용하여 레이어 간 모션 예측을 수행한다.When the QualityLevelBase derived from the base_id_plus1 is 0, for example, the EL decoder 230 performs inter-layer motion prediction using motion information of the quality base of the base layer indicated by the base_id_plus1, and the QualityLevelBase is, for example. If 3, inter-layer motion prediction is performed using motion information of FGS level 3 (FGS L3) of the base layer indicated by base_id_plus1.

또한, 상기 EL 디코더(230)는, 상기 base_id_plus1와 QualityLevelBase가 가리키는 베이스 레이어의 FGS 레벨이 모션 정보를 포함하지 않는 경우, 상기 베이스 레이어의 퀄러티 베이스의 모션 정보를 이용하거나 또는 상기 QualityLevelBase가 가리키는 FGS 레벨보다 아래에 있는 모션 정보를 포함하는 FGS 레벨의 모션 정보를 이용하여 현재 레이어의 퀄러티 베이스에 대한 레이어 간 모션 예측을 수행할 수도 있다.In addition, the EL decoder 230 uses the quality information of the base layer of the base layer, or the FGS level indicated by the QualityLevelBase, when the FGS level of the base layer indicated by the base_id_plus1 and the QualityLevelBase does not include the motion information. Inter-layer motion prediction may be performed on the quality base of the current layer by using motion information of the FGS level including motion information below.

한편, 상기 EL 디코더(230)는, 현재 레이어의 FGS 레이어를 디코딩 할 때, 현재 레이어의 FGS 레이어 슬라이스 헤더에서 motion_base_prediction_flag 정보를 확인하여, 상기 정보가 0인 경우 현재 레이어의 아래 레벨의 FGS 레이어(또는 퀄러티 베이스)의 모션 정보를 이용하여 현재 레이어의 FGS 레이어에 대한 모션을 예측하고, 반면 상기 정보가 1인 경우 다른 레이어(베이스 레이어)의 FGS 레이어의 모션 정보를 이용하여 현재 레이어의 FGS 레이어에 대한 모션을 예측한다. 상기 정보가 1인 경우에 이용될 베이스 레이어와 FGS 레이어는 현재 레이어의 퀄러티 베이스 슬라이스에 정의되는 base_id_plus1 정보를 기초로 결정될 수 있다.Meanwhile, when decoding the FGS layer of the current layer, the EL decoder 230 checks motion_base_prediction_flag information in the FGS layer slice header of the current layer, and when the information is 0, the FGS layer (or the lower level of the current layer) Motion information of the current layer is used to predict the motion of the FGS layer of the current layer, whereas when the information is 1, motion information of the FGS layer of the other layer (base layer) is used to determine the motion of the FGS layer of the current layer. Predict motion. The base layer and the FGS layer to be used when the information is 1 may be determined based on base_id_plus1 information defined in the quality base slice of the current layer.

또는, 상기 motion_base_prediction_flag 정보는 매크로블록 단위로 정의될 수 있는데, 상기 EL 디코더(230)는, 현재 레이어의 FGS 레이어 내의 임의의 매크로블록을 디코딩 할 때, 상기 임의의 매크로블록의 헤더에서 motion_base_prediction_flag 정보를 확인하여, 그 값에 따라 현재 레이어의 퀄러티 베이스 내의 상기 임의의 매크로블록에 대응되는 블록(대응 블록)의 모션 정보 또는 베이스 레이어의 FGS 레이어 내의 대응 블록의 모션 정보를 이용하여 상기 임의의 매크로블록에 대한 모션을 예측할 수 있다.Alternatively, the motion_base_prediction_flag information may be defined in macroblock units, and when the EL decoder 230 decodes any macroblock in the FGS layer of the current layer, checks the motion_base_prediction_flag information in the header of the arbitrary macroblock. According to the value, using the motion information of the block (corresponding block) corresponding to the arbitrary macroblock in the quality base of the current layer or the motion information of the corresponding block in the FGS layer of the base layer for the arbitrary macroblock You can predict motion.

상기 EL 디코더(230)는, 위와 같은 동작을 통해 현재 레이어의 퀄러티 베이스 및/또는 FGS 레이어의 모션을 베이스 레이어의 FGS 레이어의 모션 정보를 기초로 예측하고, 이를 이용하여 현재 레이어의 퀄러티 베이스 및/또는 FGS 레이어의 영상 신호를 복원한다.The EL decoder 230 predicts the quality base of the current layer and / or the motion of the FGS layer based on the motion information of the FGS layer of the base layer through the above operation, and uses the quality base of the current layer and / or Alternatively, the video signal of the FGS layer is restored.

본 발명에 따른 디코딩 장치는, 전용 재생 단말 또는 통신 기능이 부여된 단말에 실장되거나 또는 기록 매체를 재생하는 장치에 실장될 수 있다.The decoding apparatus according to the present invention may be mounted in a dedicated playback terminal or a terminal to which a communication function has been given, or in a device for reproducing a recording medium.

이상, 전술한 본 발명의 바람직한 실시예는 예시의 목적을 위해 개시된 것으로, 당업자라면 이하 첨부된 특허청구범위에 개시된 본 발명의 기술적 사상과 그 기술적 범위 내에서 또 다른 다양한 실시예들을 개량, 변경, 대체 또는 부가 등이 가능할 것이다.As described above, preferred embodiments of the present invention have been disclosed for the purpose of illustration, and those skilled in the art can improve, change, and further various embodiments within the technical spirit and the technical scope of the present invention disclosed in the appended claims. Replacement or addition may be possible.

따라서, 좀더 유연한 레이어 간 모션 예측 방법을 제공할 수 있게 되어 코딩 효율을 향상시킬 수 있다.Therefore, a more flexible inter-layer motion prediction method can be provided, thereby improving coding efficiency.

Claims

A bit stream of the current layer is generated using motion information of a lower layer encoded from the same video signal.

The lower layer includes a first layer having one or more levels for compensating for losses incurred in lower level encoding, wherein a predetermined level comprising motion information in the first layer is used to generate the bit stream of the current layer. A method for encoding a video signal, characterized in that it is used.

The method of claim 1,

And the information indicating the lower layer and the level in the lower layer is recorded in the header area of the current layer.

The method of claim 2,

And when the level indicated by the information does not include motion information, a lower level lower than the level indicated by the information and having motion information is used to generate a bit stream of the current layer.

The method of claim 1,

And wherein the motion information comprises a motion vector.

A bit stream of the current layer is generated by using a lower layer encoded from the same video signal.

The current layer and the lower layer each include a first layer and a second layer having one or more levels for compensating for losses occurring in the encoding of the lower level,

Any level of the first layer is encoded using motion information of a lower level in the first layer or motion information of a predetermined level in the second layer,

And information indicating which layer of the current layer or the lower layer is used for encoding any level of the first layer.

The method of claim 5,

And the information is defined in a slice unit corresponding to a level or in an image block unit within a level.

The method of claim 6,

When the information indicates that motion information of a lower layer is used, a predetermined level used for encoding the arbitrary level is derived from a lower layer and a level used for generating a bit stream of the current layer except for the first layer. A method of encoding a video signal, characterized in that.

The method of claim 7, wherein

When level n in the second layer of the lower layer used to generate the bit stream of the current layer except for the first layer does not include motion information, a lower level lower than the level n and having motion information may be A method of encoding a video signal, characterized by being used for encoding.

The method of claim 5,

And wherein the motion information comprises a motion vector.

Decode the bit stream of the lower layer for the same video signal,

Decode the bit stream of the current layer by using the motion information of the decoded lower layer,

The lower layer includes a first layer having one or more levels for compensating for losses incurred in lower level encoding, wherein a predetermined level comprising motion information in the first layer is used for decoding the current layer. A method of decoding an encoded video bit stream.

The method of claim 10,

And information indicating the lower layer and the level in the lower layer is read from the header area of the current layer.

The method of claim 11,

And when the level indicated by the information does not include motion information, a lower level lower than the level indicated by the information and having motion information is used for decoding the current layer.

Decode the bit stream of the lower layer for the same video signal,

Decode the bit stream of the current layer by using the decoded lower layer,

Information indicating which of the current layer or the lower layer motion information is used for decoding any level of the first layer is read, and based on this, any level of the first layer is a lower level in the first layer. A method of decoding an encoded video bit stream, characterized in that it is decoded using the motion information of a level or the motion information of a predetermined level in the second layer.

The method of claim 13,

Wherein the information is defined in units of slices corresponding to levels or units of image blocks within the levels.

The method of claim 14,

When the information indicates that motion information of a lower layer is used, a predetermined level used for decoding the arbitrary level is derived from a lower layer and a level used for generating a bit stream of the current layer except for the first layer. A method of decoding an encoded video bit stream.

The method of claim 15,

When level n in the second layer of the lower layer used to generate the bit stream of the current layer except for the first layer does not include motion information, a lower level lower than the level n and having motion information may be A method for decoding an encoded video bit stream, characterized in that it is used for decoding.