KR101178085B1

KR101178085B1 - Weighted prediction based on vectorized entropy coding

Info

Publication number: KR101178085B1
Application number: KR1020117010347A
Authority: KR
Inventors: 마르타 카르체비츠; 라훌 판찰
Original assignee: 퀄컴 인코포레이티드
Priority date: 2008-10-16
Filing date: 2009-10-14
Publication date: 2012-08-30
Also published as: WO2010045380A3; KR20110063865A; CN102187671B; US9819940B2; WO2010045380A2; US20150281695A1; TW201029472A; CN102187671A; JP2012506215A; US20100098156A1; EP2347519A2; JP5415546B2

Abstract

본 개시는 향상 계층 비디오 블록에 대한 벡터화된 엔트로피 코딩의 특징에 기초하여 이러한 향상 계층 비디오 블록에 대한 예측 코딩 기술의 선택들 제어하는 방법을 설명한다. 본 개시에 따르면, 향상 계층 비디오 블록의 예측 기반 비디오 코딩에 이용되는 예측 기술은 이러한 향상 계층 비디오 블록에 이용되는 벡터화된 엔트로피 코딩에 의존한다. 각각의 코딩된 유닛에 있어서, 벡터화된 엔트로피 코딩이 그 코딩된 유닛의 비디오 블록에 대해 단일 벡터를 정의하는지 또는 그 코딩된 유닛의 비디오 블록에 대해 다수의 벡터를 정의하는지 여부에 따라 예측 코딩 기술 (예를 들어, 가중 예측 또는 비가중 예측) 이 선택될 수도 있다.This disclosure describes a method for controlling the selection of prediction coding techniques for such an enhancement layer video block based on the characteristics of the vectorized entropy coding for the enhancement layer video block. According to the present disclosure, the prediction technique used for prediction based video coding of an enhancement layer video block depends on the vectorized entropy coding used for this enhancement layer video block. For each coded unit, the predictive coding technique (depending on whether the vectorized entropy coding defines a single vector for the video block of the coded unit or multiple vectors for the video block of the coded unit) For example, weighted prediction or unweighted prediction) may be selected.

Description

Weighted prediction based on vectorized entropy coding {WEIGHTED PREDICTION BASED ON VECTORIZED ENTROPY CODING}

관련 출원Related application

본 출원은 2008 년 10 월 16 일자로 출원되고 참조로 본 명세서에 통합된 미국 가특허출원 제 61/106,039 호에 대해 우선권을 주장한다.This application claims priority to US Provisional Patent Application 61 / 106,039, filed October 16, 2008, and incorporated herein by reference.

기술분야Technical Field

본 개시는 비디오 데이터를 압축하는데 이용되는 블록 기반 디지털 비디오 코딩에 관한 것이다.The present disclosure relates to block-based digital video coding used to compress video data.

디지털 비디오 능력은, 디지털 텔레비젼, 디지털 다이렉트 브로드캐스트 시스템, 무선 전화 핸드셋과 같은 무선 통신 디바이스, 무선 브로드캐스트 시스템, 개인 휴대 정보 단말기 (PDA), 랩탑 또는 데스크탑 컴퓨터, 디지털 카메라, 디지털 레코딩 디바이스, 비디오 게이밍 디바이스, 비디오 게임 콘솔 등을 포함하는 광범위한 디바이스에 통합될 수 있다. 디지털 비디오 디바이스는, MPEG-2, MPEG-4 또는 H.264/MPEG-4, 파트 10, 어드밴스드 비디오 코딩 (AVC) 과 같은 비디오 압축 기술을 구현하여 디지털 비디오를 더 효율적으로 송신 및 수신한다. 비디오 압축 기술은 공간적 및 시간적 예측을 수행하여, 비디오 신호에 내재하는 리던던시를 감소 또는 제거한다.Digital video capabilities include digital television, digital direct broadcast systems, wireless communication devices such as cordless telephone handsets, wireless broadcast systems, personal digital assistants (PDAs), laptop or desktop computers, digital cameras, digital recording devices, video gaming It can be integrated into a wide range of devices, including devices, video game consoles, and the like. Digital video devices implement video compression techniques such as MPEG-2, MPEG-4 or H.264 / MPEG-4, Part 10, Advanced Video Coding (AVC) to transmit and receive digital video more efficiently. Video compression techniques perform spatial and temporal prediction to reduce or eliminate redundancy inherent in video signals.

블록 기반 비디오 압축 기술은 일반적으로 공간적 예측 및/또는 시간적 예측을 수행한다. 인트라-코딩 (intra-coding) 은 공간적 예측에 의존하여, 비디오 프레임, 비디오 프레임의 슬라이스 등을 포함할 수도 있는 소정의 코딩된 유닛 내의 비디오 블록들 사이에서 공간적 리던던시를 감소 또는 제거한다. 반대로, 인터-코딩 (inter-coding) 은 시간적 예측에 의존하여, 비디오 시퀀스의 연속적으로 코딩된 유닛들의 비디오 블록들 사이에서 시간적 리던던시를 감소 또는 제거한다. 인트라-코딩에 있어서, 비디오 인코더는 공간적 예측을 수행하여, 동일한 코딩된 유닛 내의 다른 데이터에 기초하여 데이터를 압축한다. 인터-코딩에 있어서, 비디오 인코더는 모션 추정 및 모션 보상을 수행하여, 2 개 이상의 인접한 코딩된 유닛들의 대응하는 비디오 블록들의 움직임을 트래킹한다.Block-based video compression techniques generally perform spatial prediction and / or temporal prediction. Intra-coding relies on spatial prediction to reduce or remove spatial redundancy between video blocks within a given coded unit, which may include video frames, slices of video frames, and the like. In contrast, inter-coding relies on temporal prediction to reduce or remove temporal redundancy between video blocks of successively coded units of a video sequence. In intra-coding, the video encoder performs spatial prediction to compress the data based on other data in the same coded unit. In inter-coding, the video encoder performs motion estimation and motion compensation to track the motion of corresponding video blocks of two or more adjacent coded units.

코딩된 비디오 블록은, 예측 블록을 생성 또는 식별하는데 이용될 수 있는 예측 정보, 및 코딩되고 있는 블록과 예측 블록 사이의 차를 나타내는 데이터의 잔차 블록 (residual block) 에 의해 표현될 수도 있다. 인터-코딩의 경우, 데이터의 예측 블록을 식별하는데 하나 이상의 모션 벡터가 이용되는 한편, 인트라-코딩의 경우 예측 블록을 생성하기 위해 예측 모드가 이용될 수 있다. 인트라-코딩 및 인터-코딩 모두는 수개의 상이한 예측 모드들을 정의할 수도 있고, 이 모드들은 코딩에 이용되는 상이한 블록 사이즈 및/또는 예측 기술을 정의할 수도 있다. 코딩 프로세스에 이용된 코딩 기술 또는 파라미터를 제어 또는 정의하기 위해, 인코딩된 비디오 데이터의 일부로서 신택스 엘리먼트 (syntax element) 의 추가적 타입이 또한 포함될 수도 있다.A coded video block may be represented by predictive information that can be used to generate or identify a predictive block, and a residual block of data that indicates a difference between the block being coded and the predictive block. In the case of inter-coding, one or more motion vectors are used to identify the predictive block of data, while in intra-coding, the prediction mode may be used to generate the predictive block. Both intra- and inter-coding may define several different prediction modes, which may define different block sizes and / or prediction techniques used for coding. Additional types of syntax elements may also be included as part of the encoded video data to control or define coding techniques or parameters used in the coding process.

블록 기반 예측 코딩 이후, 비디오 인코더는 변환, 양자화 및 엔트로피 코딩 프로세스를 적용하여, 잔차 블록의 통신과 연관된 비트 레이트를 추가적으로 감소시킬 수도 있다. 변환 기술은 이산 코사인 변환, 또는 웨이블릿 변환, 정수 변환 또는 기타 타입의 변환과 같은 개념적으로 유사한 프로세스를 포함할 수도 있다. 일예로, 이산 코사인 변환 (DCT) 프로세스에서, 변환 프로세스는 픽셀 값들의 세트를 변환 계수들로 변환하고, 그 변환 계수들은 주파수 도메인에서 픽셀 값들의 에너지를 나타낼 수도 있다. 양자화는 변환 계수들에 적용되고, 일반적으로, 임의의 소정의 변환 계수와 연관된 비트의 수를 제한하는 프로세스와 관련된다. 엔트로피 코딩은, 양자화된 변환 계수들의 시퀀스를 포괄적으로 압축하는 하나 이상의 프로세스들을 포함한다.After block-based predictive coding, the video encoder may apply a transform, quantization, and entropy coding process to further reduce the bit rate associated with communication of the residual block. The transformation technique may include conceptually similar processes such as discrete cosine transformation, wavelet transformation, integer transformation, or other type of transformation. In one example, in a discrete cosine transform (DCT) process, the transform process transforms a set of pixel values into transform coefficients, which transform coefficients may represent the energy of the pixel values in the frequency domain. Quantization is applied to transform coefficients and generally involves a process of limiting the number of bits associated with any given transform coefficient. Entropy coding includes one or more processes to comprehensively compress a sequence of quantized transform coefficients.

다양한 경우, 비디오 시퀀스는 기본 계층 (base layer) 및 하나 이상의 향상 계층 (enhancement layer) 으로 코딩될 수도 있다. 이 경우, 기본 계층은 비디오 품질의 기본 레벨을 정의할 수도 있고, 하나 이상의 향상 계층은 디코딩된 비디오 신호의 품질을 향상시킬 수도 있다. 향상 계층은, 예를 들어, 가능하게는 기본 계층 프레임에 공간적 향상을 제공하는 것, 가능하게는 신호 대 잡음 향상을 제공하는 것, 또는 가능하게는 기본 계층 프레임들 사이에 추가적 프레임들을 추가함으로써 디코딩된 비디오에 시간적 향상을 제공하는 것과 같은 가능한 다양한 방식으로 비디오 품질을 개선시킬 수도 있다. 어느 경우든, 인코딩된 비디오는 비디오 디코딩 디바이스로 송신될 수도 있고, 비디오 디코딩 디바이스는 비디오 시퀀스를 재구성하기 위해 비디오 인코더의 상반되는 프로세스 (reciprocal process) 를 수행한다.In various cases, the video sequence may be coded with a base layer and one or more enhancement layers. In this case, the base layer may define a base level of video quality, and one or more enhancement layers may improve the quality of the decoded video signal. The enhancement layer is decoded by, for example, possibly providing a spatial enhancement to the base layer frame, possibly providing a signal to noise enhancement, or possibly adding additional frames between the base layer frames. It is also possible to improve the video quality in a variety of possible ways, such as providing temporal enhancements to the video. In either case, the encoded video may be sent to a video decoding device, which performs the reciprocal process of the video encoder to reconstruct the video sequence.

개요summary

일반적으로, 본 개시는, 이러한 향상 계층 비디오 블록에 대한 벡터화된 엔트로피 코딩의 특성이 기초하여 향상 계층 비디오 블록에 대한 예측 코딩 기술의 선택을 제어하는 방법을 설명한다. 벡터화된 엔트로피 코딩은 비디오 블록과 연관된 벡터의 수를 정의하는 벡터 신택스 엘리먼트에 의존하는 비디오 블록의 엔트로피 코딩을 지칭한다. 벡터 신택스 엘리먼트는, 예를 들어, 각각의 비디오 프레임 또는 각각의 독립적으로 디코딩가능한 슬라이스 또는 일부의 비디오 프레임과 같은 각각의 코딩된 유닛에 대해 정의될 수도 있다. 벡터 신택스 엘리먼트에 의해 정의된 각각의 벡터는 함께 엔트로피 코딩될 비디오 블록의 계수들의 세트를 정의한다. 코딩된 유닛의 비디오 블록에 대해 수개의 벡터들이 정의되면, 계수들의 수개의 개별적 세트들이 비디오 블록 각각에 대해 별도로 엔트로피 코딩될 것이다. 코딩된 유닛의 비디오 블록에 대해 오직 하나의 벡터만 정의되면, 각각의 소정의 비디오 블록에 대한 계수들 모두가 함께 엔트로피 코딩될 것이다.In general, the present disclosure describes a method for controlling the selection of a predictive coding technique for an enhancement layer video block based on the characteristics of the vectorized entropy coding for such enhancement layer video block. Vectorized entropy coding refers to entropy coding of a video block that depends on a vector syntax element that defines the number of vectors associated with the video block. The vector syntax element may be defined for each coded unit, such as, for example, each video frame or each independently decodable slice or some video frame. Each vector defined by the vector syntax element defines a set of coefficients of the video block to be entropy coded together. If several vectors are defined for the video block of the coded unit, several individual sets of coefficients will be entropy coded separately for each video block. If only one vector is defined for the video block of the coded unit, then all of the coefficients for each given video block will be entropy coded together.

본 개시에 따르면, 향상 계층 비디오 블록의 예측 기반 비디오 코딩에 이용되는 예측 기술은 이러한 향상 계층 비디오 블록에 이용된 벡터화된 엔트로피 코딩에 의존한다. 각각의 코딩된 유닛에 있어서, 벡터화된 엔트로피 코딩이 그 코딩된 유닛의 비디오 블록에 대해 단일 벡터를 정의하는지 또는 그 코딩된 유닛의 비디오 블록에 대해 다수의 벡터들을 정의하는지 여부에 따라 (예를 들어, 가중 또는 비가중 예측과 같은) 예측 코딩 기술이 선택될 수도 있다. 더 상세하게는, 가중 예측은, 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우 선택될 수도 있다.According to the present disclosure, the prediction technique used for prediction based video coding of an enhancement layer video block depends on the vectorized entropy coding used for this enhancement layer video block. For each coded unit, depending on whether the vectorized entropy coding defines a single vector for the video block of the coded unit or multiple vectors for the video block of the coded unit (eg Predictive coding techniques (such as, weighted or unweighted prediction) may be selected. More specifically, weighted prediction may be selected if vectorized entropy coding establishes two or more vectors for the enhancement layer video block.

대안적으로, 순차적 예측과 같은 비가중 예측은, 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 단일 벡터를 확립하는 경우 선택될 수도 있다. 본 개시에서, 가중 예측은, 예측된 향상 계층 데이터와 예측된 기본 계층 데이터의 조합을 포함하는 가중된 예측 데이터를 참조하는 예측을 지칭한다. 반대로, 순차적 예측은, 코딩되는 블록과 연관된 동일한 계층의 예측 프레임과 같은 미리 코딩된 데이터를 참조하는 예측을 지칭한다.Alternatively, unweighted prediction, such as sequential prediction, may be selected when vectorized entropy coding establishes a single vector for the enhancement layer video block. In this disclosure, weighted prediction refers to prediction that refers to weighted prediction data that includes a combination of predicted enhancement layer data and predicted base layer data. In contrast, sequential prediction refers to prediction that refers to precoded data, such as prediction frames of the same layer associated with the block to be coded.

일예로, 본 개시는 비디오 시퀀스의 데이터를 코딩하는 방법을 제공한다. 이 방법은, 비디오 시퀀스 내에서 코딩된 유닛의 향상 계층 비디오 블록의 벡터화된 엔트로피 코딩에 대해 하나 이상의 벡터를 정의하는 단계, 그 정의된 벡터화된 엔트로피 코딩에 기초하여 코딩된 유닛의 향상 계층 비디오 블록에 대한 예측 모드를 선택하는 단계로서, 예측 모드의 선택은, 그 정의된 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우 가중 예측을 선택하는 것을 포함하는, 상기 예측 모드를 선택하는 단계, 및 선택된 예측 모드 및 벡터화된 엔트로피 코딩에 기초하여 향상 계층 비디오 블록을 코딩하는 단계를 포함한다.In one example, the present disclosure provides a method of coding data of a video sequence. The method includes defining one or more vectors for vectorized entropy coding of an enhancement layer video block of a coded unit within a video sequence, the enhancement layer video block of the coded unit based on the defined vectorized entropy coding. Selecting a prediction mode for the prediction mode, wherein the selection of the prediction mode comprises selecting weighted prediction if the defined vectorized entropy coding establishes two or more vectors for the enhancement layer video block. Selecting, and coding an enhancement layer video block based on the selected prediction mode and vectorized entropy coding.

다른 예로, 본 개시는 비디오 시퀀스의 데이터를 코딩하는 장치를 제공한다. 이 장치는, 비디오 시퀀스에서 코딩된 유닛의 향상 계층 비디오 블록의 벡터화된 엔트로피 코딩에 대해 하나 이상의 벡터를 정의하고, 그 정의된 벡터화된 엔트로피 코딩에 기초하여 코딩된 유닛의 향상 계층 비디오 블록에 대한 예측 모드를 선택하는 제어 유닛을 포함하고, 이 제어 유닛은, 그 정의된 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우 가중 예측을 선택한다. 이 장치는 또한, 선택된 예측 모드에 기초하여 예측 코딩 기술을 수행하는 예측 유닛 및 벡터화된 엔트로피 코딩을 수행하는 엔트로피 코딩 유닛을 포함한다.As another example, the present disclosure provides an apparatus for coding data of a video sequence. The apparatus defines one or more vectors for vectorized entropy coding of an enhancement layer video block of units coded in a video sequence and predicts for the enhancement layer video blocks of the coded unit based on the defined vectorized entropy coding. A control unit that selects a mode, which selects weighted prediction if its defined vectorized entropy coding establishes two or more vectors for the enhancement layer video block. The apparatus also includes a prediction unit for performing predictive coding techniques and an entropy coding unit for performing vectorized entropy coding based on the selected prediction mode.

다른 예로, 본 개시는 비디오 시퀀스의 데이터를 코딩하는 디바이스를 제공하며, 이 디바이스는, 비디오 시퀀스 내에서 코딩된 유닛의 향상 계층 비디오 블록의 벡터화된 엔트로피 코딩에 대해 하나 이상의 벡터를 정의하는 수단, 그 정의된 벡터화된 엔트로피 코딩에 기초하여 코딩된 유닛의 향상 계층 비디오 블록에 대한 예측 모드를 선택하는 수단으로서, 그 정의된 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우 가중 예측을 선택하는 수단을 포함하는, 상기 예측 모드를 선택하는 수단, 및 선택된 예측 모드 및 벡터화된 엔트로피 코딩에 기초하여 향상 계층 비디오 블록을 코딩하는 수단을 포함한다.In another example, the present disclosure provides a device for coding data of a video sequence, the device comprising: means for defining one or more vectors for vectorized entropy coding of an enhancement layer video block of a unit coded within a video sequence, Means for selecting a prediction mode for an enhancement layer video block of a coded unit based on the defined vectorized entropy coding, wherein the defined vectorized entropy coding establishes two or more vectors for the enhancement layer video block Means for selecting a prediction mode, comprising means for selecting a prediction, and means for coding an enhancement layer video block based on the selected prediction mode and vectorized entropy coding.

다른 예로, 본 개시는, 비디오 시퀀스 내에서 코딩된 유닛의 향상 계층 비디오 블록의 벡터화된 엔트로피 인코딩에 대해 하나 이상의 벡터를 정의하고, 그 정의된 벡터화된 엔트로피 코딩에 기초하여 코딩된 유닛의 향상 계층 비디오 블록에 대한 예측 모드를 선택하는 제어 유닛으로서, 그 정의된 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우 가중 예측을 선택하는 상기 제어 유닛, 선택된 예측 모드에 기초하여 예측 인코딩 기술을 수행하는 예측 유닛, 벡터화된 엔트로피 인코딩을 수행하여 비트스트림의 적어도 일부를 생성하는 엔트로피 인코딩 유닛, 및 그 비트스트림을 다른 디바이스에 전송하는 무선 송신기를 포함하는 디바이스를 제공한다.As another example, the present disclosure defines one or more vectors for vectorized entropy encoding of an enhancement layer video block of a unit coded within a video sequence and enhances the enhancement layer video of the coded unit based on the defined vectorized entropy coding. A control unit for selecting a prediction mode for a block, wherein the control unit for selecting weighted prediction when the defined vectorized entropy coding establishes two or more vectors for the enhancement layer video block, the prediction based on the selected prediction mode A device includes a prediction unit for performing an encoding technique, an entropy encoding unit for performing at least a portion of a bitstream by performing vectorized entropy encoding, and a wireless transmitter for transmitting the bitstream to another device.

다른 예로, 본 개시는, 비디오 시퀀스 내에서 코딩된 유닛의 향상 계층 비디오 블록의 엔트로피 코딩된 계수 값들을 포함하는 비트스트림을 수신하는 무선 수신기, 비디오 시퀀스 내에서 코딩된 유닛의 향상 계층 비디오 블록의 벡터화된 엔트로피 디코딩에 대한 하나 이상의 벡터를 정의하고, 그 정의된 벡터화된 엔트로피 코딩에 기초하여 코딩된 유닛의 향상 계층 비디오 블록에 대한 예측 모드를 선택하는 제어 유닛으로서, 그 정의된 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우 가중 예측을 선택하는 상기 제어 유닛, 선택된 예측 모드에 기초하여 예측 디코딩 기술을 수행하는 예측 유닛, 벡터화된 엔트로피 디코딩을 수행하는 엔트로피 디코딩 유닛을 포함하는 디바이스를 제공한다.As another example, the present disclosure provides a wireless receiver for receiving a bitstream including entropy coded coefficient values of an enhancement layer video block of a unit coded within a video sequence, vectorization of an enhancement layer video block of a unit coded within a video sequence. A control unit that defines one or more vectors for defined entropy decoding and selects a prediction mode for an enhancement layer video block of the coded unit based on the defined vectorized entropy coding, wherein the defined vectorized entropy coding is enhanced. The control unit for selecting weighted prediction when establishing two or more vectors for the hierarchical video block, a prediction unit for performing a prediction decoding technique based on the selected prediction mode, and an entropy decoding unit for performing vectorized entropy decoding; Provide a device.

본 개시에서 설명된 기술들은 하드웨어, 소프트웨어, 펌웨어 또는 이들의 임의의 조합으로 구현될 수도 있다. 하드웨어로 구현되면, 장치는 집적 회로, 프로세서, 이산 로직 또는 이들의 임의의 조합으로 실현될 수도 있다. 소프트웨어로 구현되면, 소프트웨어는, 마이크로프로세서, 주문형 집적회로 (ASIC), 필드 프로그래머블 게이트 어레이 (FPGA), 또는 디지털 신호 프로세서 (DSP) 와 같은 하나 이상의 프로세서에서 실행될 수도 있다. 이 기술들을 실행하는 소프트웨어는 초기에 컴퓨터 판독가능 매체에 저장될 수도 있고, 프로세서에서 로딩 및 실행될 수도 있다.The techniques described in this disclosure may be implemented in hardware, software, firmware, or any combination thereof. If implemented in hardware, the device may be implemented in an integrated circuit, a processor, discrete logic, or any combination thereof. If implemented in software, the software may be executed on one or more processors, such as a microprocessor, application specific integrated circuit (ASIC), field programmable gate array (FPGA), or digital signal processor (DSP). Software implementing these techniques may be initially stored on a computer readable medium and loaded and executed on a processor.

따라서, 본 개시는 또한, 비디오 코딩 디바이스에서 실행될 때 그 디바이스로 하여금 비디오 시퀀스의 데이터를 코딩하게 하는 명령들을 포함하는 컴퓨터 판독가능 매체를 고려한다. 더 상세하게는, 이 명령들은 디바이스로 하여금, 비디오 시퀀스 내에서 코딩된 유닛의 향상 계층 비디오 블록의 벡터화된 엔트로피 코딩에 대해 하나 이상의 벡터를 정의하게 하고, 그 정의된 벡터화된 엔트로피 코딩에 기초하여 코딩된 유닛의 향상 계층 비디오 블록에 대한 예측 모드를 선택하게 하게 하고, 그 정의된 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우 가중 예측을 선택하게 하고, 선택된 예측 블록 및 벡터화된 엔트로피 코딩에 기초하여 향상 계층 비디오 블록을 코딩하게 한다.Thus, the present disclosure also contemplates a computer readable medium containing instructions that when executed in a video coding device cause the device to code data in a video sequence. More specifically, these instructions cause the device to define one or more vectors for vectorized entropy coding of an enhancement layer video block of a coded unit within a video sequence, and code based on the defined vectorized entropy coding. Select a prediction mode for the enhancement layer video block of the given unit, and if the defined vectorized entropy coding establishes two or more vectors for the enhancement layer video block, select weighted prediction, and select the selected prediction block and Code an enhancement layer video block based on vectorized entropy coding.

본 개시의 하나 이상의 양태들의 세부사항을 첨부한 도면 및 다음의 설명에서 기술한다. 본 개시에서 설명된 기술의 다른 특징, 목적 및 이점은 설명 및 도면과 청구항으로부터 명백해질 것이다.The details of one or more aspects of the disclosure are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the techniques described in this disclosure will be apparent from the description and drawings, and from the claims.

도면의 간단한 설명Brief description of the drawings

도 1 은 비디오 인코딩 및 디코딩 시스템을 도시하는 예시적인 블록도이다.1 is an exemplary block diagram illustrating a video encoding and decoding system.

도 2a 는 순차적 예측을 도시하는 개념도이다.2A is a conceptual diagram illustrating sequential prediction.

도 2b 는 가중 예측을 도시하는 개념도이다.2B is a conceptual diagram illustrating weighted prediction.

도 2c 는 가중 예측을 도시하는 다른 개념도이다.2C is another conceptual diagram illustrating weighted prediction.

도 3a 는 4×4 비디오 블록의 지그재그 스캐닝을 도시하는 개념도이다.3A is a conceptual diagram illustrating zigzag scanning of a 4x4 video block.

도 3b 는 도 3a 의 블록에 대한 지그재그 스캐닝 동안 적용되는 다른 벡터 제어 신호와 연관된 벡터들을 도시하는 도면이다.FIG. 3B is a diagram illustrating vectors associated with another vector control signal applied during zigzag scanning for the block of FIG. 3A.

도 4 는 본 개시에 따른 예시적인 비디오 인코더를 도시하는 블록도이다.4 is a block diagram illustrating an example video encoder in accordance with this disclosure.

도 5 는 본 개시에 따른 예시적인 비디오 디코더를 도시하는 블록도이다.5 is a block diagram illustrating an example video decoder according to this disclosure.

도 6 및 7 은 본 개시에 따른 기술을 도시하는 흐름도이다.6 and 7 are flow diagrams illustrating a technique in accordance with the present disclosure.

상세한 설명details

본 개시는, 향상 계층 비디오 블록에 대한 벡터화된 엔트로피 코딩의 특성에 기초하여 이러한 향상 계층 비디오 블록에 대한 예측 코딩 기술의 선택을 제어하는 방법을 설명한다. 본 개시에 따르면, 향상 계층 비디오 블록의 예측 기반 비디오 코딩에 이용되는 예측 기술들은 이러한 향상 계층 비디오 블록에 이용되는 벡터화된 엔트로피 코딩에 의존한다. 각각의 코딩된 유닛에 있어서, 벡터화된 엔트로피 코딩이 그 코딩된 유닛의 비디오 블록에 대해 단일 벡터를 정의하는지 또는 그 코딩된 유닛의 비디오 블록에 대해 다수의 벡터들을 정의하는지 여부에 따라 (예를 들어, 가중 또는 비가중 예측과 같은) 예측 코딩 기술이 선택될 수도 있다.This disclosure describes a method for controlling the selection of predictive coding techniques for such enhancement layer video blocks based on the characteristics of the vectorized entropy coding for the enhancement layer video blocks. According to the present disclosure, the prediction techniques used for prediction based video coding of an enhancement layer video block depend on the vectorized entropy coding used for such enhancement layer video block. For each coded unit, depending on whether the vectorized entropy coding defines a single vector for the video block of the coded unit or multiple vectors for the video block of the coded unit (eg Predictive coding techniques (such as, weighted or unweighted prediction) may be selected.

벡터화된 엔트로피 코딩은 비디오 블록과 연관된 벡터의 수를 정의하는 벡터 신택스 엘리먼트에 의존하는 비디오 블록의 엔트로피 코딩을 지칭한다. 벡터 신택스 엘리먼트는, 예를 들어, 각각의 비디오 프레임 또는 각각의 독립적으로 디코딩가능한 슬라이스 또는 일부의 비디오 프레임과 같은 각각의 코딩된 유닛에 대해 정의될 수도 있다. 벡터 신택스 엘리먼트에 의해 정의된 각각의 벡터는 함께 엔트로피 코딩될 비디오 블록의 계수들의 세트를 정의한다. 코딩된 유닛의 비디오 블록에 대해 수개의 벡터들이 정의되면, 계수들의 수개의 개별적 세트들이 그 코딩된 유닛에 대해 별도로 엔트로피 코딩될 것이다. 코딩된 유닛의 비디오 블록에 대해 오직 하나의 벡터만 정의되면, 각각의 소정의 비디오 블록에 대한 계수들 모두가 그 코딩된 유닛에 대해 함께 엔트로피 코딩될 것이다.Vectorized entropy coding refers to entropy coding of a video block that depends on a vector syntax element that defines the number of vectors associated with the video block. The vector syntax element may be defined for each coded unit, such as, for example, each video frame or each independently decodable slice or some video frame. Each vector defined by the vector syntax element defines a set of coefficients of the video block to be entropy coded together. If several vectors are defined for a video block of a coded unit, several individual sets of coefficients will be entropy coded separately for that coded unit. If only one vector is defined for a video block of a coded unit, then all of the coefficients for each given video block will be entropy coded together for that coded unit.

본 개시에 따르면, 가중 예측은, 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우 선택될 수도 있다. 대안적으로, 순차적 예측과 같은 비가중 예측은, 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 단일 벡터를 확립하는 경우 선택될 수도 있다. 본 개시에서, 가중 예측은, 예측된 향상 계층 데이터와 예측된 기본 계층 데이터의 조합을 포함하는 가중된 예측 데이터를 참조하는 예측을 지칭한다. 반대로, 순차적 예측은, 코딩되는 블록과 연관된 동일한 계층의 예측 프레임과 같은 미리 코딩된 데이터를 참조하는 예측을 지칭한다.According to this disclosure, weighted prediction may be selected when vectorized entropy coding establishes two or more vectors for an enhancement layer video block. Alternatively, unweighted prediction, such as sequential prediction, may be selected when vectorized entropy coding establishes a single vector for the enhancement layer video block. In this disclosure, weighted prediction refers to prediction that refers to weighted prediction data that includes a combination of predicted enhancement layer data and predicted base layer data. In contrast, sequential prediction refers to prediction that refers to precoded data, such as prediction frames of the same layer associated with the block to be coded.

도 1 은 본 개시의 기술들을 구현할 수도 있는 예시적인 비디오 인코딩 및 디코딩 시스템 (10) 을 도시하는 블록도이다. 도 1 에 도시된 바와 같이, 시스템 (10) 은, 인코딩된 비디오를 통신 채널 (15) 을 통해 수신지 디바이스 (16) 에 송신하는 소스 디바이스 (12) 를 포함한다. 소스 디바이스 (12) 및 수신지 디바이스 (16) 는 임의의 광범위한 디바이스들을 포함할 수도 있다. 몇몇 경우, 소스 디바이스 (12) 및 수신지 디바이스 (16) 는, 소위 셀룰러 또는 위성 무선전화와 같은 무선 통신 디바이스 핸드셋을 포함할 수도 있다. 그러나, 더 일반적으로 예측 코딩 및 엔트로피 코딩에 적용되는 본 개시의 기술은 무선 애플리케이션 또는 세팅에 한정될 필요는 없으며, 비디오 인코딩 및/또는 디코딩 능력을 포함하는 비-무선 디바이스에 적용될 수도 있다.1 is a block diagram illustrating an example video encoding and decoding system 10 that may implement the techniques of this disclosure. As shown in FIG. 1, system 10 includes a source device 12 that transmits encoded video to a destination device 16 over a communication channel 15. Source device 12 and destination device 16 may include any of a wide variety of devices. In some cases, source device 12 and destination device 16 may include a wireless communication device handset, such as a so-called cellular or satellite radiotelephone. However, the techniques of this disclosure that apply more generally to predictive coding and entropy coding need not be limited to wireless applications or settings, and may be applied to non-wireless devices that include video encoding and / or decoding capabilities.

도 1 의 예에서, 소스 디바이스 (12) 는 비디오 소스 (20), 비디오 인코더 (22), 변조기/복조기 (모뎀; 23) 및 송신기 (24) 를 포함할 수도 있다. 수신지 디바이스 (16) 는 수신기 (26), 모뎀 (27), 비디오 디코더 (28) 및 디스플레이 디바이스 (30) 를 포함할 수도 있다. 본 개시에 따르면, 소스 디바이스 (12) 의 비디오 인코더 (22) 는 벡터화된 엔트로피 인코딩, 및 그 벡터화된 엔트로피 인코딩에 기초하여 정의 또는 선택되는 예측 기술을 수행하도록 구성될 수도 있다. 유사하게, 수신지 디바이스 (16) 의 비디오 디코더 (28) 는 벡터화된 엔트로피 디코딩, 및 그 벡터화된 엔트로피 디코딩에 기초하여 정의 또는 선택되는 예측 기술을 수행하도록 구성될 수도 있다. 어느 경우이든, 도 1 의 도시된 시스템 (10) 은 오직 예시적이다. 본 개시의 벡터화된 엔트로피 코딩 기술 및 연관된 예측 기술은 임의의 인코딩 또는 디코딩 디바이스에 의해 수행될 수도 있다. 소스 디바이스 (12) 및 수신지 디바이스 (16) 는 이러한 기술들을 지원할 수 있는 코딩 디바이스들의 오직 예시이다.In the example of FIG. 1, source device 12 may include video source 20, video encoder 22, modulator / demodulator (modem) 23, and transmitter 24. Destination device 16 may include receiver 26, modem 27, video decoder 28, and display device 30. According to this disclosure, video encoder 22 of source device 12 may be configured to perform a vectorized entropy encoding, and a prediction technique defined or selected based on the vectorized entropy encoding. Similarly, video decoder 28 of destination device 16 may be configured to perform vectorized entropy decoding, and a prediction technique defined or selected based on the vectorized entropy decoding. In either case, the illustrated system 10 of FIG. 1 is exemplary only. The vectorized entropy coding technique and associated prediction technique of the present disclosure may be performed by any encoding or decoding device. Source device 12 and destination device 16 are only examples of coding devices that can support these techniques.

소스 디바이스 (12) 의 비디오 인코더 (22) 는 본 개시의 기술들을 이용하여, 비디오 소스 (20) 로부터 수신된 비디오 데이터를 인코딩할 수도 있다. 비디오 소스 (20) 는 비디오 카메라와 같은 비디오 캡쳐 디바이스, 이전에 캡쳐된 비디오를 포함하는 비디오 어카이브, 또는 비디오 컨텐츠 제공자로부터의 비디오 피드를 포함할 수도 있다. 다른 대안예로서, 비디오 소스 (20) 는 소스 비디오로서 컴퓨터 그래픽스 기반 데이터, 또는 라이브 비디오, 보관된 비디오 및 컴퓨터 생성 비디오의 조합을 생성할 수도 있다. 몇몇 경우, 비디오 소스 (20) 가 비디오 카메라이면, 소스 디바이스 (12) 및 수신지 디바이스 (16) 는 소위 카메라폰 또는 비디오폰을 형성할 수도 있다. 각각의 경우, 캡쳐된 비디오, 미리 캡쳐된 비디오 또는 컴퓨터 생성 비디오는 비디오 인코더 (22) 에 의해 인코딩될 수도 있다.Video encoder 22 of source device 12 may encode the video data received from video source 20 using the techniques of this disclosure. Video source 20 may include a video capture device, such as a video camera, a video archive containing previously captured video, or a video feed from a video content provider. As another alternative, video source 20 may generate computer graphics based data as the source video, or a combination of live video, archived video, and computer generated video. In some cases, if video source 20 is a video camera, source device 12 and destination device 16 may form a so-called cameraphone or videophone. In each case, the captured video, pre-captured video or computer-generated video may be encoded by video encoder 22.

비디오 데이터가 비디오 인코더 (22) 에 의해 인코딩되면, 그 인코딩된 비디오 정보는, 예를 들어, 코드 분할 다중 접속 (CDMA) 또는 다른 통신 표준이나 기술과 같은 통신 표준에 따라 모뎀 (23) 에 의해 변조되고, 송신기 (24) 를 통해 수신지 디바이스 (16) 로 송신될 수도 있다. 모뎀 (23) 은 다양한 믹서, 필터, 증폭기, 또는 신호 변조를 위해 설계된 기타 컴포넌트들을 포함할 수도 있다. 송신기 (24) 는 증폭기, 필터 및 하나 이상의 안테나를 포함하여, 데이터를 송신하기 위해 설계된 회로들을 포함할 수도 있다.When video data is encoded by video encoder 22, the encoded video information is modulated by modem 23 in accordance with a communication standard such as, for example, code division multiple access (CDMA) or other communication standard or technology. And to the destination device 16 via the transmitter 24. Modem 23 may include various mixers, filters, amplifiers, or other components designed for signal modulation. Transmitter 24 may include circuits designed to transmit data, including amplifiers, filters, and one or more antennas.

수신지 디바이스 (16) 의 수신기 (26) 는 채널 (15) 을 통해 정보를 수신하고, 모뎀 (27) 은 그 정보를 복조한다. 비디오 디코더 (28) 에 의해 수행된 비디오 디코딩 프로세스는, 여기서 설명되는 바와 같이, 벡터화된 엔트로피 디코딩, 및 그 벡터화된 엔트로피 디코딩에 기초하여 정의 또는 선택되는 예측 기술을 포함할 수도 있다. 디스플레이 디바이스 (28) 는 디코딩된 비디오 데이터를 사용자에게 디스플레이하며, 음극선관 (CRT), 액정 디스플레이 (LCD), 플라즈마 디스플레이, 유기 발광 다이오드 (OLED) 디스플레이 또는 다른 타입의 디스플레이 디바이스와 같은 임의의 다양한 디스플레이 디바이스를 포함할 수도 있다.Receiver 26 of destination device 16 receives information over channel 15, and modem 27 demodulates that information. The video decoding process performed by video decoder 28 may include vectorized entropy decoding, and a prediction technique defined or selected based on the vectorized entropy decoding, as described herein. Display device 28 displays decoded video data to a user, and any various displays such as cathode ray tube (CRT), liquid crystal display (LCD), plasma display, organic light emitting diode (OLED) display or other types of display devices. It may also include a device.

통신 채널 (15) 은 무선 주파수 (RF) 스펙트럼 또는 하나 이상의 물리적 송신 라인과 같은 임의의 무선 또는 유선 송신 라인을 포함할 수도 있고, 유선 및 무선 매체의 임의의 조합을 포함할 수도 있다. 통신 채널 (15) 은, 로컬 영역 네트워크, 광역 네트워크, 또는 인터넷과 같은 글로벌 네트워크와 같은 패킷 기반 네트워크의 일부를 형성할 수도 있다. 통신 채널 (15) 은 일반적으로, 소스 디바이스 (12) 로부터 수신지 디바이스 (16) 로 비디오 데이터를 송신하기 위한 임의의 적절한 통신 매체 또는 서로 다른 통신 매체들의 집합물을 나타낸다.Communication channel 15 may include any wireless or wired transmission line, such as a radio frequency (RF) spectrum or one or more physical transmission lines, and may include any combination of wired and wireless media. Communication channel 15 may form part of a packet-based network, such as a local area network, a wide area network, or a global network such as the Internet. Communication channel 15 generally represents any suitable communication medium or collection of different communication media for transmitting video data from source device 12 to destination device 16.

비디오 인코더 (22) 및 비디오 디코더 (28) 는, 다르게는 MPEG-4, 파트 10, 어드밴스드 비디오 코딩 (AVC) 으로 지칭되는 ITU-T H.264 표준과 같은 비디오 압축 표준에 따라 동작할 수도 있다. 그러나, 본 개시의 기술들은 임의의 다양한 다른 비디오 코딩 표준에 용이하게 적용될 수도 있다. 구체적으로는, 벡터화된 엔트로피 코딩을 허용하는 임의의 표준이 본 개시의 교시로부터 이점이 있을 것이다.Video encoder 22 and video decoder 28 may operate according to a video compression standard, such as the ITU-T H.264 standard, otherwise referred to as MPEG-4, Part 10, Advanced Video Coding (AVC). However, the techniques of this disclosure may be readily applied to any of a variety of other video coding standards. Specifically, any standard that allows vectorized entropy coding will benefit from the teachings of this disclosure.

도 1 에는 도시되지 않았지만, 몇몇 양태에서, 비디오 인코더 (22) 및 비디오 디코더 (28) 는 각각 오디오 인코더 및 디코더와 통합될 수도 있고, 적절한 MUX-DEMUX 유닛 또는 기타 하드웨어 및 소프트웨어를 포함하여, 오디오 및 비디오 모두의 인코딩을 공통의 데이터 스트림 또는 별도의 데이터 스트림에서 조작할 수도 있다. 적용가능하다면, MUX-DEMUX 유닛은 ITU H.223 멀티플렉서 프로토콜, 또는 사용자 데이터그램 프로토콜 (UDP) 과 같은 기타 프로토콜에 부합할 수도 있다.Although not shown in FIG. 1, in some aspects, video encoder 22 and video decoder 28 may each be integrated with an audio encoder and decoder, including appropriate MUX-DEMUX units or other hardware and software, The encoding of both videos may be manipulated in a common data stream or in separate data streams. If applicable, the MUX-DEMUX unit may conform to other protocols such as the ITU H.223 Multiplexer Protocol, or User Datagram Protocol (UDP).

비디오 인코더 (22) 및 비디오 디코더 (28) 각각은, 하나 이상의 마이크로프로세서, 디지털 신호 프로세서 (DSP), 주문형 집적회로 (ASIC), 필드 프로그래머블 게이트 어레이 (FPGA), 이산 로직, 소프트웨어, 하드웨어, 펌웨어 또는 이들의 임의의 조합으로 구현될 수도 있다. 비디오 인코더 (22) 및 비디오 디코더 (28) 각각은 하나 이상의 인코더 또는 디코더에 포함될 수도 있으며, 그 인코더 또는 디코더는, 각각의 이동 디바이스, 가입자 디바이스, 브로드캐스트 디바이스, 서버 등에서 조합된 인코더/디코더 (CODEC) 의 일부로서 통합될 수도 있다.Each of video encoder 22 and video decoder 28 may include one or more microprocessors, digital signal processors (DSPs), application specific integrated circuits (ASICs), field programmable gate arrays (FPGAs), discrete logic, software, hardware, firmware or It may be implemented in any combination of these. Each of video encoder 22 and video decoder 28 may be included in one or more encoders or decoders, which may be combined encoder / decoder (CODECs) in each mobile device, subscriber device, broadcast device, server, or the like. May be incorporated as part of

몇몇 경우, 디바이스들 (12, 16) 은 실질적으로 대칭적 방식으로 동작할 수도 있다. 예를 들어, 각각의 디바이스 (12, 16) 는 비디오 인코딩 및 디코딩 컴포넌트를 포함할 수도 있다. 따라서, 시스템 (10) 은, 예를 들어, 비디오 스트리밍, 비디오 재생, 비디오 브로드캐스팅 또는 비디오 전화를 위해, 비디오 디바이스 (12, 16) 사이에서 일방향 또는 양방향 비디오 송신을 지원할 수도 있다.In some cases, devices 12, 16 may operate in a substantially symmetrical manner. For example, each device 12, 16 may include a video encoding and decoding component. Thus, system 10 may support one-way or two-way video transmission between video devices 12, 16, for example, for video streaming, video playback, video broadcasting, or video telephony.

인코딩 프로세스 동안, 비디오 인코더 (22) 는 다수의 코딩 기술 또는 단계들 실행할 수도 있다. 일반적으로 비디오 인코더 (22) 는 비디오 블록을 인코딩하기 위해 개별적 비디오 프레임들 (또는 슬라이스와 같은 다른 독립적으로 코딩된 유닛들) 내에서의 비디오 블록에 대해 동작한다. 비디오 블록은 고정된 사이즈 또는 가변 사이즈를 가질 수도 있고, 특정한 코딩 표준에 따라 사이즈가 달라질 수도 있다. 몇몇 경우, 각각의 비디오 프레임은 일련의 독립적으로 디코딩가능한 슬라이스를 포함할 수도 있고, 각각의 슬라이스는 매우 작은 블록으로 배열될 수도 있는 일련의 매크로블록을 포함할 수도 있다. 매크로블록은 통상적으로 데이터의 16×16 블록을 지칭한다. ITU-T H.264 표준은, 루마 (luma) 컴포넌트에 대해서는 16×16, 8×8, 4×4, 및 크로마 (chroma) 컴포넌트에 대해서는 8×8 과 같은 다양한 블록 사이즈의 인트라 예측뿐만 아니라, 루마 컴포넌트 및 크로마 컴포넌트에 대한 대응하는 스케일링된 사이즈에 대해 16×16, 16×8, 8×16, 8×8, 8×4, 4×8 및 4×4 와 같은 다양한 블록 사이즈의 인터 예측을 지원한다. 본 개시에서, 용어, 비디오 블록은 임의의 사이즈의 비디오 블록을 지칭한다. 비디오 블록은 픽셀 도메인에서의 비디오 데이터의 블록을 지칭할 수도 있고, 또는 이산 코사인 변환 (DCT) 도메인과 같은 변환 도메인에서의 데이터의 블록을 을 지칭할 수도 있다.During the encoding process, video encoder 22 may execute a number of coding techniques or steps. In general, video encoder 22 operates on a video block within individual video frames (or other independently coded units such as slices) to encode a video block. The video block may have a fixed size or a variable size, and may vary in size depending on a particular coding standard. In some cases, each video frame may comprise a series of independently decodable slices, and each slice may include a series of macroblocks that may be arranged in very small blocks. Macroblocks typically refer to 16 × 16 blocks of data. The ITU-T H.264 standard not only provides intra prediction of various block sizes such as 16 × 16, 8 × 8, 4 × 4, and 8 × 8 for chroma components, Inter prediction of various block sizes such as 16 × 16, 16 × 8, 8 × 16, 8 × 8, 8 × 4, 4 × 8, and 4 × 4 for the corresponding scaled sizes for the luma component and chroma component Support. In this disclosure, the term video block refers to video blocks of any size. A video block may refer to a block of video data in the pixel domain, or may refer to a block of data in a transform domain, such as a discrete cosine transform (DCT) domain.

비디오 인코더 (22) 는, 예측 블록을 식별하기 위해, 코딩되고 있는 비디오 블록이 예측 프레임 (또는 다른 코딩된 유닛) 에 비교되는 예측 코딩을 수행할 수도 있다. 코딩되고 있는 현재의 비디오 블록과 예측 블록 사이의 차가 잔차 블록으로서 코딩되고, 예측 신택스가 그 예측 블록을 식별하는데 이용된다. 잔차 블록은 변환 및 양자화될 수도 있다. 변환 기술은 이산 코사인 변환 (DCT) 또는 개념적으로 유사한 프로세스, 정수 변환, 웨이블릿 변환 또는 다른 타입의 변환을 포함할 수도 있다. 일예로 DCT 프로세스에서, 이 변환 프로세스는 픽셀 값들의 세트를 변환 계수들로 변환하고, 이것은 주파수 도메인에서 그 픽셀 값들의 에너지를 나타낼 수도 있다. 양자화는 변환 계수들에 적용되고, 일반적으로 임의의 소정의 변환 계수와 연관된 비트의 수를 제한하는 프로세스와 관련된다.Video encoder 22 may perform predictive coding in which the video block being coded is compared to the predictive frame (or other coded unit) to identify the predictive block. The difference between the current video block being coded and the prediction block is coded as a residual block, and the prediction syntax is used to identify the prediction block. The residual block may be transformed and quantized. The transformation technique may include discrete cosine transformation (DCT) or a conceptually similar process, integer transformation, wavelet transformation, or other type of transformation. In one example, in a DCT process, this transform process transforms a set of pixel values into transform coefficients, which may represent the energy of those pixel values in the frequency domain. Quantization is applied to transform coefficients and generally involves a process of limiting the number of bits associated with any given transform coefficient.

변환 및 양자화에 후속하여, 그 양자화되고 변환된 잔차 비디오 블록에 대해 엔트로피 코딩이 수행될 수도 있다. 신택스 엘리먼트가 또한 엔트로피 코딩에 포함될 수도 있다. 일반적으로, 엔트로피 코딩은, 양자화된 변환 계수들의 시퀀스를 집합적으로 포함하는 하나 이상의 프로세스를 포함한다. 2 차원 비디오 블록으로부터 계수들의 하나 이상의 직렬화된 1 차원 벡터를 정의하기 위해, 양자화된 변환 계수들에 대해 지그재그 스캐닝 기술과 같은 스캐닝 기술이 수행된다. 그 후, 스캐닝된 계수들은, 예를 들어, 컨텐츠 적응형 가변 길이 코딩 (CAVLC), 콘텍스트 적응형 2 진 산술 코딩 (CABAC) 또는 다른 엔트로피 코딩 프로세스를 통해 엔트로피 코딩된다.Following transform and quantization, entropy coding may be performed on the quantized transformed residual video block. Syntax elements may also be included in entropy coding. In general, entropy coding includes one or more processes that collectively comprise a sequence of quantized transform coefficients. In order to define one or more serialized one-dimensional vectors of coefficients from the two-dimensional video block, a scanning technique, such as a zigzag scanning technique, is performed on the quantized transform coefficients. The scanned coefficients are then entropy coded via, for example, content adaptive variable length coding (CAVLC), context adaptive binary arithmetic coding (CABAC) or other entropy coding process.

벡터화된 엔트로피 코딩은, 비디오 블록과 연관된 벡터의 수를 정의하는 벡터 신택스 엘리먼트에 의존하는 비디오 블록의 엔트로피 코딩을 지칭한다. 벡터 신택스 엘리먼트는 예를 들어, 각각의 비디오 프레임 또는 각각의 독립적으로 디코딩가능한 슬라이스 또는 일부의 비디오 프레임과 같은 각각의 코딩된 유닛에 대해 정의될 수도 있다. 벡터 신택스 엘리먼트에 의해 정의된 각각의 벡터는 함께 엔트로피 코딩될 비디오 블록의 계수들의 세트를 정의한다. 코딩된 유닛의 비디오 블록에 대해 다수의 벡터들이 정의되면, 계수들의 수개의 개별적 세트들이 그 코딩된 유닛의 비디오 블록 각각에 대해 별도로 엔트로피 코딩될 것이다. 코딩된 유닛의 비디오 블록에 대해 오직 하나의 벡터만 정의되면, 각 비디오 블록 각각에 대한 계수들 모두가 그 코딩된 유닛에 대해 함께 엔트로피 코딩될 것이다.Vectorized entropy coding refers to entropy coding of a video block that depends on a vector syntax element that defines the number of vectors associated with the video block. The vector syntax element may be defined for each coded unit, such as, for example, each video frame or each independently decodable slice or some video frame. Each vector defined by the vector syntax element defines a set of coefficients of the video block to be entropy coded together. If multiple vectors are defined for a video block of a coded unit, several individual sets of coefficients will be entropy coded separately for each video block of the coded unit. If only one vector is defined for a video block of a coded unit, then all of the coefficients for each video block will be entropy coded together for that coded unit.

본 개시에 따르면, 코딩된 유닛 (예를 들어, 프레임 또는 슬라이스) 의 비디오 블록의 벡터화된 엔트로피 코딩에 대해 정의되는 벡터들의 수에 따라 상이한 타입의 예측 기술이 이용될 수도 있다. 예를 들어, 비디오 인코더 (22) 는, 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우, 가중 예측을 선택할 수도 있다. 대안적으로, 비디오 인코더 (22) 는, 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 단일 벡터를 확립하는 경우, 순차적 예측과 같은 비가중 예측을 선택할 수도 있다. 본 개시에서, 가중 예측은, 예측된 향상 계층 데이터와 예측된 기본 계층 데이터의 조합을 포함하는 가중된 예측 데이터를 참조하는 예측을 지칭한다. 반대로, 순차적 예측은, 코딩되고 있는 블록과 연관된 동일한 계층 (예를 들어, 기본 계층 또는 향상 계층) 의 예측 프레임과 같은 미리 코딩된 데이터를 참조하는 예측을 지칭한다.According to this disclosure, different types of prediction techniques may be used depending on the number of vectors defined for vectorized entropy coding of a video block of a coded unit (eg, frame or slice). For example, video encoder 22 may select weighted prediction if vectorized entropy coding establishes two or more vectors for the enhancement layer video block. Alternatively, video encoder 22 may select unweighted prediction, such as sequential prediction, when vectorized entropy coding establishes a single vector for the enhancement layer video block. In this disclosure, weighted prediction refers to prediction that refers to weighted prediction data that includes a combination of predicted enhancement layer data and predicted base layer data. In contrast, sequential prediction refers to prediction that refers to precoded data, such as prediction frames of the same layer (eg, base layer or enhancement layer) associated with the block being coded.

도 2a 는 순차적 예측을 도시하는 개념도이다. 도 2b 는 (소위, "적응형 세분화" 예측을 포함할 수도 있는) 가중 예측을 도시하는 개념도이다. 또한, 본 개시에 따르면, 비디오 인코더 (22) 및 비디오 디코더 (28) 는 향상 계층 비디오 블록에 적용되는 벡터화된 엔트로피 코딩에 기초하여 향상 계층 비디오 블록에 대한 예측 기술 (예를 들어, 가중 예측 대 비가중 예측) 을 선택할 수도 있다. 예를 들어, 벡터화된 엔트로피 코딩이, 코딩된 유닛의 각각의 비디오 블록에 대해 2 개 이상의 벡터를 정의하는 경우, 적응형 세분화 예측 기술이 선택될 수도 있다.2A is a conceptual diagram illustrating sequential prediction. 2B is a conceptual diagram illustrating weighted prediction (which may include so-called “adaptive segmentation” prediction). Further, in accordance with the present disclosure, video encoder 22 and video decoder 28 are predictive techniques (e.g., weighted prediction versus ratio) for enhancement layer video blocks based on vectorized entropy coding applied to the enhancement layer video blocks. Middle prediction). For example, if vectorized entropy coding defines two or more vectors for each video block of a coded unit, an adaptive segmentation prediction technique may be selected.

스케일러블 비디오 코딩 (SVC) 은 기본 계층 및 하나 이상의 향상 계층을 이용하는 비디오 코딩을 지칭한다. 이 경우, 기본 계층은 기본 레벨의 비디오 품질을 정의할 수도 있고, 하나 이상의 향상 계층은 디코딩된 비디오 신호의 품질을 향상시킬 수도 있다. 향상 계층은, 예를 들어, 기본 계층 프레임에 공간적 향상을 제공하는 것, 기본 계층 프레임의 픽셀 값으로 추가적 비트 깊이를 추가함으로써 신호 대 잡음 향상을 제공하는 것, 또는 기본 계층 프레임들 사이에 추가적 프레임들을 추가함으로써 디코딩된 비디오에 시간적 향상을 제공하는 것과 같은 가능한 다양한 방식으로 비디오 품질을 개선시킬 수도 있다. 기본 계층에서 코딩된 비디오 블록은 기본 계층 비디오 블록으로 지칭되고, 향상 계층에서 인코딩된 비디오 블록은 향상 계층 비디오 블록으로 지칭된다. 도 2a 및 도 2b 에서, 기본 계층 프레임들은 B1 내지 B5 및 B1' 내지 B5' 로 표기되고, 향상 계층 프레임들은 E1 내지 E14 및 E1' 내지 E14' 로 표기된다. 또한, 슬라이스 또는 프레임의 다른 부분들이 더 작은 디코딩가능한 유닛을 정의할 수도 있지만, 프레임이 디코딩가능한 유닛을 정의할 수도 있다.Scalable video coding (SVC) refers to video coding using a base layer and one or more enhancement layers. In this case, the base layer may define the base level video quality, and one or more enhancement layers may improve the quality of the decoded video signal. The enhancement layer may, for example, provide spatial enhancement to the base layer frame, provide signal-to-noise enhancement by adding additional bit depths to the pixel values of the base layer frame, or additional frames between the base layer frames. By adding them, the video quality may be improved in various possible ways, such as providing temporal enhancement to the decoded video. Video blocks coded in the base layer are referred to as base layer video blocks, and video blocks encoded in the enhancement layer are referred to as enhancement layer video blocks. In FIGS. 2A and 2B, the base layer frames are labeled B1 to B5 and B1 'to B5', and the enhancement layer frames are labeled E1 to E14 and E1 'to E14'. Also, while slices or other portions of a frame may define a smaller decodable unit, a frame may define a decodable unit.

도 2a 는 인트라 코딩 기본 계층 및 향상 계층 비디오 블록에서 이용되는 순차적 예측을 도시한다. 이 경우, 기본 계층 프레임 B1 의 블록들이 기본 계층 프레임 B2 의 블록들에 대한 예측 레퍼런스로서 이용된다. 유사하게, 기본 계층 프레임 B2 의 블록들이 기본 계층 프레임 B3 의 블록들에 대한 예측 레퍼런스로서 이용되고, 기본 계층 프레임 B3 의 블록들이 기본 계층 프레임 B4 의 블록들에 대한 예측 레퍼런스로서 이용되는 등이다. 예측 프레임의 예측 비디오 블록에 대한 현재 프레임에서의 현재 비디오 블록의 변위 (displacement) 를 나타내는 모션 벡터를 정의하기 위해 모션 추정이 이용될 수도 있다. 그 후, 모션 보상이 모션 벡터를 이용하여, 예측 프레임으로부터 예측 비디오 블록을 페치 (fetch) 또는 생성한다.2A illustrates sequential prediction used in an intra coding base layer and an enhancement layer video block. In this case, blocks of base layer frame B1 are used as prediction references for the blocks of base layer frame B2. Similarly, blocks of base layer frame B2 are used as prediction references to blocks of base layer frame B3, blocks of base layer frame B3 are used as prediction references to blocks of base layer frame B4, and so on. Motion estimation may be used to define a motion vector that indicates the displacement of the current video block in the current frame relative to the predictive video block of the predictive frame. The motion compensation then uses the motion vector to fetch or generate the predictive video block from the predictive frame.

향상 계층에서, 향상 계층 프레임 E1 의 블록들이 향상 계층 프레임 E2 의 블록들에 대한 예측 레퍼런스로서 이용된다. 유사하게, 향상 계층 프레임 E2 의 블록들이 향상 계층 프레임 E3 의 블록들에 대한 예측 레퍼런스로서 이용되고, 향상 계층 프레임 E3 의 블록들이 향상 계층 프레임 E4 의 블록들에 대한 예측 레퍼런스로서 이용되고, 향상 계층 프레임 E4 의 블록들이 향상 계층 프레임 E5 의 블록들에 대한 예측 레퍼런스로서 이용되는 등이다. 그러나, 도 2a 에 도시된 순차적 예측 기술들과 관련된 하나의 가능한 문제는 에러 드리프트 (error drift) 에 대한 가능성이다. 이 경우, 각각의 연속적 프레임의 비디오 블록이 이전 프레임의 비디오 블록에 의존하기 때문에, 일 프레임 내의 에러들은 후속 프레임들로 전파될 수도 있다.In the enhancement layer, the blocks of the enhancement layer frame E1 are used as the prediction reference for the blocks of the enhancement layer frame E2. Similarly, blocks of enhancement layer frame E2 are used as prediction references to blocks of enhancement layer frame E3, blocks of enhancement layer frame E3 are used as prediction references to blocks of enhancement layer frame E4, and enhancement layer frames Blocks of E4 are used as prediction references to the blocks of enhancement layer frame E5, and so on. However, one possible problem with the sequential prediction techniques shown in FIG. 2A is the potential for error drift. In this case, because the video block of each successive frame depends on the video block of the previous frame, errors within one frame may be propagated to subsequent frames.

특히 향상 계층에서 이러한 에러 드리프트 문제를 처리하기 위해, 가중 예측 기술들이 개발되고 있다. 이 경우, 향상 계층 비디오 블록은, 이전의 기본 및 향상 계층 프레임들의 가중 평균을 포함하는, 예측 프레임의 예측 블록들로부터 예측될 수도 있다. 예를 들어, 예측 프레임 P1' 은 기본 계층 프레임 B1' 과 향상 계층 프레임 E1' 의 가중 보간 (interpolation) 을 포함할 수도 있다. 향상 게층 프레임 E2 의 블록은 예측 프레임 P1' 의 블록에 기초하여 코딩될 수도 있다. 예측 프레임 P2' 는 기본 계층 프레임 B1' 과 향상 계층 프레임 E2' 의 가중 보간을 포함할 수도 있고, 향상 계층 프레임 E3 은 예측 프레임 P2' 의 블록에 기초하여 코딩될 수도 있다. 예측 프레임 P3' 은 기본 계층 프레임 B1' 과 향상 계층 프레임 E3' 의 가중 보간을 포함할 수도 있고, 향상 계층 프레임 E4 는 예측 프레임 P3' 의 블록에 기초하여 코딩될 수도 있다. 도 2b 에 도시된 점선이 보간을 나타내고, 역방향 화살표가 소정의 프레임을 코딩하는데 이용된 예측 프레임을 나타낸다.In order to address this error drift problem, especially in the enhancement layer, weighted prediction techniques are being developed. In this case, the enhancement layer video block may be predicted from the predictive blocks of the predictive frame, including a weighted average of previous base and enhancement layer frames. For example, prediction frame P1 ′ may include weighted interpolation of base layer frame B1 ′ and enhancement layer frame E1 ′. The block of the enhancement layer frame E2 may be coded based on the block of the prediction frame P1 '. Prediction frame P2 'may include weighted interpolation of base layer frame B1' and enhancement layer frame E2 ', and enhancement layer frame E3 may be coded based on a block of prediction frame P2'. Prediction frame P3 'may include weighted interpolation of base layer frame B1' and enhancement layer frame E3 ', and enhancement layer frame E4 may be coded based on a block of prediction frame P3'. The dotted line shown in FIG. 2B represents interpolation, and the reverse arrow represents the predictive frame used to code the given frame.

도 2b 에 도시된 바와 같은 가중 예측은 에러 드리프트를 회피하는 것을 도울 수도 있다. 예를 들어, 향상 계층 프레임 E2' 에서 에러가 나타나면, 이 에러는 기본 계층 프레임 B1' 에 대한 P2' 의 부분적 의존에 기인하여 예측 프레임 P2' 에서 완화될 수도 있다. 도 2a 에 도시된 바와 같이 순차적 예측은 에러 전파의 단점을 가지면서 시간적 리던던시를 이용하는 이점을 갖는다. 반대로, 기본 계층 프레임에만 기초한 향상 계층 프레임의 예측은 감소된 에러 전파의 이점을 가질 수도 있지만, 순차적 예측뿐만 아니라 (압축을 개선할 수 있는) 시간적 리던던시의 현상을 이용하지 않는다. 도 2b 에 도시된 가중된 예측 방식은 이 이점 및 단점을 밸런싱 (balancing) 하여, (시간적 리던던시를 이용하는 것에 기인한) 높은 압축과 (강인한 기본 계층 프레임에의 의존성에 기인한) 완화된 에러 전파의 바람직한 밸런스를 달성할 수 있다.Weighted prediction as shown in FIG. 2B may help to avoid error drift. For example, if an error appears in enhancement layer frame E2 ', this error may be mitigated in prediction frame P2' due to the partial dependence of P2 'on base layer frame B1'. As shown in FIG. 2A, sequential prediction has the advantage of using temporal redundancy with the disadvantage of error propagation. Conversely, prediction of an enhancement layer frame based only on the base layer frame may have the advantage of reduced error propagation, but does not exploit the phenomenon of temporal redundancy (which can improve compression) as well as sequential prediction. The weighted prediction scheme shown in FIG. 2B balances these advantages and disadvantages to provide high compression (due to using temporal redundancy) and mitigated error propagation (due to dependence on robust base layer frames). Desirable balance can be achieved.

가중 예측은 가중된 예측 프레임을 생성하는데 이용된 향상 계층 및 기본 계층 프레임에 가중치를 할당할 수도 있다. 또한, 이 가중치 팩터들은 시간에 따라 변경되거나 적응될 수도 있다. 가중치 팩터들은 때때로 "누설 팩터 (leaky factor) 들" 로 지칭되고, 다른 용어로 정의될 수도 있다. 어떤 경우든, 본 개시의 기술들은 상이하게 가중된 예측 프레임을 정의하는데 이용되는 가중치 팩터의 타입에 의존하지 않는다.Weighted prediction may assign weights to the enhancement layer and base layer frames used to generate the weighted prediction frames. In addition, these weight factors may be changed or adapted over time. Weight factors are sometimes referred to as "leaky factors" and may be defined in other terms. In any case, the techniques of this disclosure do not depend on the type of weight factor used to define differently weighted prediction frames.

전술한 바와 같이, 상이한 타입의 예측 기술은 코딩된 유닛의 비디오 블록의 벡터화된 엔트로피 코딩에 대해 정의된 벡터의 수에 의존하여 이용될 수도 있다. 예를 들어, 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우, 도 2b 에 도시된 바와 유사한 가중 예측이 선택될 수도 있다. 대안적으로, 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 단일 벡터를 확립하는 경우, 도 2a 에 도시된 바와 유사한 순차적 예측과 같은 비가중 예측이 선택될 수도 있다. 벡터화된 엔트로피 코딩은, 소정의 프레임 또는 다른 코딩된 유닛에 대해 단일 벡터를 정의하거나, 소정의 프레임 또는 다른 코딩된 유닛에 대해 수개의 벡터를 정의하는 능력을 디스에이블시킴으로써 단일 벡터를 확립할 수도 있다.As mentioned above, different types of prediction techniques may be used depending on the number of vectors defined for vectorized entropy coding of the video block of the coded unit. For example, if vectorized entropy coding establishes two or more vectors for an enhancement layer video block, weighted prediction similar to that shown in FIG. 2B may be selected. Alternatively, if vectorized entropy coding establishes a single vector for the enhancement layer video block, unweighted prediction, such as sequential prediction similar to that shown in FIG. 2A, may be selected. Vectorized entropy coding may establish a single vector by either defining a single vector for a given frame or other coded unit, or disabling the ability to define several vectors for a given frame or other coded unit. .

도 2c 는 가중된 예측 프레임 (P1'' 내지 P5'') 에 기초하여 향상 계층 프레임 (E1'' 내지 E5'') 의 예측을 도시하는 다른 개념도이다. 이 경우, 기본 계층 프레임 B2'' 와 향상 계층 프레임 E1'' 의 가중 보간이 예측 프레임 P2'' 를 정의한다. 유사하게, 기본 계층 프레임 B3'' 과 향상 계층 프레임 E2'' 의 가중 보간이 예측 프레임 P3'' 를 정의하는 등이다. 도 2b 에서와 같이, 가중치 팩터들은 시간에 따라 변경되거나 적응될 수도 있다. 어떤 경우든, 도 2c 의 예에서는, (현재의 향상 계층 프레임에 의해 정렬된) 시간적으로 정렬된 기본 계층 프레임 및 이전의 향상 계층 프레임이 예측 프레임을 정의하도록 보간될 수도 있다. 기본 계층과 향상 계층 보간의 다른 가중된 조합이 또한 이용되어 예측 프레임을 정의할 수도 있다.2C is another conceptual diagram illustrating prediction of enhancement layer frames E1 ″ through E5 ″ based on weighted prediction frames P1 ″ through P5 ″. In this case, weighted interpolation between the base layer frame B2 " and the enhancement layer frame E1 " defines the prediction frame P2 ". Similarly, weighted interpolation of base layer frame B3 " and enhancement layer frame E2 " defines prediction frame P3 " As in FIG. 2B, the weight factors may change or adapt over time. In any case, in the example of FIG. 2C, the temporally aligned base layer frame (aligned by the current enhancement layer frame) and the previous enhancement layer frame may be interpolated to define the prediction frame. Other weighted combinations of base layer and enhancement layer interpolation may also be used to define the predictive frame.

도 3a 및 도 3b 는 벡터화된 엔트로피 코딩의 개념을 설명하는 것을 돕는다. 도 3a 는, 예를 들어, 향상 계층과 연관된 데이터의 변환된 잔차 블록과 같은 4×4 비디오 블록의 지그재그 스캐닝을 도시하는 개념도이다. 도 3b 는 도 3a 의 블록의 지그재그 스캐닝 동안 적용되는 상이한 벡터 제어 신호와 연관된 벡터를 도시하는 도면이다.3A and 3B help explain the concept of vectorized entropy coding. 3A is a conceptual diagram illustrating zigzag scanning of a 4x4 video block, such as, for example, a transformed residual block of data associated with an enhancement layer. FIG. 3B is a diagram illustrating vectors associated with different vector control signals applied during zigzag scanning of the block of FIG. 3A.

도 3a 에서, 화살표는 데이터의 2 차원 블록을 데이터의 선형 시퀀스로 직렬화하는데 이용되는 지그재그 패턴을 도시한다. 지그재그 스캐닝은 단지 일예이고, 일반적으로, 스캐닝은 매우 다양한 패턴 또는 스캔 순서에 따를 수도 있다. 그러나, 벡터화된 엔트로피 코딩을 지원하기 위해 스캐닝이 벡터화된다는 점이 중요하다. 더 상세하게는, 벡터 제어 신호 (또는 다른 신택스 엘리먼트) 는, 도 3a 에 도시된 비디오 블록을 스캐닝하는 것으로부터 나타날 1 차원 벡터의 수 및 사이즈를 정의할 수도 있다.In FIG. 3A, the arrows show the zigzag pattern used to serialize a two-dimensional block of data into a linear sequence of data. Zigzag scanning is just one example, and in general, scanning may follow a wide variety of patterns or scan orders. However, it is important that scanning is vectorized to support vectorized entropy coding. More specifically, the vector control signal (or other syntax element) may define the number and size of one-dimensional vectors that will appear from scanning the video block shown in FIG. 3A.

예를 들어, 도 3b 에 도시된 바와 같이, 벡터 제어 신호가 수 16 을 특정하면 (항목 101 참조), 이것은, 도 3a 에 도시된 4×4 계수 비디오 블록의 16 개의 상이한 계수가 단일 벡터에 포함됨을 나타낼 수도 있다. 더 상세하게는, 16 의 벡터 제어 신호는 계수 1 내지 16 을 포함하는 단일 벡터를 생성할 수도 있다. 이 경우, 엔트로피 코딩은 계수 1 내지 16 의 전체 세트에 적용된다. 이 시나리오는 또한, 제어 신호 16 (항목 101 참조) 을 통해 단일 벡터를 정의하기 보다는, 소정의 코딩된 유닛에 대해 수개의 벡터를 정의하는 능력을 디스에이블시킴으로써 벡터화된 엔트로피 코딩에 대해 정의될 수도 있다. 벡터가 디스에이블되는 경우, 이것은, 벡터화된 코딩에 있어서 소정의 코딩된 유닛에 대해 단일 벡터를 정의하는 것과 동일한 효과를 갖는다.For example, as shown in FIG. 3B, if the vector control signal specifies the number 16 (see item 101), this means that 16 different coefficients of the 4 × 4 coefficient video block shown in FIG. 3A are included in a single vector. It may also indicate. More specifically, the vector control signal of 16 may produce a single vector comprising coefficients 1-16. In this case, entropy coding is applied to the entire set of coefficients 1-16. This scenario may also be defined for vectorized entropy coding by disabling the ability to define several vectors for a given coded unit, rather than defining a single vector via control signal 16 (see item 101). . If the vector is disabled, this has the same effect as defining a single vector for a given coded unit in vectorized coding.

반대로, 벡터 제어 신호가 수 3 및 16 을 특정하면 (항목 102 참조), 이것은, 도 3a 에 도시된 비디오 블록의 상이한 계수가, 하나는 계수 1 내지 3 을 갖고 다른 하나는 계수 4 내지 16 을 갖는 2 개의 상이한 벡터에 포함됨을 나타낼 수도 있다. 이 경우, 엔트로피 코딩은 계수 1 내지 3 및 계수 4 내지 16 의 2 개의 상이한 세트에 별도로 적용된다.Conversely, if the vector control signal specifies numbers 3 and 16 (see item 102), this means that different coefficients of the video block shown in FIG. 3A have one coefficients 1 to 3 and the other coefficients 4 to 16 It may be included in two different vectors. In this case, entropy coding is applied separately to two different sets of coefficients 1-3 and coefficients 4-16.

벡터 제어 신호가 수 2, 8 및 16 을 특정하면 (항목 103 참조), 이것은, 도 3a 에 도시된 비디오 블록의 상이한 계수가, 하나는 계수 1 및 2 를 갖고, 하나는 계수 3 내지 8 을 갖고, 하나는 계수 9 내지 16 을 갖는 3 개의 상이한 벡터에 포함됨을 나타낼 수도 있다. 이 경우, 엔트로피 코딩은 계수 1 및 2, 계수 3 내지 8 및 계수 9 내지 16 의 3 개의 상이한 세트에 별도로 적용된다. 벡터 제어 신호가 수 3, 6, 11, 16 을 특정하면 (항목 104 참조), 이것은, 도 3a 에 도시된 비디오 블록의 상이한 계수가, 하나는 계수 1 내지 3 을 갖고, 하나는 계수 4 내지 6 을 갖고, 하나는 계수 7 내지 11 을 갖고, 하나는 계수 12 내지 16 을 갖는 4 개의 상이한 벡터에 포함됨을 나타낼 수도 있다. 이 경우, 엔트로피 코딩은 계수 1 내지 3, 계수 4 내지 6, 계수 7 내지 11 및 계수 12 내지 16 의 4 개의 상이한 세트에 별도로 적용된다.If the vector control signal specifies the numbers 2, 8, and 16 (see item 103), this means that the different coefficients of the video block shown in FIG. 3A, one has coefficients 1 and 2 and one has coefficients 3 to 8 , One may indicate inclusion in three different vectors having coefficients 9-16. In this case, entropy coding is applied separately to three different sets of coefficients 1 and 2, coefficients 3 to 8 and coefficients 9 to 16. If the vector control signal specifies the numbers 3, 6, 11, 16 (see item 104), this means that the different coefficients of the video block shown in FIG. 3A, one has coefficients 1 to 3 and one coefficients 4 to 6 May be included in four different vectors having coefficients 7-11, and one having coefficients 12-16. In this case, entropy coding is applied separately to four different sets of coefficients 1-3, coefficients 4-6, coefficients 7-11 and coefficients 12-16.

상이한 벡터의 수 또는 사이즈를 특정하기 위해 이용되는 실제 신택스는 매우 다양한 구현을 따른다. 따라서, 도 3b 에 도시된 예시적인 신택스는 본 개시의 개념을 오직 예시하는데 이용되고, 제어 신호의 내용 또는 포맷을 한정하는 것으로 고려되어서는 안된다. 코딩된 유닛의 비디오 블록에 대한 벡터를 정의하기 위한 포맷은 광범위하게 변할 수 있다.The actual syntax used to specify the number or size of different vectors follows a wide variety of implementations. Thus, the example syntax shown in FIG. 3B is used only to illustrate the concepts of the present disclosure and should not be considered as limiting the content or format of the control signal. The format for defining a vector for a video block of a coded unit can vary widely.

본 개시의 기술들은, 벡터화된 엔트로피 코딩이 (도 3b 의 항목 101 의 예에서와 같이) 단일 벡터를 특정하는지 또는 (도 3b 의 항목 102, 103 및 104 의 예에서와 같이) 복수의 벡터를 특정하는지 여부에 기초한 예측 (예를 들어, 가중 예측 또는 비가중 예측) 을 정의한다. 더 상세하게는, 엔트로피 코딩을 위해, 벡터화된 엔트로피 코딩이 코딩된 유닛의 비디오 블록에 대해 복수의 벡터를 정의하는 경우에는 언제나 가중 예측이 이용되고, 엔트로피 코딩을 위한 코딩된 유닛의 비디오 블록에 대해 단일 벡터가 정의되는 경우에는 언제나 순차적 예측과 같은 비가중 예측이 이용된다. 16 의 벡터 제어 신호를 선택하거나 (예를 들면, 도 6의 항목 (101) 에 나타낸 것과 같은) 또는 가능하게는 소정의 코딩된 유닛에 대한 벡터 코딩 모드를 완전히 디스에이블시킴으로써, 엔트로피 코딩을 위해 코딩된 유닛의 비디오 블록에 대해 단일 벡터가 정의될 수도 있다. 어떤 방식으로든, 엔트로피 코딩을 위해 코딩된 유닛의 비디오 블록에 대해 단일 벡터가 정의되면, 비가중 예측이 이용될 수도 있다.The techniques of this disclosure specify whether vectorized entropy coding specifies a single vector (as in the example of item 101 of FIG. 3B) or specifies a plurality of vectors (as in the examples of items 102, 103 and 104 of FIG. 3B). Define predictions (eg, weighted predictions or unweighted predictions) based on whether they More specifically, for entropy coding, weighted prediction is always used when vectorized entropy coding defines a plurality of vectors for a video block of a coded unit, and for the video block of the coded unit for entropy coding Whenever a single vector is defined, unweighted prediction such as sequential prediction is used. Coding for entropy coding by selecting 16 vector control signals (eg, as shown in item 101 of FIG. 6) or possibly disabling the vector coding mode for a given coded unit completely A single vector may be defined for the video block of a given unit. In any way, unweighted prediction may be used if a single vector is defined for the video block of the coded unit for entropy coding.

소스 디바이스 (12) 의 인코더 (22) 는, 이용되어야 하는 예측의 타입을 정의하기 위해 제어 신호를 신택스의 일부로서 수신지 디바이스 (18) 의 디코더 (28) 로 통신할 수도 있고, 또는 대안적으로, 인코더 (22) 및 디코더 (28) 는, 벡터화된 엔트로피 코딩이 인에이블인지 여부 및 비디오 블록에 대해 2 개 이상의 벡터가 정의되는지 여부에 기초하여, 이용할 예측의 타입을 자동으로 결정할 수도 있다.Encoder 22 of source device 12 may communicate the control signal to decoder 28 of destination device 18 as part of the syntax to define the type of prediction that should be used, or alternatively Encoder 22 and decoder 28 may automatically determine the type of prediction to use based on whether vectorized entropy coding is enabled and whether two or more vectors are defined for the video block.

도 4 는 본 개시에 따른 비디오 인코더 (50) 를 도시하는 블록도이다. 비디오 인코더 (50) 는 디바이스 (20) 의 비디오 인코더 (22) 또는 다른 디바이스의 비디오 인코더에 대응할 수도 있다. 도 4 에 도시된 바와 같이, 비디오 인코더 (50) 는 제어 유닛 (31), 예측 유닛 (32) 및 레퍼런스 프레임 저장 엘리먼트 (34) 를 포함한다. 비디오 인코더는 또한 변환 유닛 (38) 및 양자화 유닛 (40) 뿐만 아니라 역양자화 유닛 (42), 역변환 유닛 (44) 및 가산기 (48 및 51) 를 포함한다. 마지막으로, 비디오 인코더 (50) 는 또한 벡터 스캔 유닛 (45) 및 엔트로피 코딩 유닛 (46) 을 포함한다.4 is a block diagram illustrating video encoder 50 according to this disclosure. Video encoder 50 may correspond to video encoder 22 of device 20 or video encoder of another device. As shown in FIG. 4, video encoder 50 includes a control unit 31, a prediction unit 32, and a reference frame storage element 34. The video encoder also includes inverse quantization unit 42, inverse transform unit 44, and adders 48 and 51 as well as transform unit 38 and quantization unit 40. Finally, video encoder 50 also includes vector scan unit 45 and entropy coding unit 46.

본 개시에 따르면, 제어 유닛 (31) 은, 코딩되고 있는 비디오 시퀀스 내의 코딩된 유닛의 향상 계층 비디오 블록의 벡터화된 엔트로피 코딩에 대해 하나 이상의 벡터를 정의한다. 제어 유닛 (31) 은 또한, 정의된 벡터화된 엔트로피 코딩에 기초하여 코딩된 유닛의 향상 계층 비디오 블록에 대한 예측 모드를 선택한다. 더 상세하게는, 제어 유닛 (31) 은, 그 정의된 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 2 개 이상의 벡터를 확립하는 경우, 가중 예측을 선택한다. 대안적으로, 제어 유닛 (31) 은, 그 정의된 벡터화된 엔트로피 코딩이 향상 계층 비디오 블록에 대해 단일 벡터를 확립하는 경우, 순차적 예측과 같은 비가중 예측을 선택할 수도 있다. 또한, (도 3b 의 항목 101 에 도시된 바와 같이) 16 의 벡터 제어 신호를 선택하거나, 소정의 코딩된 유닛에 대해 벡터 코딩 모드를 완전히 디스에이블시킴으로써, 엔트로피 코딩을 위해 코딩된 유닛의 향상 계층 비디오 블록에 대해 단일 벡터가 정의될 수도 있다. 벡터는 임의의 방식으로 정의될 수도 있고, 상이한 계층에 할당된 데이터의 양을 밸런싱 또는 정의하도록 정의될 수도 있다.According to the present disclosure, control unit 31 defines one or more vectors for vectorized entropy coding of the enhancement layer video block of the coded unit in the video sequence being coded. Control unit 31 also selects a prediction mode for the enhancement layer video block of the coded unit based on the defined vectorized entropy coding. More specifically, control unit 31 selects weighted prediction if its defined vectorized entropy coding establishes two or more vectors for the enhancement layer video block. Alternatively, control unit 31 may select unweighted prediction, such as sequential prediction, when its defined vectorized entropy coding establishes a single vector for the enhancement layer video block. In addition, enhancement layer video of the coded unit for entropy coding, by selecting 16 vector control signals (as shown in item 101 of FIG. 3B) or completely disabling the vector coding mode for a given coded unit. A single vector may be defined for the block. The vector may be defined in any manner, or may be defined to balance or define the amount of data allocated to different layers.

예측 유닛 (32) 은 제어 유닛 (31) 으로부터의 예측 제어 신호에 의해 정의되는 선택된 예측 모드에 기초하여 예측 코딩 기술을 수행한다. 따라서, 예측 유닛 (32) 은 가중 예측 또는 비가중 예측을 지원할 수도 있지만, 제어 유닛 (31) 의 지시에 따라 적절한 예측 기술을 적용한다. 대응하는 벡터 제어 신호가 또한 제어 유닛 (31) 으로부터 벡터 스캔 유닛 (45) 및 엔트로피 코딩 유닛 (46) 에 전송된다. 벡터 스캔 유닛 (45) 은 벡터화된 스캐닝을 수행하고, 엔트로피 코딩 유닛 (46) 은 벡터화된 엔트로피 코딩을 수행한다.Prediction unit 32 performs a prediction coding technique based on the selected prediction mode defined by the prediction control signal from control unit 31. Thus, prediction unit 32 may support weighted prediction or unweighted prediction, but applies an appropriate prediction technique in accordance with the instruction of control unit 31. The corresponding vector control signal is also sent from the control unit 31 to the vector scan unit 45 and the entropy coding unit 46. Vector scan unit 45 performs vectorized scanning, and entropy coding unit 46 performs vectorized entropy coding.

향상 계층 비디오 블록의 인터 코딩에 있어서, 예측 유닛 (32) 은 인코딩될 비디오 블록을 하나 이상의 비디오 레퍼런스 프레임 내의 다양한 블록들과 비교한다. 예측된 데이터가 레퍼런스 프레임 저장부 (34) 로부터 검색될 수도 있고, (도 2a 에 도시된 바와 같이) 이전의 향상 계층 프레임의 비디오 블록 또는 (도 2b 에 도시된 바와 같이) 이전의 향상 계층 프레임과 기본 계층 프레임의 가중 조합을 포함할 수도 있다.In inter coding of an enhancement layer video block, prediction unit 32 compares the video block to be encoded with various blocks in one or more video reference frames. The predicted data may be retrieved from the reference frame store 34, and may include a video block of a previous enhancement layer frame (as shown in FIG. 2A) or a previous enhancement layer frame (as shown in FIG. 2B). It may also include a weighted combination of base layer frames.

예측 유닛 (32) 은 모션 벡터와 같은 예측 신택스를 생성할 수도 있고, 모션 벡터는 현재의 향상 계층 비디오 블록을 코딩하는데 이용된 예측 블록을 식별하는데 이용될 수 있다. 예측 유닛 (32) 은, 예측 블록을 나타내고 모션 벡터에 기초하여 예측 블록을 생성하는 모션 벡터를 식별하는 모션 추정 및 모션 보상 유닛을 포함할 수도 있다. 통상적으로, 모션 추정은 모션 벡터를 생성하는 프로세스로 고려되고, 모션을 추정한다. 예를 들어, 모션 벡터는 현재의 프레임 내에서 코딩되고 있는 현재의 블록에 대한 예측 프레임 내의 예측 블록의 변위를 나타낼 수도 있다. 모션 보상은 통상적으로, 모션 추정에 의해 결정된 모션 벡터에 기초하여 예측 블록을 페치 또는 생성하는 프로세스로 고려된다.Prediction unit 32 may generate a prediction syntax, such as a motion vector, which may be used to identify the predictive block used to code the current enhancement layer video block. Prediction unit 32 may include a motion estimation and motion compensation unit that indicates the predictive block and identifies the motion vector that generates the predictive block based on the motion vector. Typically, motion estimation is considered a process of generating a motion vector and estimates motion. For example, the motion vector may represent the displacement of the prediction block in the prediction frame relative to the current block being coded in the current frame. Motion compensation is typically considered a process of fetching or generating a predictive block based on a motion vector determined by motion estimation.

변환 유닛 (38) 은 이산 코사인 변환 (DCT) 또는 개념적으로 이와 유사한 변환과 같은 변환을 잔차 블록에 적용하여, 잔여 변환 블록 계수를 포함하는 비디오 블록을 생성한다. 블록 변환 유닛 (38) 은, 예를 들어, DCT 와 개념적으로 유사한 H.264 표준에 의해 정의되는 다른 변환들을 수행할 수도 있다. 대안적으로, 웨이블릿 변환 또는 정수 변환이 이용될 수도 있다.Transform unit 38 applies a transform, such as a discrete cosine transform (DCT) or a conceptually similar transform, to the residual block, producing a video block comprising residual transform block coefficients. Block transform unit 38 may, for example, perform other transforms defined by the H.264 standard conceptually similar to DCT. Alternatively, wavelet transform or integer transform may be used.

양자화 유닛 (40) 은 잔여 변환 계수를 양자화하여 비트 레이트를 추가적으로 감소시킨다. 양자화 유닛 (40) 은, 예를 들어, 계수 각각을 코딩하는데 이용된 비트의 수를 제한할 수도 있다. 양자화 이후, 벡터 스캔 유닛 (45) 은 양자화된 계수 블록을 2 차원 표현으로부터 하나 이상의 직렬화된 1 차원 벡터로 스캐닝한다. 또한, 소정의 비디오 블록에 대해 스캐닝된 벡터의 수는 처음에 제어 유닛 (31) 에 의해 정의되고, 제어 유닛 (31) 은 벡터의 수를 선택하고 예측 기술을 선택한다. 제어 유닛 (31) 으로부터 벡터 스캔 유닛 (45) 으로의 벡터 제어 신호는 비디오 블록을 스캐닝하는 방법 및 생성할 벡터의 수를 벡터 스캔 유닛 (45) 에 통지한다. 스캔 순서는 (지그재그 스캐닝과 같이) 미리 프로그래밍될 수도 있고, 이전의 코딩 통계에 기초하여 적응형일 수도 있다.Quantization unit 40 quantizes the residual transform coefficients to further reduce bit rate. Quantization unit 40 may, for example, limit the number of bits used to code each of the coefficients. After quantization, vector scan unit 45 scans the quantized coefficient blocks from the two-dimensional representation into one or more serialized one-dimensional vectors. Also, the number of vectors scanned for a given video block is initially defined by the control unit 31, which selects the number of vectors and selects the prediction technique. The vector control signal from the control unit 31 to the vector scan unit 45 informs the vector scan unit 45 of how to scan the video block and the number of vectors to generate. The scan order may be preprogrammed (such as zigzag scanning) or may be adaptive based on previous coding statistics.

이 스캐닝 프로세스에 후속하여, 엔트로피 인코딩 유닛 (46) 은 CAVLC 또는 CABAC 와 같은 엔트로피 코딩 방법에 따라 양자화된 변환 계수를 인코딩하여 데이터를 추가적으로 압축한다. 더 상세하게는, 엔트로피 인코딩 유닛 (46) 은 제어 유닛 (31) 으로부터 전송된 벡터 제어 신호에 기초하여 벡터화된 코딩을 적용한다. 예를 들어, 엔트로피 코딩 유닛 (46) 은 벡터 스캔 유닛 (45) 에 의해 스캐닝된 상이한 벡터들 각각에 대해 엔트로피 코딩을 별도로 적용할 수도 있다. 비디오 블록에 대해 단일 벡터가 정의되면, 엔트로피 코딩 유닛 (46) 은 각각의 비디오 블록의 계수 모두에 대응하는 계수들의 세트에 엔트로피 코딩을 적용할 수도 있다. 본 개시에서, 엔트로피 코딩은 컨텐츠 적응형 가변 길이 코딩 (CAVLC), 콘텍스트 적응형 2 진 산술 코딩 (CABAC) 또는 다른 엔트로피 코딩 방법과 같은 임의의 광범위한 엔트로피 코딩 방법을 지칭한다.Following this scanning process, entropy encoding unit 46 encodes the quantized transform coefficients according to an entropy coding method such as CAVLC or CABAC to further compress the data. More specifically, entropy encoding unit 46 applies vectorized coding based on the vector control signal sent from control unit 31. For example, entropy coding unit 46 may apply entropy coding separately for each of the different vectors scanned by vector scan unit 45. If a single vector is defined for a video block, entropy coding unit 46 may apply entropy coding to the set of coefficients that correspond to all of the coefficients of each video block. In this disclosure, entropy coding refers to any of a wide range of entropy coding methods, such as content adaptive variable length coding (CAVLC), context adaptive binary arithmetic coding (CABAC), or other entropy coding methods.

CAVLC 는 ITU H.264/MPEG4, AVC 표준에 의해 지원되는 엔트로피 코딩 기술의 일 타입이고, 엔트로피 코딩 유닛 (46) 에 의해 벡터화 기반으로 적용될 수도 있다. CAVLC 는, 변환 계수의 직렬화된 "런 (runs)" 을 효과적으로 압축하는 방식으로 가변 길이 코딩 (VLC) 테이블을 이용한다. 이 경우, 벡터 스캔 유닛 (45) 에 의해 스캐닝된 각각의 개별적 벡터는 CAVLC 에 따라 엔트로피 코딩 유닛 (46) 에 의해 코딩된다. 이 경우, 엔트로피 코딩 유닛 (46) 은 CAVLC 에 따라 벡터 스캔 유닛 (45) 에 의해 스캐닝된 각각의 개별적 벡터를 코딩한다.CAVLC is a type of entropy coding technique supported by the ITU H.264 / MPEG4, AVC standard, and may be applied on a vectorization basis by entropy coding unit 46. CAVLC uses a variable length coding (VLC) table in a manner that effectively compresses the serialized "runs" of the transform coefficients. In this case, each individual vector scanned by vector scan unit 45 is coded by entropy coding unit 46 according to CAVLC. In this case, entropy coding unit 46 codes each individual vector scanned by vector scan unit 45 according to CAVLC.

CABAC 는 ITU H.264/MPEG4, AVC 표준에 의해 지원되는 엔트로피 코딩 기술의 다른 타입이고, 엔트로피 코딩 유닛 (46) 에 의해 벡터화 기반으로 적용될 수도 있다. CABAC 는 2 진화, 콘텍스트 모델 선택, 및 2 진 산술 코딩을 포함하여, 다수의 스테이지에 관련될 수도 있다. 이 경우, 엔트로피 코딩 유닛 (46) 은 CABAC 에 따라 벡터 스캔 유닛 (45) 에 의해 스캐닝된 각각의 개별적 벡터를 코딩한다. 다수의 다른 타입의 엔트로피 코딩 기술이 또한 존재하고, 새로운 엔트로피 코딩 기술들이 장래에 생성할 것이다. 본 개시는 임의의 특정한 엔트로피 코딩 기술에 한정되지 않으며, 예를 들어, 제어 유닛 (31) 으로부터의 벡터화된 제어 신호의 지시에 따라, 벡터화 기반으로 소정의 엔트로피 코딩 기술을 단순히 적용한다.CABAC is another type of entropy coding technique supported by the ITU H.264 / MPEG4, AVC standard, and may be applied on a vectorization basis by entropy coding unit 46. CABAC may be involved in multiple stages, including binary evolution, context model selection, and binary arithmetic coding. In this case, entropy coding unit 46 codes each individual vector scanned by vector scan unit 45 according to CABAC. Many other types of entropy coding techniques also exist, and new entropy coding techniques will be created in the future. The present disclosure is not limited to any particular entropy coding technique, and simply applies certain entropy coding techniques on the vectorization basis, for example, in accordance with the indication of the vectorized control signal from the control unit 31.

엔트로피 인코딩 유닛 (46) 에 의한 엔트로피 코딩에 후속하여, 인코딩된 비디오는 다른 디바이스로 송신될 수도 있고, 또는 추후의 송신 또는 검색을 위해 보관될 수도 있다. 인코딩된 비디오는 엔트로피 코딩된 벡터 및 다양한 신택스를 포함할 수도 있고, 이것은 디코더에 의해 이용되어 디코딩 프로세스를 적절하게 구성할 수 있다. 역양자화 유닛 (42) 및 역변환 유닛 (44) 은 역양자화 및 역변환을 각각 적용하여, 픽셀 도메인에서 잔차 블록을 재구성한다. 합산기 (51) 는 재구성된 잔차 블록을 예측 유닛 (32) 에 의해 생성된 예측 블록에 가산하여, 레퍼런스 프레임 저장부 (34) 에 저장하기 위한 재구성된 비디오 블록을 생성한다. 원한다면, 재구성된 비디오 블록은 또한 레퍼런스 프레임 저장부 (34) 에 저장되기 전에 디블로킹 필터 유닛 (미도시) 을 통과할 수도 있다. 재구성된 비디오 블록은, 후속 비디오 프레임에서 블록을 인터코딩하기 위한 레퍼런스 블록으로서, 또는 후속 비디오 프레임의 블록의 가중된 예측에 이용되는 예측 블록의 가중된 부분으로서 예측 유닛 (32) 에 의해 이용될 수도 있다.Following entropy coding by entropy encoding unit 46, the encoded video may be transmitted to another device or may be archived for later transmission or retrieval. The encoded video may include entropy coded vectors and various syntaxes, which may be used by the decoder to properly configure the decoding process. Inverse quantization unit 42 and inverse transform unit 44 apply inverse quantization and inverse transformation, respectively, to reconstruct the residual block in the pixel domain. Summer 51 adds the reconstructed residual block to the prediction block generated by prediction unit 32 to generate a reconstructed video block for storing in reference frame store 34. If desired, the reconstructed video block may also pass through a deblocking filter unit (not shown) before being stored in reference frame store 34. The reconstructed video block may be used by prediction unit 32 as a reference block for intercoding a block in a subsequent video frame, or as a weighted portion of a prediction block used for weighted prediction of a block of a subsequent video frame. have.

도 5 는, 여기서 설명한 방식으로 인코딩된 비디오 시퀀스를 디코딩하는 비디오 디코더 (60) 의 일예를 도시하는 블록도이다. 수신된 비디오 시퀀스는 이미지 프레임, 영상 그룹 (GOP), 또는 광범위하게 다양한 코딩된 비디오의 인코딩된 세트를 포함하고 이는 인코딩된 비디오 블록 및 신택스를 포함하여 이러한 비디오 블록을 디코딩하는 방법을 정의한다.5 is a block diagram illustrating an example of a video decoder 60 that decodes a video sequence encoded in the manner described herein. The received video sequence includes an encoded set of image frames, groups of pictures (GOPs), or a wide variety of coded videos, which defines how to decode such video blocks, including encoded video blocks and syntax.

비디오 디코더 (60) 는, 여기서 설명하는 방식으로 예측 디코딩 및 벡터화된 엔트로피 디코딩을 제어하는 제어 유닛 (31) 을 포함한다. 더 상세하게는, 제어 유닛 (31) 은 인코딩된 비디오 비트스트림을 수신하고, 벡터화된 엔트로피 코딩이 인에이블인지 여부, 및 벡터의 사이즈 및 수를 식별하는 신택스를 결정하기 위해 그 비트스트림을 파싱한다. 제어 유닛 (31) 은 코딩된 비디오를 엔트로피 디코딩 유닛 (52) 에 포워딩하고, 또한 제어 신호를 예측 유닛 (54), 스캔 유닛 (55) 및 엔트로피 디코딩 유닛 (52) 에 포워딩한다. 제어 신호는, 벡터화된 엔트로피 디코딩에 대해 2 개 이상의 벡터가 정의되는 경우에는 언제나 가중 예측이 이용되고, (예를 들어, 단일 벡터를 정의하거나 소정의 코딩 유닛에 대해 벡터 코딩 모드를 디스에이블시킴으로써) 벡터화된 엔트로피 디코딩에 대해 단일 벡터가 정의되는 경우에는 언제나 비가중 예측이 이용되는 것을 보장한다.Video decoder 60 includes a control unit 31 that controls predictive decoding and vectorized entropy decoding in the manner described herein. More specifically, control unit 31 receives the encoded video bitstream and parses the bitstream to determine whether the vectorized entropy coding is enabled and the syntax identifying the size and number of the vector. . Control unit 31 forwards the coded video to entropy decoding unit 52, and also forwards the control signal to prediction unit 54, scan unit 55, and entropy decoding unit 52. The control signal always uses weighted prediction when two or more vectors are defined for vectorized entropy decoding (eg, by defining a single vector or disabling the vector coding mode for a given coding unit). When a single vector is defined for vectorized entropy decoding, it is ensured that unweighted prediction is always used.

엔트로피 디코딩 유닛 (52) 은 도 4 의 엔트로피 인코딩 유닛에 의해 수행된 인코딩의 상호 디코딩 기능을 수행한다. 더 상세하게는, 엔트로피 디코딩은, CAVLC 및 CABAC 디코딩이 계수의 벡터화된 세트에 대해 동작할 수도 있다는 점에서 벡터화될 수도 있다. 제어 유닛 (31) 은 엔트로피 디코딩 유닛 (52) 에 의해 수행된 벡터화된 엔트로피 디코딩을 정의하는 제어 신호를 전송한다. 비디오 디코더 (60) 는 또한, 도 2 의 스캔 유닛 (45) 에 의해 수행된 스캐닝에 상반 역 스캐닝을 수행하는 스캔 유닛 (55) 을 포함한다. 이 경우, 스캔 유닛 (45) 은 계수의 하나 이상의 1 차원 벡터들을 2 차원 블록 포맷으로 조합할 수도 있다. 벡터의 수 및 사이즈뿐만 아니라 비디오 블록에 대해 정의된 스캔 순서는 2 차원 블록이 재구성되는 방법을 정의한다.Entropy decoding unit 52 performs a mutual decoding function of the encoding performed by the entropy encoding unit of FIG. 4. More specifically, entropy decoding may be vectorized in that CAVLC and CABAC decoding may operate on a vectorized set of coefficients. Control unit 31 transmits a control signal that defines the vectorized entropy decoding performed by entropy decoding unit 52. Video decoder 60 also includes a scan unit 55 that performs reverse anti-scanning on the scanning performed by scan unit 45 of FIG. 2. In this case, scan unit 45 may combine one or more one-dimensional vectors of the coefficients in a two-dimensional block format. The number and size of vectors as well as the scan order defined for the video block define how the two-dimensional block is reconstructed.

비디오 디코더 (60) 는 또한 예측 유닛 (54), 역양자화 유닛 (56), 역변환 유닛 (58), 레퍼런스 프레임 저장부 (62) 및 합산기 (64) 를 포함한다. 선택적으로, 비디오 디코더 (60) 는 또한 합산기 (64) 의 출력을 필터링하는 디블로킹 필터 (미도시) 를 포함할 수도 있다. 예측 유닛 (54) 은 엔트로피 디코딩 유닛 (52) 으로부터 (모션 벡터와 같은) 예측 신택스를 수신한다. 예측 유닛 (54) 은 또한 제어 유닛 (31) 으로부터 제어 신호를 수신하고, 이는 가중 예측이 이용되어야 하는지 비가중 예측이 이용되어야 하는지 여부를 정의한다. 또한, 비디오 블록이 복수의 벡터로 스캐닝된 경우 가중 예측이 정의되고, 엔트로피 코딩은 비디오 블록의 상이한 벡터에 대해 별도로 적용된다.Video decoder 60 also includes prediction unit 54, inverse quantization unit 56, inverse transform unit 58, reference frame store 62, and summer 64. Optionally, video decoder 60 may also include a deblocking filter (not shown) that filters the output of summer 64. Prediction unit 54 receives a prediction syntax (such as a motion vector) from entropy decoding unit 52. Prediction unit 54 also receives a control signal from control unit 31, which defines whether weighted prediction should be used or unweighted prediction should be used. In addition, weighted prediction is defined when a video block is scanned into a plurality of vectors, and entropy coding is applied separately for different vectors of the video block.

역양자화 유닛 (56) 은 영양자화를 수행하고, 역변환 유닛 (58) 은 역변환을 수행하여, 비디오 블록의 계수를 픽셀 도메인으로 다시 변경한다. 합산기 (64) 는 유닛 (54) 으로부터의 예측 블록을 역변환 유닛 (58) 으로부터의 재구성된 잔차 블록과 조합하여, 레퍼런스 프레임 저장부 (62) 에 저장되는 재구성된 블록을 생성한다. 원한다면, 재구성된 비디오 블록은 또한 레퍼런스 프레임 저장부 (62) 에 저장되기 전에 디블로킹 필터 유닛 (미도시) 을 통과할 수도 있다. 디코딩된 비디오가 레퍼런스 프레임 저장부 (62) 로부터 출력되고, 후속 예측에서 이용하기 위해 예측 유닛 (54) 으로 피드백될 수도 있다.Inverse quantization unit 56 performs quantization, and inverse transform unit 58 performs inverse transformation to change the coefficients of the video block back to the pixel domain. Summer 64 combines the predictive block from unit 54 with the reconstructed residual block from inverse transform unit 58 to produce a reconstructed block that is stored in reference frame store 62. If desired, the reconstructed video block may also pass through a deblocking filter unit (not shown) before being stored in the reference frame store 62. Decoded video may be output from reference frame store 62 and fed back to prediction unit 54 for use in subsequent prediction.

도 6 은 본 개시에 따라 향상 계층 비디오 블록을 인코딩하기 위한 코딩 (즉, 인코딩 또는 디코딩) 기술을 도시하는 흐름도이다. 도 6 을 비디오 인코더 (50) 의 관점에서 설명하지만, 유사한 기술이 비디오 디코더 (60) 에 의해 적용될 수도 있다. 즉, 인코더 (50) 및 디코더 (60) 모두가 벡터를 정의하고, 그 정의된 벡터에 기초하여 예측 (예를 들어, 가중 예측 또는 비가중 예측) 을 선택할 수도 있다. 인코더 측에서, 벡터가 정의되어 코딩 효율을 증진시킬 수도 있다. 디코더 측에서, 벡터는, 인코더에 의해 정의되고 인코딩된 비디오 스트림의 일부로서 수신된 신택스에 기초하여 정의될 수도 있다. 물론, 인코더 측에서는, 스캐닝이 2 차원 블록에 기초하여 1 차원 벡터를 정의하는 한편, 디코더 측에서는, 스캐닝이 반대로 동작하여, 1 차원 벡터에 기초하여 2 차원 블록을 정의한다.6 is a flow chart illustrating a coding (ie, encoding or decoding) technique for encoding an enhancement layer video block according to the present disclosure. Although FIG. 6 is described in terms of video encoder 50, similar techniques may be applied by video decoder 60. That is, both encoder 50 and decoder 60 may define a vector and select a prediction (eg, weighted prediction or unweighted prediction) based on the defined vector. At the encoder side, vectors may be defined to enhance coding efficiency. At the decoder side, the vector may be defined based on the syntax defined by the encoder and received as part of the encoded video stream. Of course, on the encoder side, scanning defines a one-dimensional vector based on the two-dimensional block, while on the decoder side, scanning operates in reverse, defining a two-dimensional block based on the one-dimensional vector.

도 6 에 도시된 바와 같이, 비디오 인코더 (50) 의 제어 유닛 (31) 은 향상 계층 비디오 블록의 엔트로피 코딩에 대한 벡터를 정의한다 (81). 그 후, 제어 유닛 (31) 은 정의된 벡터에 기초하여 예측 모드 (예를 들어, 가중 예측 또는 비가중 예측) 를 선택한다 (82). 더 상세하게는, 비디오 블록에 대해 복수의 벡터가 정의되면, 제어 유닛 (31) 은 가중 예측을 선택하는 한편, 비디오 블록에 대해 단일 벡터가 정의되면, 제어 유닛 (31) 은 비가중 예측을 선택한다. 예측 유닛 (32), 및 엔트로피 코딩 유닛 (46) 을 갖는 벡터 스캔 유닛 (45) 은 정의된 벡터 및 선택된 예측 모드에 기초하여 향상 계층 비디오 블록을 코딩한다 (83). 더 상세하게는, 예측 유닛 (32) 은 그 선택된 예측 모드를 예측 코딩에서 이용하고, 벡터 스캔 유닛 (45) 및 엔트로피 코딩 유닛 (46) 은 그 정의된 벡터에 기초하여 비디오 블록을 벡터 스캐닝 및 엔트로피 코딩한다. 이 경우, 벡터 스캔 유닛 (45) 은 2 차원 블록을 하나 이상의 1 차원 벡터로 변환하고, 엔트로피 코딩 유닛 (46) 은 하나 이상의 1 차원 벡터를 엔트로피 코딩한다. 예측 모드 (가중 예측 또는 비가중 예측) 을, 비디오 블록에 대해 단일 벡터가 정의되는지 또는 다수의 벡터가 정의되는지 여부에 의존하게 함으로써, 코딩 프로세스가 개선될 수도 있다.As shown in FIG. 6, control unit 31 of video encoder 50 defines a vector for entropy coding of the enhancement layer video block (81). The control unit 31 then selects a prediction mode (eg, weighted prediction or unweighted prediction) based on the defined vector (82). More specifically, if a plurality of vectors are defined for the video block, the control unit 31 selects weighted prediction, while if a single vector is defined for the video block, the control unit 31 selects the weighted prediction. do. Vector scan unit 45 with prediction unit 32 and entropy coding unit 46 codes the enhancement layer video block based on the defined vector and the selected prediction mode (83). More specifically, prediction unit 32 uses the selected prediction mode in predictive coding, and vector scan unit 45 and entropy coding unit 46 vector scan and entropy the video block based on the defined vector. Coding In this case, vector scan unit 45 transforms the two-dimensional block into one or more one-dimensional vectors, and entropy coding unit 46 entropy codes one or more one-dimensional vectors. By making the prediction mode (weighted prediction or unweighted prediction) dependent on whether a single vector or multiple vectors are defined for the video block, the coding process may be improved.

도 7 은 본 개시에 따른 향상 계층 비디오 블록을 인코딩하는 코딩 (즉, 인코딩 또는 디코딩) 기술을 도시하는 다른 흐름도이다. 도 7 을 비디오 인코더 (50) 의 관점에서 설명하지만, 유사한 기술이 또한 비디오 디코더 (60) 에 의해 적용될 수 있다. 도 7 에 도시된 바와 같이, 비디오 인코더 (50) 의 제어 유닛 (31) 은 향상 계층 비디오 블록의 엔트로피 코딩을 위한 벡터를 정의한다 (101). 그 후, 제어 유닛 (31) 은 블록 당 하나의 벡터가 있는지 (102) 또는 블록 당 복수의 벡터가 있는지 여부를 결정한다. 비디오 블록 당 정의된 하나의 벡터가 있으면 (102 에서 "예"), 제어 유닛 (31) 은 예측 유닛 (32) 으로 하여금, 도 2a 에 도시되고 전술한 바와 같이, 순차적 예측을 수행하게 한다 (103). 그러나, 비디오 블록 당 정의된 복수의 벡터가 있으면 (102 에서 "아니오"), 제어 유닛 (31) 은 예측 유닛 (32) 으로 하여금, 도 2b 에 도시되고 전술한 바와 같이, 가중 예측을 수행하게 한다 (104). 그 후, 제어 유닛 (31) 은 벡터 스캔 유닛 (45) 및 엔트로피 코딩 유닛 (46) 으로 하여금, 비디오 블록에 대한 정의된 벡터에 기초하여 벡터화된 엔트로피 코딩을 수행하게 한다 (106). 이 경우, 스캔 유닛 (45) 은 예측 코딩된 비디오 블록 (예를 들어, 잔차 블록) 을 제어 유닛 (31) 의 벡터화된 판정에 기초하여, 2 차원 포맷으로부터 하나 이상의 1 차원 벡터로 스캐닝한다. 엔트로피 코딩 유닛 (46) 은 비디오 블록에 대해 정의된 각각의 1 차원 벡터의 계수에 대해 별도로 엔트로피 코딩을 수행한다.7 is another flowchart illustrating a coding (ie, encoding or decoding) technique for encoding an enhancement layer video block according to the present disclosure. Although FIG. 7 is described in terms of video encoder 50, similar techniques may also be applied by video decoder 60. As shown in FIG. 7, control unit 31 of video encoder 50 defines a vector for entropy coding of an enhancement layer video block (101). The control unit 31 then determines whether there is one vector per block 102 or a plurality of vectors per block. If there is one vector defined per video block (“YES” at 102), control unit 31 causes prediction unit 32 to perform sequential prediction, as shown in FIG. 2A and described above (103). ). However, if there are a plurality of vectors defined per video block (“No” at 102), control unit 31 causes prediction unit 32 to perform weighted prediction, as shown in FIG. 2B and described above. (104). Control unit 31 then causes vector scan unit 45 and entropy coding unit 46 to perform vectorized entropy coding based on the defined vector for the video block (106). In this case, scan unit 45 scans the predictively coded video block (eg, residual block) from the two-dimensional format into one or more one-dimensional vectors, based on the vectorized decision of control unit 31. Entropy coding unit 46 performs entropy coding separately on the coefficients of each one-dimensional vector defined for the video block.

본 개시의 기술들은: 무선 핸드셋, 및 집적 회로 (IC) 또는 IC 의 세트 (즉, 칩셋) 를 포함하는 매우 다양한 디바이스 또는 장치에서 실현될 수도 있다. 임의의 컴포넌트들, 모듈들 또는 유닛들은 기능적 양태를 강조하기 위해 제공되도록 설명되었지만, 다른 하드웨어 유닛에 의한 실현을 필수적으로 요구하지는 않는다.The techniques of this disclosure may be realized in a wide variety of devices or apparatuses, including: a wireless handset, and an integrated circuit (IC) or set of ICs (ie, a chipset). Although certain components, modules, or units have been described to provide a functional aspect, they do not necessarily require realization by other hardware units.

따라서, 여기서 개시된 기술들은 하드웨어, 소프트웨어, 펌웨어 또는 이들의 임의의 조합으로 구현될 수도 있다. 모듈들 또는 컴포넌트들로 설명된 임의의 특징들은 집적 회로 디바이스에서 함께 구현될 수도 있고, 또는 이산적이지만 상호협력가능한 로직 디바이스들로 별도로 구현될 수도 있다. 소프트웨어로 구현되면, 이 기술들은, 실행되는 경우 전술한 방법들 중 하나 이상을 수행하는 명령들을 포함하는 컴퓨터 판독가능 매체에 의해 적어도 부분적으로 실현될 수도 있다. 컴퓨터 판독가능 데이터 저장 매체는, 패키징 재료를 포함할 수도 있는 컴퓨터 프로그램 제품의 일부를 형성할 수도 있다. 컴퓨터 판독가능 매체는 SDRAM (synchronous dynamic random access memory) 과 같은 RAM, ROM (read-only memory), NVRAM (non-volatile random access memory), EEPROM (electrically erasable programmable read-only memory), FLASH 메모리, 자기 또는 광학 데이터 저장 매체 등을 포함할 수도 있다. 이 기술들은 추가적으로 또는 대안적으로, 컴퓨터에 의해 액세스, 판독 및/또는 실행될 수 있는 명령들 또는 데이터 구조들의 형태로 코드를 반송 또는 통신하는 컴퓨터 판독가능 통신 매체에 의해 적어도 부부분적으로 실현될 수도 있다.Thus, the techniques disclosed herein may be implemented in hardware, software, firmware, or any combination thereof. Any features described as modules or components may be implemented together in an integrated circuit device or may be implemented separately in discrete but cooperative logic devices. If implemented in software, the techniques may be at least partially realized by a computer readable medium containing instructions that, when executed, perform one or more of the above-described methods. The computer readable data storage medium may form part of a computer program product, which may include packaging material. Computer-readable media may include RAM, such as synchronous dynamic random access memory (SDRAM), read-only memory (ROM), non-volatile random access memory (NVRAM), electrically erasable programmable read-only memory (EEPROM), FLASH memory, magnetic Or an optical data storage medium. These techniques may additionally or alternatively be realized at least in part by a computer readable communication medium carrying or communicating code in the form of instructions or data structures that can be accessed, read and / or executed by a computer. .

코드는, 하나 이상의 디지털 신호 프로세서 (DSP), 범용 마이크로프로세서, 주문형 집적회로 (ASIC), 필드 프로그래머블 게이트 어레이 (FPGA), 또는 기타 등가의 집적 또는 이산 로직 회로와 같은 하나 이상의 프로세서에 의해 실행될 수도 있다. 따라서, 여기서 사용되는 용어 "프로세서" 는 임의의 전술한 구조 또는 여기서 설명된 기술들의 구현에 적합한 임의의 다른 구조를 지칭할 수도 있다. 또한, 몇몇 양태에서, 여기서 설명된 기능은 인코딩 및 디코딩을 위해 구성된 전용 소프트웨어 모듈 또는 하드웨어 모듈 내에서 제공될 수도 있고, 조합된 비디오 인코더-디코더 (CODEC) 에 통합될 수도 있다. 또한, 이 기술들은 하나 이상의 회로 또는 로직 엘리먼트에서 완전히 구현될 수도 있다.The code may be executed by one or more processors, such as one or more digital signal processors (DSPs), general purpose microprocessors, application specific integrated circuits (ASICs), field programmable gate arrays (FPGAs), or other equivalent integrated or discrete logic circuits. . Thus, the term “processor” as used herein may refer to any of the foregoing structures or any other structure suitable for the implementation of the techniques described herein. In addition, in some aspects, the functionality described herein may be provided within a dedicated software module or hardware module configured for encoding and decoding, and integrated into a combined video encoder-decoder (CODEC). In addition, these techniques may be fully implemented in one or more circuits or logic elements.

본 개시의 다양한 양태들을 설명하였다. 이 양태들 및 다른 양태들은 다음의 청구항의 범주에 속한다.Various aspects of the disclosure have been described. These and other aspects are within the scope of the following claims.

Claims

A method of coding data of a video sequence,
Defining one or more vectors for vectorized entropy coding of enhancement layer video blocks of a coded unit in the video sequence, wherein the vectorized entropy coding defines a vector syntax that defines the vectors associated with the enhancement layer video blocks. Defining one or more vectors, including entropy coding the enhancement layer video blocks using an element;
Selecting a prediction mode for the enhancement layer video blocks of the coded unit based on the defined vectorized entropy coding, wherein the defined vectorized entropy coding comprises two or more vectors for the enhancement layer video blocks. Selecting weighted prediction, and if the defined vectorized entropy coding defines a single vector for the enhancement layer video blocks, selecting weighted prediction; Wherein the weighted prediction comprises prediction using weighted prediction data comprising a combination of prediction enhancement layer data and prediction base layer data; And
Coding the enhancement layer video blocks based on the selected prediction mode and the vectorized entropy coding.

The method of claim 1,
Wherein the weighted prediction comprises prediction based on prediction blocks formed of weighted combinations of prediction enhancement layer video blocks and prediction base layer video blocks in the video sequence.

delete

The method of claim 1,
Wherein the unweighted prediction comprises sequential prediction.

The method of claim 1,
Disabling a vector coding mode for the coded unit to define a single vector for vectorized entropy coding.

The method of claim 1,
The coded unit comprises a frame of the video sequence or a slice of a frame of the video sequence.

The method according to claim 6,
And the method of coding data of the video sequence is repeated for different coded units of the video sequence.

The method of claim 1,
The vectorized entropy coding includes scanning the enhancement layer video blocks from two-dimensional blocks of transform coefficients into the one or more vectors, and individually entropy coding the one or more vectors,
Wherein the one or more vectors comprise one-dimensional sets of transform coefficients.

The method of claim 1,
The coding comprises encoding,
The method of coding data of the video sequence further comprises transmitting a bitstream comprising encoded video blocks.

The method of claim 1,
The coding comprises decoding,
The method of coding data of the video sequence further comprises receiving the video sequence as a bitstream comprising encoded video blocks.

An apparatus for coding data of a video sequence,
Define one or more vectors for vectorized entropy coding of the enhancement layer video blocks of the coded unit in the video sequence, and predict the enhancement layer video blocks of the coded unit based on the defined vectorized entropy coding A control unit for selecting a mode, wherein the vectorized entropy coding includes entropy coding the enhancement layer video blocks using a vector syntax element defining vectors associated with the enhancement layer video blocks, wherein the control unit comprises: If the defined vectorized entropy coding establishes two or more vectors for the enhancement layer video blocks, then weighted prediction is selected, and the defined vectorized entropy coding is single for the enhancement layer video blocks. Unweighted example if defining a vector To choose from, the weighted prediction, including a prediction to use the weighted prediction data comprising a combination of predictive enhancement layer data and predictive base layer data, the control unit;
A prediction unit for performing prediction coding techniques based on the selected prediction mode; And
And an entropy coding unit to perform the vectorized entropy coding.

The method of claim 11,
And the weighted prediction comprises prediction based on prediction blocks formed of weighted combinations of prediction enhancement layer video blocks and prediction base layer video blocks in the video sequence.

delete

The method of claim 11,
Wherein the unweighted prediction comprises sequential prediction.

The method of claim 11,
And the control unit disables a vector coding mode for the coded unit to define a single vector for vectorized entropy coding.

The method of claim 11,
The coded unit comprises a frame of the video sequence or a slice of a frame of the video sequence.

17. The method of claim 16,
The control unit defines one or more vectors and selects the prediction mode for each of a plurality of different coded units of the video sequence.

The method of claim 11,
Further comprising a scanning unit,
The scanning unit scans the enhancement layer video blocks from the two-dimensional blocks of transform coefficients into the one or more vectors, the entropy coding unit entropy codes the one or more vectors separately,
And the one or more vectors comprise one-dimensional sets of transform coefficients.

The method of claim 11,
The coding comprises encoding,
And the entropy coding unit comprises an entropy encoding unit.

The method of claim 11,
The coding comprises decoding,
And the entropy coding unit comprises an entropy decoding unit.

The method of claim 11,
And the apparatus for coding the data of the video sequence comprises an integrated circuit.

The method of claim 11,
And the apparatus for coding data of the video sequence comprises a microprocessor.

A computer readable medium comprising instructions that, when executed on a video coding device, cause the video coding device to code data of a video sequence, the computer readable medium comprising:
The instructions cause the video coding device to:
Define one or more vectors for vectorized entropy coding of the enhancement layer video blocks of the coded unit in the video sequence;
Select a prediction mode for enhancement layer video blocks of the coded unit based on the defined vectorized entropy coding;
Code the enhancement layer video blocks based on the selected prediction mode and the vectorized entropy coding,
The vectorized entropy coding includes entropy coding the enhancement layer video blocks using a vector syntax element that defines vectors associated with the enhancement layer video blocks,
The instructions cause the video coding device to select weighted prediction when the defined vectorized entropy coding establishes two or more vectors for the enhancement layer video blocks, and the defined vectorized entropy coding. In the case of defining a single vector for these enhancement layer video blocks, unweighted prediction is selected,
Wherein the weighted prediction comprises prediction using weighted prediction data comprising a combination of prediction enhancement layer data and prediction base layer data.

24. The method of claim 23,
The weighted prediction comprises prediction based on prediction blocks formed of weighted combinations of prediction enhancement layer video blocks and prediction base layer video blocks in the video sequence.

delete

24. The method of claim 23,
And the unweighted prediction comprises sequential prediction.

24. The method of claim 23,
The instructions cause the video coding device to disable a vector coding mode for the coded unit to define a single vector for vectorized entropy coding.

24. The method of claim 23,
And the coded unit comprises a frame of the video sequence or a slice of a frame of the video sequence.

29. The method of claim 28,
The instructions cause the video coding device to define one or more vectors and to select the prediction mode for each of a plurality of different coded units of the video sequence.

24. The method of claim 23,
The vectorized entropy coding comprises scanning the enhancement layer video blocks from two-dimensional blocks of transform coefficients into the one or more vectors, and individually entropy coding the one or more vectors,
And the one or more vectors comprise one-dimensional sets of transform coefficients.

24. The method of claim 23,
The coding comprises encoding,
And the instructions cause the video coding device to transmit a bitstream comprising encoded video blocks.

24. The method of claim 23,
The coding comprises decoding,
And the instructions cause the video coding device to receive the video sequence as a bitstream that includes encoded video blocks.

A device for coding data of a video sequence,
Means for defining one or more vectors for vectorized entropy coding of enhancement layer video blocks of a coded unit in the video sequence, wherein the vectorized entropy coding defines a vector syntax that defines the vectors associated with the enhancement layer video blocks. Means for defining the one or more vectors, including entropy coding the enhancement layer video blocks using an element;
Means for selecting a prediction mode for the enhancement layer video blocks of the coded unit based on the defined vectorized entropy coding, wherein the defined vectorized entropy coding comprises two or more vectors for the enhancement layer video blocks. Means for selecting weighted prediction, and if the defined vectorized entropy coding defines a single vector for the enhancement layer video blocks, Means for selecting the prediction mode, wherein the weighted prediction comprises prediction using weighted prediction data comprising a combination of prediction enhancement layer data and prediction base layer data; And
Means for coding the enhancement layer video blocks based on the selected prediction mode and the vectorized entropy coding.

34. The method of claim 33,
Wherein the weighted prediction comprises prediction based on prediction blocks formed of weighted combinations of prediction enhancement layer video blocks and prediction base layer video blocks in the video sequence.

delete

34. The method of claim 33,
And the unweighted prediction comprises sequential prediction.

34. The method of claim 33,
And means for disabling a vector coding mode for the coded unit to define a single vector for vectorized entropy coding.

34. The method of claim 33,
And the coded unit comprises a frame of the video sequence or a slice of a frame of the video sequence.

The method of claim 38,
Wherein the means for defining defines one or more vectors, and the means for selecting the prediction mode selects the prediction mode for each of a plurality of different coded units of the video sequence.

34. The method of claim 33,
The vectorized entropy coding comprises scanning the enhancement layer video blocks from two-dimensional blocks of transform coefficients into the one or more vectors, and individually entropy coding the one or more vectors,
And the one or more vectors comprise one-dimensional sets of transform coefficients.

34. The method of claim 33,
The coding comprises encoding,
And the device for coding data of the video sequence further comprises means for transmitting a bitstream comprising encoded video blocks.

34. The method of claim 33,
The coding comprises decoding,
And the device for coding the data of the video sequence further comprises means for receiving the video sequence as a bitstream comprising encoded video blocks.

As a device,
Define one or more vectors for vectorized entropy encoding of the enhancement layer video blocks of the coded unit in the video sequence, and based on the defined vectorized entropy coding, the prediction mode for the enhancement layer video blocks of the coded unit. Wherein the vectorized entropy coding comprises entropy coding the enhancement layer video blocks using a vector syntax element defining vectors associated with the enhancement layer video blocks, wherein the control unit comprises: If defined vectorized entropy coding establishes two or more vectors for the enhancement layer video blocks, then weighted prediction is selected and the defined vectorized entropy coding is a single vector for the enhancement layer video blocks. If you define a line, And, wherein the weighted prediction is, the control unit including a prediction to use the weighted prediction data comprising a combination of predictive enhancement layer data and predictive base layer data;
A prediction unit for performing prediction encoding techniques based on the selected prediction mode;
An entropy encoding unit for performing at least a portion of the bitstream by performing the vectorized entropy encoding; And
A wireless transmitter for transmitting said bitstream to another device

44. The method of claim 43,
And the device comprises a wireless communication handset.

A wireless receiver receiving a bitstream comprising entropy coded coefficient values of enhancement layer video blocks of a coded unit in a video sequence;
Define one or more vectors for vectorized entropy decoding of the enhancement layer video blocks of the coded unit in the video sequence, and predict the enhancement layer video blocks of the coded unit based on the defined vectorized entropy decoding. A control unit for selecting a mode, wherein the vectorized entropy decoding includes entropy decoding the enhancement layer video blocks using a vector syntax element that defines vectors associated with the enhancement layer video blocks, wherein the control unit comprises: Select weighted prediction if the defined vectorized entropy decoding establishes two or more vectors for the enhancement layer video blocks, and wherein the defined vectorized entropy decoding is single for the enhancement layer video blocks. If you define a vector of Select a non-weighted prediction, wherein the weighted prediction comprises prediction using weighted prediction data comprising a combination of prediction enhancement layer data and prediction base layer data;
A prediction unit for performing prediction decoding techniques based on the selected prediction mode; And
And an entropy decoding unit for performing the vectorized entropy decoding.

46. The method of claim 45,
And the device comprises a wireless communication handset.