HK1213402B - Methods and apparatus for video encoders and decoders - Google Patents

Methods and apparatus for video encoders and decoders Download PDF

Info

Publication number
HK1213402B
HK1213402B HK16101202.4A HK16101202A HK1213402B HK 1213402 B HK1213402 B HK 1213402B HK 16101202 A HK16101202 A HK 16101202A HK 1213402 B HK1213402 B HK 1213402B
Authority
HK
Hong Kong
Prior art keywords
block
intra
sub
blocks
frame
Prior art date
Application number
HK16101202.4A
Other languages
Chinese (zh)
Other versions
HK1213402A1 (en
Inventor
郑云飞
许茜
吕小安
尹鹏
J‧索尔
A‧阿巴斯
Original Assignee
Interdigital Vc Holdings, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Interdigital Vc Holdings, Inc. filed Critical Interdigital Vc Holdings, Inc.
Publication of HK1213402A1 publication Critical patent/HK1213402A1/en
Publication of HK1213402B publication Critical patent/HK1213402B/en

Links

Description

用于视频编码器和解码器的方法和装置Methods and apparatus for video encoders and decoders

本申请是申请日为2010年6月29日、申请号为201080038907.7、发明名称为“用于视频编码器和解码器的对大块的帧内预测进行信令的方法和装置”的发明专利申请的分案申请。This application is a divisional application of an invention patent application filed on June 29, 2010, with application number 201080038907.7 and invention name “Method and device for signaling intra-frame prediction of large blocks for video encoders and decoders”.

相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS

本申请要求2009年7月1日提交的美国临时申请序列号No.61/222,177(代理案号No.PU090082)的权益,通过引用将其内容全部并入于此。This application claims the benefit of U.S. Provisional Application Serial No. 61/222,177 (Attorney Docket No. PU090082), filed July 1, 2009, which is hereby incorporated by reference in its entirety.

技术领域Technical Field

本原理一般地涉及视频编码和解码,并且更具体地涉及用于视频编码器和解码器的对大块的帧内预测进行信令(signal)的方法和装置。The present principles relate generally to video encoding and decoding, and more particularly to methods and apparatus for video encoders and decoders to signal intra prediction of large blocks.

背景技术Background Art

多数现代视频编码标准采用各种编码模式来有效地减少空间域和时间域中的相关度。例如,在国际标准化组织/国际电工委员会(ISO/IEC)运动画面专家组-4(MPEG-4)第10部分高级视频编码(AVC)标准/国际电信联盟电信分部(ITU-T)H.264推荐(下文的“MPEG-4 AVC标准”)中,可以帧内编码或者帧间编码画面。在帧内画面中,以帧内模式编码所有宏块,由此利用画面内的空间相关度。帧内模式可以被归类为以下三种类型:INTRA4×4;INTRA8×8;INTRA16×16。INTRA4×4和INTRA8×8支持9种帧内预测模式,INTRA16×16支持4种帧内预测模式。Most modern video coding standards employ various coding modes to effectively reduce correlation in the spatial and temporal domains. For example, in the International Organization for Standardization/International Electrotechnical Commission (ISO/IEC) Moving Picture Experts Group-4 (MPEG-4) Part 10 Advanced Video Coding (AVC) standard/International Telecommunication Union-Telecom (ITU-T) H.264 Recommendation (hereinafter referred to as the "MPEG-4 AVC standard"), pictures can be intra-coded or inter-coded. In intra-pictures, all macroblocks are coded in intra mode, thereby exploiting spatial correlation within the picture. Intra modes can be categorized into the following three types: INTRA4×4; INTRA8×8; and INTRA16×16. INTRA4×4 and INTRA8×8 support nine intra-prediction modes, while INTRA16×16 supports four intra-prediction modes.

INTRA 4×4和INTRA8×8支持以下9种帧内预测模式:垂直预测、水平预测、DC预测、对角下/左预测、对角下/右预测、垂直-左预测、水平-下预测、垂直-右预测,以及水平-上预测。INTRA16×16支持以下4种帧内预测模式:垂直预测、水平预测、DC预测,以及平面预测。转到图1,由参考标号100总地指示INTRA4×4和INTRA8×8预测模式。在图1中,参考标号0指示垂直预测模式、参考标号1指示水平预测模式、参考标号3指示对角下/左预测模式、参考标号4指示对角下/右预测模式、参考标号5指示垂直-右预测模式、参考标号6指示水平-下预测模式、参考标号7指示垂直-左预测模式、参考标号8指示垂直-上预测模式。未示出作为INTRA4×4和INTRA8×8预测模式一部分的DC模式。转到图2,由参考标号200总地指示INTRA16×16预测模式。在图2中,参考标号0指示垂直预测模式、参考标号1指示水平预测模式、参考标号3指示平面预测模式。未示出作为INTRA16×16预测模式一部分的DC模式。INTRA4×4 and INTRA8×8 support the following nine intra-frame prediction modes: vertical prediction, horizontal prediction, DC prediction, diagonal down/left prediction, diagonal down/right prediction, vertical-left prediction, horizontal-down prediction, vertical-right prediction, and horizontal-up prediction. INTRA16×16 supports the following four intra-frame prediction modes: vertical prediction, horizontal prediction, DC prediction, and planar prediction. Turning to FIG. 1 , the INTRA4×4 and INTRA8×8 prediction modes are generally indicated by reference numeral 100. In FIG. 1 , reference numeral 0 indicates a vertical prediction mode, reference numeral 1 indicates a horizontal prediction mode, reference numeral 3 indicates a diagonal down/left prediction mode, reference numeral 4 indicates a diagonal down/right prediction mode, reference numeral 5 indicates a vertical-right prediction mode, reference numeral 6 indicates a horizontal-down prediction mode, reference numeral 7 indicates a vertical-left prediction mode, and reference numeral 8 indicates a vertical-up prediction mode. The DC mode, which is part of the INTRA4×4 and INTRA8×8 prediction modes, is not shown. 2 , the INTRA16x16 prediction mode is generally indicated by reference numeral 200. In FIG2 , reference numeral 0 indicates vertical prediction mode, reference numeral 1 indicates horizontal prediction mode, and reference numeral 3 indicates planar prediction mode. The DC mode, which is part of the INTRA16x16 prediction mode, is not shown.

INTRA4×4使用4×4离散余弦变换(DCT)。INTRA8×8使用8×8变换。INTRA16×16使用级联的4×4变换。为了进行信令,INTRA4×4和INTRA8×8共享相同的宏块类型(mb_type)0并且通过变换尺寸标志(transform_8×8_size_flag)来区分。然后,通过最可能的模式(如果必要,可能利用其余模式)来对在INTRA4×4或INTRA8×8中帧内预测模式的选取进行信令。对于INTRA16×16,在mb_type中对所有帧内预测模式连同编码块图案(cbp)类型进行信令,其使用1到24的mb_type值。表1示出了用于帧内编码码片(I码片)的宏块类型的详细的信令。如果尺寸大于16×16的更大的块用于帧内预测,则面对如下的若干可能的问题。INTRA4×4 uses a 4×4 discrete cosine transform (DCT). INTRA8×8 uses an 8×8 transform. INTRA16×16 uses a cascaded 4×4 transform. For signaling, INTRA4×4 and INTRA8×8 share the same macroblock type (mb_type) 0 and are distinguished by a transform size flag (transform_8×8_size_flag). The choice of intra prediction mode in INTRA4×4 or INTRA8×8 is then signaled by the most likely mode (and possibly the remaining modes if necessary). For INTRA16×16, all intra prediction modes are signaled in mb_type along with the coded block pattern (cbp) type, which uses mb_type values from 1 to 24. Table 1 shows the detailed signaling of the macroblock types for intra-coded slices (I slices). If larger blocks of size greater than 16×16 are used for intra prediction, several possible problems are faced as follows.

(1)如果通过在MPEG-4 AVC标准中简单地扩展mb_type来增加INTRA32×32或者INTRA64×64预测,则其将对这两种新模式造成太多的开销,并且另外,将不允许帧内预测的分级类型。如下解释帧内预测的分级类型的示例。如果32×32块用作大块并且允许子划分为16×16,则对于每个16×16子划分,应允许INTRA4×4、INTRA8×8、INTRA16×16。(1) If INTRA32×32 or INTRA64×64 prediction were added by simply extending mb_type in the MPEG-4 AVC standard, it would cause too much overhead for the two new modes, and in addition, hierarchical types of intra prediction would not be allowed. An example of hierarchical types of intra prediction is explained as follows. If a 32×32 block is used as a large block and sub-division into 16×16 is allowed, then for each 16×16 sub-division, INTRA4×4, INTRA8×8, and INTRA16×16 should be allowed.

(2)如果更大的变换(诸如16×16变换)而不是级联的变换用于INTRA16×16,则不能应用当前的信令。(2) If a larger transform (such as a 16x16 transform) instead of a concatenated transform is used for INTRA16x16, the current signaling cannot be applied.

(3)应对一个帧内划分类型内部的帧内预测模式给出不同的优先级。(3) Different priorities should be given to intra prediction modes within an intra partition type.

表1Table 1

MPEG-4 AVC标准的扩展中存在与对大的运动(帧间)划分进行信令有关的一些现有技术方法。关于第一种现有技术方法来描述在MPEG-4 AVC标准的扩展中怎样对大的运动(帧间)划分进行信令的一个示例。第一种现有技术方法描述了怎样为使用分级编码结构的32×32块或者64×64块进行信令。There are several prior art methods related to signaling large motion (inter) partitions in the extensions of the MPEG-4 AVC standard. Regarding the first prior art method, an example of how large motion (inter) partitions are signaled in the extensions of the MPEG-4 AVC standard is described. The first prior art method describes how to signal for 32×32 blocks or 64×64 blocks using a hierarchical coding structure.

此外,除了MPEG-4 AVC标准中现有的运动划分尺寸(16×16、16×8、8×16、8×8、8×4、4×8和4×4)之外,也已经提出使用32×32、32×16和16×32划分的用于MPEG-4 AVC标准的扩展的帧间编码。转到图3,通过参考标号300总地指示用于32×32块中的运动划分。划分包括32×32、32×16、16×32和16×16。16×16划分可以进一步被划分为尺寸16×16、16×8、8×16和8×8的划分。此外,8×8划分可以进一步被划分为尺寸8×8、8×4、4×8和4×4的划分。Furthermore, in addition to the existing motion partition sizes (16×16, 16×8, 8×16, 8×8, 8×4, 4×8, and 4×4) in the MPEG-4 AVC standard, extended inter-frame coding for the MPEG-4 AVC standard has been proposed using 32×32, 32×16, and 16×32 partitions. Turning to FIG3 , motion partitions for 32×32 blocks are generally indicated by reference numeral 300. These partitions include 32×32, 32×16, 16×32, and 16×16. A 16×16 partition can be further divided into partitions of sizes 16×16, 16×8, 8×16, and 8×8. Furthermore, an 8×8 partition can be further divided into partitions of sizes 8×8, 8×4, 4×8, and 4×4.

对于每个32×32的块,以对MPEG-4 AVC标准的其它模式执行的方式类似的方式使用mb32_skip_flag来对SKIP模式或者DIRECT模式进行信令。另外,MPEG-4 AVC标准中的M×N(M=8或16而N=8或16)划分的原始mb_type还用于对32×32块中的2M×2N划分进行信令。如果32×32的mb32_type指示使用16×16划分,则通过使用与MPEG-4 AVC标准中的macroblock_layer()相同的语法元素,以光栅扫描顺序来对四个16×16块进行信令。可以进一步以四叉树方式将每个16×16块从尺寸16×16向下划分为尺寸4×4。For each 32x32 block, mb32_skip_flag is used to signal SKIP mode or DIRECT mode in a manner similar to that performed for other modes of the MPEG-4 AVC standard. In addition, the original mb_type for the M×N (M=8 or 16 and N=8 or 16) partition in the MPEG-4 AVC standard is also used to signal the 2M×2N partition in the 32x32 block. If the mb32_type of 32×32 indicates the use of 16×16 partition, four 16×16 blocks are signaled in raster scan order using the same syntax elements as macroblock_layer() in the MPEG-4 AVC standard. Each 16×16 block can be further partitioned down from size 16×16 to size 4×4 in a quadtree manner.

对于宏块尺寸64×64,在32×32块中使用的划分之上添加了以下划分:64×64、64×32和32×64。由此,在块尺寸32×32之上的宏块划分中添加了超过一个分级的层。MPEG-4AVC标准中的M×N(M=8或16而N=8或16)宏块划分的原始的mb_type用于对64×64宏块中的4M×4N宏块划分进行信令。如果32×32宏块划分用于64×64块,则每个32×32块将以与上述相同的方式被处理。For the macroblock size 64×64, the following partitions are added above the partition used in the 32×32 block: 64×64, 64×32, and 32×64. Thus, more than one hierarchical layer is added to the macroblock partitions above the block size 32×32. The original mb_type of the M×N (M=8 or 16 and N=8 or 16) macroblock partition in the MPEG-4 AVC standard is used to signal the 4M×4N macroblock partition in the 64×64 macroblock. If the 32×32 macroblock partition is used for the 64×64 block, each 32×32 block will be processed in the same manner as described above.

然而,现有文献没有解决怎样对大的帧内模式进行信令,其中大的帧内模式被定义为意指涉及具有等于或者大于32×32的尺寸的划分块的帧内预测。However, existing literature does not address how to signal large intra mode, where large intra mode is defined to mean intra prediction involving partition blocks having a size equal to or greater than 32x32.

发明内容Summary of the Invention

通过本原理解决现有技术的这些和其它缺陷和缺点,本原理针对用于视频编码器和解码器的对大块的帧内预测进行信令的方法和装置。These and other deficiencies and shortcomings of the prior art are addressed by the present principles, which are directed to methods and apparatus for video encoders and decoders for signaling intra prediction of large blocks.

根据本原理的一个方面,提供了一种装置。该装置包括视频编码器,所述视频编码器通过对用于画面中的至少一个大块的帧内预测进行信令来编码所述至少一个大块的画面数据。通过选择基本编码单元尺寸并且分配用于基本编码单元尺寸的单个空间帧内划分类型来对帧内预测进行信令。该单个空间帧内划分类型是可从多个空间帧内划分类型中选择的。所述至少一个大块具有比基本编码单元的块尺寸大的大块尺寸。帧内预测是分层级的帧内预测并且通过以下操作中的至少一个而对至少一个大块执行:将大块尺寸拆分为基本编码单元尺寸以及从基本编码单元尺寸合并到大块尺寸。According to one aspect of the present principles, an apparatus is provided. The apparatus includes a video encoder that encodes picture data for at least one large block in a picture by signaling intra prediction for the at least one large block. The intra prediction is signaled by selecting a basic coding unit size and assigning a single spatial intra partition type for the basic coding unit size. The single spatial intra partition type is selectable from a plurality of spatial intra partition types. The at least one large block has a large block size that is larger than a block size of the basic coding unit. The intra prediction is hierarchical intra prediction and is performed on the at least one large block by at least one of: splitting the large block size into the basic coding unit size and merging from the basic coding unit size to the large block size.

根据本原理的另一方面,提供了一种视频编码器中的方法。该方法包括通过对用于画面中的至少一个大块的帧内预测进行信令来编码所述至少一个大块的画面数据。通过选择基本编码单元尺寸并且分配用于基本编码单元尺寸的单个空间帧内划分类型来对帧内预测进行信令。该单个空间帧内划分类型是可从多个空间帧内划分类型中选择的。所述至少一个大块具有比基本编码单元的块尺寸大的大块尺寸。帧内预测是分层级的帧内预测并且通过以下操作中的至少一个而对至少一个大块执行:将大块尺寸拆分为基本编码单元尺寸以及从基本编码单元尺寸合并到大块尺寸。According to another aspect of the present principles, a method in a video encoder is provided. The method includes encoding picture data of at least one large block in the picture by signaling intra prediction for the at least one large block. The intra prediction is signaled by selecting a basic coding unit size and assigning a single spatial intra partition type for the basic coding unit size. The single spatial intra partition type is selectable from a plurality of spatial intra partition types. The at least one large block has a large block size that is larger than a block size of the basic coding unit. The intra prediction is hierarchical intra prediction and is performed on the at least one large block by at least one of: splitting the large block size into the basic coding unit size and merging from the basic coding unit size to the large block size.

根据本原理的又一方面,提供了一种装置。该装置包括视频解码器,所述视频解码器通过确定要为画面中的至少一个大块执行的帧内预测来解码所述至少一个大块的画面数据。通过确定基本编码单元尺寸并且确定用于基本编码单元尺寸的单个空间帧内划分类型来确定帧内预测。该单个空间帧内划分类型是可从多个空间帧内划分类型中确定的。所述至少一个大块具有比基本编码单元的块尺寸大的大块尺寸。帧内预测是分层级的帧内预测并且通过以下操作中的至少一个而对至少一个大块执行:将大块尺寸拆分为基本编码单元尺寸以及从基本编码单元尺寸合并到大块尺寸。According to yet another aspect of the present principles, an apparatus is provided. The apparatus includes a video decoder configured to decode picture data of at least one large block in a picture by determining intra prediction to be performed for the at least one large block. The intra prediction is determined by determining a basic coding unit size and determining a single spatial intra partition type for the basic coding unit size. The single spatial intra partition type is determinable from a plurality of spatial intra partition types. The at least one large block has a large block size that is larger than a block size of the basic coding unit. The intra prediction is hierarchical intra prediction and is performed on the at least one large block by at least one of: splitting the large block size into the basic coding unit size and merging the basic coding unit size into the large block size.

根据本原理的再一方面,提供了一种视频解码器中的方法。该方法包括通过确定要为画面中的至少一个大块执行的帧内预测来解码所述至少一个大块的画面数据。通过确定基本编码单元尺寸并且确定用于基本编码单元尺寸的单个空间帧内划分类型来确定帧内预测。该单个空间帧内划分类型是可从多个空间帧内划分类型中确定的。所述至少一个大块具有比基本编码单元的块尺寸大的大块尺寸。帧内预测是分层级的帧内预测并且通过以下操作中的至少一个而对至少一个大块执行:将大块尺寸拆分为基本编码单元尺寸以及从基本编码单元尺寸合并到大块尺寸。According to yet another aspect of the present principles, a method in a video decoder is provided. The method includes decoding picture data of at least one large block in a picture by determining intra prediction to be performed for the at least one large block. The intra prediction is determined by determining a basic coding unit size and determining a single spatial intra partition type for the basic coding unit size. The single spatial intra partition type is determinable from a plurality of spatial intra partition types. The at least one large block has a large block size that is larger than a block size of the basic coding unit. The intra prediction is hierarchical intra prediction and is performed on the at least one large block by at least one of: splitting the large block size into the basic coding unit size and merging from the basic coding unit size to the large block size.

本原理的这些和其它方面、特征和优点将从示例实施例的以下具体描述中变得明显,将结合附图阅读以下具体描述。These and other aspects, features and advantages of the present principles will become apparent from the following detailed description of example embodiments, which is to be read in conjunction with the accompanying drawings.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

依据以下示例性图将更好地理解本原理,其中:The present principles will be better understood with reference to the following exemplary drawings, in which:

图1是示出可以应用本原理的INTRA4×4和INTRA8×8预测模式的图;FIG1 is a diagram illustrating INTRA4×4 and INTRA8×8 prediction modes to which the present principles may be applied;

图2是示出可以应用本原理的INTRA16×16预测模式的图;FIG2 is a diagram illustrating an INTRA16×16 prediction mode to which the present principles may be applied;

图3是示出可以应用本原理的用于32×32块的运动划分的图;FIG3 is a diagram illustrating motion partitioning for a 32×32 block to which the present principles may be applied;

图4是依据本原理实施例的可以应用本原理的示例性视频编码器的框图;FIG4 is a block diagram of an exemplary video encoder to which the present principles may be applied, according to an embodiment of the present principles;

图5是依据本原理实施例的可以应用本原理的示例性视频解码器的框图;FIG5 is a block diagram of an exemplary video decoder to which the present principles may be applied, according to an embodiment of the present principles;

图6是依据本原理实施例的可以应用本原理的示例性分级划分的框图;FIG6 is a block diagram of an exemplary hierarchical partitioning to which the present principles may be applied, according to an embodiment of the present principles;

图7A和7B表示依据本原理实施例的通过对用于大块的帧内预测进行信令来编码所述大块的画面数据的示例性方法的流程图;以及7A and 7B are flowcharts illustrating exemplary methods for encoding picture data of a large block by signaling intra-frame prediction for the large block, in accordance with an embodiment of the present principles; and

图8A和8B表示依据本原理实施例的通过确定要被应用到大块的帧内预测来解码所述大块的画面数据的示例性方法的流程图。8A and 8B illustrate flowcharts of exemplary methods for decoding picture data of a large block by determining intra prediction to be applied to the large block, in accordance with an embodiment of the present principles.

具体实施方式DETAILED DESCRIPTION

本原理针对用于视频编码器和解码器的对大块的帧内预测进行信令的方法和装置。The present principles are directed to methods and apparatus for video encoders and decoders to signal intra prediction for large blocks.

本描述例示了本原理。因此,将理解,本领域技术人员将能够开发未在这里明确描述或示出但是体现本原理并且被包括在本原理的精神和范围之内的各种布置。This description illustrates the present principles. Thus, it will be appreciated that those skilled in the art will be able to develop various arrangements not explicitly described or shown herein that embody the present principles and are included within their spirit and scope.

在此叙述的所有示例和条件性语言意欲用于教导的目的以便帮助读者理解本原理以及由(多个)发明人贡献以推动本领域发展的构思,并且应该被解释为不局限于这样具体叙述的示例和条件。All examples and conditional language recited herein are intended for teaching purposes to aid the reader in understanding the present principles and concepts contributed by the inventor(s) to further the art, and should be construed as not being limited to such specifically recited examples and conditions.

另外,在这里叙述本原理的原理、方面和实施例的所有陈述,及其具体示例意欲包括其结构和功能上的等同物。另外,意图是:这样的等同物包括当前已知的等同物以及将来开发的等同物二者,即所开发的执行相同功能的任何元件,而不论其结构如何。In addition, all statements herein reciting principles, aspects, and embodiments of the present principles, and specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents and equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure.

因此,例如,本领域技术人员将认识到:在此呈现的框图表示体现本原理的例示性电路的概念性视图。类似地,将认识到:任何流程图示(flow chart)、流程图(flowdiagram)、状态转换图、伪代码等表示实质上可以表示在计算机可读介质中并因此由计算机或处理器执行的各种处理,而不管是否明确地示出这样的计算机或处理器。Thus, for example, those skilled in the art will recognize that the block diagrams presented herein represent conceptual views of illustrative circuitry embodying the present principles. Similarly, it will be recognized that any flow charts, flow diagrams, state transition diagrams, pseudo-code, and the like may substantially represent various processes that are embodied in a computer-readable medium and thereby executed by a computer or processor, whether or not such computer or processor is explicitly shown.

可以通过使用专用硬件、以及与适当的软件相关联的能够执行软件的硬件来提供图中示出的各种元件的功能。当利用处理器来提供所述功能时,可以利用单个专用处理器、利用单个共享处理器、或者利用其中一些可被共享的多个独立处理器来提供所述功能。另外,术语“处理器”或“控制器”的明确使用不应该被解释为排他性地指代能够执行软件的硬件,而是可以隐含地不受限制地包括数字信号处理器(“DSP”)硬件、用于存储软件的只读存储器(“ROM”)、随机存取存储器(“RAM”)、和非易失性存储装置。The functions of the various elements shown in the figures can be provided by using dedicated hardware and hardware capable of executing software in association with appropriate software. When a processor is utilized to provide the functions, the functions can be provided by a single dedicated processor, by a single shared processor, or by a plurality of independent processors, some of which can be shared. In addition, the explicit use of the terms "processor" or "controller" should not be interpreted as referring exclusively to hardware capable of executing software, but rather may implicitly include, without limitation, digital signal processor ("DSP") hardware, read-only memory ("ROM") for storing software, random access memory ("RAM"), and non-volatile storage.

还可以包括其它传统的和/或定制的硬件。类似地,图中示出的任何开关只是概念性的。它们的功能可以通过程序逻辑的运行、通过专用逻辑、通过程序控制和专用逻辑的交互、或者甚至手动地来执行,如从上下文更具体地理解的,实施者可选择具体技术。Other conventional and/or custom hardware may also be included. Similarly, any switches shown in the figures are conceptual only. Their functions may be performed by the operation of program logic, by dedicated logic, by the interaction of program control and dedicated logic, or even manually, as will be more specifically understood from the context, the implementer having the choice of the specific technique.

在其权利要求中,被表示为用于执行指定功能的部件的任何元件意欲包含执行那个功能的任何方式,例如包括:a)执行那个功能的电路元件的组合或者b)与适当电路相组合的任何形式的软件,所述软件因此包括固件或微代码等,所述适当电路用于执行该软件以执行所述功能。由这种权利要求限定的本发明在于如下事实,即,以权利要求所要求的方式将由各种所叙述的部件提供的功能组合和集合到一起。因此认为可以提供那些功能的任何部件与在此示出的那些部件等同。In the claims herein, any element referred to as a means for performing a specified function is intended to encompass any means of performing that function, including, for example, a) a combination of circuit elements that perform that function or b) any form of software, thus including firmware or microcode, combined with appropriate circuitry for executing the software to perform the function. The invention defined by such claims resides in the fact that the functionality provided by the various recited components is combined and brought together in the manner required by the claims. Any components that can provide those functions are therefore considered equivalent to those shown herein.

在本说明书中提到的本原理的“一个实施例”或“实施例”及其其它变型意味着:结合所述实施例描述的具体特征、结构、特性等被包括在本原理的至少一个实施例中。因此,在说明书各处出现的短语“在一个实施例中”和“在实施例中”、以及任何其它变型不一定都指代相同的实施例。Reference throughout this specification to "one embodiment" or "an embodiment" of the present principles and other variations thereof means that a particular feature, structure, characteristic, etc. described in connection with the embodiment is included in at least one embodiment of the present principles. Thus, the appearances of the phrases "in one embodiment" and "in an embodiment," as well as any other variations, throughout the specification are not necessarily all referring to the same embodiment.

应当认识到,“/”、“和/或”以及“至少一个”任何一个的使用,例如在“A/B”、“A和/或B”和“A和B中的至少一个”的情况中,意欲包括仅仅对第一个列出的选项(A)的选择、或仅仅对第二个列出的选项(B)的选择、或者对于两个选项(A和B)的选择。作为另一示例,在“A、B和/或C”以及“A、B和C中的至少一个”的情况中,这种措辞意欲包括仅仅对第一个列出的选项(A)的选择、或仅仅对第二个列出的选项(B)的选择、或仅仅对第三个列出的选项(C)的选择、或仅仅对第一个和第二个列出的选项(A和B)的选择、或仅仅对第一个和第三个列出的选项(A和C)的选择、或仅仅对第二个和第三个列出的选项(B和C)的选择、或者对于全部三个选项(A和B和C)的选择。如本领域和相关领域普通技术人员容易认识到的,这可以被扩展用于很多列出的条目。It should be appreciated that the use of any of “/,” “and/or,” and “at least one of,” such as in the case of “A/B,” “A and/or B,” and “at least one of A and B,” is intended to include a selection of only the first listed option (A), or only the second listed option (B), or a selection of both options (A and B). As another example, in the case of “A, B, and/or C,” and “at least one of A, B, and C,” such wording is intended to include a selection of only the first listed option (A), or only the second listed option (B), or only the third listed option (C), or only the first and second listed options (A and B), or only the first and third listed options (A and C), or only the second and third listed options (B and C), or a selection of all three options (A, B, and C). As will be readily appreciated by those of ordinary skill in this and related arts, this can be extended to many listed items.

此外,应理解,尽管这里关于MPEG-4 AVC标准的扩展来描述本原理的一个或多个实施例,但本原理不仅仅限于该扩展或者该标准,并且因此可以关于其它视频编码标准、推荐、及其扩展而被利用,同时保持本原理的精神。Furthermore, it should be understood that although one or more embodiments of the present principles are described herein with respect to an extension to the MPEG-4 AVC standard, the present principles are not limited to only that extension or that standard and, therefore, may be utilized with respect to other video coding standards, recommendations, and extensions thereof while maintaining the spirit of the present principles.

如这里所使用的,“高级语法”指代在分级上高于宏块层驻留的比特流中存在的语法。例如,如这里所使用的,高级语法可以指代但不限于:码片首标级、补充增强信息(SEI)级、画面参数集(PPS)级、序列参数集(SPS)级、和网络抽象层(NAL)单元首标级处的语法。As used herein, "high-level syntax" refers to syntax present in a bitstream that resides hierarchically above the macroblock layer. For example, as used herein, high-level syntax may refer to, but is not limited to, syntax at the slice header level, the supplemental enhancement information (SEI) level, the picture parameter set (PPS) level, the sequence parameter set (SPS) level, and the network abstraction layer (NAL) unit header level.

而且,如这里所使用的,词语“画面”和“图像”被可互换地使用,并且指代静止图像或者来自视频序列的画面。如已知的,画面可以是帧或场。Also, as used herein, the words "picture" and "image" are used interchangeably and refer to a still image or a picture from a video sequence.As is known, a picture can be a frame or a field.

此外,如这里所使用的,词语“信令”指代向对应解码器指示某些内容(something)。例如,编码器可以对被指定用于特定的大块(如在此定义的)的帧内预测进行信令以便使得解码器得知在编码器侧使用了特定的预测类型(例如,帧内或者帧间)。以此方式,可以在编码器侧和解码器侧使用相同的预测类型。由此,例如,编码器可以对特定的大块传送关于在该大块上要执行帧内预测的指示(例如,信号),以便简单地使得解码器知道并且选择用于该大块的相同的预测类型。应理解,可以以多种方式来实现该信令。例如,可以使用一个或多个语法元素、标志等来向对应解码器对信息进行信令。Furthermore, as used herein, the term "signaling" refers to indicating something to a corresponding decoder. For example, an encoder may signal intra-prediction designated for a particular large block (as defined herein) so that the decoder knows that a particular prediction type (e.g., intra or inter) is used on the encoder side. In this way, the same prediction type may be used on both the encoder and decoder sides. Thus, for example, an encoder may transmit an indication (e.g., a signal) for a particular large block that intra-prediction is to be performed on the large block so that the decoder is simply aware of and selects the same prediction type for the large block. It will be appreciated that this signaling may be implemented in a variety of ways. For example, one or more syntax elements, flags, etc. may be used to signal information to the corresponding decoder.

转到图4,由参考标号400总地指示依据本原理实施例的可以应用本原理的示例性视频编码器。4 , an exemplary video encoder to which the present principles may be applied is indicated generally by the reference numeral 400 , in accordance with an embodiment of the present principles.

视频编码器400包括帧排序缓冲器410,该帧排序缓冲器410具有与组合器485的非反相输入端进行信号通信的输出端。组合器485的输出端以信号通信方式与变换器和量化器425的第一输入端连接。变换器和量化器425的输出端以信号通信方式与熵编码器445的第一输入端以及逆变换器和逆量化器450的第一输入端连接。熵编码器445的输出端以信号通信方式与组合器490的第一非反相输入端连接。组合器490的输出端以信号通信方式与输出缓冲器435的第一输入端连接。The video encoder 400 includes a frame ordering buffer 410 having an output in signal communication with a non-inverting input of a combiner 485. The output of the combiner 485 is connected in signal communication with a first input of a transformer and quantizer 425. The output of the transformer and quantizer 425 is connected in signal communication with a first input of an entropy encoder 445 and a first input of an inverse transformer and inverse quantizer 450. The output of the entropy encoder 445 is connected in signal communication with a first non-inverting input of a combiner 490. The output of the combiner 490 is connected in signal communication with a first input of an output buffer 435.

编码器控制器405的第一输出端以信号通信方式与帧排序缓冲器410的第二输入端、逆变换器和逆量化器450的第二输入端、画面类型判定模块415的输入端、宏块类型(MB类型)判定模块420的输入端、超级帧内预测模块460的第二输入端、去块滤波器465的第二输入端、运动补偿器470的第一输入端、运动估计器475的第一输入端、和参考画面缓冲器480的第二输入端连接。A first output of the encoder controller 405 is connected in signal communication with a second input of the frame sorting buffer 410, a second input of the inverse transformer and inverse quantizer 450, an input of the picture type determination module 415, an input of the macroblock type (MB type) determination module 420, a second input of the super intra prediction module 460, a second input of the deblocking filter 465, a first input of the motion compensator 470, a first input of the motion estimator 475, and a second input of the reference picture buffer 480.

编码器控制器405的第二输出端以信号通信方式与补充增强信息(SEI)插入器430的第一输入端、变换器和量化器425的第二输入端、熵编码器445的第二输入端、输出缓冲器435的第二输入端、以及序列参数集(SPS)和画面参数集(PPS)插入器440的输入端连接。A second output of the encoder controller 405 is connected to a first input of a supplemental enhancement information (SEI) inserter 430, a second input of the transformer and quantizer 425, a second input of the entropy encoder 445, a second input of the output buffer 435, and an input of a sequence parameter set (SPS) and picture parameter set (PPS) inserter 440 in a signal communication manner.

画面类型判定模块415的第一输出端以信号通信方式与帧排序缓冲器410的第三输入端连接。画面类型判定模块115的第二输出端以信号通信方式与宏块类型判定模块420的第二输入端连接。A first output of the picture type determination module 415 is connected in signal communication with a third input of the frame sorting buffer 410. A second output of the picture type determination module 115 is connected in signal communication with a second input of the macroblock type determination module 420.

序列参数集(SPS)和画面参数集(PPS)插入器440的输出端以信号通信方式与组合器490的第三非反相输入端连接。An output of the sequence parameter set (SPS) and picture parameter set (PPS) inserter 440 is connected in signal communication with a third non-inverting input of the combiner 490 .

逆量化器和逆变换器450的输出端以信号通信方式与组合器419的第一非反相输入端连接。组合器419的输出端以信号通信方式与超级帧内预测模块460的第一输入端和去块滤波器465的第一输入端连接。去块滤波器465的输出端以信号通信方式与参考画面缓冲器480的第一输入端连接。参考画面缓冲器480的输出端以信号通信方式与运动估计器475的第二输入端连接。运动估计器475的第一输出端以信号通信方式与运动补偿器470的第二输入端连接。运动估计器475的第二输出端以信号通信方式与熵编码器445的第三输入端连接。An output of the inverse quantizer and inverse transformer 450 is connected in signal communication with a first non-inverting input of a combiner 419. An output of the combiner 419 is connected in signal communication with a first input of a super intra prediction module 460 and a first input of a deblocking filter 465. An output of the deblocking filter 465 is connected in signal communication with a first input of a reference picture buffer 480. An output of the reference picture buffer 480 is connected in signal communication with a second input of a motion estimator 475. A first output of the motion estimator 475 is connected in signal communication with a second input of a motion compensator 470. A second output of the motion estimator 475 is connected in signal communication with a third input of the entropy encoder 445.

运动补偿器470的输出端以信号通信方式与开关497的第一输入端连接。超级帧内预测模块460的输出端以信号通信方式与开关497的第二输入端连接。宏块类型判定模块420的输出端以信号通信方式与开关497的第三输入端连接。开关497的第三输入端确定开关的“数据”输入(与控制输入(即,第三输入端)相对)是由运动补偿器470提供还是由超级帧内预测模块460提供。开关497的输出端以信号通信方式与组合器419的第二非反相输入端和组合器485的反向输入端连接。An output of the motion compensator 470 is connected in signal communication with a first input of a switch 497. An output of the super intra prediction module 460 is connected in signal communication with a second input of the switch 497. An output of the macroblock type determination module 420 is connected in signal communication with a third input of the switch 497. The third input of the switch 497 determines whether the "data" input (as opposed to the control input (i.e., the third input)) of the switch is provided by the motion compensator 470 or the super intra prediction module 460. An output of the switch 497 is connected in signal communication with a second non-inverting input of the combiner 419 and an inverting input of the combiner 485.

帧排序缓冲器410和编码器控制器405的输入端可用作编码器400的用于接收输入画面401的输入端。此外,补充增强信息(SEI)插入器430的输入端可用作编码器400的用于接收元数据的输入端。输出缓冲器435的输出端可用作编码器400的用于输出比特流的输出端。The frame sorting buffer 410 and the input of the encoder controller 405 are available as inputs to the encoder 400 for receiving input pictures 401. In addition, the input of the supplemental enhancement information (SEI) inserter 430 is available as input to the encoder 400 for receiving metadata. The output of the output buffer 435 is available as an output of the encoder 400 for outputting a bitstream.

转到图5,通过参考标号500总地指示依据本原理实施例的可以应用本原理的示例性视频解码器。5 , an exemplary video decoder to which the present principles may be applied is indicated generally by the reference numeral 500 , in accordance with an embodiment of the present principles.

视频解码器500包括输入缓冲器510,该输入缓冲器510具有以信号通信方式与熵解码器545的第一输入端连接的输出端。熵解码器545的第一输出端以信号通信方式与逆变换器和逆量化器550的第一输入端连接。逆变换器和逆量化器550的输出端以信号通信方式与组合器525的第二非反相输入端连接。组合器525的输出端以信号通信方式与去块滤波器565的第二输入端和超级帧内预测模块560的第一输入端连接。去块滤波器565的第二输出端以信号通信方式与参考画面缓冲器580的第一输入端连接。参考画面缓冲器580的输出端以信号通信方式与运动补偿器570的第二输入端连接。The video decoder 500 includes an input buffer 510 having an output connected in signal communication with a first input of an entropy decoder 545. The first output of the entropy decoder 545 is connected in signal communication with a first input of an inverse transformer and inverse quantizer 550. The output of the inverse transformer and inverse quantizer 550 is connected in signal communication with a second non-inverting input of a combiner 525. The output of the combiner 525 is connected in signal communication with a second input of a deblocking filter 565 and a first input of a super intra prediction module 560. A second output of the deblocking filter 565 is connected in signal communication with a first input of a reference picture buffer 580. The output of the reference picture buffer 580 is connected in signal communication with a second input of a motion compensator 570.

熵解码器545的第二输出端以信号通信方式与运动补偿器570的第三输入端和去块滤波器565的第一输入端连接。熵解码器545的第三输出端以信号通信方式与解码器控制器505的输入端连接。解码器控制器505的第一输出端以信号通信方式与熵解码器545的第二输入端连接。解码器控制器505的第二输出端以信号通信方式与逆变换器和逆量化器550的第二输入端连接。解码器控制器505的第三输出端以信号通信方式与去块滤波器565的第三输入端连接。解码器控制器505的第四输出端以信号通信方式与超级帧内预测模块560的第二输入端、运动补偿器570的第一输入端、以及参考画面缓冲器580的第二输入端连接。A second output of the entropy decoder 545 is connected in signal communication with a third input of the motion compensator 570 and a first input of the deblocking filter 565. A third output of the entropy decoder 545 is connected in signal communication with an input of the decoder controller 505. A first output of the decoder controller 505 is connected in signal communication with a second input of the entropy decoder 545. A second output of the decoder controller 505 is connected in signal communication with a second input of the inverse transformer and inverse quantizer 550. A third output of the decoder controller 505 is connected in signal communication with a third input of the deblocking filter 565. A fourth output of the decoder controller 505 is connected in signal communication with a second input of the super intra prediction module 560, a first input of the motion compensator 570, and a second input of the reference picture buffer 580.

运动补偿器570的输出端以信号通信方式与开关597的第一输入端连接。超级帧内预测模块560的输出端以信号通信方式与开关597的第二输入端连接。开关597的输出端以信号通信方式与组合器525的第一非反相输入端连接。An output of the motion compensator 570 is connected in signal communication with a first input of a switch 597. An output of the super intra prediction module 560 is connected in signal communication with a second input of the switch 597. An output of the switch 597 is connected in signal communication with a first non-inverting input of the combiner 525.

输入缓冲器510的输入端可用作解码器500的用于接收输入比特流的输入端。去块滤波器565的第一输出端可用作解码器500的用于对输出画面进行输出的输出端。An input of the input buffer 510 is available as an input of the decoder 500 for receiving an input bitstream. A first output of the deblocking filter 565 is available as an output of the decoder 500 for outputting an output picture.

如上注意的,本原理针对用于视频编码器和解码器的对大块的帧内预测进行信令的方法和装置。此外,如上注意的,可以应用本原理的大块被定义为意指具有等于或者大于32×32的尺寸的块。As noted above, the present principles are directed to methods and apparatus for signaling intra prediction of large blocks for video encoders and decoders.Furthermore, as noted above, a large block to which the present principles may apply is defined to mean a block having a size equal to or greater than 32x32.

在实施例中,为了易于标注,把帧内预测的信令拆分为以下两部分:sip_type(空间帧内划分类型,其可以是INTRA4×4、INTRA8×8、INTRA16×16等等);以及每个sip_type内的intra_pred_mode(诸如,例如INTRA4×4和INTRA8×8内的9种帧内预测模式)。关于特定实施例进一步的详述中,提出了用于本原理的以下三个规则:(1)选择基本编码单元;(2)通过从最大的帧内预测类型中拆分或者从基本编码单元中合并而允许分层级的帧内预测;以及(3)对于每个sip_type,向最频繁使用的intra_pred_mode分配较高的优先级。关于规则(1),允许若干sip_type用于基本的编码单元。In an embodiment, for ease of notation, the signaling of intra prediction is split into two parts: sip_type (spatial intra partitioning type, which can be INTRA4×4, INTRA8×8, INTRA16×16, etc.); and intra_pred_mode within each sip_type (such as, for example, 9 intra prediction modes within INTRA4×4 and INTRA8×8). In further detail regarding a specific embodiment, the following three rules are proposed for the present principle: (1) selecting a basic coding unit; (2) allowing hierarchical intra prediction by splitting or merging from the largest intra prediction type; and (3) for each sip_type, assigning a higher priority to the most frequently used intra_pred_mode. With respect to rule (1), several sip_types are allowed to be used for a basic coding unit.

一实施例One embodiment

在一实施例中,将基本编码单元设置为16×16。在该编码单元中,允许sip_type为INTRA4×4、INTRA8×8、INTRA16×16。还允许分层级的帧内预测,如图6所示。In one embodiment, the basic coding unit is set to 16×16. In this coding unit, sip_type is allowed to be INTRA4×4, INTRA8×8, and INTRA16×16. Hierarchical intra prediction is also allowed, as shown in FIG6 .

转到图6,通过参考标号600总地指示可以应用本原理的示例性的分级划分。在该实施例中,如果最大的块尺寸被设置为64×64,则使用“拆分信令”来允许分层级的帧内预测。也就是说,在实施例中,添加了intra64_flag。如果intra64_flag等于1,则使用INTRA64×64。否则,如果intra64_flag等于0,则将64×64块611拆分为四个32×32块621。对于每个32×32块621,添加intra32_flag。如果intra32_flag等于1,则使用INTRA32×32。否则,如果intra32_flag等于0,则在此(例如对于32×32块621)同样允许在16×16基本编码单元中所允许的所有sip_type。对于INTRA16×16中的intra_pred_mode,具有DC模式和定向模式(directional mode),后者通过发送模式信息而允许不同类型的定向预测。由此,32×32帧内预测块621可以被进一步拆分为4个16×16帧内预测块631。4个16×16帧内预测块631中的一个或多个可以进一步被拆分为DC模式(未示出)、16×16模式641、8×8模式651,和4×4模式661。在该实施例中,假设具有以下四个16×16帧内预测模式:DC;水平的(HOR);垂直的(VER);和多定向的(Multi_DIR)。通过考虑每个模式的优先级来对intra_pred_mode进行信令。在INTRA16×16中,由于DC模式比其它模式更经常地使用,所以在INTRA16×16之前在sip_type表中添加INTRA16×16DC。然后,移除用于INTRA16×16的intra_pred_mode中的most_probable_mode指示。作为替代,绝对地(absolutely)指示其它3个模式(16×16、8×8和4×4)。6 , an exemplary hierarchical partitioning to which the present principles may be applied is indicated generally by reference numeral 600 . In this embodiment, if the maximum block size is set to 64×64, “split signaling” is used to allow hierarchical intra prediction. That is, in an embodiment, an intra64_flag is added. If intra64_flag is equal to 1, INTRA64×64 is used. Otherwise, if intra64_flag is equal to 0, the 64×64 block 611 is split into four 32×32 blocks 621 . For each 32×32 block 621 , an intra32_flag is added. If intra32_flag is equal to 1, INTRA32×32 is used. Otherwise, if intra32_flag is equal to 0, all sip_types allowed in a 16×16 basic coding unit are also allowed here (e.g., for the 32×32 block 621). For intra_pred_mode in INTRA16×16, there are DC mode and directional mode. The latter allows different types of directional prediction by transmitting mode information. Thus, the 32×32 intra prediction block 621 can be further split into four 16×16 intra prediction blocks 631. One or more of the four 16×16 intra prediction blocks 631 can be further split into DC mode (not shown), 16×16 mode 641, 8×8 mode 651, and 4×4 mode 661. In this embodiment, the following four 16×16 intra prediction modes are assumed: DC; horizontal (HOR); vertical (VER); and multi-directional (Multi_DIR). Intra_pred_mode is signaled by considering the priority of each mode. In INTRA16×16, since DC mode is used more frequently than other modes, INTRA16×16DC is added to the sip_type table before INTRA16×16. Then, the most_probable_mode indication in intra_pred_mode for INTRA16x16 is removed. Instead, the other three modes (16x16, 8x8 and 4x4) are absolutely indicated.

语法grammar

在表2和表3中例示用于该实施例的语法示例。具体地,表2示出了依据本原理实施例的用于16×16编码单元的sip_type的示例性规范,表3示出了依据本原理实施例的示例性的INTRA16×16预测模式。对于INTRA32×32/INTRA64×64,使用与INTRA16×16相同的模式。对于信令,利用intra32_DC_flag和intra64_DC_flag来替换most_probable_mode指示,这是由于更频繁地使用DC。然后,绝对地编码其它intra_pred_mode。Tables 2 and 3 illustrate examples of syntax used in this embodiment. Specifically, Table 2 shows an exemplary specification of sip_type for a 16×16 coding unit in accordance with an embodiment of the present principles, and Table 3 shows an exemplary INTRA16×16 prediction mode in accordance with an embodiment of the present principles. For INTRA32×32/INTRA64×64, the same modes as INTRA16×16 are used. For signaling, the most_probable_mode indication is replaced with intra32_DC_flag and intra64_DC_flag, due to the more frequent use of DC. Other intra_pred_modes are then coded absolutely.

可以与在MPEG-4 AVC标准中完全相同地执行对于INTRA4×4和INTRA8×8的intra_pred_mode信令,因此在任何表中不列出这些模式。The intra_pred_mode signaling for INTRA4x4 and INTRA8x8 may be performed exactly as in the MPEG-4 AVC standard, so these modes are not listed in any table.

表2Table 2

Sip_typeSip_type 索引index 二进制比特binary bits SIP8×8SIP8×8 00 00 SIP16×16DCSIP16×16DC 11 1010 SIP16×16SIP16×16 22 110110 SIP4×4SIP4×4 33 11101110

表3Table 3

帧内预测模式Intra prediction mode 索引index 二进制比特binary bits VERVER 00 00 HORHOR 11 1010 Multi-DIRMulti-DIR 22 1111

表4示出了依据本原理实施例的示例性的宏块层语法。Table 4 shows an exemplary macroblock layer syntax according to an embodiment of the present principles.

表4Table 4

表4的一些语法元素的语义如下:The semantics of some syntax elements in Table 4 are as follows:

Intra64_flag等于1规定使用INTRA64×64。Intra64_flag等于0规定64×64的大块被进一步拆分为32×32的划分。Intra64_flag equal to 1 specifies that INTRA64x64 is used. Intra64_flag equal to 0 specifies that the 64x64 block is further split into 32x32 partitions.

Intra64_DC_flag等于1规定intra_pred_mode是用于INTRA64×64的DC模式。Intra64_DC_flag等于0规定intra_pred_mode不是用于INTRA64×64的DC模式。Intra64_DC_flag equal to 1 specifies that intra_pred_mode is the DC mode for INTRA64x64. Intra64_DC_flag equal to 0 specifies that intra_pred_mode is not the DC mode for INTRA64x64.

intra_pred_mode_64规定用于INTRA64×64的帧内预测模式(不包括DC模式)。intra_pred_mode_64 specifies the intra prediction mode (excluding DC mode) for INTRA64×64.

intra_multidir_index规定用于INTRA64×64中的Multi_Dir模式的角度的索引。intra_multidir_index specifies the index of the angle used for Multi_Dir mode in INTRA64×64.

Intra32_flag[i]等于1规定对于第i个32×32的大块使用INTRA32×32。intra32_flag[i]等于0规定第i个32×32的大块被进一步拆分为16×16的划分。Intra32_flag[i] equal to 1 specifies that INTRA32x32 is used for the i-th 32x32 chunk. intra32_flag[i] equal to 0 specifies that the i-th 32x32 chunk is further split into 16x16 partitions.

Intra32_DC_flag[i]等于1规定对于第i个32×32块intra_pred_mode是用于INTRA32×32的DC模式。Intra32_DC_flag[i]等于0规定对于第i个32×32块intra_pred_mode不是用于INTRA32×32的DC模式。Intra32_DC_flag[i] equal to 1 specifies that for the i-th 32x32 block intra_pred_mode is the DC mode for INTRA32x32. Intra32_DC_flag[i] equal to 0 specifies that for the i-th 32x32 block intra_pred_mode is not the DC mode for INTRA32x32.

intra_pred_mode_32[i]规定用于第i个32×32大块的INTRA32×32的帧内预测模式(不包括DC模式)。intra_pred_mode_32[i] specifies the INTRA32x32 intra prediction mode (excluding DC mode) for the i-th 32x32 large block.

intra_multidir_index规定用于INTRA32×32中的Multi_Dir模式的角度的索引。intra_multidir_index specifies the index of the angle used for Multi_Dir mode in INTRA32x32.

sip_type[i]规定在第i个16×16块中用于基本块编码单元的空间帧内划分类型。sip_type[i] specifies the spatial intra partitioning type used for the basic block coding unit in the i-th 16×16 block.

intra_pred_mode_16[i]规定用于第i个16×16块的INTRA16×16的帧内预测模式(不包括DC模式)。intra_pred_mode_16[i] specifies the INTRA16x16 intra prediction mode (excluding DC mode) for the i-th 16x16 block.

intra_multidir_index规定用于第i个16×16块的INTRA16×16中的Multi_Dir模式的角度的索引。intra_multidir_index specifies the index of the angle of the Multi_Dir mode in INTRA16x16 for the i-th 16x16 block.

另一实施例Another embodiment

在另一实施例中,自适应地选择大块单元为32×32或者64×64。可以使用一个或多个高级语法元素来对该选择进行信令。在实施例中,如果选择32×32,则仅仅移除与64×64有关的所有语法。In another embodiment, the large block unit is adaptively selected to be 32x32 or 64x64. One or more high-level syntax elements may be used to signal the selection. In an embodiment, if 32x32 is selected, all syntax related to 64x64 is simply removed.

在另一实施例中,分层级的帧内预测可以包含从基本编码单元合并。例如,如果最大的块单元是64×64并且基本编码单元是16×16,则使用一个标志(is_all_16×16_coding)来指示一个64×64块内部的所有16×16块是否都属于16×16编码类型。如果is_all_16×16_coding等于1,则这指示使用16×16编码类型并且停止信令。否则,使用一个标志(is_all_32×32_coding)来指示一个64×64块内部的所有32×32块是否都属于32×32编码类型。如果is_all_32×32_coding等于1,则这指示一个64×64块内部的所有32×32块都属于32×32编码类型。否则,如果is_all_32×32_coding和is_all_16×16_coding等于0,则这指示使用INTRA64×64。In another embodiment, hierarchical intra prediction may include merging from basic coding units. For example, if the largest block unit is 64×64 and the basic coding unit is 16×16, a flag (is_all_16×16_coding) is used to indicate whether all 16×16 blocks within a 64×64 block belong to the 16×16 coding type. If is_all_16×16_coding is equal to 1, this indicates that the 16×16 coding type is used and signaling is stopped. Otherwise, a flag (is_all_32×32_coding) is used to indicate whether all 32×32 blocks within a 64×64 block belong to the 32×32 coding type. If is_all_32×32_coding is equal to 1, this indicates that all 32×32 blocks within a 64×64 block belong to the 32×32 coding type. Otherwise, if is_all_32x32_coding and is_all_16x16_coding are equal to 0, this indicates that INTRA64x64 is used.

在另一实施例中,引入了用于具有不小于16×16尺寸的块单元的SIP类型(large_sip_type)。三种类型如下被称为:large_intra_16×16;large_intra_32×32;和large_intra_64×64。large_intra_16×16意味着一个大块内部的所有16×16块都属于16×16编码类型。large_intra_32×32意味着一个大块内部的所有32×32块都属于32×32编码类型。在一实施例中,large_intra_32×32可以与以上利用intra32_flag描述的实施例进行组合以允许分层级的帧内预测。large_intra_64×64意味着一个大块内部的所有64×64块都被编码为INTRA64×64。In another embodiment, a SIP type (large_sip_type) is introduced for block units with a size of not less than 16×16. The three types are referred to as follows: large_intra_16×16; large_intra_32×32; and large_intra_64×64. large_intra_16×16 means that all 16×16 blocks within a large block belong to the 16×16 coding type. large_intra_32×32 means that all 32×32 blocks within a large block belong to the 32×32 coding type. In one embodiment, large_intra_32×32 can be combined with the embodiment described above using intra32_flag to allow hierarchical intra-frame prediction. large_intra_64×64 means that all 64×64 blocks within a large block are coded as INTRA64×64.

在另一实施例中,可以引入若干sip/mode表。可以将这些表预先存储在编码器和解码器二者中,或者这些表可以是用户指定的并且使用一个或多个高级语法元素进行传送。表5示出了依据本原理实施例的示例性的宏块层语法。In another embodiment, several sip/mode tables may be introduced. These tables may be pre-stored in both the encoder and decoder, or they may be user-specified and transmitted using one or more high-level syntax elements. Table 5 shows an exemplary macroblock layer syntax according to an embodiment of the present principles.

表5Table 5

表5的一些语法元素的语义如下:The semantics of some syntax elements in Table 5 are as follows:

is_all_16×16_coding等于1规定大块内部的所有16×16块都通过16×16编码类型编码。is_all_16×16_coding等于0规定所述大块不通过16×16编码类型进行编码。is_all_16x16_coding equal to 1 specifies that all 16x16 blocks within the large block are coded using the 16x16 coding type. is_all_16x16_coding equal to 0 specifies that the large block is not coded using the 16x16 coding type.

is_all_32×32_coding等于1规定大块内部的所有32×32块都通过32×32编码类型编码。is_all_32×32_coding等于0规定所述大块不通过32×32编码类型进行编码。is_all_32x32_coding equal to 1 specifies that all 32x32 blocks within the large block are coded using the 32x32 coding type. is_all_32x32_coding equal to 0 specifies that the large block is not coded using the 32x32 coding type.

转到图7A和7B,其一起表示由参考标号700总地指示的、通过对用于大块的帧内预测进行信令而编码所述大块的画面数据的示例性方法。方法700包括开始块705,其将控制传递到功能块710。功能块710执行初始化,并且将控制传递到循环限制块715。循环限制块715对64×64块(即,具有64×64块尺寸的块)执行循环(下文也称为循环1),并且将控制传递到功能块785和循环限制块720。7A and 7B , which together illustrate an exemplary method for encoding picture data of a large block by signaling intra prediction for the large block, generally indicated by the reference numeral 700. Method 700 includes a start block 705, which passes control to a function block 710. Function block 710 performs initialization and passes control to a loop limit block 715. Loop limit block 715 performs a loop (hereinafter also referred to as loop 1) on a 64×64 block (i.e., a block having a 64×64 block size) and passes control to a function block 785 and a loop limit block 720.

功能块785执行帧内64×64模式判定,基于RD64(即,从Intra64×64模式判定产生的率失真)设置Intra64_DC_flag,并且将控制传递到判定块770。Function block 785 performs the intra 64×64 mode decision, sets the Intra64_DC_flag based on RD64 (ie, the rate-distortion resulting from the Intra64×64 mode decision), and passes control to decision block 770 .

循环限制块720对四个32×32块(即,具有32×32块尺寸并且从由循环1处理的当前64×64块中获得的四个块)执行循环(下文也称为循环2),并且将控制传递到功能块790和循环限制块725。The loop limit block 720 performs a loop (hereinafter also referred to as loop 2) on four 32×32 blocks (i.e., four blocks having a 32×32 block size and obtained from the current 64×64 block processed by loop 1) and passes control to the function block 790 and the loop limit block 725.

功能块790执行帧内32×32模式判定,基于RD32(即,从Intra32×32模式判定产生的率失真)设置Intra32_DC_flag,并且将控制传递到判定块750。Function block 790 performs the intra 32×32 mode decision, sets the Intra32_DC_flag based on RD32 (ie, the rate-distortion resulting from the Intra32×32 mode decision), and passes control to decision block 750 .

循环限制块725对四个16×16块(即,具有16×16块尺寸并且从由循环2处理的当前32×32块中获得的四个块)执行循环(下文也称为循环3),并且将控制传递到功能块730和功能块735。The loop limit block 725 performs a loop (hereinafter also referred to as loop 3) on four 16×16 blocks (i.e., four blocks having a 16×16 block size and obtained from the current 32×32 block processed by loop 2) and passes control to function blocks 730 and 735.

功能块730评估Intra16×16_DC模式,并且将控制传递到功能块740。功能块735评估其它的16×16模式(即,除了Intra 16×16_DC以外的)和以下的模式(例如,8×8,4×4等等),并且将控制传递到功能块740。Function block 730 evaluates the Intra16x16_DC mode and passes control to function block 740. Function block 735 evaluates other 16x16 modes (i.e., other than Intra16x16_DC) and following modes (e.g., 8x8, 4x4, etc.) and passes control to function block 740.

功能块740基于RD16(即,从Intra16×16模式判定产生的率失真)执行16×16模式判定,然后累计每个16×16块的RD16以获得TotRD16(其指示当由四个16×16块编码时整个32×32块的总的率失真),并且将控制传递到循环限制块745。循环限制块745结束对16×16块的循环(即,循环3),并且将控制传递给判定块750。Function block 740 performs a 16×16 mode decision based on RD16 (i.e., the rate-distortion resulting from the Intra16×16 mode decision), then accumulates the RD16 for each 16×16 block to obtain TotRD16 (which indicates the total rate-distortion for the entire 32×32 block when encoded by four 16×16 blocks), and passes control to loop limit block 745. Loop limit block 745 ends the loop for the 16×16 blocks (i.e., loop 3) and passes control to decision block 750.

判定块750确定RD32是否小于TotRD16(即,当前32×32块的率失真成本是否小于从当前32×32块中获得的四个16×16块的总的率失真成本)。如果是这样,则控制被传递给功能块755。否则,控制被传递给功能块742。Decision block 750 determines whether RD32 is less than TotRD16 (i.e., whether the rate-distortion cost of the current 32×32 block is less than the total rate-distortion cost of the four 16×16 blocks obtained from the current 32×32 block). If so, control is passed to function block 755. Otherwise, control is passed to function block 742.

功能块755将Intra32_flag设置为等于1,并且将控制传递给功能块760。功能块742将Intra32_flag设置为等于0,并且将控制传递给功能块760。Function block 755 sets Intra32_flag equal to 1 and passes control to function block 760. Function block 742 sets Intra32_flag equal to 0 and passes control to function block 760.

功能块760将每个32×32块的RD32的累计设置至TotRD32以便指示在通过四个32×32块进行编码时整个64×64块的总的率失真,并且将控制传递给循环限制块765。循环限制块765结束对32×32块的循环(即,循环2),并且将控制传递给判定块770。Function block 760 sets the accumulation of RD32 for each 32×32 block to TotRD32 to indicate the total rate-distortion of the entire 64×64 block when encoded by four 32×32 blocks, and passes control to loop limit block 765. Loop limit block 765 ends the loop for the 32×32 blocks (i.e., loop 2) and passes control to decision block 770.

判定块770确定RD64是否小于TotRD32(即,当前64×64块的率失真成本是否小于从当前64×64块中获得的四个32×32块的总的率失真成本)。如果是这样,则控制被传递给功能块775。否则,控制被传递给功能块780。Decision block 770 determines whether RD64 is less than TotRD32 (i.e., whether the rate-distortion cost of the current 64×64 block is less than the total rate-distortion cost of the four 32×32 blocks obtained from the current 64×64 block). If so, control is passed to function block 775. Otherwise, control is passed to function block 780.

功能块775将Intra64_flag设置为等于1,并且将控制传递给循环限制块795。功能块780将Intra64_flag设置为等于0,并且将控制传递给循环限制块795。Function block 775 sets Intra64_flag equal to 1 and passes control to loop limit block 795. Function block 780 sets Intra64_flag equal to 0 and passes control to loop limit block 795.

循环限制块795结束对64×64块的循环(即,循环1),并且将控制传递给功能块797。功能块797熵编码标志、intra_pred_mode和残差,并且将控制传递给结束块799。The loop limit block 795 ends the loop on the 64×64 block (ie, loop 1) and passes control to a function block 797 . The function block 797 entropy encodes the flag, intra_pred_mode, and residual, and passes control to an end block 799 .

转到图8A和8B,其一起表示由参考标号800总地指示的、通过确定要被应用于大块的帧内预测而解码所述大块的画面数据的示例性方法。方法800包括开始块805,其将控制传递到功能块808。功能块808初始化解码器并且然后将控制传递到功能块810。功能块810解析比特流,并且将控制传递给循环限制块815。循环限制块815对64×64块执行循环(下文也称为循环1),并且将控制传递判定块820。判定块820确定Intra64_flag是否被设置为等于1。如果是这样,则控制被传递给功能块885。否则,控制被传递给循环限制块825。8A and 8B , which together illustrate an exemplary method, generally designated by the reference numeral 800 , for decoding picture data for a large block by determining intra-frame prediction to be applied to the large block. Method 800 includes a start block 805 , which passes control to a function block 808 . Function block 808 initializes the decoder and then passes control to a function block 810 . Function block 810 parses the bitstream and passes control to a loop limit block 815 . Loop limit block 815 performs a loop on the 64×64 blocks (hereinafter also referred to as loop 1 ) and passes control to a decision block 820 . Decision block 820 determines whether the Intra64_flag is set equal to 1. If so, control is passed to a function block 885 . Otherwise, control is passed to a loop limit block 825 .

功能块885确定Intra64_DC_flag是否被设置为等于1。如果是这样,则控制被传递给功能块887。否则,控制被传递给功能块888。功能块887执行Intra64×64DC预测,并且然后将控制传递到功能块890。功能块888执行除了Intra64×64DC模式以外的Intra 64×64预测,并且然后将控制传递给功能块890。功能块890解码当前64×64块,并且将控制传递给循环限制块880。循环限制块880结束对64×64块的循环(即,循环1),并且将控制传递给结束块899。Function block 885 determines whether Intra64_DC_flag is set equal to 1. If so, control is passed to function block 887. Otherwise, control is passed to function block 888. Function block 887 performs Intra64×64 DC prediction and then passes control to function block 890. Function block 888 performs Intra64×64 prediction except for Intra64×64 DC mode and then passes control to function block 890. Function block 890 decodes the current 64×64 block and passes control to loop limit block 880. Loop limit block 880 ends the loop on the 64×64 block (i.e., loop 1) and passes control to end block 899.

循环限制块825对四个32×32块执行循环(下文也称为循环2),并且将控制传递给判定块830。判定块830确定Intra32_flag是否等于1。如果是这样,则控制被传递给功能块835。否则,控制被传递给循环限制块845。The loop limit block 825 performs a loop on the four 32×32 blocks (hereinafter also referred to as loop 2) and passes control to a decision block 830. The decision block 830 determines whether Intra32_flag is equal to 1. If so, control is passed to a function block 835. Otherwise, control is passed to a loop limit block 845.

功能块835确定Intra32_DC_flag是否等于1。如果是这样,则控制被传递给功能块837。否则,控制被传递给功能块838。功能块837执行Intra32×32DC预测,并且将控制传递到功能块840。功能块838执行除了Intra32×32DC模式以外的帧内预测模式,并且然后将控制传递给功能块840。功能块840解码32×32块,并且将控制传递给循环限制块875。Function block 835 determines whether Intra32_DC_flag is equal to 1. If so, control is passed to function block 837. Otherwise, control is passed to function block 838. Function block 837 performs Intra32×32 DC prediction and passes control to function block 840. Function block 838 performs intra prediction modes other than Intra32×32 DC mode and then passes control to function block 840. Function block 840 decodes the 32×32 block and passes control to loop limit block 875.

循环限制块875结束对32×32块的循环(即,循环2),并且将控制传递给循环限制块880。The loop limit block 875 ends the loop on the 32×32 block (ie, loop 2 ) and passes control to the loop limit block 880 .

循环限制块845对四个16×16块执行循环(下文也称为循环3),并且将控制传递判定块850。判定块850确定是否sip_type=Intra16_DC。如果是这样,则控制被传递给功能块855。否则,控制被传递给功能块860。The loop limit block 845 performs a loop on the four 16×16 blocks (hereinafter also referred to as loop 3) and passes control to the decision block 850. The decision block 850 determines whether sip_type=Intra16_DC. If so, control is passed to the function block 855. Otherwise, control is passed to the function block 860.

功能块855执行Intra16×16_DC模式预测,并且将控制传递到功能块865。功能块860使用其它帧内预测模式(即,除了Intra16×16_DC模式以外的)执行模式预测,并且将控制传递给功能块865。Function block 855 performs Intra16x16_DC mode prediction and passes control to function block 865. Function block 860 performs mode prediction using other intra prediction modes (ie, other than Intra16x16_DC mode) and passes control to function block 865.

功能块865解码16×16块,并且将控制传递给循环限制块870。循环限制块870结束对16×16块的循环(即,循环3),并且将控制传递给循环限制块875。Function block 865 decodes the 16×16 block and passes control to loop limit block 870 . Loop limit block 870 ends the loop on the 16×16 block (ie, loop 3 ) and passes control to loop limit block 875 .

现在将描述本发明的许多伴随优点/特征中的一些,其中的一些已经在上面提及。例如,一个优点/特征是一种具有视频编码器的装置,所述视频编码器通过对用于画面中的至少一个大块的帧内预测进行信令来编码用于所述至少一个大块的画面数据。通过选择基本编码单元尺寸并且分配用于基本编码单元尺寸的单个空间帧内划分类型来对帧内预测进行信令。该单个空间帧内划分类型是可从多个空间帧内划分类型中选择的。所述至少一个大块具有比基本编码单元的块尺寸大的大块尺寸。帧内预测是分层级的帧内预测并且通过以下操作中的至少一个而对至少一个大块执行:将大块尺寸拆分为基本编码单元尺寸以及从基本编码单元尺寸合并到大块尺寸。Some of the many attendant advantages/features of the present invention will now be described, some of which have been mentioned above. For example, one advantage/feature is an apparatus having a video encoder that encodes picture data for at least one large block in a picture by signaling intra-frame prediction for the at least one large block. The intra-frame prediction is signaled by selecting a basic coding unit size and assigning a single spatial intra-frame partitioning type for the basic coding unit size. The single spatial intra-frame partitioning type is selectable from a plurality of spatial intra-frame partitioning types. The at least one large block has a large block size that is larger than the block size of the basic coding unit. The intra-frame prediction is hierarchical intra-frame prediction and is performed on the at least one large block by at least one of the following operations: splitting the large block size into the basic coding unit size and merging from the basic coding unit size to the large block size.

另一优点/特征是具有如上所述的视频编码器的装置,其中对于多个空间帧内划分类型的每一个,向多个可用的帧内预测模式中最频繁使用的特定的帧内预测模式分配较高的优先级。Another advantage/feature is the apparatus having the video encoder as described above, wherein for each of a plurality of spatial intra partition types, a most frequently used particular intra prediction mode among a plurality of available intra prediction modes is assigned a higher priority.

又一优点/特征是具有如上所述的视频编码器的装置,其中自适应地选择大块尺寸。Yet another advantage/feature is the apparatus having the video encoder as described above, wherein the macroblock size is adaptively selected.

再一优点/特征是具有如上所述的视频编码器的装置,其中,使用一个或多个高级语法元素来执行信令。Yet another advantage/feature is the apparatus having the video encoder as described above, wherein signaling is performed using one or more high-level syntax elements.

此外,另一优点/特征是具有如上所述的视频编码器的装置,其中空间帧内划分类型表和帧内预测模式表中的至少一个被视频编码器预先存储并且使用以便编码至少一个大块。该空间帧内划分类型表和帧内预测模式表中的至少一个被安排为被对应的视频解码器预先存储并且使用以便解码该至少一个大块。Furthermore, another advantage/feature is the apparatus having the video encoder as described above, wherein at least one of the spatial intra partition type table and the intra prediction mode table is pre-stored and used by the video encoder to encode at least one large block, and wherein the at least one of the spatial intra partition type table and the intra prediction mode table is pre-stored and used by a corresponding video decoder to decode the at least one large block.

此外,另一优点/特征是具有如上所述的视频编码器的装置,其中空间帧内划分类型表和帧内预测模式表中的至少一个被视频编码器用于编码至少一个大块,并且被视频编码器使用一个或多个高级语法元素来传送。Additionally, another advantage/feature is an apparatus having a video encoder as described above, wherein at least one of the spatial intra partitioning type table and the intra prediction mode table is used by the video encoder to encode at least one large block and is transmitted by the video encoder using one or more high-level syntax elements.

基于这里的教导,本领域普通技术人员可以容易确定本原理的这些和其它特征和优点。应理解,本原理的教导可以以硬件、软件、固件、专用处理器、或其组合的各种形式来实现。Based on the teachings herein, those skilled in the art can easily determine these and other features and advantages of the present principles.It should be understood that the teachings of the present principles can be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof.

更优选地,本原理的教导被实现为硬件与软件的组合。此外,软件可以实现为有形地体现在程序存储单元上的应用程序。应用程序可以被上载到包括任何适当架构的机器并由该机器执行。优选地,在具有诸如一个或多个中央处理单元(“CPU”)、随机存取存储器(“RAM”)、以及输入/输出(“I/O”)接口等的硬件的计算机平台上实现该机器。计算机平台还可以包括操作系统和微指令代码。这里描述的各种处理与功能可以是可能由CPU执行的微指令代码的一部分或是应用程序的一部分、或者是其任何组合。另外,各种其它的诸如附加数据存储单元以及打印单元之类的外设单元可以连接到计算机平台。More preferably, the teaching of the present principles is implemented as a combination of hardware and software. In addition, the software can be implemented as an application program tangibly embodied on a program storage unit. The application program can be uploaded to a machine comprising any appropriate architecture and executed by the machine. Preferably, the machine is implemented on a computer platform having hardware such as one or more central processing units ("CPUs"), random access memories ("RAMs"), and input/output ("I/O") interfaces. The computer platform can also include an operating system and microinstruction code. The various processes and functions described herein can be a part of the microinstruction code that may be executed by the CPU or a part of the application program, or any combination thereof. In addition, various other peripheral units such as additional data storage units and printing units can be connected to the computer platform.

还应理解,由于在附图中描绘的一些组成系统组件和方法优选地以软件实现,因此这些系统组件或处理功能块之间的实际连接可能取决于本原理被编程的方式而有所不同。给出这里的教导,本领域普通技术人员将能够预期本原理的这些和类似的实现方式或配置。It should also be understood that since some of the constituent system components and methods depicted in the accompanying drawings are preferably implemented in software, the actual connections between these system components or processing function blocks may vary depending on how the present principles are programmed. Given the teachings herein, one of ordinary skill in the art will be able to contemplate these and similar implementations or configurations of the present principles.

尽管这里已经参考附图描述了示例实施例,应理解本原理不限于那些确切的实施例,并且本领域普通技术人员可以在其中进行各种改变和修改,而不偏离本原理的范围和精神。所有这些改变和修改意在被包括在所附权利要求阐述的本原理的范围之内。Although exemplary embodiments have been described herein with reference to the accompanying drawings, it should be understood that the present principles are not limited to those precise embodiments, and that various changes and modifications may be made therein by one of ordinary skill in the art without departing from the scope and spirit of the present principles. All such changes and modifications are intended to be included within the scope of the present principles as set forth in the appended claims.

Claims (8)

1.一种视频编码装置中的方法,所述方法包括:1. A method in a video encoding apparatus, the method comprising: 通过确定要为画面中的大块执行帧内预测来编码所述大块的画面数据,其中所述大块具有比基本编码单元尺寸大的块尺寸,所述大块尺寸是32×32和64×64中的一个,且所述基本编码单元尺寸是16×16,The large blocks of frame data are encoded by determining which large blocks in the frame need to be encoded using intra-frame prediction, wherein the large blocks have a block size larger than the basic coding unit size, the large block size being one of 32×32 and 64×64, and the basic coding unit size is 16×16. 其中对于所述大块,通过以下操作信令所述帧内预测:For the large block, the intra-frame prediction is signaled through the following operation: 编码二进制拆分信令语法元素,所述二进制拆分信令语法元素指定是否所述大块被进一步拆分为四个尺寸相等的子块;Encode binary split signaling syntax elements, which specify whether the large block is further split into four equal-sized sub-blocks; 在所述二进制拆分信令语法元素指定所述大块不被进一步拆分的情形下,编码用于所述大块的帧内预测模式;In the case where the binary split signaling syntax element specifies that the block is not to be further split, the intra-frame prediction mode for the block is encoded. 否则在所述二进制拆分信令语法元素指定所述大块被进一步拆分的情形下:Otherwise, in the case where the binary splitting signaling syntax element specifies that the large block is further split: 对于每个子块,在所述子块是32×32的情形下,编码指定是否所述32×32的子块被进一步拆分为四个基本编码单元尺寸相等的块的二进制拆分信令语法元素,以及在所述指定是否所述32×32的子块被进一步拆分为四个基本编码单元尺寸相等的块的二进制拆分信令语法元素指定所述32×32的子块不被进一步拆分的情形下,编码用于所述32×32的子块的帧内预测模式;以及For each sub-block, if the sub-block is 32×32, a binary splitting signaling syntax element is encoded specifying whether the 32×32 sub-block is further split into four blocks of equal basic coding unit size; and if the binary splitting signaling syntax element specifying whether the 32×32 sub-block is further split into four blocks of equal basic coding unit size specifies that the 32×32 sub-block is not further split, an intra-prediction mode for the 32×32 sub-block is encoded; and 对于每个子块,在所述子块是16×16的情形下,编码单个空间帧内划分类型,所述单个空间帧内划分类型是从多个空间帧内划分类型中可确定的。For each sub-block, in the case that the sub-block is 16×16, a single spatial intra-frame partition type is encoded, which can be determined from multiple spatial intra-frame partition types. 2.如权利要求1所述的方法,还包括:2. The method of claim 1, further comprising: 编码至少一个二进制合并信令语法元素,所述至少一个二进制合并信令语法元素指定是否基本编码单元尺寸的块被合并为大尺寸的块。Encode at least one binary merge signaling syntax element, which specifies whether blocks of basic coding unit size are merged into larger blocks. 3.如权利要求1所述的方法,其中空间帧内划分类型表和帧内预测模式表中的至少一个是预先存储的并由所述编码方法使用以编码所述大块。3. The method of claim 1, wherein at least one of the spatial intra-frame partitioning type table and the intra-frame prediction mode table is pre-stored and used by the encoding method to encode the bulk block. 4.如权利要求1所述的方法,其中空间帧内划分类型表和帧内预测模式表中的至少一个是由所述编码方法使用一个或多个高级语法元素编码的,并由所述编码方法使用以编码所述大块。4. The method of claim 1, wherein at least one of the spatial intra-frame partitioning type table and the intra-frame prediction mode table is encoded by the encoding method using one or more high-level syntax elements, and is used by the encoding method to encode the bulk block. 5.一种视频解码装置中的方法,包括:5. A method in a video decoding apparatus, comprising: 通过确定要为画面中的大块执行帧内预测来解码所述大块的画面数据,Decoding the large chunk of frame data involves determining which chunk needs to be intra-frame prediction performed. 其中,所述大块具有比基本编码单元尺寸大的块尺寸,所述大块尺寸是32×32和64×64中的一个,且所述基本编码单元尺寸是16×16,The large block has a block size larger than the basic coding unit size, and the large block size is one of 32×32 and 64×64, while the basic coding unit size is 16×16. 其中对于所述大块,通过以下操作信令所述帧内预测:For the large block, the intra-frame prediction is signaled through the following operation: 解码二进制拆分信令语法元素,所述二进制拆分信令语法元素指定是否所述大块被进一步拆分为四个尺寸相等的子块;Decode the binary split signaling syntax element, which specifies whether the large block is further split into four equal-sized sub-blocks; 在所述二进制拆分信令语法元素指定所述大块不被进一步拆分的情形下,解码用于所述大块的帧内预测模式;In the case where the binary split signaling syntax element specifies that the block is not to be further split, decode the intra-prediction mode for the block; 否则在所述二进制拆分信令语法元素指定所述大块被进一步拆分的情形下:Otherwise, in the case where the binary splitting signaling syntax element specifies that the large block is further split: 对于每个子块,在所述子块是32×32的情形下,解码指定是否所述32×32的子块被进一步拆分为四个基本编码单元尺寸相等的块的二进制拆分信令语法元素,以及在所述指定是否所述32×32的子块被进一步拆分为四个基本编码单元尺寸相等的块的二进制拆分信令语法元素指定所述32×32的子块不被进一步拆分的情形下,解码用于所述32×32的子块的帧内预测模式;以及For each sub-block, if the sub-block is 32×32, decode the binary splitting signaling syntax element specifying whether the 32×32 sub-block is further split into four blocks of equal basic coding unit size; and if the binary splitting signaling syntax element specifying whether the 32×32 sub-block is further split into four blocks of equal basic coding unit size indicates that the 32×32 sub-block is not further split, decode the intra-prediction mode for the 32×32 sub-block; and 对于每个子块,在所述子块是16×16的情形下,解码单个空间帧内划分类型,所述单个空间帧内划分类型是从多个空间帧内划分类型中可确定的。For each sub-block, in the case that the sub-block is 16×16, a single spatial intra-frame partition type is decoded, which can be determined from multiple spatial intra-frame partition types. 6.如权利要求5所述的方法,还包括:6. The method of claim 5, further comprising: 解码至少一个二进制合并信令语法元素,所述至少一个二进制合并信令语法元素指定是否基本编码单元尺寸的块被合并为大尺寸的块。Decode at least one binary merge signaling syntax element, which specifies whether blocks of basic coding unit size are merged into larger blocks. 7.如权利要求5所述的方法,其中空间帧内划分类型表和帧内预测模式表中的至少一个是预先存储的并由所述解码方法使用以解码所述大块。7. The method of claim 5, wherein at least one of the spatial intra-frame partitioning type table and the intra-frame prediction mode table is pre-stored and used by the decoding method to decode the block. 8.如权利要求5所述的方法,其中空间帧内划分类型表和帧内预测模式表中的至少一个是由所述解码方法使用一个或多个高级语法元素接收的,并由所述解码方法使用以解码所述大块。8. The method of claim 5, wherein at least one of the spatial intra-frame partitioning type table and the intra-frame prediction mode table is received by the decoding method using one or more high-level syntax elements and used by the decoding method to decode the block.
HK16101202.4A 2009-07-01 2016-02-02 Methods and apparatus for video encoders and decoders HK1213402B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US22217709P 2009-07-01 2009-07-01
US61/222,177 2009-07-01

Publications (2)

Publication Number Publication Date
HK1213402A1 HK1213402A1 (en) 2016-06-30
HK1213402B true HK1213402B (en) 2019-08-30

Family

ID=

Similar Documents

Publication Publication Date Title
JP7576073B2 (en) Method and apparatus for signaling intra prediction for large blocks for video encoders and decoders - Patents.com
HK1213402B (en) Methods and apparatus for video encoders and decoders
HK1212841B (en) Methods and apparatus for video encoders and decoders