WO2012065552A1 - 视频图像编码方法及装置 - Google Patents

视频图像编码方法及装置 Download PDF

Info

Publication number
WO2012065552A1
WO2012065552A1 PCT/CN2011/082287 CN2011082287W WO2012065552A1 WO 2012065552 A1 WO2012065552 A1 WO 2012065552A1 CN 2011082287 W CN2011082287 W CN 2011082287W WO 2012065552 A1 WO2012065552 A1 WO 2012065552A1
Authority
WO
WIPO (PCT)
Prior art keywords
block
parameter
slice
encoding
video image
Prior art date
Application number
PCT/CN2011/082287
Other languages
English (en)
French (fr)
Inventor
左雯
王宁
于培松
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2012065552A1 publication Critical patent/WO2012065552A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/174Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding

Definitions

  • the present invention relates to the field of communications, and in particular to a video image encoding method and apparatus.
  • BACKGROUND OF THE INVENTION In the video coding standard before H.264, when video coding of each frame of image is performed, the image is first divided into macroblocks (16 ⁇ 16), and then the image is divided into 8 ⁇ 8 according to predetermined judgment criteria.
  • the 16 ⁇ 16 block performs inter-frame or intra-frame prediction, and finally performs a cosine transform (DCT) transform of 8 ⁇ 8 blocks on the residual, and encodes the DCT coefficients.
  • FIG. 1 is a position of the block division module in the video encoder according to the related art. Schematic, the location of the block partitioning module in the video encoder is shown in Figure 1.
  • a macroblock can be divided into seven modes, namely 16X16, 16X8, 8X16, 8X8, 8X4, 4X8, 4X4, encoding.
  • the mode is selected based on rate distortion.
  • the Key Technology Areas (KTA) adopts super macroblocks, namely 64X64, 64X32, . . . , 4x4 block partitioning.
  • FIG. 2 is According to the schematic diagram of the block division method used in the HEVC video coding of the related art, as shown in FIG. 2, the blocks thus divided are only square, and then the divided blocks are combined with the surrounding blocks according to a certain principle. Forming blocks like 64X32, 32X64, ..., 8x4, 4x8, etc. for interframe prediction and encoding.
  • the disadvantage of this encoding method is: Although the codeword for transmitting the motion vector is reduced, in addition to the structure that needs to transmit the quadtree In addition, you need to transfer a block and which block to combine, which will take up a lot of codewords. Aiming at the problem that the codewords transmitted by the video image coding in the related art are relatively large, the code efficiency is low, and an effective solution has not been proposed yet.
  • the present invention provides a video image encoding method and apparatus for solving at least the problem that the codewords transmitted by video image encoding are relatively low in the related art. According to an aspect of the present invention, a video image encoding method is provided.
  • a video image encoding method includes: configuring an integer power of a maximum block of a coded block slice of a video image to have an integer power of 2 and a length and a width of a minimum block of a video image coding block Slice of 2; using a binary tree structure The block coding order encodes the coded block Slice.
  • encoding the coding block slice by using a block coding order of the binary tree structure includes: determining, according to a predetermined policy, whether the current coding block slice is divided, and distinguishing and not dividing by using the first parameter; determining that the current coding block is horizontally divided according to a predetermined policy Or vertical, and using the second parameter to identify horizontal division and vertical division; encoding the first parameter and the second parameter of all video image coding block slices in a block coding order of a binary tree structure, wherein the block coding order of the binary tree structure is from left to Right, from top to bottom.
  • determining whether the current coding block slice is divided according to a predetermined policy, and dividing and not dividing by using the first parameter identifier includes: determining, according to a predetermined policy, that the current coding block Slice is divided, and using the first parameter to be 1 to identify the division; according to a predetermined strategy It is judged that the current coding block Slice is not divided, and the first parameter is 0, and the division is not performed.
  • determining whether the current coding block slice is divided according to a predetermined policy, and using the second parameter to identify the horizontal division and the vertical division comprises: determining, according to a predetermined strategy, that the current coding block slice is a horizontal division, and using the second parameter as a 1 to identify a horizontal division; According to a predetermined strategy, it is determined that the current coding block Slice is a vertical division, and the second parameter is 0 to identify a vertical division.
  • the above method further comprises: configuring, in an image header of the video image, a third parameter for identifying an encoding configuration of all slices in the video image, and the encoding configuration is the same; on one or more coding blocks Slice in the video image
  • the fourth parameter is separately configured to identify the encoding configuration on one or more slices, respectively.
  • the predetermined strategy is a rate distortion principle.
  • a video image encoding apparatus includes: a configuration module configured to configure an integer power of a maximum block length and a width of a block of a coded block slice of a video image to be 2 and a length and a width of a minimum block of the video image encoding block Slice are both 2 And an encoding module configured to encode the video image encoding block Slice using a block encoding order of a binary tree structure.
  • the encoding module includes: a first determining module, configured to determine whether the current encoding block Slice is divided according to a predetermined policy; the first identifying module is configured to use the first parameter identifier to divide and not divide; the second determining module is set to Determining, according to a predetermined policy, whether the current coding block is horizontally divided or vertically divided; the second identification module is configured to use the second parameter identifier to perform horizontal division and vertical division; the coding submodule is configured to encode the first parameter of the block slice for all video images.
  • the second parameter encodes the block coding order of the binary tree structure.
  • the first identification module is configured to: when the current coding block Slice is determined according to the predetermined policy, the first parameter is 1 for the division; the first identification module is configured to determine that the current coding block Slice is not performed according to the predetermined policy.
  • the second identifier is configured to identify a horizontal partition with a second parameter of 1 when the current coded block slice is horizontally divided according to a predetermined policy; and the second identifier is configured to determine that the current coded block slice is vertical according to a predetermined policy. Divide, use the second parameter to 0 to identify the vertical division.
  • the apparatus further includes: a first configuration module, configured to configure, in an image header of the video image, a third parameter for identifying an encoding configuration of all slices in the video image, and the encoding configuration is the same; the second configuration module is configured to A fourth parameter is respectively configured on one or more coding blocks Slice in the video image for respectively identifying an encoding configuration on one or more encoding block Slices.
  • the maximum size and width of the largest block of the video image slice are set to an integer power of 2, and the minimum block is 2, and the slice is encoded using the block coding order of the binary tree structure, thereby solving the codeword of the video image coding transmission.
  • FIG. 1 is a schematic diagram of a position of a block division module according to the related art in a video encoder
  • FIG. 2 is a schematic diagram of a block division method used in HEVC video coding according to the related art
  • FIG. 3 is an implementation according to the present invention.
  • FIG. 4 is a schematic diagram of a block partitioning method in accordance with a preferred embodiment of the present invention
  • Figure 5 is a schematic diagram of a binary tree structure and corresponding output bits in accordance with a preferred embodiment of the present invention.
  • 6 is a block diagram showing the structure of a video image encoding apparatus according to an embodiment of the present invention
  • FIG. 7 is a block diagram showing a preferred configuration of a video image encoding apparatus according to an embodiment of the present invention; BEST MODE FOR CARRYING OUT THE INVENTION
  • the present invention will be described in detail with reference to the accompanying drawings. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict.
  • the present embodiment provides a video image encoding method.
  • FIG. 3 is a flowchart of a video image encoding method according to an embodiment of the present invention.
  • the method includes the following process: Step S302: Configuring a video image encoding block ( The maximum block length and width 2 integer power of the slice and the minimum block width of the video image coding block Slice are both 2.
  • the maximum and width of the slice and the minimum block are configured, and then the coded block slice is encoded using the block coding order of the binary tree structure, which overcomes the related art, and the codeword transmitted by the video image coding is more complicated, resulting in code efficiency.
  • step S304 Determining whether the current coding block slice is divided according to a predetermined policy, and dividing and not dividing by using the first parameter identifier; determining whether the current coding block is horizontally divided or vertical according to a predetermined policy, and using the second parameter to identify horizontal division and vertical division; for all video images
  • the first parameter and the second parameter of the coding block slice are encoded in a block coding order of a binary tree structure, wherein the block coding order of the binary tree structure is from left to right, from top to bottom.
  • whether the current coding block slice is divided, horizontally divided, or vertically divided according to a predetermined policy is used, and encoding is performed using a block coding order of a binary tree structure, which reduces the number of bits transmitted by the slice coding.
  • determining whether the current coding block slice is divided according to a predetermined policy, and dividing and not dividing by using the first parameter identifier includes: determining, according to a predetermined policy, that the current coding block Slice is divided, and using the first parameter to be 1 to identify the division; according to a predetermined strategy It is judged that the current coding block Slice is not divided, and the first parameter is 0, and the division is not performed.
  • whether or not the current coding block is divided by lbit is used, which reduces the number of codewords for video image coding transmission.
  • determining whether the current coding block slice is divided according to a predetermined policy, and using the second parameter to identify the horizontal division and the vertical division comprises: determining, according to a predetermined policy, that the current video coding block Slice is horizontally divided, and using the second parameter as 1 to identify horizontal division.
  • the above method further comprises: configuring, in an image header of the video image, a third parameter for identifying an encoding configuration of all slices in the video image, and the encoding configuration is the same; on one or more coding blocks Slice in the video image
  • the fourth parameter is separately configured to identify the encoding configuration on one or more slices, respectively.
  • the coding configuration of each slice in the video image is represented by the parameter identification, which improves the coding efficiency.
  • the predetermined strategy is a rate distortion principle.
  • a video image encoding method is provided.
  • the method includes the following steps: Step 1: Initialize a maximum block and a divided minimum block of a slice at the beginning of encoding, and the length and width of the largest block must be Is a power of 2, such as: 128, 64, etc.; the smallest block of the partition can be up to 2 or 4, the parameter is encoded in the Slice header, transmitted to the decoder.
  • Step 2 According to the rate distortion principle (may also be based on other principles), whether the current block is divided or not.
  • FIG. 4 is a schematic diagram of a block dividing method according to a preferred embodiment of the present invention. As shown in FIG.
  • FIG. 5 is a schematic diagram of a binary tree structure and corresponding output bits according to a preferred embodiment of the present invention, as shown in FIG. It is shown that the block coding order for the binary tree structure is from left to right, from top to bottom.
  • Step 4 Use the existing standard code to predict the residual and auxiliary information.
  • the present embodiment provides a video image encoding apparatus.
  • FIG. 6 is a structural block diagram of a video image encoding apparatus according to an embodiment of the present invention.
  • the apparatus includes: a configuration module 62 and an encoding module 64.
  • the configuration module 62 is configured to configure the maximum length and width of the largest block of the video image encoding block Slice to be an integer power of 2 and the minimum length and width of the smallest block of the video image encoding block Slice are 2;
  • the encoding module 64 Connected to the configuration module 62, arranged to encode the video image encoding block Slice configured by the configuration module 62 using the block encoding order of the binary tree structure.
  • FIG. 7 is a block diagram of a preferred structure of a video image encoding apparatus according to an embodiment of the present invention. As shown in FIG.
  • the apparatus further includes: a first configuration module 72, a second configuration module 74.
  • the encoding module 64 includes: Mode The block 642, the first identifier module 644, the second determining module 646, the second identifier module 648, and the encoding sub-module 649. The foregoing structure is described in detail below.
  • the encoding module 64 includes: a first determining module 642, configured to be configured according to a predetermined The policy determines whether the current video coding block slice is divided.
  • the first identification module 644 is connected to the first determination module 642, and is configured to use the first parameter identification second determination module 646 to determine the division and non-division; the second determination module 646 And determining, according to the predetermined policy, whether the current video coding block is horizontally divided or vertically divided; the second identification module 648 is connected to the second determining module 646, and is configured to use the second parameter identifier second determining module 646 to determine the horizontal division and
  • the vertical subdivision 649 is coupled to the first identification module 644 and the second identification module 648, and is configured to encode the first parameter and the second parameter of all the video image coding block slices in a block coding order of a binary tree structure.
  • the device further includes: a first configuration module 72, configured to configure, in an image header of the video image, a third parameter for identifying an encoding configuration of all slices in the video image, and the encoding configuration is the same; the second configuration module 74 is configured to be A fourth parameter is respectively disposed on the one or more coding blocks Slice in the video image for respectively identifying the coding configuration on one or more slices.
  • the first identification module 644 is configured to identify the division by using the first parameter as 1 when determining that the current video coding block Slice is divided according to a predetermined policy; preferably, the first identification module 644 is configured to determine according to a predetermined policy. When the current video coding block Slice is not divided, the first parameter is 0 to identify no division.
  • the second identity module 648 is configured to identify a horizontal partition with a second parameter of 1 when it is determined according to a predetermined policy that the current video coding block slice is horizontally divided; the second identity module 648 is configured to determine the current according to a predetermined policy.
  • the vertical division is identified by the second parameter being 0.
  • the video image encoding method provided in this embodiment is adapted to video images of different resolutions, especially with the development of high definition and ultra high definition video, the largest block in H.264/AVC is 16x16, and the smallest block is The 4x4 partitioning method causes the transfer block mode and motion vector to occupy too many bits.
  • a video image encoding method and apparatus are provided.
  • the block division method enables the coding mode to adapt to different resolutions of image and video coding, and has a flexible block division method, which overcomes the related art, and the codewords transmitted by the video image coding are relatively large, resulting in low code efficiency.
  • the codeword transmitted by the video image coding is reduced, thereby improving the performance of the video encoder.
  • modules or steps of the present invention can be implemented by a general-purpose computing device, which can be concentrated on a single computing device or distributed over a network composed of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device, such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
  • the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps are fabricated as a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

本发明公开了一种视频图像编码方法及装置,该方法包括:配置视频图像的编码块Slice的最大块的长和宽为2的整数幂及视频图像编码块Slice的最小块的长和宽均为2;使用二叉树结构的块编码顺序对编码块Slice进行编码。通过本发明,减低了视频图像编码传送的码字,进而提高了视频编码器性能。

Description

视频图像编码方法及装置 技术领域 本发明涉及通信领域, 具体而言, 涉及一种视频图像编码方法及装置。 背景技术 在 H.264以前的视频编码标准中, 在进行每一帧图像的视频编码时, 都是先把图 像划分成宏块 (16X16), 然后再根据预定的判断准则, 把图像划分成 8X8或 16X16的 块进行帧间或帧内预测, 最后对残差进行 8X8块的余弦变换 (DCT) 变换, 对 DCT 系数进行编码完成的, 图 1是根据相关技术的块划分模块在视频编码器中的位置示意 图, 块划分模块在视频编码器中的位置如图 1所示。 在 H.264中, 块划分有了很大的 改进, 在进行帧间预测的时候, 可以把一个宏块划分成 7种模式, 即 16X16、 16X8、 8X16、 8X8、 8X4、 4X8、 4X4, 编码模式根据率失真进行选择。 但随着视频编码在高 清图像等方面的应用, 图像的分辨率越来越大, 基于宏块的编码已经不能适应大分辨 率的视频或图像编码, 因此在 H.264算法的后续研究项目关键技术域(Key Technology Areas, 简称为 KTA)采用了超宏块, 即 64X64 ,64X32 ,···, 4x4的块划分方式。 后来在 高效视频编码 (HEVC, 即 H.265)中, 有提案提出了采用 4叉树的块划分方式, 即对 一个超宏块 (大小可以配置), 采用 4叉树进行划分, 图 2是根据相关技术的 HEVC视 频编码中使用的块划分方法的示意图, 如图 2所示,这样划分出来的块只是正方形的, 然后再根据一定的原则, 对划分后的块再和周围的块进行结合, 形成类似 64X32, 32X64, … ,8x4, 4x8等的块, 进行帧间预测和编码, 这种编码方法的缺陷是: 虽然减 少了传送运动矢量的码字, 但是除了需要传送四叉树的结构外, 还需要传送一个块和 哪个块进行结合, 这样会占用不少码字。 针对相关技术中视频图像编码传送的码字比较多导致码效率低的问题, 目前尚未 提出有效的解决方案。 发明内容 针对相关技术中视频图像编码传送的码字比较多导致码效率低的问题, 本发明提 供了一种视频图像编码方法及装置, 以至少解决该问题。 根据本发明的一个方面, 提供了一种视频图像编码方法。 根据本发明的视频图像编码方法包括:配置视频图像的编码块 Slice的最大块的长 和宽为 2的整数幂及视频图像编码块 Slice的最小块的长和宽均为 2; 使用二叉树结构 的块编码顺序对编码块 Slice进行编码。 优选地, 使用二叉树结构的块编码顺序对编码块 Slice进行编码包括: 根据预定策 略判断当前编码块 Slice是否进行划分, 并用第一参量标识划分和不划分; 根据预定策 略判断当前编码块是水平划分还是垂直, 并用第二参量标识水平划分和垂直划分; 对 所有视频图像编码块 Slice 的第一参量和第二参量进行二叉树结构的块编码顺序进行 编码, 其中二叉树结构的块编码顺序为从左到右, 从上到下。 优选地,根据预定策略判断当前编码块 Slice是否进行划分, 并用第一参量标识划 分和不划分包括: 根据预定策略判断出当前编码块 Slice进行划分, 用第一参量为 1 标识划分; 根据预定策略判断出当前编码块 Slice不进行划分,用第一参量为 0标识不 划分。 优选地,根据预定策略判断当前编码块 Slice是否进行划分, 并用第二参量标识水 平划分和垂直划分包括: 根据预定策略判断出当前编码块 Slice是水平划分,用第二参 量为 1标识水平划分; 根据预定策略判断出当前编码块 Slice是垂直划分,用第二参量 为 0标识垂直划分。 优选地, 上述方法还包括: 在视频图像的图像头配置第三参量用于标识视频图像 中的所有 Slice 的编码配置, 且编码配置相同; 在视频图像中的一个或多个的编码块 Slice上分别配置第四参量用于分别标识一个或多个 Slice上的编码配置。 优选地, 预定策略为率失真原则。 根据本发明的另一方面, 提供了一种视频图像编码装置。 根据本发明的视频图像编码装置包括: 配置模块, 设置为配置视频图像的编码块 Slice的最大块的长和宽为 2的整数幂及视频图像编码块 Slice的最小块的长和宽均为 2; 编码模块, 设置为使用二叉树结构的块编码顺序对视频图像编码块 Slice进行编码。 优选地,编码模块包括:第一判断模块,设置为根据预定策略判断当前编码块 Slice 是否进行划分; 第一标识模块, 设置为使用第一参量标识划分和不划分; 第二判断模 块, 设置为根据预定策略判断当前编码块是水平划分还是垂直划分; 第二标识模块, 设置为使用第二参量标识水平划分和垂直划分; 编码子模块, 设置为对所有视频图像 编码块 Slice的第一参量和第二参量进行二叉树结构的块编码顺序进行编码。 优选地, 第一标识模块设置为在根据预定策略判断出当前编码块 Slice进行划分 时, 用第一参量为 1标识划分; 第一标识模块设置为在根据预定策略判断出当前编码 块 Slice不进行划分时, 用第一参量为 0标识不划分。 优选地,第二标识用于在根据预定策略判断出当前编码块 Slice是水平划分时,用 第二参量为 1 标识水平划分; 第二标识用于在根据预定策略判断出当前编码块 Slice 是垂直划分, 用第二参量为 0标识垂直划分。 优选地, 上述装置还包括: 第一配置模块, 设置为在视频图像的图像头配置第三 参量用于标识视频图像中的所有 Slice的编码配置, 且编码配置相同; 第二配置模块, 设置为在视频图像中的一个或多个的编码块 Slice 上分别配置第四参量用于分别标识 一个或多个编码块 Slice上的编码配置。 通过本发明,采用配置视频图像 Slice最大块的长和宽为 2的整数次幂,及最小块 为 2, 使用二叉树结构的块编码顺序对该 Slice进行编码, 解决了视频图像编码传送的 码字比较多导致码效率低的问题, 达到了减低视频图像编码传送的码字, 进而提高了 视频编码器性能的效果。 附图说明 此处所说明的附图用来提供对本发明的进一步理解, 构成本申请的一部分, 本发 明的示意性实施例及其说明用于解释本发明, 并不构成对本发明的不当限定。 在附图 中: 图 1是根据相关技术的块划分模块在视频编码器中的位置示意图; 图 2是根据相关技术的 HEVC视频编码中使用的块划分方法的示意图; 图 3是根据本发明实施例的视频图像编码方法的流程图; 图 4是根据本发明优选实施例的块划分方法的示意图; 图 5是根据本发明优选实施例的二叉树结构及相应的输出比特的示意图。 图 6是根据本发明实施例的视频图像编码装置的结构框图; 以及 图 7是根据本发明实施例的视频图像编码装置的优选的结构框图; 具体实施方式 下文中将参考附图并结合实施例来详细说明本发明。 需要说明的是, 在不冲突的 情况下, 本申请中的实施例及实施例中的特征可以相互组合。 本实施例提供了一种视频图像编码方法, 图 3是根据本发明实施例的视频图像编 码方法的流程图, 如图 3所示, 该方法包括如下流程: 步骤 S302: 配置视频图像编码块(Slice) 的最大块的长和宽 2的整数幂及视频图 像编码块 Slice的最小块的长和宽均为 2。 步骤 S304: 使用二叉树结构的块编码顺序对编码块 (Slice) 进行编码。 通过上述步骤, 配置 Slice的最大块和最小块的长和宽,然后使用二叉树结构的块 编码顺序对编码块 Slice进行编码, 克服了相关技术中,视频图像编码传送的码字比较 多导致码效率低的问题, 达到了减低视频图像编码传送的码字, 进而提高了视频编码 器性能的效果。 优选地,下面对步骤 S304的一个优选的实施例进行说明。根据预定策略判断当前 编码块 Slice是否进行划分, 并用第一参量标识划分和不划分; 根据预定策略判断当前 编码块是水平划分还是垂直, 并用第二参量标识水平划分和垂直划分; 对所有视频图 像编码块 Slice的第一参量和第二参量进行二叉树结构的块编码顺序进行编码,其中二 叉树结构的块编码顺序为从左到右, 从上到下。 通过该优选实施例, 根据预定策略判 断当前编码块 Slice是否划分、水平划分或垂直划分, 并使用二叉树结构的块编码顺序 进行编码, 降低了 Slice编码传送的比特数。 优选地,根据预定策略判断当前编码块 Slice是否进行划分, 并用第一参量标识划 分和不划分包括: 根据预定策略判断出当前编码块 Slice进行划分, 用第一参量为 1 标识划分; 根据预定策略判断出当前编码块 Slice不进行划分,用第一参量为 0标识不 划分。 通过该优选实施例, 通过 lbit来表示当前编码块是否进行划分, 降低了减低视 频图像编码传送的码字数。 优选地,根据预定策略判断当前编码块 Slice是否进行划分, 并用第二参量标识水 平划分和垂直划分包括: 根据预定策略判断出当前视频编码块 Slice是水平划分,用第 二参量为 1标识水平划分; 根据预定策略判断出当前视频编码块 Slice是垂直划分,用 第二参量为 0标识垂直划分。 通过该优选实施例, 通过 lbit来表示当前编码块是垂直 划分还是水平划分, 降低了减低视频图像编码传送的码字数。 优选地, 上述方法还包括: 在视频图像的图像头配置第三参量用于标识视频图像 中的所有 Slice 的编码配置, 且编码配置相同; 在视频图像中的一个或多个的编码块 Slice上分别配置第四参量用于分别标识一个或多个 Slice上的编码配置。 通过该优选 实施例, 通过参量标识来表示视频图像中各 Slice的编码配置, 提高了编码效率。 优选地, 预定策略为率失真原则。 通过该优选实施例, 采用现有的策略去判断, 降低了研发成本。 实施例一 在本实施例中, 提供了一种视频图像编码方法, 该方法包括如下步骤: 步骤 1 : 在编码开始初始化一个 Slice的最大块和划分的最小块, 最大块的长和宽 都必须是 2的幂, 如: 128,64等; 分割的最小块的可以直到 2或 4, 该参数要编码在 在 Slice头中, 传送到解码端。 步骤 2: 根据率失真原则 (也可以根据其他原则)判断当前块是划分还是不划分, 图 4是根据本发明优选实施例的块划分方法的示意图, 如图 4所示, 如果划分, 则输入 比特 1, 否则输出比特 0;判断当前块是水平划分还是垂直划分, 如果水平划分, 则输 出比特 1, 如果垂直划分, 则输出比特 0。 步骤 3 : 编码每个块的运动矢量, 为了利用已编码块的运动矢量预测当前块的运 动矢量, 图 5是根据本发明优选实施例的二叉树结构及相应的输出比特的示意图, 如 图 5所示, 对二叉树结构的块编码顺序为从左到右, 从上到下。 步骤 4: 采用现有的标准编码预测后的残差及辅助信息。 本实施例提供了一种视频图像编码装置, 图 6是根据本发明实施例的视频图像编 码装置的结构框图, 如图 6所示, 该装置包括: 配置模块 62和编码模块 64, 下面对 上述结构进行详细描述: 配置模块 62, 设置为配置视频图像编码块 Slice的最大块的长和宽为 2的整数幂 及视频图像编码块 Slice的最小块的长和宽均为 2; 编码模块 64, 连接至配置模块 62, 设置为使用二叉树结构的块编码顺序对配置模块 62配置好的视频图像编码块 Slice进 行编码。 图 7是根据本发明实施例的视频图像编码装置的优选的结构框图, 如图 7所示, 该装置还包括: 第一配置模块 72, 第二配置模块 74; 编码模块 64包括: 第一判断模 块 642, 第一标识模块 644, 第二判断模块 646, 第二标识模块 648, 编码子模块 649, 下面对上述结构进行详细描述: 编码模块 64包括: 第一判断模块 642, 设置为根据预定策略判断当前视频编码块 Slice是否进行划分; 第一标识模块 644, 连接至第一判断模块 642, 设置为使用第一 参量标识第二判断模块 646判断出的划分和不划分; 第二判断模块 646, 设置为根据 预定策略判断当前视频编码块是水平划分还是垂直划分; 第二标识模块 648, 连接至 第二判断模块 646, 设置为使用第二参量标识第二判断模块 646判断出的水平划分和 垂直划分; 编码子模块 649, 连接至第一标识模块 644和第二标识模块 648, 设置为对 所有视频图像编码块 Slice 的第一参量和第二参量进行二叉树结构的块编码顺序进行 编码。 上述装置还包括: 第一配置模块 72, 设置为在视频图像的图像头配置第三参量用 于标识视频图像中的所有 Slice的编码配置, 且编码配置相同; 第二配置模块 74, 设 置为在视频图像中的一个或多个的编码块 Slice 上分别配置第四参量用于分别标识一 个或多个 Slice上的编码配置。 优选地, 第一标识模块 644 设置为在根据预定策略判断出当前视频编码块 Slice 进行划分时, 用第一参量为 1标识划分; 优选地, 第一标识模块 644设置为在根据预 定策略判断出当前视频编码块 Slice不进行划分时, 用第一参量为 0标识不划分。 优选地, 第二标识模块 648 设置为在根据预定策略判断出当前视频编码块 Slice 是水平划分时, 用第二参量为 1标识水平划分; 第二标识模块 648设置为在根据预定 策略判断出当前视频编码块 Slice是垂直划分时, 用第二参量为 0标识垂直划分。 需要说明的是, 本实施例中提供的视频图像编码方法适应于不同分辨率的视频图 像, 特别是随着高清和超高清视频的发展, H.264/AVC 中最大块为 16x16, 最小块为 4x4 的划分方法会造成传送块模式和运动矢量所占比特过多。 且提供了很大的灵活性 (可配置), 而且块划分方式比现有视频标准更多, 且编码模式所占用的比特更少, 可 以应用于下一代视频编码标准中。 通过上述实施例,提供了一种视频图像编码方法及装置,通过配置 Slice的最大块 和最小块的长和宽,然后使用二叉树结构的块编码顺序对编码块 Slice进行编码, 改进 了视频编码中的块划分方法,使得编码模式既能适应不同的分辨率的图像及视频编码, 又能具有灵活的块划分方法,, 克服了相关技术中,视频图像编码传送的码字比较多导 致码效率低的问题, 达到了减低视频图像编码传送的码字, 进而提高了视频编码器性 能的效果。 显然, 本领域的技术人员应该明白, 上述的本发明的各模块或各步骤可以用通用 的计算装置来实现, 它们可以集中在单个的计算装置上, 或者分布在多个计算装置所 组成的网络上, 可选地, 它们可以用计算装置可执行的程序代码来实现, 从而, 可以 将它们存储在存储装置中由计算装置来执行, 并且在某些情况下, 可以以不同于此处 的顺序执行所示出或描述的步骤, 或者将它们分别制作成各个集成电路模块, 或者将 它们中的多个模块或步骤制作成单个集成电路模块来实现。 这样, 本发明不限制于任 何特定的硬件和软件结合。 以上所述仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本领域的技 术人员来说, 本发明可以有各种更改和变化。 凡在本发明的精神和原则之内, 所作的 任何修改、 等同替换、 改进等, 均应包含在本发明的保护范围之内。

Claims

权 利 要 求 书
1. 一种视频图像编码方法, 包括:
配置视频图像的编码块 Slice的最大块的长和宽为 2的整数幂及所述视频图 像编码块 Slice的最小块的长和宽均为 2;
使用二叉树结构的块编码顺序对所述编码块 Slice进行编码。
2. 根据权利要求 1所述的方法, 其中, 所述使用二叉树结构的块编码顺序对所述 编码块 Slice进行编码包括:
根据预定策略判断当前所述编码块 Slice是否进行划分,并用第一参量标识 划分和不划分;
根据所述预定策略判断当前所述编码块是水平划分还是垂直, 并用第二参 量标识水平划分和垂直划分;
对所有所述视频图像编码块 Slice 的所述第一参量和所述第二参量进行二 叉树结构的块编码顺序进行编码, 其中所述二叉树结构的块编码顺序为从左到 右, 从上到下。
3. 根据权利要求 2所述的方法,其中,所述根据预定策略判断当前所述编码块 Slice 是否进行划分, 并用第一参量标识划分和不划分包括:
根据所述预定策略判断出当前所述编码块 Slice进行划分,用所述第一参量 为 1标识划分;
根据所述预定策略判断出当前所述编码块 Slice不进行划分,用所述第一参 量为 0标识不划分。
4. 根据权利要求 2所述的方法,其中,所述根据预定策略判断当前所述编码块 Slice 是否进行划分, 并用第二参量标识水平划分和垂直划分包括:
根据所述预定策略判断出当前所述编码块 Slice是水平划分,用所述第二参 量为 1标识水平划分;
根据所述预定策略判断出当前所述编码块 Slice是垂直划分,用所述第二参 量为 0标识垂直划分。 根据权利要求 1所述的方法, 其中, 还包括: 在所述视频图像的图像头配置第三参量用于标识所述视频图像中的所有
Slice的编码配置, 且所述编码配置相同;
在所述视频图像中的一个或多个的编码块 Slice 上分别配置第四参量用于 分别标识所述一个或多个所述 Slice上的编码配置。 根据权利要求 1至 5中任一项所述的方法, 其中, 所述预定策略为率失真原则。 一种视频图像编码装置, 包括:
配置模块,设置为配置视频图像的编码块 Slice的最大块的长和宽为 2的整 数幂及所述视频图像编码块 Slice的最小块的长和宽均为 2;
编码模块, 设置为使用二叉树结构的块编码顺序对所述视频图像编码块 Slice进行编码。 根据权利要求 7所述的装置, 其中, 所述编码模块包括: 第一判断模块,设置为根据预定策略判断当前所述编码块 Slice是否进行划 分;
第一标识模块, 设置为使用第一参量标识划分和不划分;
第二判断模块, 设置为根据所述预定策略判断当前所述编码块是水平划分 还是垂直划分;
第二标识模块, 设置为使用第二参量标识所述水平划分和所述垂直划分; 编码子模块,设置为对所有所述视频图像编码块 Slice的所述第一参量和所 述第二参量进行所述二叉树结构的块编码顺序进行编码。 根据权利要求 8所述的装置, 其中, 所述第一标识模块设置为在根据所述预定策略判断出当前所述编码块 Slice进行划分时, 用所述第一参量为 1标识划分;
所述第一标识模块设置为在根据所述预定策略判断出当前所述编码块 Slice不进行划分时, 用所述第一参量为 0标识不划分。 根据权利要求 8所述的装置, 其中, 所述第二标识用于在根据所述预定策略判断出当前所述编码块 Slice 是水 平划分时, 用所述第二参量为 1标识水平划分; 所述第二标识用于在根据所述预定策略判断出当前所述编码块 Slice 是垂 直划分, 用所述第二参量为 0标识垂直划分。
11. 根据权利要求 7所述的装置, 其中, 还包括:
第一配置模块, 设置为在所述视频图像的图像头配置第三参量用于标识所 述视频图像中的所有 Slice的编码配置, 且所述编码配置相同;
第二配置模块,设置为在所述视频图像中的一个或多个的编码块 Slice上分 别配置第四参量用于分别标识所述一个或多个所述 Slice上的编码配置。
PCT/CN2011/082287 2010-11-17 2011-11-16 视频图像编码方法及装置 WO2012065552A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010548628.7 2010-11-17
CN2010105486287A CN101990104A (zh) 2010-11-17 2010-11-17 视频图像编码方法及装置

Publications (1)

Publication Number Publication Date
WO2012065552A1 true WO2012065552A1 (zh) 2012-05-24

Family

ID=43746405

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/082287 WO2012065552A1 (zh) 2010-11-17 2011-11-16 视频图像编码方法及装置

Country Status (2)

Country Link
CN (1) CN101990104A (zh)
WO (1) WO2012065552A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3213514A4 (en) * 2014-12-10 2018-06-13 MediaTek Singapore Pte Ltd. Method of video coding using binary tree block partitioning
US10382795B2 (en) 2014-12-10 2019-08-13 Mediatek Singapore Pte. Ltd. Method of video coding using binary tree block partitioning

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9332283B2 (en) * 2011-09-27 2016-05-03 Broadcom Corporation Signaling of prediction size unit in accordance with video coding
CN112954368B (zh) * 2015-03-13 2024-05-24 Lg电子株式会社 处理视频信号的方法及其设备
WO2017114450A1 (en) * 2015-12-31 2017-07-06 Mediatek Inc. Method and apparatus of prediction binary tree structure for video and image coding
EP3306938A1 (en) 2016-10-05 2018-04-11 Thomson Licensing Method and apparatus for binary-tree split mode coding
JP6565885B2 (ja) * 2016-12-06 2019-08-28 株式会社Jvcケンウッド 画像符号化装置、画像符号化方法及び画像符号化プログラム、並びに画像復号化装置、画像復号化方法及び画像復号化プログラム
WO2019017673A1 (ko) * 2017-07-19 2019-01-24 삼성전자 주식회사 부호화 방법 및 그 장치, 복호화 방법 및 그 장치

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5134478A (en) * 1991-02-19 1992-07-28 Intel Corporation Method and apparatus for compressing and decompressing a digital video signal using predicted and error images
WO1994006099A1 (en) * 1992-09-01 1994-03-17 Apple Computer, Inc. Improved vector quantization
US20020130796A1 (en) * 2001-02-23 2002-09-19 Toshiaki Tsuchido Huffman decoding method and decoder, huffman decoding table, method of preparing the table, and storage media
US20060133680A1 (en) * 2004-12-22 2006-06-22 Frank Bossen Method and apparatus for coding positions of coefficients
EP2065812A1 (en) * 2007-11-30 2009-06-03 Palo Alto Research Center Incorporated Bounded sub-optimal problem solving
CN101507280A (zh) * 2006-08-25 2009-08-12 汤姆逊许可公司 用于降低分辨率划分的方法和装置
WO2010056013A2 (en) * 2008-11-11 2010-05-20 Samsung Electronics Co., Ltd. Moving picture encoding/decoding apparatus and method for processing of moving picture divided in units of slices

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4785349A (en) * 1987-10-05 1988-11-15 Technology Inc. 64 Digital video decompression system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5134478A (en) * 1991-02-19 1992-07-28 Intel Corporation Method and apparatus for compressing and decompressing a digital video signal using predicted and error images
WO1994006099A1 (en) * 1992-09-01 1994-03-17 Apple Computer, Inc. Improved vector quantization
US20020130796A1 (en) * 2001-02-23 2002-09-19 Toshiaki Tsuchido Huffman decoding method and decoder, huffman decoding table, method of preparing the table, and storage media
US20060133680A1 (en) * 2004-12-22 2006-06-22 Frank Bossen Method and apparatus for coding positions of coefficients
CN101507280A (zh) * 2006-08-25 2009-08-12 汤姆逊许可公司 用于降低分辨率划分的方法和装置
EP2065812A1 (en) * 2007-11-30 2009-06-03 Palo Alto Research Center Incorporated Bounded sub-optimal problem solving
WO2010056013A2 (en) * 2008-11-11 2010-05-20 Samsung Electronics Co., Ltd. Moving picture encoding/decoding apparatus and method for processing of moving picture divided in units of slices

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3213514A4 (en) * 2014-12-10 2018-06-13 MediaTek Singapore Pte Ltd. Method of video coding using binary tree block partitioning
US10375393B2 (en) 2014-12-10 2019-08-06 Mediatek Singapore Pte. Ltd. Method of video coding using binary tree block partitioning
US10382795B2 (en) 2014-12-10 2019-08-13 Mediatek Singapore Pte. Ltd. Method of video coding using binary tree block partitioning
US10506231B2 (en) 2014-12-10 2019-12-10 Mediatek Singapore Pte. Ltd Method of video coding using binary tree block partitioning

Also Published As

Publication number Publication date
CN101990104A (zh) 2011-03-23

Similar Documents

Publication Publication Date Title
WO2012065552A1 (zh) 视频图像编码方法及装置
RU2565365C2 (ru) Кодирование коэффициентов преобразования для кодирования видео
ES2546514T3 (es) Aparato de codificación de imágenes
ES2644002T3 (es) Procedimiento y aparato de codificación y decodificación de imágenes usando unidades de transformada grandes
US9202290B2 (en) Apparatus and method for encoding/decoding images for intra-prediction
ES2767966T3 (es) Codificación de intra-predicción bajo representaciones planas
US20100195715A1 (en) Method and apparatus for adaptive frame prediction
CN103891280B (zh) 用于确定视频数据的上下文模型的方法和系统
WO2011096662A2 (ko) 율-왜곡 최적화를 위한 영상 부호화/복호화 방법 및 이를 수행하는 장치
WO2013029560A1 (zh) 一种获得变换块尺寸的方法和模块
CN102835107A (zh) 高性能视频编码的超块
CN110115034A (zh) 一种用于扩展预定定向帧内预测模式集合的帧内预测装置
CN104995919A (zh) 用于最末重要系数位置编码中的上下文减少的设备和方法
WO2013029559A1 (zh) 一种变换块位置信息的获取方法和模块
CN103067700B (zh) 编码装置,编码方法
WO2012167539A1 (zh) 帧内预测模式处理的方法和装置
WO2012167670A1 (zh) 确定变换块尺寸的方法和设备
JP6390875B2 (ja) 画像符号化/復号化方法、装置およびシステム
CN104954787B (zh) Hevc帧间预测模式选择方法及装置
CN110958451A (zh) 一种视频编码方法及电子设备
CN101854554A (zh) 基于图像修复预测的视频编解码系统
WO2010083743A1 (zh) 基于二重预测的编解码方法及装置
KR101739580B1 (ko) 적응적 스캔 장치 및 그 스캔 방법
CN105791868B (zh) 视频编码的方法和设备
CN102065299B (zh) 一种视频编码的自适应扫描重排序方法和系统

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11841637

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11841637

Country of ref document: EP

Kind code of ref document: A1