WO2022227622A1 - 一种权值可配置的帧间帧内联合预测编解码的方法及装置 - Google Patents
一种权值可配置的帧间帧内联合预测编解码的方法及装置 Download PDFInfo
- Publication number
- WO2022227622A1 WO2022227622A1 PCT/CN2021/139107 CN2021139107W WO2022227622A1 WO 2022227622 A1 WO2022227622 A1 WO 2022227622A1 CN 2021139107 W CN2021139107 W CN 2021139107W WO 2022227622 A1 WO2022227622 A1 WO 2022227622A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frame
- intra
- coding
- coding order
- inter
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 45
- 238000010276 construction Methods 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 abstract description 16
- 230000006835 compression Effects 0.000 abstract description 8
- 238000007906 compression Methods 0.000 abstract description 8
- 238000013213 extrapolation Methods 0.000 abstract description 3
- 108091026890 Coding region Proteins 0.000 abstract 5
- 230000000903 blocking effect Effects 0.000 abstract 1
- 239000013598 vector Substances 0.000 description 10
- 230000006870 function Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 230000002123 temporal effect Effects 0.000 description 5
- 238000004891 communication Methods 0.000 description 4
- 238000004590 computer program Methods 0.000 description 4
- 238000005457 optimization Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000003491 array Methods 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000006073 displacement reaction Methods 0.000 description 2
- 241000023320 Luma <angiosperm> Species 0.000 description 1
- 241001025261 Neoraja caerulea Species 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- OSWPMRLSEDHDFF-UHFFFAOYSA-N methyl salicylate Chemical compound COC(=O)C1=CC=CC=C1O OSWPMRLSEDHDFF-UHFFFAOYSA-N 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/129—Scanning of coding units, e.g. zig-zag scan of transform coefficients or flexible macroblock ordering [FMO]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/593—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/96—Tree coding, e.g. quad-tree coding
Definitions
- the present invention relates to the technical field of multimedia processing, and in particular, to a method and device for inter-frame and intra-frame joint prediction coding and decoding with configurable weights.
- Intra-frame prediction and inter-frame prediction technology are commonly used in most of the current video coding and decoding technologies. Among them, inter-frame prediction uses the correlation between the current frame and its reference frame to reduce temporal redundancy, while intra-frame prediction uses the similarity between spatially adjacent pixels to eliminate spatial redundancy.
- a coding block can only choose one of intra-frame prediction or inter-frame prediction.
- the H266/VCC standard has improved this, and introduced CIIP, the intra-frame and inter-frame joint technology, to make the same coding block.
- the intra prediction value of the current prediction block is first calculated, that is, the pixel value of the current block is predicted by the traditional intra prediction mode.
- the inter-frame prediction method is used to predict the inter-frame prediction value of the current block, and finally the intra-frame and inter-frame prediction values are weighted and summed to obtain the final prediction value of the current prediction block.
- CIIP technology is implemented in both the luma block and the chroma block.
- weights in the above scheme is very limited, and is limited by whether the adjacent blocks above or on the left are available, and the temporal correlation and spatial correlation are not fully reduced at the same time, which affects the coding efficiency.
- the purpose of the present invention is to provide a method and device for inter-frame and intra-frame joint prediction coding and decoding with configurable weights, which are used to solve the problem that the choice of weights is very limited, and is limited by whether the upper or left adjacent blocks are available, The temporal and spatial correlations are not simultaneously reduced most adequately, which affects the technical problem of coding efficiency.
- a method for inter-frame and intra-frame joint predictive coding with configurable weights comprising:
- the weight data for the current prediction unit is encoded and/or transmitted.
- the reference coding order indication information is coded and/or transmitted.
- the candidate intra-frame coding order includes, but is not limited to, raster and Z scan order.
- the weight data corresponding to the reference coding order has multiple values, and one of the weight data is selected according to the distortion cost function.
- the weight data indication information is encoded into a video code stream.
- a method for inter-frame and intra-frame joint prediction decoding with configurable weights comprising:
- the reference coding order is determined from at least one candidate intra-frame coding order based on the indication information of the reference coding order, and the weight data of the current prediction unit is determined according to the coding order and the coding order of the current prediction unit.
- the weight data is further determined according to the weight data indication information.
- An apparatus for inter-frame and intra-frame joint predictive coding with configurable weights comprising:
- a candidate prediction module for determining at least one candidate intra-coding order
- An intra-frame coding order coding module configured to select at least one candidate intra-frame coding order as a reference coding order, and at the same time determine a weight corresponding to the reference coding order, determined according to the reference coding order and the coding order of the current prediction unit weight data of the current prediction unit; encoding and/or transmitting the weight data of the current prediction unit.
- the above-mentioned apparatus for inter-frame and intra-frame joint predictive coding with configurable weights wherein the intra-frame coding order coding module includes an intra-frame coding order indication coding module for coding and/or the intra-frame coding order indication information. transmission.
- An apparatus for inter-frame and intra-frame joint prediction decoding with configurable weights comprising:
- a candidate construction module used for decoding the encoded and compressed video stream, and constructing at least one candidate intra-frame coding order
- the decoding module is configured to use the indication information of the reference coding order to determine the reference coding order from at least one candidate intra-frame coding order in the candidate list, and determine the weight data according to the coding order and the coding order of the current prediction unit.
- An electronic device comprising:
- the memory is connected in communication with the one or more processors, the memory stores instructions executable by the one or more processors, and the instructions are executed by the one or more processors , the electronic device is used to implement the method described in any one of the above embodiments.
- a computer-readable storage medium having computer-executable instructions stored thereon, when the computer-executable instructions are executed by a computing device, can be used to implement the method described in any one of the above embodiments.
- a computer program product comprising a computer program stored on a computer-readable storage medium, the computer program comprising program instructions, which, when executed by a computer, can be used to implement any of the above implementations method described in the example.
- the method and device for inter-frame and intra-frame joint prediction coding and decoding with configurable weights of the present invention enable coding blocks to utilize the reference pixels on the lower side and the right side, which can adapt to images with different spatial correlations, and the newly designed intra/
- the extrapolation prediction technology can make full use of the surrounding reference pixels to further improve the prediction accuracy, and is no longer limited to whether the adjacent blocks above or on the left are available, thereby improving the overall compression performance, realizing efficient coding, and further improving the compression of video data. efficiency.
- FIG. 1 is a schematic diagram of an inter-frame and intra-frame joint predictive coding method with configurable weights according to the present invention
- FIG. 2 is a schematic diagram of an inter-frame and intra-frame joint predictive decoding method with configurable weights according to the present invention.
- the image block will be represented as short as possible during video encoding, and the image block will be restored using the short code during decoding.
- the content of adjacent images is very similar, and the background picture changes very little. Therefore, it is not necessary to encode all the information of each image, but only the motion information of the moving objects in the current image needs to be transmitted to the decoding.
- the current image can be restored by using the content of the previous image and the motion information of the current image, which can effectively save the bit rate.
- block-based motion compensation technology is mainly used. The principle is to find a best matching block in the previously encoded image for each pixel block of the current image through the motion estimation algorithm, and express the positional relationship between the two image blocks through the motion vector.
- a coding block generally refers to a relatively independent unit that divides a complete image of a frame in any way to facilitate transformation, inter-frame coding/intra-frame coding.
- the basic unit involved in the motion prediction operation in the coding block or the combination of multiple basic units is generally referred to as a prediction block.
- HEVC High Efficiency Video Coding, high-efficiency video coding
- Merge motion merging
- This inter-frame prediction technology is also used in the next-generation standard VVC (Versatile Video Coding, Universal Video Coding).
- a method for inter-frame and intra-frame joint predictive coding with configurable weights is characterized in that, it includes:
- the reference coding order indication information is encoded and/or transmitted.
- PCIIP ((4-(wIntra))*Pinter+(wIntra)*Pintra+2)>>2
- the above example is improved based on the existing coding order and weight calculation method of the VCC standard. It is expected that new scanning methods, such as horizontal scanning from right to left, vertical scanning from bottom to top, and diagonal scanning from bottom right to top left, may also be used, and it is expected that corresponding to different coding orders - The calculation method of the weights of the intra-frame and inter-frame joint technology can also be adjusted accordingly.
- the candidate intra-frame coding order includes but is not limited to raster and Z scan order.
- intra-frame compression is a hierarchical block coding structure based on quadtree division: video frames are first divided into LCU (largest Coding Unit, the largest coding block). ), the LCU is then divided into CUs (Coding Units, coding blocks) according to the recursive quadtree. And LCU and CU are encoded in raster and Z scan order, respectively.
- Raster Scan RasterScan refers to scanning from left to right, from top to bottom, scanning one line first, and then moving to the starting position of the next line to continue scanning.
- H.264 mainly uses the raster scan order.
- Z is the representation of the image.
- the scanning order is from left to right, and then folds back to the next line.
- the block scan order of H.266/VVC includes diagonal scan, horizontal scan and vertical scan order (take 4x4 block as an example).
- the weight data corresponding to the reference coding order has multiple values, and one of the weight data is selected according to a distortion cost function.
- the weight data indication information is encoded into a video code stream.
- the video coding process usually includes some further optimization means, such as motion estimation, intra-frame prediction, DCT (Discrete Cosine Transform), quantization, entropy coding and filtering, etc.
- the embodiment of the application will further combine the above-mentioned one or more optimization means to complete the complete video coding.
- the embodiments of the present application can be directly implemented by the existing means, so they will not be explained one by one, and the above optimization means should not be regarded as a limitation on the specific implementation of the present application. .
- a method for inter-frame and intra-frame joint prediction decoding with configurable weights comprising:
- the reference coding order is determined from at least one candidate intra-frame coding order based on the indication information of the reference coding order, and the weight data of the current prediction unit is determined according to the coding order and the coding order of the current prediction unit.
- the encoder uses the selected reference coding order as the current prediction unit coding order to determine the weight data, and encodes and/or transmits indication information indicating the reference coding order, so that the decoder can obtain the index information and determine the Refer to the coding order.
- the decoding end also uses the same method to establish the candidate list, so that the encoder only needs to transmit the index of the reference coding order to the decoding end, which greatly reduces the bit consumption of the encoding process.
- the decoder uses the coding order as the current block coding order to determine the weight data.
- the decoder can select one or more items of coding order from the candidate list to determine the weight data to determine the current block coding order to determine the weight data, and can select several items with the highest priority for linear combination as the current block coding order to determine the weights data.
- An apparatus for inter-frame and intra-frame joint predictive coding with configurable weights comprising:
- a candidate prediction module for determining at least one candidate intra-coding order
- An intra-frame coding order coding module configured to select at least one candidate intra-frame coding order as a reference coding order, and at the same time determine a weight corresponding to the reference coding order, determined according to the reference coding order and the coding order of the current prediction unit weight data of the current prediction unit; encoding and/or transmitting the weight data of the current prediction unit.
- the intra-frame coding order coding module includes an intra-frame coding order indication coding module, which is used to The intra-frame coding order indication information is encoded and/or transmitted.
- An apparatus for inter-frame and intra-frame joint prediction decoding with configurable weights comprising:
- a candidate construction module used for decoding the encoded and compressed video stream, and constructing at least one candidate intra-frame coding order
- the decoding module is configured to use the indication information of the reference coding order to determine the reference coding order from at least one candidate intra-frame coding order in the candidate list, and determine the weight data according to the coding order and the coding order of the current prediction unit.
- An electronic device comprising:
- the memory is connected in communication with the one or more processors, the memory stores instructions executable by the one or more processors, and the instructions are executed by the one or more processors , the electronic device is used to implement the method according to any one of the above.
- the processor and the memory may be connected through a bus or in other ways, and the connection through a bus is taken as an example.
- the processor may be a central processing unit (Central Processing Unit, CPU).
- the processor may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), application specific integrated circuits (Application Specific Integrated Circuits, ASICs), Field-Programmable Gate Arrays (Field-Programmable Gate Arrays, FPGAs) or other Chips such as programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or a combination of the above types of chips.
- the memory can be used to store non-transitory software programs, non-transitory computer-executable programs, and modules, such as the cascaded progressive network in the embodiments of the present application.
- the processor executes various functional applications and data processing of the processor by running non-transitory software programs/instructions and functional modules stored in the memory.
- the memory may include a storage program area and a storage data area, wherein the storage program area may store an operating system and an application program required by at least one function; the storage data area may store data created by the processor, and the like. Additionally, the memory may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, the memory may optionally include memory located remotely from the processor, such remote memory being connectable to the processor through a network, such as through a communication interface. Examples of such networks include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
- Computer-readable storage media includes physical volatile and non-volatile, removable and non-removable implemented in any manner or technology for storage of information such as computer-readable instructions, data structures, program modules or other data. medium.
- Computer-readable storage media specifically include, but are not limited to, U disk, removable hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), erasable programmable read-only memory Memory (EPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Flash memory or other solid-state memory technology, CD-ROM, Digital Versatile Disc (DVD), HD-DVD, Blue-Ray, or other optical storage device, tape, disk storage or other magnetic storage device, or any other medium that can be used to store the desired information and that can be accessed by a computer.
- ROM read-only memory
- RAM random access memory
- EPROM erasable programmable read-only memory Memory
- EEPROM Electrically Erasable Programmable Read-Only Memory
- Flash memory
- program modules include routines, programs, components, data structures, and other types of structures that perform particular tasks or implement particular abstract data types.
- program modules may be located in both local and remote memory storage devices.
- the functions, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium.
- the technical solution of the present application can be embodied in the form of a software product in essence, or the part that contributes to the original technology or the part of the technical solution.
- the computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present application.
- a method and device for inter-frame and intra-frame joint prediction coding and decoding with configurable weights of the present invention can adapt to different spatial correlations by enabling coding blocks to use reference pixels that can use the lower and right side. image, and the newly designed interpolation/extrapolation prediction technology can make full use of the surrounding reference pixels to further improve the accuracy of prediction, no longer limited to whether the upper or left adjacent blocks are available, thereby improving the overall compression performance and realizing efficient coding. , which further improves the compression efficiency of video data.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
本发明涉及一种权值可配置的帧间帧内联合预测编解码的方法及装置,对一个编码单元或分块单元进行帧间帧内联合预测,确定至少一个候选帧内编码次序;选择至少一个候选帧内编码次序作为参考编码次序,根据参考编码次序和当前预测单元的编码次序确定权值数据;将当前预测单元的权值数据进行编码和/或传输。本发明使编码块能够利用下侧和右侧的参考像素,可以适应具有不同空间相关性的图像,且新设计的内/外插预测技术可以充分利用周围的参考像素进一步提高预测的准确性,不再局限于上方或左侧相邻块是否可用,从而提高整体的压缩性能,实现了高效编码,进一步提升了视频数据的压缩效率。
Description
本发明涉及到多媒体处理的技术领域,尤其涉及一种权值可配置的帧间帧内联合预测编解码的方法及装置。
在目前大部分视频编解码技术中均普遍采用了帧内预测和帧间预测技术。其中,帧间预测利用当前帧与其参考帧之间的相关性减小时域冗余,而帧内预测利用空间相邻像素点之间的相似性消除空域冗余。
然而在H266/VCC标准之前的所有视频编解码标准中,一个编码块只能选择帧内预测或帧间预测的其中一种。H266/VCC标准对此作了改进,引入了CIIP即帧内帧间联合技术,使得同一编码块。在CIIP技术中,首先计算当前预测块的帧内预测值,即用传统的帧内预测模式去预测当前块的像素值。然后利用帧间的预测方式去预测当前块的帧间预测值,最后将帧内和帧间的预测值进行加权求和,得到当前预测块的最终预测值。CIIP技术在亮度块的和色度块都要实现。具体的加权公式如下:PCIIP=((4-wIntra)*Pinter+wIntra*Pintra+2)>>2,权值wIntra是由当前CU的上方和左侧相邻块的编码模式决定的,其中帧间预测值的权重大小的为4-wIntra。权重按这种方式确定:如果上方的相邻块可用,且使用的是帧内预测模式则isIntraTop=1,否则isIntraTop=0;如果左侧的相邻块可用,且使用的是帧内预测模式则isIntraLeft=1,否则isIntraLeft=0。确定
a、如果isIntraTop+isIntraLeft=2,则wIntra=3;
b、如果如果isIntraTop+isIntraLeft=1,则wIntra=2;
c、否则wIntra=1
上述方案中权值的选择非常有限,且受限于上方或左侧相邻块是否可用,并没有最充分地同时减少时域相关性和空域相关性,影响了编码效率。
发明内容
本发明的目的在于提供一种权值可配置的帧间帧内联合预测编解码的方法及装置,用于解决权值的选择非常有限,且受限于上方或左侧相邻块是否可用,并没有最充分地同时减少时域相关性和空域相关性,影响了编码效率的技术问题。
一种权值可配置的帧间帧内联合预测编码的方法,包括:
对一个编码单元或分块单元进行帧间帧内联合预测,确定至少一个候选帧内编码次序;
选择至少一个候选帧内编码次序作为参考编码次序,且同时确定所述参考编码次序相对应的权值,根据所述参考编码次序和当前预测单元的编码次序确定当前预测单元的权值数据;
将所述当前预测单元的权值数据进行编码和/或传输。
上述的权值可配置的帧间帧内联合预测编码的方法,将所述参考编码次序指示信息进行编码和/或传输。
上述的权值可配置的帧间帧内联合预测编码的方法,所述候选帧内编码次序包括且不限于按照光栅和Z扫描顺序。
上述的权值可配置的帧间帧内联合预测编码的方法,所述参考编码次序相对应的权值数据存在多个值,根据失真代价函数选择其中一个权值数据。
上述的权值可配置的帧间帧内联合预测编码的方法,将所述权值数据指示信息编码到视频码流。
一种权值可配置的帧间帧内联合预测解码的方法,包括:
对编码压缩后的视频码流解码,确定至少一个候选帧内编码次序包括帧内编码次序指示信息;
基于参考编码次序的指示信息从至少一个候选帧内编码次序确定参考编码次序,根据所述编码次序和当前预测单元的编码次序确定所述当前预测单元的权值数据。
上述的权值可配置的帧间帧内联合预测解码的方法,若所述参考编码次序相对应的权值数据存在多个值,则进一步根据权值数据指示信息确定权值数据。
一种权值可配置的帧间帧内联合预测编码的装置,包括:
候选预测模块,用于确定至少一个候选帧内编码次序;
帧内编码次序编码模块,用于选择至少一个候选帧内编码次序作为参考编码次序,且同时确定所述参考编码次序相对应的权值,根据所述参考编码次序和当前预测单元的编码次序确定当前预测单元的权值数据;将所述当前预测单元的权值数据进行编码和/或传输。
上述的权值可配置的帧间帧内联合预测编码的装置,所述帧内编码次序编码模块包括帧内编码次序指示编码模块,用于将所述帧内编码次序指示信息进行编码和/或传输。
一种权值可配置的帧间帧内联合预测解码的装置,包括:
候选构建模块,用于对编码压缩后的视频码流解码,构建至少一个候选帧内编码次序;
解码模块,用于利用参考编码次序的指示信息从候选列表中的至少一个候选帧内编码次序确定参考编码次序,根据所述编码次序和当前预测单元的编码次序确定权值数据。
一种电子设备,包括:
存储器以及一个或多个处理器;
其中,所述存储器与所述一个或多个处理器通信连接,所述存储器中存储有可被所述一个或多个处理器执行的指令,所述指令被所述一个或多个处理器执行时,所述电子设备用于实现以上任一项实施例所述的方法。
一种计算机可读存储介质,其上存储有计算机可执行指令,当所述计算机可执行指令被计算装置执行时,可用来实现以上任一项实施例所述的方法。
一种计算机程序产品,所述计算机程序产品包括存储在计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,可用来实现以上任一项实施例所述的方法。
上述技术方案具有如下优点或有益效果:
本发明权值可配置的帧间帧内联合预测编解码的方法及装置使编码块利用能够利用下侧和右侧的参考像素,可以适应具有不同空间相关性的图像,且新设计的内/外插预测技术可以充分利用周围的参考像素进一步提高预测的准确性,不再局限于上方或左侧相邻块是否可用从而提高整体的压缩性能,实现了高效编码,进一步提升了视频数据的压缩效率。
图1是本发明一种权值可配置的帧间帧内联合预测编码的方法的示意图;
图2是本发明一种权值可配置的帧间帧内联合预测解码的方法的示意图。
为使本发明的上述目的、特征和优点能够更为明显易懂,下面结合附图对本发明的具体实施例做详细的说明。
为了充分提高压缩率,视频编码时会尽量把图像块采用尽可能简短的编码表示,解码时再使用简短编码还原该图像块。大多数视频序列中,相邻图像内容非常相似,其背景画面变化极小,因此不需要对每幅图像的全部信息都进行编码,而只需要将将当前图像中运动物体的运动信息传给解码器,利用前一图像内容以及当前图像的运动信息即可恢复当前图像,这样可以有效的节省比特率。目前主要采用基于块的运动补偿技术,原理是通过运动估计算法为当前图像的每个像素块在之前已编码图像中寻找一个最佳匹配块,通过运动矢量表述两个图像块的位置关系,从而建立编码的关联。其中,用于预测的图像称为参考图像,参考块到当前像素块的位移称为运动矢量MV,当前块与参考块的差值称为预测残差。在本发明中,编码块泛指以任何方式对一帧完整的图像进行划分以利于进行变换、帧间编码/帧内编码的相对独立的单元。其中,在编码块内涉及到运动预测运算的基本单元或多个基本单元 的组合泛称为预测块。
除了图像块具有相关性之外,现有技术还发现帧间不同编码块的运动矢量之间也存在空域和时域上的相关性,因而进一步出现了对运动矢量进行预测的技术,比如HEVC(High Efficiency Video Coding,高效率视频编码,已成为H.265国际标准)提出的运动合并(Merge)技术,其利用当前预测单元空时域上相邻已编码块的运动信息(运动矢量和参考帧信息)直接推导得到当前块运动信息,实现了运动矢量的预测。下一代标准VVC(Versatile Video Coding,通用视频编码)中也沿用了这一种帧间预测技术。
但是,由于现有技术的运动矢量仅考虑了物体整体平移的情况,仅根据一个像素点的坐标位移确定,当物体旋转运动时,现有的运动矢量无法有效描述旋转情况,因而也无法采用现有运动估计和Merge技术来进行旋转运动的运动矢量预测。
结合图1、图2所示,一种权值可配置的帧间帧内联合预测编码的方法,其特征在于,包括:
对一个编码单元或分块单元进行帧间帧内联合预测,确定至少一个候选帧内编码次序;
选择至少一个候选帧内编码次序作为参考编码次序,且同时确定所述参考编码次序相对应的权值,根据所述参考编码次序和当前预测单元的编码次序确定当前预测单元的权值数据;
将所述当前预测单元的权值数据进行编码和/或传输;
将所述参考编码次序指示信息进行编码和/或传输。
现有CIIP算法中,加权公式如下:
PCIIP=((4-(wIntra))*Pinter+(wIntra)*Pintra+2)>>2
其中的帧内预测部分权值(wIntra)及帧间预测部分权值wInter=(4-(wIntra))由上方或左侧相邻块是否可用来确定,跟所采取的编码顺序无直接相关。
具体实施例一,预先设定一个编码次序-帧内帧间联合技术权值计算方式对应表:
以上示例基于VCC标准现有编码次序和权值计算方式进行改进。可以预料得到,新的扫描方式,比如从右到左的水平扫描、从下到上的垂直扫描、从右下到左上的对角扫描也有可能被采用,可以预料得到相应地于不同编码次序-帧内帧间联合技术权值计算方式,也可以进行相应调整。
进一步地,本发明一种权值可配置编码的方法的较佳的实施例中,所述候选帧内编码次序包括且不限于按照光栅和Z扫描顺序。
现有技术:视频编码国际标准High Efficiency Video Coding(HEVC)中,帧内压缩是以四叉树划分为基础的分层分块编码结构:视频帧先划分为LCU(largest Coding Unit,最大编码块),LCU再按照递归的四叉树划分为CU(Coding Unit,编码块)。并且LCU和CU分别按照光栅和Z扫描顺序编码。光栅扫描(RasterScan)是指从左往右,由上往下,先扫描完一行,再移至下一行起始位置继续扫描,H.264使用的主要就是光栅扫描顺序。Z字形扫描(Z-Scan)中Z是形象的表示方式,扫描顺序是从左到右,然后折回下一行。H.266/VVC的块扫描顺序包括对角扫描、水平扫描和垂直扫描顺序(以4x4块为例)。
进一步地,本发明一种权值可配置编码的方法的较佳的实施例中,所述参考编码次序相对应的权值数据存在多个值,根据失真代价函数选择其中一个权值数据。
进一步地,本发明一种权值可配置编码的方法的较佳的实施例中,将所述权值数据指示信息编码到视频码流。
若上表所示,在对角扫描时,a、如果isIntraTopOrTopRight+isIntraLeftOrBottomLeft=2,则wIntra=2或3。究竟是2还是3,可以选择其中失真度最小的一个。可以用ssd、sad、mse、psnr都可以度量失真度。选择不同的权值(2或3)对编码块进行编码,计算对应的失真,选择可以选择其中失真度最小的一个,并记录其指示信息。比如,用比特0表示wIntra=2,用比特1表示wIntra=3。
当然,本领域技术人员应当理解,视频编码过程通常还包括一些进一步优化的手段,比如运动估计、帧内预测、DCT(离散余弦变换,Discrete Cosine Transform)、量化、熵编码和滤波等,故本申请的实施例除帧间预测外,还会进一步结合上述一种或多种优化手段来完成完整的视频编码。鉴于现有技术对这些优化手段已有充分的研究,本申请的实施例可直接使用现有手段来实现,故不再一一展开说明,上述优化手段也不应视作对本申请具体实现的限制。
一种权值可配置的帧间帧内联合预测解码的方法,包括:
对编码压缩后的视频码流解码,确定至少一个候选帧内编码次序包括帧内编码次序指示信息;
基于参考编码次序的指示信息从至少一个候选帧内编码次序确定参考编码次序,根据所述编码次序和当前预测单元的编码次序确定所述当前预测单元的权值数据。
其中,编码器将所选择的参考编码次序作为当前预测单元编码次序确定权值数据,并将指示参考编码次序的指示信息进行编码和/或传输,以供解码器获取该索引信息并确定所述参考编码次序。在解码端也使用同样的方法建立该候选列表,如此以来,编码器只需要传输参考编码次序的索引到解码端即可,大大减少了编码过程的比特消耗。其中,解码器确定参考编码次序后,将其编码次序作为当前块编码次序确定权值数据。解码器可以从候选列表中选择其中一项或多项编码次序确定权值数据确定当前块编码次序确定权值数据,可以选择优先级最高的若干项MVP进行线性组合作为当前块编码次序确定权值数据。
进一步地,本发明一种权值可配置的帧间帧内联合预测解码的方法的较佳的实施例中,若所述参考编码次序相对应的权值数据存在多个值,则进一步根据权值数据指示信息确定权值数据。
一种权值可配置的帧间帧内联合预测编码的装置,包括:
候选预测模块,用于确定至少一个候选帧内编码次序;
帧内编码次序编码模块,用于选择至少一个候选帧内编码次序作为参考编码次序,且同时确定所述参考编码次序相对应的权值,根据所述参考编码次序和当前预测单元的编码次序确定当前预测单元的权值数据;将所述当前预测单元的权值数据进行编码和/或传输。
进一步地,本发明一种权值可配置的帧间帧内联合预测编码的装置的较佳的实施例中,所述帧内编码次序编码模块包括帧内编码次序指示编码模块,用于将所述帧内编码次序指示信息进行编码和/或传输。
一种权值可配置的帧间帧内联合预测解码的装置,包括:
候选构建模块,用于对编码压缩后的视频码流解码,构建至少一个候选帧内编码次序;
解码模块,用于利用参考编码次序的指示信息从候选列表中的至少一个候选帧内编码次序确定参考编码次序,根据所述编码次序和当前预测单元的编码次序确定权值数据。
一种电子设备,包括:
存储器以及一个或多个处理器;
其中,所述存储器与所述一个或多个处理器通信连接,所述存储器中存储有可被所述一个或多个处理器执行的指令,所述指令被所述一个或多个处理器执行时,所述电子设备用于实现如以上任一项所述的方法。
具体地,处理器和存储器可以通过总线或者其他方式连接,以通过总线连接为例。处理器可以为中央处理器(Central Processing Unit,CPU)。处理器还可以为其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等芯片,或者上述各类芯片的组合。
存储器作为一种非暂态计算机可读存储介质,可用于存储非暂态软件程序、非暂态计算机可执行程序以及模块,如本申请实施例中的级联渐进网络等。处理器通过运行存储在存储器中的非暂态软件程序/指令以及功能模块,从而执行处理器的各种功能应用以及数据处理。
存储器可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储处理器所创建的数据等。此外,存储器可以包括高速随机存取存储器,还可以包括非暂态存储器,例如至少一个磁盘存储器件、闪存 器件、或其他非暂态固态存储器件。在一些实施例中,存储器可选包括相对于处理器远程设置的存储器,这些远程存储器可以通过网络(比如通过通信接口)连接至处理器。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。
一种计算机可读存储介质,其上存储有计算机可执行指令,当所述计算机可执行指令被计算装置执行时,可用来实现如以上任一项所述的方法。
前述的计算机可读取存储介质包括以存储如计算机可读指令、数据结构、程序模块或其他数据等信息的任何方式或技术来实现的物理易失性和非易失性、可移动和不可移动介质。计算机可读取存储介质具体包括,但不限于,U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、可擦除可编程只读存储器(EPROM)、电可擦可编程只读存储器(EEPROM)、闪存或其他固态存储器技术、CD-ROM、数字多功能盘(DVD)、HD-DVD、蓝光(Blue-Ray)或其他光存储设备、磁带、磁盘存储或其他磁性存储设备、或能用于存储所需信息且可以由计算机访问的任何其他介质。
尽管此处所述的主题是在结合操作系统和应用程序在计算机系统上的执行而执行的一般上下文中提供的,但本领域技术人员可以认识到,还可结合其他类型的程序模块来执行其他实现。一般而言,程序模块包括执行特定任务或实现特定抽象数据类型的例程、程序、组件、数据结构和其他类型的结构。本领域技术人员可以理解,此处所述的本主题可以使用其他计算机系统配置来实践,包括手持式设备、多处理器系统、基于微处理器或可编程消费电子产品、小型计算机、大型计算机等,也可使用在其中任务由通过通信网络连接的远程处理设备执行的分布式计算环境中。在分布式计算环境中,程序模块可位于本地和远程存储器存储设备的两者中。
本领域普通技术人员可以意识到,结合本文中所本申请的实施例描述的各示例的单元及方法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对原有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算 机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。
综上所述,本发明一种权值可配置的帧间帧内联合预测编解码的方法及装置通过使编码块利用能够利用下侧和右侧的参考像素,可以适应具有不同空间相关性的图像,且新设计的内/外插预测技术可以充分利用周围的参考像素进一步提高预测的准确性,不再局限于上方或左侧相邻块是否可用从而提高整体的压缩性能,实现了高效编码,进一步提升了视频数据的压缩效率。
以上仅为本发明较佳的实施例,并非因此限制本发明的实施方式及保护范围,对于本领域技术人员而言,应当能够意识到凡运用本发明说明书及图示内容所作出的等同替换和显而易见的变化所得到的方案,均应当包含在本发明的保护范围内。
在本发明的描述中,需要理解的是,术语“上”、“下”、“前”“后”等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。
Claims (10)
- 一种权值可配置的帧间帧内联合预测编码的方法,其特征在于,包括:对一个编码单元或分块单元进行帧间帧内联合预测,确定至少一个候选帧内编码次序;选择至少一个候选帧内编码次序作为参考编码次序,且同时确定所述参考编码次序相对应的权值,根据所述参考编码次序和当前预测单元的编码次序确定当前预测单元的权值数据;将所述当前预测单元的权值数据进行编码和/或传输。
- 如权利要求1所述的权值可配置的帧间帧内联合预测编码的方法,其特征在于,将所述参考编码次序指示信息进行编码和/或传输。
- 如权利要求1所述的权值可配置的帧间帧内联合预测编码的方法,其特征在于,所述候选帧内编码次序包括且不限于按照光栅和Z扫描顺序。
- 如权利要求1或2所述的权值可配置的帧间帧内联合预测编码的方法,其特征在于,所述参考编码次序相对应的权值数据存在多个值,根据失真代价函数选择其中一个权值数据。
- 如权利要求4所述的权值可配置的帧间帧内联合预测编码的方法,其特征在于,将所述权值数据指示信息编码到视频码流。
- 一种权值可配置的帧间帧内联合预测解码的方法,其特征在于,包括:对编码压缩后的视频码流解码,确定至少一个候选帧内编码次序包括帧内编码次序指示信息;基于参考编码次序的指示信息从至少一个候选帧内编码次序确定参考编码次序,根据所述编码次序和当前预测单元的编码次序确定所述当前预测单元的权值数据。
- 如权利要求6所述的权值可配置的帧间帧内联合预测解码的方法,其特征在于,若所述参考编码次序相对应的权值数据存在多个值,则进一步根据权值数据指示信息确定权值数据。
- 一种权值可配置的帧间帧内联合预测编码的装置,其特征在于,包括:候选预测模块,用于确定至少一个候选帧内编码次序;帧内编码次序编码模块,用于选择至少一个候选帧内编码次序作为参考编码次序,且同时确定所述参考编码次序相对应的权值,根据所述参考编码次序和当前预测单元的编码次序确定当前预测单元的权值数据;将所述当前预测单元的权值数据进行编码和/或传输。
- 如权利要求8所述的权值可配置的帧间帧内联合预测编码的装置,其特征在于,所述帧内编码次序编码模块包括帧内编码次序指示编码模块,用于将所述帧内编码次序指示信息进行编码和/或传输。
- 一种权值可配置的帧间帧内联合预测解码的装置,其特征在于,包括: 候选构建模块,用于对编码压缩后的视频码流解码,构建至少一个候选帧内编码次序;解码模块,用于利用参考编码次序的指示信息从候选列表中的至少一个候选帧内编码次序确定参考编码次序,根据所述编码次序和当前预测单元的编码次序确定权值数据。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110482985.6 | 2021-04-30 | ||
CN202110482985.6A CN112887732B (zh) | 2021-04-30 | 2021-04-30 | 一种权值可配置的帧间帧内联合预测编解码的方法及装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022227622A1 true WO2022227622A1 (zh) | 2022-11-03 |
Family
ID=76040270
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/139107 WO2022227622A1 (zh) | 2021-04-30 | 2021-12-17 | 一种权值可配置的帧间帧内联合预测编解码的方法及装置 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112887732B (zh) |
WO (1) | WO2022227622A1 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112887732B (zh) * | 2021-04-30 | 2021-10-01 | 浙江智慧视频安防创新中心有限公司 | 一种权值可配置的帧间帧内联合预测编解码的方法及装置 |
CN115052160B (zh) * | 2022-04-22 | 2023-07-07 | 江西中烟工业有限责任公司 | 基于云数据自动下载的图像编码方法、装置及电子设备 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110519600A (zh) * | 2019-08-21 | 2019-11-29 | 浙江大华技术股份有限公司 | 帧内帧间联合预测方法、装置、编解码器及存储装置 |
CN111010578A (zh) * | 2018-12-28 | 2020-04-14 | 北京达佳互联信息技术有限公司 | 一种帧内帧间联合预测的方法、装置以及存储介质 |
JP2021027464A (ja) * | 2019-08-05 | 2021-02-22 | 日本放送協会 | 予測ブロック生成装置、画像符号化装置、画像復号装置、及びプログラム |
WO2021061018A1 (en) * | 2019-09-24 | 2021-04-01 | Huawei Technologies Co., Ltd. | Method and apparatus of combined intra-inter prediction |
CN112887732A (zh) * | 2021-04-30 | 2021-06-01 | 浙江智慧视频安防创新中心有限公司 | 一种权值可配置的帧间帧内联合预测编解码的方法及装置 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3912350A4 (en) * | 2019-03-21 | 2022-03-30 | Beijing Bytedance Network Technology Co., Ltd. | IMPROVED WEIGHT PROCESSING OF COMBINED INTRA-INTER PREDICTION |
CN112584142B (zh) * | 2019-09-30 | 2022-09-30 | 杭州海康威视数字技术股份有限公司 | 一种编解码方法、装置及其设备 |
-
2021
- 2021-04-30 CN CN202110482985.6A patent/CN112887732B/zh active Active
- 2021-12-17 WO PCT/CN2021/139107 patent/WO2022227622A1/zh active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111010578A (zh) * | 2018-12-28 | 2020-04-14 | 北京达佳互联信息技术有限公司 | 一种帧内帧间联合预测的方法、装置以及存储介质 |
JP2021027464A (ja) * | 2019-08-05 | 2021-02-22 | 日本放送協会 | 予測ブロック生成装置、画像符号化装置、画像復号装置、及びプログラム |
CN110519600A (zh) * | 2019-08-21 | 2019-11-29 | 浙江大华技术股份有限公司 | 帧内帧间联合预测方法、装置、编解码器及存储装置 |
WO2021061018A1 (en) * | 2019-09-24 | 2021-04-01 | Huawei Technologies Co., Ltd. | Method and apparatus of combined intra-inter prediction |
CN112887732A (zh) * | 2021-04-30 | 2021-06-01 | 浙江智慧视频安防创新中心有限公司 | 一种权值可配置的帧间帧内联合预测编解码的方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
CN112887732A (zh) | 2021-06-01 |
CN112887732B (zh) | 2021-10-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7233218B2 (ja) | ビデオコーディングのためにブロックの複数のクラスのためのフィルタをマージすること | |
JP6716611B2 (ja) | スライスレベルのイントラブロックコピーおよび他のビデオコーディングの改善 | |
RU2683495C1 (ru) | Нововведения в предсказание блочных векторов и оценку восстановленных значений отсчетов в области перекрытия | |
US20180376160A1 (en) | Motion-based priority for the construction of candidate lists in video coding | |
WO2022104498A1 (zh) | 帧内预测方法、编码器、解码器以及计算机存储介质 | |
WO2019062544A1 (zh) | 视频图像的帧间预测方法、装置及编解码器 | |
JP7148612B2 (ja) | ビデオデータインター予測の方法、装置、ビデオエンコーダ、ビデオデコーダ及びプログラム | |
JP2024099849A (ja) | ビデオ符号化のための双方向オプティカルフローおよびデコーダ側動きベクトル補正を選択的に適用する方法およびデバイス | |
US20140003518A1 (en) | Image encoding method using a skip mode, and a device using the method | |
WO2022227622A1 (zh) | 一种权值可配置的帧间帧内联合预测编解码的方法及装置 | |
JP2013524730A (ja) | 幾何学的動き区分のための固定小数点実装形態 | |
KR20170093833A (ko) | 인트라 모드들의 코딩 | |
WO2022237168A1 (zh) | 一种顺序执行的帧间帧内联合预测编解码的方法及装置 | |
WO2020247577A1 (en) | Adaptive motion vector resolution for affine mode | |
CN117501694A (zh) | 用于视频编解码中的仿射运动补偿的具有双向匹配的运动细化 | |
KR100856392B1 (ko) | 현재 영상의 복원영역을 참조하는 동영상 부호화/복호화장치 및 그 방법 | |
CN117730535A (zh) | 视频编解码中用于仿射运动补偿预测的几何分割 | |
WO2020132908A1 (zh) | 解码预测方法、装置及计算机存储介质 | |
CN117957838A (zh) | 帧内预测模式信令 | |
CN117730531A (zh) | 用于解码器侧帧内模式推导的方法和设备 | |
Suzuki et al. | Block-based reduced resolution inter frame coding with template matching prediction | |
CN116636218A (zh) | 利用多方向帧内预测的视频编解码 | |
US20180249155A1 (en) | Method and apparatus of prediction offset derived based on neighbouring area in video coding | |
WO2020007093A1 (zh) | 一种图像预测方法及装置 | |
WO2020056767A1 (zh) | 视频图像分量的预测方法、装置及计算机存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21939081 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21939081 Country of ref document: EP Kind code of ref document: A1 |