WO2022252567A1 - Procédé et dispositif de détermination d'un ordre de priorité de codage et de décodage vidéo à base de comparaison de corrélations - Google Patents

Procédé et dispositif de détermination d'un ordre de priorité de codage et de décodage vidéo à base de comparaison de corrélations Download PDF

Info

Publication number
WO2022252567A1
WO2022252567A1 PCT/CN2021/139138 CN2021139138W WO2022252567A1 WO 2022252567 A1 WO2022252567 A1 WO 2022252567A1 CN 2021139138 W CN2021139138 W CN 2021139138W WO 2022252567 A1 WO2022252567 A1 WO 2022252567A1
Authority
WO
WIPO (PCT)
Prior art keywords
correlation
video
coding
determining
encoding
Prior art date
Application number
PCT/CN2021/139138
Other languages
English (en)
Chinese (zh)
Inventor
洪一帆
王琪
周东东
滕波
Original Assignee
浙江智慧视频安防创新中心有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 浙江智慧视频安防创新中心有限公司 filed Critical 浙江智慧视频安防创新中心有限公司
Publication of WO2022252567A1 publication Critical patent/WO2022252567A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/127Prioritisation of hardware or computational resources
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock

Definitions

  • the present invention relates to the technical field of multimedia processing, in particular to a method and a device for determining video encoding and decoding priorities based on correlation degree comparison.
  • Video compression based on video analysis tasks is somewhat different from other video compression scenarios.
  • scenarios such as film and television video compression the quality of compressed video is mainly judged by people subjectively, and the impact of distortion caused by video compression is mainly the impact of human subjective perception.
  • Video analysis is generally done automatically by a computer, and it is required that the analysis result of the compressed video should be consistent with the analysis result of the uncompressed video, otherwise it will cause erroneous analysis results.
  • a public document (CN111901594A) has proposed the concept of "just identifiable distortion".
  • the just identifiable distortion is the distortion threshold of the coded image that causes the visual analysis accuracy to be less than the preset accuracy, and the coding parameters are determined according to the just identifiable distortion of the image, and the image is coded according to the coding parameters to obtain a corresponding coded image, which can Under the same bit rate, the performance of visual analysis tasks on encoded images can be improved, which can save more bit rates.
  • Existing publications disclose the use of just identifiable distortion prediction models to determine encoding parameters, but do not disclose how to transmit the encoding parameters and their encoding blocks to overcome the problem of limited transmission channels.
  • determining the encoding parameters according to the just identifiable distortion of the image can concentrate most of the bit rate on the encoding blocks that affect the visual analysis task, and they have a higher impact on the video analysis results than other encoding blocks.
  • the determination of the coding order in the standard is only related to video content, it has nothing to do with video analysis. Therefore, The encoding order and the transmission order of the encoded data cannot prioritize important data according to the needs of video analysis.
  • the purpose of the present invention is to provide a method and device for determining the priority order of video encoding and decoding based on the correlation degree comparison, which is used to solve the above-mentioned technology that the encoding order and the transmission order of encoded data cannot prioritize important data according to the needs of video analysis question.
  • a method for determining video encoding priority based on correlation comparison comprising the following steps:
  • the encoded block is encoded and/or transmitted.
  • the above-mentioned method for determining video encoding priority based on correlation comparison, said determining the coding block for priority encoding includes:
  • the determination of the coding block for priority coding also includes the coding sequence formed by horizontal scanning, vertical scanning and diagonal scanning.
  • the coding block represents a video data block that is subjected to intra-frame prediction coding.
  • the coding block has a corresponding correlation for each video analysis task.
  • the determination of the correlation degree corresponding to each coding block includes: performing numerical modification on each coding block to determine the correlation degree between each coding block and a video analysis task.
  • a method for determining priority order of video decoding based on correlation comparison comprising the following steps:
  • the video code stream is decoded for the decoder to decode to obtain corresponding image frames.
  • a device for determining video encoding priority based on correlation comparison comprising:
  • a determination module is used to modify the numerical value of each coding block to determine the correlation between each coding block and the video analysis task;
  • a comparison module configured to preset a reference threshold, and compare the correlation corresponding to each coding block with the threshold
  • An encoding module configured to select encoding blocks whose correlation degrees exceed the reference threshold and encode them preferentially.
  • the above-mentioned device for determining video coding priority based on correlation comparison further includes a block module for dividing the entire image frame into one or more coding blocks, and each coding block is a rectangular area containing an integer number of CTUs.
  • a device for determining video decoding priority based on correlation comparison comprising:
  • a conversion module configured to convert the encoding sequence indication information and video data into a video code stream
  • the decoding module is used to decode the video code stream for the decoder to decode to obtain corresponding image frames.
  • An electronic device comprising: a memory, a processor, and a computer program stored on the memory and operable on the processor, and executed when the processor runs the computer program to achieve any of the above A method for prioritizing video coding based on correlation comparison.
  • a computer-readable storage medium on which computer-readable instructions are stored, and the computer-readable instructions can be executed by a processor to implement the method for determining video encoding priority based on correlation comparison described in any one of the above.
  • the method and device for determining the priority order of video encoding and decoding based on the correlation degree comparison determine the correlation degree between each sub-block and the video analysis task according to the video analysis task, determine the encoding sequence based on the correlation degree, the encoding sequence and the sequence of the encoded data
  • the transmission sequence prioritizes important data according to the needs of video analysis, performs video encoding, and converts the encoded data into video code streams, which can ensure the accuracy of video analysis results within an acceptable range, and at the same time ensure the best data compression rate .
  • FIG. 1 is a schematic diagram of a method for determining video coding priority based on correlation comparison in the present invention
  • Fig. 2 is a schematic diagram of the horizontal scanning coding sequence in the embodiment of the present invention.
  • first and second are used for descriptive purposes only, and cannot be understood as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, the features defined as “first” and “second” may explicitly or implicitly include at least one of these features.
  • a method for determining video coding priority based on correlation comparison includes the following steps:
  • the encoded block is encoded and/or transmitted.
  • the coding blocks for determining priority coding also include coding sequences formed by horizontal scanning, vertical scanning and diagonal scanning.
  • Preset a reference threshold compare whether the correlation degree corresponding to the coding block exceeds the reference threshold, preset a threshold, compare the correlation degree of each coding block with the threshold, and the ones higher than the threshold are considered to be closely related to a specific video analysis task relevant, has the highest encoding priority;
  • the coding blocks whose correlation degrees exceed the reference threshold are selected for priority coding, and in the coding sequence formed by horizontal scanning, vertical scanning, and diagonal scanning, several coding blocks with correlation degrees exceeding a certain threshold are selected for coding priority.
  • the block scan order of H.266/VVC includes diagonal scan, horizontal scan and vertical scan order.
  • the coding blocks that are preferentially coded are 2, 7, and 8.
  • the horizontal scanning coding order is still maintained in the priority coding block and other coding blocks.
  • the complete coding sequence is: 2, 7, 8, 1, 3, 4, 5, 6, 9, 10, 11, 12, 1, 14, 15, 16.
  • video analysis tasks include image classification, object detection, and pose estimation.
  • distorted images Pic1, Pic2, Pic3, For an original image Pic0, distorted images Pic1, Pic2, Pic3, .
  • the increase of the degree of distortion will reduce the accuracy of the visual analysis results.
  • the encoding parameters that have a greater impact on the distortion degree include quantization parameters (QP), target code rates, and the like.
  • QP quantization parameters
  • target code rates target code rates
  • the degree of distortion degree increases to a certain threshold
  • the prediction of the image by the visual analysis model will completely fail. Therefore, in Pic1, Pic2, Pic3, . Therefore, by trying different sets of encoding parameters and then analyzing the corresponding distorted image with a specific visual analysis process, it is judged whether the accurate result is obtained. Theoretically speaking, a set of optimal parameter combinations can be obtained, just enough to ensure that the accuracy of the video analysis results is within an acceptable range, and at the same time ensure the best data compression rate and other beneficial effects.
  • the coding block represents a video data block that implements intra-frame prediction coding.
  • the block method of VVC and HEVC is followed.
  • a frame of image is divided into one or more coded block rows and coded block columns, and each coded block is a rectangular area containing an integer number of CTUs.
  • CTUs are divided into CUs according to the quadtree division method, and each CU can be divided into PUs and TUs.
  • a coded block or sub-block or sub-block is used to represent a block of video data subjected to intra-frame predictive coding.
  • each of the encoding blocks has one corresponding correlation.
  • the determination of the correlation degree corresponding to each coding block includes: performing numerical modification on each coding block to determine each coding block The block is related to the video analysis task.
  • a method for determining priority order of video decoding based on correlation comparison comprising the following steps:
  • the video code stream is decoded for the decoder to decode to obtain corresponding image frames.
  • Compression coding is performed on all codes in the above sequence, and all possible video compression technologies, existing and future, can be utilized.
  • the framed data is compressed using an intra-frame coding technique, that is, at least including intra-frame prediction, DCT transformation, quantization, and entropy coding processes.
  • a device for determining video encoding priority based on correlation comparison comprising:
  • the block module is used to divide the entire image frame into one or more coding blocks, and each coding block is a rectangular area containing an integer number of CTUs;
  • a determining module configured to modify the values of each coding block to determine the correlation between each coding block and the video analysis task
  • a comparison module configured to preset a reference threshold, and compare the correlation corresponding to each coding block with the threshold
  • An encoding module configured to select encoding blocks whose correlation degrees exceed the reference threshold and encode them preferentially.
  • a device for determining video decoding priority based on correlation comparison comprising:
  • a conversion module configured to convert the encoding sequence indication information and video data into a video code stream
  • the decoding module is used to decode the video code stream for the decoder to decode to obtain corresponding image frames.
  • the processor and the memory may be connected through a bus or in other ways, taking connection through a bus as an example.
  • the processor may be a central processing unit (Central Processing Unit, CPU).
  • the processor can also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application-specific integrated circuits (Application Specific Integrated Circuit, ASIC), field-programmable gate array (Field-Programmable Gate Array, FPGA) or other Chips such as programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or combinations of the above-mentioned types of chips.
  • DSP Digital Signal Processor
  • ASIC Application Specific Integrated Circuit
  • FPGA Field-Programmable Gate Array
  • FPGA Field-Programmable Gate Array
  • the memory can be used to store non-transitory software programs, non-transitory computer-executable programs and modules, such as the cascaded progressive network in the embodiment of the present application.
  • the processor executes various functional applications and data processing of the processor by running non-transitory software programs/instructions and functional modules stored in the memory.
  • the memory may include a program storage area and a data storage area, wherein the program storage area may store an operating system and an application program required by at least one function; the data storage area may store data created by the processor, and the like.
  • the memory may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid-state storage devices.
  • the memory may optionally include memory located remotely from the processor, such remote memory may be connected to the processor via a network (eg, via a communication interface). Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • An embodiment of the present application further provides a computer-readable storage medium, where computer-executable instructions are stored, and the computer-executable instructions are executed to perform the methods in the foregoing embodiments of the present application.
  • Computer-readable storage media include physically volatile and non-volatile, removable and non-removable media implemented in any manner or technology for storing information such as computer-readable instructions, data structures, program modules, or other data. medium.
  • Computer-readable storage media specifically include, but are not limited to, U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), erasable programmable read-only Memory (EPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Flash or other solid-state memory technology, CD-ROM, Digital Versatile Disk (DVD), HD-DVD, Blue-Ray or other optical storage device, tape, disk storage or other magnetic storage device, or any other medium that can be used to store the desired information and that can be accessed by a computer.
  • ROM read-only memory
  • RAM random access memory
  • EPROM erasable programmable read-only Memory
  • EEPROM Electrically Erasable Programmable Read-Only Memory
  • Flash Flash or other
  • program modules include routines, programs, components, data structures, and other types of structures that perform particular tasks or implement particular abstract data types.
  • program modules may be located in both local and remote memory storage devices.
  • the functions described above are realized in the form of software function units and sold or used as independent products, they can be stored in a computer-readable storage medium.
  • the technical solution of the present application can be embodied in the form of a software product in essence or the part that contributes to the prior art or the technical solution, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the method and device for determining the priority order of video encoding and decoding based on the correlation degree comparison determine the correlation degree between each sub-block and the video analysis task according to the video analysis task, determine the encoding sequence based on the correlation degree, the encoding sequence and the sequence of the encoded data
  • the transmission sequence prioritizes important data according to the needs of video analysis, performs video encoding, and converts the encoded data into video code streams, which can ensure the accuracy of video analysis results within an acceptable range, and at the same time ensure the best data compression rate .

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computing Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Sont divulgués un procédé et un dispositif de détermination de l'ordre de priorité de codage et de décodage vidéo à base de comparaison de corrélations, le procédé consistant à : déterminer une corrélation correspondante entre chaque bloc de codage et chaque tâche d'analyse vidéo ; déterminer, sur la base de la corrélation, un bloc de codage préférentiellement codé ; et coder et/ou transmettre le bloc de codage. La présente invention : détermine, selon les exigences d'une tâche d'analyse vidéo, la corrélation entre des sous-blocs et la tâche d'analyse vidéo ; détermine un ordre de codage à base de corrélation, l'ordre de codage et l'ordre de transmission de données codées de ceux-ci traitant préférentiellement des données importantes selon les exigences d'analyse vidéo ; effectue un codage vidéo ; et convertit les données codées en flux de codes vidéo, ce qui permet de garantir que la précision des résultats d'analyse vidéo se situe dans une plage acceptable, tout en garantissant le caractère optimal du taux de compression de données.
PCT/CN2021/139138 2021-06-03 2021-12-17 Procédé et dispositif de détermination d'un ordre de priorité de codage et de décodage vidéo à base de comparaison de corrélations WO2022252567A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110617182.7A CN113079375B (zh) 2021-06-03 2021-06-03 基于相关度比较确定视频编解码优先顺序的方法及装置
CN202110617182.7 2021-06-03

Publications (1)

Publication Number Publication Date
WO2022252567A1 true WO2022252567A1 (fr) 2022-12-08

Family

ID=76616963

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/139138 WO2022252567A1 (fr) 2021-06-03 2021-12-17 Procédé et dispositif de détermination d'un ordre de priorité de codage et de décodage vidéo à base de comparaison de corrélations

Country Status (2)

Country Link
CN (1) CN113079375B (fr)
WO (1) WO2022252567A1 (fr)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113079375B (zh) * 2021-06-03 2022-03-08 浙江智慧视频安防创新中心有限公司 基于相关度比较确定视频编解码优先顺序的方法及装置
CN113489983A (zh) * 2021-06-11 2021-10-08 浙江智慧视频安防创新中心有限公司 一种基于相关度比较确定分块编码参数的方法及装置

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6233283B1 (en) * 1998-06-23 2001-05-15 Lucent Technologies Inc. Layered video coding using perceptual coding criteria for error resilience in packet networks
CN101102495A (zh) * 2007-07-26 2008-01-09 武汉大学 一种基于区域的视频图像编解码方法和装置
CN101379831A (zh) * 2006-02-24 2009-03-04 三星电子株式会社 图像编码/解码方法和装置
US20100124274A1 (en) * 2008-11-17 2010-05-20 Cheok Lai-Tee Analytics-modulated coding of surveillance video
CN105306945A (zh) * 2014-07-10 2016-02-03 北京创鑫汇智科技发展有限责任公司 一种监控视频的可伸缩浓缩编码方法和装置
CN113079375A (zh) * 2021-06-03 2021-07-06 浙江智慧视频安防创新中心有限公司 基于相关度比较确定视频编解码优先顺序的方法及装置

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013181821A1 (fr) * 2012-06-07 2013-12-12 Mediatek Singapore Pte. Ltd. Mode saut de transformation intra amélioré
CN104853196B (zh) * 2014-02-18 2018-10-19 华为技术有限公司 编解码方法和装置
CN108076345A (zh) * 2016-11-09 2018-05-25 阿里巴巴集团控股有限公司 多视角视频帧的编码方法、传输方法、装置、计算机

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6233283B1 (en) * 1998-06-23 2001-05-15 Lucent Technologies Inc. Layered video coding using perceptual coding criteria for error resilience in packet networks
CN101379831A (zh) * 2006-02-24 2009-03-04 三星电子株式会社 图像编码/解码方法和装置
CN101102495A (zh) * 2007-07-26 2008-01-09 武汉大学 一种基于区域的视频图像编解码方法和装置
US20100124274A1 (en) * 2008-11-17 2010-05-20 Cheok Lai-Tee Analytics-modulated coding of surveillance video
CN105306945A (zh) * 2014-07-10 2016-02-03 北京创鑫汇智科技发展有限责任公司 一种监控视频的可伸缩浓缩编码方法和装置
CN113079375A (zh) * 2021-06-03 2021-07-06 浙江智慧视频安防创新中心有限公司 基于相关度比较确定视频编解码优先顺序的方法及装置

Also Published As

Publication number Publication date
CN113079375A (zh) 2021-07-06
CN113079375B (zh) 2022-03-08

Similar Documents

Publication Publication Date Title
US11936884B2 (en) Coded-block-flag coding and derivation
US11412229B2 (en) Method and apparatus for video encoding and decoding
WO2022252567A1 (fr) Procédé et dispositif de détermination d'un ordre de priorité de codage et de décodage vidéo à base de comparaison de corrélations
US20190246107A1 (en) Encoding and decoding methods and corresponding devices
US20160323600A1 (en) Methods and Apparatus for Use of Adaptive Prediction Resolution in Video Coding
CN109688407B (zh) 编码单元的参考块选择方法、装置、电子设备及存储介质
CN111316642B (zh) 信令图像编码和解码划分信息的方法和装置
WO2020001325A1 (fr) Procédé d'encodage d'image, procédé de décodage, encodeur, décodeur et support de stockage
WO2021004153A1 (fr) Procédé de prédiction d'image, codeur, décodeur, et support d'informations
WO2019085942A1 (fr) Procédé de prise de décision de mode sao, appareil, dispositif électronique et support de stockage lisible
US20190045185A1 (en) Coding tools for subjective quality improvements in video codecs
WO2021037041A1 (fr) Procédé et appareil de décodage de données, et procédé et appareil de codage de données
CN107454414B (zh) 图像编码/解码方法及相关装置
MX2012004747A (es) Codificacion de graficos integrados: corriente de bits reclasificada para descodificacion paralela.
US11778172B2 (en) Colour component prediction method, encoder, decoder and storage medium
CN110996127B (zh) 图像编解码方法、设备及系统
CN111432213A (zh) 用于视频和图像压缩的自适应贴片数据大小编码
JP7437426B2 (ja) インター予測方法および装置、機器、記憶媒体
US11647196B2 (en) Method and apparatus for encoding image, method and apparatus for decoding image, electronic device, and system
CN108683915B (zh) 一种写入dQP值的方法、装置及电子设备
WO2022166370A1 (fr) Procédé et appareil de codage et de décodage vidéo, produit-programme informatique, support de stockage lisible par ordinateur et dispositif électronique
US10045022B2 (en) Adaptive content dependent intra prediction mode coding
US20210344924A1 (en) Method for processing information, encoder and decoder
CN116982262A (zh) 视频编码中依赖性量化的状态转换
CN109672889A (zh) 约束的序列数据头的方法及装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21943919

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21943919

Country of ref document: EP

Kind code of ref document: A1