WO2021068341A1 - Code rate configuration method, system and device for video frame - Google Patents

Code rate configuration method, system and device for video frame Download PDF

Info

Publication number
WO2021068341A1
WO2021068341A1 PCT/CN2019/118056 CN2019118056W WO2021068341A1 WO 2021068341 A1 WO2021068341 A1 WO 2021068341A1 CN 2019118056 W CN2019118056 W CN 2019118056W WO 2021068341 A1 WO2021068341 A1 WO 2021068341A1
Authority
WO
WIPO (PCT)
Prior art keywords
video frame
macroblock
amount
coded data
encoded data
Prior art date
Application number
PCT/CN2019/118056
Other languages
French (fr)
Chinese (zh)
Inventor
林鹏程
黄学辉
Original Assignee
网宿科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 网宿科技股份有限公司 filed Critical 网宿科技股份有限公司
Publication of WO2021068341A1 publication Critical patent/WO2021068341A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/587Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal sub-sampling or interpolation, e.g. decimation or subsequent interpolation of pictures in a video sequence

Definitions

  • the present invention relates to the technical field of image processing, in particular to a method, system and equipment for configuring the code rate of video frames.
  • these coding standards may include JPEG, JVT, MPEG, H.26X, and so on.
  • certain encoding standards can allocate different amounts of encoded data to different video frames based on the complexity of each video frame and the reference weight of each video frame.
  • this encoding method can improve the image quality of some of the video frames, it sacrifices the image quality of other video frames.
  • the image quality will fluctuate. Therefore, such an encoding method cannot achieve a better image quality improvement effect.
  • the purpose of this application is to provide a method, system and device for configuring the bit rate of a video frame, which can have a better image quality improvement effect.
  • one aspect of the present application provides a method for configuring the code rate of a video frame, the video frame is divided into a plurality of macro blocks; the method includes: establishing the encoding data amount of the macro block and the configuration parameters of the macro block The mapping relationship between each of the macroblocks; according to the mapping relationship and the initial configuration parameters of each macroblock in the video frame, determine the actual coded data volume of each macroblock; wherein the actual coded data of each macroblock The amount constitutes the actual amount of encoded data of the video frame; identifying the target area in the video frame, and keeping the actual amount of encoded data of the video frame unchanged, according to the image quality improvement coefficient of the target area And re-determine the coded data amount and configuration parameters of each macro block in the video frame, so as to encode the video frame through the re-determined configuration parameters of each macro block.
  • another aspect of the present application also provides a system for configuring the code rate of a video frame, the video frame is divided into a plurality of macro blocks; the system includes: a mapping relationship establishment unit for establishing a macro block The mapping relationship between the amount of encoded data and the configuration parameters of the macroblock; the encoded data amount determining unit is configured to determine each macroblock according to the mapping relationship and the initial configuration parameter of each macroblock in the video frame The actual amount of encoded data; wherein the actual amount of encoded data of each macroblock constitutes the actual amount of encoded data of the video frame; the encoding unit is used to identify the target area in the video frame and keep the video When the actual coded data volume of the frame remains unchanged, the coded data volume and configuration parameters of each macroblock in the video frame are re-determined according to the image quality improvement coefficient of the target area to pass each macroblock The re-determined configuration parameters encode the video frame.
  • another aspect of the present application also provides a video frame rate configuration device, the device includes a processor and a memory, the memory is used to store a computer program, the computer program is executed by the processor When the above-mentioned video frame rate configuration method is realized.
  • the technical solutions provided by one or more embodiments of the present application can pre-establish the mapping relationship between the amount of encoded data of the macro block and the configuration parameters.
  • the amount of coded data is usually fixed within a limited range, so it is possible to establish an accurate mapping relationship with the configuration parameters for the amount of coded data within this range.
  • the video frame can be divided into multiple different macroblocks.
  • the initial configuration parameters can be substituted into the above mapping relationship, so as to determine the actual coded data amount of each macro block.
  • the image quality is improved, the target area that the human eye pays attention to can be identified in the video frame according to the principle of human vision. The image quality can be improved for the content in the target area, and the image quality can be appropriately reduced for the content outside the target area.
  • the encoded data amount and configuration parameters of each macroblock in the video frame can be re-determined while keeping the actual encoded data amount of the video frame unchanged.
  • each macro block in the video frame can be coded, thereby completing the process of image quality improvement.
  • the image quality can be improved for the target area that the human eye pays attention to, and the image quality can be appropriately reduced for the area that the human eye does not pay attention to.
  • it can ensure that the actual coded data volume of the video frame will not change significantly, on the other hand, it can ensure that the user will always present a higher picture quality to the area of his concern when watching, thus having a higher picture quality improvement effect.
  • FIG. 1 is a step diagram of a method for configuring a bit rate of a video frame in an embodiment of the present invention
  • Fig. 2 is a schematic diagram of a mapping relationship model in an embodiment of the present invention.
  • FIG. 3 is a schematic diagram of a code rate configuration system of a video frame in an embodiment of the present invention.
  • Fig. 4 is a schematic structural diagram of a video frame rate configuration device in an embodiment of the present invention.
  • the code rate configuration method of the video frame can adjust the amount of encoded data of the macroblock (MacroBlock) in the video frame, so as to achieve the effect that the macroblocks in different regions have different amounts of encoded data.
  • the video frame may be divided into multiple macroblocks.
  • the divided macroblocks usually have the same size.
  • the divided macroblocks may also have different sizes according to different regions, which is not limited in this application.
  • each macro block in the video frame can be mathematically transformed based on the residual error with the reference macro block, so as to obtain the residual transform (SATD, Sum of Absolute) of each macro block. Transformed Difference) value.
  • the respective pixel matrices of the current macroblock and the reference macroblock can be extracted, and the two pixel matrices can be subtracted to obtain the residuals of the two macroblocks.
  • the residual can be subjected to hadamad transformation, and the obtained value can be used as the SATD value of the current macroblock.
  • the quantization offset parameter can be used to save the number of bytes after encoding.
  • the quantization offset parameter of each macroblock can be calculated by means of the existing coding standard. For example, the complexity of each macro block can be analyzed by the MB-Tree algorithm that comes with x264, so as to calculate the quantization offset parameter of each macro block.
  • the above-mentioned residual transform value and quantization offset parameter can be used as the configuration parameter of the macro block.
  • a method for configuring a bit rate of a video frame may include the following multiple steps.
  • S1 Establish a mapping relationship between the amount of coded data of the macro block and the configuration parameters of the macro block.
  • the configuration parameters of the macro block may include the above-mentioned residual transformation value and quantization offset parameter.
  • the SATD value and quantization offset parameter of the macro block the actual coded data amount of the macro block can be calculated.
  • the mapping relationship between the amount of coded data, the SATD value, and the quantization bias parameter can be refitted in advance for a scene with a low bit rate through a machine learning method.
  • the video frame samples within the specified bit rate range can be obtained.
  • the specified code rate range may be a range with a lower code rate.
  • the specified code rate range may be a range of 100kbps to 300kpbs.
  • the video frame samples within the specified bit rate range can all be video frames that have been encoded.
  • the coded data amount, SATD value, and quantization offset parameter of each macroblock in the video frame sample can be identified, and subsequently, it can be based on the coded data amount, SATD value and quantization obtained by the recognition.
  • the bias parameter training obtains the prediction model.
  • different methods can be used to obtain the final predictive model. For example, algorithms such as Support Vector Machine (SVM) and logistic regression can be used to use the recognized SATD value and quantization bias parameter as input data, and compare the corresponding amount of encoded data with the output result to determine The error between the actual amount of encoded data and the output result.
  • SVM Support Vector Machine
  • logistic regression can be used to use the recognized SATD value and quantization bias parameter as input data, and compare the corresponding amount of encoded data with the output result to determine The error between the actual amount of encoded data and the output result.
  • the input of the prediction model may be the SATD value and the quantization offset parameter of the macro block, and the output may be the amount of coded data of the macro block obtained by prediction.
  • the mathematical relationship between the input data and the output result of the prediction model can be expressed by a certain formula, and the mathematical relationship can be used as the mapping relationship established in step S1.
  • the mathematical expression of the predictive model can be as follows:
  • B i represents the sample video frame coded data amount of the i-th macroblock
  • S i represents the i-th macroblock SATD value
  • Q i denotes the quantization parameter bias the i-th macroblock
  • ⁇ (S i ,Q i ) represents the kernel function with SATD value and quantization bias parameter as independent variables
  • w represents the linear coefficient of the kernel function
  • b represents the undetermined constant.
  • the residual transformation value of the macroblock and the quantization offset parameter can be used as the independent variables of the kernel function, and the kernel function can also have linear coefficients, and The undetermined constant is also included in the mapping relationship.
  • the above-mentioned expression representing the mapping relationship can be converted into a minimum value problem with constraint conditions, and the expression of the minimum value problem can be as follows:
  • ⁇ i and ⁇ i * can be used as slack variables of the i-th macroblock
  • C can be used as a penalty factor
  • l represents the number of macroblocks contained in the current video frame sample.
  • both the expression characterizing the minimum value problem and the expression characterizing the constraint condition may include slack variables, and the expression characterizing the minimum value problem may also include a penalty factor.
  • Lagrangian multipliers can be introduced into the above-mentioned minimum problem, thereby converting the above-mentioned minimum problem into a dual problem, and by solving the dual problem, the above linear coefficients can be obtained And the assignment of undetermined constants.
  • the above-mentioned minimum problem can be transformed into the following dual form:
  • is the real part of the slack variable
  • y i is the actual amount of coded data of the i-th macroblock.
  • the kernel function can use the radial basis kernel function, and the expression of the radial basis kernel function can be as follows:
  • represents the fitting parameter
  • the specific assignment of the Lagrange multiplier is obtained by solving the above-mentioned dual problem.
  • the specific assignment of the Lagrangian multiplier can be used to deduct the linear coefficient and Assignment of undetermined constants.
  • the mathematical expression of the predictive model can be obtained.
  • the data expression can be used as the established mapping relationship.
  • the mapping relationship can be used as the relationship between the amount of encoded data, the SATD value and the quantization bias parameter at low bit rates. Mapping relations.
  • S3 Determine the actual coded data volume of each macroblock according to the mapping relationship and the initial configuration parameters of each macroblock in the video frame; wherein the actual coded data volume of each macroblock constitutes the The actual amount of encoded data of the video frame.
  • the initial configuration parameters of the macroblock may refer to the initial SATD value and the initial quantization offset parameter.
  • the initial SATD value and the initial quantization offset parameter of each macroblock in the video frame to be encoded can be substituted into the above
  • the mapping relationship the actual coded data amount of each macro block is determined. Since the above mapping relationship is obtained based on training in a low bit rate scenario, the actual coded data amount determined according to the above mapping relationship is more accurate. After the actual coded data volume of each macroblock is obtained, the sum of the actual coded data volume of these macroblocks can be used as the actual coded data volume of the video frame.
  • S5 Identify the target area in the video frame, and re-determine each target area in the video frame according to the image quality improvement coefficient of the target area while keeping the actual amount of encoded data of the video frame unchanged.
  • the coded data amount and configuration parameters of the macroblocks are used to encode the video frame with the configuration parameters newly determined by each macroblock.
  • the image quality of the area that the user pays attention to in the video frame can be enhanced, and for the area that the user does not pay attention to, the image quality can be appropriately reduced, so as to keep the overall amount of encoded data of the video frame unchanged, and improve the user’s The viewing experience.
  • the target area of the user's attention in the video frame can be identified.
  • a fixed area in the video frame may be used as the target area, and the fixed area may be, for example, the central area of the video frame.
  • the target area in the video frame can also be determined according to the focus information of the video frame.
  • the content mainly displayed in the video frame tends to have high definition, and the definition can be expressed by different focus information. Therefore, the main display content and other parts of the content can have different focus information.
  • the target area of the user's attention in the video frame can be determined.
  • the target area in the video frame can also be determined by the target tracking algorithm.
  • feature extraction can be performed first, so as to extract one or more of the gray feature, color feature, texture feature, Haar-like rectangle feature, interest point feature, and super pixel feature in the video frame. Then, the motion state of the target object in multiple adjacent video frames is analyzed through the motion model, so as to determine the candidate area where the target object is located.
  • the motion model may be one of a mean shift (Mean shift), a sliding window (Slide window), a Kalman filter (Kalman Filtering), a particle filter (Particle Filtering) and other models.
  • the Appearance Model can be used to determine whether there is a tracked target object in the candidate area of the current video frame.
  • the candidate area can be used as the target area of the video frame.
  • the candidate area can be used as the target area of the video frame.
  • the above examples in this embodiment are only possible implementations, and it does not mean that the solution of this application can only be applied to the above implementations. .
  • an image quality improvement coefficient may be determined in advance.
  • the image quality improvement coefficient may be determined based on the current coded data amount of the macroblock in the target area and the improved coded data amount. For example, if the current coded data volume is 100kb, and the improved coded data volume needs to reach 200kb, then the image quality improvement factor can be 2. After the image quality improvement coefficient is obtained, the amount of encoded data and the residual transformation value of each of the macroblocks in the video frame may be re-determined while keeping the actual amount of encoded data of the video frame unchanged.
  • the target macroblock located in the target area can be determined in each macroblock of the video frame, and the image quality can be enhanced for this part of the target macroblock. Therefore, the product of the actual coded data amount of the target macroblock and the image quality improvement coefficient may be used as the coded data amount after the target macroblock is re-determined. In this way, the amount of encoded data after the target area is re-determined It can be determined by the following formula:
  • r is the image quality improvement coefficient
  • ROI Region Of Interest, region of interest
  • CS Total represents the actual amount of encoded data of the video frame
  • the difference It can be expressed as the sum of the amount of coded data in areas other than the target area after the image quality is adjusted. In this way, the number of other macroblocks located outside the target area in the video frame can be determined, and the difference The ratio between the determined quantity and the determined quantity is used as the coded data quantity after the other macroblocks are re-determined.
  • the new SATD value of each macro block can be further determined.
  • the product of the initial residual transformation value of the target macroblock and the image quality improvement coefficient may be used as the residual transformation value after the target macroblock is re-determined.
  • the SATD value after the target macroblock is re-determined It can be as follows:
  • S i represents the i-th macroblock (macroblock if the target macroblock is a) an initial SATD value.
  • the calculated difference can be
  • the ratio of the CS NROI to the actual coded data volume of the other macroblocks is used as the image quality adjustment coefficient of the other macroblocks, and the initial residual transformation value of the other macroblocks is compared with the image quality adjustment coefficient
  • the product of is used as the residual transform value after the other macroblocks are re-determined.
  • the image quality adjustment coefficients of other macroblocks can be as follows:
  • S i represents the i-th macroblock (if the macroblock is another macroblock) initial SATD value.
  • the re-determined coded data amount and SATD value can be substituted into the mapping relationship, so as to calculate the re-determined quantization offset parameter of the macroblock.
  • the macroblocks in the video frame can be coded, so as to obtain the coded video frame that meets the expected bit rate.
  • the mapping relationship established in this application can be represented by the model shown in FIG. 2.
  • the SATD value and the quantization offset parameter can be used as input data to predict the amount of coded data of the macroblock.
  • the expected coded data volume of the macroblock is limited, and the SATD value of the macroblock is set, then the actual quantization offset parameter of the macroblock can also be derived from the model. In this way, by performing model training on video frame samples in low bit rate scenarios, the quantization relationship between the amount of encoded data and the improvement of image quality can be established, and the bit rate allocation of the ROI in the video can be dynamically realized.
  • the present application also provides a code rate configuration system for a video frame, where the video frame is divided into multiple macroblocks; the system includes:
  • a mapping relationship establishing unit for establishing a mapping relationship between the amount of coded data of the macro block and the configuration parameters of the macro block;
  • the coded data amount determining unit is configured to determine the actual coded data amount of each macroblock according to the mapping relationship and the initial configuration parameters of each macroblock in the video frame; wherein, the actual amount of each macroblock is The amount of encoded data constitutes the actual amount of encoded data of the video frame;
  • the encoding unit is used to identify the target area in the video frame, and to re-determine the video frame according to the image quality improvement coefficient of the target area while keeping the actual amount of encoded data of the video frame unchanged
  • the coded data amount and configuration parameters of each macroblock in encode the video frame with the configuration parameters newly determined by each macroblock.
  • the configuration parameters include residual transform values and quantization offset parameters;
  • the coding unit includes:
  • a parameter re-determination module configured to re-determine the amount of encoded data and the residual transformation value of each macroblock in the video frame
  • the quantization offset parameter calculation module is used to calculate the adjusted quantization offset parameter of each macro block according to the mapping relationship and the re-determined coded data amount and residual transformation value of each macro block to pass The re-determined residual transform value of each macroblock and the calculated quantization offset parameter encode the video frame.
  • mapping relationship establishment unit includes:
  • the sample processing module is used to obtain video frame samples within a specified code rate range, and divide the video frame samples into multiple macro blocks;
  • the training module is used to identify the amount of encoded data, the residual transformation value and the quantization offset parameter of each macroblock in the video frame sample, and train based on the encoded data amount, the residual transformation value and the quantization offset parameter obtained by the recognition Predictive model
  • the relationship establishment module is used to use the mathematical relationship between the input data and the output result of the prediction model as a mapping between the coded data amount of the macroblock and the residual transformation value and quantization offset parameter of the macroblock. relationship.
  • the parameter re-determination unit includes:
  • the image quality improvement module is used to determine the target macroblock located in the target area in each macroblock of the video frame, and to multiply the actual coded data amount of the target macroblock and the image quality improvement coefficient , As the amount of coded data after the target macroblock is re-determined;
  • a difference calculation module which is used to calculate the sum of the amount of encoded data after each of the target macroblocks is re-determined, and calculate the difference between the actual amount of encoded data of the video frame and the sum of the amount of encoded data;
  • the image quality adjustment module is used to determine the number of other macroblocks located outside the target area in the video frame, and re-determine the ratio between the difference and the determined number as the other macroblocks The amount of encoded data afterwards.
  • the parameter re-determination unit further includes:
  • a residual transformation value improvement module configured to multiply the initial residual transformation value of the target macroblock and the image quality improvement coefficient as the residual transformation value after the target macroblock is re-determined
  • the residual transform value adjustment module is configured to use the calculated ratio of the difference value and the sum of the actual coded data amount of the other macroblocks as the image quality adjustment coefficient of the other macroblocks, and adjust the other The product of the initial residual transformation value of the macroblock and the image quality adjustment coefficient is used as the residual transformation value after the other macroblocks are re-determined.
  • this application also provides a video frame rate configuration device, the device includes a processor and a memory, the memory is used to store a computer program, when the computer program is executed by the processor, The above-mentioned method for configuring the code rate of the video frame is realized.
  • the memory may include a physical device for storing information, which is usually digitized and then stored in a medium using electrical, magnetic, or optical methods.
  • the memory described in this embodiment may also include: a device that uses electrical energy to store information, such as RAM or ROM, etc.; a device that uses magnetic energy to store information, such as hard disk, floppy disk, magnetic tape, magnetic core memory, bubble memory, or U disk ; A device that uses optical means to store information, such as CD or DVD.
  • a device that uses electrical energy to store information such as RAM or ROM, etc.
  • a device that uses magnetic energy to store information such as hard disk, floppy disk, magnetic tape, magnetic core memory, bubble memory, or U disk
  • a device that uses optical means to store information such as CD or DVD.
  • quantum memory or graphene memory there are other types of memory, such as quantum memory or graphene memory.
  • the processor can be implemented in any suitable manner.
  • the processor may take the form of, for example, a microprocessor or a processor, and a computer-readable medium storing computer-readable program codes (for example, software or firmware) executable by the (micro)processor, logic gates, switches, dedicated integrated circuits, etc. Circuit (Application Specific Integrated Circuit, ASIC), programmable logic controller and embedded microcontroller form, etc.
  • ASIC Application Specific Integrated Circuit
  • ASIC Application Specific Integrated Circuit
  • the technical solutions provided by one or more embodiments of the present application can pre-establish the mapping relationship between the amount of encoded data of the macro block and the configuration parameters.
  • the amount of coded data is usually fixed within a limited range, so it is possible to establish an accurate mapping relationship with the configuration parameters for the amount of coded data within this range.
  • the video frame can be divided into multiple different macroblocks.
  • the initial configuration parameters can be substituted into the above mapping relationship, so as to determine the actual coded data amount of each macro block.
  • the image quality is improved, the target area that the human eye pays attention to can be identified in the video frame according to the principle of human vision. The image quality can be improved for the content in the target area, and the image quality can be appropriately reduced for the content outside the target area.
  • the encoded data amount and configuration parameters of each macroblock in the video frame can be re-determined while keeping the actual encoded data amount of the video frame unchanged.
  • each macro block in the video frame can be coded, thereby completing the process of image quality improvement.
  • the image quality can be improved for the target area that the human eye pays attention to, and the image quality can be appropriately reduced for the area that the human eye does not pay attention to.
  • it can ensure that the actual coded data volume of the video frame will not change significantly, on the other hand, it can ensure that the user will always present a higher picture quality to the area of his concern when watching, thus having a higher picture quality improvement effect.
  • the embodiments of the present invention can be provided as a method, a system, or a computer program product. Therefore, the present invention may adopt a form of a complete hardware implementation, a complete software implementation, or a combination of software and hardware implementations. Moreover, the present invention may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program codes.
  • computer-usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.
  • These computer program instructions can also be stored in a computer-readable memory that can guide a computer or other programmable data processing equipment to work in a specific manner, so that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction device.
  • the device implements the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
  • These computer program instructions can also be loaded on a computer or other programmable data processing equipment, so that a series of operation steps are executed on the computer or other programmable equipment to produce computer-implemented processing, so as to execute on the computer or other programmable equipment.
  • the instructions provide steps for implementing the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
  • the computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
  • processors CPUs
  • input/output interfaces network interfaces
  • memory volatile and non-volatile memory
  • the memory may include non-permanent memory in a computer readable medium, random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of computer readable media.
  • RAM random access memory
  • ROM read-only memory
  • flash RAM flash memory
  • Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology.
  • the information can be computer-readable instructions, data structures, program modules, or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include transitory media, such as modulated data signals and carrier waves.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Disclosed are a code rate configuration method, system and device for a video frame. The method comprises: establishing a mapping relationship between an encoded data volume of a macro block and configuration parameters of the macro block; determining actual encoded data volumes of macro blocks according to the mapping relationship and initial configuration parameters of the macro blocks in a video frame, wherein the actual encoded data volumes of the macro blocks constitute an actual encoded data volume of the video frame; and recognizing a target region in the video frame, and when the actual encoded data volume of the video frame is kept unchanged, redetermining, according to a picture quality improving coefficient of the target region, encoded data volumes and configuration parameters of the macro blocks in the video frame, so as to encode the video frame by means of the redetermined configuration parameters of the macro blocks. The technical solution provided in the present application can achieve a relatively good picture quality improving effect.

Description

一种视频帧的码率配置方法、系统及设备Code rate configuration method, system and equipment of video frame 技术领域Technical field
本发明涉及图像处理技术领域,特别涉及一种视频帧的码率配置方法、系统及设备。The present invention relates to the technical field of image processing, in particular to a method, system and equipment for configuring the code rate of video frames.
背景技术Background technique
随着视频画质的提升,视频的编码数据量也在不断增长。然而,有限的网络带宽,使得视频画质不可能无限提升。因此,在有限的带宽或者有限的编码数据量的前提下,如何尽可能地提高视频的画质成为了一个亟待解决的问题。With the improvement of video quality, the amount of video coded data is also increasing. However, the limited network bandwidth makes it impossible to improve the video quality indefinitely. Therefore, under the premise of limited bandwidth or limited amount of encoded data, how to improve the image quality of the video as much as possible has become an urgent problem to be solved.
随着图像处理技术的发展,出现了多种不同的编码标准。例如,这些编码标准可以包括JPEG、JVT、MPEG、H.26X等。在进行视频编码时,某些编码标准可以针对每个视频帧的复杂度以及每个视频帧的参考权重,从而为不同的视频帧分配不同的编码数据量。这样的编码方式,虽然能够提升其中部分视频帧的画质,但却牺牲了其它视频帧的画质。用户在观看视频时,会产生画质忽高忽低的感觉。因此,这样的编码方式无法达到较好的画质提升效果。With the development of image processing technology, many different coding standards have emerged. For example, these coding standards may include JPEG, JVT, MPEG, H.26X, and so on. During video encoding, certain encoding standards can allocate different amounts of encoded data to different video frames based on the complexity of each video frame and the reference weight of each video frame. Although this encoding method can improve the image quality of some of the video frames, it sacrifices the image quality of other video frames. When the user is watching the video, the image quality will fluctuate. Therefore, such an encoding method cannot achieve a better image quality improvement effect.
发明内容Summary of the invention
本申请的目的在于提供一种视频帧的码率配置方法、系统及设备,能够具备较好的画质提升效果。The purpose of this application is to provide a method, system and device for configuring the bit rate of a video frame, which can have a better image quality improvement effect.
为实现上述目的,本申请一方面提供一种视频帧的码率配置方法,所述视频帧被划分为多个宏块;所述方法包括:建立宏块的编码数据量与宏块的配置参数之间的映射关系;根据所述映射关系和所述视频帧中各个所述宏块的初始配置参数,确定各个所述宏块的实际编码数据量;其中,各个所述宏块的实际编码数据量构成所述视频帧的实际编码数据量;识别所述视频帧中的目标区域,并在保持所述视频帧的实际编码数据量不变的情况下,根据所述目标区域的画质改善系数,重新确定所述视频帧中各个所述宏块的编码数据量和配置参数,以通过各个所述宏块重新确定的配置参数对所述视频帧进行编码。In order to achieve the above objective, one aspect of the present application provides a method for configuring the code rate of a video frame, the video frame is divided into a plurality of macro blocks; the method includes: establishing the encoding data amount of the macro block and the configuration parameters of the macro block The mapping relationship between each of the macroblocks; according to the mapping relationship and the initial configuration parameters of each macroblock in the video frame, determine the actual coded data volume of each macroblock; wherein the actual coded data of each macroblock The amount constitutes the actual amount of encoded data of the video frame; identifying the target area in the video frame, and keeping the actual amount of encoded data of the video frame unchanged, according to the image quality improvement coefficient of the target area And re-determine the coded data amount and configuration parameters of each macro block in the video frame, so as to encode the video frame through the re-determined configuration parameters of each macro block.
为实现上述目的,本申请另一方面还提供一种视频帧的码率配置系统,所述视频帧被划分为多个宏块;所述系统包括:映射关系建立单元,用于建立宏块的编码数据量与宏块的配置参数之间的映射关系;编码数据量确定单元,用于根据所述映射关系和所述视频帧中各个所述宏块的初始配置参数,确定各个所述宏块的实际编码数据量;其中,各个所述宏块的实际编码数据量构成所述视频帧的实际编码数据量;编码单元,用于识别所述视频帧中的目标区域,并在保持所述视频帧的实际编码数据量不变的情况下,根据所述目标区域的画质改善系数,重新确定所述视频帧中各个所述宏块的编码数据量和配置参数,以通过各个所述宏块重新确定的配置参数对所述视频帧进行编码。In order to achieve the above-mentioned objective, another aspect of the present application also provides a system for configuring the code rate of a video frame, the video frame is divided into a plurality of macro blocks; the system includes: a mapping relationship establishment unit for establishing a macro block The mapping relationship between the amount of encoded data and the configuration parameters of the macroblock; the encoded data amount determining unit is configured to determine each macroblock according to the mapping relationship and the initial configuration parameter of each macroblock in the video frame The actual amount of encoded data; wherein the actual amount of encoded data of each macroblock constitutes the actual amount of encoded data of the video frame; the encoding unit is used to identify the target area in the video frame and keep the video When the actual coded data volume of the frame remains unchanged, the coded data volume and configuration parameters of each macroblock in the video frame are re-determined according to the image quality improvement coefficient of the target area to pass each macroblock The re-determined configuration parameters encode the video frame.
为实现上述目的,本申请另一方面还提供一种视频帧的码率配置设备,所述设备包括处理器和存储器,所述存储器用于存储计算机程序,所述计算机程序被所述处理器执行时,实现上述的视频帧的码率配置方法。In order to achieve the above objective, another aspect of the present application also provides a video frame rate configuration device, the device includes a processor and a memory, the memory is used to store a computer program, the computer program is executed by the processor When the above-mentioned video frame rate configuration method is realized.
由上可见,本申请一个或者多个实施方式提供的技术方案,可以预先建立宏块的编码数据量与配置参数之间的映射关系。在实际应用中,编码数据量通常是固定在一个有限的范围内,因此可以针对该范围内的编码数据量,建立与配置参数之间精确的映射关系。在对视频帧进行编码时,可以将视频帧划分为多个不同的宏块。针对每个宏块,可以将初始的配置参数代入上述的映射关系,从而确定出各个宏块的实际编码数据量。在进行画质改善时,可以根据人眼视觉原理,在视频帧中识别出人眼关注的目标区域。针对目标区域内的内容可以进行画质提升,而针对目标区域外的内容,则可以适当降低画质。It can be seen from the above that the technical solutions provided by one or more embodiments of the present application can pre-establish the mapping relationship between the amount of encoded data of the macro block and the configuration parameters. In practical applications, the amount of coded data is usually fixed within a limited range, so it is possible to establish an accurate mapping relationship with the configuration parameters for the amount of coded data within this range. When encoding a video frame, the video frame can be divided into multiple different macroblocks. For each macro block, the initial configuration parameters can be substituted into the above mapping relationship, so as to determine the actual coded data amount of each macro block. When the image quality is improved, the target area that the human eye pays attention to can be identified in the video frame according to the principle of human vision. The image quality can be improved for the content in the target area, and the image quality can be appropriately reduced for the content outside the target area.
具体地,根据画质提升所需的画质改善系数,可以在保持视频帧的实际编码数据量不变的情况下,重新确定视频帧中各个宏块的编码数据量和配置参数。这样,根据重新确定的配置参数,便可以对视频帧中的各个宏块进行编码,从而完成画质改善的过程。Specifically, according to the image quality improvement coefficient required for image quality improvement, the encoded data amount and configuration parameters of each macroblock in the video frame can be re-determined while keeping the actual encoded data amount of the video frame unchanged. In this way, according to the re-determined configuration parameters, each macro block in the video frame can be coded, thereby completing the process of image quality improvement.
可见,通过上述的方案,对于人眼关注的目标区域,可以进行画质提升,而对于人眼不关注的区域,则可以适当降低画质。一方面可以保证视频帧的实际编码数据量不会出现大幅度变化,另一方面可以保证用户在观看时,对于自身关注的区域始终会呈现较高的画质,从而具备较高的画质提升效果。It can be seen that through the above solution, the image quality can be improved for the target area that the human eye pays attention to, and the image quality can be appropriately reduced for the area that the human eye does not pay attention to. On the one hand, it can ensure that the actual coded data volume of the video frame will not change significantly, on the other hand, it can ensure that the user will always present a higher picture quality to the area of his concern when watching, thus having a higher picture quality improvement effect.
附图说明Description of the drawings
为了更清楚地说明本发明实施方式中的技术方案,下面将对实施方式描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施方式,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly explain the technical solutions in the embodiments of the present invention, the following will briefly introduce the drawings needed in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present invention. For those of ordinary skill in the art, other drawings can be obtained from these drawings without creative work.
图1是本发明实施方式中视频帧的码率配置方法步骤图;FIG. 1 is a step diagram of a method for configuring a bit rate of a video frame in an embodiment of the present invention;
图2是本发明实施方式中映射关系的模型示意图;Fig. 2 is a schematic diagram of a mapping relationship model in an embodiment of the present invention;
图3是本发明实施方式中视频帧的码率配置系统示意图;FIG. 3 is a schematic diagram of a code rate configuration system of a video frame in an embodiment of the present invention;
图4是本发明实施方式中视频帧的码率配置设备的结构示意图。Fig. 4 is a schematic structural diagram of a video frame rate configuration device in an embodiment of the present invention.
具体实施方式Detailed ways
为使本申请的目的、技术方案和优点更加清楚,下面将结合本申请具体实施方式及相应的附图对本申请技术方案进行清楚、完整地描述。显然,所描述的实施方式仅是本申请一部分实施方式,而不是全部的实施方式。基于本申请中的实施方式,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施方式,都属于本申请保护的范围。In order to make the purpose, technical solutions, and advantages of the present application clearer, the technical solutions of the present application will be described clearly and completely in conjunction with the specific embodiments of the present application and the corresponding drawings. Obviously, the described implementations are only a part of the implementations of the present application, rather than all of the implementations. Based on the implementation manners in this application, all other implementation manners obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
本申请提供的视频帧的码率配置方法,可以对视频帧中宏块(MacroBlock)的编码数据量进行调节,从而达到不同区域的宏块具备不同的编码数据量的效果。具体地,在对视频帧编码时,可以将视频帧划分为多个宏块。在实际应用中,划分得到的宏块通常具备相同的尺寸。当然,划分得到的宏块也可以根据所处区域的不同,而具备不同的尺寸,本申请对此并不做限制。The code rate configuration method of the video frame provided by the present application can adjust the amount of encoded data of the macroblock (MacroBlock) in the video frame, so as to achieve the effect that the macroblocks in different regions have different amounts of encoded data. Specifically, when encoding a video frame, the video frame may be divided into multiple macroblocks. In practical applications, the divided macroblocks usually have the same size. Of course, the divided macroblocks may also have different sizes according to different regions, which is not limited in this application.
在本实施方式中,视频帧中的各个宏块,可以根据与参考宏块之间的残差,并对该残差进行数学变换,从而得到各个宏块的残差变换(SATD,Sum of Absolute Transformed Difference)值。具体地,可以提取出当前宏块与参考宏块各自的像素矩阵,并将两个像素矩阵相减,从而得到两个宏块的残差。后续,可以对该残差进行hadamard变换,并将得到的数值作为当前宏块的SATD值。在对视频帧进行编码时,除了需要给各个宏块计算SATD值,还需要为各个宏块设置量化偏置参数,该量化偏置参数可以用于节省编码后的字节数。在实际应用中,可以借助于现有编码标准中的方式来计算各个宏块的量化偏置参数。例如,可以通过x264中自带的MB-Tree算法分析各个宏块的复杂度,从而计算得到各个宏块的量化偏置参数。上述的残差变换值和量化偏置参数,都可以作 为宏块的配置参数。In this embodiment, each macro block in the video frame can be mathematically transformed based on the residual error with the reference macro block, so as to obtain the residual transform (SATD, Sum of Absolute) of each macro block. Transformed Difference) value. Specifically, the respective pixel matrices of the current macroblock and the reference macroblock can be extracted, and the two pixel matrices can be subtracted to obtain the residuals of the two macroblocks. Subsequently, the residual can be subjected to hadamad transformation, and the obtained value can be used as the SATD value of the current macroblock. When encoding a video frame, in addition to calculating the SATD value for each macroblock, it is also necessary to set a quantization offset parameter for each macroblock. The quantization offset parameter can be used to save the number of bytes after encoding. In practical applications, the quantization offset parameter of each macroblock can be calculated by means of the existing coding standard. For example, the complexity of each macro block can be analyzed by the MB-Tree algorithm that comes with x264, so as to calculate the quantization offset parameter of each macro block. The above-mentioned residual transform value and quantization offset parameter can be used as the configuration parameter of the macro block.
在本实施方式中,按照上述方式确定的各个宏块的SATD值和量化偏置参数,可以作为各个宏块的初始SATD值和初始量化偏置参数。后续,可以通过如图1所示的步骤,实现对视频帧的码率配置过程。具体地,请参阅图1,在一个实施方式中,视频帧的码率配置方法可以包括以下多个步骤。In this embodiment, the SATD value and quantization offset parameter of each macroblock determined in the above manner can be used as the initial SATD value and initial quantization offset parameter of each macroblock. Subsequently, the process of configuring the bit rate of the video frame can be implemented through the steps shown in FIG. 1. Specifically, referring to FIG. 1, in one embodiment, a method for configuring a bit rate of a video frame may include the following multiple steps.
S1:建立宏块的编码数据量与宏块的配置参数之间的映射关系。S1: Establish a mapping relationship between the amount of coded data of the macro block and the configuration parameters of the macro block.
在实际应用中,宏块的配置参数可以包括上述的残差变换值和量化偏置参数,根据宏块的SATD值和量化偏置参数,可以计算得到宏块的实际编码数据量。在当前的编码标准中已经存在编码数据量、SATD值以及量化偏置参数之间的映射关系,但是,这种映射关系在码率较低的场景下不够准确。鉴于此,在本实施方式中,可以预先针对低码率的场景,通过机器学习的方法,重新拟合出编码数据量、SATD值、量化偏置参数之间的映射关系。In practical applications, the configuration parameters of the macro block may include the above-mentioned residual transformation value and quantization offset parameter. According to the SATD value and quantization offset parameter of the macro block, the actual coded data amount of the macro block can be calculated. In the current coding standard, there is already a mapping relationship between the amount of coded data, SATD value, and quantization offset parameter, but this mapping relationship is not accurate enough in a scenario with a low code rate. In view of this, in this embodiment, the mapping relationship between the amount of coded data, the SATD value, and the quantization bias parameter can be refitted in advance for a scene with a low bit rate through a machine learning method.
具体地,在重新拟合该映射关系时,首先可以获取指定码率范围内的视频帧样本。该指定码率范围,可以是码率较低的范围。例如,该指定码率范围可以是100kbps至300kpbs的范围。在该指定码率范围内的视频帧样本都可以是已经完成了编码的视频帧,通过分析这些视频帧中宏块的编码数据量与宏块的SATD值和量化偏置参数之间的关系,便可以得到低码率场景下精确的映射关系。Specifically, when refitting the mapping relationship, firstly, the video frame samples within the specified bit rate range can be obtained. The specified code rate range may be a range with a lower code rate. For example, the specified code rate range may be a range of 100kbps to 300kpbs. The video frame samples within the specified bit rate range can all be video frames that have been encoded. By analyzing the relationship between the amount of encoded data of the macroblock in these video frames and the SATD value of the macroblock and the quantization offset parameter, Then, accurate mapping relationships can be obtained in low bit rate scenarios.
在本实施方式中,针对每个视频帧样本,可以识别视频帧样本中各个宏块的编码数据量、SATD值和量化偏置参数,后续,可以基于识别得到的编码数据量、SATD值和量化偏置参数训练得到预估模型。在训练过程中,可以选用不同的方法得到最终的预估模型。例如,可以采用支持向量机(Support Vector Machine,SVM)、逻辑回归等算法,将识别得到的SATD值和量化偏置参数作为输入数据,并将对应的编码数据量与输出结果进行比较,从而确定出实际的编码数据量与输出结果之间的误差。通过该误差,对预估模型进行不断校正,从而使得输出结果与实际的编码数据量之间的误差能够限制在一定的允许范围内。这样,通过大量的视频帧样本进行训练,便可以得到比较精确的预估模型。该预估模型的输入可以是宏块的SATD值和量化偏置参数,输出可以是预测得到的宏块的编码数据量。In this embodiment, for each video frame sample, the coded data amount, SATD value, and quantization offset parameter of each macroblock in the video frame sample can be identified, and subsequently, it can be based on the coded data amount, SATD value and quantization obtained by the recognition. The bias parameter training obtains the prediction model. In the training process, different methods can be used to obtain the final predictive model. For example, algorithms such as Support Vector Machine (SVM) and logistic regression can be used to use the recognized SATD value and quantization bias parameter as input data, and compare the corresponding amount of encoded data with the output result to determine The error between the actual amount of encoded data and the output result. Through this error, the prediction model is continuously corrected, so that the error between the output result and the actual amount of encoded data can be limited within a certain allowable range. In this way, by training a large number of video frame samples, a more accurate prediction model can be obtained. The input of the prediction model may be the SATD value and the quantization offset parameter of the macro block, and the output may be the amount of coded data of the macro block obtained by prediction.
在本实施方式中,预估模型的输入数据和输出结果之间的数学关系,可以通过一定的公式来表达,该数学关系,可以作为步骤S1中建立的映射关系。In this embodiment, the mathematical relationship between the input data and the output result of the prediction model can be expressed by a certain formula, and the mathematical relationship can be used as the mapping relationship established in step S1.
具体地,以下从数学公式的角度来阐述预估模型的训练过程。首先,该预估模型的数学表达式可以如下所示:Specifically, the following describes the training process of the prediction model from the perspective of a mathematical formula. First, the mathematical expression of the predictive model can be as follows:
B i=w·φ(S i,Q i)+b B i =w·φ(S i ,Q i )+b
其中,B i表示视频帧样本中第i个宏块的编码数据量,S i表示该第i个宏块的SATD值,Q i表示该第i个宏块的量化偏置参数,φ(S i,Q i)表示以SATD值和量化偏置参数为自变量的核函数,w表示核函数的线性系数,b表示待定常数。 Wherein, B i represents the sample video frame coded data amount of the i-th macroblock, S i represents the i-th macroblock SATD value, Q i denotes the quantization parameter bias the i-th macroblock, φ (S i ,Q i ) represents the kernel function with SATD value and quantization bias parameter as independent variables, w represents the linear coefficient of the kernel function, and b represents the undetermined constant.
由此可见,在编码数据量与SATD值和量化偏置参数的映射关系中,宏块的残差变换值和量化偏置参数可以作为核函数的自变量,核函数还可以具备线性系数,并且在该映射关系中还包括待定常数。为了确定出该数学表达式,则需要确定其中的线性系数和待定常数。此外,还需要确定核函数中除了SATD值和量化偏置参数之外的其它变量的赋值。It can be seen that in the mapping relationship between the amount of encoded data and the SATD value and the quantization offset parameter, the residual transformation value of the macroblock and the quantization offset parameter can be used as the independent variables of the kernel function, and the kernel function can also have linear coefficients, and The undetermined constant is also included in the mapping relationship. In order to determine the mathematical expression, you need to determine the linear coefficients and undetermined constants. In addition, it is also necessary to determine the assignment of other variables in the kernel function except the SATD value and the quantization bias parameter.
具体地,可以将上述表征映射关系的表达式转换为携带约束条件的最小值问题,该最小值问题的表达式可以如下所示:Specifically, the above-mentioned expression representing the mapping relationship can be converted into a minimum value problem with constraint conditions, and the expression of the minimum value problem can be as follows:
Figure PCTCN2019118056-appb-000001
Figure PCTCN2019118056-appb-000001
其中,ξ i和ξ i *可以作为第i个宏块的松弛变量,C可以作为惩罚因子,l则表示当前的视频帧样本中包含的宏块的数量。 Among them, ξ i and ξ i * can be used as slack variables of the i-th macroblock, C can be used as a penalty factor, and l represents the number of macroblocks contained in the current video frame sample.
可见,表征所述最小值问题的表达式中以及表征所述约束条件的表达式中均可以包含松弛变量,并且表征所述最小值问题的表达式中还可以包含惩罚因子。It can be seen that both the expression characterizing the minimum value problem and the expression characterizing the constraint condition may include slack variables, and the expression characterizing the minimum value problem may also include a penalty factor.
为了求解上述的最小值问题,可以在上述的最小值问题中引入拉格朗日乘子,从而将上述的最小值问题转换为对偶问题,并通过对该对偶问题进行求解,得到上述的线性系数和待定常数的赋值。具体地,引入拉格朗日乘子a i后,上述的最小值问题便可以转换为以下的对偶形式: In order to solve the above-mentioned minimum problem, Lagrangian multipliers can be introduced into the above-mentioned minimum problem, thereby converting the above-mentioned minimum problem into a dual problem, and by solving the dual problem, the above linear coefficients can be obtained And the assignment of undetermined constants. Specifically, after introducing the Lagrangian multiplier a i , the above-mentioned minimum problem can be transformed into the following dual form:
Figure PCTCN2019118056-appb-000002
Figure PCTCN2019118056-appb-000002
其中,a i
Figure PCTCN2019118056-appb-000003
可以是引入的拉格朗日乘子,ε为松弛变量的实部,y i为第i个宏块实际的编码数据量。
Where a i and
Figure PCTCN2019118056-appb-000003
It can be an introduced Lagrangian multiplier, ε is the real part of the slack variable, and y i is the actual amount of coded data of the i-th macroblock.
在求解上述的对偶问题时,核函数可以采用径向基核函数,该径向基核函数的表达式可以如下所示:When solving the above dual problem, the kernel function can use the radial basis kernel function, and the expression of the radial basis kernel function can be as follows:
Figure PCTCN2019118056-appb-000004
Figure PCTCN2019118056-appb-000004
其中,σ表示拟合参数。Among them, σ represents the fitting parameter.
在本实施方式中,通过求解上述的对偶问题,从而得到拉格朗日乘子的具体赋值。通过在将最小值问题转换为对偶问题的过程中,拉格朗日乘子与线性系数和待定常数之间的变换关系,从而可以由拉格朗日乘子的具体赋值,反推出线性系数和待定常数的赋值。最终,便可以得到预估模型的数学表达式,该数据表达式可以作为建立的映射关系,该映射关系可以作为低码率下编码数据量、SATD值和量化偏置参数三者之间的映射关系。In this embodiment, the specific assignment of the Lagrange multiplier is obtained by solving the above-mentioned dual problem. Through the transformation relationship between the Lagrangian multiplier and the linear coefficient and the undetermined constant in the process of converting the minimum problem to the dual problem, the specific assignment of the Lagrangian multiplier can be used to deduct the linear coefficient and Assignment of undetermined constants. Finally, the mathematical expression of the predictive model can be obtained. The data expression can be used as the established mapping relationship. The mapping relationship can be used as the relationship between the amount of encoded data, the SATD value and the quantization bias parameter at low bit rates. Mapping relations.
S3:根据所述映射关系和所述视频帧中各个所述宏块的初始配置参数,确定各个所述宏块的实际编码数据量;其中,各个所述宏块的实际编码数据量构成所述视频帧的实际编码数据量。S3: Determine the actual coded data volume of each macroblock according to the mapping relationship and the initial configuration parameters of each macroblock in the video frame; wherein the actual coded data volume of each macroblock constitutes the The actual amount of encoded data of the video frame.
在本实施方式中,宏块的初始配置参数,可以指初始SATD值和初始量化偏置参数,这样,可以将待编码的视频帧中各个宏块的初始SATD值和初始量化偏置参数代入上述的映射关系中,从而确定出各个宏块的实际编码数据量。由于上述的映射关系是基于低码率场景下训练得到的,因此根据上述的映射关系确定出的实际编码数据量比较准确。在得到各个宏块的实际编码数据量后,这些宏块的实际编码数据量之和,便可以作为视频帧的实际编码数据量。In this embodiment, the initial configuration parameters of the macroblock may refer to the initial SATD value and the initial quantization offset parameter. In this way, the initial SATD value and the initial quantization offset parameter of each macroblock in the video frame to be encoded can be substituted into the above In the mapping relationship, the actual coded data amount of each macro block is determined. Since the above mapping relationship is obtained based on training in a low bit rate scenario, the actual coded data amount determined according to the above mapping relationship is more accurate. After the actual coded data volume of each macroblock is obtained, the sum of the actual coded data volume of these macroblocks can be used as the actual coded data volume of the video frame.
S5:识别所述视频帧中的目标区域,并在保持所述视频帧的实际编码数据量不变的情况下,根据所述目标区域的画质改善系数,重新确定所述视频帧中各个所述宏块的编码数据量和配置参数,以通过各个所述宏块重新确定的配置 参数对所述视频帧进行编码。S5: Identify the target area in the video frame, and re-determine each target area in the video frame according to the image quality improvement coefficient of the target area while keeping the actual amount of encoded data of the video frame unchanged. The coded data amount and configuration parameters of the macroblocks are used to encode the video frame with the configuration parameters newly determined by each macroblock.
在本实施方式中,考虑到用户在观看视频时,通常会关注于画面中的一部分主要内容,而对于其它内容并不会特别在意。鉴于此,可以将视频帧中用户关注的区域进行画质增强,而对于用户不关注的区域,则可以适当降低画质,从而在保持视频帧整体编码数据量不变的情况下,提升用户的观看体验。In this embodiment, it is considered that when a user is watching a video, he usually pays attention to a part of the main content in the screen, but does not pay particular attention to other content. In view of this, the image quality of the area that the user pays attention to in the video frame can be enhanced, and for the area that the user does not pay attention to, the image quality can be appropriately reduced, so as to keep the overall amount of encoded data of the video frame unchanged, and improve the user’s The viewing experience.
具体地,可以识别视频帧中用户关注的目标区域。在实际应用中,识别目标区域的方式也可以多种多样。例如,可以将视频帧中的固定区域作为目标区域,该固定区域例如可以是视频帧的中心区域。此外,还可以根据视频帧的对焦信息来确定视频帧中的目标区域。通常而言,视频帧中主要展示的内容,往往会具备较高的清晰度,而清晰度可以通过不同的对焦信息来表示。因此,主要展示的内容,和其它部分的内容,可以具备不同的对焦信息。通过识别视频帧中的对焦信息,从而可以确定出视频帧中用户关注的目标区域。再者,还可以通过目标跟踪算法,确定出视频帧中的目标区域。具体地,首先可以进行特征提取,从而提取出视频帧中的灰度特征、颜色特征、纹理特征、Haar-like矩形特征、兴趣点特征、超像素特征中的一个或者多个特征。然后,通过运动模型分析相邻的多个视频帧中目标对象的运动状态,从而确定出目标对象所处的候选区域。其中,运动模型可以是均值漂移(Mean shift)、滑动窗口(Slide window)、卡尔曼滤波(Kalman Filtering)、粒子滤波(Particle Filtering)等模型中的一种。最终,针对确定出的候选区域,可以通过外观模型(Appearance Model)在当前的视频帧的候选区域内判断是否存在被跟踪的目标对象。如果存在,则可以将该候选区域作为视频帧的目标区域。当然,在实际应用中还包括更多的方式来确定视频帧中的目标区域,本实施方式中以上例举的只是可能的实现方式,并不表示本申请的方案只能适用于上述的实现方式。Specifically, the target area of the user's attention in the video frame can be identified. In practical applications, there can be many ways to identify the target area. For example, a fixed area in the video frame may be used as the target area, and the fixed area may be, for example, the central area of the video frame. In addition, the target area in the video frame can also be determined according to the focus information of the video frame. Generally speaking, the content mainly displayed in the video frame tends to have high definition, and the definition can be expressed by different focus information. Therefore, the main display content and other parts of the content can have different focus information. By identifying the focus information in the video frame, the target area of the user's attention in the video frame can be determined. Furthermore, the target area in the video frame can also be determined by the target tracking algorithm. Specifically, feature extraction can be performed first, so as to extract one or more of the gray feature, color feature, texture feature, Haar-like rectangle feature, interest point feature, and super pixel feature in the video frame. Then, the motion state of the target object in multiple adjacent video frames is analyzed through the motion model, so as to determine the candidate area where the target object is located. Among them, the motion model may be one of a mean shift (Mean shift), a sliding window (Slide window), a Kalman filter (Kalman Filtering), a particle filter (Particle Filtering) and other models. Finally, for the determined candidate area, the Appearance Model can be used to determine whether there is a tracked target object in the candidate area of the current video frame. If it exists, the candidate area can be used as the target area of the video frame. Of course, there are more ways to determine the target area in the video frame in practical applications. The above examples in this embodiment are only possible implementations, and it does not mean that the solution of this application can only be applied to the above implementations. .
在本实施方式中,在对视频帧进行编码时,可以为目标区域分配多一些的编码数据量,从而提升目标区域的画质。具体地,在提升目标区域的画质时,可以预先确定一个画质改善系数。该画质改善系数可以基于目标区域中宏块当前的编码数据量和改善后的编码数据量来确定。例如,当前的编码数据量为100kb,而改善后的编码数据量需要达到200kb,那么画质改善系数便可以是2。在获取了画质改善系数后,可以在保持视频帧的实际编码数据量不变的情况下,重新确定所述视频帧中各个所述宏块的编码数据量和残差变换值。In this embodiment, when encoding a video frame, a larger amount of encoded data can be allocated to the target area, thereby improving the image quality of the target area. Specifically, when the image quality of the target area is improved, an image quality improvement coefficient may be determined in advance. The image quality improvement coefficient may be determined based on the current coded data amount of the macroblock in the target area and the improved coded data amount. For example, if the current coded data volume is 100kb, and the improved coded data volume needs to reach 200kb, then the image quality improvement factor can be 2. After the image quality improvement coefficient is obtained, the amount of encoded data and the residual transformation value of each of the macroblocks in the video frame may be re-determined while keeping the actual amount of encoded data of the video frame unchanged.
具体地,可以在视频帧的各个宏块中确定位于目标区域内的目标宏块,针对这部分目标宏块可以进行画质增强。因此,可以将所述目标宏块的实际编码数据量与所述画质改善系数的乘积,作为所述目标宏块重新确定后的编码数据量。这样,目标区域重新确定后的编码数据量
Figure PCTCN2019118056-appb-000005
便可以通过以下公式确定:
Specifically, the target macroblock located in the target area can be determined in each macroblock of the video frame, and the image quality can be enhanced for this part of the target macroblock. Therefore, the product of the actual coded data amount of the target macroblock and the image quality improvement coefficient may be used as the coded data amount after the target macroblock is re-determined. In this way, the amount of encoded data after the target area is re-determined
Figure PCTCN2019118056-appb-000005
It can be determined by the following formula:
Figure PCTCN2019118056-appb-000006
Figure PCTCN2019118056-appb-000006
其中,r为所述画质改善系数,ROI(Region Of Interest,感兴趣区域)为上述的目标区域中包含的宏块数量。Wherein, r is the image quality improvement coefficient, and ROI (Region Of Interest, region of interest) is the number of macroblocks included in the aforementioned target region.
在本实施方式中,可以计算各个所述目标宏块重新确定后的编码数据量之和,并计算所述视频帧的实际编码数据量与所述编码数据量之和的差值,该差值
Figure PCTCN2019118056-appb-000007
可以通过以下公式表示:
In this embodiment, it is possible to calculate the sum of the amount of encoded data after each of the target macroblocks is re-determined, and calculate the difference between the actual amount of encoded data of the video frame and the sum of the amount of encoded data.
Figure PCTCN2019118056-appb-000007
It can be expressed by the following formula:
Figure PCTCN2019118056-appb-000008
Figure PCTCN2019118056-appb-000008
其中,CS Total表示视频帧的实际编码数据量,该差值
Figure PCTCN2019118056-appb-000009
可以表示为经过画质调节后,除目标区域以外的其它区域的编码数据量的总和。这样,可以确定所述视频帧中位于所述目标区域外的其它宏块的数量,并将所述差值
Figure PCTCN2019118056-appb-000010
与确定的所述数量之间的比值作为所述其它宏块重新确定后的编码数据量。
Among them, CS Total represents the actual amount of encoded data of the video frame, the difference
Figure PCTCN2019118056-appb-000009
It can be expressed as the sum of the amount of coded data in areas other than the target area after the image quality is adjusted. In this way, the number of other macroblocks located outside the target area in the video frame can be determined, and the difference
Figure PCTCN2019118056-appb-000010
The ratio between the determined quantity and the determined quantity is used as the coded data quantity after the other macroblocks are re-determined.
在本实施方式中,在计算出各个宏块重新确定后的编码数据量后,可以进一步地确定各个宏块新的SATD值。具体地,可以将目标宏块的初始残差变换值与所述画质改善系数的乘积,作为所述目标宏块重新确定后的残差变换值。目标宏块重新确定后的SATD值
Figure PCTCN2019118056-appb-000011
可以如下所示:
In this embodiment, after the re-determined coded data amount of each macro block is calculated, the new SATD value of each macro block can be further determined. Specifically, the product of the initial residual transformation value of the target macroblock and the image quality improvement coefficient may be used as the residual transformation value after the target macroblock is re-determined. The SATD value after the target macroblock is re-determined
Figure PCTCN2019118056-appb-000011
It can be as follows:
Figure PCTCN2019118056-appb-000012
Figure PCTCN2019118056-appb-000012
其中,S i表示第i个宏块(如果该宏块是目标宏块)的初始SATD值。 Where, S i represents the i-th macroblock (macroblock if the target macroblock is a) an initial SATD value.
然后,可以将计算得到的所述差值
Figure PCTCN2019118056-appb-000013
与所述其它宏块的实际编码数据量之和CS NROI的比值,作为所述其它宏块的画质调节系数,并将所述其它宏块的初始残差变换值与所述画质调节系数的乘积,作为所述其它宏块重新确定后的残差变换值。具体地,其它宏块的画质调节系数可以如下所示:
Then, the calculated difference can be
Figure PCTCN2019118056-appb-000013
The ratio of the CS NROI to the actual coded data volume of the other macroblocks is used as the image quality adjustment coefficient of the other macroblocks, and the initial residual transformation value of the other macroblocks is compared with the image quality adjustment coefficient The product of is used as the residual transform value after the other macroblocks are re-determined. Specifically, the image quality adjustment coefficients of other macroblocks can be as follows:
Figure PCTCN2019118056-appb-000014
Figure PCTCN2019118056-appb-000014
那么其它宏块重新确定后的残差变换值可以如下所示:Then the residual transformation values of other macroblocks after re-determination can be as follows:
Figure PCTCN2019118056-appb-000015
Figure PCTCN2019118056-appb-000015
其中,S i表示第i个宏块(如果该宏块是其它宏块)的初始SATD值。 Where, S i represents the i-th macroblock (if the macroblock is another macroblock) initial SATD value.
在本实施方式中,经过上述的方式,可以在保持视频帧的实际编码数据量不变的情况下,通过画质改善系数,重新确定目标区域和其它区域中的各个宏块的编码数据量和SATD值。这样,根据步骤S1确定出的映射关系,可以将重新确定的编码数据量和SATD值代入该映射关系中,从而计算出宏块重新确定的量化偏置参数。这样,通过重新确定的SATD值和量化偏置参数,便可以对视频帧中的宏块进行编码,从而得到符合预期码率的编码后的视频帧。In this embodiment, through the above-mentioned method, it is possible to re-determine the sum of the coded data volume of each macroblock in the target area and other areas through the image quality improvement coefficient while keeping the actual coded data volume of the video frame unchanged. SATD value. In this way, according to the mapping relationship determined in step S1, the re-determined coded data amount and SATD value can be substituted into the mapping relationship, so as to calculate the re-determined quantization offset parameter of the macroblock. In this way, by re-determining the SATD value and the quantization offset parameter, the macroblocks in the video frame can be coded, so as to obtain the coded video frame that meets the expected bit rate.
由上可见,本申请建立的映射关系,可以通过图2所示的模型来表示。在图2中,SATD值和量化偏置参数可以作为输入数据,从而预测出宏块的编码数据量。此外,如果限定了宏块预期的编码数据量,并且设定了宏块的SATD值,那么也可以根据该模型推导出宏块实际的量化偏置参数。这样,通过对低码率场景下的视频帧样本进行模型训练,从而可以建立编码数据量与画质提升之间的量化关系,并且可以动态地实现视频中ROI的码率分配。It can be seen from the above that the mapping relationship established in this application can be represented by the model shown in FIG. 2. In Figure 2, the SATD value and the quantization offset parameter can be used as input data to predict the amount of coded data of the macroblock. In addition, if the expected coded data volume of the macroblock is limited, and the SATD value of the macroblock is set, then the actual quantization offset parameter of the macroblock can also be derived from the model. In this way, by performing model training on video frame samples in low bit rate scenarios, the quantization relationship between the amount of encoded data and the improvement of image quality can be established, and the bit rate allocation of the ROI in the video can be dynamically realized.
请参阅图3,本申请还提供一种视频帧的码率配置系统,所述视频帧被划分为多个宏块;所述系统包括:Referring to FIG. 3, the present application also provides a code rate configuration system for a video frame, where the video frame is divided into multiple macroblocks; the system includes:
映射关系建立单元,用于建立宏块的编码数据量与宏块的配置参数之间的映射关系;A mapping relationship establishing unit for establishing a mapping relationship between the amount of coded data of the macro block and the configuration parameters of the macro block;
编码数据量确定单元,用于根据所述映射关系和所述视频帧中各个所述宏块的初始配置参数,确定各个所述宏块的实际编码数据量;其中,各个所述宏块的实际编码数据量构成所述视频帧的实际编码数据量;The coded data amount determining unit is configured to determine the actual coded data amount of each macroblock according to the mapping relationship and the initial configuration parameters of each macroblock in the video frame; wherein, the actual amount of each macroblock is The amount of encoded data constitutes the actual amount of encoded data of the video frame;
编码单元,用于识别所述视频帧中的目标区域,并在保持所述视频帧的实际编码数据量不变的情况下,根据所述目标区域的画质改善系数,重新确定所述视频帧中各个所述宏块的编码数据量和配置参数,以通过各个所述宏块重新确定的配置参数对所述视频帧进行编码。The encoding unit is used to identify the target area in the video frame, and to re-determine the video frame according to the image quality improvement coefficient of the target area while keeping the actual amount of encoded data of the video frame unchanged The coded data amount and configuration parameters of each macroblock in, encode the video frame with the configuration parameters newly determined by each macroblock.
在一个实施方式中,所述配置参数包括残差变换值和量化偏置参数;所述编码单元包括:In one embodiment, the configuration parameters include residual transform values and quantization offset parameters; the coding unit includes:
参数重新确定模块,用于重新确定所述视频帧中各个所述宏块的编码数据量和残差变换值;A parameter re-determination module, configured to re-determine the amount of encoded data and the residual transformation value of each macroblock in the video frame;
量化偏置参数计算模块,用于根据所述映射关系,以及重新确定的各个所述宏块的编码数据量和残差变换值,计算各个所述宏块调整后的量化偏置参数,以通过各个所述宏块重新确定的残差变换值和计算得到的量化偏置参数对所述 视频帧进行编码。The quantization offset parameter calculation module is used to calculate the adjusted quantization offset parameter of each macro block according to the mapping relationship and the re-determined coded data amount and residual transformation value of each macro block to pass The re-determined residual transform value of each macroblock and the calculated quantization offset parameter encode the video frame.
在一个实施方式中,所述映射关系建立单元包括:In an embodiment, the mapping relationship establishment unit includes:
样本处理模块,用于获取指定码率范围内的视频帧样本,并将所述视频帧样本划分为多个宏块;The sample processing module is used to obtain video frame samples within a specified code rate range, and divide the video frame samples into multiple macro blocks;
训练模块,用于识别所述视频帧样本中各个宏块的编码数据量、残差变换值和量化偏置参数,并基于识别得到的编码数据量、残差变换值和量化偏置参数训练得到预估模型;The training module is used to identify the amount of encoded data, the residual transformation value and the quantization offset parameter of each macroblock in the video frame sample, and train based on the encoded data amount, the residual transformation value and the quantization offset parameter obtained by the recognition Predictive model
关系建立模块,用于将所述预估模型的输入数据和输出结果之间的数学关系,作为建立的宏块的编码数据量与宏块的残差变换值和量化偏置参数之间的映射关系。The relationship establishment module is used to use the mathematical relationship between the input data and the output result of the prediction model as a mapping between the coded data amount of the macroblock and the residual transformation value and quantization offset parameter of the macroblock. relationship.
在一个实施方式中,所述参数重新确定单元包括:In one embodiment, the parameter re-determination unit includes:
画质改善模块,用于在所述视频帧的各个宏块中确定位于所述目标区域内的目标宏块,并将所述目标宏块的实际编码数据量与所述画质改善系数的乘积,作为所述目标宏块重新确定后的编码数据量;The image quality improvement module is used to determine the target macroblock located in the target area in each macroblock of the video frame, and to multiply the actual coded data amount of the target macroblock and the image quality improvement coefficient , As the amount of coded data after the target macroblock is re-determined;
差值计算模块,用于计算各个所述目标宏块重新确定后的编码数据量之和,并计算所述视频帧的实际编码数据量与所述编码数据量之和的差值;A difference calculation module, which is used to calculate the sum of the amount of encoded data after each of the target macroblocks is re-determined, and calculate the difference between the actual amount of encoded data of the video frame and the sum of the amount of encoded data;
画质调整模块,用于确定所述视频帧中位于所述目标区域外的其它宏块的数量,并将所述差值与确定的所述数量之间的比值作为所述其它宏块重新确定后的编码数据量。The image quality adjustment module is used to determine the number of other macroblocks located outside the target area in the video frame, and re-determine the ratio between the difference and the determined number as the other macroblocks The amount of encoded data afterwards.
在一个实施方式中,所述参数重新确定单元还包括:In an embodiment, the parameter re-determination unit further includes:
残差变换值改善模块,用于将所述目标宏块的初始残差变换值与所述画质改善系数的乘积,作为所述目标宏块重新确定后的残差变换值;A residual transformation value improvement module, configured to multiply the initial residual transformation value of the target macroblock and the image quality improvement coefficient as the residual transformation value after the target macroblock is re-determined;
残差变换值调节模块,用于将计算得到的所述差值与所述其它宏块的实际编码数据量之和的比值,作为所述其它宏块的画质调节系数,并将所述其它宏块的初始残差变换值与所述画质调节系数的乘积,作为所述其它宏块重新确定后的残差变换值。The residual transform value adjustment module is configured to use the calculated ratio of the difference value and the sum of the actual coded data amount of the other macroblocks as the image quality adjustment coefficient of the other macroblocks, and adjust the other The product of the initial residual transformation value of the macroblock and the image quality adjustment coefficient is used as the residual transformation value after the other macroblocks are re-determined.
请参阅图4,本申请还提供一种视频帧的码率配置设备,所述设备包括处理器和存储器,所述存储器用于存储计算机程序,所述计算机程序被所述处理器执行时,可以实现上述的视频帧的码率配置方法。4, this application also provides a video frame rate configuration device, the device includes a processor and a memory, the memory is used to store a computer program, when the computer program is executed by the processor, The above-mentioned method for configuring the code rate of the video frame is realized.
在本实施方式中,所述存储器可以包括用于存储信息的物理装置,通常是 将信息数字化后再以利用电、磁或者光学等方法的媒体加以存储。本实施方式所述的存储器又可以包括:利用电能方式存储信息的装置,如RAM或ROM等;利用磁能方式存储信息的装置,如硬盘、软盘、磁带、磁芯存储器、磁泡存储器或U盘;利用光学方式存储信息的装置,如CD或DVD。当然,还有其他方式的存储器,例如量子存储器或石墨烯存储器等等。In this embodiment, the memory may include a physical device for storing information, which is usually digitized and then stored in a medium using electrical, magnetic, or optical methods. The memory described in this embodiment may also include: a device that uses electrical energy to store information, such as RAM or ROM, etc.; a device that uses magnetic energy to store information, such as hard disk, floppy disk, magnetic tape, magnetic core memory, bubble memory, or U disk ; A device that uses optical means to store information, such as CD or DVD. Of course, there are other types of memory, such as quantum memory or graphene memory.
在本实施方式中,所述处理器可以按任何适当的方式实现。例如,所述处理器可以采取例如微处理器或处理器以及存储可由该(微)处理器执行的计算机可读程序代码(例如软件或固件)的计算机可读介质、逻辑门、开关、专用集成电路(Application Specific Integrated Circuit,ASIC)、可编程逻辑控制器和嵌入微控制器的形式等等。In this embodiment, the processor can be implemented in any suitable manner. For example, the processor may take the form of, for example, a microprocessor or a processor, and a computer-readable medium storing computer-readable program codes (for example, software or firmware) executable by the (micro)processor, logic gates, switches, dedicated integrated circuits, etc. Circuit (Application Specific Integrated Circuit, ASIC), programmable logic controller and embedded microcontroller form, etc.
由上可见,本申请一个或者多个实施方式提供的技术方案,可以预先建立宏块的编码数据量与配置参数之间的映射关系。在实际应用中,编码数据量通常是固定在一个有限的范围内,因此可以针对该范围内的编码数据量,建立与配置参数之间精确的映射关系。在对视频帧进行编码时,可以将视频帧划分为多个不同的宏块。针对每个宏块,可以将初始的配置参数代入上述的映射关系,从而确定出各个宏块的实际编码数据量。在进行画质改善时,可以根据人眼视觉原理,在视频帧中识别出人眼关注的目标区域。针对目标区域内的内容可以进行画质提升,而针对目标区域外的内容,则可以适当降低画质。It can be seen from the above that the technical solutions provided by one or more embodiments of the present application can pre-establish the mapping relationship between the amount of encoded data of the macro block and the configuration parameters. In practical applications, the amount of coded data is usually fixed within a limited range, so it is possible to establish an accurate mapping relationship with the configuration parameters for the amount of coded data within this range. When encoding a video frame, the video frame can be divided into multiple different macroblocks. For each macro block, the initial configuration parameters can be substituted into the above mapping relationship, so as to determine the actual coded data amount of each macro block. When the image quality is improved, the target area that the human eye pays attention to can be identified in the video frame according to the principle of human vision. The image quality can be improved for the content in the target area, and the image quality can be appropriately reduced for the content outside the target area.
具体地,根据画质提升所需的画质改善系数,可以在保持视频帧的实际编码数据量不变的情况下,重新确定视频帧中各个宏块的编码数据量和配置参数。这样,根据重新确定的配置参数,便可以对视频帧中的各个宏块进行编码,从而完成画质改善的过程。Specifically, according to the image quality improvement coefficient required for image quality improvement, the encoded data amount and configuration parameters of each macroblock in the video frame can be re-determined while keeping the actual encoded data amount of the video frame unchanged. In this way, according to the re-determined configuration parameters, each macro block in the video frame can be coded, thereby completing the process of image quality improvement.
可见,通过上述的方案,对于人眼关注的目标区域,可以进行画质提升,而对于人眼不关注的区域,则可以适当降低画质。一方面可以保证视频帧的实际编码数据量不会出现大幅度变化,另一方面可以保证用户在观看时,对于自身关注的区域始终会呈现较高的画质,从而具备较高的画质提升效果。It can be seen that through the above solution, the image quality can be improved for the target area that the human eye pays attention to, and the image quality can be appropriately reduced for the area that the human eye does not pay attention to. On the one hand, it can ensure that the actual coded data volume of the video frame will not change significantly, on the other hand, it can ensure that the user will always present a higher picture quality to the area of his concern when watching, thus having a higher picture quality improvement effect.
本说明书中的各个实施方式均采用递进的方式描述,各个实施方式之间相同相似的部分互相参见即可,每个实施方式重点说明的都是与其他实施方式的不同之处。尤其,针对系统和设备的实施方式来说,均可以参照前述方法的实施方式的介绍对照解释。The various embodiments in this specification are described in a progressive manner, and the same or similar parts between the various embodiments can be referred to each other, and each embodiment focuses on the differences from other embodiments. In particular, for the implementation of the system and the device, both can be explained with reference to the introduction of the implementation of the foregoing method.
本领域内的技术人员应明白,本发明的实施方式可提供为方法、系统、或计算机程序产品。因此,本发明可采用完全硬件实施方式、完全软件实施方式、或结合软件和硬件方面的实施方式的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present invention can be provided as a method, a system, or a computer program product. Therefore, the present invention may adopt a form of a complete hardware implementation, a complete software implementation, or a combination of software and hardware implementations. Moreover, the present invention may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program codes.
本发明是参照根据本发明实施方式的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention is described with reference to flowcharts and/or block diagrams of methods, devices (systems), and computer program products according to the embodiments of the present invention. It should be understood that each process and/or block in the flowchart and/or block diagram, and the combination of processes and/or blocks in the flowchart and/or block diagram can be implemented by computer program instructions. These computer program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, an embedded processor, or other programmable data processing equipment to generate a machine, so that the instructions executed by the processor of the computer or other programmable data processing equipment are generated It is a device that realizes the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions can also be stored in a computer-readable memory that can guide a computer or other programmable data processing equipment to work in a specific manner, so that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction device. The device implements the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing equipment, so that a series of operation steps are executed on the computer or other programmable equipment to produce computer-implemented processing, so as to execute on the computer or other programmable equipment. The instructions provide steps for implementing the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
在一个典型的配置中,计算设备包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration, the computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
内存可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或非易失性内存等形式,如只读存储器(ROM)或闪存(flash RAM)。内存是计算机可读介质的示例。The memory may include non-permanent memory in a computer readable medium, random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of computer readable media.
计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器 (EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定,计算机可读介质不包括暂存电脑可读媒体(transitory media),如调制的数据信号和载波。Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology. The information can be computer-readable instructions, data structures, program modules, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include transitory media, such as modulated data signals and carrier waves.
还需要说明的是,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、商品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、商品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、商品或者设备中还存在另外的相同要素。It should also be noted that the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, commodity or equipment including a series of elements not only includes those elements, but also includes Other elements that are not explicitly listed, or they also include elements inherent to such processes, methods, commodities, or equipment. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, method, commodity, or equipment that includes the element.
以上所述仅为本申请的实施方式而已,并不用于限制本申请。对于本领域技术人员来说,本申请可以有各种更改和变化。凡在本申请的精神和原理之内所作的任何修改、等同替换、改进等,均应包含在本申请的权利要求范围之内。The above are only the implementation manners of this application, and are not intended to limit this application. For those skilled in the art, this application can have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of this application shall be included in the scope of the claims of this application.

Claims (13)

  1. 一种视频帧的码率配置方法,其特征在于,所述视频帧被划分为多个宏块;所述方法包括:A method for configuring the bit rate of a video frame, wherein the video frame is divided into a plurality of macro blocks; the method includes:
    建立宏块的编码数据量与宏块的配置参数之间的映射关系;Establish a mapping relationship between the amount of coded data of the macro block and the configuration parameters of the macro block;
    根据所述映射关系和所述视频帧中各个所述宏块的初始配置参数,确定各个所述宏块的实际编码数据量;其中,各个所述宏块的实际编码数据量构成所述视频帧的实际编码数据量;According to the mapping relationship and the initial configuration parameters of each macroblock in the video frame, determine the actual coded data volume of each macroblock; wherein the actual coded data volume of each macroblock constitutes the video frame The actual amount of encoded data;
    识别所述视频帧中的目标区域,并在保持所述视频帧的实际编码数据量不变的情况下,根据所述目标区域的画质改善系数,重新确定所述视频帧中各个所述宏块的编码数据量和配置参数,以通过各个所述宏块重新确定的配置参数对所述视频帧进行编码。Identify the target area in the video frame, and re-determine each macro in the video frame according to the image quality improvement coefficient of the target area while keeping the actual amount of encoded data of the video frame unchanged. The amount of coded data and configuration parameters of the block are used to encode the video frame with the configuration parameters newly determined by each of the macroblocks.
  2. 根据权利要求1所述的方法,其特征在于,所述配置参数包括残差变换值和量化偏置参数;重新确定所述视频帧中各个所述宏块的编码数据量和配置参数包括:The method according to claim 1, wherein the configuration parameters include a residual transform value and a quantization offset parameter; and re-determining the amount of coded data and configuration parameters of each of the macroblocks in the video frame comprises:
    重新确定所述视频帧中各个所述宏块的编码数据量和残差变换值;Re-determine the amount of encoded data and the residual transform value of each macroblock in the video frame;
    根据所述映射关系,以及重新确定的各个所述宏块的编码数据量和残差变换值,计算各个所述宏块调整后的量化偏置参数,以通过各个所述宏块重新确定的残差变换值和计算得到的量化偏置参数对所述视频帧进行编码。According to the mapping relationship, and the re-determined coded data amount and residual transformation value of each macro block, the adjusted quantization offset parameter of each macro block is calculated to pass the re-determined residual value of each macro block. The difference transform value and the calculated quantization offset parameter encode the video frame.
  3. 根据权利要求2所述的方法,其特征在于,所述映射关系按照以下方式建立:The method according to claim 2, wherein the mapping relationship is established in the following manner:
    获取指定码率范围内的视频帧样本,并将所述视频帧样本划分为多个宏块;Obtaining video frame samples within a specified bit rate range, and dividing the video frame samples into multiple macro blocks;
    识别所述视频帧样本中各个宏块的编码数据量、残差变换值和量化偏置参数,并基于识别得到的编码数据量、残差变换值和量化偏置参数训练得到预估模型;Identifying the amount of encoded data, residual transformation value, and quantization offset parameter of each macroblock in the video frame sample, and training to obtain a prediction model based on the encoded data amount, residual transformation value, and quantization offset parameter obtained by the identification;
    将所述预估模型的输入数据和输出结果之间的数学关系,作为建立的宏块的编码数据量与宏块的残差变换值和量化偏置参数之间的映射关系。The mathematical relationship between the input data and the output result of the prediction model is used as the established mapping relationship between the amount of encoded data of the macro block and the residual transform value of the macro block and the quantization offset parameter.
  4. 根据权利要求2所述的方法,其特征在于,在所述映射关系中,所述宏块的残差变换值和量化偏置参数作为核函数的自变量,所述核函数具备线性系数,并且在所述映射关系中还包括待定常数。The method according to claim 2, wherein in the mapping relationship, the residual transform value of the macroblock and the quantization bias parameter are used as the independent variables of the kernel function, and the kernel function has linear coefficients, and The mapping relationship also includes undetermined constants.
  5. 根据权利要求4所述的方法,其特征在于,在建立所述映射关系时,所述方法还包括:The method according to claim 4, wherein when establishing the mapping relationship, the method further comprises:
    将表征所述映射关系的表达式转换为携带约束条件的最小值问题,其中,表征所述最小值问题的表达式中以及表征所述约束条件的表达式中均包含松弛变量,并且表征所述最小值问题的表达式中还包含惩罚因子;The expression characterizing the mapping relationship is converted into a minimum value problem with constraint conditions, wherein the expression characterizing the minimum value problem and the expression characterizing the constraint conditions both contain slack variables, and characterize the The expression of the minimum problem also includes a penalty factor;
    在所述最小值问题中引入拉格朗日乘子,以将所述最小值问题转换为对偶问题,并通过对所述对偶问题进行求解,得到所述线性系数和所述待定常数的赋值。A Lagrangian multiplier is introduced into the minimum problem to convert the minimum problem into a dual problem, and by solving the dual problem, the assignment of the linear coefficient and the undetermined constant is obtained.
  6. 根据权利要求2所述的方法,其特征在于,按照以下方式重新确定所述视频帧中各个所述宏块的编码数据量:The method according to claim 2, wherein the amount of coded data of each macroblock in the video frame is re-determined in the following manner:
    在所述视频帧的各个宏块中确定位于所述目标区域内的目标宏块,并将所述目标宏块的实际编码数据量与所述画质改善系数的乘积,作为所述目标宏块重新确定后的编码数据量;Determine a target macroblock located in the target area in each macroblock of the video frame, and use the product of the actual coded data amount of the target macroblock and the image quality improvement coefficient as the target macroblock The amount of encoded data after re-determination;
    计算各个所述目标宏块重新确定后的编码数据量之和,并计算所述视频帧的实际编码数据量与所述编码数据量之和的差值;Calculating the sum of the coded data amount after each of the target macroblocks is re-determined, and calculating the difference between the actual coded data amount of the video frame and the sum of the coded data amount;
    确定所述视频帧中位于所述目标区域外的其它宏块的数量,并将所述差值与确定的所述数量之间的比值作为所述其它宏块重新确定后的编码数据量。The number of other macroblocks located outside the target area in the video frame is determined, and the ratio between the difference and the determined number is used as the re-determined coded data amount of the other macroblocks.
  7. 根据权利要求6所述的方法,其特征在于,按照以下方式重新确定所述视频帧中各个所述宏块的残差变换值:The method according to claim 6, wherein the residual transform value of each macroblock in the video frame is re-determined in the following manner:
    将所述目标宏块的初始残差变换值与所述画质改善系数的乘积,作为所述目标宏块重新确定后的残差变换值;Taking the product of the initial residual transformation value of the target macroblock and the image quality improvement coefficient as the residual transformation value after the target macroblock is re-determined;
    将计算得到的所述差值与所述其它宏块的实际编码数据量之和的比值,作为所述其它宏块的画质调节系数,并将所述其它宏块的初始残差变换值与所述画质调节系数的乘积,作为所述其它宏块重新确定后的残差变换值。The ratio of the calculated difference value to the sum of the actual coded data amount of the other macroblocks is used as the image quality adjustment coefficient of the other macroblocks, and the initial residual transformation value of the other macroblocks is compared with The product of the image quality adjustment coefficient is used as the residual transformation value after the other macroblocks are re-determined.
  8. 一种视频帧的码率配置系统,其特征在于,所述视频帧被划分为多个宏块;所述系统包括:A code rate configuration system for a video frame, wherein the video frame is divided into a plurality of macro blocks; the system includes:
    映射关系建立单元,用于建立宏块的编码数据量与宏块的配置参数之间的映射关系;A mapping relationship establishing unit for establishing a mapping relationship between the amount of coded data of the macro block and the configuration parameters of the macro block;
    编码数据量确定单元,用于根据所述映射关系和所述视频帧中各个所述宏块的初始配置参数,确定各个所述宏块的实际编码数据量;其中,各个所述宏块的实际编码数据量构成所述视频帧的实际编码数据量;The coded data amount determining unit is configured to determine the actual coded data amount of each macroblock according to the mapping relationship and the initial configuration parameters of each macroblock in the video frame; wherein, the actual amount of each macroblock is The amount of encoded data constitutes the actual amount of encoded data of the video frame;
    编码单元,用于识别所述视频帧中的目标区域,并在保持所述视频帧的实际编码数据量不变的情况下,根据所述目标区域的画质改善系数,重新确定所述视频帧中各个所述宏块的编码数据量和配置参数,以通过各个所述宏块重新确定的配置参数对所述视频帧进行编码。The encoding unit is used to identify the target area in the video frame, and to re-determine the video frame according to the image quality improvement coefficient of the target area while keeping the actual amount of encoded data of the video frame unchanged The coded data amount and configuration parameters of each macroblock in, encode the video frame with the configuration parameters newly determined by each macroblock.
  9. 根据权利要求8所述的系统,其特征在于,所述配置参数包括残差变换值和量化偏置参数;所述编码单元包括:The system according to claim 8, wherein the configuration parameter includes a residual transformation value and a quantization offset parameter; and the coding unit includes:
    参数重新确定模块,用于重新确定所述视频帧中各个所述宏块的编码数据量和残差变换值;A parameter re-determination module, configured to re-determine the amount of encoded data and the residual transformation value of each macroblock in the video frame;
    量化偏置参数计算模块,用于根据所述映射关系,以及重新确定的各个所述宏块的编码数据量和残差变换值,计算各个所述宏块调整后的量化偏置参数,以通过各个所述宏块重新确定的残差变换值和计算得到的量化偏置参数对所述视频帧进行编码。The quantization offset parameter calculation module is used to calculate the adjusted quantization offset parameter of each macro block according to the mapping relationship and the re-determined coded data amount and residual transformation value of each macro block to pass The re-determined residual transform value of each macroblock and the calculated quantization offset parameter encode the video frame.
  10. 根据权利要求9所述的系统,其特征在于,所述映射关系建立单元包括:The system according to claim 9, wherein the mapping relationship establishment unit comprises:
    样本处理模块,用于获取指定码率范围内的视频帧样本,并将所述视频帧样本划分为多个宏块;The sample processing module is used to obtain video frame samples within a specified code rate range, and divide the video frame samples into multiple macroblocks;
    训练模块,用于识别所述视频帧样本中各个宏块的编码数据量、残差变换值和量化偏置参数,并基于识别得到的编码数据量、残差变换值和量化偏置参数训练得到预估模型;The training module is used to identify the amount of encoded data, the residual transformation value and the quantization offset parameter of each macroblock in the video frame sample, and train based on the encoded data amount, the residual transformation value and the quantization offset parameter obtained by the recognition Predictive model
    关系建立模块,用于将所述预估模型的输入数据和输出结果之间的数学关 系,作为建立的宏块的编码数据量与宏块的残差变换值和量化偏置参数之间的映射关系。The relationship establishment module is used to use the mathematical relationship between the input data and the output result of the prediction model as a mapping between the coded data amount of the macroblock and the residual transformation value and quantization offset parameter of the macroblock. relationship.
  11. 根据权利要求9所述的系统,其特征在于,所述参数重新确定单元包括:The system according to claim 9, wherein the parameter re-determination unit comprises:
    画质改善模块,用于在所述视频帧的各个宏块中确定位于所述目标区域内的目标宏块,并将所述目标宏块的实际编码数据量与所述画质改善系数的乘积,作为所述目标宏块重新确定后的编码数据量;The image quality improvement module is used to determine the target macroblock located in the target area in each macroblock of the video frame, and to multiply the actual coded data amount of the target macroblock and the image quality improvement coefficient , As the amount of coded data after the target macroblock is re-determined;
    差值计算模块,用于计算各个所述目标宏块重新确定后的编码数据量之和,并计算所述视频帧的实际编码数据量与所述编码数据量之和的差值;A difference calculation module, which is used to calculate the sum of the amount of encoded data after each of the target macroblocks is re-determined, and calculate the difference between the actual amount of encoded data of the video frame and the sum of the amount of encoded data;
    画质调整模块,用于确定所述视频帧中位于所述目标区域外的其它宏块的数量,并将所述差值与确定的所述数量之间的比值作为所述其它宏块重新确定后的编码数据量。The image quality adjustment module is used to determine the number of other macroblocks located outside the target area in the video frame, and re-determine the ratio between the difference and the determined number as the other macroblocks The amount of encoded data afterwards.
  12. 根据权利要求11所述的系统,其特征在于,所述参数重新确定单元还包括:The system according to claim 11, wherein the parameter re-determining unit further comprises:
    残差变换值改善模块,用于将所述目标宏块的初始残差变换值与所述画质改善系数的乘积,作为所述目标宏块重新确定后的残差变换值;A residual transformation value improvement module, configured to multiply the initial residual transformation value of the target macroblock and the image quality improvement coefficient as the residual transformation value after the target macroblock is re-determined;
    残差变换值调节模块,用于将计算得到的所述差值与所述其它宏块的实际编码数据量之和的比值,作为所述其它宏块的画质调节系数,并将所述其它宏块的初始残差变换值与所述画质调节系数的乘积,作为所述其它宏块重新确定后的残差变换值。The residual transform value adjustment module is configured to use the calculated ratio of the difference value and the sum of the actual coded data amount of the other macroblocks as the image quality adjustment coefficient of the other macroblocks, and adjust the other The product of the initial residual transformation value of the macroblock and the image quality adjustment coefficient is used as the residual transformation value after the other macroblocks are re-determined.
  13. 一种视频帧的码率配置设备,其特征在于,所述设备包括处理器和存储器,所述存储器用于存储计算机程序,所述计算机程序被所述处理器执行时,实现如权利要求1至7中任一所述的方法。A video frame rate configuration device, characterized in that the device includes a processor and a memory, the memory is used to store a computer program, and when the computer program is executed by the processor, it implements 7. The method described in any one of 7.
PCT/CN2019/118056 2019-10-10 2019-11-13 Code rate configuration method, system and device for video frame WO2021068341A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910956977.3 2019-10-10
CN201910956977.3A CN110769254B (en) 2019-10-10 2019-10-10 Code rate configuration method, system and equipment for video frame

Publications (1)

Publication Number Publication Date
WO2021068341A1 true WO2021068341A1 (en) 2021-04-15

Family

ID=69331580

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/118056 WO2021068341A1 (en) 2019-10-10 2019-11-13 Code rate configuration method, system and device for video frame

Country Status (2)

Country Link
CN (1) CN110769254B (en)
WO (1) WO2021068341A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114745556A (en) * 2022-02-07 2022-07-12 浙江智慧视频安防创新中心有限公司 Encoding method, encoding device, digital video film system, electronic device, and storage medium
CN115174967A (en) * 2022-07-04 2022-10-11 山东衡昊信息技术有限公司 Code rate dynamic allocation method based on bandwidth estimation
CN116760965A (en) * 2023-08-14 2023-09-15 腾讯科技(深圳)有限公司 Panoramic video encoding method, device, computer equipment and storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112235576B (en) * 2020-11-16 2024-04-30 北京世纪好未来教育科技有限公司 Encoding method, encoding device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101572810A (en) * 2008-04-29 2009-11-04 合肥坤安电子科技有限公司 Video encoding method based on interested regions
CN101882316A (en) * 2010-06-07 2010-11-10 深圳市融创天下科技发展有限公司 Method, device and system for regional division/coding of image
CN102438142A (en) * 2011-11-08 2012-05-02 北京空间机电研究所 Adaptive image compression method based on deep space background
US20170214915A1 (en) * 2016-01-22 2017-07-27 Oki Electric Industry Co., Ltd. Image encoding device and image encoding method

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101534432A (en) * 2009-04-09 2009-09-16 上海广电(集团)有限公司中央研究院 Method for controlling code rate based on human eye sensing model
CN101867799B (en) * 2009-04-17 2011-11-16 北京大学 Video frame processing method and video encoder
TWI491262B (en) * 2010-09-14 2015-07-01 Alpha Imaging Technology Corp Image encoding integrated circuit and image encoding data transmission method thereof
CN102271248B (en) * 2011-08-15 2014-03-05 复旦大学 H.264/AVC (Advanced Video Coding) macroblock-level code rate control algorithm based on weight window models
US9485506B2 (en) * 2012-09-11 2016-11-01 Texas Instruments Incorporated Method and system for constraining slice header processing overhead in video coding
CN106937118B (en) * 2017-03-13 2019-09-13 西安电子科技大学 A kind of bit rate control method combined based on subjective area-of-interest and time-space domain
CN108111857A (en) * 2017-12-29 2018-06-01 广州海昇计算机科技有限公司 A kind of target bit allocation method, system and device for the control of H.265 code check
CN109951706B (en) * 2019-04-22 2021-01-01 湖南国科微电子股份有限公司 Video coding method, device and coder
CN110177277B (en) * 2019-06-28 2022-04-12 广东中星微电子有限公司 Image coding method and device, computer readable storage medium and electronic equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101572810A (en) * 2008-04-29 2009-11-04 合肥坤安电子科技有限公司 Video encoding method based on interested regions
CN101882316A (en) * 2010-06-07 2010-11-10 深圳市融创天下科技发展有限公司 Method, device and system for regional division/coding of image
CN102438142A (en) * 2011-11-08 2012-05-02 北京空间机电研究所 Adaptive image compression method based on deep space background
US20170214915A1 (en) * 2016-01-22 2017-07-27 Oki Electric Industry Co., Ltd. Image encoding device and image encoding method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114745556A (en) * 2022-02-07 2022-07-12 浙江智慧视频安防创新中心有限公司 Encoding method, encoding device, digital video film system, electronic device, and storage medium
CN114745556B (en) * 2022-02-07 2024-04-02 浙江智慧视频安防创新中心有限公司 Encoding method, encoding device, digital retina system, electronic device, and storage medium
CN115174967A (en) * 2022-07-04 2022-10-11 山东衡昊信息技术有限公司 Code rate dynamic allocation method based on bandwidth estimation
CN116760965A (en) * 2023-08-14 2023-09-15 腾讯科技(深圳)有限公司 Panoramic video encoding method, device, computer equipment and storage medium
CN116760965B (en) * 2023-08-14 2023-12-22 腾讯科技(深圳)有限公司 Panoramic video encoding method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN110769254B (en) 2022-04-22
CN110769254A (en) 2020-02-07

Similar Documents

Publication Publication Date Title
WO2021068341A1 (en) Code rate configuration method, system and device for video frame
CN109660800B (en) Motion estimation method, motion estimation device, electronic equipment and computer-readable storage medium
Gao et al. Joint machine learning and game theory for rate control in high efficiency video coding
WO2021093060A1 (en) Video encoding method, system, and apparatus
US20180025474A1 (en) Video processing method and apparatus
US20150023411A1 (en) Encoding an image
CN108737841A (en) Coding unit depth determination method and device
Hou et al. Underwater image dehazing and denoising via curvature variation regularization
CN113597764A (en) Interframe coding for adaptive resolution video coding
US10536696B2 (en) Image encoding device and image encoding method
CN106664404A (en) Block segmentation mode processing method in video coding and relevant apparatus
Kadaikar et al. Sequential block-based disparity map estimation algorithm for stereoscopic image coding
CN112862838A (en) Natural image matting method based on real-time click interaction of user
WO2022127865A1 (en) Video processing method, apparatus, electronic device, and storage medium
JP6090430B2 (en) Encoding apparatus, method, program, computer system, recording medium
Dey et al. Enhanced macroblock features for dynamic background modeling in H. 264/AVC video encoded at low bitrate
CN113099241B (en) Reference frame list updating method, device, equipment and storage medium
CN112243129B (en) Video data processing method and device, computer equipment and storage medium
US10187656B2 (en) Image processing device for adjusting computational complexity of interpolation filter, image interpolation method, and image encoding method
CN106101731B (en) Lossless Image Compression Algorithm method and device
CN110944211B (en) Interpolation filtering method, device, medium and electronic device for intra-frame prediction
WO2021093059A1 (en) Method, system and device for recognizing region of interest
US11265553B2 (en) Obtaining a target representation of a time sample of a signal
Kountchev et al. Hierarchical adaptive KL-based transform: algorithms and applications
CN106937117B (en) Image compression method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19948610

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19948610

Country of ref document: EP

Kind code of ref document: A1