CN110784713B

CN110784713B - Coding and decoding method capable of changing effective image size

Info

Publication number: CN110784713B
Application number: CN201911286945.3A
Authority: CN
Inventors: 万俊青; 谢亚光; 李小强
Original assignee: Hangzhou Arcvideo Technology Co ltd
Current assignee: Hangzhou Arcvideo Technology Co ltd
Priority date: 2019-12-14
Filing date: 2019-12-14
Publication date: 2022-02-22
Anticipated expiration: 2039-12-14
Also published as: CN110784713A

Abstract

The invention discloses a coding and decoding method with variable effective image size. The method specifically comprises the following steps: (1) in the encoder: analyzing the complexity of each frame of the file, segmenting the file according to the complexity, evaluating the size of an effective image suitable for each segment, writing the effective image into a configuration file, determining whether the image is reset or not according to the frame level width setting in the configuration file by a reset module, wherein a coding core is responsible for coding, and a code stream synthesizer packs the video stream from the coding core and the information of an effective area of the image into a final video stream for output; (2) in the decoder: the video stream analyzer analyzes whether the current frame carries effective image size information or not, the decoding kernel is responsible for decoding, and the Resize module determines whether to Resize or not according to the effective image size and the video coding image size. The invention has the beneficial effects that: the size of the coded image is not changed, the viewing effect is guaranteed, the method is suitable for any coder and decoder kernel, the coder and decoder kernel is not affected, and the stability of the coder and decoder kernel is guaranteed.

Description

Coding and decoding method capable of changing effective image size

Technical Field

The present invention relates to the field of video processing technologies, and in particular, to a method for encoding and decoding a variable effective image size.

Background

In practical application, the encoder is set with a fixed image size, a fixed code rate or a maximum code rate, which causes poor quality of a complex section in a video, and a plurality of blocks appear to influence viewing. Viewing may be improved if the complex segment is encoded with a smaller encoded picture size, but the same video, changes in encoded picture size will require the encoder and decoder to delete, reconstruct, which will make the encoder and decoder likely to be unstable and potentially stuttered when played, especially with hardware encoders and decoders (NVidia codec creation sometimes takes more than 2-3 seconds).

Disclosure of Invention

The present invention provides a coding and decoding method with variable effective image size and good stability to overcome the above-mentioned defects in the prior art.

In order to achieve the purpose, the invention adopts the following technical scheme:

a variable effective image size coding and decoding method specifically comprises the following steps:

(1) in the encoder: analyzing the complexity of each frame of the file, segmenting the file according to the complexity, evaluating the size of an effective image suitable for each segment, writing the effective image into a configuration file, determining whether the image is reset or not according to the frame level width setting in the configuration file by a reset module, wherein a coding core is responsible for coding, and a code stream synthesizer packs the video stream from the coding core and the information of an effective area of the image into a final video stream for output;

(2) in the decoder: the video stream analyzer analyzes whether the current frame carries effective image size information or not, the decoding kernel is responsible for decoding, and the Resize module determines whether to Resize or not according to the effective image size and the video coding image size.

The encoder changes the size of an effective image and fills black in an invalid area in a video at a complex section, so that the size of the encoded image is not changed, the viewing effect is ensured, the encoder is suitable for any codec kernel, the encoder and the decoder kernel are not influenced, and the stability of the encoder and the decoder kernel is ensured. The codec can also be used in other applications where it is desirable to change the effective image size, such as network real-time transcoding or real-time communication applications.

Preferably, in step (1), the specific steps of encoding are as follows:

(11) starting coding of a frame, reading a configuration file, judging whether the current frame has information setting of effective image size change according to frame level width setting in the configuration file, and directly entering the next step if the current frame does not have information setting of effective image size change; if yes, updating the effective image size for the current frame, and then entering the next step;

(12) judging whether the effective image size is equal to the coding image size, if so, directly continuing coding in a coding kernel, and then entering the next step; if not, the image frame Resize is adjusted to the effective image size through a Resize module, the upper left corner of a memory of the coded image is placed, other areas of the memory of the coded image are filled with black, then the memory of the coded image is sent to a coding kernel, the coding kernel codes the frame into an IDR frame, codes the frame in an IDR frame type, and then the next step is carried out;

(13) judging whether the effective image size of the current frame is updated or not, and if not, outputting a code stream given by a coding kernel by a code stream synthesizer; and if the video coding rate is updated, the effective image size information is packed according to the user _ data _ unregistered format of sei in the HEVC, and the code stream synthesizer packs the coding core output code stream and the effective image size information into a new video code rate for output.

Preferably, in step (11), the frame level width setting method in the configuration file is as follows:

firstly, transcoding is configured by self-adaption of a fixed Qp scene and IDR frames, and after the transcoding is completed, the average pixel compression ratio bpp of all frames between two IDR frames and the average pixel compression ratio bitrate _ bpp of a given code rate are calculated;

if the bitrate _ bpp is more than or equal to the bpp, the original height and width are adopted;

if bpp > bitrate _ bpp is more than or equal to 0.5 bpp, the height of the effective image is unchanged, and the width adopts a new width:

if bitrate _ bpp <0.5 bpp, the height and width of the effective image all need to be new, new width:

new_width = (old_width/2+15)/16*16

the new height is as follows:

。

preferably, in step (2), the specific steps of decoding are as follows:

(21) when the decoding of a frame starts, a video stream analyzer analyzes the code stream, checks whether the video frame has effective image size information, if so, updates the effective image size information, and enters a decoding kernel; if not, directly entering a decoding kernel;

(22) the decoding kernel carries out decoding operation;

(23) judging whether the effective image size is consistent with the coding image size, if not, resetting the image with the effective image size to the coding image size through a reset module, and outputting a video image; and if the video images are consistent, directly outputting the video images.

The invention has the beneficial effects that: the size of the coded image is not changed, the viewing effect is guaranteed, the method is suitable for any coder and decoder kernel, the coder and decoder kernel is not affected, and the stability of the coder and decoder kernel is guaranteed.

Drawings

FIG. 1 is a diagram of the coding framework of the present invention;

FIG. 2 is a flow chart of the encoding of the present invention;

FIG. 3 is a decoding framework of the present invention;

fig. 4 is a decoding flow diagram of the present invention.

Detailed Description

The invention is further described with reference to the following figures and detailed description.

(1) in the encoder: analyzing the complexity of each frame of the file, segmenting the file according to the complexity, evaluating the size of an effective image suitable for each segment, writing the effective image into a configuration file, determining whether the image is reset or not according to the frame level width setting in the configuration file by a reset module, wherein a coding core is responsible for coding, and a code stream synthesizer packs video streams coming out of the coding core and information of an effective area of the image into final video streams to be output, as shown in FIG. 1;

as shown in fig. 2, the specific steps of encoding are as follows:

the frame level high-width setting method in the configuration file is as follows:

new_width = (old_width/2+15)/16*16

the new height is as follows:

。

(2) In the decoder: the video stream analyzer analyzes whether the current frame carries effective image size information or not, a decoding kernel is responsible for decoding, and a Resize module determines whether Resize or not according to the effective image size and the video coding image size, as shown in fig. 3;

as shown in fig. 4, the specific steps of decoding are as follows:

(22) the decoding kernel carries out decoding operation;

Here, it should be noted that: the effective image size refers to the actual image size, and the encoded image size refers to the actual image size plus the size of other areas filled with black. In step (11), the frame level in the configuration file is set to be higher than the width of the configuration file, and the division by 16 means integer division and decimal division, so that the following must be multiplied by 16.

Claims

1. A coding and decoding method with variable effective image size is characterized by comprising the following steps:

(1) in the encoder: analyzing the complexity of each frame of the file, segmenting the file according to the complexity, evaluating the size of an effective image suitable for each segment, writing the effective image into a configuration file, determining whether the image is reset or not according to the frame level width setting in the configuration file by a reset module, wherein a coding core is responsible for coding, and a code stream synthesizer packs the video stream from the coding core and the information of an effective area of the image into a final video stream for output; the specific steps of encoding are as follows:

(12) judging whether the effective image size is equal to the coding image size, if so, directly continuing coding in a coding kernel, and then entering the next step; if not, the image frame Resize is adjusted to the effective image size through a Resize module, the upper left corner of the size of the coded image is placed, other areas of the size of the coded image are filled with black, then the size of the coded image is sent to a coding kernel, the coding kernel codes the frame into an IDR frame, codes the frame in an IDR frame type, and then the next step is carried out;

(13) judging whether the effective image size of the current frame is updated or not, and if not, outputting a code stream given by a coding kernel by a code stream synthesizer; if the video coding rate is updated, the effective image size information is packaged according to a user _ data _ unregistered format of sei in HEVC, and a code stream synthesizer packages a coding core output code stream and the effective image size information into a new video code rate for output;

2. The method of claim 1, wherein in the step (11), the frame level width in the configuration file is set as follows:

new_width = (old_width/2+15)/16*16

the new height is as follows:

。

3. the method as claimed in claim 1, wherein the decoding in step (2) comprises the following steps:

(22) the decoding kernel carries out decoding operation;