WO2016033725A1

WO2016033725A1 - Block segmentation mode processing method in video coding and relevant apparatus

Info

Publication number: WO2016033725A1
Application number: PCT/CN2014/085681
Authority: WO
Inventors: 杨晓峰; 张园园; 石腾
Original assignee: 华为技术有限公司
Priority date: 2014-09-01
Filing date: 2014-09-01
Publication date: 2016-03-10
Also published as: CN106664404A; CN106664404B

Abstract

A block segmentation processing method in video coding and a relevant apparatus. The block segmentation processing method in video coding comprises: acquiring a subdivided region of a video frame, wherein the subdivided region of the video frame comprises at least one of a region of interest of the video frame and a fringe region of the video frame; determining that a first coding block of the video frame contains pixel points in the subdivided region; and segmenting the first coding block into sub-blocks. The solution provided in the embodiments of the present invention facilitates reduction of computation complexity for judging whether a coding block is segmented into sub-blocks.

Description

Block division method processing method and related device in video coding

Technical field

The present invention relates to the field of video codec technology, and in particular, to a block segmentation processing method and related device in video coding.

Background technique

Since the International Telegraph Union (ITU) introduced the first international video coding standard H.120 in 1984, video coding technology has achieved rapid development and has become a modern information technology. An important part of the deficiencies. With the rapid development of the Internet (internet), wireless communication networks and digital broadcasting networks, people are increasingly demanding access to multimedia information, and video coding technology is one of the key technologies for efficient transmission and storage of video information.

The goal of video coding technology is to achieve better picture quality at the same compression ratio, or to achieve greater compression ratio under the same picture quality. It can be seen that the compression ratio and image quality are important indicators that need to be weighed by a pair of coding techniques. Under certain conditions of coding technology, the improvement of one index usually leads to the decrease of another index.

The evaluation of image quality is generally divided into subjective evaluation criteria and objective evaluation criteria. In the current mainstream video coding technology standards, the objective parameter based on the peak signal-to-noise ratio is used as the evaluation standard for the picture quality. This is an objective evaluation standard. The quality of the image quality needs to be judged by the human eye. The objective evaluation standard is only used to simulate the human eye's perception of the image quality to a certain extent. Although the objective evaluation standard has certain reference significance, the objective image quality and the subjective image quality of the human eye. It is not always consistent. For example, the peak signal to noise ratio (PSNR) of the left and right pictures shown in Figure 1 is the same, but the subjective quality of the right picture is obviously better than the left picture. The reason is that the right picture biases the bit resources. It is allocated to the area where the facial features of the face are located in the picture, and the face part is a sensitive area of the human eye, which greatly enhances the subjective feeling of the human eye on the picture quality. This is also the origin of video coding technology based on visual perception.

Scientific research has found that the human visual system (English: human visual system, abbreviation: HVS) is sensitive to some areas or features in the video picture. According to this characteristic, the researchers proposed video coding based on visual perception, the purpose of which is to eliminate the human eye with a large extent by using the known HVS characteristics. Information that is not or difficult to perceive, it is desirable to provide visually perceived quality video frames with fewer bit resources.

In the traditional video-aware video coding process, any coding block (English: coding unit, abbreviation: CU) for any level (ie, any kind of depth) in a video frame is calculated and compared. The CU performs rate distortion before and after sub-block partitioning to determine whether to continue sub-block partitioning for the CU, which results in a large amount of computing resources.

Summary of the invention

Embodiments of the present invention provide a block segmentation processing method and related apparatus in video coding to reduce computational complexity of determining whether a coded block performs sub-block segmentation.

A first aspect of the embodiments of the present invention provides a block segmentation processing method in video coding, including:

Obtaining a subdivided region of the video frame, wherein the subdivided region of the video frame includes at least one of a region of interest of the video frame and an edge region of the video frame;

Determining that the first coded block of the video frame includes pixel points in the subdivided region;

The first coding block is subjected to sub-block division.

With reference to the first aspect, in a first possible implementation manner of the first aspect, before the performing the sub-block partitioning on the first coding block, the method further includes: determining a current of the first coding block The segmentation depth is less than a first segmentation depth threshold, wherein the first segmentation depth threshold is less than or equal to a maximum allowable segmentation depth of the video frame.

With reference to the first possible implementation manner of the first aspect, in a second possible implementation manner of the first aspect, the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region; wherein the determining that the first coding block of the video frame includes pixel points in the subdivided region comprises: determining that the first coding block of the video frame includes the region of interest and the edge The pixel of the overlapping area of the area.

In conjunction with the first possible implementation of the first aspect, in a third possible implementation of the first aspect, the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region, wherein the determining that the first coding block of the video frame includes a pixel in the subdivided region comprises: determining that a first coding block of the video frame includes a pixel in the region of interest And not including a pixel in an edge region of the video frame, or determining a first coding block of the video frame Pixel points in an edge region of the video frame and pixels in the region of interest of the video frame are not included;

Before the first coding block is divided into sub-blocks,

The method further includes determining that a rate distortion penalty of the first coding block is greater than a rate distortion penalty of the first coding block for sub-block division.

A second aspect of the embodiments of the present invention provides a block segmentation processing device in video coding, including:

An acquiring unit, configured to acquire a subdivided area of the video frame, where the subdivided area of the video frame includes at least one of a region of interest of the video frame and an edge region of the video frame;

a determining unit, configured to determine that the first coding block of the video frame includes pixel points in the subdivided region;

And a dividing unit, configured to perform sub-block segmentation on the first coding block.

In conjunction with the second aspect, in a first possible implementation of the second aspect,

The determining unit is further configured to: before the sub-block division of the first coding block, determine that a current segmentation depth of the first coding block is smaller than a first segmentation depth threshold, where the first segmentation depth The threshold is less than or equal to the maximum allowed segmentation depth of the video frame.

With reference to the first possible implementation manner of the second aspect, in a second possible implementation manner of the second aspect, the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region; wherein, in the determining that the first coding block of the video frame includes a pixel point in the subdivided region, the determining unit is specifically configured to determine that the first coding block of the video frame includes a pixel point of the region of interest and the overlapping region of the edge region.

With reference to the first possible implementation manner of the second aspect, in a third possible implementation manner of the second aspect, the subdivided area of the video frame includes a region of interest of the video frame and the video frame An edge region; wherein, in the determining that the first coding block of the video frame includes a pixel point in the subdivided region, the determining unit is specifically configured to determine that the first coding block of the video frame includes a pixel in the region of interest and not including a pixel in an edge region of the video frame, or determining that a first encoded block of the video frame includes a pixel in an edge region of the video frame and does not include a pixel in a region of interest of the video frame;

The determining unit is further configured to: determine, before the sub-block is divided into the first coding block, determine The rate distortion penalty of the first coding block is greater than the rate distortion cost of the first coding block after sub-block division.

A third aspect of the embodiments of the present invention provides a video encoding apparatus, including:

Processor and memory,

The processor is configured to acquire a subdivided area of a video frame by running an instruction or code stored in the memory, where a subdivided area of the video frame includes a region of interest of the video frame and the video At least one of edge regions of the frame; determining that the first coded block of the video frame includes pixel points in the subdivided region; and sub-blocking the first coded block.

With reference to the third aspect, in a first possible implementation manner of the third aspect, the processor is further configured to determine, by the first coding block, the first coding block before performing the sub-block division The current segmentation depth is less than a first segmentation depth threshold, wherein the first segmentation depth threshold is less than or equal to a maximum allowable segmentation depth of the video frame.

With reference to the first possible implementation manner of the third aspect, in a second possible implementation manner of the third aspect, the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region; wherein the processor is configured to determine that a first coded block of the video frame includes pixel points of the region of interest and an overlap region of the edge region.

With reference to the first possible implementation manner of the third aspect, in a third possible implementation manner of the third aspect, the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region, wherein the processor is configured to determine that a first coding block of the video frame includes a pixel point in the region of interest and does not include a pixel point in an edge region of the video frame, or determine the A first coded block of a video frame includes pixel points in an edge region of the video frame and does not include pixel points in a region of interest of the video frame;

The processor is further configured to: before performing the sub-block division on the first coding block, determining that a rate distortion cost of the first coding block is greater than a rate distortion of the first coding block after performing sub-block division cost.

It can be seen that, in the technical solution of some embodiments of the present invention, in the video encoding process, after acquiring the subdivided area of the video frame, when determining that the first coding block of the video frame includes the pixel in the subdivided area Pointing, the first coding block is subjected to sub-block division, wherein the subdivision area packet of the video frame Enclosing at least one of an ROI of the video frame and an edge region of the video frame, that is, a relative positional relationship between the first coding block of the video frame and the subdivision region may determine the number to some extent Whether a coded block performs sub-block partitioning at the current segmentation depth, which is a conventional mechanism for determining whether to continue sub-block partitioning of the current coded block by completely calculating and comparing the rate distortions before and after the current coded block partition. The foregoing technical solution of the present invention is advantageous for reducing the computational complexity of determining whether the current coding block performs sub-block segmentation under the current segmentation depth, thereby facilitating reduction of occupation of computing resources.

DRAWINGS

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the following description of the embodiments will be briefly described. It is obvious that the drawings in the following description are only some embodiments of the present invention. Those skilled in the art can also obtain other drawings based on these drawings without paying any creative work.

FIG. 1 is a schematic flowchart of a block segmentation processing method in video coding according to an embodiment of the present invention;

FIG. 1-b is a schematic diagram of a Sauber operator according to an embodiment of the present invention; FIG.

FIG. 1-c is a schematic diagram of an etching template according to an embodiment of the present invention; FIG.

FIG. 1-d is a schematic diagram of comparison of an etching process according to an embodiment of the present invention; FIG.

2 to FIG. 7 are schematic flowcharts of a method for processing a block division in another video coding according to an embodiment of the present invention;

FIG. 8-a is a schematic flowchart of a video encoding method according to an embodiment of the present invention;

FIG. 8-b is a schematic diagram of generating a saliency map according to an embodiment of the present invention;

FIG. 9 is a schematic diagram of a block segmentation processing apparatus in video coding according to an embodiment of the present disclosure;

FIG. 10 is a schematic diagram of a video encoding apparatus according to an embodiment of the present invention.

detailed description

In order to make the objects, features and advantages of the present invention more obvious and easy to understand, the following will be The technical solutions in the embodiments of the present invention are described in the accompanying drawings in the embodiments of the present invention. It is obvious that the embodiments described below are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.

The terms "first", "second", "third", "fourth" and the like in the specification and claims of the present invention and the above drawings are used to distinguish different objects, and are not intended to describe a specific order. . Furthermore, the terms "comprises" and "comprising" and "comprising" are intended to cover a non-exclusive inclusion. For example, a process, method, system, product, or device that comprises a series of steps or units is not limited to the listed steps or units, but optionally also includes steps or units not listed, or alternatively Other steps or units inherent to these processes, methods, products or equipment.

The method for the block division processing in the video coding provided by the embodiment of the present invention is first introduced. The execution body of the block division processing method in the video coding provided by the embodiment of the present invention may be a video coding device, and the video coding device may be any output, A device that stores video, such as a mobile phone, laptop, tablet, or personal computer.

An embodiment of the block segmentation processing method in the video coding of the present invention, wherein the block segmentation processing method in the video coding may include: acquiring a subdivision region of the video frame, where the subdivision region of the video frame includes Determining at least one of a region of interest (ROI) of the video frame and an edge region of the video frame; determining that the first coding block of the video frame includes pixels in the subdivided region Pointing; sub-blocking the first coded block.

Referring to FIG. 1-a, FIG. 1-a is a schematic flowchart diagram of a block segmentation processing method in video coding according to an embodiment of the present invention. As shown in the example of FIG. 1-a, a block segmentation processing method in video coding provided by an embodiment of the present invention may include:

101. Obtain a subdivision area of the video frame.

The subdivided area of the video frame includes at least one of an ROI of the video frame and an edge area of the video frame.

The specific manner of obtaining the subdivision area of the video frame may be various. In some possible implementation manners of the present invention, for example, a region matching algorithm may be used to perform matching processing on a video frame to obtain a subdivided region of the video frame. Specifically, for example, a matching processing example of a video frame based on a region matching algorithm For example, it can be identified which areas in the video frame are subdivision areas and which areas are not subdivision areas.

Or in some possible implementation manners of the present invention, the subdivided area of the video frame may also be obtained according to a configuration instruction in the configuration file. For example, the configuration instructions in the configuration file may specify which areas are subdivision areas and which areas are not subdivision areas. Of course, the subdivision area of the video frame can also be obtained by other means.

102. Determine that a first coding block of the video frame includes a pixel point in the subdivided region.

The first coding block of the video frame is any one of the video frames.

The current segmentation depth of the first coding block may be any segmentation depth that is less than the maximum allowed segmentation depth of the video frame, that is, the size of the first coded block may be any size greater than the allowed minimum coding block size. For example, the size of the first code block may be 64*64, 32*32, 16*16, or other sizes allowed.

103. Perform sub-block partitioning on the first coding block.

For example, the first coding block may be partitioned into 4 sub-coded blocks or other numbers of sub-coded blocks.

The ROI of the video frame generally refers to a relatively sensitive area of the HVS in the video frame or an area that the HVS mainly focuses on. For example, the ROI of the video frame may be determined based on the region matching algorithm, and of course the ROI of the video frame may be determined according to a configuration instruction in the configuration file.

It can be understood that the ROI of different video frames may not be the same. For example, in video such as video conferences or news programs, HVS usually focuses on the face in the video frame, the central area of the video frame, etc., so the area of the face in the video video frame, the central area of the video frame, etc. Look at the ROI of the video frame. For example, for monitoring video frames and the like, HVS usually focuses on moving objects in video frames, so the area of the moving objects in these video frames can be regarded as the ROI of the video frame. For another example, in some game videos, HVS may be more concerned with the player area, so the player area in such video frames can be seen as the ROI of the video frame. Of course, a certain area in the video frame may also be designated as the ROI of the video frame, even if the designated area may not include a face and/or a moving object, for example, in some experimental scenarios or suspect monitoring scenes, it is necessary to focus on the video. Some specific areas in the frame, so these areas may be set to ROI. Of course, in actual applications, there may be cases where the ROI of the video frame is obtained by other means.

It can be understood that the region of interest of a video frame may be a continuous pixel area, or may be packaged. A plurality of non-contiguous pixel sub-regions are included.

The edge region of the video frame refers to an area in the video frame that includes edge pixels.

Among them, one of the methods for detecting edge pixel points in a video frame is to determine whether there is a sharp brightness change around a certain pixel point in the video frame. Wherein, if there is a sharp brightness change around a pixel in the video frame, the pixel is an edge pixel; otherwise, the pixel is not an edge pixel. Wherein, the entire video frame corresponds to a binary map of the same size. Among them, there are many detection algorithms for edge pixels. The main process includes: determining the edge detection operator; using the edge detection operator to filter the video frame, and post-processing the video frame after filtering.

The edge detection operator is a set of rules for calculating the current pixel point and the surrounding pixel points of the current pixel point. For example, the Sauber operator shown in Figure 1-b, the filtered pixel value of the current pixel is multiplied by the three coefficients of the first row and the three pixels below are multiplied by the third. The average of the sum of the three coefficients of the row; wherein, if the filtered value exceeds the threshold t0, the current pixel point is determined to be an edge pixel point, otherwise the current pixel point is determined not to be an edge pixel point.

Further, edge pixels detected in the video frame may be etched to remove isolated, sparse edge pixels in the video frame. Such isolated, sparse edge pixels present in video frames are generally considered to be due to video frame noise. Wherein, the etching treatment can also be performed by etching the template or by filtering. For example, the etching treatment shown in FIG. 1-d can be completed based on the 7*7 etching template (or other etching template) shown in FIG. 1-c. Among them, the left side of Figure 1-d is the state before the corrosion treatment, and the right side is the state after the corrosion treatment. The filtering process in the etching process is also multiplied by the surrounding pixel points of the current pixel point and the corresponding positions in the template. If the sum is greater than 0, the filtered value of the pixel point is taken as 1, indicating that the current pixel point is an edge pixel point; otherwise, the pixel point The filtered value takes 0, that is, it indicates that the current pixel point is not an edge pixel point.

It can be seen that, in the video coding process of this embodiment, after obtaining the subdivision area of the video frame, when determining that the first coding block of the video frame includes the pixel in the subdivision area, the first coding is performed. The block performs sub-block division. The subdivided area of the video frame includes at least one of an ROI of the video frame and an edge area of the video frame. That is to say, the relative positional relationship between the first coding block and the subdivision area of the video frame can determine to some extent whether the first coding block performs sub-block division under the current segmentation depth. Therefore, before and after partitioning before and after the calculation and comparison of the current coding block The above-mentioned technical solution of the present invention is advantageous for reducing the computational complexity of determining whether the current coded block performs sub-block segmentation at the current segmentation depth, thereby facilitating the conventional mechanism of determining whether to perform sub-block partitioning on the current coded block. Reduce the occupation of computing resources.

Optionally, in some possible implementation manners of the present disclosure, before the performing the sub-block partitioning on the first coding block, the method may further include: determining that a current segmentation depth of the first coding block is smaller than a first partition depth threshold, wherein the first partition depth threshold is less than or equal to a maximum allowed partition depth of the video frame. Wherein, the first segmentation depth threshold can be obtained from the configuration file.

Optionally, in some possible implementation manners of the present invention, the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame. The determining that the first coding block of the video frame includes the pixel in the subdivided region may include: determining that the first coding block of the video frame includes the region of interest and the edge region The pixels of the overlapping area. That is, when the first coding block of the video frame includes pixel points of the overlapping area of the region of interest and the edge region, the first coding block may be sub-block divided, in this case It is not necessary to refer to the magnitude relationship of the rate distortion before and after the first coding block division to determine whether to continue the sub-block division for the first coding block.

Optionally, in some possible implementation manners of the present disclosure, the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame, where the determining the video frame The first coding block including the pixel in the subdivided region may include: determining that the first coding block of the video frame includes a pixel in the region of interest and does not include an edge region of the video frame a pixel, or determining that the first encoded block of the video frame includes pixel points in an edge region of the video frame and does not include pixel points in the region of interest of the video frame. The method may further include: determining that a rate distortion cost of the first coding block is greater than or equal to a size of the first coding block after sub-block division. Rate distortion cost. That is, when the first coding block of the video frame includes pixel points of the region of interest and the non-overlapping region of the edge region, but the first coding block does not include the region of interest and the edge region The pixel points of the overlapping area, in this case, may further refer to the magnitude relationship of the rate distortion before and after the first coding block sub-block division to determine whether to continue the sub-block division for the first coding block.

Optionally, in some possible implementation manners of the present disclosure, the performing the first coding block Before the block segmentation, the method may further include: determining that the current segmentation depth of the first coded block is less than a second segmentation depth threshold, wherein the second segmentation depth threshold is less than or equal to the first segmentation depth threshold. That is, for example, when the first coding block of the video frame includes pixel points of the region of interest and the non-overlapping region of the edge region, but the first coding block does not include the region of interest and a pixel point of the overlapping area of the edge region, in this case, further determining whether the first coding block continues to perform sub-block division by referring to a size relationship between the current segmentation depth of the first coding block and the second segmentation depth threshold . For example, the segmentation depth of the first coded block may be limited to within the second segmentation depth threshold. Wherein, the second segmentation depth threshold can be obtained from the configuration file. The second segmentation depth threshold may be less than or equal to the first segmentation depth threshold, and the size of the second segmentation depth threshold may be set according to specific needs.

Optionally, in some possible implementation manners of the disclosure, the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame. The method may further include: determining that the second coded block of the video frame includes a pixel point in the region of interest and does not include a pixel point in an edge region of the video frame, or determining a number of the video frame The second coding block includes pixel points in an edge region of the video frame and does not include pixel points in a region of interest of the video frame; determining that a rate distortion cost of the second coding block is less than or equal to the second encoding The block performs rate distortion cost after sub-block division; and determines that the second coding block does not perform sub-block division. That is, when the second coding block of the video frame includes pixel points of the region of interest and the non-overlapping region of the edge region, but the second coding block does not include the region of interest and the edge region The pixel points of the overlapping area, in this case, may further refer to the magnitude relationship of the rate distortion before and after the second coding block sub-block division to determine whether to continue the sub-block division for the second coding block.

Wherein the second coding block of the video frame may refer to any one of the video frames.

The current segmentation depth of the second coding block may be an arbitrary segmentation depth smaller than the maximum allowed segmentation depth of the video frame, that is, the size of the second coding block may be any size larger than the allowed minimum coding block size. For example, the size of the second code block may be 64*64, 32*32, 16*16, or other sizes allowed.

Optionally, in some possible implementation manners of the present disclosure, the method may further include: determining that a third coding block of the video frame does not include a pixel in the subdivided region; determining the third encoding Piece Sub-block splitting is not performed. That is, when the third coding block of the video frame does not include the pixel in the subdivided region, it may be considered that the third coding block does not perform sub-block division. In this case, the third reference may be omitted. A condition such as the magnitude relationship of the rate distortion before and after the block sub-block division is determined to determine whether to continue sub-block division for the third coded block.

Further, optionally, in some possible implementation manners of the present disclosure, before the determining that the third coding block does not perform sub-block division, the method further includes: determining that a current segmentation depth of the third coding block is greater than or Is equal to a third partition depth threshold, wherein the third partition depth threshold is less than or equal to a maximum allowed partition depth of the coding block. That is, when the third coding block does not include the pixel points in the subdivision area, the division depth of the third coding block may be limited to the third division depth threshold. A third split depth threshold can be obtained from the configuration file. The third segmentation depth threshold may be less than or equal to the first segmentation depth threshold. The third segmentation depth threshold may be, for example, less than or equal to the second segmentation depth threshold. The size of the third segmentation depth threshold can be set according to specific needs.

Still further optionally, in some possible implementation manners of the present disclosure, before the determining that the third coding block does not perform sub-block segmentation, the method may further include: determining a rate distortion cost of the third coding block. a less than or equal to a rate distortion cost of the third coding block after the sub-block division; determining that the current segmentation depth of the third coding block is smaller than a third segmentation depth threshold, wherein the third segmentation depth threshold is less than or equal to the coding The maximum allowable split depth of the block. That is, when the third coding block does not include the pixel in the subdivided region, in this case, the third coding may be determined by referring to conditions such as the relationship between the third coding block before and after the division. The block continues with sub-block partitioning.

Wherein the third coding block of the video frame may refer to any one of the video frames.

The current segmentation depth of the third coding block may be an arbitrary segmentation depth smaller than the maximum allowed segmentation depth of the video frame, that is, the size of the third coding block may be any size larger than the allowed minimum coding block size. For example, the size of the third code block may be 64*64, 32*32, 16*16, or other sizes allowed.

Optionally, in some possible implementation manners of the present disclosure, the method may further include: determining that a fourth coding block of the video frame does not include a pixel in the subdivided region; determining the fourth coding block The rate distortion penalty is greater than the rate distortion cost of the fourth code block after sub-block segmentation. The third segmentation depth threshold is less than or equal to a maximum allowable segmentation depth of the video frame; determining the fourth series The code block performs sub-block division. Further, before determining that the fourth coding block performs sub-block division, the current segmentation depth of the fourth coding block may be determined to be smaller than a third segmentation depth threshold. Wherein, the third segmentation depth threshold can be obtained from the configuration file. The third segmentation depth threshold may be less than or equal to the first segmentation depth threshold. The third segmentation depth threshold may be, for example, less than or equal to the second segmentation depth threshold. The size of the third segmentation depth threshold can be set according to specific needs.

Wherein the fourth coding block of the video frame may refer to any one of the video frames.

The current segmentation depth of the fourth coding block may be an arbitrary segmentation depth smaller than the maximum allowed segmentation depth of the video frame, that is, the size of the fourth coding block may be any size larger than the allowed minimum coding block size. For example, the size of the fourth coding block may be 64*64, 32*32, 16*16, or other sizes allowed.

It can be understood that the maximum allowed segmentation depth of the video frame refers to the segmentation depth corresponding to the minimum allowable coding block. That is to say, the maximum allowable segmentation depth of a video frame refers to the number of divisions required to divide the allowable maximum size coded block into the allowable minimum size coded block. The current segmentation depth refers to the number of segments required to divide the maximum allowable coding block into the current coded block.

Assuming that the maximum size allowed by the coding block is 64*64, and the minimum size allowed for the coding block is 8*8, if the current coding block size is 64*64, the current coding depth of the current coding block is 0; if the current coding block is The size is 32*32, the current segmentation depth of the current coding block is 1; if the current coding block size is 16*16, the current segmentation depth of the current coding block is 2; if the current coding block size is 8*8, then The current segmentation depth of the current coded block is 3. In this scenario, the maximum allowable segmentation depth of the video frame in this scenario is 3 because the coded block of size 64*64 is divided into 8×8 coded blocks.

Assuming that the maximum size allowed by the coding block is 64*64, and the minimum size allowed for the coding block is 2*2, if the current coding block size is 64*64, the current coding depth of the current coding block is 0; if the current coding block is If the size of the current coding block is 8*8, the current coding block has a current partition depth of 1; if the current coding block has a size of 16*16, the current coding block has a current division depth of 2; The current segmentation depth of the current coding block is 3; similarly, if the current coding block size is 4*4, the current coding block has a current segmentation depth of 4; if the current coding block has a size of 2*2, the current coding The current split depth of the block is 5. In this scenario, the coded block of size 64*64 is divided into 2*2 sizes. The code block needs to be split 5 times. Therefore, the maximum allowable segmentation depth of the video frame in this scenario is 5. It can be understood that the maximum size and the minimum size allowed by the coding block may also be other values. In the corresponding scenario, the maximum allowable segmentation depth and the current segmentation depth may be determined in a similar manner.

When the coding block sub-block is divided, the current coding block may be divided into four equal-sized sub-coded blocks. For example, a coding block of size 64*64 may be divided into four coding blocks of size 32*32, for example, The coded block of size 16*16 is divided into four coded blocks of size 8*8.

It can be understood that, since the first segmentation depth threshold is less than or equal to the maximum allowable segmentation depth of the video frame, if the maximum allowable segmentation depth is 5, the first segmentation depth threshold may be 5, 4, 3, or 2, and the like; If the maximum allowable segmentation depth is 3, the first segmentation depth threshold may be 3, 2, or 1, and the like, and other similar scenarios may be deduced. The first segmentation depth threshold is greater than or equal to the second segmentation depth threshold. The first segmentation depth threshold is greater than or equal to the third segmentation depth threshold. Therefore, after the first segmentation depth threshold is determined, the second segmentation depth threshold or the third segment may be selected within a range of values less than or equal to the first segmentation depth threshold. The specific value of the depth threshold. For example, when the first segmentation depth threshold is 3, the specific value of the second segmentation depth threshold may be 2, and the specific value of the third segmentation depth threshold may be 2 or 1. For example, when the first segmentation depth threshold is 5, the specific value of the second segmentation depth threshold may be 4 or 3. The specific value of the third segmentation depth threshold may be 3 or 2 or 1.

It can be understood that, in some scenarios, when the third segmentation depth threshold is equal to the second segmentation depth threshold, the third segmentation depth threshold and the second segmentation depth threshold may be regarded as the same segmentation depth threshold. Similarly, when the second segmentation depth threshold, the first segmentation depth threshold, and the third segmentation depth threshold are equal, the second segmentation depth threshold, the first segmentation depth threshold, and the third segmentation depth threshold may also be regarded as the same segmentation depth. Threshold. That is to say, when some segmentation depth thresholds are equal, the equal segmentation depth thresholds can be regarded as the same segmentation depth threshold.

To facilitate a better understanding and implementation of the foregoing solution of the embodiments of the present invention, the following is exemplified by some specific application scenarios.

Referring to FIG. 2, FIG. 2 is a schematic flowchart diagram of another method for processing a block division in video coding according to another embodiment of the present invention. In the embodiment corresponding to FIG. 2, the relative positional relationship between the first coding block and the subdivided region, the first segmentation depth threshold, the second segmentation depth threshold, and the third segmentation depth threshold are mainly referenced to determine the coding block. Split processing method.

The block division processing method in another video coding provided by another embodiment of the present invention may include:

201. Obtain a subdivision area of the video frame.

The subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame.

The specific manner of obtaining the subdivided area of the image may be various. For details, refer to the description of step 101. This embodiment is not described in detail herein.

202. Determine whether a current segmentation depth of the first coding block of the video frame is smaller than a maximum allowed segmentation depth of the video frame.

If yes, go to step 203.

If no, step 208 is performed.

The first coding block of the video frame may refer to any one of the video frames.

The current segmentation depth of the first coding block may be any segmentation depth less than or equal to the maximum allowed segmentation depth of the video frame, that is, the size of the first coding block may be any greater than or equal to the minimum coding block size allowed. size. For example, the size of the first code block may be 64*64, 32*32, 16*16 or other sizes allowed.

203. Determine whether a first coding block of the video frame includes a pixel point in the subdivided region.

If yes, go to step 204.

If no, step 207 is performed.

204. Determine whether a first coding block of the video frame includes a pixel in an overlap region of a region of interest and an edge region of the video frame.

If yes, step 205 is performed.

If no, step 206 is performed.

205. Determine whether a current segmentation depth of the first coding block of the video frame is smaller than a first segmentation depth threshold.

Here, the first segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame, and therefore, the specific value of the first segmentation depth threshold may be selected within a range of values smaller than the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specificity of the first segmentation depth threshold is specific. The value can be 2. For another example, when the maximum allowable segmentation depth is 5, the specific value of the first segmentation depth threshold may be 4 or 3 or 2 or the like.

If yes, go to step 209.

If no, step 208 is performed.

206. Determine whether a current segmentation depth of the first coding block is smaller than a second segmentation depth threshold.

If no, step 210 is performed.

If yes, step 208 is performed.

The second segmentation depth threshold is mainly used to limit pixel points in the video frame that do not include the region of interest and the edge region of the video frame, but includes the region of interest and the edge of the video frame. The depth of division of the coded block of the pixel in the non-overlapping region of the region.

The second segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame. Therefore, the specific value of the second segmentation depth threshold may be selected within a range of values smaller than the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specific value of the second segmentation depth threshold may be 2 or 1. For another example, when the first segmentation depth threshold is 5, the specific value of the second segmentation depth threshold may be 4 or 3 or 2 or 1 or the like.

207. Determine whether a current segmentation depth of the first coding block is smaller than a third segmentation depth threshold.

If no, step 210 is performed.

If yes, step 208 is performed.

The third segmentation depth threshold is mainly used to limit the segmentation depth of the coding block of the pixel in the video frame that does not include the subdivision region.

The third segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame. Therefore, the specific value of the third segmentation depth threshold may be selected within a range of values smaller than the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specific value of the third segmentation depth threshold may be 2 or 1. For another example, when the first segmentation depth threshold is 5, the specific value of the third segmentation depth threshold may be 4 or 3 or 2 or 1 or the like. The third segmentation depth threshold is less than or equal to the second segmentation depth.

208. Determine whether a rate distortion cost of the first coding block is greater than a rate distortion cost of the first coding block after sub-block division.

If yes, go to step 209.

If no, step 210 is performed.

209. Perform sub-block partitioning on the first coding block.

210. Determine to not perform sub-block partitioning on the first coding block.

It can be understood that for each coding block in the video frame, the sub-block division processing can be performed by way of example in FIG. For example, the coding block CU-1 in the video frame may be subjected to sub-block division processing according to the example of FIG. 2, and it is assumed that the coding block CU-1 is divided into four coding blocks, which are CU-11, CU-12, CU-, respectively. 13 and CU-14, then for each coding block in CU-11, CU-12, CU-13, and CU-14, sub-block division processing may also be performed according to the example of FIG. 2, for example, CU-11 may be used. Cut into 4 coded blocks, and CU-12 may no longer perform sub-block splitting, and so on.

It can be seen that, in the video encoding process of this embodiment, after acquiring the subdivided area of the video frame, when determining that the first coding block of the video frame includes the pixel in the overlapping area of the region of interest and the edge region of the video frame, and The current segmentation depth of a coding block is smaller than the first segmentation depth threshold of the video frame, and the first coding block is subjected to sub-block segmentation. That is to say, the relative positional relationship between the first coding block of the video frame and the overlapping area of the region of interest and the edge region can determine to some extent whether the first coding block performs sub-block segmentation under the current segmentation depth. Therefore, the above technical solution of the present embodiment is advantageous in reducing the conventional mechanism of determining whether to continue sub-block division for the current coding block by calculating and comparing the current coding block before and after the division. Determining whether the current coded block performs the computational complexity of the sub-block segmentation under the current segmentation depth, thereby facilitating the reduction of the occupation of the relevant computing resources.

Referring to FIG. 3, FIG. 3 is a schematic flowchart diagram of another method for processing a block division in video coding according to another embodiment of the present invention. In the embodiment corresponding to FIG. 3, the relative positional relationship between the first coding block and the subdivided region and the first segmentation depth threshold are mainly referred to (the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame in this embodiment). For example, the third segmentation depth threshold or the like is used to determine the segmentation processing mode of the coding block.

The block segmentation processing method in another video coding provided by another embodiment of the present invention may include:

301. Obtain a subdivision area of the video frame.

The subdivided area of the video frame includes a region of interest of the video frame and the video frame The edge area.

302. Determine whether a current segmentation depth of the first coding block of the video frame is smaller than a maximum allowed segmentation depth of the video frame.

If yes, go to step 303.

If no, step 308 is performed.

303. Determine whether a first coding block of the video frame includes a pixel point in the subdivided region.

If yes, go to step 304.

If no, step 305 is performed.

304. Determine whether a first coding block of the video frame includes a pixel in an overlap region of a region of interest and an edge region of the video frame.

If yes, go to step 307.

If no, step 306 is performed.

305. Determine whether a current segmentation depth of the first coding block is smaller than a third segmentation depth threshold.

If no, step 308 is performed.

If yes, go to step 306.

The third segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame. Therefore, the specific value of the third segmentation depth threshold may be selected within a range of values smaller than the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specific value of the third segmentation depth threshold may be 2 or 1. For another example, when the first segmentation depth threshold is 5, the specific value of the third segmentation depth threshold may be 4 or 3 or 2 or 1 or the like.

306. Determine whether a rate distortion cost of the first coding block is greater than the first coding block. Rate distortion cost after block segmentation.

If yes, go to step 307.

If no, step 308 is performed.

307. Perform sub-block partitioning on the first coding block.

308. Determine to not perform sub-block splitting on the first coding block.

It can be understood that for each coding block in the video frame, the sub-block division processing can be performed by way of example in FIG.

It can be seen that, in the video encoding process of this embodiment, after acquiring the subdivided area of the video frame, when determining that the first coding block of the video frame includes the pixel in the overlapping area of the region of interest and the edge region of the video frame, and The current segmentation depth of a coding block is smaller than the maximum allowed segmentation depth of the video frame, and the first coding block is subjected to sub-block segmentation, that is, between the first coding block of the video frame and the overlapping region of the region of interest and the edge region. The relative positional relationship may determine whether the first coding block performs sub-block segmentation at the current segmentation depth to some extent, compared to completely calculating and comparing the rate distortion of the current coding block before and after the division. The foregoing technical solution of the present embodiment is advantageous for reducing the computational complexity of determining whether the current coding block performs sub-block segmentation at the current segmentation depth, thereby facilitating reduction of correlation. Calculate the occupation of resources.

Referring to FIG. 4, FIG. 4 is a schematic flowchart diagram of a block segmentation processing method in video coding according to another embodiment of the present invention. In the embodiment corresponding to FIG. 4, the relative positional relationship between the first coding block and the subdivided region and the first segmentation depth threshold are mainly referred to (in the embodiment, the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame). For example, the third segmentation depth threshold or the like is used to determine the segmentation processing mode of the coding block. For example, as shown in FIG. 4, a block segmentation processing method in video coding provided by another embodiment of the present invention may include:

401. Obtain a subdivision area of the video frame.

The subdivided area of the video frame includes at least one of a region of interest of the video frame and an edge region of the video frame.

The specific manner of obtaining the subdivided area of the video frame may be various. For details, refer to the description of step 101, which is not detailed herein.

402. Determine whether a current segmentation depth of the first coding block of the video frame is smaller than a maximum allowed segmentation depth of the video frame.

If yes, go to step 403.

If no, step 407 is performed.

403. Determine whether a first coding block of the video frame includes a pixel point in the subdivided region.

If no, step 404 is performed.

If yes, step 406 is performed.

The first coding block of the video frame may refer to one of the coding blocks or any one of the coding blocks. The size of the first code block may be the maximum size allowed (for example, 64*64), or the next largest size allowed (for example, 32*32), or the next large size allowed (for example, 16*16), or Other sizes allowed.

404. Determine whether a current segmentation depth of the first coding block is smaller than a third segmentation depth threshold.

If yes, go to step 405.

If no, step 407 is performed.

The third segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame. Therefore, the specific value of the third segmentation depth threshold may be selected within a range of values less than or equal to the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specific value of the third segmentation depth threshold may be 2 or 1. For another example, when the first segmentation depth threshold is 5, the specific value of the third segmentation depth threshold may be 4 or 3 or 2 or 1 or the like.

405. Determine whether a rate distortion cost of the first coding block is greater than a rate distortion cost of the first coding block after performing sub-block division.

If yes, step 406 is performed.

If no, step 407 is performed.

406. Perform sub-block partitioning on the first coding block.

407. Determine to not perform sub-block splitting on the first coding block.

It can be seen that, after acquiring the subdivided area of the video frame in the video encoding process of this embodiment, when determining that the first coding block of the video frame includes the pixel in the subdivided area, and the first coding block is currently segmented The depth is smaller than the maximum allowed segmentation depth of the video frame, and the first coded block is subjected to sub-block segmentation. That is to say, the relative positional relationship between the first coding block and the subdivided region of the video frame can determine to some extent whether the first coding block performs sub-block segmentation under the current segmentation depth, so The above technical solution of the present embodiment is advantageous for reducing the current coding block at the current state by determining the conventional mechanism for determining whether to continue the sub-block division for the current coding block before and after the current coding block partitioning and before and after the division. Whether the computational complexity of sub-block segmentation is performed under the segmentation depth is beneficial to reduce the occupation of related computing resources.

Referring to FIG. 5, FIG. 5 is a schematic flowchart diagram of a block segmentation processing method in video coding according to another embodiment of the present invention. In the embodiment corresponding to FIG. 5, the relative positional relationship between the first coding block and the subdivision area and the first segmentation depth threshold are mainly referred to. In this embodiment, the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame. For example, the rate distortion cost, etc., to determine the segmentation processing mode of the coding block. For example, as shown in FIG. 5, a block segmentation processing method in video coding provided by another embodiment of the present invention may include:

501. Obtain a subdivision area of the video frame.

502. Determine whether a current segmentation depth of the first coding block of the video frame is smaller than a maximum allowed segmentation depth of the video frame.

If yes, go to step 503.

If no, step 506 is performed.

503. Determine whether a first coding block of the video frame includes a pixel in a region of interest of the video frame and an overlap region of the edge region.

If yes, go to step 505.

If no, step 504 is performed.

504. Determine whether a rate distortion cost of the first coding block is greater than a rate distortion cost of the first coding block after sub-block division.

If yes, go to step 505.

If no, step 506 is performed.

505. Perform sub-block partitioning on the first coding block.

506. Determine to not perform sub-block partitioning on the first coding block.

It can be understood that for each coding block in the video frame, the sub-block division processing can be performed as exemplified in FIG. 5.

It can be seen that, after acquiring the subdivided area of the video frame in the video encoding process of this embodiment, when determining that the first coding block of the video frame includes the pixel in the overlapping area of the region of interest and the edge region of the video frame, Point, and the current coding depth of the first coding block is smaller than the maximum allowed division depth of the video frame, and the first coding block is subjected to sub-block division, and when the first coding block does not include the region of interest and the edge of the video frame For the pixel points in the overlapping region of the region, the size relationship of the rate distortion cost before and after the first coding block segmentation may be referenced to determine whether the first coding block continues to perform the sub-block segmentation. That is to say, the relative positional relationship between the first coding block of the video frame and the overlapping area of the region of interest and the edge region can determine to some extent whether the first coding block performs sub-block segmentation under the current segmentation depth. The above technical solution of the present embodiment is advantageous for reducing the current determination, compared to the conventional mechanism for determining whether to continue sub-block division for the current coding block by calculating and comparing the current coding block before and after the division. Whether the coding block performs sub-block division under the current segmentation depth Calculating the complexity, which in turn helps to reduce the occupation of related computing resources.

Referring to FIG. 6, FIG. 6 is a schematic flowchart diagram of a block segmentation processing method in video coding according to another embodiment of the present invention. In the embodiment corresponding to FIG. 6 , the relative positional relationship between the first coding block and the subdivision area and the first segmentation depth threshold are mainly referred to (the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame in this embodiment). For example, the rate distortion cost, etc., to determine the segmentation processing mode of the coding block. For example, as shown in FIG. 6 , a block segmentation processing method in video coding provided by another embodiment of the present invention may include:

601. Obtain a subdivision area of the video frame.

The subdivided area of the video frame includes an edge area or a region of interest of the video frame.

602. Determine whether a current segmentation depth of the first coding block of the video frame is smaller than a maximum allowed segmentation depth of the video frame.

If yes, step 603 is performed.

If no, step 606 is performed.

603. Determine whether a first coding block of the video frame includes a pixel point in a subdivided region of the video frame.

If yes, step 605 is performed.

If no, step 604 is performed.

604. Determine whether a rate distortion cost of the first coding block is greater than a rate distortion cost of the first coding block after sub-block division.

If yes, step 605 is performed.

If no, step 606 is performed.

605. Perform sub-block partitioning on the first coding block.

606. Determine to not perform sub-block splitting on the first coding block.

It can be understood that for each coding block in the video frame, the sub-block division processing can be performed as exemplified in FIG. 6.

It can be seen that, after obtaining the edge region or the region of interest of the video frame in the video encoding process of this embodiment, when determining that the first coding block of the video frame includes an edge region of the video frame or a pixel in the region of interest a point, and the current coding depth of the first coding block is smaller than a maximum allowed division depth of the video frame, the first coding block may be subjected to sub-block division, when the first coding block does not include an edge region of the video frame or is interested For the pixel points in the region, the size relationship of the rate distortion cost before and after the first coding block segmentation may be referenced to determine whether the first code block continues to perform the sub-block segmentation. That is to say, the relative positional relationship between the first coding block and the edge region or the region of interest may determine to some extent whether the first coded block performs sub-block segmentation at the current segmentation depth, which is compared to complete The technical solution of the present embodiment is beneficial to reduce the current coding block at the current segmentation depth by calculating and comparing the current rate of the current coding block before and after the division to determine whether to continue the sub-block division of the current coding block. Whether the computational complexity of sub-block segmentation is performed, thereby reducing the occupation of related computing resources.

Referring to FIG. 7, FIG. 7 is a schematic flowchart diagram of another method for processing a block division in video coding according to another embodiment of the present invention. In the embodiment corresponding to FIG. 7 , the relative positional relationship between the first coding block and the subdivision area and the first segmentation depth threshold are mainly referred to. In this embodiment, the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame. For example, the third segmentation depth threshold or the like is used to determine the segmentation processing mode of the coding block. The block segmentation processing method in another video coding provided by another embodiment of the present invention may include:

701. Obtain a subdivision area of the video frame.

702. Determine whether a current segmentation depth of the first coding block of the video frame is smaller than a video frame. Large allow for segmentation depth.

If yes, go to step 703.

If no, step 708 is performed.

703. Determine whether a first coding block of the video frame includes a pixel point in the subdivided region.

If yes, step 704 is performed.

If no, step 705 is performed.

704. Determine whether a first coding block of the video frame includes a pixel point in an edge region of the video frame.

If yes, go to step 707.

If no, step 706 is performed.

705. Determine whether a current segmentation depth of the first coding block is smaller than a third segmentation depth threshold.

If no, step 708 is performed.

If yes, step 706 is performed.

706. Determine whether a rate distortion cost of the first coding block is greater than a rate distortion cost of the first coding block after sub-block division.

If yes, go to step 707.

If no, step 708 is performed.

707. Perform sub-block partitioning on the first coding block.

708. Determine to not perform sub-block partitioning on the first coding block.

It can be understood that for each coding block in the video frame, the sub-block division processing can be performed as exemplified in FIG.

It can be seen that, in the video coding process of the embodiment, after acquiring the subdivided area of the video frame, when determining that the first coding block of the video frame includes a pixel in an edge region of the video frame, and the first coding If the current segmentation depth of the block is smaller than the maximum allowable segmentation depth of the video frame, the first coded block may be divided into sub-blocks, and when the first coded block does not include pixels in the edge region of the video frame, reference may be made. The condition of the rate distortion cost before and after the first coding block is divided to determine whether the first coding block continues to perform sub-block division. That is to say, the relative positional relationship between the first coding block and the edge region of the video frame can determine to some extent whether the first coding block performs sub-block segmentation under the current segmentation depth, which is compared to completely The foregoing technical solution of the present embodiment helps to reduce the current coding block at the current segmentation depth, by calculating and comparing the current rate of the current coding block before and after the division of the coding block to determine whether to continue the sub-block division of the current coding block. Whether the computational complexity of sub-block segmentation is performed, thereby reducing the occupation of related computing resources.

Referring to FIG. 8-a, FIG. 8-a is a schematic flowchart of another method for processing a block division in video coding according to another embodiment of the present invention. For example, as shown in FIG. 8-a, another method for processing a block division in video coding according to another embodiment of the present invention may include:

801. Obtain a subdivision area of the first video frame.

The subdivided area of the first video frame includes a region of interest of the first video frame and an edge region of the first video frame. The first image group may be any one of the image groups in the code stream. The first video frame belongs to the first image group (English, group of pictures, abbreviated: GOP).

The specific manner of obtaining the subdivision area of the first video frame may be various. In some possible implementation manners of the present invention, the first video frame may be matched by using a region matching algorithm to obtain a subdivided region of the first video frame, for example, matching the first video frame based on the region matching algorithm. The process, for example, can identify which regions in the first video frame are subdivided regions and which regions are not subdivided regions. Alternatively, the subdivision area of the first video frame may be obtained according to the configuration instruction in the configuration file. For example, the configuration instruction in the configuration file may specifically specify which areas of the first video frame are subdivision areas, and the first video frame. Which areas are not subdivisions. Of course, other The method obtains a subdivision area of the first video frame.

802. Generate a saliency map corresponding to the first video frame.

The algorithm for generating the Saliency map is intended to output a block diagram such as the one shown in the right figure in FIG. 8-b. The left side of FIG. 8-b is the original picture, and the right side of FIG. 8-b is the left picture. Saliency map. Each color in the right image in Figure 8-b represents a weight value. For example, the pure white coding block in the saliency map has the highest weight value, for example, 5, and the pure black coding block in the saliency map has the lowest weight value, for example, 1.

In the H.264 standard, the size of a macroblock is determined (16×16), and each macroblock in the saliency map corresponds to a weight value; in the H.265 standard, the size of the CU is variable. Each of the smallest CU-covered pixel regions (eg, 8×8) in the significance map corresponds to one weight value, and the weight value corresponding to the pixel region covered by the CU may be weighted by the weight value corresponding to any CU of the determined size. The value is calculated. For example, the weight value of a 16×16-sized CU can be averaged by the weight values corresponding to the four 8×8 pixel regions covered by it.

The method for determining the weight value of each coding block on the significance map may be various. For example, the detected person area may be defined as an ROI. For example, the weight value of the coded block including the pixel points in the ROI may be set to 2, and the coded block weight value of the pixel point not including the ROI may be set to 1.

Alternatively, another way may be used to determine the weight value of each code block on the significance map. Specifically, for example, according to prior knowledge of the position of the eyes, nose, and mouth in the face, combined with the area information of the face, the coded block including the pixels of the eyes, nose, and mouth is obtained as the highest weight, and the pixel points including the edge of the face are included. The weight of the coded block is second, and the other face flat areas are again, and other areas outside the face can be assigned the lowest weight.

803. Allocate a target number of bits for each video frame in the first image group.

First, the number of bits T _G used by the first image group is calculated by the formula (1) according to the code rate, the frame rate, and the number of video frames in the first image group.

In the formula (1), S represents a code rate, N represents the number of frames in the GOP, and fps represents a frame rate.

Then, the number of bits T _f allocated to each video frame in the first image group is calculated according to T _G by equation (2), where w _I is the weight corresponding to the I frame in the first image group, w _B Is the weight corresponding to the B frame in the first image group, w _P is the weight corresponding to the P frame in the first image group, N _I is the number of I frames in the first image group, and N _B is the first image The number of B frames in the group, N _P is the number of P frames in the first image group.

T _f =T _G /(w _I ×N _I +w _P ×N _P +w _B ×N _B )×w _I (2)

804. Sub-block partitioning is performed on each coding block in the first video frame.

The sub-block division may be performed on each coding block in the first video frame according to the block division processing method in any one of the video coding examples exemplified in the foregoing method embodiments. Of course, the manner in which each coding block of another video frame in the first picture group performs sub-block division may be similar.

Among them, the parameter RDCost may be used in the process of sub-block segmentation. The calculation of RDCost may use the quantization parameter corresponding to the first video frame (English: quantization parameter, QP). For example, the QP of the first video frame that can be specified, for example, specifies QP=30. The QP of the first video frame may also be predicted according to the context. Specifically, for example, the quantization parameter QP(j) of the jth video frame in one GOP may be calculated according to formula (3).

Wherein, MAD _pred (j) in the formula (3) represents the maximum absolute difference before and after encoding of the jth video frame in the predicted GOP. Among them, because the jth video frame is not currently encoded, it is difficult to obtain an accurate mean absolute difference (English: mean absolute difference, abbreviation: MAD), which can predict the MAD. The MAD prediction generally uses the MAD of the previous coded frame (ie, the j-1th frame). MAD represents the complexity of the video frame. In the formula (3), T(j) represents the target bit rate. Under the condition that the target bit rate T(j) is constant, the larger the MAD is, the larger the QP is, which means that the more complicated the video frame is, The worse the details of the encoded video frame. Npixel(j) represents the total number of pixels included in the jth video frame, α and β are adjustment parameters, and α and β are usually set to 1. QP(j) represents the QP of the jth video frame.

805. Calculate a QP of each CU according to a target bit number corresponding to each CU in the first video frame and a weight value of each CU calculated by the saliency map.

For example, the QP of each CU can be calculated based on the formula (4).

Where k in equation (4) represents the kth CU, w _i represents the weight value of the i th CU, and T _i (j) represents the remaining target code rate of the jth video frame when encoding the i th CU, where , N represents the number of uncoded CUs of the jth video frame, and N _pixel,i (j) represents the number of pixels of the i th CU. Where QP _k (j) represents the _kth QP in the jth video frame. MAD _pred,k (j) represents the maximum absolute difference before and after the kth CU encoding of the jth video frame in the predicted GOP.

806. Encode the CU according to the QP of each CU of the first video frame.

The CU coding mainly includes quantization of residuals and entropy coding of coding modes, parameters, and quantized residuals.

It can be seen that, in the video coding process of this embodiment, after acquiring the subdivided area of the video frame, determining whether the current coding block continues to perform the sub-block according to the relative positional relationship between the current coding block and the edge area of the video frame, and the like. segmentation. That is to say, the relative positional relationship between the current coding block and the edge region of the video frame can determine to some extent whether the current coding block performs sub-block segmentation at the current segmentation depth, which is compared to completely calculating and comparing the current The foregoing technical solution of the present embodiment helps to determine whether to determine whether the current coding block performs sub-blocks at the current segmentation depth, and the conventional mechanism for determining whether to perform the sub-block division on the current coding block before and after the block division. The computational complexity of the segmentation, which in turn helps to reduce the occupation of related computing resources.

In order to facilitate the implementation of the above solution of the embodiments of the present invention, some related devices for implementing the above solutions are also provided below.

Referring to FIG. 9, an embodiment of the present invention provides a block segmentation processing apparatus 900 in video coding, which may include: an obtaining unit 910, a determining unit 920, and a dividing unit 930.

The obtaining unit 910 is configured to acquire a subdivision area of the video frame,

The manner in which the obtaining unit 910 obtains the subdivided area of the video frame may be any one of the foregoing embodiments.

The determining unit 920 is configured to determine that the first coding block of the video frame includes pixel points in the subdivided region.

The dividing unit 930 is configured to perform sub-block segmentation on the first coding block.

In some possible implementation manners of the present disclosure, the determining unit 920 is further configured to: before the performing the sub-block partitioning on the first coding block, determining that a current segmentation depth of the first coding block is smaller than a first segmentation depth a threshold, wherein the first partition depth threshold is less than or equal to a maximum allowed partition depth of the video frame.

In some possible implementations of the present invention, the subdivided region of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein, in the determining the first of the video frame The encoding block includes an aspect of the pixel in the subdivided region, and the determining unit 920 is specifically configured to determine that the first encoding block of the video frame includes the pixel of the overlapping region of the region of interest and the edge region .

In some possible implementations of the present invention, the subdivided region of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein the determining the first encoding of the video frame The block includes an aspect of the pixel in the subdivided region, and the determining unit 920 is specifically configured to: determine that the first coding block of the video frame includes a pixel in the region of interest and does not include the video frame a pixel point in the edge region, or determining that the first coded block of the video frame includes a pixel point in an edge region of the video frame and does not include a pixel point in the region of interest of the video frame;

The determining unit 920 is further configured to: before performing the sub-block partitioning on the first coding block, determining that a rate distortion cost of the first coding block is greater than a rate distortion of the first coding block after sub-block division cost.

In some possible implementation manners of the present disclosure, the determining unit 920 is further configured to: before performing the sub-block partitioning on the first coding block, determining that a current segmentation depth of the first coding block is smaller than a second segmentation depth threshold, The second segmentation depth threshold is smaller than the first segmentation depth threshold.

In some possible implementations of the present invention, the subdivided area of the video frame includes an area of interest of the video frame and an edge area of the video frame; the determining unit 920 is further configured to determine the video frame. a second coding block comprising pixel points in the region of interest and not including pixel points in an edge region of the video frame, or determining that a second coding block of the video frame includes an edge of the video frame a pixel in the edge region and not including a pixel in the region of interest of the video frame; determining that the rate distortion cost of the second encoding block is less than or equal to the rate distortion of the second encoding block after sub-block segmentation a cost; determining that the second coded block does not perform sub-block splitting.

In some possible implementation manners of the present invention, the determining unit 920 is further configured to: determine that the third coding block of the video frame does not include the pixel point in the subdivision area; and determine that the third coding block does not perform sub Block splitting.

In some possible implementation manners of the present disclosure, the determining unit 920 is further configured to: before determining that the third coding block does not perform sub-block segmentation, determine that a current segmentation depth of the third coded block is equal to a third segmentation depth threshold. Wherein the third partition depth threshold is less than a maximum allowed partition depth of the video frame.

In some possible implementation manners of the present invention, the determining unit 920 is further configured to: before determining that the third coding block does not perform sub-block segmentation, determine that a rate distortion cost of the third coding block is less than or equal to the first The rate-distortion cost of the sub-block partitioning by the three-coded block.

In some possible implementation manners of the present invention, the determining unit 920 is further configured to: determine that a fourth coding block of the video frame does not include a pixel point in the subdivided region; determine a rate distortion of the fourth coding block The cost is greater than the rate distortion cost of the fourth coding block after sub-block division;

The dividing unit 930 is further configured to perform sub-block segmentation on the fourth coding block.

In some possible implementation manners of the present invention, the determining unit 920 is further configured to: before the sub-block division of the fourth coding block, determine that a current segmentation depth of the fourth coding block is smaller than a third segmentation depth. a threshold; wherein the third partition depth threshold is less than a maximum allowed partition depth of the video frame.

It can be understood that the functions of the function modules of the block segmentation processing device 900 in the video coding of the embodiment may be specifically implemented according to the method in the foregoing method embodiment, and the specific implementation process may refer to the related description of the foregoing method embodiments. I will not repeat them here. The block division processing device 900 in video coding can be integrated in the video encoding device. The video encoding device can be any device that needs to collect, store, or transmit audio signals, such as a mobile phone, a tablet computer, a personal computer, a notebook computer, and the like.

It can be seen that the block segmentation processing apparatus 900 in the video coding of this embodiment is acquiring the fineness of the video frame. After the sub-region, when it is determined that the first coding block of the video frame includes pixel points in the subdivided region, the first coding block is subjected to sub-block division. The subdivided area of the video frame includes at least one of an ROI of the video frame and an edge area of the video frame. That is to say, the relative positional relationship between the first coding block and the subdivision area of the video frame can determine to some extent whether the first coding block performs sub-block division under the current segmentation depth. Therefore, the above technical solution of the present invention is advantageous for reducing the determination of the current coding block, compared to the conventional mechanism of determining whether to continue sub-block division for the current coding block by calculating and comparing the current coding block before and before the division. Whether the computational complexity of sub-block segmentation is performed under the current segmentation depth, thereby facilitating the reduction of the occupation of computing resources.

Referring to FIG. 10, FIG. 10 is a structural block diagram of a video encoding apparatus according to another embodiment of the present invention.

The video encoding apparatus 1000 may include at least one processor 1001, a memory 1005, and at least one communication bus 1002. Communication bus 1002 is used to implement connection communication between these components.

Optionally, the video encoding apparatus 1000 may further include: at least one network interface 1004, a user interface 1003, and the like. Optionally, the user interface 1003 includes a display (such as a touch screen, a liquid crystal display or a holographic image (English: Holographic) or a projection (English: Projector), etc.), and a click device (for example, a mouse, a trackball (English: trackball) touch) Board or touch screen, etc.), camera and / or pickup device.

The memory 1005 can include read only memory and random access memory and provides instructions and data to the processor 1001. A portion of the memory 1005 may also include a non-volatile random access memory.

In some possible implementations, the memory 1005 stores elements, executable modules or data structures, or a subset thereof, or their extended set: acquisition unit 910, determination unit 920, and partition unit 930.

In an embodiment of the present invention, the processor 1001 is configured to acquire a subdivided region of a video frame by executing a code or an instruction in the memory 1005, where the subdivided region of the video frame includes a region of interest of the video frame and At least one of edge regions of the video frame; determining that the first coding block of the video frame includes pixel points in the subdivision region; and performing sub-block segmentation on the first coding block.

In some possible implementation manners of the present invention, the processor 1001 is further configured to: before the sub-block division of the first coding block, determine that a current segmentation depth of the first coding block is smaller than a first segmentation depth threshold. Where the first partition depth threshold is less than or equal to a maximum allowable score of the video frame Cut depth.

In some possible implementations of the present invention, the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein the processor 1001 is configured to determine the video frame The first coding block includes pixel points of the region of interest and the overlapping region of the edge region.

In some possible implementations of the present invention, the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein the processor 1001 is configured to determine the video frame The first coded block includes pixel points in the region of interest and does not include pixel points in an edge region of the video frame, or determines that a first coded block of the video frame includes an edge region of the video frame Pixels and do not include pixels in the region of interest of the video frame;

The processor 1001 is further configured to: before performing the sub-block partitioning on the first coding block, determining that a rate distortion cost of the first coding block is greater than a rate after the first coding block performs sub-block division Distortion cost.

In some possible implementations of the present invention, the processor 1001 is further configured to: before performing the sub-block partitioning on the first coding block, determining that a current segmentation depth of the first coding block is smaller than a second segmentation depth threshold, The second segmentation depth threshold is less than the first segmentation depth threshold.

In some possible implementations of the present invention, the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein the processor 1001 is further configured to determine the A second coded block of the video frame includes a pixel in the region of interest and does not include a pixel in an edge region of the video frame, or determines that a second encoded block of the video frame includes an edge of the video frame a pixel in the region and not including a pixel in the region of interest of the video frame; determining that a rate distortion penalty of the second encoded block is less than or equal to a rate distortion penalty of the second encoded block for sub-block segmentation Determining that the second coding block does not perform sub-block division.

In some possible implementation manners of the present invention, the processor 1001 is further configured to: determine that a third coding block of the video frame does not include a pixel point in the subdivided region; and determine that the third coding block does not perform a sub- Block splitting.

In some possible implementation manners of the present invention, the processor 1001 is further configured to: before determining that the third coding block does not perform sub-block segmentation, determining that a current segmentation depth of the third coded block is equal to a third segmentation depth a threshold, wherein the third partition depth threshold is smaller than a maximum allowed segmentation of the video frame depth.

In some possible implementations of the present invention, the processor 1001 is further configured to: before determining that the third coding block does not perform sub-block segmentation, determining that a rate distortion cost of the third coding block is less than or equal to the The third coding block performs rate distortion cost after sub-block division.

In some possible implementation manners of the present invention, the processor 1001 is further configured to: determine that a fourth coding block of the video frame does not include a pixel point in the subdivision region; and determine a rate distortion cost of the fourth coding block. And a rate distortion cost after the sub-block division is performed by the fourth coding block; and the fourth coding block is subjected to sub-block division.

In some possible implementations of the present invention, the processor 1001 is further configured to: before the sub-block division of the fourth coding block, determine that a current segmentation depth of the fourth coding block is smaller than a third segmentation depth threshold. Wherein the third partition depth threshold is less than a maximum allowed partition depth of the video frame.

It is to be understood that the functions of the functional modules of the video encoding apparatus 1000 of the present embodiment may be specifically implemented according to the method in the foregoing method embodiments, and the specific implementation process may refer to the related description of the foregoing method embodiments, and details are not described herein again. . The video encoding device 1000 can be, for example, any device that needs to collect, store, or transmit audio signals, such as a mobile phone, a tablet computer, a personal computer, a notebook computer, and the like.

It can be seen that, after acquiring the subdivided area of the video frame, the first coding block of the video frame includes the pixel points in the subdivided area, and the first coding block is determined. Perform sub-block splitting. The subdivided area of the video frame includes at least one of an ROI of the video frame and an edge area of the video frame. That is to say, the relative positional relationship between the first coding block and the subdivision area of the video frame can determine to some extent whether the first coding block performs sub-block division under the current segmentation depth. Therefore, the above technical solution of the present invention is advantageous for reducing the determination of the current coding block, compared to the conventional mechanism of determining whether to continue sub-block division for the current coding block by calculating and comparing the current coding block before and before the division. Whether the computational complexity of sub-block segmentation is performed under the current segmentation depth, thereby facilitating the reduction of the occupation of computing resources.

The embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium may store a program, where the program includes some or all of the block segmentation processing method in any one of the video encodings described in the foregoing method embodiments. step.

The embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium can store a program, and the program includes some or all of the steps of any one of the video encoding methods described in the foregoing method embodiments.

In the several embodiments provided herein, it should be understood that the disclosed apparatus may be implemented in other ways. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be electrical or otherwise.

The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.

The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium. The instructions include a plurality of instructions for causing a computer device (which may be a personal computer, server or network device, etc., and in particular a processor in a computer device) to perform all or part of the steps of the methods of the various embodiments of the present invention. Wherein, the foregoing storage medium may include: a U disk, a mobile hard disk, a magnetic disk, an optical disk, a read only memory (English: read-only memory, abbreviation: ROM) or a random access memory (English: random access memory, abbreviation: RAM) and other media that can store program code.

The above embodiments are only used to illustrate the technical solutions of the present invention, and are not intended to be limiting; although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art will understand that The technical solutions described in the embodiments are modified, or the equivalents of the technical features are replaced by the equivalents of the technical solutions of the embodiments of the present invention.

Claims

A block segmentation processing method in video coding, comprising:

Obtaining a subdivided region of the video frame, wherein the subdivided region of the video frame includes at least one of a region of interest of the video frame and an edge region of the video frame;

Determining that the first coded block of the video frame includes pixel points in the subdivided region;

The first coding block is subjected to sub-block division.
The method according to claim 1, wherein before the sub-block segmentation is performed on the first coding block, the method further comprises: determining that a current segmentation depth of the first coding block is smaller than a first segmentation a depth threshold, wherein the first partition depth threshold is less than or equal to a maximum allowed partition depth of the video frame.
The method according to claim 2, wherein the subdivided area of the video frame comprises a region of interest of the video frame and an edge region of the video frame; wherein the determining the video frame is A code block includes pixel points in the subdivided region, including: determining that a first coded block of the video frame includes pixel points of an overlap region of the region of interest and the edge region.
The method according to claim 2, wherein the subdivided area of the video frame comprises a region of interest of the video frame and an edge region of the video frame; wherein the determining the video frame is A code block includes pixel points in the subdivided region, including: determining that a first coded block of the video frame includes a pixel point in the region of interest and does not include a pixel point in an edge region of the video frame Or determining that the first coded block of the video frame includes a pixel point in an edge region of the video frame and does not include a pixel point in a region of interest of the video frame;

Before the first coding block is divided into sub-blocks,

The method further includes determining that a rate distortion penalty of the first coding block is greater than a rate distortion penalty of the first coding block for sub-block division.
The method of claim 4 wherein:

The method further includes: determining that a current segmentation depth of the first coding block is smaller than a second segmentation depth threshold, where the second segmentation depth threshold is smaller than the first A split depth threshold.
The method according to any one of claims 1 to 5, characterized in that the video frame is fine a subregion including a region of interest of the video frame and an edge region of the video frame;

The method further includes:

Determining that a second coded block of the video frame includes a pixel in the region of interest and does not include a pixel in an edge region of the video frame, or determining that a second encoded block of the video frame includes the video a pixel point in an edge region of the frame and not including a pixel point in the region of interest of the video frame; determining that a rate distortion cost of the second coded block is less than or equal to the second code block for sub-block segmentation Rate distortion cost; determining that the second coded block does not perform sub-block segmentation.
A method according to any one of claims 1 to 6, wherein

The method further includes:

Determining that the third coding block of the video frame does not include pixel points in the subdivided region;

It is determined that the third coding block does not perform sub-block division.
The method according to claim 7, wherein before the determining that the third coding block does not perform sub-block segmentation, the method further comprises: determining that a current segmentation depth of the third coding block is equal to a third segmentation a depth threshold, wherein the third partition depth threshold is less than a maximum allowed partition depth of the video frame.
Method according to claim 7 or 8, characterized in that

Before the determining that the third coding block does not perform sub-block division, the method further includes: determining that a rate distortion cost of the third coding block is less than or equal to a rate distortion of the third coding block after sub-block division cost.
A method according to any one of claims 1 to 9, wherein

The method further includes: determining that a fourth coding block of the video frame does not include a pixel point in the subdivided region; determining that a rate distortion cost of the fourth coding block is greater than the fourth coding block for sub-block division The subsequent rate distortion penalty; the fourth coding block is subjected to sub-block division.
The method according to claim 10, wherein before the sub-block segmentation is performed on the fourth coding block, the method further comprises: determining that a current segmentation depth of the fourth coding block is smaller than a third segmentation depth a threshold; wherein the third partition depth threshold is less than a maximum allowed partition depth of the video frame.
A block segmentation processing device for video coding, comprising:

An acquiring unit, configured to acquire a subdivided area of the video frame, where the subdivided area of the video frame includes at least one of a region of interest of the video frame and an edge region of the video frame;

a determining unit, configured to determine that the first coding block of the video frame includes pixel points in the subdivided region;

And a dividing unit, configured to perform sub-block segmentation on the first coding block.
The device according to claim 12, characterized in that

The determining unit is further configured to: before the sub-block division of the first coding block, determine that a current segmentation depth of the first coding block is smaller than a first segmentation depth threshold, where the first segmentation depth The threshold is less than or equal to the maximum allowed segmentation depth of the video frame.
The apparatus according to claim 13, wherein the subdivided area of the video frame comprises a region of interest of the video frame and an edge region of the video frame; wherein, in the determining the video frame The first coding block includes an aspect of the pixel in the subdivision region, and the determining unit is specifically configured to: determine that the first coding block of the video frame includes an overlap region of the region of interest and the edge region pixel.
The apparatus according to claim 13, wherein the subdivided area of the video frame comprises a region of interest of the video frame and an edge region of the video frame; wherein, in the determining the video frame The first coding block includes an aspect of the pixel in the subdivided region, and the determining unit is specifically configured to: determine that the first coding block of the video frame includes a pixel in the region of interest and does not include the a pixel in an edge region of the video frame, or determining that the first encoded block of the video frame includes pixel points in an edge region of the video frame and does not include pixel points in the region of interest of the video frame;

The determining unit is further configured to: before performing the sub-block partitioning on the first coding block, determining that a rate distortion cost of the first coding block is greater than a rate distortion cost of the first coding block after performing sub-block division .
The device of claim 15 wherein:

The determining unit is further configured to: before the first coding block is divided into sub-blocks, determine that a current segmentation depth of the first coding block is smaller than a second segmentation depth threshold, where the second segmentation depth threshold is smaller than the first A split depth threshold.
The apparatus according to any one of claims 12 to 16, wherein the subdivided area of the video frame comprises a region of interest of the video frame and an edge region of the video frame;

The determining unit is further configured to: determine that the second coding block of the video frame includes a pixel point in the region of interest and does not include a pixel point in an edge region of the video frame, or determine the video frame The second coding block includes pixel points in an edge region of the video frame and does not include pixel points in the region of interest of the video frame; determining that the rate distortion cost of the second coding block is less than or equal to the second The coding block performs rate distortion cost after sub-block division; determining that the second coding block does not perform sub-block division.
The apparatus according to any one of claims 12 to 17, wherein the determining unit is further configured to: determine that a third coding block of the video frame does not include a pixel in the subdivision area; The third coding block does not perform sub-block division.
The apparatus according to claim 18, wherein the determining unit is further configured to: before determining that the third coding block does not perform sub-block segmentation, determining that a current segmentation depth of the third coding block is equal to a third segmentation a depth threshold, wherein the third partition depth threshold is less than a maximum allowed partition depth of the video frame.
Device according to claim 18 or 19, characterized in that

The determining unit is further configured to: before determining that the third coding block does not perform sub-block division, determining that a rate distortion cost of the third coding block is less than or equal to a rate of the third coding block after performing sub-block division Distortion cost.
Apparatus according to any one of claims 12 to 20, wherein

The determining unit is further configured to: determine that a fourth coding block of the video frame does not include a pixel point in the subdivided region; and determine that a rate distortion cost of the fourth coding block is greater than a fourth coding block. Rate distortion cost after block segmentation;

The dividing unit is further configured to perform sub-block segmentation on the fourth coding block.
The device according to claim 21, wherein

The determining unit is further configured to: before the sub-block division of the fourth coding block, determine that a current segmentation depth of the fourth coding block is smaller than a third segmentation depth threshold; wherein the third segmentation depth The threshold is less than the maximum allowed segmentation depth of the video frame.
A video encoding apparatus, comprising:

Processor and memory,

The processor is configured to acquire a subdivided area of a video frame by running an instruction or code stored in the memory, where a subdivided area of the video frame includes a region of interest of the video frame and the video At least one of edge regions of the frame; determining that the first coded block of the video frame includes pixel points in the subdivided region; and sub-blocking the first coded block.
The video encoding apparatus according to claim 23, wherein the processor is further configured to: before the sub-block division of the first coding block, determine that a current segmentation depth of the first coding block is smaller than a first partition depth threshold, wherein the first partition depth threshold is less than or equal to a maximum allowed partition depth of the video frame.
The video encoding apparatus according to claim 24, wherein the subdivided area of the video frame comprises a region of interest of the video frame and an edge region of the video frame; wherein the processor is configured to determine The first coded block of the video frame includes pixel points of the region of interest and the overlap region of the edge region.
The video encoding apparatus according to claim 24, wherein the subdivided area of the video frame comprises a region of interest of the video frame and an edge region of the video frame; wherein the processor is configured to determine a first coding block of the video frame includes a pixel in the region of interest and does not include a pixel in an edge region of the video frame, or determines that a first coding block of the video frame includes the video frame a pixel in the edge region and does not include a pixel in the region of interest of the video frame;

The processor is further configured to: before performing the sub-block division on the first coding block, determining that a rate distortion cost of the first coding block is greater than a rate distortion of the first coding block after performing sub-block division cost.
The video encoding apparatus according to claim 26, wherein

The processor is further configured to: before performing the sub-block division on the first coding block, determining that a current segmentation depth of the first coding block is smaller than a second segmentation depth threshold, where the second segmentation depth threshold is smaller than the The first split depth threshold.
A video encoding apparatus according to any one of claims 23 to 27, wherein said said A subdivided region of the video frame includes a region of interest of the video frame and an edge region of the video frame;

The processor is further configured to: determine that a second coding block of the video frame includes a pixel in the region of interest and does not include a pixel in an edge region of the video frame, or determine the video The second coded block of the frame includes pixel points in an edge region of the video frame and does not include pixel points in the region of interest of the video frame; determining that the rate distortion cost of the second coded block is less than or equal to The second coding block performs a rate-distortion penalty after the sub-block division; determining that the second coding block does not perform sub-block division.
A video encoding apparatus according to any one of claims 23 to 28, characterized in that

The processor is further configured to: determine that the third coding block of the video frame does not include a pixel in the subdivided region; and determine that the third coding block does not perform sub-block segmentation.
The video encoding apparatus according to claim 29, wherein the processor is further configured to determine a current segmentation depth of the third encoding block before determining that the third encoding block does not perform sub-block segmentation. Is equal to a third partition depth threshold, wherein the third partition depth threshold is smaller than a maximum allowed partition depth of the video frame.
A video encoding apparatus according to claim 29 or 30, wherein

The processor is further configured to: before determining that the third coding block does not perform sub-block division, determining that a rate distortion cost of the third coding block is less than or equal to the third coding block after sub-block division Rate distortion cost.
A video encoding apparatus according to any one of claims 23 to 31, wherein

The processor is further configured to: determine that a fourth coding block of the video frame does not include a pixel point in the subdivided region; determine that a rate distortion cost of the fourth coding block is greater than a fourth block to perform a subblock a rate-distortion penalty after segmentation; sub-block segmentation is performed on the fourth coded block.
The video encoding apparatus according to claim 32, wherein the processor is further configured to: before the sub-block division of the fourth coding block, determine that a current segmentation depth of the fourth coding block is smaller than a third partition depth threshold; wherein the third partition depth threshold is smaller than a maximum allowed partition depth of the video frame.