WO2016033725A1 - Procédé de traitement de mode de segmentation de blocs lors d'un codage vidéo et appareil associé - Google Patents

Procédé de traitement de mode de segmentation de blocs lors d'un codage vidéo et appareil associé Download PDF

Info

Publication number
WO2016033725A1
WO2016033725A1 PCT/CN2014/085681 CN2014085681W WO2016033725A1 WO 2016033725 A1 WO2016033725 A1 WO 2016033725A1 CN 2014085681 W CN2014085681 W CN 2014085681W WO 2016033725 A1 WO2016033725 A1 WO 2016033725A1
Authority
WO
WIPO (PCT)
Prior art keywords
video frame
block
coding block
region
segmentation
Prior art date
Application number
PCT/CN2014/085681
Other languages
English (en)
Chinese (zh)
Inventor
杨晓峰
张园园
石腾
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2014/085681 priority Critical patent/WO2016033725A1/fr
Priority to CN201480080086.1A priority patent/CN106664404B/zh
Publication of WO2016033725A1 publication Critical patent/WO2016033725A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/119Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/167Position within a video image, e.g. region of interest [ROI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding

Definitions

  • the present invention relates to the field of video codec technology, and in particular, to a block segmentation processing method and related device in video coding.
  • the goal of video coding technology is to achieve better picture quality at the same compression ratio, or to achieve greater compression ratio under the same picture quality. It can be seen that the compression ratio and image quality are important indicators that need to be weighed by a pair of coding techniques. Under certain conditions of coding technology, the improvement of one index usually leads to the decrease of another index.
  • the evaluation of image quality is generally divided into subjective evaluation criteria and objective evaluation criteria.
  • the objective parameter based on the peak signal-to-noise ratio is used as the evaluation standard for the picture quality.
  • This is an objective evaluation standard.
  • the quality of the image quality needs to be judged by the human eye.
  • the objective evaluation standard is only used to simulate the human eye's perception of the image quality to a certain extent.
  • the objective evaluation standard has certain reference significance, the objective image quality and the subjective image quality of the human eye. It is not always consistent.
  • the peak signal to noise ratio (PSNR) of the left and right pictures shown in Figure 1 is the same, but the subjective quality of the right picture is obviously better than the left picture. The reason is that the right picture biases the bit resources.
  • PSNR peak signal to noise ratio
  • the face part is a sensitive area of the human eye, which greatly enhances the subjective feeling of the human eye on the picture quality. This is also the origin of video coding technology based on visual perception.
  • HVS human visual system
  • any coding block (English: coding unit, abbreviation: CU) for any level (ie, any kind of depth) in a video frame is calculated and compared.
  • the CU performs rate distortion before and after sub-block partitioning to determine whether to continue sub-block partitioning for the CU, which results in a large amount of computing resources.
  • Embodiments of the present invention provide a block segmentation processing method and related apparatus in video coding to reduce computational complexity of determining whether a coded block performs sub-block segmentation.
  • a first aspect of the embodiments of the present invention provides a block segmentation processing method in video coding, including:
  • the subdivided region of the video frame includes at least one of a region of interest of the video frame and an edge region of the video frame;
  • the first coding block is subjected to sub-block division.
  • the method before the performing the sub-block partitioning on the first coding block, the method further includes: determining a current of the first coding block The segmentation depth is less than a first segmentation depth threshold, wherein the first segmentation depth threshold is less than or equal to a maximum allowable segmentation depth of the video frame.
  • the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region; wherein the determining that the first coding block of the video frame includes pixel points in the subdivided region comprises: determining that the first coding block of the video frame includes the region of interest and the edge The pixel of the overlapping area of the area.
  • the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region
  • the determining that the first coding block of the video frame includes a pixel in the subdivided region comprises: determining that a first coding block of the video frame includes a pixel in the region of interest And not including a pixel in an edge region of the video frame, or determining a first coding block of the video frame Pixel points in an edge region of the video frame and pixels in the region of interest of the video frame are not included;
  • the method further includes determining that a rate distortion penalty of the first coding block is greater than a rate distortion penalty of the first coding block for sub-block division.
  • a second aspect of the embodiments of the present invention provides a block segmentation processing device in video coding, including:
  • An acquiring unit configured to acquire a subdivided area of the video frame, where the subdivided area of the video frame includes at least one of a region of interest of the video frame and an edge region of the video frame;
  • a determining unit configured to determine that the first coding block of the video frame includes pixel points in the subdivided region
  • a dividing unit configured to perform sub-block segmentation on the first coding block.
  • the determining unit is further configured to: before the sub-block division of the first coding block, determine that a current segmentation depth of the first coding block is smaller than a first segmentation depth threshold, where the first segmentation depth The threshold is less than or equal to the maximum allowed segmentation depth of the video frame.
  • the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region; wherein, in the determining that the first coding block of the video frame includes a pixel point in the subdivided region, the determining unit is specifically configured to determine that the first coding block of the video frame includes a pixel point of the region of interest and the overlapping region of the edge region.
  • the subdivided area of the video frame includes a region of interest of the video frame and the video frame An edge region; wherein, in the determining that the first coding block of the video frame includes a pixel point in the subdivided region, the determining unit is specifically configured to determine that the first coding block of the video frame includes a pixel in the region of interest and not including a pixel in an edge region of the video frame, or determining that a first encoded block of the video frame includes a pixel in an edge region of the video frame and does not include a pixel in a region of interest of the video frame;
  • the determining unit is further configured to: determine, before the sub-block is divided into the first coding block, determine The rate distortion penalty of the first coding block is greater than the rate distortion cost of the first coding block after sub-block division.
  • a third aspect of the embodiments of the present invention provides a video encoding apparatus, including:
  • the processor is configured to acquire a subdivided area of a video frame by running an instruction or code stored in the memory, where a subdivided area of the video frame includes a region of interest of the video frame and the video At least one of edge regions of the frame; determining that the first coded block of the video frame includes pixel points in the subdivided region; and sub-blocking the first coded block.
  • the processor is further configured to determine, by the first coding block, the first coding block before performing the sub-block division The current segmentation depth is less than a first segmentation depth threshold, wherein the first segmentation depth threshold is less than or equal to a maximum allowable segmentation depth of the video frame.
  • the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region; wherein the processor is configured to determine that a first coded block of the video frame includes pixel points of the region of interest and an overlap region of the edge region.
  • the subdivision region of the video frame includes a region of interest of the video frame and the video frame An edge region
  • the processor is configured to determine that a first coding block of the video frame includes a pixel point in the region of interest and does not include a pixel point in an edge region of the video frame, or determine the A first coded block of a video frame includes pixel points in an edge region of the video frame and does not include pixel points in a region of interest of the video frame;
  • the processor is further configured to: before performing the sub-block division on the first coding block, determining that a rate distortion cost of the first coding block is greater than a rate distortion of the first coding block after performing sub-block division cost.
  • the first coding block is subjected to sub-block division, wherein the subdivision area packet of the video frame Enclosing at least one of an ROI of the video frame and an edge region of the video frame, that is, a relative positional relationship between the first coding block of the video frame and the subdivision region may determine the number to some extent Whether a coded block performs sub-block partitioning at the current segmentation depth, which is a conventional mechanism for determining whether to continue sub-block partitioning of the current coded block by completely calculating and comparing the rate distortions before and after the current coded block partition.
  • the foregoing technical solution of the present invention is advantageous for reducing the computational complexity of determining whether the current coding block performs sub-block segmentation under the current segmentation depth, thereby
  • FIG. 1 is a schematic flowchart of a block segmentation processing method in video coding according to an embodiment of the present invention
  • FIG. 1-b is a schematic diagram of a Sauber operator according to an embodiment of the present invention.
  • FIG. 1-c is a schematic diagram of an etching template according to an embodiment of the present invention.
  • FIG. 1-d is a schematic diagram of comparison of an etching process according to an embodiment of the present invention.
  • FIG. 2 to FIG. 7 are schematic flowcharts of a method for processing a block division in another video coding according to an embodiment of the present invention
  • FIG. 8-a is a schematic flowchart of a video encoding method according to an embodiment of the present invention.
  • FIG. 8-b is a schematic diagram of generating a saliency map according to an embodiment of the present invention.
  • FIG. 9 is a schematic diagram of a block segmentation processing apparatus in video coding according to an embodiment of the present disclosure.
  • FIG. 10 is a schematic diagram of a video encoding apparatus according to an embodiment of the present invention.
  • Embodiments of the present invention provide a block segmentation processing method and related apparatus in video coding to reduce computational complexity of determining whether a coded block performs sub-block segmentation.
  • the method for the block division processing in the video coding provided by the embodiment of the present invention is first introduced.
  • the execution body of the block division processing method in the video coding provided by the embodiment of the present invention may be a video coding device, and the video coding device may be any output, A device that stores video, such as a mobile phone, laptop, tablet, or personal computer.
  • An embodiment of the block segmentation processing method in the video coding of the present invention wherein the block segmentation processing method in the video coding may include: acquiring a subdivision region of the video frame, where the subdivision region of the video frame includes Determining at least one of a region of interest (ROI) of the video frame and an edge region of the video frame; determining that the first coding block of the video frame includes pixels in the subdivided region Pointing; sub-blocking the first coded block.
  • ROI region of interest
  • FIG. 1-a is a schematic flowchart diagram of a block segmentation processing method in video coding according to an embodiment of the present invention.
  • a block segmentation processing method in video coding provided by an embodiment of the present invention may include:
  • the subdivided area of the video frame includes at least one of an ROI of the video frame and an edge area of the video frame.
  • the specific manner of obtaining the subdivision area of the video frame may be various.
  • a region matching algorithm may be used to perform matching processing on a video frame to obtain a subdivided region of the video frame.
  • a matching processing example of a video frame based on a region matching algorithm For example, it can be identified which areas in the video frame are subdivision areas and which areas are not subdivision areas.
  • the subdivided area of the video frame may also be obtained according to a configuration instruction in the configuration file.
  • the configuration instructions in the configuration file may specify which areas are subdivision areas and which areas are not subdivision areas.
  • the subdivision area of the video frame can also be obtained by other means.
  • the first coding block of the video frame is any one of the video frames.
  • the current segmentation depth of the first coding block may be any segmentation depth that is less than the maximum allowed segmentation depth of the video frame, that is, the size of the first coded block may be any size greater than the allowed minimum coding block size.
  • the size of the first code block may be 64*64, 32*32, 16*16, or other sizes allowed.
  • the first coding block may be partitioned into 4 sub-coded blocks or other numbers of sub-coded blocks.
  • the ROI of the video frame generally refers to a relatively sensitive area of the HVS in the video frame or an area that the HVS mainly focuses on.
  • the ROI of the video frame may be determined based on the region matching algorithm, and of course the ROI of the video frame may be determined according to a configuration instruction in the configuration file.
  • the ROI of different video frames may not be the same.
  • HVS usually focuses on the face in the video frame, the central area of the video frame, etc., so the area of the face in the video video frame, the central area of the video frame, etc. Look at the ROI of the video frame.
  • HVS usually focuses on moving objects in video frames, so the area of the moving objects in these video frames can be regarded as the ROI of the video frame.
  • HVS may be more concerned with the player area, so the player area in such video frames can be seen as the ROI of the video frame.
  • a certain area in the video frame may also be designated as the ROI of the video frame, even if the designated area may not include a face and/or a moving object, for example, in some experimental scenarios or suspect monitoring scenes, it is necessary to focus on the video.
  • Some specific areas in the frame so these areas may be set to ROI.
  • the ROI of the video frame is obtained by other means.
  • the region of interest of a video frame may be a continuous pixel area, or may be packaged. A plurality of non-contiguous pixel sub-regions are included.
  • the edge region of the video frame refers to an area in the video frame that includes edge pixels.
  • one of the methods for detecting edge pixel points in a video frame is to determine whether there is a sharp brightness change around a certain pixel point in the video frame. Wherein, if there is a sharp brightness change around a pixel in the video frame, the pixel is an edge pixel; otherwise, the pixel is not an edge pixel. Wherein, the entire video frame corresponds to a binary map of the same size.
  • detection algorithms for edge pixels The main process includes: determining the edge detection operator; using the edge detection operator to filter the video frame, and post-processing the video frame after filtering.
  • the edge detection operator is a set of rules for calculating the current pixel point and the surrounding pixel points of the current pixel point. For example, the Sauber operator shown in Figure 1-b, the filtered pixel value of the current pixel is multiplied by the three coefficients of the first row and the three pixels below are multiplied by the third. The average of the sum of the three coefficients of the row; wherein, if the filtered value exceeds the threshold t0, the current pixel point is determined to be an edge pixel point, otherwise the current pixel point is determined not to be an edge pixel point.
  • edge pixels detected in the video frame may be etched to remove isolated, sparse edge pixels in the video frame.
  • Such isolated, sparse edge pixels present in video frames are generally considered to be due to video frame noise.
  • the etching treatment can also be performed by etching the template or by filtering.
  • the etching treatment shown in FIG. 1-d can be completed based on the 7*7 etching template (or other etching template) shown in FIG. 1-c.
  • the left side of Figure 1-d is the state before the corrosion treatment
  • the right side is the state after the corrosion treatment.
  • the filtering process in the etching process is also multiplied by the surrounding pixel points of the current pixel point and the corresponding positions in the template.
  • the filtered value of the pixel point is taken as 1, indicating that the current pixel point is an edge pixel point; otherwise, the pixel point The filtered value takes 0, that is, it indicates that the current pixel point is not an edge pixel point.
  • the first coding is performed.
  • the block performs sub-block division.
  • the subdivided area of the video frame includes at least one of an ROI of the video frame and an edge area of the video frame. That is to say, the relative positional relationship between the first coding block and the subdivision area of the video frame can determine to some extent whether the first coding block performs sub-block division under the current segmentation depth.
  • the above-mentioned technical solution of the present invention is advantageous for reducing the computational complexity of determining whether the current coded block performs sub-block segmentation at the current segmentation depth, thereby facilitating the conventional mechanism of determining whether to perform sub-block partitioning on the current coded block. Reduce the occupation of computing resources.
  • the method may further include: determining that a current segmentation depth of the first coding block is smaller than a first partition depth threshold, wherein the first partition depth threshold is less than or equal to a maximum allowed partition depth of the video frame.
  • the first segmentation depth threshold can be obtained from the configuration file.
  • the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame.
  • the determining that the first coding block of the video frame includes the pixel in the subdivided region may include: determining that the first coding block of the video frame includes the region of interest and the edge region The pixels of the overlapping area. That is, when the first coding block of the video frame includes pixel points of the overlapping area of the region of interest and the edge region, the first coding block may be sub-block divided, in this case It is not necessary to refer to the magnitude relationship of the rate distortion before and after the first coding block division to determine whether to continue the sub-block division for the first coding block.
  • the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame
  • the first coding block including the pixel in the subdivided region may include: determining that the first coding block of the video frame includes a pixel in the region of interest and does not include an edge region of the video frame a pixel, or determining that the first encoded block of the video frame includes pixel points in an edge region of the video frame and does not include pixel points in the region of interest of the video frame.
  • the method may further include: determining that a rate distortion cost of the first coding block is greater than or equal to a size of the first coding block after sub-block division.
  • Rate distortion cost when the first coding block of the video frame includes pixel points of the region of interest and the non-overlapping region of the edge region, but the first coding block does not include the region of interest and the edge region
  • the pixel points of the overlapping area in this case, may further refer to the magnitude relationship of the rate distortion before and after the first coding block sub-block division to determine whether to continue the sub-block division for the first coding block.
  • the method may further include: determining that the current segmentation depth of the first coded block is less than a second segmentation depth threshold, wherein the second segmentation depth threshold is less than or equal to the first segmentation depth threshold.
  • the first coding block of the video frame includes pixel points of the region of interest and the non-overlapping region of the edge region, but the first coding block does not include the region of interest and a pixel point of the overlapping area of the edge region, in this case, further determining whether the first coding block continues to perform sub-block division by referring to a size relationship between the current segmentation depth of the first coding block and the second segmentation depth threshold .
  • the segmentation depth of the first coded block may be limited to within the second segmentation depth threshold.
  • the second segmentation depth threshold can be obtained from the configuration file.
  • the second segmentation depth threshold may be less than or equal to the first segmentation depth threshold, and the size of the second segmentation depth threshold may be set according to specific needs.
  • the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame.
  • the method may further include: determining that the second coded block of the video frame includes a pixel point in the region of interest and does not include a pixel point in an edge region of the video frame, or determining a number of the video frame
  • the second coding block includes pixel points in an edge region of the video frame and does not include pixel points in a region of interest of the video frame; determining that a rate distortion cost of the second coding block is less than or equal to the second encoding
  • the block performs rate distortion cost after sub-block division; and determines that the second coding block does not perform sub-block division.
  • the second coding block of the video frame includes pixel points of the region of interest and the non-overlapping region of the edge region, but the second coding block does not include the region of interest and the edge region
  • the pixel points of the overlapping area may further refer to the magnitude relationship of the rate distortion before and after the second coding block sub-block division to determine whether to continue the sub-block division for the second coding block.
  • the second coding block of the video frame may refer to any one of the video frames.
  • the current segmentation depth of the second coding block may be an arbitrary segmentation depth smaller than the maximum allowed segmentation depth of the video frame, that is, the size of the second coding block may be any size larger than the allowed minimum coding block size.
  • the size of the second code block may be 64*64, 32*32, 16*16, or other sizes allowed.
  • the method may further include: determining that a third coding block of the video frame does not include a pixel in the subdivided region; determining the third encoding Piece Sub-block splitting is not performed. That is, when the third coding block of the video frame does not include the pixel in the subdivided region, it may be considered that the third coding block does not perform sub-block division. In this case, the third reference may be omitted. A condition such as the magnitude relationship of the rate distortion before and after the block sub-block division is determined to determine whether to continue sub-block division for the third coded block.
  • the method before the determining that the third coding block does not perform sub-block division, the method further includes: determining that a current segmentation depth of the third coding block is greater than or Is equal to a third partition depth threshold, wherein the third partition depth threshold is less than or equal to a maximum allowed partition depth of the coding block. That is, when the third coding block does not include the pixel points in the subdivision area, the division depth of the third coding block may be limited to the third division depth threshold.
  • a third split depth threshold can be obtained from the configuration file.
  • the third segmentation depth threshold may be less than or equal to the first segmentation depth threshold.
  • the third segmentation depth threshold may be, for example, less than or equal to the second segmentation depth threshold.
  • the size of the third segmentation depth threshold can be set according to specific needs.
  • the method may further include: determining a rate distortion cost of the third coding block. a less than or equal to a rate distortion cost of the third coding block after the sub-block division; determining that the current segmentation depth of the third coding block is smaller than a third segmentation depth threshold, wherein the third segmentation depth threshold is less than or equal to the coding
  • the maximum allowable split depth of the block may be determined by referring to conditions such as the relationship between the third coding block before and after the division. The block continues with sub-block partitioning.
  • the third coding block of the video frame may refer to any one of the video frames.
  • the current segmentation depth of the third coding block may be an arbitrary segmentation depth smaller than the maximum allowed segmentation depth of the video frame, that is, the size of the third coding block may be any size larger than the allowed minimum coding block size.
  • the size of the third code block may be 64*64, 32*32, 16*16, or other sizes allowed.
  • the method may further include: determining that a fourth coding block of the video frame does not include a pixel in the subdivided region; determining the fourth coding block The rate distortion penalty is greater than the rate distortion cost of the fourth code block after sub-block segmentation.
  • the third segmentation depth threshold is less than or equal to a maximum allowable segmentation depth of the video frame; determining the fourth series The code block performs sub-block division. Further, before determining that the fourth coding block performs sub-block division, the current segmentation depth of the fourth coding block may be determined to be smaller than a third segmentation depth threshold.
  • the third segmentation depth threshold can be obtained from the configuration file.
  • the third segmentation depth threshold may be less than or equal to the first segmentation depth threshold.
  • the third segmentation depth threshold may be, for example, less than or equal to the second segmentation depth threshold.
  • the size of the third segmentation depth threshold can be set according to specific needs.
  • the fourth coding block of the video frame may refer to any one of the video frames.
  • the current segmentation depth of the fourth coding block may be an arbitrary segmentation depth smaller than the maximum allowed segmentation depth of the video frame, that is, the size of the fourth coding block may be any size larger than the allowed minimum coding block size.
  • the size of the fourth coding block may be 64*64, 32*32, 16*16, or other sizes allowed.
  • the maximum allowed segmentation depth of the video frame refers to the segmentation depth corresponding to the minimum allowable coding block. That is to say, the maximum allowable segmentation depth of a video frame refers to the number of divisions required to divide the allowable maximum size coded block into the allowable minimum size coded block.
  • the current segmentation depth refers to the number of segments required to divide the maximum allowable coding block into the current coded block.
  • the maximum size allowed by the coding block is 64*64, and the minimum size allowed for the coding block is 8*8, if the current coding block size is 64*64, the current coding depth of the current coding block is 0; if the current coding block is The size is 32*32, the current segmentation depth of the current coding block is 1; if the current coding block size is 16*16, the current segmentation depth of the current coding block is 2; if the current coding block size is 8*8, then The current segmentation depth of the current coded block is 3.
  • the maximum allowable segmentation depth of the video frame in this scenario is 3 because the coded block of size 64*64 is divided into 8 ⁇ 8 coded blocks.
  • the current coding block size is 64*64, and the minimum size allowed for the coding block is 2*2, if the current coding block size is 64*64, the current coding depth of the current coding block is 0; if the current coding block is If the size of the current coding block is 8*8, the current coding block has a current partition depth of 1; if the current coding block has a size of 16*16, the current coding block has a current division depth of 2; The current segmentation depth of the current coding block is 3; similarly, if the current coding block size is 4*4, the current coding block has a current segmentation depth of 4; if the current coding block has a size of 2*2, the current coding The current split depth of the block is 5.
  • the coded block of size 64*64 is divided into 2*2 sizes.
  • the code block needs to be split 5 times. Therefore, the maximum allowable segmentation depth of the video frame in this scenario is 5. It can be understood that the maximum size and the minimum size allowed by the coding block may also be other values. In the corresponding scenario, the maximum allowable segmentation depth and the current segmentation depth may be determined in a similar manner.
  • the current coding block may be divided into four equal-sized sub-coded blocks.
  • a coding block of size 64*64 may be divided into four coding blocks of size 32*32, for example,
  • the coded block of size 16*16 is divided into four coded blocks of size 8*8.
  • the first segmentation depth threshold is less than or equal to the maximum allowable segmentation depth of the video frame, if the maximum allowable segmentation depth is 5, the first segmentation depth threshold may be 5, 4, 3, or 2, and the like; If the maximum allowable segmentation depth is 3, the first segmentation depth threshold may be 3, 2, or 1, and the like, and other similar scenarios may be deduced.
  • the first segmentation depth threshold is greater than or equal to the second segmentation depth threshold.
  • the first segmentation depth threshold is greater than or equal to the third segmentation depth threshold. Therefore, after the first segmentation depth threshold is determined, the second segmentation depth threshold or the third segment may be selected within a range of values less than or equal to the first segmentation depth threshold. The specific value of the depth threshold.
  • the specific value of the second segmentation depth threshold may be 2 or 1.
  • the specific value of the third segmentation depth threshold may be 2 or 1.
  • the specific value of the second segmentation depth threshold may be 4 or 3.
  • the specific value of the third segmentation depth threshold may be 3 or 2 or 1.
  • the third segmentation depth threshold and the second segmentation depth threshold may be regarded as the same segmentation depth threshold.
  • the second segmentation depth threshold, the first segmentation depth threshold, and the third segmentation depth threshold may also be regarded as the same segmentation depth.
  • Threshold That is to say, when some segmentation depth thresholds are equal, the equal segmentation depth thresholds can be regarded as the same segmentation depth threshold.
  • FIG. 2 is a schematic flowchart diagram of another method for processing a block division in video coding according to another embodiment of the present invention.
  • the relative positional relationship between the first coding block and the subdivided region, the first segmentation depth threshold, the second segmentation depth threshold, and the third segmentation depth threshold are mainly referenced to determine the coding block.
  • the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame.
  • step 101 The specific manner of obtaining the subdivided area of the image may be various. For details, refer to the description of step 101. This embodiment is not described in detail herein.
  • step 203 If yes, go to step 203.
  • step 208 is performed.
  • the first coding block of the video frame may refer to any one of the video frames.
  • the current segmentation depth of the first coding block may be any segmentation depth less than or equal to the maximum allowed segmentation depth of the video frame, that is, the size of the first coding block may be any greater than or equal to the minimum coding block size allowed. size.
  • the size of the first code block may be 64*64, 32*32, 16*16 or other sizes allowed.
  • step 204 If yes, go to step 204.
  • step 207 is performed.
  • a first coding block of the video frame includes a pixel in an overlap region of a region of interest and an edge region of the video frame.
  • step 205 is performed.
  • step 206 is performed.
  • the first segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame, and therefore, the specific value of the first segmentation depth threshold may be selected within a range of values smaller than the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specificity of the first segmentation depth threshold is specific. The value can be 2. For another example, when the maximum allowable segmentation depth is 5, the specific value of the first segmentation depth threshold may be 4 or 3 or 2 or the like.
  • step 208 is performed.
  • step 210 is performed.
  • step 208 is performed.
  • the second segmentation depth threshold is mainly used to limit pixel points in the video frame that do not include the region of interest and the edge region of the video frame, but includes the region of interest and the edge of the video frame.
  • the depth of division of the coded block of the pixel in the non-overlapping region of the region is mainly used to limit pixel points in the video frame that do not include the region of interest and the edge region of the video frame, but includes the region of interest and the edge of the video frame.
  • the second segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame. Therefore, the specific value of the second segmentation depth threshold may be selected within a range of values smaller than the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specific value of the second segmentation depth threshold may be 2 or 1. For another example, when the first segmentation depth threshold is 5, the specific value of the second segmentation depth threshold may be 4 or 3 or 2 or 1 or the like.
  • step 210 is performed.
  • step 208 is performed.
  • the third segmentation depth threshold is mainly used to limit the segmentation depth of the coding block of the pixel in the video frame that does not include the subdivision region.
  • the third segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame. Therefore, the specific value of the third segmentation depth threshold may be selected within a range of values smaller than the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specific value of the third segmentation depth threshold may be 2 or 1. For another example, when the first segmentation depth threshold is 5, the specific value of the third segmentation depth threshold may be 4 or 3 or 2 or 1 or the like. The third segmentation depth threshold is less than or equal to the second segmentation depth.
  • step 210 is performed.
  • the sub-block division processing can be performed by way of example in FIG.
  • the coding block CU-1 in the video frame may be subjected to sub-block division processing according to the example of FIG. 2, and it is assumed that the coding block CU-1 is divided into four coding blocks, which are CU-11, CU-12, CU-, respectively. 13 and CU-14, then for each coding block in CU-11, CU-12, CU-13, and CU-14, sub-block division processing may also be performed according to the example of FIG. 2, for example, CU-11 may be used. Cut into 4 coded blocks, and CU-12 may no longer perform sub-block splitting, and so on.
  • the first coding block of the video frame after acquiring the subdivided area of the video frame, when determining that the first coding block of the video frame includes the pixel in the overlapping area of the region of interest and the edge region of the video frame, and The current segmentation depth of a coding block is smaller than the first segmentation depth threshold of the video frame, and the first coding block is subjected to sub-block segmentation. That is to say, the relative positional relationship between the first coding block of the video frame and the overlapping area of the region of interest and the edge region can determine to some extent whether the first coding block performs sub-block segmentation under the current segmentation depth.
  • the above technical solution of the present embodiment is advantageous in reducing the conventional mechanism of determining whether to continue sub-block division for the current coding block by calculating and comparing the current coding block before and after the division. Determining whether the current coded block performs the computational complexity of the sub-block segmentation under the current segmentation depth, thereby facilitating the reduction of the occupation of the relevant computing resources.
  • FIG. 3 is a schematic flowchart diagram of another method for processing a block division in video coding according to another embodiment of the present invention.
  • the relative positional relationship between the first coding block and the subdivided region and the first segmentation depth threshold are mainly referred to (the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame in this embodiment).
  • the third segmentation depth threshold or the like is used to determine the segmentation processing mode of the coding block.
  • the subdivided area of the video frame includes a region of interest of the video frame and the video frame The edge area.
  • step 101 The specific manner of obtaining the subdivided area of the image may be various. For details, refer to the description of step 101. This embodiment is not described in detail herein.
  • step 303 If yes, go to step 303.
  • step 308 is performed.
  • the first coding block of the video frame may refer to any one of the video frames.
  • the current segmentation depth of the first coding block may be any segmentation depth less than or equal to the maximum allowed segmentation depth of the video frame, that is, the size of the first coding block may be any greater than or equal to the minimum coding block size allowed. size.
  • the size of the first code block may be 64*64, 32*32, 16*16 or other sizes allowed.
  • step 304 If yes, go to step 304.
  • step 305 is performed.
  • a first coding block of the video frame includes a pixel in an overlap region of a region of interest and an edge region of the video frame.
  • step 306 is performed.
  • step 308 is performed.
  • the third segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame. Therefore, the specific value of the third segmentation depth threshold may be selected within a range of values smaller than the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specific value of the third segmentation depth threshold may be 2 or 1. For another example, when the first segmentation depth threshold is 5, the specific value of the third segmentation depth threshold may be 4 or 3 or 2 or 1 or the like.
  • Rate distortion cost after block segmentation Determine whether a rate distortion cost of the first coding block is greater than the first coding block. Rate distortion cost after block segmentation.
  • step 308 is performed.
  • the sub-block division processing can be performed by way of example in FIG.
  • the first coding block of the video frame after acquiring the subdivided area of the video frame, when determining that the first coding block of the video frame includes the pixel in the overlapping area of the region of interest and the edge region of the video frame, and The current segmentation depth of a coding block is smaller than the maximum allowed segmentation depth of the video frame, and the first coding block is subjected to sub-block segmentation, that is, between the first coding block of the video frame and the overlapping region of the region of interest and the edge region.
  • the relative positional relationship may determine whether the first coding block performs sub-block segmentation at the current segmentation depth to some extent, compared to completely calculating and comparing the rate distortion of the current coding block before and after the division.
  • the foregoing technical solution of the present embodiment is advantageous for reducing the computational complexity of determining whether the current coding block performs sub-block segmentation at the current segmentation depth, thereby facilitating reduction of correlation. Calculate the occupation of resources.
  • FIG. 4 is a schematic flowchart diagram of a block segmentation processing method in video coding according to another embodiment of the present invention.
  • the relative positional relationship between the first coding block and the subdivided region and the first segmentation depth threshold are mainly referred to (in the embodiment, the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame).
  • the third segmentation depth threshold or the like is used to determine the segmentation processing mode of the coding block.
  • a block segmentation processing method in video coding provided by another embodiment of the present invention may include:
  • the subdivided area of the video frame includes at least one of a region of interest of the video frame and an edge region of the video frame.
  • step 101 The specific manner of obtaining the subdivided area of the video frame may be various. For details, refer to the description of step 101, which is not detailed herein.
  • step 407 is performed.
  • the first coding block of the video frame may refer to any one of the video frames.
  • the current segmentation depth of the first coding block may be any segmentation depth less than or equal to the maximum allowed segmentation depth of the video frame, that is, the size of the first coding block may be any greater than or equal to the minimum coding block size allowed. size.
  • the size of the first code block may be 64*64, 32*32, 16*16 or other sizes allowed.
  • step 404 is performed.
  • step 406 is performed.
  • the first coding block of the video frame may refer to one of the coding blocks or any one of the coding blocks.
  • the size of the first code block may be the maximum size allowed (for example, 64*64), or the next largest size allowed (for example, 32*32), or the next large size allowed (for example, 16*16), or Other sizes allowed.
  • step 407 is performed.
  • the third segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame. Therefore, the specific value of the third segmentation depth threshold may be selected within a range of values less than or equal to the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specific value of the third segmentation depth threshold may be 2 or 1. For another example, when the first segmentation depth threshold is 5, the specific value of the third segmentation depth threshold may be 4 or 3 or 2 or 1 or the like.
  • step 406 is performed.
  • step 407 is performed.
  • the sub-block division processing can be performed by way of example in FIG.
  • the relative positional relationship between the first coding block and the subdivided region of the video frame can determine to some extent whether the first coding block performs sub-block segmentation under the current segmentation depth, so
  • the above technical solution of the present embodiment is advantageous for reducing the current coding block at the current state by determining the conventional mechanism for determining whether to continue the sub-block division for the current coding block before and after the current coding block partitioning and before and after the division. Whether the computational complexity of sub-block segmentation is performed under the segmentation depth is beneficial to reduce the occupation of related computing resources.
  • FIG. 5 is a schematic flowchart diagram of a block segmentation processing method in video coding according to another embodiment of the present invention.
  • the relative positional relationship between the first coding block and the subdivision area and the first segmentation depth threshold are mainly referred to.
  • the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame.
  • the rate distortion cost, etc. to determine the segmentation processing mode of the coding block.
  • a block segmentation processing method in video coding provided by another embodiment of the present invention may include:
  • the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame.
  • step 101 The specific manner of obtaining the subdivided area of the video frame may be various. For details, refer to the description of step 101, which is not detailed herein.
  • step 506 is performed.
  • the first coding block of the video frame may refer to any one of the video frames.
  • the current segmentation depth of the first coding block may be any segmentation depth less than or equal to the maximum allowed segmentation depth of the video frame, that is, the size of the first coding block may be any greater than or equal to the minimum coding block size allowed. size.
  • the size of the first code block may be 64*64, 32*32, 16*16 or other sizes allowed.
  • a first coding block of the video frame includes a pixel in a region of interest of the video frame and an overlap region of the edge region.
  • step 504 is performed.
  • step 506 is performed.
  • the relative positional relationship between the first coding block of the video frame and the overlapping area of the region of interest and the edge region can determine to some extent whether the first coding block performs sub-block segmentation under the current segmentation depth.
  • the above technical solution of the present embodiment is advantageous for reducing the current determination, compared to the conventional mechanism for determining whether to continue sub-block division for the current coding block by calculating and comparing the current coding block before and after the division. Whether the coding block performs sub-block division under the current segmentation depth Calculating the complexity, which in turn helps to reduce the occupation of related computing resources.
  • FIG. 6 is a schematic flowchart diagram of a block segmentation processing method in video coding according to another embodiment of the present invention.
  • the relative positional relationship between the first coding block and the subdivision area and the first segmentation depth threshold are mainly referred to (the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame in this embodiment).
  • the rate distortion cost, etc. to determine the segmentation processing mode of the coding block.
  • a block segmentation processing method in video coding provided by another embodiment of the present invention may include:
  • the subdivided area of the video frame includes an edge area or a region of interest of the video frame.
  • step 101 The specific manner of obtaining the subdivided area of the video frame may be various. For details, refer to the description of step 101, which is not detailed herein.
  • step 603 is performed.
  • step 606 is performed.
  • the first coding block of the video frame may refer to any one of the video frames.
  • the current segmentation depth of the first coding block may be any segmentation depth less than or equal to the maximum allowed segmentation depth of the video frame, that is, the size of the first coding block may be any greater than or equal to the minimum coding block size allowed. size.
  • the size of the first code block may be 64*64, 32*32, 16*16 or other sizes allowed.
  • step 605 is performed.
  • step 604 is performed.
  • step 605 is performed.
  • step 606 is performed.
  • the first coding block when determining that the first coding block of the video frame includes an edge region of the video frame or a pixel in the region of interest a point, and the current coding depth of the first coding block is smaller than a maximum allowed division depth of the video frame, the first coding block may be subjected to sub-block division, when the first coding block does not include an edge region of the video frame or is interested For the pixel points in the region, the size relationship of the rate distortion cost before and after the first coding block segmentation may be referenced to determine whether the first code block continues to perform the sub-block segmentation.
  • the relative positional relationship between the first coding block and the edge region or the region of interest may determine to some extent whether the first coded block performs sub-block segmentation at the current segmentation depth, which is compared to complete
  • the technical solution of the present embodiment is beneficial to reduce the current coding block at the current segmentation depth by calculating and comparing the current rate of the current coding block before and after the division to determine whether to continue the sub-block division of the current coding block. Whether the computational complexity of sub-block segmentation is performed, thereby reducing the occupation of related computing resources.
  • FIG. 7 is a schematic flowchart diagram of another method for processing a block division in video coding according to another embodiment of the present invention.
  • the relative positional relationship between the first coding block and the subdivision area and the first segmentation depth threshold are mainly referred to.
  • the first segmentation depth threshold is equal to the maximum allowable segmentation depth of the video frame.
  • the third segmentation depth threshold or the like is used to determine the segmentation processing mode of the coding block.
  • the block segmentation processing method in another video coding provided by another embodiment of the present invention may include:
  • the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame.
  • step 101 The specific manner of obtaining the subdivided area of the video frame may be various. For details, refer to the description of step 101, which is not detailed herein.
  • step 708 is performed.
  • the first coding block of the video frame may refer to any one of the video frames.
  • the current segmentation depth of the first coding block may be any segmentation depth less than or equal to the maximum allowed segmentation depth of the video frame, that is, the size of the first coding block may be any greater than or equal to the minimum coding block size allowed. size.
  • the size of the first code block may be 64*64, 32*32, 16*16 or other sizes allowed.
  • step 704 is performed.
  • step 705 is performed.
  • step 706 is performed.
  • step 708 is performed.
  • step 706 is performed.
  • the third segmentation depth threshold is smaller than the maximum allowable segmentation depth of the video frame. Therefore, the specific value of the third segmentation depth threshold may be selected within a range of values less than or equal to the maximum allowable segmentation depth of the video frame. For example, when the maximum allowable segmentation depth of the video frame is 3, the specific value of the third segmentation depth threshold may be 2 or 1. For another example, when the first segmentation depth threshold is 5, the specific value of the third segmentation depth threshold may be 4 or 3 or 2 or 1 or the like.
  • step 708 is performed.
  • the first coded block may be divided into sub-blocks, and when the first coded block does not include pixels in the edge region of the video frame, reference may be made.
  • the condition of the rate distortion cost before and after the first coding block is divided to determine whether the first coding block continues to perform sub-block division.
  • the relative positional relationship between the first coding block and the edge region of the video frame can determine to some extent whether the first coding block performs sub-block segmentation under the current segmentation depth, which is compared to completely
  • the foregoing technical solution of the present embodiment helps to reduce the current coding block at the current segmentation depth, by calculating and comparing the current rate of the current coding block before and after the division of the coding block to determine whether to continue the sub-block division of the current coding block. Whether the computational complexity of sub-block segmentation is performed, thereby reducing the occupation of related computing resources.
  • FIG. 8-a is a schematic flowchart of another method for processing a block division in video coding according to another embodiment of the present invention.
  • another method for processing a block division in video coding according to another embodiment of the present invention may include:
  • the subdivided area of the first video frame includes a region of interest of the first video frame and an edge region of the first video frame.
  • the first image group may be any one of the image groups in the code stream.
  • the first video frame belongs to the first image group (English, group of pictures, abbreviated: GOP).
  • the specific manner of obtaining the subdivision area of the first video frame may be various.
  • the first video frame may be matched by using a region matching algorithm to obtain a subdivided region of the first video frame, for example, matching the first video frame based on the region matching algorithm.
  • the process for example, can identify which regions in the first video frame are subdivided regions and which regions are not subdivided regions.
  • the subdivision area of the first video frame may be obtained according to the configuration instruction in the configuration file.
  • the configuration instruction in the configuration file may specifically specify which areas of the first video frame are subdivision areas, and the first video frame. Which areas are not subdivisions.
  • the method obtains a subdivision area of the first video frame.
  • the algorithm for generating the Saliency map is intended to output a block diagram such as the one shown in the right figure in FIG. 8-b.
  • the left side of FIG. 8-b is the original picture, and the right side of FIG. 8-b is the left picture.
  • Each color in the right image in Figure 8-b represents a weight value.
  • the pure white coding block in the saliency map has the highest weight value, for example, 5, and the pure black coding block in the saliency map has the lowest weight value, for example, 1.
  • the size of a macroblock is determined (16 ⁇ 16), and each macroblock in the saliency map corresponds to a weight value; in the H.265 standard, the size of the CU is variable.
  • Each of the smallest CU-covered pixel regions (eg, 8 ⁇ 8) in the significance map corresponds to one weight value, and the weight value corresponding to the pixel region covered by the CU may be weighted by the weight value corresponding to any CU of the determined size. The value is calculated. For example, the weight value of a 16 ⁇ 16-sized CU can be averaged by the weight values corresponding to the four 8 ⁇ 8 pixel regions covered by it.
  • the method for determining the weight value of each coding block on the significance map may be various.
  • the detected person area may be defined as an ROI.
  • the weight value of the coded block including the pixel points in the ROI may be set to 2
  • the coded block weight value of the pixel point not including the ROI may be set to 1.
  • the coded block including the pixels of the eyes, nose, and mouth is obtained as the highest weight, and the pixel points including the edge of the face are included.
  • the weight of the coded block is second, and the other face flat areas are again, and other areas outside the face can be assigned the lowest weight.
  • the number of bits T G used by the first image group is calculated by the formula (1) according to the code rate, the frame rate, and the number of video frames in the first image group.
  • S represents a code rate
  • N represents the number of frames in the GOP
  • fps represents a frame rate
  • the number of bits T f allocated to each video frame in the first image group is calculated according to T G by equation (2), where w I is the weight corresponding to the I frame in the first image group, w B Is the weight corresponding to the B frame in the first image group, w P is the weight corresponding to the P frame in the first image group, N I is the number of I frames in the first image group, and N B is the first image The number of B frames in the group, N P is the number of P frames in the first image group.
  • T f T G /(w I ⁇ N I +w P ⁇ N P +w B ⁇ N B ) ⁇ w I (2)
  • Sub-block partitioning is performed on each coding block in the first video frame.
  • the sub-block division may be performed on each coding block in the first video frame according to the block division processing method in any one of the video coding examples exemplified in the foregoing method embodiments.
  • the manner in which each coding block of another video frame in the first picture group performs sub-block division may be similar.
  • the parameter RDCost may be used in the process of sub-block segmentation.
  • the calculation of RDCost may use the quantization parameter corresponding to the first video frame (English: quantization parameter, QP).
  • QP quantization parameter
  • the QP of the first video frame may also be predicted according to the context.
  • the quantization parameter QP(j) of the jth video frame in one GOP may be calculated according to formula (3).
  • MAD pred (j) in the formula (3) represents the maximum absolute difference before and after encoding of the jth video frame in the predicted GOP.
  • the MAD prediction generally uses the MAD of the previous coded frame (ie, the j-1th frame).
  • MAD represents the complexity of the video frame.
  • T(j) represents the target bit rate. Under the condition that the target bit rate T(j) is constant, the larger the MAD is, the larger the QP is, which means that the more complicated the video frame is, The worse the details of the encoded video frame.
  • Npixel(j) represents the total number of pixels included in the jth video frame
  • ⁇ and ⁇ are adjustment parameters
  • ⁇ and ⁇ are usually set to 1.
  • QP(j) represents the QP of the jth video frame.
  • the QP of each CU can be calculated based on the formula (4).
  • k in equation (4) represents the kth CU
  • w i represents the weight value of the i th CU
  • T i (j) represents the remaining target code rate of the jth video frame when encoding the i th CU
  • N represents the number of uncoded CUs of the jth video frame
  • N pixel,i (j) represents the number of pixels of the i th CU.
  • QP k (j) represents the kth QP in the jth video frame.
  • MAD pred,k (j) represents the maximum absolute difference before and after the kth CU encoding of the jth video frame in the predicted GOP.
  • the CU coding mainly includes quantization of residuals and entropy coding of coding modes, parameters, and quantized residuals.
  • the relative positional relationship between the current coding block and the edge region of the video frame can determine to some extent whether the current coding block performs sub-block segmentation at the current segmentation depth, which is compared to completely calculating and comparing the current
  • the foregoing technical solution of the present embodiment helps to determine whether to determine whether the current coding block performs sub-blocks at the current segmentation depth, and the conventional mechanism for determining whether to perform the sub-block division on the current coding block before and after the block division.
  • the computational complexity of the segmentation which in turn helps to reduce the occupation of related computing resources.
  • an embodiment of the present invention provides a block segmentation processing apparatus 900 in video coding, which may include: an obtaining unit 910, a determining unit 920, and a dividing unit 930.
  • the obtaining unit 910 is configured to acquire a subdivision area of the video frame
  • the subdivided area of the video frame includes at least one of a region of interest of the video frame and an edge region of the video frame.
  • the manner in which the obtaining unit 910 obtains the subdivided area of the video frame may be any one of the foregoing embodiments.
  • the determining unit 920 is configured to determine that the first coding block of the video frame includes pixel points in the subdivided region.
  • the dividing unit 930 is configured to perform sub-block segmentation on the first coding block.
  • the determining unit 920 is further configured to: before the performing the sub-block partitioning on the first coding block, determining that a current segmentation depth of the first coding block is smaller than a first segmentation depth a threshold, wherein the first partition depth threshold is less than or equal to a maximum allowed partition depth of the video frame.
  • the subdivided region of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein, in the determining the first of the video frame
  • the encoding block includes an aspect of the pixel in the subdivided region, and the determining unit 920 is specifically configured to determine that the first encoding block of the video frame includes the pixel of the overlapping region of the region of interest and the edge region .
  • the subdivided region of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein the determining the first encoding of the video frame The block includes an aspect of the pixel in the subdivided region, and the determining unit 920 is specifically configured to: determine that the first coding block of the video frame includes a pixel in the region of interest and does not include the video frame a pixel point in the edge region, or determining that the first coded block of the video frame includes a pixel point in an edge region of the video frame and does not include a pixel point in the region of interest of the video frame;
  • the determining unit 920 is further configured to: before performing the sub-block partitioning on the first coding block, determining that a rate distortion cost of the first coding block is greater than a rate distortion of the first coding block after sub-block division cost.
  • the determining unit 920 is further configured to: before performing the sub-block partitioning on the first coding block, determining that a current segmentation depth of the first coding block is smaller than a second segmentation depth threshold, The second segmentation depth threshold is smaller than the first segmentation depth threshold.
  • the subdivided area of the video frame includes an area of interest of the video frame and an edge area of the video frame; the determining unit 920 is further configured to determine the video frame.
  • a second coding block comprising pixel points in the region of interest and not including pixel points in an edge region of the video frame, or determining that a second coding block of the video frame includes an edge of the video frame a pixel in the edge region and not including a pixel in the region of interest of the video frame; determining that the rate distortion cost of the second encoding block is less than or equal to the rate distortion of the second encoding block after sub-block segmentation a cost; determining that the second coded block does not perform sub-block splitting.
  • the determining unit 920 is further configured to: determine that the third coding block of the video frame does not include the pixel point in the subdivision area; and determine that the third coding block does not perform sub Block splitting.
  • the determining unit 920 is further configured to: before determining that the third coding block does not perform sub-block segmentation, determine that a current segmentation depth of the third coded block is equal to a third segmentation depth threshold. Wherein the third partition depth threshold is less than a maximum allowed partition depth of the video frame.
  • the determining unit 920 is further configured to: before determining that the third coding block does not perform sub-block segmentation, determine that a rate distortion cost of the third coding block is less than or equal to the first The rate-distortion cost of the sub-block partitioning by the three-coded block.
  • the determining unit 920 is further configured to: determine that a fourth coding block of the video frame does not include a pixel point in the subdivided region; determine a rate distortion of the fourth coding block The cost is greater than the rate distortion cost of the fourth coding block after sub-block division;
  • the dividing unit 930 is further configured to perform sub-block segmentation on the fourth coding block.
  • the determining unit 920 is further configured to: before the sub-block division of the fourth coding block, determine that a current segmentation depth of the fourth coding block is smaller than a third segmentation depth. a threshold; wherein the third partition depth threshold is less than a maximum allowed partition depth of the video frame.
  • the block division processing device 900 in video coding can be integrated in the video encoding device.
  • the video encoding device can be any device that needs to collect, store, or transmit audio signals, such as a mobile phone, a tablet computer, a personal computer, a notebook computer, and the like.
  • the block segmentation processing apparatus 900 in the video coding of this embodiment is acquiring the fineness of the video frame. After the sub-region, when it is determined that the first coding block of the video frame includes pixel points in the subdivided region, the first coding block is subjected to sub-block division.
  • the subdivided area of the video frame includes at least one of an ROI of the video frame and an edge area of the video frame. That is to say, the relative positional relationship between the first coding block and the subdivision area of the video frame can determine to some extent whether the first coding block performs sub-block division under the current segmentation depth.
  • the above technical solution of the present invention is advantageous for reducing the determination of the current coding block, compared to the conventional mechanism of determining whether to continue sub-block division for the current coding block by calculating and comparing the current coding block before and before the division. Whether the computational complexity of sub-block segmentation is performed under the current segmentation depth, thereby facilitating the reduction of the occupation of computing resources.
  • FIG. 10 is a structural block diagram of a video encoding apparatus according to another embodiment of the present invention.
  • the video encoding apparatus 1000 may include at least one processor 1001, a memory 1005, and at least one communication bus 1002.
  • Communication bus 1002 is used to implement connection communication between these components.
  • the video encoding apparatus 1000 may further include: at least one network interface 1004, a user interface 1003, and the like.
  • the user interface 1003 includes a display (such as a touch screen, a liquid crystal display or a holographic image (English: Holographic) or a projection (English: Projector), etc.), and a click device (for example, a mouse, a trackball (English: trackball) touch) Board or touch screen, etc.), camera and / or pickup device.
  • the memory 1005 can include read only memory and random access memory and provides instructions and data to the processor 1001.
  • a portion of the memory 1005 may also include a non-volatile random access memory.
  • the memory 1005 stores elements, executable modules or data structures, or a subset thereof, or their extended set: acquisition unit 910, determination unit 920, and partition unit 930.
  • the processor 1001 is configured to acquire a subdivided region of a video frame by executing a code or an instruction in the memory 1005, where the subdivided region of the video frame includes a region of interest of the video frame and At least one of edge regions of the video frame; determining that the first coding block of the video frame includes pixel points in the subdivision region; and performing sub-block segmentation on the first coding block.
  • the processor 1001 is further configured to: before the sub-block division of the first coding block, determine that a current segmentation depth of the first coding block is smaller than a first segmentation depth threshold. Where the first partition depth threshold is less than or equal to a maximum allowable score of the video frame Cut depth.
  • the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein the processor 1001 is configured to determine the video frame
  • the first coding block includes pixel points of the region of interest and the overlapping region of the edge region.
  • the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein the processor 1001 is configured to determine the video frame
  • the first coded block includes pixel points in the region of interest and does not include pixel points in an edge region of the video frame, or determines that a first coded block of the video frame includes an edge region of the video frame Pixels and do not include pixels in the region of interest of the video frame;
  • the processor 1001 is further configured to: before performing the sub-block partitioning on the first coding block, determining that a rate distortion cost of the first coding block is greater than a rate after the first coding block performs sub-block division Distortion cost.
  • the processor 1001 is further configured to: before performing the sub-block partitioning on the first coding block, determining that a current segmentation depth of the first coding block is smaller than a second segmentation depth threshold, The second segmentation depth threshold is less than the first segmentation depth threshold.
  • the subdivided area of the video frame includes a region of interest of the video frame and an edge region of the video frame; wherein the processor 1001 is further configured to determine the A second coded block of the video frame includes a pixel in the region of interest and does not include a pixel in an edge region of the video frame, or determines that a second encoded block of the video frame includes an edge of the video frame a pixel in the region and not including a pixel in the region of interest of the video frame; determining that a rate distortion penalty of the second encoded block is less than or equal to a rate distortion penalty of the second encoded block for sub-block segmentation Determining that the second coding block does not perform sub-block division.
  • the processor 1001 is further configured to: determine that a third coding block of the video frame does not include a pixel point in the subdivided region; and determine that the third coding block does not perform a sub- Block splitting.
  • the processor 1001 is further configured to: before determining that the third coding block does not perform sub-block segmentation, determining that a current segmentation depth of the third coded block is equal to a third segmentation depth a threshold, wherein the third partition depth threshold is smaller than a maximum allowed segmentation of the video frame depth.
  • the processor 1001 is further configured to: before determining that the third coding block does not perform sub-block segmentation, determining that a rate distortion cost of the third coding block is less than or equal to the The third coding block performs rate distortion cost after sub-block division.
  • the processor 1001 is further configured to: determine that a fourth coding block of the video frame does not include a pixel point in the subdivision region; and determine a rate distortion cost of the fourth coding block. And a rate distortion cost after the sub-block division is performed by the fourth coding block; and the fourth coding block is subjected to sub-block division.
  • the processor 1001 is further configured to: before the sub-block division of the fourth coding block, determine that a current segmentation depth of the fourth coding block is smaller than a third segmentation depth threshold. Wherein the third partition depth threshold is less than a maximum allowed partition depth of the video frame.
  • the video encoding device 1000 can be, for example, any device that needs to collect, store, or transmit audio signals, such as a mobile phone, a tablet computer, a personal computer, a notebook computer, and the like.
  • the first coding block of the video frame includes the pixel points in the subdivided area, and the first coding block is determined.
  • the subdivided area of the video frame includes at least one of an ROI of the video frame and an edge area of the video frame. That is to say, the relative positional relationship between the first coding block and the subdivision area of the video frame can determine to some extent whether the first coding block performs sub-block division under the current segmentation depth.
  • the above technical solution of the present invention is advantageous for reducing the determination of the current coding block, compared to the conventional mechanism of determining whether to continue sub-block division for the current coding block by calculating and comparing the current coding block before and before the division. Whether the computational complexity of sub-block segmentation is performed under the current segmentation depth, thereby facilitating the reduction of the occupation of computing resources.
  • the embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium may store a program, where the program includes some or all of the block segmentation processing method in any one of the video encodings described in the foregoing method embodiments. step.
  • the embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium can store a program, and the program includes some or all of the steps of any one of the video encoding methods described in the foregoing method embodiments.
  • the disclosed apparatus may be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be electrical or otherwise.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • the technical solution of the present invention which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium.
  • the instructions include a plurality of instructions for causing a computer device (which may be a personal computer, server or network device, etc., and in particular a processor in a computer device) to perform all or part of the steps of the methods of the various embodiments of the present invention.
  • the foregoing storage medium may include: a U disk, a mobile hard disk, a magnetic disk, an optical disk, a read only memory (English: read-only memory, abbreviation: ROM) or a random access memory (English: random access memory, abbreviation: RAM) and other media that can store program code.

Abstract

La présente invention concerne un procédé de traitement de segmentation de blocs lors d'un codage vidéo et un appareil associé. Le procédé de traitement de segmentation de blocs lors d'un codage vidéo consiste à : acquérir une région subdivisée d'une trame vidéo, la région subdivisée de la trame vidéo comprenant une région d'intérêt de la trame vidéo et/ou une région limitrophe de la trame vidéo; déterminer qu'un premier bloc de codage de la trame vidéo contient des points de pixel dans la région subdivisée; et segmenter le premier bloc de codage en sous-blocs. La solution selon les modes de réalisation de la présente invention facilite la réduction de la complexité de calcul permettant d'évaluer si un bloc de codage est segmenté en sous-blocs.
PCT/CN2014/085681 2014-09-01 2014-09-01 Procédé de traitement de mode de segmentation de blocs lors d'un codage vidéo et appareil associé WO2016033725A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2014/085681 WO2016033725A1 (fr) 2014-09-01 2014-09-01 Procédé de traitement de mode de segmentation de blocs lors d'un codage vidéo et appareil associé
CN201480080086.1A CN106664404B (zh) 2014-09-01 2014-09-01 视频编码中的块分割方式处理方法和相关装置

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2014/085681 WO2016033725A1 (fr) 2014-09-01 2014-09-01 Procédé de traitement de mode de segmentation de blocs lors d'un codage vidéo et appareil associé

Publications (1)

Publication Number Publication Date
WO2016033725A1 true WO2016033725A1 (fr) 2016-03-10

Family

ID=55438975

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/085681 WO2016033725A1 (fr) 2014-09-01 2014-09-01 Procédé de traitement de mode de segmentation de blocs lors d'un codage vidéo et appareil associé

Country Status (2)

Country Link
CN (1) CN106664404B (fr)
WO (1) WO2016033725A1 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107566834A (zh) * 2017-10-10 2018-01-09 司马大大(北京)智能系统有限公司 帧内预测单元划分方法、装置及电子设备
CN110662048A (zh) * 2018-06-28 2020-01-07 华为技术有限公司 一种图像编码方法以及设备
CN112153317A (zh) * 2020-09-25 2020-12-29 杭州涂鸦信息技术有限公司 一种图像质量控制方法及其系统和设备

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3920535A1 (fr) * 2016-11-21 2021-12-08 Panasonic Intellectual Property Corporation of America Paramètres de division dans un codeur d'image et une méthode de codage d'images
CN110225355A (zh) * 2019-06-22 2019-09-10 衢州光明电力投资集团有限公司赋腾科技分公司 基于感兴趣区域的高性能视频编码帧内预测优化方法
CN112750127B (zh) * 2021-02-04 2022-08-26 深圳市泽峰光电科技有限公司 一种用于原木端面测量的图像处理方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080165861A1 (en) * 2006-12-19 2008-07-10 Ortiva Wireless Intelligent Video Signal Encoding Utilizing Regions of Interest Information
US20110194613A1 (en) * 2010-02-11 2011-08-11 Qualcomm Incorporated Video coding with large macroblocks
CN102792691A (zh) * 2010-01-12 2012-11-21 Lg电子株式会社 视频信号的处理方法和设备
CN103299634A (zh) * 2010-11-22 2013-09-11 联发科技(新加坡)私人有限公司 高效率视频编码约束分区大小的方法和装置
WO2013157820A1 (fr) * 2012-04-16 2013-10-24 삼성전자 주식회사 Procédé et dispositif de codage de vidéo utilisant une détection de contours à haute vitesse, et procédé et dispositif de décodage de vidéo associés

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080165861A1 (en) * 2006-12-19 2008-07-10 Ortiva Wireless Intelligent Video Signal Encoding Utilizing Regions of Interest Information
CN102792691A (zh) * 2010-01-12 2012-11-21 Lg电子株式会社 视频信号的处理方法和设备
US20110194613A1 (en) * 2010-02-11 2011-08-11 Qualcomm Incorporated Video coding with large macroblocks
CN103299634A (zh) * 2010-11-22 2013-09-11 联发科技(新加坡)私人有限公司 高效率视频编码约束分区大小的方法和装置
WO2013157820A1 (fr) * 2012-04-16 2013-10-24 삼성전자 주식회사 Procédé et dispositif de codage de vidéo utilisant une détection de contours à haute vitesse, et procédé et dispositif de décodage de vidéo associés

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107566834A (zh) * 2017-10-10 2018-01-09 司马大大(北京)智能系统有限公司 帧内预测单元划分方法、装置及电子设备
CN110662048A (zh) * 2018-06-28 2020-01-07 华为技术有限公司 一种图像编码方法以及设备
CN112153317A (zh) * 2020-09-25 2020-12-29 杭州涂鸦信息技术有限公司 一种图像质量控制方法及其系统和设备

Also Published As

Publication number Publication date
CN106664404B (zh) 2019-10-01
CN106664404A (zh) 2017-05-10

Similar Documents

Publication Publication Date Title
Min et al. Unified blind quality assessment of compressed natural, graphic, and screen content images
WO2016033725A1 (fr) Procédé de traitement de mode de segmentation de blocs lors d'un codage vidéo et appareil associé
US20140321552A1 (en) Optimization of Deblocking Filter Parameters
CN107454413B (zh) 一种保留特征的视频编码方法
Choi et al. Video quality assessment accounting for temporal visual masking of local flicker
Yuan et al. Low bit-rate compression of underwater image based on human visual system
Nami et al. BL-JUNIPER: A CNN-assisted framework for perceptual video coding leveraging block-level JND
Tandon et al. CAMBI: Contrast-aware multiscale banding index
CN110740316A (zh) 数据编码方法及装置
Wang et al. Perceptual video coding based on saliency and just noticeable distortion for H. 265/HEVC
CN113906762B (zh) 用于视频压缩的预处理
Nur Yilmaz A no reference depth perception assessment metric for 3D video
Zhao et al. Fast CU partition decision strategy based on human visual system perceptual quality
CN107509074B (zh) 基于压缩感知的自适应3d视频压缩编解码方法
Farah et al. Full-reference and reduced-reference quality metrics based on SIFT
CN114173131A (zh) 一种基于帧间相关性的视频压缩方法及系统
Khatoonabadi et al. Compressed-domain correlates of fixations in video
KR101358576B1 (ko) 주관적/객관적 동영상 비교를 통해 사용자가 인식하는 화질 저하를 최소화 하는 동영상 트랜스코딩 최적화 방법
TWI226189B (en) Method for automatically detecting region of interest in the image
Wang et al. PVC-STIM: Perceptual video coding based on spatio-temporal influence map
Himawan et al. Impact of automatic region-of-interest coding on perceived quality in mobile video
Fang et al. Review of existing objective QoE methodologies
US10848772B2 (en) Histogram-based edge/text detection
KR101362654B1 (ko) 컬러 및 명암 분석 기술을 활용한 동영상 트랜스코딩 최적화 방법
Amor et al. A block artifact distortion measure for no reference video quality evaluation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14901226

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14901226

Country of ref document: EP

Kind code of ref document: A1