WO2011153869A1 - Method, device and system for partition/encoding image region - Google Patents

Method, device and system for partition/encoding image region Download PDF

Info

Publication number
WO2011153869A1
WO2011153869A1 PCT/CN2011/072787 CN2011072787W WO2011153869A1 WO 2011153869 A1 WO2011153869 A1 WO 2011153869A1 CN 2011072787 W CN2011072787 W CN 2011072787W WO 2011153869 A1 WO2011153869 A1 WO 2011153869A1
Authority
WO
WIPO (PCT)
Prior art keywords
macroblock
region
image
interest
motion vector
Prior art date
Application number
PCT/CN2011/072787
Other languages
French (fr)
Chinese (zh)
Inventor
张智雄
Original Assignee
深圳市融创天下科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市融创天下科技股份有限公司 filed Critical 深圳市融创天下科技股份有限公司
Publication of WO2011153869A1 publication Critical patent/WO2011153869A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/194Segmentation; Edge detection involving foreground-background segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/215Motion-based segmentation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/137Motion inside a coding unit, e.g. average field, frame or block difference
    • H04N19/139Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/14Coding unit complexity, e.g. amount of activity or edge presence estimation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20021Dividing image into blocks, subimages or windows

Definitions

  • Another object of the embodiments of the present invention is to provide an image area dividing apparatus, and the apparatus includes:
  • a macroblock motion vector extraction module configured to extract motion directions of each macroblock of each frame of the video sequence
  • a macroblock motion vector statistics module for counting motion vectors of a current frame and calculating the same
  • the macroblock motion vector complex region processing module is used for labeling and extracting a relatively complex region of the macroblock motion vector in the current frame, and further correcting the image after the region according to the spatial correlation and time correlation of the macroblock, and dividing the same Regions of interest and areas of non-interest.
  • Another object of the embodiments of the present invention is to provide an image region encoding method, the method comprising the following steps:
  • the region of interest or non-interest region is determined according to the complexity of the motion vector of the image macroblock, the region with high motion vector complexity is the region of interest, and the region with low motion vector complexity is the non-interest region;
  • the coding quantization parameter is reduced for the image region of interest to improve the image quality of the region, and the coding quantization parameter is increased for the non-interest region of the image to keep the overall coding bits unchanged.
  • Another object of the embodiments of the present invention is to provide an image area coding system, the system comprising:
  • a macroblock motion vector extraction module configured to extract a motion-to-macroblock motion vector statistics module of each macroblock of each frame of the video sequence, for performing statistics on the motion vector of the current frame, and calculating the same
  • the macroblock motion vector complex region processing module is used for labeling and extracting a relatively complex region of the macroblock motion vector in the current frame, and further correcting the image after the region according to the spatial correlation and time correlation of the macroblock, and dividing the same Regions of interest and areas of non-interest.
  • a video encoding module configured to perform video encoding of different qualities according to the region of interest and the non-interest region divided by the image region dividing device.
  • Figure lc shows an image with a PSNR of 28. 3db in the prior art
  • FIG. 2 is a flowchart of an image area dividing method according to an embodiment of the present invention.
  • Figure 3a is an image of a frame in a tennis match of an embodiment of the present invention
  • Figure 3b is an image of the motion vector of each macroblock labeled in Figure 3a.
  • Figure 4 is a diagram of the image in Figures 3a and 3b divided into foreground and background regions;
  • FIG. 5 is a view showing further optimization of the image division area in FIG. 4;
  • FIG. 6 is a structural diagram of an image area dividing apparatus according to an embodiment of the present invention.
  • the motion vector of each macroblock in each frame of the video sequence is extracted, the motion vector of the current frame is counted, the number of the same motion vector is calculated, and the image is determined as the region of interest according to the complexity of the motion vector.
  • Non-interest regions thereby dividing the region of interest and the region of non-interest region of the image.
  • Smaller quantization parameters are used for the region of interest to improve the quality of the video.
  • higher quantization parameters are used for the non-interest region to balance the overall bit consumption and finally reach Improve the subjective quality of video.
  • x, i, j, m, n, v, and h are all natural numbers.
  • the specific method is as follows: spatially analyzing the macroblock of the image, and calculating the motion vector (MVX, MVY) of the current macroblock according to the absolute difference and the calculated difference of the Sum of Absolute Transform Difference (STD), and Store it.
  • 5102 Perform statistics on the motion vector of the current frame, and calculate the number of the same motion vector.
  • FIG. 3a is an image of a frame in a tennis match in an embodiment of the present invention
  • FIG. 3b is a diagram showing motion vectors of macroblocks in the image shown in FIG. 3a.
  • the matrix M in step S102 is sorted in a small to large manner and stored in the array M - ra " ⁇ , as shown in the following equation (3):
  • the area is considered to be the foreground
  • L_fore i - level ⁇ . - ⁇ - ⁇ (level ⁇ . + level i+l . + level ⁇ - x + level ⁇
  • is the line where the current macroblock is located
  • is the current macroblock
  • the serial number of the frame, "is the time correlation coefficient, and the value is [Q , 1] .
  • the comprehensive probability that the current macroblock is foreground is as shown in the following equation (7). Where ⁇ is the row in which the current macroblock is located, ⁇ "is the sequence number of the frame in which the current macroblock is located, and A is the time correlation coefficient, and the value is [Q , 1] .
  • FIG. 5 is a diagram showing further optimization of the image division area in FIG. 4.
  • the motion vector of each macroblock in each frame of the video sequence is extracted, the motion vector of the current frame is counted, the number of the same motion vector is calculated, and the image is determined as the region of interest according to the complexity of the motion vector.
  • the non-interest region, and the image after the segmentation is further accurately judged according to the spatial correlation and temporal correlation of the macroblock, thereby accurately dividing the foreground and background regions of the image.
  • FIG. 6 is a structural diagram of an image area dividing apparatus according to an embodiment of the present invention.
  • the device comprises the following parts:
  • a macroblock motion vector extraction module configured to extract a motion-to-macroblock motion vector statistics module of each macroblock of each frame of the video sequence, for performing statistics on the motion vector of the current frame, and calculating the number of the same MV;
  • the macroblock motion vector complex region processing module is used for labeling and extracting a relatively complex region of the macroblock motion vector in the current frame, and further correcting the image of the partitioned region according to the spatial correlation and temporal correlation of the macroblock, and dividing the same Regions of interest and areas of non-interest.
  • the macroblock motion vector complex region processing module includes a preliminary processing module and an optimization processing module, and the preliminary processing module is configured to label and extract a relatively complex region of a macroblock motion vector in a current frame; the optimization processing module is configured to use the current
  • the spatial correlation and temporal correlation of the macroblocks further accurately determine the images after the sub-regions, and divide the regions of interest and non-interest regions.
  • the motion vector of the image in the video sequence is extracted, and the image is determined as the region of interest or the non-interest region according to the complexity of the motion vector, and the spatial correlation and temporal correlation of the image after the region are determined according to the macroblock. Make further precise judgments to accurately segment the foreground and background regions of the image.
  • the motion vector of the image in the video sequence is extracted, and the image is determined as the region of interest or the non-interest region according to the complexity of the motion vector, and the spatial correlation and temporal correlation of the image after the region are determined according to the macroblock.
  • make further precise judgments to make foreground and background areas of the image The domain is precisely divided. Smaller quantization parameters are used for the region of interest to improve the quality of the video, and higher quantization parameters are used for the non-interest region to balance the overall bit consumption, and finally achieve the effect of improving the subjective quality of the video.
  • An embodiment of the present invention further provides an image area coding system, where the system includes:
  • a macroblock motion vector extraction module configured to extract a motion-to-macroblock motion vector statistics module of each macroblock of each frame of the video sequence, for performing statistics on the motion vector of the current frame, and calculating the same

Abstract

A method, a device and a system for partition/encoding image regions are provided, wherein, the method comprises: extracting motion vectors of an image in a video sequence, partitioning the image into regions of interest or regions of no interest according to the complexities of the motion vectors, and carrying out a further precise judgement on the region-partitioned image according to the spatial and temporal correlations of macro blocks, thereby implementing precise foreground and background regions partition on the image. In encoding, a smaller quantized parameter is adopted on the regions of interest to improve the video quality, and a larger quantized parameter is adopted on the regions of no interest, in order to keep the total bit consumption unchanged, and eventually achieve the effect of improving the subjective quality of the video.

Description

一种图像区域划分 /编码方法、 装置及系统 技术领域  Image area division/coding method, device and system
本发明涉及视频编码领域, 尤其涉及一种图像区域划分 /编码方法、 装置 及系统。 背景技术  The present invention relates to the field of video coding, and in particular, to an image area division/coding method, apparatus and system. Background technique
面对一个复杂的场景, 人类视觉注意系统 (HVS , human visual system) 能够迅速将注意力集中在少数几个显著的视觉对象上,对其进行优先处理, 该 过程被称为视觉注意,显著的视觉对象被称为感兴趣区域(region of interest, ROD o 在该机制的作用下, HVS对有限的信息加工资源进行了合理分配, 使视 觉感知过程具备了选择能力。 由此可见, 并非整个图像的所有区域在图像的主 观质量都有同等重要的地位,图像的主观质量是更多的由图像中的感兴趣区域 的质量决定的。 R0I检测对众多图像分析都具有较大的应用价值, 其中较为突 出的几个应用方向包括: 图像质量评估、 图像压缩与编码、 图像检索、 场景渲 染、 目标检测。 目前, 最常用的衡量图像、视频质量的标准是峰值信号与噪声之比(PSNR, PowerSignal - to - NoiseRatio, 信噪功率比), 用如下式 ( 1 ) 所示:  Faced with a complex scene, the human visual attention system (HVS) can quickly focus on a few significant visual objects and prioritize them. This process is called visual attention, significant. The visual object is called the region of interest (ROD o). Under the action of this mechanism, HVS allocates the limited information processing resources reasonably, which makes the visual perception process have the ability to select. Thus, it is not the entire image. All areas of the image have equal importance in the subjective quality of the image, and the subjective quality of the image is more determined by the quality of the region of interest in the image. R0I detection has great application value for many image analysis, among which Some of the more prominent application directions include: image quality assessment, image compression and encoding, image retrieval, scene rendering, target detection. Currently, the most commonly used standard for measuring image and video quality is the peak signal to noise ratio (PSNR, PowerSignal). - to - NoiseRatio, signal to noise power ratio), using the following equation (1) :
PSNRdB = 101og10 (2" - \f lMSE , 其中, MSE 为原始图像和编码后图像之间的均方误差, (2" _ 2为图像中最 大可能的信号值平方, n为表示每个象素的比特数。 PSNR dB = 101og 10 (2" - \f lMSE , where MSE is the mean square error between the original image and the encoded image, ( 2 " _ 2 is the square of the largest possible signal value in the image, n is for each The number of bits in the pixel.
PSNR是人们衡量两幅图像相似程度的最常用的指标, 该值越高, 我们通常 认为两幅图像越相似, 然而 PSNR并不能完全反应图像、 视频的主观质量。 如 图 la-图 Id所示, 其中图 la为原始图像, 图 lb的 PSNR = 30. 6db , 图 lc 的 PSNR = 28. 3db, 图 Id的 PSNR = 27. 7db。 虽然图 lb的 PSNR值最高, 但 人们往往会认为图 Id和原图图 la更相似, 图像质量更高。 这是因为在图 Id 中视觉感兴趣区域(如人脸、 眼镜等区域) 比图 lb和图 lc更为清晰, 即使该 图 Id在视觉不感兴趣的区域 (女孩身后的地板、 小提琴) 显得比图 lb和图 lc模糊。 以至即使图 Id整体的 PSNR比图 lb低将近 3个 db (即图像整体客观 质量相差较大), 我们依然会主观的人为图 Id中的图像更清晰。 由此可见, 图像中不同区域在人们的主观评价中具有不同的重要性, 图像 的主观质量是更多的由图像中的感兴趣区域的质量决定的。现有技术的图像编 码压缩方法没有对前景和背景进行精确划分,在码率一定的情况下, 不能将有 限的比特更合理的有机的分配到人类视觉感兴趣的区域上,不能提供人类主观 感觉更清晰的图像效果。 PSNR is the most commonly used indicator to measure the similarity between two images. The higher the value, the more similar we think the two images are, but the PSNR does not fully reflect the subjective quality of images and video. As shown in Figure la-Id, where la is the original image, Figure lb has PSNR = 30. 6db, Figure lc The PSNR = 28. 3db, Figure Id's PSNR = 27. 7db. Although the PSNR value of Figure lb is the highest, people tend to think that the picture Id is more similar to the original picture, and the picture quality is higher. This is because the visual interest area (such as face, glasses, etc.) in Figure Id is clearer than Figure lb and Figure lc, even if the figure Id is in the area of visually uninteresting (the floor behind the girl, the violin) Figure lb and Figure lc are blurred. Even if the overall PSNR of the graph Id is nearly 3 db lower than the graph lb (that is, the overall objective quality of the image is quite different), we will still be more subjective in the image of the artificial graph Id. It can be seen that different regions in the image have different importance in people's subjective evaluation, and the subjective quality of the image is more determined by the quality of the region of interest in the image. The prior art image coding compression method does not accurately divide the foreground and the background. In the case of a certain code rate, the limited bits cannot be organically allocated to the region of human visual interest, and the human subjective feeling cannot be provided. A clearer image effect.
发明内容 Summary of the invention
本发明实施例的目的在于提出一种图像区域划分方法,旨在解决现有技术 不能精确划分图形的前景和背景问题。  The object of the embodiments of the present invention is to provide an image region dividing method, which aims to solve the problem that the prior art cannot accurately divide the foreground and background of the graphics.
本发明实施例是这样实现的,一种图像区域划分方法,所述方法包括以下 步骤:  The embodiment of the present invention is implemented by the method for dividing an image region, and the method includes the following steps:
提取视频序列中每帧图像各宏块的运动向量;  Extracting a motion vector of each macroblock of each frame of the video sequence;
对当前帧的运动向量进行统计, 计算相同的运动向量的数量;  Counting the motion vectors of the current frame, and calculating the number of the same motion vectors;
标注并提取当前帧中宏块运动向量相对复杂的区域,划分感兴趣和非感兴 趣区域。  Label and extract the relatively complex areas of the macroblock motion vector in the current frame, and divide the interesting and non-interesting areas.
本发明实施例的另一目的在于提出一种图像区域划分装置, 所述装置包 括:  Another object of the embodiments of the present invention is to provide an image area dividing apparatus, and the apparatus includes:
宏块运动向量提取模块, 用于提取视频序列中每帧图像各宏块的运动向 宏块运动向量统计模块,用于对当前帧的运动向量进行统计,计算相同的a macroblock motion vector extraction module, configured to extract motion directions of each macroblock of each frame of the video sequence A macroblock motion vector statistics module for counting motion vectors of a current frame and calculating the same
MV的数量; The number of MVs;
宏块运动向量复杂区域处理模块,用于标注并提取当前帧中宏块运动向量 相对复杂的区域,对划分区域后的图像根据宏块的空间相关性及时间相关性进 行进一步精确判断, 划分其感兴趣区和非感兴趣区。  The macroblock motion vector complex region processing module is used for labeling and extracting a relatively complex region of the macroblock motion vector in the current frame, and further correcting the image after the region according to the spatial correlation and time correlation of the macroblock, and dividing the same Regions of interest and areas of non-interest.
本发明实施例的另一目的在于提出一种图像区域编码方法,, 所述方法包 括以下步骤:  Another object of the embodiments of the present invention is to provide an image region encoding method, the method comprising the following steps:
根据图像宏块的运动向量的复杂度判断划分感兴趣区域或非感兴趣区域, 运动向量复杂度高的区域为感兴趣区域,运动向量复杂度低的区域为非感兴趣 区域;  The region of interest or non-interest region is determined according to the complexity of the motion vector of the image macroblock, the region with high motion vector complexity is the region of interest, and the region with low motion vector complexity is the non-interest region;
对于图像感兴趣区域降低编码量化参数以提高该区域的图像质量,对图像 非感兴趣区域则提高编码量化参数以保持整体的编码比特不变。 本发明实施例的另一目的在于提出一种图像区域编码系统, 所述系统包 括:  The coding quantization parameter is reduced for the image region of interest to improve the image quality of the region, and the coding quantization parameter is increased for the non-interest region of the image to keep the overall coding bits unchanged. Another object of the embodiments of the present invention is to provide an image area coding system, the system comprising:
宏块运动向量提取模块, 用于提取视频序列中每帧图像各宏块的运动向 宏块运动向量统计模块,用于对当前帧的运动向量进行统计,计算相同的 a macroblock motion vector extraction module, configured to extract a motion-to-macroblock motion vector statistics module of each macroblock of each frame of the video sequence, for performing statistics on the motion vector of the current frame, and calculating the same
MV的数量; The number of MVs;
宏块运动向量复杂区域处理模块,用于标注并提取当前帧中宏块运动向量 相对复杂的区域,对划分区域后的图像根据宏块的空间相关性及时间相关性进 行进一步精确判断, 划分其感兴趣区和非感兴趣区。 视频编码模块,用于根据所述图像区域划分装置划分出的感兴趣区域和非 感兴趣区域, 进行不同质量的视频编码。 本发明的有益效果:本发明根据图像中宏块运动向量的复杂度判断图像为 感兴趣区域或非感兴趣区域, 从而对图像进行感兴趣区域和非感兴趣区域划 分。对感兴趣区域采用较小的量化参数以提高视频的质」 相应的, 对非感兴 趣区域采用较高的量化参数, 以平衡总体的比特消耗不 最终达到提高视频 主观质量的效果。 The macroblock motion vector complex region processing module is used for labeling and extracting a relatively complex region of the macroblock motion vector in the current frame, and further correcting the image after the region according to the spatial correlation and time correlation of the macroblock, and dividing the same Regions of interest and areas of non-interest. And a video encoding module, configured to perform video encoding of different qualities according to the region of interest and the non-interest region divided by the image region dividing device. Advantageous Effects of the Invention: According to the complexity of a macroblock motion vector in an image, the present invention determines an image as a region of interest or a region of non-interest, thereby performing a region of interest and a region of non-interest for the image. Minute. Smaller quantization parameters are used for the region of interest to improve the quality of the video. Correspondingly, higher quantization parameters are used for non-interest regions to balance the overall bit consumption without ultimately achieving the effect of improving the subjective quality of the video.
附图说明 DRAWINGS
图 la为现有技术中的原始图像;  Figure la is an original image in the prior art;
图 lb的现有技术中 PSNR为 30. 6db的图像 ;  The image of PSNR of 30. 6db in the prior art of FIG.
图 lc的 现有技术中 PSNR为 28. 3db的图像;  Figure lc shows an image with a PSNR of 28. 3db in the prior art;
图 Id的现有技术中 PSNR为 27. 7db的图像;  The prior art of Figure Id has an image with a PSNR of 27. 7 db;
图 2为本发明实施例的图像区域划分方法流程图;  2 is a flowchart of an image area dividing method according to an embodiment of the present invention;
图 3a为本发明实施例网球比赛中某一帧的图像;  Figure 3a is an image of a frame in a tennis match of an embodiment of the present invention;
图 3b为图 3a中标注了起中各宏块运动向量的图像  Figure 3b is an image of the motion vector of each macroblock labeled in Figure 3a.
图 4为图 3a和图 3b中的图像划分成前景和背景区域后的图;  Figure 4 is a diagram of the image in Figures 3a and 3b divided into foreground and background regions;
图 5为对图 4中的图像划分区域进行进一步优化后的图;  FIG. 5 is a view showing further optimization of the image division area in FIG. 4; FIG.
图 6为本发明实施例的一种图像区域划分装置结构图;  6 is a structural diagram of an image area dividing apparatus according to an embodiment of the present invention;
具体实施方式 detailed description
为了使本发明的目的、技术方案及优点更加清楚明白, 以下结合附图和实 施例, 对本发明进行进一步详细说明, 为了便于说明, 仅示出了与本发明实施 例相关的部分。 应当理解, 此处所描写的具体实施例, 仅仅用于解释本发明, 并不用以限制本发明。  The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. For the purpose of explanation, only the parts related to the embodiments of the present invention are shown. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
本发明实施例通过提取视频序列中每一帧图像各宏块的运动向量,对当前 帧的运动向量进行统计, 计算相同的运动向量的数量, 根据运动向量的复杂度 判断图像为感兴趣区域或非感兴趣区域,从而对图像进行感兴趣区域和非感兴 趣区域划分。 对感兴趣区域采用较小的量化参数以提高视频的质量, 相应的, 对非感兴趣区域采用较高的量化参数, 以平衡总体的比特消耗不变, 最终达到 提高视频主观质量的效果。 下述各实施例中, x、 i、 j、 m、 n、 v、 h均为自然 数。 In the embodiment of the present invention, the motion vector of each macroblock in each frame of the video sequence is extracted, the motion vector of the current frame is counted, the number of the same motion vector is calculated, and the image is determined as the region of interest according to the complexity of the motion vector. Non-interest regions, thereby dividing the region of interest and the region of non-interest region of the image. Smaller quantization parameters are used for the region of interest to improve the quality of the video. Correspondingly, higher quantization parameters are used for the non-interest region to balance the overall bit consumption and finally reach Improve the subjective quality of video. In each of the following embodiments, x, i, j, m, n, v, and h are all natural numbers.
5101, 提取视频序列中每帧图像各宏块的运动向量。 5101. Extract a motion vector of each macroblock of each frame of the video sequence.
具体方法如下: 对图像的宏块进行空间上连续性的分析, 根据运动预测 SATD (Sum of Absolute Transform Difference) 变换后绝对差值和计算得出 当前宏块的运动向量 (MVX, MVY), 并存储下来。  The specific method is as follows: spatially analyzing the macroblock of the image, and calculating the motion vector (MVX, MVY) of the current macroblock according to the absolute difference and the calculated difference of the Sum of Absolute Transform Difference (STD), and Store it.
5102, 对当前帧的运动向量进行统计, 计算相同的运动向量的数量。 5102: Perform statistics on the motion vector of the current frame, and calculate the number of the same motion vector.
具体方法如下: 将步骤 S101中 X轴方向的运动搜索范围为 [^», Y轴方 向的运动搜索范围为 [_vv)。 设矩阵 M为 x2v的运动向量统计矩阵, 如下式 (2) 所示: The specific method is as follows: The motion search range in the X-axis direction in step S101 is [^», and the motion search range in the Y-axis direction is [ _ v , v ). Let the matrix M be a motion vector statistical matrix of x 2 v, as shown in the following equation (2):
M j = X {MYXm n , MVYm n ) = (ί -h,j- ν) M j = X {MYX mn , MVY mn ) = (ί -h,j- ν)
(2), 其中 (^^ '^^,„)表示第 m行第 n列的宏块的运动向: ,. ,· (2), where (^^ '^^, „) indicates the motion of the macroblock of the mth row and the nth column: , .
表示当前 帧中运动向量 (MV^," ' MVY - )等于 d J _ 的宏块数目。 Indicates the number of macroblocks in the current frame where the motion vector (MV ^,"' MVY - ) is equal to d J _ .
S103,标注并提取当前帧中宏块运动向量相对复杂的区域,划分感兴趣和 非感兴趣区域。 S103, labeling and extracting a relatively complex region of the macroblock motion vector in the current frame, and dividing the region of interest and non-interest.
一般情况下, 每帧图像的背景的运动趋势都是相对近似的, 而前景区域的 运动向量则相对复杂。如图 3a所示为本发明实施例网球比赛中某一帧的图像, 图 3b为标注了图 3a所示图像中各宏块运动向量的图。  In general, the motion trend of the background of each frame of image is relatively similar, while the motion vector of the foreground region is relatively complex. 3a is an image of a frame in a tennis match in an embodiment of the present invention, and FIG. 3b is a diagram showing motion vectors of macroblocks in the image shown in FIG. 3a.
由此可见, 将运动向量相对复杂的区域加以区分, 即可将图像的前景, 即 人们感兴趣的区域提取出来。 具体方法如下: It can be seen that by distinguishing the relatively complex regions of the motion vector, the foreground of the image can be Areas of interest are extracted. The specific method is as follows:
将步骤 S102中的矩阵 M按从小到大的方式排序, 存入数组 M-ra"^中, 如 下式 (3 ) 所示: The matrix M in step S102 is sorted in a small to large manner and stored in the array M - ra "^, as shown in the following equation (3):
M _ count n = ^ i ;. == n M _ count n = ^ i ; . == n
i=0J=0 , (3), 其中 M -^ 的值表示具有 "个相同运动向量的宏块的数目。在该数组中,i = 0J = 0, (3), where the value of M -^ represents the number of macroblocks having "one motion vector. In this array,
"值越少的元素代表着越罕见的宏块运动, 同时亦意味着这些宏块越有可能是 前景。 因此, 当 Μ -""^«的累加值在域值 D以下时, 认为这些宏块为前景, 反 之为背景。 方法原理如下: "The fewer values represent the rarer macroblock motion, and it also means that these macroblocks are more likely to be foreground. Therefore, when the accumulated value of Μ -""^« is below the domain value D, these macros are considered The block is foreground, and vice versa. The principle of the method is as follows:
r , 该区域被认为是前景 r , the area is considered to be the foreground
ntx≤D , 该区域被认为是背景 (4 )
Figure imgf000008_0001
Nt x ≤ D , the area is considered to be the background ( 4 )
Figure imgf000008_0001
其中, 域值 D = w^^/16^ fe/16 ; w^ 指视频序列的宽度; ^'gfe指视 频序列的高度; w^/16^^/16指一帧图像中所含宏块的总量; 为比例系数, 表明在运动向量较为罕见宏块数的与全图总宏块数之比为 此处, 的值设 定为 0. 384, else表示除了 _ χ · Μ— COimtx≤/)之外的其他区域。 Wherein, the domain value D = w^^/16^ fe/16 ; w^ refers to the width of the video sequence; ^'gfe refers to the height of the video sequence; w ^ /16 ^^ /16 refers to the macroblock contained in one frame of the image total; the proportional coefficient, indicating the ratio of the total full FIG macro-block motion vector of rare macroblock number here, the value is set to 0. 384, else except represents _ χ · Μ- COi mt Other areas than x ≤ /).
1  1
为确定上式 (4)中比例系数 的值, 此处引用了黄金分割的法则: 黄金分割 又称黄金律, 是指事物各部分间一定的数学比例关系, 即将整体一分为二, 较大部分与较小部分之比等于整体与较大部分之比, 其比值为 1 : 0. 618 或 0. 618: 0. 384, 即长 段为全段的 0. 618。 0. 618被公认为最具有审美意义的比 例数字。上述比例是最能引起人的美感的比例, 因此被称为黄金分割。根据人 类的视觉特点,将总和为 1的图像全景分割为 0. 384的背景和 0. 618的背景更 能符合人类的审美观。 因此, 我们将 的值设定为 0. 384。  In order to determine the value of the proportional coefficient in the above formula (4), the rule of golden section is quoted here: The golden section, also known as the golden rule, refers to a certain mathematical proportional relationship between the various parts of the thing, that is, the whole is divided into two, larger The ratio of the portion to the smaller portion is equal to the ratio of the whole to the larger portion, and the ratio is 1: 0. 618 or 0. 618: 0. 384, that is, the long segment is 0. 618 of the whole segment. 0. 618 is recognized as the most aesthetically significant proportion. The above ratio is the ratio that most causes people's beauty, so it is called the golden section. According to the visual characteristics of humans, the image with a total of 1 is divided into a background of 0. 384 and a background of 0. 618 is more in line with human aesthetics. Therefore, we set the value to 0.384.
根据上述法则, 能将图 3a和图 3b中的图像划分成前景及背景区域, 图 4 所示为对图 3中的图像划分成前景和背景区域后的图。 According to the above rules, the images in Figures 3a and 3b can be divided into foreground and background regions, Figure 4 The figure after dividing the image in Fig. 3 into foreground and background areas is shown.
S104,对视频序列中每帧图像的前景作进一步判断。  S104, further determining the foreground of each frame of the video sequence.
虽然图 4已将所述帧图像的前景背景做出了初步分离,但图中仍有部分被 误判的区域。 由于视频序列在空间上和时间上都有较强的连续性, 因此在判断 当前宏块是否图像的前景时, 可将空间上和时间上临近的区域也作为考虑因 素, 加强判断的准确性。视频序列空间上区域的连续性考虑如下: 例如当前宏 块被判断为前景, 但与当前宏块接邻的上、 下、 左、 右宏块都被判断为背景, 那当前宏块是前景的可能性就大为降低了; 同理, 若当前宏块被判断为背景, 但与当前宏块接邻的上、 下、 左、 右宏块都被判断为前景, 那当前宏块是背景 的可能性也很低。  Although Figure 4 has made a preliminary separation of the foreground background of the frame image, there are still some areas that are misjudged in the figure. Since the video sequence has strong continuity in space and time, when judging whether the current macroblock is the foreground of the image, the spatially and temporally adjacent regions can also be considered as factors to enhance the accuracy of the judgment. The continuity of the area in the video sequence space is considered as follows: For example, the current macroblock is judged as the foreground, but the upper, lower, left, and right macroblocks adjacent to the current macroblock are judged as the background, and the current macroblock is the foreground. The possibility is greatly reduced; similarly, if the current macroblock is judged as the background, but the upper, lower, left, and right macroblocks adjacent to the current macroblock are judged as foreground, then the current macroblock is background. The possibility is also very low.
视频序列时间上的连续性考虑如下:若当前宏块所在区域在前一帧图像中 为前景, 则当前帧中该区域也是前景的几率就会大大增加; 反之, 若当前宏块 所在区域在前一帧图像中为背景, 则前帧中该区域则更可能是背景而非前景。  The continuity of the video sequence in time is considered as follows: if the area of the current macroblock is foreground in the previous frame image, the probability that the area is also foreground in the current frame is greatly increased; conversely, if the current macroblock is in the front area The background in a frame of image is more likely to be the background than the foreground in the previous frame.
为使对图像前景背景区域判断更准确, 可对步骤 S103中区分出来的区域 进行进一步的筛选、 优化, 使结果更准确。 其具体方法如下:  In order to make the judgment of the foreground background area of the image more accurate, the area distinguished in step S103 can be further filtered and optimized to make the result more accurate. The specific method is as follows:
1) 将步骤 S103中前景背景的判定方法按下述式 (5) 进行优化。 ^x-M _countx≤DIA ,level = 1) The determination method of the foreground background in step S103 is optimized by the following formula (5). ^xM _count x ≤DIA ,level =
1  1
DI <^x-M _countx≤DI2 ,level = 2 (5) DI <^xM _count x ≤DI2 ,level = 2 (5)
1  1
D/2<^x-M _countx≤D ,level=l D/2<^xM _count x ≤ D , level=l
1  1
else evel = 0  Else evel = 0
其中, 表示当前宏块以运动的稀有性来判断得出是前景的可能性。 该值越高表明当前宏块越可能是前景。 2) 根据当前宏块的空间相关性及时间相关性确定当前宏块是否为前景。 设当前宏块的^^值为 ZeveZ' , 结合当前宏块的空间相关性, 得到当前宏 块为前景的空间可能性为 0 ,《, 如下式 (6) 所示。 Among them, it indicates that the current macroblock judges the possibility of foreground by the rarity of motion. The higher the value, the more likely the current macroblock is to be foreground. 2) Determine whether the current macroblock is foreground based on the spatial correlation and temporal correlation of the current macroblock. Let the ^^ value of the current macroblock be ZeveZ ', and combine with the spatial correlation of the current macroblock to obtain the space probability that the current macroblock is foreground, as shown in the following equation (6).
L_forei - = level{ . -\- · (level^ . + leveli+l . + level{ - x + level{ 其中 ^为当前宏块所处的行、 歹 ^, "为当前宏块所在的帧的序号, "为时 间相关性系数, 取值为 [Q1]。 对处于边界区域的宏块, 令临近的宏块超出边界 部分的 值取 0,例:若" <ι, 即当前宏块位于图像最左边,则令 ZeveZ'w=Q 结合当前宏块的时间相关性, 得到当前宏块为前景的综合可能性为 如下式 (7) 所示。
Figure imgf000010_0001
其中 ^为当前宏块所处的行、 歹 "为当前宏块所在的帧的序号, A为时 间相关性系数, 取值为 [Q1]
L_fore i - = level { . -\- · (level^ . + level i+l . + level { - x + level { where ^ is the line where the current macroblock is located, 歹^, "is the current macroblock The serial number of the frame, "is the time correlation coefficient, and the value is [Q , 1] . For the macroblock in the boundary area, let the value of the adjacent macroblock beyond the boundary part take 0, for example: if "<ι, ie current The macroblock is located at the far left of the image, which makes ZeveZ 'w =Q combined with the temporal correlation of the current macroblock. The comprehensive probability that the current macroblock is foreground is as shown in the following equation (7).
Figure imgf000010_0001
Where ^ is the row in which the current macroblock is located, 歹 "is the sequence number of the frame in which the current macroblock is located, and A is the time correlation coefficient, and the value is [Q , 1] .
3) 选择域值, 判定当前宏块是否为前景。  3) Select the field value to determine if the current macroblock is foreground.
方法如下式 (8) 所示:
Figure imgf000010_0002
The method is as shown in the following equation (8):
Figure imgf000010_0002
对式(6) 中 "取 0.6, 对式(7)中 A取.05, 对式(8)中 i^y/^W取 10, 对图 3a和图 3b进行前景区域提取,可得如图 5所示为对图 4中的图像划分区 域进行进一步优化后的图。  For equation (6), take 0.6, for equation (7), take A.05, for equation (8), for i^y/^W, take 10, and for Fig. 3a and 3b, for foreground region extraction, you can get FIG. 5 is a diagram showing further optimization of the image division area in FIG. 4.
本发明实施例通过提取视频序列中每一帧图像各宏块的运动向量,对当前 帧的运动向量进行统计, 计算相同的运动向量的数量,根据运动向量的复杂度 判断图像为感兴趣区域或非感兴趣区域,并对划分区域后的图像根据宏块的空 间相关性及时间相关性进行进一步精确判断,从而对图像进行前景和背景区域 的精确划分。 实施例二 In the embodiment of the present invention, the motion vector of each macroblock in each frame of the video sequence is extracted, the motion vector of the current frame is counted, the number of the same motion vector is calculated, and the image is determined as the region of interest according to the complexity of the motion vector. The non-interest region, and the image after the segmentation is further accurately judged according to the spatial correlation and temporal correlation of the macroblock, thereby accurately dividing the foreground and background regions of the image. Embodiment 2
如图 6所示为本发明实施例的一种图像区域划分装置结构图。所述装置包 括以下部分:  FIG. 6 is a structural diagram of an image area dividing apparatus according to an embodiment of the present invention. The device comprises the following parts:
宏块运动向量提取模块, 用于提取视频序列中每帧图像各宏块的运动向 宏块运动向量统计模块,用于对当前帧的运动向量进行统计,计算相同的 MV的数量;  a macroblock motion vector extraction module, configured to extract a motion-to-macroblock motion vector statistics module of each macroblock of each frame of the video sequence, for performing statistics on the motion vector of the current frame, and calculating the number of the same MV;
宏块运动向量复杂区域处理模块,用于标注并提取当前帧中宏块运动向量 相对复杂的区域,对划分区域后的图像根据宏块的空间相关性及时间相关性进 行进一步精确判断,划分其感兴趣区和非感兴趣区。所述宏块运动向量复杂区 域处理模块包括初步处理模块和优化处理模块,所述初步处理模块用于标注并 提取当前帧中宏块运动向量相对复杂的区域;所述优化处理模块用于根据当前 宏块的空间相关性及时间相关性对分区域后的图像进行进一步精确判断,划分 其感兴趣区和非感兴趣区。  The macroblock motion vector complex region processing module is used for labeling and extracting a relatively complex region of the macroblock motion vector in the current frame, and further correcting the image of the partitioned region according to the spatial correlation and temporal correlation of the macroblock, and dividing the same Regions of interest and areas of non-interest. The macroblock motion vector complex region processing module includes a preliminary processing module and an optimization processing module, and the preliminary processing module is configured to label and extract a relatively complex region of a macroblock motion vector in a current frame; the optimization processing module is configured to use the current The spatial correlation and temporal correlation of the macroblocks further accurately determine the images after the sub-regions, and divide the regions of interest and non-interest regions.
本发明实施例通过提取视频序列中图像的运动向量,根据运动向量的复杂 度判断图像为感兴趣区域或非感兴趣区域,并对划分区域后的图像根据宏块的 空间相关性及时间相关性进行进一步精确判断,从而对图像进行前景和背景区 域精确划分。  In the embodiment of the present invention, the motion vector of the image in the video sequence is extracted, and the image is determined as the region of interest or the non-interest region according to the complexity of the motion vector, and the spatial correlation and temporal correlation of the image after the region are determined according to the macroblock. Make further precise judgments to accurately segment the foreground and background regions of the image.
实施例三  Embodiment 3
本发明实施例还提供一种图像区域编码方法,所述方法包括实施例一的方 法包括的步骤外, 还包括步骤: 根据最终划分出的前景和背景, 进行不同质量 的视频编码,对感兴趣区域采用较小的量化参数编码以提高视频的质量,对非 感兴趣区域采用较高的量化参数编码, 以平衡总体的比特消耗不变。  An embodiment of the present invention further provides an image region encoding method, where the method includes the steps included in the method of the first embodiment, and further includes the steps of: performing video encoding of different qualities according to the final divided foreground and background. The region uses smaller quantization parameter coding to improve the quality of the video, and uses higher quantization parameter coding for the non-interest region to balance the overall bit consumption.
本发明实施例通过提取视频序列中图像的运动向量,根据运动向量的复杂 度判断图像为感兴趣区域或非感兴趣区域,并对划分区域后的图像根据宏块的 空间相关性及时间相关性进行进一步精确判断,从而对图像进行前景和背景区 域精确划分。对感兴趣区域采用较小的量化参数以提高视频的质量,对非感兴 趣区域采用较高的量化参数, 以平衡总体的比特消耗不变, 最终达到提高视频 主观质量的效果。 In the embodiment of the present invention, the motion vector of the image in the video sequence is extracted, and the image is determined as the region of interest or the non-interest region according to the complexity of the motion vector, and the spatial correlation and temporal correlation of the image after the region are determined according to the macroblock. Make further precise judgments to make foreground and background areas of the image The domain is precisely divided. Smaller quantization parameters are used for the region of interest to improve the quality of the video, and higher quantization parameters are used for the non-interest region to balance the overall bit consumption, and finally achieve the effect of improving the subjective quality of the video.
实施例四  Embodiment 4
本发明实施例还提供一种图像区域编码系统, 所述系统包括:  An embodiment of the present invention further provides an image area coding system, where the system includes:
宏块运动向量提取模块, 用于提取视频序列中每帧图像各宏块的运动向 宏块运动向量统计模块, 用于对当前帧的运动向量进行统计, 计算相同的 a macroblock motion vector extraction module, configured to extract a motion-to-macroblock motion vector statistics module of each macroblock of each frame of the video sequence, for performing statistics on the motion vector of the current frame, and calculating the same
MV的数量; The number of MVs;
宏块运动向量复杂区域处理模块,用于标注并提取当前帧中宏块运动向量 相对复杂的区域,对划分区域后的图像根据宏块的空间相关性及时间相关性进 行进一步精确判断,划分其感兴趣区和非感兴趣区。所述宏块运动向量复杂区 域处理模块包括初步处理模块和优化处理模块,所述初步处理模块用于标注并 提取当前帧中宏块运动向量相对复杂的区域;所述优化处理模块用于根据当前 宏块的空间相关性及时间相关性对分区域后的图像进行进一步精确判断,划分 其感兴趣区和非感兴趣区;  The macroblock motion vector complex region processing module is used for labeling and extracting a relatively complex region of the macroblock motion vector in the current frame, and further correcting the image of the partitioned region according to the spatial correlation and temporal correlation of the macroblock, and dividing the same Regions of interest and areas of non-interest. The macroblock motion vector complex region processing module includes a preliminary processing module and an optimization processing module, and the preliminary processing module is configured to label and extract a relatively complex region of a macroblock motion vector in a current frame; the optimization processing module is configured to use the current The spatial correlation and time correlation of the macroblocks further accurately judge the images after the subregions, and divide the regions of interest and non-interest regions;
视频编码模块, 用于根据最终划分出的感兴趣区域和非感兴趣区域,进行 不同质量的视频编码。  The video coding module is configured to perform video coding of different qualities according to the finally divided region of interest and non-interest region.
本发明实施例通过提取视频序列中图像的运动向量,根据运动向量的复杂 度判断图像为感兴趣区域或非感兴趣区域,并对划分区域后的图像根据宏块的 空间相关性及时间相关性进行进一步精确判断,从而对图像进行前景和背景区 域精确划分。对感兴趣区域采用较小的量化参数以提高视频的质量,对非感兴 趣区域采用较高的量化参数, 以平衡总体的比特消耗不变, 最终达到提高视频 主观质量的效果。 本领域的普通技术人员可以理解,实现上述实施例方法中的全部或部分步 骤是可以通过程序指令相关硬件来完成的,所述的程序可以存储于一计算机可 读取存储介质中, 所述的存储介质可以为 R0M、 RAM, 磁盘、 光盘等。 以上所述仅为本发明的较佳实施例而已, 并不用以限制本发明, 凡在本发 明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明 的保护范围之内。 In the embodiment of the present invention, the motion vector of the image in the video sequence is extracted, and the image is determined as the region of interest or the non-interest region according to the complexity of the motion vector, and the spatial correlation and temporal correlation of the image after the region are determined according to the macroblock. Make further precise judgments to accurately segment the foreground and background regions of the image. Smaller quantization parameters are used for the region of interest to improve the quality of the video, and higher quantization parameters are used for the non-interest region to balance the overall bit consumption, and finally achieve the effect of improving the subjective quality of the video. One of ordinary skill in the art will appreciate that all or part of the steps of the above embodiments are implemented. The program can be completed by a program instruction related hardware, and the program can be stored in a computer readable storage medium, and the storage medium can be a ROM, a RAM, a magnetic disk, an optical disk, or the like. The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the protection of the present invention. Within the scope.

Claims

权 利 要 求 书 Claims
提取视频序列中每帧图像各宏块的运动向量; Extracting a motion vector of each macroblock of each frame of the video sequence;
对当前帧的运动向量进行统计, 计算相同的运动向量的数量;  Counting the motion vectors of the current frame, and calculating the number of the same motion vectors;
标注并提取当前帧中宏块运动向量相对复杂的区域,划分感兴趣和非感兴 趣区域。  Label and extract the relatively complex areas of the macroblock motion vector in the current frame, and divide the interesting and non-interesting areas.
2、 如权利要求 1所述的图像区域划分方法, 其特征在于, 所述方法还包 括以下步骤:  2. The image region dividing method according to claim 1, wherein the method further comprises the following steps:
对划分感兴趣和非感兴趣区域后的图像根据宏块空间和时间的相关性进 行进一步判断。  The image after dividing the region of interest and non-interest is further judged based on the correlation of the macroblock space and time.
3、 如权利要求 1所述的图像区域划分方法, 其特征在于, 所述提取视频 序列中每帧图像各宏块的运动向量包括:  The image region dividing method according to claim 1, wherein the extracting the motion vector of each macroblock of each frame image in the video sequence comprises:
对图像的宏块进行空间上连续性的分析,根据运动预测变换后绝对差值和 计算得出当前宏块的运动向量 (MVX, MVY) , 并存储。  The spatial continuity of the macroblock of the image is analyzed, and the motion vector (MVX, MVY) of the current macroblock is calculated according to the motion prediction transformed absolute difference and calculated.
4、 如权利要求 1所述的图像区域划分方法, 其特征在于, 所述对当前帧 的运动向量进行统计, 计算相同的运动向量的数量包括: The image region dividing method according to claim 1, wherein the calculating the motion vector of the current frame, and calculating the number of the same motion vector comprises:
将所述宏块的运动向量中 X轴方向的运动搜索范围为卜 ^), Y轴方向的 运动搜索范围为[_^ , 设矩阵 M为 2/z x 2v的运动向量统计矩阵:
Figure imgf000014_0001
The motion search range in the X-axis direction of the motion vector of the macroblock is b), the motion search range in the Y-axis direction is [_ ^, and the motion vector statistical matrix of the matrix M is 2/zx 2 v:
Figure imgf000014_0001
(2), 其中, 上述公式 (2 ) 中的 (MVX 表示第 m行第 n列的宏块的 运动向量, 表示当前帧中运动向量 (MVX 等于 ^ v)的宏块 数目, 所述 i、 j、 m、 n、 v、 h均为自然数。 (2), wherein, in the above formula (2), ( MVX represents a motion vector of a macroblock of the mth row and the nth column, and represents a number of macroblocks of a motion vector ( MVX equal to ^v) in the current frame, the i, j, m, n, v, h are all natural numbers.
5、 如权利要求 1所述的图像区域划分方法, 其特征在于, 所述标注并提 取当前帧中宏块运动向量相对复杂的区域, 划分感兴趣和非感兴趣区域包括: 将所述矩阵 M按从小到大的方式排序,存入下述公式(3)的数组 The image region dividing method according to claim 1, wherein the labeling and extracting a relatively complex region of a macroblock motion vector in a current frame, and dividing the region of interest and non-interest region comprises: Sort by small to large, and store the array of the following formula (3)
中: Medium:
i=2h-l =2v-l  i=2h-l =2v-l
M count" = V . , == η  M count" = V . , == η
i=0' 0 (3), 其中, 上述公式 (3) 中的 M-raiWi"的值表示具有"个相同运动向量的宏 块的数目, "值越少的元素代表着越罕见的宏块运动, 其为前景的可能性就越 大,当 M_a^t„的累加值在域值0以下时,认为这些宏块为前景,反之为背景; 方法原理如下公式 (4): i = 0 ' 0 (3), where the value of M - raiWi " in the above formula (3) represents the number of macroblocks having "one motion vector," the less the value represents the rarer macroblock Motion, the more likely it is for the foreground, when the accumulated value of M_a^t„ is below the domain value of 0 , the macroblocks are considered to be foreground, and vice versa; the principle of the method is as follows: (4):
}x-M_countx≤D , 该区域被认为是前景 }x-M_count x ≤D , the area is considered to be the foreground
.1 else , 该区域被认为是背景 (4), 其中, 域值 D =
Figure imgf000015_0001
指视频序列的宽度; 指 视频序列的高度; ^^^16'^^^16指一帧图像中所含宏块的总量; 为比例 系数, 表明在运动向量较为罕见宏块数的与全图总宏块数之比为 , 此处, k 的值设定为 0.384; 所述 else表示除了 1χ·Μ— coimtx≤Ζ)之外的其他区域,所
1 else , the region is considered to be the background ( 4 ), where the domain value D =
Figure imgf000015_0001
Refers to the width of the video sequence; refers to the height of the video sequence; ^^^ 16 '^^^ 16 refers to the total number of macroblocks contained in a frame of image; is the proportional coefficient, indicating that the number of macroblocks in the motion vector is relatively rare The ratio of the total number of macroblocks in the graph is, where the value of k is set to 0.384; the else represents a region other than 1χ·Μ—coimt x ≤Ζ,
1  1
述 x、 i、 j、 n、 v、 h均为自然数。 x, i, j, n, v, h are all natural numbers.
6、 如权利要求 5所述的图像区域划分方法, 其特征在于, 所述对划分感 兴趣和非感兴趣区域后的图像根据宏块空间和时间的相关性进行进一步判断 还包括以下步骤:  The image region dividing method according to claim 5, wherein the further determining the correlation between the partitioning interest and the non-interest region image according to the correlation between the macroblock space and the time further comprises the following steps:
A)将所述公式 (4) 中前景背景的判定方法采用下述公式 (5) 进行优化, jx-M _countx <DI ,level=4 A) The method for determining the foreground background in the formula (4) is optimized by the following formula (5), jx-M _count x <DI ,level=4
1  1
D/4<jx-M_countx≤D/2 , level = 2 D/4<jx-M_count x ≤D/2 , level = 2
D/2< j x- M _countx≤D , level = 1 D/2< j x- M _count x ≤ D , level = 1
其它区域 'level = 0 其中, 表示当前宏块以运动的稀有性来判断得出是前景的可能性, 该值越高表明当前宏块越可能是前景;  The other area 'level = 0, which indicates that the current macroblock is determined by the rarity of motion as the foreground. The higher the value indicates that the current macroblock is more likely to be the foreground;
B)根据当前宏块的空间相关性及时间相关性确定当前宏块是否为前景, 设当前宏块的^^值为 Zev ", 结合当前宏块的空间相关性, 得到当前宏 块为前景的空间可能性为 - ,^, 用如下公式 (6) 表示, B) determining whether the current macroblock is foreground according to the spatial correlation and temporal correlation of the current macroblock, and setting the ^^ value of the current macroblock to Zev ", combining the spatial correlation of the current macroblock to obtain the current macroblock as foreground The space possibility is - , ^, expressed by the following formula (6),
L_ forei n - lev el + ί· (level^ + leveli+l + lev el t + lev el t j+l ) ( g ) 其中 为当前宏块所处的行、 歹 ^, "为当前宏块所在的帧的序号, "为时 间相关性系数, 取值为 [Q1], 对处于边界区域的宏块, 令临近的宏块超出边界 部分的^^值取 0, L_ fore in - lev el + ί· (level^ + level i+l + lev el t + lev el t j+l ) ( g ) where is the row of the current macroblock, 歹^, "is the current macroblock The sequence number of the frame in which it is located, "is the time correlation coefficient, and takes the value [Q , 1] . For the macroblock in the boundary region, the value of the ^^ of the adjacent macroblock beyond the boundary portion is 0.
结合当前宏块的时间相关性, 得到当前宏块为前景的综合可能性为 Combined with the temporal correlation of the current macroblock, the comprehensive probability that the current macroblock is foreground is
Lv Lv
11 , 用如下公式 (7) 表示, Small 11 , expressed by the following formula (7),
Lvi ,n =L- f°rei ,n + λ■L_foreijn_l Lv i , n = L - f° re i , n + λ ■ L_fore ijn _ l
(7), 其中 Α为时间相关性系数, 取值为 [Q1]; (7), where Α is the time correlation coefficient and the value is [Q , 1] ;
C)选择域值, 判定当前宏块是否为前景, 判断方法如下:  C) Select the domain value and determine whether the current macroblock is foreground. The judgment method is as follows:
Lvi j n > threshold ,该宏块为前景 Lv ijn > threshold , the macroblock is the foreground
(8),  (8),
其它区域 该宏块为背景  Other areas, the macro block is the background
对式(6) 中"取 0.6, 对式(8)中 A取.05, 对式(9)中 ^?/^/6?取 10, 对图像前景区域提取, 对所述图像进行进一步优化划分; For equation (6), "take 0.6, for equation (8), take A.05, and for equation (9), ^?/^/6? take 10 , extract the foreground region of the image, and further optimize the image. Division
所述 i、 j、 m、 n、 v、 h均为自然数。 The i, j, m, n, v, h are all natural numbers.
7、 一种图像区域划分装置, 其特征在于, 所述装置包括: 7. An image area dividing apparatus, wherein the apparatus comprises:
宏块运动向量提取模块, 用于提取视频序列中每帧图像各宏块的运动向 宏块运动向量统计模块, 用于对当前帧的运动向量进行统计, 计算相同的 a macroblock motion vector extraction module, configured to extract a motion-to-macroblock motion vector statistics module of each macroblock of each frame of the video sequence, for performing statistics on the motion vector of the current frame, and calculating the same
MV的数量; The number of MVs;
宏块运动向量复杂区域处理模块,用于标注并提取当前帧中宏块运动向量 相对复杂的区域,对划分区域后的图像根据宏块的空间相关性及时间相关性进 行进一步精确判断, 划分其感兴趣区和非感兴趣区。  The macroblock motion vector complex region processing module is used for labeling and extracting a relatively complex region of the macroblock motion vector in the current frame, and further correcting the image after the region according to the spatial correlation and time correlation of the macroblock, and dividing the same Regions of interest and areas of non-interest.
8、 如权利要求 7所述的图像区域划分装置, 其特征在于, 所述宏块运动 向量复杂区域处理模块包括: 初步处理模块, 用于标注并提取当前帧中宏块运动向量相对复杂的区域; 优化处理模块,用于根据当前宏块的空间相关性及时间相关性对分区域后 的图像进行进一步精确判断, 划分其感兴趣区和非感兴趣区。  The image region dividing device according to claim 7, wherein the macroblock motion vector complex region processing module comprises: a preliminary processing module, configured to label and extract a relatively complex region of a macroblock motion vector in a current frame. The optimization processing module is configured to further accurately determine the image after the sub-region according to the spatial correlation and temporal correlation of the current macroblock, and divide the region of interest and the non-region of interest.
9、 一种图像区域编码方法, 其特征在于, 所述方法包括以下步骤: 根据图像宏块的运动向量的复杂度判断划分感兴趣区域或非感兴趣区域, 运动向量复杂度高的区域为感兴趣区域,运动向量复杂度低的区域为非感兴趣 区域;  9. An image region encoding method, the method comprising the steps of: determining, according to the complexity of a motion vector of an image macroblock, a region of interest or a region of non-interest, a region with a high complexity of motion vectors The region of interest, where the motion vector complexity is low, is a non-interest region;
对于图像感兴趣区域降低编码量化参数以提高该区域的图像质量,对图像 非感兴趣区域则提高编码量化参数以保持整体的编码比特不变。  The coding quantization parameter is reduced for the image region of interest to improve the image quality of the region, and the coding quantization parameter is increased for the non-interest region of the image to keep the overall coding bits unchanged.
10、 一种图像区域编码系统, 其特征在于, 所述系统包括: 10. An image area coding system, wherein the system comprises:
如权利要求 7所述的图像区域划分装置;  The image area dividing device according to claim 7;
视频编码模块,用于根据所述图像区域划分装置划分出的感兴趣区域和非 感兴趣区域, 进行不同质量的视频编码。  And a video encoding module, configured to perform video encoding of different qualities according to the region of interest and the region of non-interest that are divided by the image region dividing device.
PCT/CN2011/072787 2010-06-07 2011-04-14 Method, device and system for partition/encoding image region WO2011153869A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010193795.4 2010-06-07
CN2010101937954A CN101882316A (en) 2010-06-07 2010-06-07 Method, device and system for regional division/coding of image

Publications (1)

Publication Number Publication Date
WO2011153869A1 true WO2011153869A1 (en) 2011-12-15

Family

ID=43054327

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/072787 WO2011153869A1 (en) 2010-06-07 2011-04-14 Method, device and system for partition/encoding image region

Country Status (2)

Country Link
CN (1) CN101882316A (en)
WO (1) WO2011153869A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112292864A (en) * 2018-06-14 2021-01-29 瑞典爱立信有限公司 Tile selection and bandwidth optimization for providing 360 degree immersive video

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101882316A (en) * 2010-06-07 2010-11-10 深圳市融创天下科技发展有限公司 Method, device and system for regional division/coding of image
CN101945281B (en) * 2010-09-10 2014-09-10 中兴通讯股份有限公司 Method and device for filtering video codes
CN102036074A (en) * 2010-12-21 2011-04-27 汉王科技股份有限公司 Embedded equipment-based code rate control method and device
EP2472868A1 (en) * 2010-12-29 2012-07-04 British Telecommunications Public Limited Company Video coding
CN102129682B (en) * 2011-03-09 2015-10-07 深圳市云宙多媒体技术有限公司 A kind of prospect background region partitioning method, system
TWI519131B (en) * 2012-06-19 2016-01-21 廣達電腦股份有限公司 Video transmission system and transmitting device and receiving device thereof
CN103002280B (en) * 2012-10-08 2016-09-28 中国矿业大学 Distributed decoding method based on HVS&ROI and system
EP2965288A4 (en) * 2013-03-08 2016-07-27 Intel Corp Techniques for image encoding based on region of interest
CN104427337B (en) * 2013-08-21 2018-03-27 杭州海康威视数字技术股份有限公司 Interested area video coding method and its device based on target detection
CN103561261B (en) * 2013-10-12 2016-10-26 重庆邮电大学 The panoramic locatable video coded method that view-based access control model notes
CN103702118A (en) * 2014-01-02 2014-04-02 华为技术有限公司 Image processing method and equipment
CN104980740A (en) * 2014-04-08 2015-10-14 富士通株式会社 Image processing method, image processing device and electronic equipment
CN104079934B (en) * 2014-07-14 2017-02-15 武汉大学 Method for extracting regions of interest in real-time video communication
CN106162177B (en) * 2016-07-08 2018-11-09 腾讯科技(深圳)有限公司 Method for video coding and device
CN106303333B (en) * 2016-08-30 2019-01-04 百味迹忆(厦门)网络科技有限公司 Audio-video recording method and device
CN107369165A (en) * 2017-07-10 2017-11-21 Tcl移动通信科技(宁波)有限公司 A kind of video selection picture optimization method and storage medium, intelligent terminal
US10609398B2 (en) * 2017-07-28 2020-03-31 Black Sesame International Holding Limited Ultra-low bitrate coding based on 3D map reconstruction and decimated sub-pictures
JP6961443B2 (en) * 2017-09-29 2021-11-05 キヤノン株式会社 Image processing equipment, image processing methods, and programs
CN109862356B (en) * 2019-01-17 2020-11-10 中国科学院计算技术研究所 Video coding method and system based on region of interest
CN110602495A (en) * 2019-08-20 2019-12-20 深圳市盛世生物医疗科技有限公司 Medical image coding method and device
CN110677670A (en) * 2019-08-21 2020-01-10 咪咕视讯科技有限公司 Image compression method, image decompression method, image compression device, image decompression device, electronic equipment and image decompression system
CN110769254B (en) * 2019-10-10 2022-04-22 网宿科技股份有限公司 Code rate configuration method, system and equipment for video frame
CN110689014B (en) * 2019-10-14 2022-02-11 腾讯科技(深圳)有限公司 Method and device for detecting region of interest, electronic equipment and readable storage medium
CN111131825A (en) * 2019-10-28 2020-05-08 华为技术有限公司 Video processing method and related device
US11164339B2 (en) * 2019-11-12 2021-11-02 Sony Interactive Entertainment Inc. Fast region of interest coding using multi-segment temporal resampling
CN111866504B (en) * 2020-07-17 2022-06-21 Oppo广东移动通信有限公司 Encoding method, encoder and computer readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003032688A (en) * 2001-07-18 2003-01-31 Nippon Telegr & Teleph Corp <Ntt> Separation method of foreground and background regions for moving image, and moving image coding method by conditional pixel replenishment by using this method
US20070025444A1 (en) * 2005-07-28 2007-02-01 Shigeyuki Okada Coding Method
CN101436301A (en) * 2008-12-04 2009-05-20 上海大学 Method for detecting characteristic movement region of video encode
CN101534444A (en) * 2009-04-20 2009-09-16 杭州华三通信技术有限公司 Image processing method, system and device
CN101882316A (en) * 2010-06-07 2010-11-10 深圳市融创天下科技发展有限公司 Method, device and system for regional division/coding of image

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4596201B2 (en) * 2001-02-01 2010-12-08 ソニー株式会社 Image processing apparatus and method, and recording medium
JP4375452B2 (en) * 2007-07-18 2009-12-02 ソニー株式会社 Image processing apparatus, image processing method, program, and display apparatus
CN101102495B (en) * 2007-07-26 2010-04-07 武汉大学 A video image decoding and encoding method and device based on area
JP2009077382A (en) * 2007-08-27 2009-04-09 Sony Corp Image processing apparatus, method thereof, and program

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003032688A (en) * 2001-07-18 2003-01-31 Nippon Telegr & Teleph Corp <Ntt> Separation method of foreground and background regions for moving image, and moving image coding method by conditional pixel replenishment by using this method
US20070025444A1 (en) * 2005-07-28 2007-02-01 Shigeyuki Okada Coding Method
CN101436301A (en) * 2008-12-04 2009-05-20 上海大学 Method for detecting characteristic movement region of video encode
CN101534444A (en) * 2009-04-20 2009-09-16 杭州华三通信技术有限公司 Image processing method, system and device
CN101882316A (en) * 2010-06-07 2010-11-10 深圳市融创天下科技发展有限公司 Method, device and system for regional division/coding of image

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112292864A (en) * 2018-06-14 2021-01-29 瑞典爱立信有限公司 Tile selection and bandwidth optimization for providing 360 degree immersive video
CN112292864B (en) * 2018-06-14 2023-03-21 瑞典爱立信有限公司 Tile selection and bandwidth optimization for providing 360 degree immersive video

Also Published As

Publication number Publication date
CN101882316A (en) 2010-11-10

Similar Documents

Publication Publication Date Title
WO2011153869A1 (en) Method, device and system for partition/encoding image region
US8285045B2 (en) Image analysis method, medium and apparatus and moving image segmentation system
JP5555221B2 (en) Method and apparatus for adaptive color model parameter estimation of object of interest
KR100670003B1 (en) The apparatus for detecting the homogeneous region in the image using the adaptive threshold value
KR101761928B1 (en) Blur measurement in a block-based compressed image
US11093788B2 (en) Scene change detection
JP2000217117A (en) Processing method for digital image expression video data in compression form
JP2009512246A (en) Method and apparatus for determining shot type of an image
KR20120114263A (en) Object-aware video encoding strategies
JP5285159B2 (en) Block noise and fidelity in watermarking
CN101352029A (en) Randomly sub-sampled partition voting(RSVP) algorithm for scene change detection
Zhang et al. Low-complexity and high-efficiency background modeling for surveillance video coding
GB2459671A (en) Scene Change Detection For Use With Bit-Rate Control Of A Video Compression System
Feng et al. Saliency based objective quality assessment of decoded video affected by packet losses
CN109194955B (en) Scene switching detection method and system
CN106664404A (en) Block segmentation mode processing method in video coding and relevant apparatus
US8611423B2 (en) Determination of optimal frame types in video encoding
Uddin et al. Double compression detection in HEVC-coded video with the same coding parameters using picture partitioning information
JP2009212605A (en) Information processing method, information processor, and program
JP2007512750A (en) Detection of local image space-temporal details in video signals
KR100963701B1 (en) Video identification device
CN105631419B (en) Face identification method and device
CN104469361B (en) A kind of video with Motion Adaptive deletes frame evidence collecting method
Tu et al. Video saliency detection incorporating temporal information in compressed domain
KR101358576B1 (en) Video transcoding optimization method using correlation of subjective and objective video quality assessment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11791860

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11791860

Country of ref document: EP

Kind code of ref document: A1