CN115514964A - HEVC compression noise standard deviation estimation method based on convolutional neural network - Google Patents

HEVC compression noise standard deviation estimation method based on convolutional neural network Download PDF

Info

Publication number
CN115514964A
CN115514964A CN202110699673.0A CN202110699673A CN115514964A CN 115514964 A CN115514964 A CN 115514964A CN 202110699673 A CN202110699673 A CN 202110699673A CN 115514964 A CN115514964 A CN 115514964A
Authority
CN
China
Prior art keywords
standard deviation
hevc
block
noise standard
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110699673.0A
Other languages
Chinese (zh)
Inventor
何小海
孙伟恒
熊淑华
徐艺菲
陈洪刚
滕奇志
任超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan University
Original Assignee
Sichuan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan University filed Critical Sichuan University
Priority to CN202110699673.0A priority Critical patent/CN115514964A/en
Publication of CN115514964A publication Critical patent/CN115514964A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding

Abstract

The invention discloses an HEVC compression noise standard deviation estimation method based on a convolutional neural network, which mainly comprises the following steps of: the method comprises the steps of dividing a video frame subjected to HEVC coding into blocks, extracting blocking information from a code stream to determine whether a current block belongs to a high texture block or a low texture block, obtaining a real noise standard deviation of a small block by subtracting an original video and a compressed video, and using the real noise standard deviation as a label of the small block to construct a data set. The invention constructs a compression noise estimation network, and trains high and low texture data sets respectively to obtain a trained model. In the testing stage, videos are tested according to HEVC standards with different resolutions, and the standard deviation level of compression noise of the videos is estimated by taking a block as a unit under different QPs and compared with real noise. Experimental results show that the method can effectively estimate the noise standard deviation of the compressed video.

Description

HEVC compression noise standard deviation estimation method based on convolutional neural network
Technical Field
The invention relates to a convolutional neural network and a video coding algorithm, in particular to an HEVC compression noise standard deviation estimation method based on the convolutional neural network, and belongs to the field of image communication.
Background
In recent years, video applications have grown rapidly, but at the same time, challenges have been presented to the transmission and storage of large amounts of video data. In this context, the International Telecommunications Union (ITU) has promulgated a High Efficiency Video Coding standard (HEVC/h.265) in cooperation with the International Organization for standardization (ISO). Compared with advanced video Coding (AVC/h.264), the HEVC standard can save about 50% of code rate under the same Coding quality. Inevitably, however, HEVC compressed video also contains many compression effects, especially in low bitrate segments, and the presence of these compression effects seriously affects the viewing experience of the user. The HEVC standard employs a block-based hybrid coding strategy, and after the predictive coding is finished, the prediction residual is transformed and quantized in units of blocks. The pixels at the block boundary are continuous, and the inconsistency of the processing parameters of the pixels distributed into different blocks causes the occurrence of blocking effect in a compressed video; quantization rounding and high-frequency zero setting cause the compressed video to lose a lot of detail information; inaccuracies in motion estimation and motion compensation further introduce compression noise. Many scholars refer to the following: accurate evaluation of the compression noise level can help the subsequent decompression effect algorithm, and setting an incorrect noise level may reduce the effect of denoising. The distribution of the compression noise is related to the compression process, so that the modeling difficulty is high, the mainstream research still regards the compression noise as additive gaussian noise at present, and the estimation of the compression noise is the estimation of the standard deviation of the gaussian noise. Convolutional neural networks have achieved significant success in the field of computer vision and image processing. Compared with the traditional image prior constructed by summarizing the image/video signal distribution, the convolutional neural network can learn the deep image characteristics in the image/video, which is also an important reason for obtaining excellent effects on various image/video processing tasks. Inspired by great success of a classification network based on CNN, the invention takes HEVC compression noise standard deviation estimation as a classification task for images of different styles.
Disclosure of Invention
The invention aims to estimate the standard deviation of HEVC compressed video compression noise.
The invention provides an HEVC compression noise standard deviation estimation method based on a convolutional neural network, which mainly comprises the following operation steps of:
(1) A training data set is constructed.
We performed 32 frame coding of 200 training videos in the data set LDV at QP =22, 27, 32, 37, the coding configuration being LDP. Then, the picture is divided into 64 × 64 small blocks, the difference is made between the small blocks and the original video to obtain a residual block of each image block, the standard deviation of the residual block is calculated, and the picture is classified according to the standard deviation, as shown in fig. 1. In general, pictures with low level of compression noise contain less texture detail, and pictures with high level of compression noise contain more texture. But we have also found special cases where some pictures with high noise levels also contain less texture, because the entire picture becomes smoother due to the coarse quantization of HEVC and the high frequency quality of the transform coefficients. For the image blocks which have less texture, the compressed image blocks represent the smooth blurring and have less compression noise, while the image blocks which have rich dense texture represent the smooth blurring and have larger compression noise. To solve this problem, we extract the blocking information from the encoded bitstream to determine whether a compressed smooth block is a high texture block or a low texture block before compression. As shown in fig. 2, when HEVC is compressed, quad-tree-based block division is adopted, dense block division is performed on a region with complex texture, sparse block division is performed on a region with simple texture, CU block information of HEVC is extracted from a code stream, an image is divided into a high texture block and a low texture block according to the depth of the CU block, the high texture block is determined when the depth is greater than 2, and the low texture block is determined when the depth is less than or equal to 2. A data set is constructed for each of the high texture blocks and the low texture blocks. The high texture blocks are divided into 15 classes, the standard deviation difference of each class is 1, from 0, the value width of each class is 0.5, if the standard deviation value interval of the first class is [0,0.5], the standard deviation value interval of the second class is [1,1.5], and the value space of the later class is analogized according to the form. For the low texture blocks, the low texture blocks are divided into 10 classes, the standard deviation difference of each class is 1, and the value width of each class is 0.7 from 0.
(2) And constructing a compression noise estimation network based on the convolutional neural network.
We propose an HEVC compression noise classification network whose network backbone is the classical classification network DENSE-NET, as shown in fig. 3, we make some modifications to the network to meet our task requirements. Since our block edge length is 64, we add a transposed convolutional layer before the first Dense block to further increase the field of view to enlarge the resolution of the feature map, and use 3 Dense blocks to achieve the balance of complexity and performance.
(3) And (3) carrying out network model training according to the data set in (1) and the network proposed in (2), and estimating the compression noise standard deviation for the HEVC compression test video block by using the trained model.
Drawings
Fig. 1 is a schematic diagram of classifying pictures based on different standard deviations of compression noise according to the present invention.
Fig. 2 is a high and low texture block distribution diagram of the present invention.
Fig. 3 is a block diagram of an HEVC compression noise standard deviation estimation network according to the present invention.
Detailed Description
In order to better illustrate the effectiveness of the present invention, videos with different resolution sizes in a standard HEVC test video are selected, noise estimation is performed on different QPs, and table one shows the difference between the estimation result of the present invention and the real noise. From the table, it can be seen that the present invention can obtain a more accurate estimation result.
Table one standard deviation difference between the invention and true noise
Figure BDA0003129298050000031

Claims (1)

1. An HEVC compression noise standard deviation estimation method based on a convolutional neural network is characterized by comprising the following steps of:
the method comprises the following steps: firstly, constructing a training data set, partitioning a video compressed by HEVC, extracting partitioning information from code stream information, dividing small blocks into high texture blocks and low texture blocks, calculating a real noise standard deviation by calculating a difference value between an original video and the compressed video, taking the real noise standard deviation as a label of each small block, and finally constructing two training data sets;
step two: constructing a network, namely constructing the network on the basis of DENSE-NET, adopting 3 Dense blocks, and increasing a transposed convolution layer to enlarge the resolution of a characteristic diagram;
step three: and (3) training the data set obtained in the step one through the network provided in the step two to obtain two models aiming at the high and low texture blocks, wherein the obtained models can effectively estimate the value of the standard deviation of the HEVC compressed video.
CN202110699673.0A 2021-06-23 2021-06-23 HEVC compression noise standard deviation estimation method based on convolutional neural network Pending CN115514964A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110699673.0A CN115514964A (en) 2021-06-23 2021-06-23 HEVC compression noise standard deviation estimation method based on convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110699673.0A CN115514964A (en) 2021-06-23 2021-06-23 HEVC compression noise standard deviation estimation method based on convolutional neural network

Publications (1)

Publication Number Publication Date
CN115514964A true CN115514964A (en) 2022-12-23

Family

ID=84499159

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110699673.0A Pending CN115514964A (en) 2021-06-23 2021-06-23 HEVC compression noise standard deviation estimation method based on convolutional neural network

Country Status (1)

Country Link
CN (1) CN115514964A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107133948A (en) * 2017-05-09 2017-09-05 电子科技大学 Image blurring and noise evaluating method based on multitask convolutional neural networks
CN109658348A (en) * 2018-11-16 2019-04-19 天津大学 The estimation of joint noise and image de-noising method based on deep learning

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107133948A (en) * 2017-05-09 2017-09-05 电子科技大学 Image blurring and noise evaluating method based on multitask convolutional neural networks
CN109658348A (en) * 2018-11-16 2019-04-19 天津大学 The estimation of joint noise and image de-noising method based on deep learning

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
于海雯;易昕炜;徐少平;刘婷云;李崇禧;: "一种基于卷积神经网络的快速噪声水平估计算法", 南昌大学学报(理科版), no. 05, 25 October 2019 (2019-10-25) *
肖进胜;朱力;赵博强;雷俊锋;王莉;: "基于主成分分析的分块视频噪声估计", 自动化学报, no. 09, 11 December 2017 (2017-12-11) *

Similar Documents

Publication Publication Date Title
Zhang et al. Low-rank-based nonlocal adaptive loop filter for high-efficiency video compression
Balle et al. Models for static and dynamic texture synthesis in image and video compression
CN110944200B (en) Method for evaluating immersive video transcoding scheme
WO2005055618A1 (en) Method and system for video quality measurements
CN110852964A (en) Image bit enhancement method based on deep learning
CN105933711B (en) Neighborhood optimum probability video steganalysis method and system based on segmentation
US9268791B2 (en) Method and apparatus for image processing and computer readable medium
WO2017004889A1 (en) Jnd factor-based super-pixel gaussian filter pre-processing method
CN107295217B (en) Video noise estimation method based on principal component analysis
CN112584153B (en) Video compression method and device based on just noticeable distortion model
CN111726638A (en) HEVC (high efficiency video coding) optimization method combining decompression effect and super-resolution
CN115442613A (en) Interframe information-based noise removal method using GAN
CN102523454B (en) Method for utilizing 3D (three dimensional) dictionary to eliminate block effect in 3D display system
CN115514964A (en) HEVC compression noise standard deviation estimation method based on convolutional neural network
Huang et al. Beyond VVC: Towards perceptual quality optimized video compression using multi-scale hybrid approaches
CN105141967A (en) Fast adaptive loop filter algorithm based on just noticeable distortion (JND) model
CN112381744B (en) Adaptive pretreatment method for AV1 synthetic film grain
CN112954350B (en) Video post-processing optimization method and device based on frame classification
CN114189695B (en) HEVC compressed video visual perception improving method based on GAN
Leszczuk et al. Freely available large-scale video quality assessment database in full-hd resolution with h. 264 coding
Qi et al. Non-Reference Image Quality Assessment Based on Super-Pixel Segmentation and Information Entropy
KR100628839B1 (en) Method for detecting and compensating corner outlier
CN104219526B (en) HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion
CN115510271B (en) Content-oriented animation video non-reference quality evaluation method
KR20030078633A (en) Method of processing digital images for low-bit rate applications

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination