CN115514964A - HEVC compression noise standard deviation estimation method based on convolutional neural network - Google Patents
HEVC compression noise standard deviation estimation method based on convolutional neural network Download PDFInfo
- Publication number
- CN115514964A CN115514964A CN202110699673.0A CN202110699673A CN115514964A CN 115514964 A CN115514964 A CN 115514964A CN 202110699673 A CN202110699673 A CN 202110699673A CN 115514964 A CN115514964 A CN 115514964A
- Authority
- CN
- China
- Prior art keywords
- standard deviation
- hevc
- block
- noise standard
- video
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000007906 compression Methods 0.000 title claims abstract description 28
- 230000006835 compression Effects 0.000 title claims abstract description 27
- 238000013527 convolutional neural network Methods 0.000 title claims abstract description 11
- 238000000034 method Methods 0.000 title claims abstract description 9
- 238000012549 training Methods 0.000 claims description 6
- 238000010586 diagram Methods 0.000 claims description 4
- 238000000638 solvent extraction Methods 0.000 claims 2
- 230000000903 blocking effect Effects 0.000 abstract description 3
- 238000012360 testing method Methods 0.000 abstract description 2
- 230000000694 effects Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 3
- 208000037170 Delayed Emergence from Anesthesia Diseases 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000012669 compression test Methods 0.000 description 1
- 230000006837 decompression Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/96—Tree coding, e.g. quad-tree coding
Abstract
The invention discloses an HEVC compression noise standard deviation estimation method based on a convolutional neural network, which mainly comprises the following steps of: the method comprises the steps of dividing a video frame subjected to HEVC coding into blocks, extracting blocking information from a code stream to determine whether a current block belongs to a high texture block or a low texture block, obtaining a real noise standard deviation of a small block by subtracting an original video and a compressed video, and using the real noise standard deviation as a label of the small block to construct a data set. The invention constructs a compression noise estimation network, and trains high and low texture data sets respectively to obtain a trained model. In the testing stage, videos are tested according to HEVC standards with different resolutions, and the standard deviation level of compression noise of the videos is estimated by taking a block as a unit under different QPs and compared with real noise. Experimental results show that the method can effectively estimate the noise standard deviation of the compressed video.
Description
Technical Field
The invention relates to a convolutional neural network and a video coding algorithm, in particular to an HEVC compression noise standard deviation estimation method based on the convolutional neural network, and belongs to the field of image communication.
Background
In recent years, video applications have grown rapidly, but at the same time, challenges have been presented to the transmission and storage of large amounts of video data. In this context, the International Telecommunications Union (ITU) has promulgated a High Efficiency Video Coding standard (HEVC/h.265) in cooperation with the International Organization for standardization (ISO). Compared with advanced video Coding (AVC/h.264), the HEVC standard can save about 50% of code rate under the same Coding quality. Inevitably, however, HEVC compressed video also contains many compression effects, especially in low bitrate segments, and the presence of these compression effects seriously affects the viewing experience of the user. The HEVC standard employs a block-based hybrid coding strategy, and after the predictive coding is finished, the prediction residual is transformed and quantized in units of blocks. The pixels at the block boundary are continuous, and the inconsistency of the processing parameters of the pixels distributed into different blocks causes the occurrence of blocking effect in a compressed video; quantization rounding and high-frequency zero setting cause the compressed video to lose a lot of detail information; inaccuracies in motion estimation and motion compensation further introduce compression noise. Many scholars refer to the following: accurate evaluation of the compression noise level can help the subsequent decompression effect algorithm, and setting an incorrect noise level may reduce the effect of denoising. The distribution of the compression noise is related to the compression process, so that the modeling difficulty is high, the mainstream research still regards the compression noise as additive gaussian noise at present, and the estimation of the compression noise is the estimation of the standard deviation of the gaussian noise. Convolutional neural networks have achieved significant success in the field of computer vision and image processing. Compared with the traditional image prior constructed by summarizing the image/video signal distribution, the convolutional neural network can learn the deep image characteristics in the image/video, which is also an important reason for obtaining excellent effects on various image/video processing tasks. Inspired by great success of a classification network based on CNN, the invention takes HEVC compression noise standard deviation estimation as a classification task for images of different styles.
Disclosure of Invention
The invention aims to estimate the standard deviation of HEVC compressed video compression noise.
The invention provides an HEVC compression noise standard deviation estimation method based on a convolutional neural network, which mainly comprises the following operation steps of:
(1) A training data set is constructed.
We performed 32 frame coding of 200 training videos in the data set LDV at QP =22, 27, 32, 37, the coding configuration being LDP. Then, the picture is divided into 64 × 64 small blocks, the difference is made between the small blocks and the original video to obtain a residual block of each image block, the standard deviation of the residual block is calculated, and the picture is classified according to the standard deviation, as shown in fig. 1. In general, pictures with low level of compression noise contain less texture detail, and pictures with high level of compression noise contain more texture. But we have also found special cases where some pictures with high noise levels also contain less texture, because the entire picture becomes smoother due to the coarse quantization of HEVC and the high frequency quality of the transform coefficients. For the image blocks which have less texture, the compressed image blocks represent the smooth blurring and have less compression noise, while the image blocks which have rich dense texture represent the smooth blurring and have larger compression noise. To solve this problem, we extract the blocking information from the encoded bitstream to determine whether a compressed smooth block is a high texture block or a low texture block before compression. As shown in fig. 2, when HEVC is compressed, quad-tree-based block division is adopted, dense block division is performed on a region with complex texture, sparse block division is performed on a region with simple texture, CU block information of HEVC is extracted from a code stream, an image is divided into a high texture block and a low texture block according to the depth of the CU block, the high texture block is determined when the depth is greater than 2, and the low texture block is determined when the depth is less than or equal to 2. A data set is constructed for each of the high texture blocks and the low texture blocks. The high texture blocks are divided into 15 classes, the standard deviation difference of each class is 1, from 0, the value width of each class is 0.5, if the standard deviation value interval of the first class is [0,0.5], the standard deviation value interval of the second class is [1,1.5], and the value space of the later class is analogized according to the form. For the low texture blocks, the low texture blocks are divided into 10 classes, the standard deviation difference of each class is 1, and the value width of each class is 0.7 from 0.
(2) And constructing a compression noise estimation network based on the convolutional neural network.
We propose an HEVC compression noise classification network whose network backbone is the classical classification network DENSE-NET, as shown in fig. 3, we make some modifications to the network to meet our task requirements. Since our block edge length is 64, we add a transposed convolutional layer before the first Dense block to further increase the field of view to enlarge the resolution of the feature map, and use 3 Dense blocks to achieve the balance of complexity and performance.
(3) And (3) carrying out network model training according to the data set in (1) and the network proposed in (2), and estimating the compression noise standard deviation for the HEVC compression test video block by using the trained model.
Drawings
Fig. 1 is a schematic diagram of classifying pictures based on different standard deviations of compression noise according to the present invention.
Fig. 2 is a high and low texture block distribution diagram of the present invention.
Fig. 3 is a block diagram of an HEVC compression noise standard deviation estimation network according to the present invention.
Detailed Description
In order to better illustrate the effectiveness of the present invention, videos with different resolution sizes in a standard HEVC test video are selected, noise estimation is performed on different QPs, and table one shows the difference between the estimation result of the present invention and the real noise. From the table, it can be seen that the present invention can obtain a more accurate estimation result.
Table one standard deviation difference between the invention and true noise
Claims (1)
1. An HEVC compression noise standard deviation estimation method based on a convolutional neural network is characterized by comprising the following steps of:
the method comprises the following steps: firstly, constructing a training data set, partitioning a video compressed by HEVC, extracting partitioning information from code stream information, dividing small blocks into high texture blocks and low texture blocks, calculating a real noise standard deviation by calculating a difference value between an original video and the compressed video, taking the real noise standard deviation as a label of each small block, and finally constructing two training data sets;
step two: constructing a network, namely constructing the network on the basis of DENSE-NET, adopting 3 Dense blocks, and increasing a transposed convolution layer to enlarge the resolution of a characteristic diagram;
step three: and (3) training the data set obtained in the step one through the network provided in the step two to obtain two models aiming at the high and low texture blocks, wherein the obtained models can effectively estimate the value of the standard deviation of the HEVC compressed video.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110699673.0A CN115514964A (en) | 2021-06-23 | 2021-06-23 | HEVC compression noise standard deviation estimation method based on convolutional neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110699673.0A CN115514964A (en) | 2021-06-23 | 2021-06-23 | HEVC compression noise standard deviation estimation method based on convolutional neural network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115514964A true CN115514964A (en) | 2022-12-23 |
Family
ID=84499159
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110699673.0A Pending CN115514964A (en) | 2021-06-23 | 2021-06-23 | HEVC compression noise standard deviation estimation method based on convolutional neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115514964A (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107133948A (en) * | 2017-05-09 | 2017-09-05 | 电子科技大学 | Image blurring and noise evaluating method based on multitask convolutional neural networks |
CN109658348A (en) * | 2018-11-16 | 2019-04-19 | 天津大学 | The estimation of joint noise and image de-noising method based on deep learning |
-
2021
- 2021-06-23 CN CN202110699673.0A patent/CN115514964A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107133948A (en) * | 2017-05-09 | 2017-09-05 | 电子科技大学 | Image blurring and noise evaluating method based on multitask convolutional neural networks |
CN109658348A (en) * | 2018-11-16 | 2019-04-19 | 天津大学 | The estimation of joint noise and image de-noising method based on deep learning |
Non-Patent Citations (2)
Title |
---|
于海雯;易昕炜;徐少平;刘婷云;李崇禧;: "一种基于卷积神经网络的快速噪声水平估计算法", 南昌大学学报(理科版), no. 05, 25 October 2019 (2019-10-25) * |
肖进胜;朱力;赵博强;雷俊锋;王莉;: "基于主成分分析的分块视频噪声估计", 自动化学报, no. 09, 11 December 2017 (2017-12-11) * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhang et al. | Low-rank-based nonlocal adaptive loop filter for high-efficiency video compression | |
Balle et al. | Models for static and dynamic texture synthesis in image and video compression | |
CN110944200B (en) | Method for evaluating immersive video transcoding scheme | |
WO2005055618A1 (en) | Method and system for video quality measurements | |
CN110852964A (en) | Image bit enhancement method based on deep learning | |
CN105933711B (en) | Neighborhood optimum probability video steganalysis method and system based on segmentation | |
US9268791B2 (en) | Method and apparatus for image processing and computer readable medium | |
WO2017004889A1 (en) | Jnd factor-based super-pixel gaussian filter pre-processing method | |
CN107295217B (en) | Video noise estimation method based on principal component analysis | |
CN112584153B (en) | Video compression method and device based on just noticeable distortion model | |
CN111726638A (en) | HEVC (high efficiency video coding) optimization method combining decompression effect and super-resolution | |
CN115442613A (en) | Interframe information-based noise removal method using GAN | |
CN102523454B (en) | Method for utilizing 3D (three dimensional) dictionary to eliminate block effect in 3D display system | |
CN115514964A (en) | HEVC compression noise standard deviation estimation method based on convolutional neural network | |
Huang et al. | Beyond VVC: Towards perceptual quality optimized video compression using multi-scale hybrid approaches | |
CN105141967A (en) | Fast adaptive loop filter algorithm based on just noticeable distortion (JND) model | |
CN112381744B (en) | Adaptive pretreatment method for AV1 synthetic film grain | |
CN112954350B (en) | Video post-processing optimization method and device based on frame classification | |
CN114189695B (en) | HEVC compressed video visual perception improving method based on GAN | |
Leszczuk et al. | Freely available large-scale video quality assessment database in full-hd resolution with h. 264 coding | |
Qi et al. | Non-Reference Image Quality Assessment Based on Super-Pixel Segmentation and Information Entropy | |
KR100628839B1 (en) | Method for detecting and compensating corner outlier | |
CN104219526B (en) | HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion | |
CN115510271B (en) | Content-oriented animation video non-reference quality evaluation method | |
KR20030078633A (en) | Method of processing digital images for low-bit rate applications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |