CN109922339A

CN109922339A - In conjunction with the image coding framework of multi-sampling rate down-sampling and super-resolution rebuilding technology

Info

Publication number: CN109922339A
Application number: CN201711314711.6A
Authority: CN
Inventors: 何小海; 李兴龙; 卿粼波; 任超; 占文枢; 滕奇志; 吴小强
Original assignee: Sichuan University
Current assignee: Sichuan University
Priority date: 2017-12-12
Filing date: 2017-12-12
Publication date: 2019-06-21

Abstract

The invention discloses the image coding frameworks of a kind of combination multi-sampling rate down-sampling and super-resolution rebuilding technology.Mainly comprise the steps that image to be compressed is divided into 32 × 32 block, then to each piece of progress down-sampling under plurality of sampling rates；Rough JPEG encoding and decoding are carried out to every kind of down-sampling block, and up-sample decoded low resolution block；Each piece of optimal sample rate and quantization parameter is selected by rate-distortion optimization；According to selected parameter, using JPEG to each piece of progress encoding and decoding, and decoding block is rebuild using the super-resolution technique based on deep learning；Reconstructed block is combined into image.Higher compression ratio can may be implemented and under identical decoded image quality in the distortion performance that all-key rate section promotes JPEG in the present invention.This method can be adapted for a variety of mainstream image encoders, realize the efficient storage and transmission of image data.

Description

In conjunction with the image coding framework of multi-sampling rate down-sampling and super-resolution rebuilding technology

Technical field

The present invention relates to image super-resolution rebuildings and Image Compression, and in particular to adopts under a kind of combination multi-sampling rate The image coding framework of sample and super-resolution rebuilding technology, belongs to field of picture communication.

Background technique

Compression of images is a kind of to reduce the image procossing skill of data volume by reducing the redundancy between raw image data Art, the problems such as can solve the memory space inadequate and limited transmission bandwidth of vision facilities.JPEG is encoded as the image of mainstream One of standard, since it has many advantages, such as lower computation complexity, so that it is in the lossy compressions neck such as network and wireless communication Domain is widely used.However, the compression performance of JPEG will be greatly reduced in the case where limited bit rate, so that decoding image There are serious pinch effects, to reduce the visual quality of compressed images.

Universal with high definition and ultra high-definition equipment, demand of the people to the resolution ratio of image and video is higher and higher.So And it is limited to higher equipment cost and different use environments etc., so that the image and video quality that obtain, which are not able to satisfy still, to be needed It asks.Super-resolution rebuilding is a kind of technology for carrying out increase resolution to known low-resolution image by way of software, can To get higher-quality image under the premise of not needing to update hardware device, there is very high practicability.In recent years, With the fast development of machine learning, the super resolution ratio reconstruction method based on deep learning enters the public visual field.Compared to Traditional super-resolution rebuilding technology, the super-resolution rebuilding based on deep learning can obtain higher-quality image, simultaneously In phase of regeneration, has and rebuild speed faster.

Down-sampling is carried out to image to be compressed in coding side, while obtaining original point by super-resolution technique in decoding end The scheme of resolution image solves the problems, such as that existing compression standard is ineffective in low bit- rate section to a certain extent.But it should Scheme is only applicable to compared with low bit- rate section, and with the promotion of code rate, compression performance, which has, significantly to be declined, therefore is actually being answered There is significant limitation in.

Summary of the invention

The image coding framework of combination multi-sampling rate down-sampling proposed by the present invention and super-resolution rebuilding technology, for mentioning JPEG coding standard is risen in the distortion performance of all-key rate section.The frame proposed simultaneously can also be applied to other third parties and compile Code device is to promote its coding efficiency.

The image coding framework of combination multi-sampling rate down-sampling proposed by the present invention and super-resolution rebuilding technology, it is main to wrap Include following operating procedure:

(1) original image is subjected to piecemeal by 32 × 32 sizes, the down-sampling of plurality of sampling rates is then carried out to each piece；

(2) the pre- encoding and decoding of more quantization parameters are carried out to the fritter after down-sampling, and after statistic quantification DCT coefficient non-zero The number of value；

(3) decoded image fritter is interpolated into original resolution with interpolation method, and calculates reconstructed block and original block Between mean square error；

(4) rate-distortion optimization algorithm is used, optimal sample rate and its corresponding down-sampling mode and quantization parameter are selected；

(5) according to the optional sampling mode and quantization parameter of acquisition, using JPEG to original picture block carry out down-sampling with Encoding and decoding；

(6) model gone out by using the super-resolution rebuilding technique drill based on deep learning, to decoded image Fritter carries out super-resolution rebuilding；

(7) it combines the image block of reconstruction according to corresponding mode, forms final decoding image.

Detailed description of the invention

Fig. 1 is the block diagram for the image coding framework that the present invention combines multi-sampling rate down-sampling and super-resolution rebuilding technology

Fig. 2 is the specific aim model training process of the super resolution ratio reconstruction method based on deep learning and its block diagram of reconstruction

Fig. 3 is that the distortion performance of the present invention and JPEG compression algorithm to ' Bike ' test image in all-key rate section compares

Fig. 4 is that the distortion performance of the present invention and JPEG compression algorithm to ' Lena ' test image in all-key rate section compares

Fig. 5 is the vision of ' Lena ' decoding image of the present invention and JPEG in low bit- rate section and when code rate is all 0.2bpp Effect compares: figure (a) is that JPEG decodes image, and figure (b) is present invention decoding image

Fig. 6 is the view of ' Bike ' decoding image of the present invention and JPEG in middle high code rate section and when code rate is all 1.06bpp Feel that effect compares: figure (a) is that JPEG decodes image, and figure (b) is present invention decoding image

Specific embodiment

The present invention will be further explained below with reference to the attached drawings:

In Fig. 1, in conjunction with the image coding framework of multi-sampling rate down-sampling and super-resolution rebuilding technology, can specifically it be divided into Seven steps below:

(6) the multiple models gone out by using the super-resolution rebuilding technique drill based on deep learning, to decoded Image fritter carries out super-resolution rebuilding；

Specifically, in the step (1), by original image segmentation to be compressed at the image for 32 × 32 sizes not overlapped Block；Then it is directed to each image block, devises 9 kinds of different sample rates (the down-sampling modes different corresponding to 16 kinds) table 1 In give plurality of sampling rates and its corresponding down-sampling mode.Down-sampling is all made of the optimal down-sampling side based on least square Case, whole process can be formulated as

Wherein, H^h,vFor low resolution blockCorresponding up-sampling matrix, h and v respectively indicate horizontal and vertical direction On sample rate, Y is original picture block.Optimal down-sampling block according to least-squares algorithm, under available current sample rate For

When current procedures carry out down-sampling, need to find out the up-sampling matrix under all sample rates, and use above-mentioned public affairs Formula obtains the optimal down-sampling block under different sample rates.

1 sample rate of table and its sampling configuration

In the step (2), 8 different quantization parameters are given, to distribute more bits to each down-sampling block, To weaken the loss of down-sampling process bring high-frequency information, standard JPEG then is carried out to the image block of 16 kinds of different resolutions Encoding and decoding.In an encoding process, the number (NZC) for counting nonzero value of each image fritter DCT coefficient after quantization, is used for Code rate in step (4) in rate-distortion optimization is to reduce calculation amount.Meanwhile it is worth noting that pre- encoding and decoding in current procedures It does not need to carry out Huffman encoding and decoding, need to only carry out the positive and negative transform and quantization of DCT and inverse quantization processes.It is used in the present invention Quantization parameter QP is

Wherein QF is that present image carries out the quality factor used when standard JPEG coding.

In the step (3), decoded down-sampled images fritter is up-sampled by simple interpolation method, and The mean square error (MSE) between image block and original picture block after calculating up-sampling, as in rate-distortion optimization in step (4) Distortion value.

In the step (4), in conjunction under the different sample rates and quantization parameter being calculated in step (2) and step (3) NZC and MSE, by Rate-distortion optimization method obtain current image block optimal sample rate and quantization parameter, rate distortion it is excellent Change process is defined as

Min (J), wtihJ=MSE+ λ NZC

The smallest cost function value under plurality of sampling rates and quantization parameter is calculated, the corresponding sample rate of the smallest J is selected Under sampling configuration and quantization parameter, as optimal sampling configuration and quantization parameter.Simultaneously in optimization process, MSE and amount It is directly proportional to change factor Q F, and NZC and quantizing factor QF are inversely proportional, is fitted by mass data and obtains its functional relation, and led to Crossing the value for asking local derviation to obtain Lagrange multiplier λ is

In the step (5), the down-sampling rate selected using step (4) adopt to each 32 × 32 image block Then sample carries out encoding and decoding under selected quantization parameter to the image fritter after down-sampling using JPEG, in encoding-decoding process In, use the quantization parameter of step (4) acquisition as the quality factor of JPEG.

In the step (6), using the super-resolution rebuilding technology based on deep learning to decoded image fritter into Row super-resolution rebuilding obtains the final image block of corresponding 32 × 32 size.Fig. 2 gives super resolution ratio reconstruction method instruction Experienced and reconstruction process block diagram.

In the training stage, the training image of input is subjected to down-sampling under same sample rate, this process does not consider more Then sample rate mode carries out JPEG to the training image after down-sampling at different Q P to simplify the network model trained Encoding and decoding, then by interpolation method up-sample decoding image, obtain with an equal amount of degraded image of training image, finally will be former Degraded image after beginning training image and up-sampling carries out dictionary respectively as the high-resolution and low-resolution image during model training Training uses VDSR (Super Resolution Using Very Deep Convolutional in the training stage Networks) algorithm is trained at each QP.Each decoded image fritter is quantified by it in phase of regeneration Parameter QP selects corresponding network model to carry out super-resolution rebuilding.

In the step (7), the image block rebuild in step (6) is carried out accordingly according to the partitioned mode in step (1) Combination, form final decoding image.

Two width gray level images ' Bike ' are selected at random from test picture library, ' Lena ' is tested with above-mentioned steps, and with Joint Photographic Experts Group compares distortion performance and decodes the visual quality of image.Fig. 3 and Fig. 4 gives the rate distortion of two width pictures Can, wherein horizontal axis is code rate, and unit is bpp；The longitudinal axis is Y-PSNR (PSNR), and unit is dB.When code rate is identical, PSNR It is higher, illustrate that distortion performance is better.Fig. 5 is when code rate is 0.2bpp, and standard JPEG and the present invention compress ' Lena ' and tie The subjective vision effect contrast figure of fruit.Fig. 6 is when code rate is 1.06bpp, and standard JPEG and the present invention compress ' Bike ' and tie The subjective vision effect contrast figure of fruit.Experimental result has universality to other test images.

Claims

1. combining the image coding framework of multi-sampling rate down-sampling and super-resolution rebuilding technology, it is characterised in that including following step It is rapid:

Step 1: original image is subjected to piecemeal by 32 × 32 sizes, the down-sampling of plurality of sampling rates is then carried out to each piece；

Step 2: carrying out the pre- encoding and decoding of more quantization parameters to the fritter after down-sampling, and after statistic quantification DCT coefficient non-zero The number of value；

Step 3: decoded image fritter is interpolated into original resolution with interpolation method, and calculates reconstructed block and original block Between mean square error；

Step 4: rate-distortion optimization algorithm is used, optimal sample rate and its corresponding down-sampling mode and quantization parameter are selected；

Step 5: according to the optional sampling mode and quantization parameter of acquisition, using JPEG to original picture block carry out down-sampling with Encoding and decoding；

Step 6: the multiple models gone out by using the super-resolution rebuilding technique drill based on deep learning, to decoded Image fritter carries out super-resolution rebuilding；

Step 7: combining the image block of reconstruction according to corresponding mode, forms final decoding image.

2. the image coding framework of combination multi-sampling rate down-sampling according to claim 1 and super-resolution rebuilding technology, It is characterized in that described in step 1 carry out each 32 × 32 image block using least square method under plurality of sampling rates Optimal down-sampling,

In formula, Y is original picture block,For the low resolution block after down-sampling, h and v respectively indicate horizontal and vertical direction On sample rate, this method acquires the corresponding up-sampling matrix H under every kind of sample rate^h,v, then solved using least square method Up-sampling block and original block in above formula minimize the error problem,

Known optimal down-sampling block of the original picture block under different sample rates can be obtained by this method.

3. the image coding framework of combination multi-sampling rate down-sampling according to claim 1 and super-resolution rebuilding technology, It is characterized in that walking pre- encoding-decoding process constructed in two and step 3, compiled in the pre- encoding-decoding process without Huffman Code only carries out the positive and negative transform and quantization of DCT and inverse quantization processes.

4. the image coding framework of combination multi-sampling rate down-sampling according to claim 1 and super-resolution rebuilding technology, It is characterized in that being that rate distortion used in each image block selection optional sampling rate and quantization parameter is excellent described in step 4 Change algorithm, the purpose of rate-distortion optimization is to find so that the smallest mode of majorized function J,

Min (J), wtih J=MSE+ λ NZC

In formula, MSE and NZC be respectively the non-zero quantised DCT coefficient obtained in step 2 and step 3 under each pattern with And the mean square error between reconstructed block and original block；Simultaneously in optimization process, MSE is directly proportional to quantizing factor QF, and NZC with Quantizing factor QF is inversely proportional, and is fitted by mass data and obtains its functional relation, and by asking local derviation to obtain Lagrange multiplier λ Value be

5. the image coding framework of combination multi-sampling rate down-sampling according to claim 1 and super-resolution rebuilding technology, It is characterized in that use described in step 6 is based on the super-resolution rebuilding technology of deep learning to decoded down-sampling block Super-resolution rebuilding: training stage is carried out, down-sampling is carried out to training image under same sample rate, then uses standard JPEG Then encoding and decoding use VDSR by this process come the encoding-decoding process under sample rates different in phantom frame and quantization parameter (Super Resolution Using Very Deep Convolutional Networks) algorithm trains decoded drop Mapping relations between matter image and original image generate convolutional network model；When rebuilding, joined according to the quantization of current decoding block Number, selects corresponding network model to be rebuild.