CN110047038B

CN110047038B - Single-image super-resolution reconstruction method based on hierarchical progressive network

Info

Publication number: CN110047038B
Application number: CN201910146330.4A
Authority: CN
Inventors: 王清华; 孙浩洋
Original assignee: Nanjing University of Science and Technology
Current assignee: Nanjing University of Science and Technology
Priority date: 2019-02-27
Filing date: 2019-02-27
Publication date: 2022-11-04
Anticipated expiration: 2039-02-27
Also published as: CN110047038A

Abstract

The invention provides a single image super-resolution reconstruction method based on a hierarchical progressive network, which mainly comprises the following steps: the method comprises a characteristic extraction branch, an image reconstruction branch, a hierarchical progressive network structure and a loss function. The method decomposes a high-power super-resolution task into a plurality of subtasks, each subtask can be independently completed by one super-resolution unit network, and the plurality of super-resolution unit networks are cascaded to form the whole network. The invention can use the same training model to carry out super-resolution reconstruction processing on images with multiple times.

Description

Single-image super-resolution reconstruction method based on hierarchical progressive network

Technical Field

The invention relates to the technical field of image super-resolution, in particular to a single image super-resolution reconstruction method based on a hierarchical progressive network.

Background

The image super-resolution is a technology for restoring a corresponding high-resolution image with more detail information by using a single low-resolution image or a low-resolution image sequence with sub-pixel offset on the basis of the condition of original hardware equipment. The restored image can express potential details and hidden structures, and the visual effect of the image is enhanced. The image super-resolution technology plays an important role in the fields of medical imaging, safety monitoring, audio-video entertainment, satellite remote sensing and the like.

At present, a single-image super-resolution reconstruction technology which is applied more is a learning-based method, and the method comprises a method based on ultra-complete sparse dictionary learning, a method based on a deep convolutional neural network and the like. The method can obtain a better reconstruction effect under a specific scene and a low-power super-resolution task. However, satisfactory results are still not obtained in the context of distributed generalization, especially for high-power super-resolution tasks. On the other hand, for the multi-scale super-resolution task, most of the existing methods adopt a strategy of learning for multiple times to obtain the mapping from the low-resolution images with different scales to the high-resolution images, which greatly increases the learning cost.

Disclosure of Invention

The invention aims to provide a method for single-image super-resolution reconstruction, which solves the problems that the existing method has poor reconstruction effect under a high-power super-resolution task and is difficult to complete a multi-scale super-resolution task at one time.

The technical solution for realizing the purpose of the invention is as follows: a single image super-resolution reconstruction method based on a hierarchical progressive network is characterized by comprising the following steps:

the method comprises the following steps of (I) a feature extraction branch, wherein the feature extraction branch comprises a feature extraction convolutional layer, a nonlinear mapping module and an upsampling layer, the size of a convolutional kernel is 3 multiplied by 3, the number of output feature maps is 160, the convolution step length is 1, and the upsampling layer is obtained by transposing the convolutional layer: the convolution kernel size is 3 x 3, and the upsampling factor is set to 2;

(II) an image reconstruction branch, the image reconstruction branch comprising a local residual structure, N recursive block structures and a residual prediction convolutional layer, the local residual structure being represented by the following formula:

the input-to-output mapping relationship can be expressed by equation (1):

y＝F(x，{W _i })+x

where x is the input, y is the desired output, F (x, { W) _i }) represents potential mappings that need to be learned;

the recursive block structure comprises dense connection blocks and transition layers, wherein each layer in the dense connection blocks obtains input from all layers in front of the dense connection blocks, and simultaneously transmits output of the dense connection blocks to all subsequent layers, the size of a convolution kernel of each transition layer is 1 multiplied by 1, the number of output characteristic graphs is 160, and the convolution step length is 1;

the convolution kernel size of the residual prediction convolution layer is 3 multiplied by 3, the number of output characteristic graphs is 160, and the convolution step length is 1;

(III) hierarchical progressive network structure;

(iv) a loss function represented by:

wherein, I _r For residual images obtained by mapping F, I _SR For super-resolution reconstruction of images, θ is a parameter in the network, N is the number of pictures of a training batch, U (I) _L ) Is an interpolated upsampling operation on the low resolution image, I _H -U(I _L ) The residual error of the true value is obtained, and rho is a Charbonier function and is defined as

Epsilon is taken as an empirical value of 10 ^-3 。

Furthermore, the network structure of the single layer in the dense connection block comprises a convolution layer I, a batch normalization layer, a nonlinear activation function and a convolution layer II,

the convolution kernel size of the convolution layer I is 1 multiplied by 1, the convolution step length is 1, and the padding is 1; the batch normalization layer is represented by the following formula:

wherein E (-) and var (-) respectively represent an absolute value taking operation and a variance taking operation;

the nonlinear activation function adopts a ReLU activation function and is represented by the following formula:

f(z)＝max(0，z)；

the convolution kernel size of convolution layer II is 1 × 1, the convolution step size is 1, and padding is 1.

Further, the up-sampling layer performs interpolation up-sampling on the input low-resolution image using a bicubic interpolation method.

Furthermore, the super-resolution tasks of different multiples in the hierarchical progressive network structure reuse the same feature extraction branch structure.

Furthermore, the hierarchical progressive network structure has the input of the feature extraction branch being the output of the up-sampling feature map of the previous level under the 4 x and 8 x super-resolution tasks.

Has the beneficial effects that: the invention provides a hierarchical progressive network structure for single image super-resolution reconstruction, which decomposes a high-power super-resolution task into a plurality of subtasks, wherein each subtask can be independently completed by one super-resolution unit network, and the plurality of super-resolution unit networks are cascaded to form the whole network. The invention adopts structures with skip connection property, such as local residual error, dense connection and the like, so as to improve the information flow transmission efficiency and avoid gradient disappearance.

Drawings

FIG. 1 is a diagram of a hierarchical progressive network architecture for single image super resolution according to the present invention;

FIG. 2 is a detailed structure diagram of 2 times image super-resolution task in the network structure of the present invention;

FIG. 3 is a diagram of recursive blocks in the network architecture of the present invention;

FIG. 4 is a single-layer structure diagram of the dense connection structure of the present invention.

Detailed Description

The invention is further illustrated by the following examples in conjunction with the accompanying drawings.

Fig. 1 is a diagram illustrating a hierarchical progressive network structure for single image super resolution according to the present invention. The hierarchical progressive neural network provided by the invention can perform super-resolution processing with the reconstruction multiple of s on a single picture (wherein s =2 or 4 or 8). The network consists of a group of cascaded up-sampling units, and each unit is used for performing 2-time super-resolution processing on an image. The sampling unit structure on each level mainly comprises a characteristic extraction branch and an image reconstruction branch.

Wherein the feature extraction branch in the sampling unit structure at each level is shown in fig. 2. The device comprises a feature extraction convolution layer, a nonlinear mapping module and an upper sampling layer.

The characteristic extraction layer specifically comprises: the feature extraction layer of the 2-time super-resolution task is a convolution layer, the size of a convolution kernel is 3 multiplied by 3, the number of output feature maps is 160, and the convolution step length is 1; and the 4-time and 8-time super-resolution task feature extraction layers are output of the super-resolution unit at the upper stage.

The nonlinear mapping module comprises a local residual error structure, N recursive block structures and a residual error prediction convolutional layer.

The local residual structure specifically comprises: with x as input and y as desired output, the input to output mapping can be expressed by equation (1):

y＝F(x，{W _i })+x (1)

wherein F (x, { W) _i }) represent complex potential mappings that need to be learned.

The recursive block structure is shown in fig. 3, and includes a dense connection block and a transition layer, specifically:

dense-connected blocks, where each layer in the structure takes input from all layers before it, and passes its output to all subsequent layers, i.e. the input of each layer is the superposition of the information streams output by all the previous layers. The network structure of the single layer is shown in fig. 4:

(1) A convolutional layer I: the size of the convolution kernel is 1 multiplied by 1, the convolution step is 1, and the padding is 1;

(2) The Batch Normalization (BN) layer can be specifically represented by formula (2):

wherein E (-) and var (-) respectively represent an absolute value operation and a variance operation;

(3) Nonlinear activation function: a ReLU activation function is adopted, and the specific function is as follows (3):

f(z)＝max(0，z) (3)

(4) And (3) convolutional layer II: the convolution kernel size is 1 × 1, the convolution step size is 1, padding is 1

The transition layer is specifically: the convolution kernel size is 1 × 1, the number of output feature maps is 160, and the convolution step size is 1.

And the residual prediction convolutional layer is used for integrating the characteristic diagrams extracted by the dense connection structure into a residual diagram so as to be convenient for subsequent superposition with input data. Specifically, the convolution layer has a convolution kernel size of 3 × 3, the number of output feature maps of 160, and a convolution step size of 1.

The upsampling layer uses a transposed convolutional layer: the convolution kernel size is 3 x 3 and the upsampling factor is set to 2.

The image reconstruction branch performs interpolation up-sampling on the input low-resolution image by using a bicubic interpolation method. And aiming at reconstruction tasks of different multiples, corresponding up-sampling proportions are adopted. Images of different super-resolution tasks obtain input images from original low-resolution images, and interpolation images with the same size as the prediction residual images are obtained through 2 x, 4 x and 8 x bicubic interpolation respectively. And finally, performing pixel-by-pixel addition on the images of the two branches to obtain a final super-resolution reconstructed image.

The purpose of super-resolution of the image is to find the mapping function F, so that the low-resolution image I _L Super obtained after mappingResolution image F (I) _L ) And the original high resolution image I _H As consistent as possible. Defining the residual image obtained by mapping F as I _r The super-resolution reconstructed image is I _SR The parameter in the network is θ. The loss function used in the present invention can be represented by equation (4):

where N is the number of pictures of a training batch, U (I) _L ) Is an interpolated upsampling operation on the low resolution image, I _H -U(I _L ) The result is a residual error of the true value. ρ is a Charbonier function, defined as

E is to take an empirical value of 10 ^-3 。

Compared with the traditional image super-resolution depth neural network algorithm, the hierarchical progressive structure provided by the invention can generate a multi-scale predicted image in one-time forward propagation. And meanwhile, information flow is integrated by using local residual errors and dense connection, so that the problem of gradient disappearance is avoided, and a deeper network is obtained. The present embodiment uses peak signal-to-noise ratio (PSNR) and Structural Similarity (SSIM) to evaluate the image super-resolution method of the present invention. The higher PSNR and SSIM values are generally considered to be the better the quality of the image. The PSNR value of the obtained image super-resolution reconstruction result on each test set is 0.25 dB-2.35dB higher, and the SSIM value is 0.003-0.053 higher when the set5, the set14 and the BSD100 data sets are used for testing the invention.

The invention provides a hierarchical progressive network structure for single-image super-resolution reconstruction. The structure decomposes a high-power super-resolution task into a plurality of subtasks, each subtask can be independently completed by one super-resolution unit network, and a plurality of super-resolution unit networks are cascaded to form the whole network. In the network training stage, 2 x, 4 x and 8 x upsampling factors can be trained simultaneously, and in the testing stage, the same training model can be used for performing super-resolution reconstruction processing on images by three times. Meanwhile, the invention adopts structures with skip connection property, such as local residual error, dense connection and the like, so as to improve the information flow transmission efficiency and avoid gradient disappearance.

Claims

1. A single image super-resolution reconstruction method based on a hierarchical progressive network is characterized by comprising the following steps:

the method comprises the following steps of (A) extracting features, wherein the extracting features comprise a feature extraction convolutional layer, a nonlinear mapping module and an upsampling layer, the size of a convolutional kernel is 3 multiplied by 3, the number of output feature graphs is 160, the convolution step size is 1, and the upsampling layer is obtained by transposing the convolutional layer: the convolution kernel size is 3 x 3, and the upsampling factor is set to 2;

the input-to-output mapping is represented by:

y＝F(x，{W _i })+x

the recursive block structure comprises a dense connection block and transition layers, wherein each layer in the dense connection block obtains input from all the layers in front of the dense connection block and simultaneously transmits output of the dense connection block to all the subsequent layers, the convolution kernel size of the transition layers is 1 multiplied by 1, the number of output characteristic graphs is 160, and the convolution step length is 1;

(III) hierarchical progressive network structure;

(iv) a loss function represented by:

E is to take an empirical value of 10 ^-3 ；

The super-resolution tasks of different multiples in the hierarchical progressive network structure reuse the same feature extraction branch structure;

under the 4 x and 8 x super-resolution tasks of the hierarchical progressive network structure, the input of the feature extraction branch is the output of the up-sampling feature map of the previous level.

2. The single image super-resolution reconstruction method based on hierarchical progressive network as claimed in claim 1, wherein the network structure of single layer in the dense connection block comprises convolutional layer I, batch normalization layer, nonlinear activation function and convolutional layer II,

f(z)＝max(0，z)；

3. The single-image super-resolution reconstruction method based on hierarchical progressive network according to claim 1, wherein the upsampling layer interpolates the input low-resolution image by using a bicubic interpolation method.