CN112991173B

CN112991173B - Single-frame image super-resolution reconstruction method based on dual-channel feature migration network

Info

Publication number: CN112991173B
Application number: CN202110268450.9A
Authority: CN
Inventors: 秦翰林; 乐阳; 延翔; 冯冬竹; 姚迪; 梁毅; 李莹; 张嘉伟; 杨硕闻; 马琳; 周慧鑫
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2021-03-12
Filing date: 2021-03-12
Publication date: 2024-04-16
Anticipated expiration: 2041-03-12
Also published as: CN112991173A

Abstract

The invention discloses a single-frame image super-resolution reconstruction method based on a dual-channel feature migration network, which aims at the problem that detail information is lost due to the increase of depth of a depth super-resolution image reconstruction network, local receptive fields are not fully utilized for global-local texture similarity, and a network model is designed, and the whole network is based on a residual mechanism, and a post up-sampling module is used for carrying out up-sampling on the image in a spatial scale to output a super-resolution reconstruction result y. The invention can maintain the effective distribution of the spatial repeated texture features, multiplex the out-of-service features by using the residual structure, effectively prevent the disappearance phenomenon of the detail features in the forward transmission process of the network and remarkably improve the super-resolution reconstruction quality of single-frame images.

Description

Single-frame image super-resolution reconstruction method based on dual-channel feature migration network

Technical Field

The invention belongs to the field of super-resolution reconstruction of single-frame images, and particularly relates to a single-frame image super-resolution reconstruction method based on a dual-channel feature migration network.

Background

The purpose of single frame image super-resolution reconstruction (SISR) is to obtain a high resolution image reconstruction output from a low resolution image input and to recover the details and texture information of the image to the maximum extent with increasing its spatial resolution. From the viewpoint of signal entropy, the method is an entropy increasing process, under the condition of no reference and no priori knowledge, the method is a pathological problem of reconstructing an original real signal from a lossy degradation signal, from an early interpolation-based method to a current learning-based method, because human eyes are insensitive to high-frequency details, the algorithms only optimize and reconstruct images from the viewpoint of visual features, but neglect the expression of abundant textures and high-frequency details in a real image, and meanwhile, the non-local similarity of spatial textures is not considered, so that the problems of low reconstruction precision, insufficient data feature utilization and loss of texture details often exist for SISR tasks. The high-precision single-frame image super-resolution reconstruction method has great application value in the fields of biomedicine, security monitoring, image processing, pattern recognition, computer vision and the like.

Aiming at the problems of low reconstruction accuracy, insufficient utilization of characteristic information, loss of texture details and the like of the reconstruction algorithm, along with the continuous development of deep learning in recent years, in order to optimize the SISR problem with high quality, reduce the calculation complexity of reconstruction and improve the scene adaptation performance of the algorithm, a method based on a deep neural network is also introduced into the SISR problem, but a simple neural network can acquire better reconstruction performance under extremely large network depth, and a convolution-based method is only effective for local receptive fields of images, so that only an optimization result conforming to the visual perception of human eyes can be reconstructed by a method for deepening the network, and the performance of the method is often poor for some advanced pattern recognition tasks. Therefore, the network depth is reduced, the characteristic information in the low-resolution image is fully utilized, and based on reasonable image global autocorrelation assumption, a great breakthrough is brought to the SISR problem, and the landing process of the SISR problem in technical production and living development is promoted.

Disclosure of Invention

Therefore, the main objective of the present invention is to provide a single-frame image super-resolution reconstruction method based on a dual-channel feature migration network.

In order to achieve the above purpose, the technical scheme of the invention is realized as follows:

the invention discloses a single-frame image super-resolution reconstruction method based on a dual-channel feature migration network, which comprises the following steps:

by carrying out convolution operation sf (·) on the input original low-resolution image x, shallow features F of the image are obtained ₁ ；

The shallow features F are corrected by a non-local correlation correction module h (& gt) ₁ Performing characteristic channel correction to obtain correction characteristic F ₂ ；

The correction feature F ₂ Outputting an intermediate feature map by a dual-channel feature migration module RSCAB, and correcting the feature F ₂ Calculating the sum of the signal intensities at the corresponding spatial positions, and outputting an intermediate feature F ₃ ；

Said intermediate feature F ₃ Outputting the fused characteristic F after fusion through a layer of convolution Conv (& gt) ₄ ；

The fusion feature F ₄ Outputting corrected fusion characteristic F through a non-local correlation correction module T (& gt) ₅ ；

The shallow layer feature F ₁ Fusion feature F after correction by means of residual connection ₅ Calculating the sum of the signal quantities at the corresponding spatial positions and outputting the deep features F ₆ Fusion of shallow features F by means of residual connection ₁ Obtaining a characteristic diagram F to be rebuilt ₇ ；

The to-be-reconstructed featureSign F ₇ And outputting a super-resolution reconstruction result y through upsampling of the reconstruction module.

In the above scheme, the shallow feature F of the image is obtained by performing convolution operation sf (·) on the input original low-resolution image x ₁ The method specifically comprises the following steps: performing convolution operation sf (·) on an input original low-resolution image x to acquire shallow features F of the image ₁ The number of convolution kernels used is 128, the size is 3×3, the step size is 1, and 0 is added at the edge of the convolution map so that the convolution map is consistent with the original image space size, this process can be expressed as equation (1):

F ₁ ＝sf(x)＝Relu(Conv _3×3 (x)) (1)。

in the above scheme, the non-local correlation correction module h (·) corrects the shallow feature F ₁ Performing characteristic channel correction to obtain correction characteristic F ₂ The method specifically comprises the following steps: for the original uncorrected shallow features F ₁ The spatial shape of which is represented by [ C, H, W ]]Respectively represent the channel number C, the longitudinal signal number H and the transverse signal number W, and for the channel number H and the transverse signal number W]The 2X 2 area in the dimension takes out the corresponding signals and reconstructs 4 feature subgraphs, and the 4 recombined shapes are [ C, H/2, W/2 ]]Respectively inputting the feature subgraphs of the (4) sub-graphs into an N-L module to obtain 4 output subgraphs, and recombining the 4 subgraphs in space dimension according to the arrangement before subgraph division and then combining with the original feature graph F ₁ The signal quantity at the corresponding space position is summed up to obtain a corrected characteristic diagram F ₂ 。

In the above scheme, the non-local correlation correction module h (·) corrects the shallow feature F ₁ Performing characteristic channel correction to obtain correction characteristic F ₂ The method specifically comprises the following steps: for the input shape [ C, H, W ]]Feature map F of (1) ₂ Using 3 shapes [ C, C/2, 1]]Is convolved by a convolution kernel to obtain 3 convolutions of the shape [ C/2, H W ]]Is characterized by (a)The elements of the feature map are rearranged to obtain the shape of [ C, H, W ]]Is a two-dimensional feature matrix M of (2) ₁ 、M ₂ 、M ₃ The process is shown in the formula (2):

for M ₂ Transpose the feature matrix with M ₁ Matrix multiplication is carried out on the feature matrix to obtain a correlation matrixThe shape is [ H X W, H X W ]]The feature map is then activated using a Softmax activation function, M _rel Mapped as M' _rel As shown in formula (3), wherein (i, j) represents M' _rel A signal value for the (i, j) th position;

for M ₃ The matrix is transposed and transformed with M' _rel Performing matrix multiplication to obtain a spatial attention correction matrix:

performing dimension ascending by using a convolution kernel space attention correction matrix with the shape of [ C/2, C, 1], and performing point multiplication operation of corresponding space positions on the matrix after dimension ascending and an original input feature map to obtain a corrected feature map:

F ₂ ＝F ₁ gRelu(Conv _1×1 (M _{attention-fix} )) (5)。

in the above scheme, the correction feature F ₂ Outputting an intermediate feature map by a dual-channel feature migration module RSCAB, and correcting the feature F ₂ Calculating the sum of the signal intensities at the corresponding spatial positions, and outputting an intermediate feature F ₃ The method specifically comprises the following steps: correction feature F for input ₂ The shape is [ C, H, W ]]Two shapes are used, C,1]Is convolved by a convolution kernel to obtain two shapesIs [ C, H, W]Is expressed as: s is S _f ＝Conv _1×1 (F ₂ )，C _f ＝Conv _1×1 (F ₂ )；

For S _f Transforming to obtain the shape of [ C, H, W ]]Is a spatial feature transformation feature map S' _f ；

Pair C using global average pooling approach _f Processing to obtain average statistical vector V of channel dimension _c And pair V using Softmax activation function _c Nonlinear activation is carried out, and three layers of full-connection networks are used for V _c Mapping results in a correction vector V', which can be expressed as: v'. _c ＝fc ₂ (Softmax(fc ₁ (Softmax(V _c ) -a) a; in V' _c Correcting input feature map C for weight _f Obtaining C' _f ＝C _f ·V′ _c ；

For S' _f And C' _f The shape of [2C, 1 is used]Fusing features and fusing input features F using residual connections ₂ As shown in the formula (6), a high-frequency correction characteristic is obtained:

F ₃ ＝F ₂ +Conv _1×1 (V′+C′ _f )＝F ₂ +RSCAB(F ₂ ) (6)。

in the above scheme, the intermediate feature F ₃ Outputting the fused characteristic F after fusion through a layer of convolution Conv (& gt) ₄ The method specifically comprises the following steps: for input intermediate feature F ₃ The shape is [ C, H, W ]]Two shapes are used, C,1]Is convolved by a convolution kernel to obtain two shapes [ C, H, W ]]Is expressed as: s is S _f ＝Conv _1×1 (F ₃ )，C _f ＝Conv _1×1 (F ₃ )；

For C _f Obtaining an average statistical vector V of channel dimensions by a global average pooling method _c And pair V using Softmax activation function _c Non-linear activation and using three-layer full-connectionNetwork pair V _c Mapping is performed to obtain a correction vector V', and the process is shown in a formula (7):

V′ _c ＝fc ₂ (Softmax(fc ₁ (Softmax(V _c )))) (7)

in V' _c Correcting input feature map C for weight _f Obtaining C' _f ＝C _f ·V′ _c ；

For S' _f And C' _f The shape of [2C, 1 is used]Fusing features and fusing input features F using residual connections ₃ Obtaining fusion characteristic F ₄ The calculation is shown as a formula (8);

F ₄ ＝F ₃ +Relu(Conv _1×1 (V′+C′ _f ))＝F ₃ +RSCAB(F ₃ ) (8)。

in the above scheme, the fusion feature F ₄ Outputting corrected fusion characteristic F through a non-local correlation correction module T (& gt) ₅ The method specifically comprises the following steps: for corrected fusion feature F ₄ Fusion of upper layer features F using residual connection ₃ Nonlinear mapping is carried out on the fusion characteristic diagram through single-layer convolution to obtain F ₅ The number of convolution kernels used is 128, the size is 3×3, the step size is 1, and 0 is added at the edge of the convolution map, so that the convolution map keeps consistent with the spatial dimension of the upper layer input feature, and the process is as shown in formula (9):

F ₅ ＝Relu(Conv _3×3 (F ₄ )) (9)。

in the above aspect, the shallow feature F ₁ Fusion feature F after correction by means of residual connection ₅ Calculating the sum of the signal quantities at the corresponding spatial positions and outputting the deep features F ₆ Fusion of shallow features F by means of residual connection ₁ Obtaining a characteristic diagram F to be rebuilt ₇ The method specifically comprises the following steps: for fusion feature F ₅ Detecting the similarity of the spatial features and obtaining deep features F ₆ Fusion of shallow features F by means of residual connection ₁ Obtaining a characteristic diagram F to be rebuilt ₇ The process is shown in formula (10):

F ₇ ＝F ₁ +F ₆ ＝F ₁ +F ₅ gRelu(Conv _1×1 (M _{attention-fix} )) (10)。

in the above scheme, the feature map F is to be reconstructed ₇ The super-resolution reconstruction result y is output through upsampling of the reconstruction module, specifically: for the characteristic diagram F to be reconstructed ₇ The spatial scale of the feature map is transformed using a reconstruction module comprising an upsampling module in the form of [ C, C/(s x s), 3, and a convolution layer]The spatial scale of the input feature map is increased by s x s times and the shape is used as [ C/(s x s), 3]Is convolved with H _↑ Carrying out fusion optimization on the feature images to obtain super-resolution reconstructed images, wherein the process is shown as a formula (11):

y＝Conv _3×3 (H _↑ (F ₇ )) (11)

the model is shown in a formula (12) by using a loss function, wherein y is image data in a real training data set, alpha and delta are self-adaptive adjustment parameters according to a training convergence state, and the process optimizes network parameters in an end-to-end mode:

compared with the prior art, the method has the advantages that the network model design is conducted aiming at the problem that the detail information is lost and the local receptive field is not fully utilized for the global-local texture similarity caused by the increase of the depth super-resolution image reconstruction network, the whole network is based on a residual mechanism, and a post up-sampling module is used for up-sampling the image in a space scale; and a non-local correlation correction module is embedded in the front end and the rear end of the network, so that the number of convolution layers in the process of shallow layer feature extraction and advanced feature fusion is obviously reduced, and the spatial non-local correlation features are obviously corrected. The middle high-frequency characteristic reconstruction part is based on a space and channel dual-attention module, a learner structure is designed for purposefully carrying out re-weighted distribution on channel characteristics according to important new aiming at three-dimensional space characteristics of a middle characteristic layer, non-local similarity is calculated on similar texture characteristics of space dimensions, effective distribution of space repeated texture characteristics is kept, residual error structures are used for multiplexing out-of-service characteristics, the context consistency level of a network is improved, the disappearance phenomenon of detail characteristics in the forward transmission process of the network is effectively prevented, and the super-resolution reconstruction quality of single-frame images can be remarkably improved.

Drawings

FIG. 1 is a single-frame image super-resolution reconstruction network model diagram based on a dual-channel feature migration network;

FIG. 2 is a diagram of a network model of non-locally relevant correction module units according to the present invention;

FIG. 3 is a diagram of a network model of a spatial similarity feature detection module (N-L module) unit in a non-local correlation correction module according to the present invention;

FIG. 4 is a network model diagram of the RSCA module unit of the present invention;

fig. 5 is a graph showing the result of super-resolution reconstruction of an image for the case of downsampling rate of 0.0625 (1/42).

FIG. 6 is an infrared artwork of a scene and an infrared super-resolution reconstructed image processed by the present invention;

fig. 7 is a two-infrared original image of a scene and an infrared super-resolution reconstructed image processed by the method.

Detailed Description

The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.

The embodiment of the invention provides a single-frame image super-resolution reconstruction method based on a dual-channel feature migration network, which is realized by the following steps:

step 1: by carrying out convolution operation sf (·) on the input original low-resolution image x, shallow features F of the image are obtained ₁ 。

Specifically, the shallow layer of the image is obtained by convolving the input original low resolution image x with sf (·) operationFeature F ₁ The number of convolution kernels used is 128, the size is 3×3, the step size is 1, and 0 is added at the edge of the convolution map so that the convolution map is consistent with the original image space size, this process can be expressed as equation (1):

F ₁ ＝sf(x)＝Relu(Conv _3×3 (x)) (1)

step 2: the shallow features F are corrected by a non-local correlation correction module h (& gt) ₁ Performing characteristic channel correction to obtain correction characteristic F ₂ 。

Specifically, the extracted shallow features F ₁ Obtaining corrected correction characteristic F after non-local correlation compression module ₂ 。

For the original uncorrected feature map F ₁ The spatial shape of which is represented by [ C, H, W ]]Respectively represent the channel number C, the longitudinal signal number H and the transverse signal number W, and for the channel number H and the transverse signal number W]The 2×2 regions in the dimension respectively take out the corresponding signals and reconstruct into 4 feature subgraphs, as shown in fig. 2;

the recombined 4 shapes are [ C, H/2, W/2 ]]Respectively inputting the feature subgraphs of the (4) sub-graphs into an N-L module to obtain 4 output subgraphs, and recombining the 4 subgraphs in space dimension according to the arrangement before subgraph division and then combining with the original feature graph F ₁ The signal quantity at the corresponding space position is summed up to obtain a corrected characteristic diagram F ₂ ；

The N-L module can detect the space similar characteristics of the characteristic subgraphs and correct the similar characteristics according to the optimization target, and specifically comprises the following steps:

for the input shape [ C, H, W ]]Feature map x of (2) ₁ Using 3 shapes [ C, C/2, 1]]Is a convolution kernel of (2)(i=1, 2, 3) convolving it to give three shapes [ C/2, h×w]Feature map of->The elements of the feature map are rearranged to obtain the shape of [ C, H, W ]]Is a two-dimensional feature matrix M of (2) ₁ 、M ₂ 、M ₃ ；

For M ₂ Transpose the feature matrix with M ₁ Matrix multiplication is carried out on the feature matrix to obtain a correlation matrixThe shape is [ H X W, H X W ]]The feature map is then activated using a Softmax activation function, M _rel Mapped as M' _rel ＝Softmax(M _rel ) The calculation formula is as follows: />In [ i, j ]]Represents M' _rel I, j of (a)]A numerical value of the location;

for M ₃ The matrix is transposed and transformed with M' _rel Performing matrix multiplication to obtain a spatial attention correction matrix

The shape is [ C/2, C, 1]]The convolution kernel space attention correction matrix of (2) is subjected to dimension increasing, and the matrix after dimension increasing and the original input feature map are subjected to point multiplication operation of corresponding space positions to obtain a corrected feature map x '' ₁ ＝x ₁ gConv _1×1 (M _{attention-fix} )。

For the input shape [ C, H, W ]]Feature map F of (1) ₂ Using 3 shapes [ C, C/2, 1]]Is convolved by a convolution kernel to obtain 3 convolutions of the shape [ C/2, H W ]]Is characterized by (a)The elements of the feature map are rearranged to obtain the shape of [ C, H, W ]]Is a two-dimensional feature matrix M of (2) ₁ 、M ₂ 、M ₃ The process is shown in the formula (2):

for M ₂ Transpose transformation of feature matrixChange with M ₁ Matrix multiplication is carried out on the feature matrix to obtain a correlation matrixThe shape is [ H X W, H X W ]]The feature map is then activated using a Softmax activation function, M _rel Mapped as M' _rel As shown in formula (3), wherein (i, j) represents M' _rel A signal value for the (i, j) th position;

F ₂ ＝F ₁ gRelu(Conv _1×1 (M _{attention-fix} )) (5)

step 3: the correction feature F ₂ Outputting an intermediate feature map by a dual-channel feature migration module RSCAB, and correcting the feature F ₂ Calculating the sum of the signal intensities at the corresponding spatial positions, and outputting an intermediate feature F ₃ 。

In particular, a one-layer convolution is used, the shape of which is [ C, C,3]For the correction characteristic F corrected by the non-local correlation compression module ₂ Feature fusion is carried out, and the feature fusion is delivered to an RSCAB module for processing to obtain intermediate features F ₃ And delivering correction feature F by means of residual connection ₂ ；

Correction of feature F using RSCA module ₂ Optimizing the space high-frequency information of (2), and outputting optimized high-frequency detailsHigh frequency correction of features feature F ₃ The method specifically comprises the following steps:

correction feature F for input ₂ The shape is [ C, H, W ]]Two shapes are used, C,1]Is convolved by a convolution kernel to obtain two shapes [ C, H, W ]]Is expressed as: s is S _f ＝Conv _1×1 (F ₂ )，C _f ＝Conv _1×1 (F ₂ )；

F ₃ ＝F ₂ +Conv _1×1 (V′+C′ _f )＝F ₂ +RSCAB(F ₂ ) (6)。

step 4: said intermediate feature F ₃ Outputting the fused characteristic F after fusion through a layer of convolution Conv (& gt) ₄ 。

In particular, the spatial-channel dual-attention module is used for the intermediate feature F ₃ Extracting space and space scale features, and outputting the fused features F after space-channel correction fusion ₄ 。

For input features F ₃ The shape is [ C, H, W ]]Two shapes are used, C,1]Is convolved by a convolution kernel to obtain two shapes [ C, H, W ]]Is expressed as: s is S _f ＝Conv _1×1 (F ₃ )，C _f ＝Conv _1×1 (F ₃ )；

For C _f The global average pooling method is used for the first time to obtain the average statistical vector V of the channel dimension _c And pair V using Softmax activation function _c Nonlinear activation is carried out, and three layers of full-connection networks are used for V _c Mapping is performed to obtain a correction vector V', and the process is shown in a formula (7):

V′ _c ＝fc ₂ (Softmax(fc ₁ (Softmax(V _c )))) (7)

For S' _f And C' _f The shape of [2C, 1 is used]Fusing features and connecting intermediate features F using residuals ₃ Obtaining fusion characteristic F ₄ The calculation is shown as a formula (8);

F ₄ ＝F ₃ +Relu(Conv _1×1 (V′+C′ _f ))＝F ₃ +RSCAB(F ₃ ) (8)。

step 5: the fusion feature F ₄ Outputting corrected fusion characteristic F through a non-local correlation correction module T (& gt) ₅ 。

Specifically, for corrected fusion feature F ₄ Fusing intermediate features F using residual connection ₃ Nonlinear mapping is carried out on the fusion feature map through single-layer convolution to obtain fusion feature F ₅ The number of convolution kernels used is 128, the size is 3×3, the step size is 1, and 0 is added at the edge of the convolution map, so that the convolution map keeps consistent with the spatial dimension of the upper layer input feature, and the process is as shown in formula (9):

F ₅ ＝Relu(Conv _3×3 (F ₄ )) (9)。

step 6: the shallow layer feature F ₁ Through residual connectionIs combined with corrected fusion feature F ₅ Calculating the sum of the signal quantities at the corresponding spatial positions and outputting the deep features F ₆ Fusion of shallow features F by means of residual connection ₁ Obtaining a characteristic diagram F to be rebuilt ₇ 。

Specifically, for fusion feature F ₅ The non-local correlation correction module in the step 2 is used for carrying out space feature similarity detection on the deep features F ₆ Fusion of shallow features F by means of residual connection ₁ Obtaining a characteristic diagram F to be rebuilt ₇ The process is shown in formula (10):

F ₇ ＝F ₁ +F ₆ ＝F ₁ +F ₅ gRelu(Conv _1×1 (M _{attention-fix} )) (10)

step 8: the characteristic diagram F to be reconstructed ₇ And outputting a super-resolution reconstruction result y through upsampling of the reconstruction module.

Specifically, for the feature map F to be reconstructed ₇ The spatial scale of the feature map is transformed using a reconstruction module comprising an upsampling module and a convolution layer. The up-sampling module has the shape of [ C, C/(s×s), 3]The spatial scale of the input feature map is increased by s x s times and the shape is used as [ C/(s x s), 3]Is convolved with H _↑ Carrying out fusion optimization on the feature images to obtain super-resolution reconstructed images, wherein the process is shown as a formula (11):

y＝Conv _3×3 (H _↑ (F ₇ )) (11)

the model is shown in a formula (12) by using a loss function, wherein y is image data in a real training data set, alpha and delta are self-adaptive adjustment parameters according to a training convergence state, and network parameter optimization is carried out in an end-to-end mode in the process.

In order to prove the effectiveness of the method, super-resolution reconstruction experiments are respectively carried out on visible light, infrared and terahertz images, and fig. 5, 6 and 7 respectively show the super-resolution reconstruction results of the method at the downsampling rate of 4×4.

Fig. 5 shows the super-resolution reconstruction result of the terahertz image by the method of the invention, and it can be seen that the overall quality of the super-resolution result image is improved compared with the original downsampled image, the vein profile of the leaf is clearer, and the local texture detail of the epidermis is more obvious.

Fig. 6 shows the result of super-resolution reconstruction of an infrared image by the method of the present invention, and it can be seen that the overall quality of the super-resolution image is improved compared with the original downsampled image, and the contour and detail features are effectively enhanced.

Fig. 7 shows the super-resolution reconstruction result of the method for the visible light image, and it can be seen that the overall look and feel and quality of the super-resolution result image are obviously improved compared with the original downsampled image, the outlines and the existing detail information of buildings, vehicles, ground facilities and the like in the figure are restored, and the characters are clearer and more discernable.

The same or similar reference numerals in the drawings of the present embodiment correspond to the same or similar components; in the description of the present invention, it should be understood that the directions or positional relationships indicated by the terms "upper", "lower", "left", "right", "inner", "outer", etc. are based on the directions or positional relationships shown in the drawings, are merely for convenience of describing the present invention and simplifying the description, and are not indicative or implying that the devices or elements being referred to must have specific directions, be constructed and operated in specific directions, so that the terms describing the positional relationships in the drawings are merely for exemplary illustration, are not to be construed as limitations of the present patent, and the specific meanings of the terms described above may be understood by those of ordinary skill in the art according to specific circumstances.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, article or apparatus that comprises the element.

The foregoing description is only of the preferred embodiments of the present invention, and is not intended to limit the scope of the present invention.

Claims

1. A single-frame image super-resolution reconstruction method based on a dual-channel feature migration network is characterized by comprising the following steps:

The characteristic diagram F to be reconstructed ₇ Outputting a super-resolution reconstruction result y through upsampling of a reconstruction module;

the correction feature F ₂ Outputting an intermediate feature map by a dual-channel feature migration module RSCAB, and correcting the feature F ₂ Calculating the sum of the signal intensities at the corresponding spatial positions, and outputting an intermediate feature F ₃ The method specifically comprises the following steps: correction feature F for input ₂ The shape is [ C, H, W ]]Two shapes are used, C,1]Is convolved by a convolution kernel to obtain two shapes [ C, H, W ]]Is expressed as: s is S _f ＝Conv _1×1 (F ₂ )，C _f ＝Conv _1×1 (F ₂ )；

Pair C using global average pooling approach _f Processing to obtain average statistical vector V of channel dimension _c And pair V using Softmax activation function _c Nonlinear activation is carried out, and three layers of full-connection networks are used for V _c Mapping results in a correction vector V', which can be expressed as: v (V) _c ′＝fc ₂ (Softmax(fc ₁ (Softmax(V _c ) -a) a; in V form _c ' weight corrected input feature map C _f Obtaining C' _f ＝C _f ·V _c ′；

F ₃ ＝F ₂ +Conv _1×1 (V′+C′ _f )＝F ₂ +RSCAB(F ₂ ) (6)。

2. the method for reconstructing a single-frame image super-resolution based on a dual-channel feature migration network according to claim 1, wherein the shallow feature F of the image is obtained by performing a convolution operation sf (·) on the input original low-resolution image x ₁ The method specifically comprises the following steps: performing convolution operation sf (·) on an input original low-resolution image x to acquire shallow features F of the image ₁ The number of convolution kernels used is 128, the size is 3×3, the step size is 1, and 0 is added at the edge of the convolution map so that the convolution map is consistent with the original image space size, this process can be expressed as equation (1):

F ₁ ＝sf(x)＝Relu(Conv _3×3 (x)) (1)。

3. the method for reconstructing super-resolution of single-frame image based on dual-channel feature migration network according to claim 1 or 2, wherein the shallow feature F is corrected by a non-local correlation correction module h () ₁ Performing characteristic channel correction to obtain correction characteristic F ₂ The method specifically comprises the following steps: for the original uncorrected shallow features F ₁ The spatial shape of which is represented by [ C, H, W ]]Respectively represent the channel number C, the longitudinal signal number H and the transverse signal number W, and for the channel number H and the transverse signal number W]The 2X 2 area in the dimension takes out the corresponding signals and reconstructs 4 feature subgraphs, and the 4 recombined shapes are [ C, H/2, W/2 ]]Respectively inputting the feature subgraphs of the (4) sub-graphs into an N-L module to obtain 4 output subgraphs, and recombining the 4 subgraphs in space dimension according to the arrangement before subgraph division and then combining with the original feature graph F ₁ The signal quantity at the corresponding space position is summed up to obtain a corrected characteristic diagram F ₂ 。

4. The method for reconstructing a single-frame image super-resolution based on a dual-channel feature migration network as recited in claim 3, wherein said shallow feature F is corrected by a non-local correlation correction module h () ₁ Performing characteristic channel correction to obtain correction characteristic F ₂ The method specifically comprises the following steps: for the input shape [ C, H, W ]]Feature map F of (1) ₂ Using 3 shapes [ C, C/2, 1]]Is convolved by a convolution kernel to obtain 3 convolutions of the shape [ C/2, H W ]]Is characterized by (a)The elements of the feature map are rearranged to obtain the shape of [ C, H, W ]]Is a two-dimensional feature matrix M of (2) ₁ 、M ₂ 、M ₃ The process is shown in the formula (2):

for M ₂ Transpose the feature matrix with M ₁ Matrix multiplication is carried out on the feature matrix to obtain a correlation matrixThe shape is [ H X W, H X W ]]The feature map is then activated using a Softmax activation function, M _rel Mapping to M _r ′ _el As shown in formula (3), wherein (i, j) represents M _r ′ _el A signal value for the (i, j) th position;

for M ₃ Transpose the matrix with M _r ′ _el Performing matrix multiplication to obtain a spatial attention correction matrix:

F ₂ ＝F ₁ gRelu(Conv _1×1 (M _{attention-fix} )) (5)。

5. the method for reconstructing super-resolution of single-frame image based on dual-channel feature migration network of claim 4, wherein the intermediate feature F ₃ Outputting the fused characteristic F after fusion through a layer of convolution Conv (& gt) ₄ The method specifically comprises the following steps: for input intermediate feature F ₃ The shape is [ C, H, W ]]Two shapes are used, C,1,1]Is convolved by a convolution kernel to obtain two shapes [ C, H, W ]]Is characterized in that,

expressed as: s is S _f ＝Conv _1×1 (F ₃ )，C _f ＝Conv _1×1 (F ₃ )；

For C _f Obtaining an average statistical vector V of channel dimensions by a global average pooling method _c And pair V using Softmax activation function _c Nonlinear activation is carried out, and three layers of full-connection networks are used for V _c Mapping is performed to obtain a correction vector V', and the process is shown in a formula (7):

V _c ′＝fc ₂ (Softmax(fc ₁ (Softmax(V _c )))) (7)

in V form _c ' weight corrected input feature map C _f Obtaining C' _f ＝C _f ·V _c ′；

F ₄ ＝F ₃ +Relu(Conv _1×1 (V′+C′ _f ))＝F ₃ +RSCAB(F ₃ ) (8)。

6. the method for reconstructing super-resolution of single-frame image based on dual-channel feature migration network of claim 5, wherein the fusion feature F ₄ Outputting corrected fusion characteristic F through a non-local correlation correction module T (& gt) ₅ The method specifically comprises the following steps: for corrected fusion feature F ₄ Fusion of upper layer features F using residual connection ₃ Nonlinear mapping is carried out on the fusion characteristic diagram through single-layer convolution to obtain F ₅ The number of convolution kernels used is 128, the size is 3×3, the step size is 1, and 0 is added at the edge of the convolution map, so that the space size of the convolution map and the upper-layer input features is ensuredConsistent, the process is shown in formula (9):

F ₅ ＝Relu(Conv _3×3 (F ₄ )) (9)。

7. the method for reconstructing super-resolution of single-frame image based on dual-channel feature migration network of claim 6, wherein the shallow feature F ₁ Fusion feature F after correction by means of residual connection ₅ Calculating the sum of the signal quantities at the corresponding spatial positions and outputting the deep features F ₆ Fusion of shallow features F by means of residual connection ₁ Obtaining a characteristic diagram F to be rebuilt ₇ The method specifically comprises the following steps: for fusion feature F ₅ Detecting the similarity of the spatial features and obtaining deep features F ₆ Fusion of shallow features F by means of residual connection ₁ Obtaining a characteristic diagram F to be rebuilt ₇ The process is shown in formula (10):

8. the method for reconstructing super-resolution of single-frame image based on dual-channel feature migration network of claim 7, wherein the feature map F to be reconstructed is characterized by ₇ The super-resolution reconstruction result y is output through upsampling of the reconstruction module, specifically: for the characteristic diagram F to be reconstructed ₇ The spatial scale of the feature map is transformed using a reconstruction module comprising an upsampling module in the form of [ C, C/(s x s), 3, and a convolution layer]The spatial scale of the input feature map is increased by s x s times and the shape is used as [ C/(s x s), 3]Is convolved with H _↑ Carrying out fusion optimization on the feature images to obtain super-resolution reconstructed images, wherein the process is shown as a formula (11):

y＝Conv _3×3 (H _↑ (F ₇ )) (11)

the loss function is shown in a formula (12), wherein y is image data in a real training data set, alpha and delta are self-adaptive adjustment parameters according to training convergence states, and the process optimizes network parameters in an end-to-end mode: