CN109389556B

CN109389556B - Multi-scale cavity convolutional neural network super-resolution reconstruction method and device

Info

Publication number: CN109389556B
Application number: CN201811113028.0A
Authority: CN
Inventors: 徐颖; 应自炉; 商丽娟; 翟懿奎; 王天雷; 甘俊英; 曾军英; 秦传波; 曹鹤; 邓文博
Original assignee: Wuyi University
Current assignee: Wuyi University
Priority date: 2018-09-21
Filing date: 2018-09-21
Publication date: 2023-03-21
Anticipated expiration: 2038-09-21
Also published as: WO2020056791A1; CN109389556A

Abstract

The invention discloses a method and a device for reconstructing a super-resolution of a multi-scale cavity convolution neural network, which can effectively solve the problem of small reception field in the prior art by constructing a cavity convolution super-resolution network, increase the network reception field, and meanwhile, in order to enable the cavity convolution super-resolution network to be suitable for reconstructing super-resolution of different scales, the original image is subjected to data enhancement of different scales, so that the cavity convolution super-resolution network can process images of different multiples, and the cavity convolution super-resolution network can generalize the super-resolution of the multi-scale image.

Description

Multi-scale cavity convolutional neural network super-resolution reconstruction method and device

Technical Field

The invention relates to the technical field of image processing, in particular to a method and a device for reconstructing super-resolution of a multi-scale cavity convolutional neural network.

Background

At present, a deep convolutional neural network is widely used in computer vision problems and shows excellent performance in image classification, target detection and other problems. The single-image super-resolution is a typical computer vision problem, and for the reconstruction problem of low-resolution images, dong and the like propose an algorithm for reconstructing the super-resolution of the images by using a convolutional neural network, and the idea is to use the strong learning capability of a network model to learn the end-to-end mapping relationship from low resolution to high resolution so as to reconstruct the super-resolution images. Later, dong et al improved on the basis of the SRCNN method, and reduced the feature map dimensions and convolution kernel size with low resolution images as network input, and proposed a fast super-resolution method that reduced network training parameters. Sh i et al propose a super-resolution method using sub-pixel convolution layers to rearrange the finally obtained feature maps to obtain a high-resolution image, and greatly increase the computation speed. However, the above 3 deep learning methods all belong to a shallow convolutional neural network structure, and all have the problems of small receptive field and very local extracted features, so that the super-resolution effect has certain limitations.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention aims to provide a method and a device for reconstructing the super-resolution of the multi-scale cavity convolutional neural network, which can increase the network receptive field, extract more global characteristics and are suitable for reconstructing the super-resolution of multi-scale images.

The technical scheme adopted by the invention for solving the problems is as follows:

a multi-scale cavity convolutional neural network super-resolution reconstruction method comprises the following steps:

carrying out multi-scale data enhancement on the original image to obtain an enhanced image;

carrying out data preprocessing on the enhanced image to obtain a preprocessed image;

constructing a cavity convolution super-resolution network, and performing super-resolution reconstruction on the preprocessed image by using the cavity convolution super-resolution network, wherein the parameter needing to be continuously updated in the cavity convolution super-resolution network is theta = { W = ₁ ,W ₂ ,W ₃ ,B ₁ ,B ₂ ,B ₃ In which W is ₁ And B ₁ Weight and bias variables, W, representing the void convolution kernel ₂ Comprises n ₂ Each size is n ₁ ×f ₂ ×f ₂ Convolution kernel of (A), B ₂ Is a size n ₂ Vector of dimensions, W ₃ Corresponding to c pieces of size n ₂ ×f ₃ ×f ₃ Convolution kernel of (A), B ₃ Is a c-dimensional vector; the reconstructed image is F (Y; theta), and the corresponding original image is X; using the mean square error as a loss function:

wherein n is the number of training samples;

the weight value of the loss function is updated by a random gradient descent method, and the formula is as follows:

where l ∈ {1,2,3}, i is the number of layers and iterations, and η is the learning rate.

Further, performing multi-scale data enhancement on the original image to obtain an enhanced image, including:

taking an original image as an initial training set;

scaling each image in the initial training set in different scales respectively, and forming an intermediate training set by combining the original images;

and respectively rotating each image in the intermediate training set by different angles, and forming a processing training set by combining each image in the intermediate training set, wherein the images in the processing training set are the enhanced images.

Further, the data preprocessing is performed on the enhanced image to obtain a preprocessed image, and the data preprocessing comprises: extracting a brightness component of the enhanced image, cutting the brightness component, and downsampling the cut image according to different scale factors;

up-sampling the down-sampled image according to the corresponding scale factor to generate an initial high-resolution image with the resolution higher than that of the original image;

and cutting the initial high-resolution image and the original image into a high-resolution image block and an original image block with the size of N x N according to the step length M, wherein both M and N are constants.

Further, constructing a cavity convolution super-resolution network comprises the following steps:

replacing the pooling layer in the convolutional neural network and the convolutional layer behind the pooling layer with a hollow convolutional layer.

Further, performing super-resolution reconstruction on the preprocessed image by using the cavity convolution super-resolution network, wherein the super-resolution reconstruction comprises the following steps:

extracting image characteristics of the original image block and the corresponding high-resolution image block;

establishing a single-layer neural network and learning a nonlinear mapping relation between two image features by using the mapping characteristics of a convolution kernel for the obtained image features of the original image block and the image features of the high-resolution image block;

and carrying out aggregation reconstruction on the mapped high-resolution image blocks to generate a high-resolution image.

The device for storing the super-resolution reconstruction method of the multi-scale cavity convolutional neural network comprises a control module and a storage medium for storing control instructions, wherein the control module reads the control instructions in the storage medium and executes the following operations:

constructing a cavity convolution super-resolution network, and performing super-resolution reconstruction on the preprocessed image by using the cavity convolution super-resolution network, wherein a parameter needing to be continuously updated in the cavity convolution super-resolution network is theta = { W = ₁ ,W ₂ ,W ₃ ,B ₁ ,B ₂ ,B ₃ In which W is ₁ And B ₁ Weight and bias variables, W, representing the hole convolution kernel ₂ Comprises n ₂ Each size is n ₁ ×f ₂ ×f ₂ Convolution kernel of (A), B ₂ Is a size n ₂ Vector of dimensions, W ₃ Corresponding to c pieces of size n ₂ ×f ₃ ×f ₃ Of a convolution kernel of, B ₃ Is a c-dimensional vector; the reconstructed image is F (Y; theta), and the corresponding original image is X; using the mean square error as a loss function:

wherein n is the number of training samples;

Further, the control module executes operation to perform multi-scale data enhancement on the original image, and when the enhanced image is obtained, the control module comprises the following operations:

taking an original image as an initial training set;

Further, the control module executes operation to perform data preprocessing on the enhanced image, and when the preprocessed image is obtained, the control module comprises the following operations:

extracting a brightness component of the enhanced image, cutting the brightness component, and downsampling the cut image according to different scale factors;

Further, when the control module executes the operation to construct the cavity convolution super-resolution network, the method comprises the following operations:

Further, when the control module executes the operation to perform the super-resolution reconstruction on the preprocessed image by using the cavity convolution super-resolution network, the method comprises the following operations:

and performing aggregation reconstruction on the mapped high-resolution image blocks to generate a high-resolution image.

The invention has the beneficial effects that: a method and a device for reconstructing a super-resolution of a multi-scale cavity convolution neural network are provided, wherein a cavity convolution super-resolution network is constructed, so that the problem of small reception field in the prior art can be effectively solved, the network reception field is increased, and meanwhile, in order to enable the cavity convolution super-resolution network to be suitable for reconstructing super-resolution of different scales, different-scale data enhancement is carried out on an original image, so that the cavity convolution super-resolution network can process images of different size multiples, and the cavity convolution super-resolution network can generalize the multi-scale image super-resolution.

Drawings

The invention is further illustrated with reference to the following figures and examples.

FIG. 1 is a flow chart of a super-resolution reconstruction method of the present invention;

FIG. 2 is a further detailed flow chart of the multiscale data enhancement step of FIG. 1;

FIG. 3 is a flow chart further detailing the steps of data preprocessing of FIG. 1;

FIG. 4 is a flowchart further detailing the step of performing super resolution reconstruction in FIG. 1.

Detailed Description

Referring to fig. 1, an embodiment of the present invention provides a multi-scale cavity convolutional neural network super-resolution reconstruction method, including but not limited to the following steps:

s100, performing multi-scale data enhancement on an original image to obtain an enhanced image;

step S200, carrying out data preprocessing on the enhanced image to obtain a preprocessed image;

and S300, constructing a cavity convolution super-resolution network, and performing super-resolution reconstruction on the preprocessed image by using the cavity convolution super-resolution network.

In this embodiment, by constructing the cavity convolution super-resolution network, the problem of small reception field in the prior art can be effectively solved, the network reception field is increased, and meanwhile, in order to enable the cavity convolution super-resolution network to be suitable for reconstruction of super-resolution images of different scales, data enhancement of different scales is performed on an original image, so that the cavity convolution super-resolution network can process images of different multiples, and thus the cavity convolution super-resolution network can generalize multi-scale image super-resolution.

Further, referring to fig. 2, in step S100 of this embodiment, performing multi-scale data enhancement on the original image to obtain an enhanced image, including but not limited to the following steps:

step S110, taking the original image as an initial training set;

step S120, zooming each image in the initial training set in different scales respectively, and combining the original images to form an intermediate training set;

step S130, respectively rotating each image in the intermediate training set by different angles, and forming a processing training set by combining each image in the intermediate training set, where the images in the processing training set are the enhanced images.

In this embodiment, 91 images in the Timofte data set in the prior art are used as original images, and the 91 original images are used as an initial training set, and then the initial training set is respectively processed by adopting two ways of scaling and rotating. For example, 91 original images in the initial training set are first down-sampled and scaled to 0.6 times, 0.7 times, 0.8 times and 0.9 times of the original size, and 91 original images are combined to form an intermediate training set, where the number of images in the intermediate training set is 5 times that in the initial training set; and then, respectively rotating each image in the intermediate training set by 90 degrees, 180 degrees and 270 degrees, and combining the images in the original intermediate training set to form a processing training set, wherein the images in the processing training set are the enhanced images. At this time, the number of enhanced images is increased to 20 times the number of images in the initial training set.

Further, referring to fig. 3, in step S200 of this embodiment, the data preprocessing is performed on the enhanced image to obtain a preprocessed image, which includes, but is not limited to, the following steps:

step S210, extracting the brightness component of the enhanced image, cutting the brightness component, and down-sampling the cut image according to different scale factors;

step S220, up-sampling the down-sampled image according to the corresponding scale factor to generate an initial high-resolution image with the resolution higher than that of the original image;

and step S230, cutting the initial high-resolution image and the original image into a high-resolution image block and an original image block with the size of N × N according to the step length M, wherein both M and N are constants.

In this embodiment, after obtaining an enhanced image, first, performing spatial conversion on the enhanced image to obtain a YCbCr spatial image, then extracting a Y nonlinear luminance component of the enhanced image, then clipping the luminance component, and downsampling the clipped image by scale factors of 2,3, and 4, respectively, to obtain a corresponding low-resolution image, then, upsampling the low-resolution image by the same scale factor, respectively, to generate an initial high-resolution image with a resolution higher than that of the original image, and in addition, for matching network input, clipping the initial high-resolution image and the original image into a high-resolution image block and an original image block with a size of 41 × 41 according to a step size of 14, and using the high-resolution image block and the original image block as input data of a hole convolution super-resolution network.

Further, in step S300 of this embodiment, a hollow convolution super-resolution network is constructed, which is mainly formed by replacing the pooling layer in the convolutional neural network and the convolutional layer behind the pooling layer with a hollow convolutional layer. In order to solve the problem that the downsampling operation of the pooling layer in the conventional convolutional neural network causes information loss, the pooling layer and the convolutional layer behind the pooling layer are replaced by a hole convolutional layer, and the hole convolutional layer has the main functions of: how to remove the operation of pooling downsampling without reducing the receptive field of the network, thereby reconstructing the super-resolution image. Therefore, for the operation of reconstructing the super-resolution image, the cavity convolution super-resolution network with the cavity convolution layer has better performance than the traditional convolution neural network.

Further, referring to fig. 4, in step S300 of this embodiment, performing super-resolution reconstruction on the preprocessed image by using a hole convolution super-resolution network, including, but not limited to, the following steps:

step S310, extracting image features of the original image block and the corresponding high-resolution image block.

For applying a convolutional neural network to super-resolution reconstruction, the extraction and representation of image blocks is a convolution process. The convolution operation in the hole convolution super-resolution network is the same as that of the conventional convolution neural network, but the convolution kernel is different from the original one. In order to enlarge the network receptive field and enable the network to learn more global characteristics, the invention adopts the hole convolution kernel to carry out convolution operation, thereby achieving the purpose of extracting the network characteristics. In the step, 9 × 9 convolution kernels with a cavity factor of 2 are used to extract image features of the original image block and the corresponding high-resolution image block, and a feature diagram output by the network is set to 64, that is, an output channel is 64. Specifically, the convolution operation formula is as follows:

F ₁ (Y)＝max(0,W ₁ * _l Y+B ₁ )

wherein, W ₁ And B ₁ Weight and bias variables representing the kernel of the hole convolution _l Indicating a hole convolution. W ₁ Corresponding to a size of c × f ₁ ×f ₁ N of (A) to (B) ₁ A filter, c being the dimension of the input image, f ₁ Is the size of the filter. B is ₁ Is n ₁ Dimension vectors, and each element corresponds to a convolution kernel. Therefore, the output of the first layer of the hole convolution super-resolution network has n in total according to the principle of convolution ₁ A characteristic diagram.

In particular, the hole convolution operation is specifically as follows:

defining a discrete function: f is Z ² → R, suppose Ω R = [ -R, R] ² ∩Z ² K is omega → R is (2r + 1) ² The discrete convolution operation is defined as:

(F*k)(p)＝∑ _s+t＝p

the hole convolution is generally expressed in terms of:

(F* _l k)(p)＝∑ _s+lt＝p

wherein l is a void factor _l Representing the hole convolution, when l =1, the method is a common discrete convolution operation, the network based on the hole convolution supports the exponential increase of the receptive field without losing resolution information, and F is recorded ₀ ,F ₁ ,...,F _n-1 :Z ² R is a discrete function, k ₀ ,k ₁ ,...,k _n-1 Omega → R is a discrete 3x3 filter, with exponentially growing convolution kernels,

F _i+1 ＝F _i * _2i k _i when i =0, 1.,. N-2

Definition F _i+1 The element P receptive field in (A) is as follows: f ₀ Can change F _i+1 (p) a set of elements of the value. F _i+ The size of the P receptive field in 1 is the number of these element sets. Is readily available, F _i+1 Large receptive field of each elementIs small as (2) ⁱ⁺² -1)×(2 ⁱ⁺² -1)。

And step S320, establishing a single-layer neural network and learning a nonlinear mapping relation between the two image characteristics by using the mapping characteristics of the convolution kernel for the obtained image characteristics of the original image block and the image characteristics of the high-resolution image block.

The operation of the step is mainly realized in the second layer of the network, and the step is mainly to nonlinearly map the high-dimensional vector generated in the first layer of the network to another high-dimensional vector, and the newly generated high-dimensional vectors form another vector set. In this step, the feature maps generated by the 64 channels generated in step S310 are subjected to nonlinear mapping using a 1 × 1 convolution kernel, high-dimensional vectors with low resolution are mapped to a high-dimensional vector space having high-resolution features, and 32 feature maps are generated. Next, the vector generated in the first layer of the network is taken as n ₁ Feature maps of dimensions, n in the second layer of the network ₁ Vector mapping of dimensions to n ₂ And (5) maintaining. Such an operation is equivalent to using n ₂ Convolution is performed by convolution kernels with the size of 1 × 1, and the second layer of the network is represented as:

F ₂ ＝max(0,W ₂ *F ₁ (Y)+B ₂ )

wherein, W ₂ Comprises n ₂ Each size is n ₁ ×f ₂ ×f ₂ Of a convolution kernel of, B ₂ Is one with the size of n ₂ A vector of dimensions. Each output of the second layer of the network will be used to reconstruct a high resolution image block of the image. And finally, reconstructing the 32 channel characteristic graphs into a high-resolution image block through a 5-by-5 convolution kernel, and realizing image super-resolution reconstruction.

And step S330, carrying out aggregation reconstruction on the mapped high-resolution image blocks to generate a high-resolution image.

The operation of the step is mainly realized in the third layer of the network, the step generates the high-resolution image from the high-resolution image set of the second layer of the network, and the generated image is similar to a real original image. In the conventional method, the reconstruction operation generates the final complete high resolution image by averaging the predicted high resolution blocks. In fact, the averaging process can also be regarded as a pre-trained filter convolving on a feature map set. The operation of this step in the third layer of the network is inspired by the convolution operation to obtain the final high-resolution image. The third layer of the network is represented as:

F(Y)＝W ₃ *F ₂ (Y)+B ₃

wherein, W ₃ Corresponding to c pieces of size n ₂ ×f ₃ ×f ₃ Of a convolution kernel of, B ₃ Is a c-dimensional vector. If the high resolution block is in the image domain, then treating the filter as an average filter; if the high resolution block is other domain, the convolution kernel W ₃ The coefficients are projected onto the image domain and averaged.

Then, if a high resolution image is obtained from the above result, the parameter to be continuously updated in the hollow convolution super-resolution network is Θ = { W = { (W) ₁ ,W ₂ ,W ₃ ,B ₁ ,B ₂ ,B ₃ And the reconstructed image is F (Y; theta) and the corresponding original image is X. Using the mean square error as a loss function:

where n is the number of training samples. The weight value of the loss function is updated by a random gradient descent (SGD) method, and the formula is as follows:

Under the same calculation conditions, the hole convolution provides a larger receptive field. When the network layer needs a larger receptive field, but the computational resources are limited and the number or size of the convolution kernels cannot be increased, the hole convolution can be utilized. The hole convolution can integrate multi-scale content information, does not lose resolution and supports exponential increase of receptive field. Meanwhile, content information with different scales is integrated, so that any resolution is added into the existing network structure, and network reusability is facilitated. In the traditional convolutional neural network, a pooling layer is adopted to achieve the purpose of reducing the dimension, but the method can generate certain side effect in the image super-resolution, the pixel size of the pooled feature layer is low, and the feature map can lose certain precision even through upsampling, so that the sensing field of the network is increased by adopting a cavity convolution mode instead of the pooling layer.

In addition, in order to implement the method for reconstructing the super-resolution of the multi-scale cavity convolutional neural network in the above embodiment, an embodiment of the present invention further provides an apparatus for storing the method for reconstructing the super-resolution of the multi-scale cavity convolutional neural network, where the apparatus includes a control module and a storage medium for storing a control instruction, and the control module reads the control instruction in the storage medium and performs the following operations:

and constructing a cavity convolution super-resolution network, and performing super-resolution reconstruction on the preprocessed image by using the cavity convolution super-resolution network.

Further, the control module executes operation to perform multi-scale data enhancement on the original image, and when an enhanced image is obtained, the method comprises the following operations:

taking an original image as an initial training set;

Further, the control module executes operations to perform data preprocessing on the enhanced image, and when a preprocessed image is obtained, the operations include:

Further, when the control module executes the operation to construct the hole convolution super-resolution network, the method comprises the following operations: replacing the pooling layer in the convolutional neural network and the convolutional layer behind the pooling layer with a hollow convolutional layer.

Further, when the control module executes the operation to perform the super-resolution reconstruction on the preprocessed image by using the hole convolution super-resolution network, the method comprises the following operations:

While the preferred embodiments of the present invention have been described in detail, it will be understood by those skilled in the art that the foregoing and various other changes, omissions and deviations in the form and detail thereof may be made without departing from the scope of this invention.

Claims

1. A super-resolution reconstruction method of a multi-scale cavity convolution neural network is characterized by comprising the following steps: the method comprises the following steps:

constructing a cavity convolution super-resolution network, and performing super-resolution reconstruction on the preprocessed image by using the cavity convolution super-resolution network, wherein a convolution operation formula is as follows:

F ₁ (Y)＝max(0,W ₁ * _l Y+B ₁ )

wherein, W ₁ And B ₁ Weight and bias variables representing the kernel of the hole convolution _l Representing a hole convolution; w ₁ Corresponding to a size of c × f ₁ ×f ₁ N of (A) to (B) ₁ A filter, c being the dimension of the input image, f ₁ Is the size of the filter;

defining a discrete function: f is Z ² → R, assuming Ω R = [ -R, R] ² ∩Z ² K is omega → R is (2r + 1) ² The discrete convolution operation is defined as:

(F*k)(p)＝∑ _s+t＝p

the hole convolution is generally expressed in terms of:

(F* _l k)(p)＝∑ _s+lt＝p

wherein l is a void factor _l Representing the hole convolution, when l =1, the method is a common discrete convolution operation, the network based on the hole convolution supports the exponential increase of the receptive field without losing resolution information, and F is recorded ₀ ,F ₁ ,...,F _n-1 :Z ² K is a discrete function → R ₀ ,k ₁ ,...,k _n-1 Omega → R is a discrete 3x3 filter, with exponentially growing convolution kernels,

F _i+1 ＝F _i * _2i k _i when i =0, 1.,. N-2

Definition F _i+1 The element P receptive field in (A) is as follows: f ₀ Can change F _i+1 (p) a set of elements of a value; f _i+1 The size of the receptor field is the number of the element sets to obtain F _i+1 The receptive field of each element is (2) ⁱ⁺² -1)×(2 ⁱ⁺² -1)；

The parameter needing to be continuously updated in the cavity convolution super-resolution network is theta = { W = ₁ ,W ₂ ,W ₃ ,B ₁ ,B ₂ ,B ₃ In which W is ₁ And B ₁ Weight and bias variables, W, representing the void convolution kernel ₂ Comprises n ₂ Each size is n ₁ ×f ₂ ×f ₂ Of a convolution kernel of, B ₂ Is a size n ₂ Vector of dimensions, W ₃ Corresponding to c pieces of size n ₂ ×f ₃ ×f ₃ Of a convolution kernel of, B ₃ Is a c-dimensional vector; the reconstructed image is F (Y; theta), and the corresponding original image is X; using the mean square error as a loss function:

wherein n is the number of training samples;

2. The method for reconstructing the super-resolution of the multi-scale hole convolutional neural network of claim 1, wherein: the multi-scale data enhancement of the original image to obtain an enhanced image comprises the following steps:

taking an original image as an initial training set;

3. The method for reconstructing the super-resolution of the multi-scale hole convolutional neural network of claim 1, wherein: the data preprocessing is performed on the enhanced image to obtain a preprocessed image, and the data preprocessing comprises the following steps:

4. The method for reconstructing the super-resolution of the multi-scale hole convolutional neural network of claim 1, wherein: the method for constructing the cavity convolution super-resolution network comprises the following steps:

5. The multi-scale hole convolutional neural network super-resolution reconstruction method of claim 3, wherein: the super-resolution reconstruction of the preprocessed image by using the cavity convolution super-resolution network comprises the following steps:

6. A device for storing a multi-scale cavity convolution neural network super-resolution reconstruction method is characterized in that: the control module reads the control instructions in the storage medium and executes the following operations:

F ₁ (Y)＝max(0,W ₁ * _l Y+B ₁ )

defining a discrete function: f is Z ² → R, suppose Ω R = [ -R, R] ² ∩Z ² K is omega → R is (2r + 1) ² The discrete convolution operation defined as:

(F*k)(p)＝∑ _s+t＝p

the hole convolution is generally expressed in terms of:

(F* _l k)(p)＝∑ _s+lt＝p

F _i+1 ＝F _i * _2i k _i when i =0, 1.,. N-2

The parameter needing to be continuously updated in the cavity convolution super-resolution network is theta = { W = ₁ ,W ₂ ,W ₃ ,B ₁ ,B ₂ ,B ₃ In which W is ₁ And B ₁ Weight and bias variables, W, representing the void convolution kernel ₂ Comprises n ₂ Each size is n ₁ ×f ₂ ×f ₂ Convolution kernel of (A), B ₂ Is a size n ₂ Vector of dimensions, W ₃ Corresponding to c pieces of size n ₂ ×f ₃ ×f ₃ Of a convolution kernel of, B ₃ Is a c-dimensional vector; the reconstructed image is F (Y; theta), and the corresponding original image is X; using the mean square error as a loss function:

wherein n is the number of training samples;

7. The apparatus of claim 6, wherein: the control module executes operation to perform multi-scale data enhancement on the original image, and when the enhanced image is obtained, the control module comprises the following operations: taking an original image as an initial training set;

zooming each image in the initial training set in different scales respectively, and combining the original images to form an intermediate training set;

8. The apparatus of claim 6, wherein: the control module executes operation to perform data preprocessing on the enhanced image, and when a preprocessed image is obtained, the control module comprises the following operations: extracting a brightness component of the enhanced image, cutting the brightness component, and downsampling the cut image according to different scale factors;

9. The apparatus of claim 6, wherein: when the control module executes the operation to construct the cavity convolution super-resolution network, the method comprises the following operations:

10. The apparatus of claim 8, wherein: when the control module executes the operation and utilizes the cavity convolution super-resolution network to carry out the super-resolution reconstruction on the preprocessed image, the control module comprises the following operations:

extracting image characteristics of the original image block and the corresponding high-resolution image block; establishing a single-layer neural network and learning a nonlinear mapping relation between two image features by using the mapping characteristics of a convolution kernel for the obtained image features of the original image block and the image features of the high-resolution image block;