CN112132756A

CN112132756A - Attention mechanism-based single raindrop image enhancement method

Info

Publication number: CN112132756A
Application number: CN201910587306.4A
Authority: CN
Inventors: 郑顾平; 李金华; 曹锦纲
Original assignee: North China Electric Power University
Current assignee: North China Electric Power University
Priority date: 2019-06-24
Filing date: 2019-06-24
Publication date: 2020-12-25

Abstract

The invention discloses and provides a single raindrop image enhancement method based on an attention mechanism, which comprises the following steps: preprocessing a data set; performing feature extraction by adopting multi-scale expansion convolution with different expansion factors; an attention mechanism model was introduced to focus on the details of the raindrop regions in the image. The method provided by the invention can well remove the raindrop part in the raindrop image, so that the raindrop blurred image has better quality.

Description

Attention mechanism-based single raindrop image enhancement method

Technical Field

The invention relates to the field of image enhancement, in particular to a single raindrop image enhancement method.

Background

The image acquired in a rainy environment is often doped with raindrops or rainstripes to cause image blurring, more noise is easily introduced or larger raindrops cannot be removed by the conventional raindrop removing image enhancement algorithm, and in order to better remove raindrops on the image and restore more details of the image, the inventor provides a single raindrop image enhancement method based on attention-driven system.

The expanded convolution (also called dilation convolution or hole convolution) is different from other convolution methods in that it introduces a new parameter, namely a dilation rate parameter, which is used to indicate the size of dilation.

The structure of the dilation convolution is shown in fig. 1. Such as Yu et alLet us mention that F₀，F₁，....，F_n-1：Z₁→ R is a discrete function, let K₀，K₁，....，K_n-2：Z₂→ R is a discrete 3 x 3 filter, considering filtering with exponential increasing dilation:

F_i+1＝F_i×2′k_i，i＝0，1，2，....，n-2 (1)

f is to be_i+1The pixel receptive field area of the middle element p is defined as being in F₀Modifying on the basis of F_i+1Elemental values of (p) value, assume F_i+1The receptive field size of the pixel p in (a) is the number of these elements, and the receptive field size under the dilation convolution in the pixel can be expressed as:

F_i+1＝(2ⁱ⁺²-1)×(2ⁱ⁺²-1) (2)

the expansion convolution can weight the pixels and aggregate information, so that the perception visual field of the pixels can be enhanced under the condition of not losing the resolution, more characteristic information can be obtained, and the size of the output characteristic mapping can be ensured not to change.

The visual attention mechanism is a specific visual signal processing mechanism in the human brain, the essence of the attention mechanism is similar to the visual attention of human beings, and the main significance is to screen a large amount of information and select key high-value information relevant to the current task. The attention mechanism thought was originally applied to computer image vision research, and more researchers in recent years use the attention mechanism thought to combine with a neural network to perform related problem research.

Therefore, by means of the attention mechanism idea, a single raindrop image enhancement method based on the attention mechanism is provided, so that raindrop fuzzy image enhancement can be effectively improved, image details are richer, and a better image visual effect can be achieved.

Disclosure of Invention

In order to solve the above problems, the present invention provides a method for enhancing blurred raindrops and making details of an image richer.

The invention adopts the following technical scheme: firstly, extracting characteristic values of an input raindrop-containing picture by adopting multi-scale expansion convolution with different expansion factors; secondly, an attention mechanism is introduced into the model to pay attention to the details of the raindrop region in the image so as to help better remove raindrops and restore the details of the image: finally, raindrop-like blurred image enhancement is achieved end-to-end.

The method comprises the following steps:

(1) and carrying out uniform resolution processing on the data set pictures to divide a training set and a testing set.

(2) And establishing a raindrop fuzzy image enhancement model based on an attention mechanism, and initializing parameters of the network.

(3) And training the model by using the divided training set, and continuously updating the model parameters by performing back propagation through a loss function so as to minimize the calculated loss and optimize the performance effect of the model.

(4) Training every 50 pairs of pictures, setting the maximum iteration number to be 3000, storing model parameters, and testing the model by using a test set.

Preferably, the specific implementation includes: the attention mechanism model helps to extract features of the raindrop image, and the attention mechanism can enable a network to pay attention to a raindrop area, so that raindrop image enhancement can be better achieved. The attention mechanism model comprises three layers of residual error networks (ResBlock), a long and short memory neural network (LSTM) and a common convolution layer (conv) with a convolution kernel size of 3 x 3 and a step size of 1.

The preferred loss function consists of two parts, attention model loss L_attAnd a perceptual loss L_p。

L_p＝L_MSE(VGG(0)，VGG(T)) (4)

Pre-training the network by using VGG16, extracting high-level features and monitoring to ensure the quality of generated images, wherein A_tIs attention toAnd (3) drawing an attention machine generated by the mechanism model at the time T, wherein M is a binary mask, N is 4, theta is 0.8, O is a raindrop removing image output after model processing, and T is an original clear raindrop-free image corresponding to the raindrop image. An adaptive moment estimation (ADAM) algorithm is selected to optimize the loss function. The model continuously updates the weight of the neural network through a training data set, and after multiple times of training and tuning, the learning rate alpha is determined to be 0.0001, beta 1 is 0, beta 2 is 0.9, and in the training process, the value of the attention map is initialized to be 0.5.

The method has the advantages that by means of the technical scheme, the perception visual field can be effectively enhanced during feature extraction, more feature information is obtained, and raindrop areas in the image are paid more attention.

Drawings

FIG. 1 is a diagram of a dilated convolution;

FIG. 2 is a model for enhancing blurred images in rainy days based on attention mechanism

FIG. 3 attention mechanism model

Residual error network adopted by FIG. 4

FIG. 5 comparative graph of experimental results

Wherein:

in fig. 5: (a) raindrop pattern (b) clear pattern (c) model (d) tanaka (e) Eigen proposed by the invention

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail with reference to the accompanying drawings. It should be understood that the detailed description and specific examples, while indicating the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention.

As shown in fig. 1, the overall structure of the raininess blurred image enhancement model based on the attention mechanism proposed by the present invention, wherein conv, dia _ c and convtranspose represent a convolutional layer, an extended convolutional layer and a deconvolution layer, respectively, and relu is an activation function. The invention provides an attention mechanism-based rainy day blurred image enhancement model, which comprises the following components:

(1) and (5) preliminarily extracting features. The input image was subjected to two convolution processes with convolution kernel sizes 7 x 64, 5 x 128 and a step size of 1,

and obtaining the preliminarily extracted features.

(2) And extracting features by multi-scale expansion convolution. Dilation convolution with convolution kernel 3 x 256, namely dia _ c1, dia _ c2, dia _ c3

The values of the factors (differentiation rates) are 2, 4 and 6, respectively, and three features are obtained after the expansion convolution processing.

(3) An attention map is generated. Inputting the three features into the attention mechanism model introduced in 3.2, outputting the result as three attention diagrams,

and performing feature fusion on the three attention diagrams and the original image.

(4) And outputting the image. The output was sequentially subjected to 3 convolution processes with a convolution kernel size of 3 × 256 and a step size of 1, 2 deconvolution operations with convolution kernel sizes of 4 × 128, 4 × 64 and a step size of 2, and convolution processes with a convolution kernel size of 3 × 64 and a step size of 1, and finally the image after rain division was output.

As shown in fig. 3, the attention mechanism model designed in the present invention is composed of three layers of residual error network (ResBlock), long and short memory neural network (LSTM), and convolutional layer (conv):

(1) residual error network: since the Normalization layer ignores absolute differences between image features, we use a residual network that removes the Normalization layer from the conventional residual network structure.

(2) Long and short memory neural networks (LSTM).

(3) And (3) rolling layers: a normal convolution with a convolution kernel size of 3 x 3 and a step size of 1 is used.

The experimental procedure was as follows:

To illustrate the superiority of the model proposed by the present invention, we performed a comparison experiment on the algorithms proposed by Tanaka et al and Eigen et al, and using the raindrop image data set self-constructed herein, the image quality evaluation results of the experiment are shown in table 1, and fig. 5 is an experimental effect diagram of 4 images in the selected test set.

Table 1 picture quality evaluation results

As can be seen from Table 1, the peak SNR of the model for rain removal proposed by Tanaka et al is 26.84, the peak SNR of the model proposed by Eigen et al is 27.41, and the peak SNR of the model proposed in this chapter reaches 28.37, while the structural similarity of the model proposed by the present invention reaches 0.9091, which is higher than that of the other two models. As can be seen from FIG. 5, the Tanaka et al method cannot remove dense raindrops and introduce much noise to the image, the Eigen et al method cannot process the image to obtain an image with abundant details, and the image has distortion, but the image processed by the model of the present invention has good effect in subjective visual perception. Therefore, the algorithm model of the invention is superior to other algorithms in terms of peak signal-to-noise ratio and structural similarity.

Claims

1. A single raindrop image enhancement method based on an attention mechanism is characterized in that a perception field of the single raindrop image enhancement method is enhanced by adopting multi-scale expansion convolution with different expansion factors to obtain more characteristic information; the model introduces an attention mechanism to construct an attention mechanism model to pay attention to a raindrop area in an image, so that raindrops are removed better, and the recovery of raindrop fuzzy image details is realized.

2. The method of claim 1, wherein the method comprises the steps of:

(1) and (5) preliminarily extracting features. The input image was subjected to two convolution processes with convolution kernel sizes of 7 x 64, 5 x 128 and step size of 1 to obtain the preliminary extracted features.

(2) And extracting features by multi-scale expansion convolution. Three features obtained after the three expanding convolution treatments.

(3) An attention map is generated. And respectively inputting the three features into the attention mechanism model, outputting the result into three attention diagrams, and performing feature fusion on the three attention diagrams and the original image.

3. The multi-scale dilation convolution of claim 2, the method comprising: the expansion convolution of the convolution kernel 3 × 256, namely dia _ c1, dia _ c2, dia _ c3, has expansion factor (expansion rate) values of 2, 4, and 6, respectively.

4. The attention mechanism model of claim 2, the method comprising the steps of:

the attention mechanism model is composed of three layers of residual error networks (ResBlock), a long and short memory neural network (LSTM) and a convolutional layer (conv):

(2) Long and short memory neural networks (LSTM);

5. The attention-based single raindrop image enhancement method according to claim 1, wherein: the network loss resistance L_advAnd a perceptual loss L_cComprises the following steps:

attention mechanism model loss L_attAnd a perceptual loss L_p。

L_p＝L_MSE(VGG(O)，VGG(T)) (2)

Pre-training the network by using VGG16, extracting high-level features and monitoring to ensure the quality of generated images, wherein A_tThe method is characterized in that an attention machine drawing is generated by an attention machine model at time T, M is a binary mask, N is 4, theta is 0.8, O is a raindrop removing image output after model processing, and T is an original clear raindrop-free image corresponding to the raindrop image.

The overall loss function of the model herein is:

L＝L_att+L_p (3)。