CN114677306A

CN114677306A - Context aggregation image rain removing method based on edge information guidance

Info

Publication number: CN114677306A
Application number: CN202210319123.6A
Authority: CN
Inventors: 王军; 左慧园; 潘在宇; 韩淑雨; 李玉莲
Original assignee: China University of Mining and Technology CUMT
Current assignee: China University of Mining and Technology CUMT
Priority date: 2022-03-29
Filing date: 2022-03-29
Publication date: 2022-06-28
Anticipated expiration: 2042-03-29
Also published as: CN114677306B

Abstract

The invention discloses a context aggregation image rain removing method based on edge information guidance, which aims to solve the problem that the rain removing method ignores texture information and edge information of an image at the present stage and is characterized in that a multi-scale information network is designed, wherein the multi-scale information network comprises an upper branch image rain removing network for acquiring rain removing information of a rough adjustment image and a lower branch edge information detection network for acquiring edge information of the image, and the multi-scale information network comprises a context aggregation module which is used for aggregating and processing the context information, guiding the rain removing information of the rough adjustment image by using the aggregated and processed information, and enhancing the representation capability of the upper branch image rain removing network on the detail information of the image. Experimental results show that the method can complete the rain removal of the image and enable the image to obtain richer texture information and edge information.

Description

Context aggregation image rain removing method based on edge information guidance

Technical Field

The invention belongs to the field of image processing and deep learning, and particularly relates to a context aggregation image rain removing method based on edge information guidance.

Background

The image edge is an important feature of an image, and is a discontinuity of distribution of characteristics (such as pixel gray scale, texture and the like) in the image, most information of the image is concentrated in the edge part of the image, and the edge structure and the characteristics of an image are often important parts for determining the image characteristics. At present, deep learning already obtains excellent performance on an image rain removing task, but edge information is often ignored in the process of removing rain from an image, and the edge information in the image is also removed in the process of removing rain stripes and raindrops, so that the original image cannot be completely restored due to the loss of some important edge information. Therefore, it is also important to recover the edge information while removing rain.

Most of the existing image rain removing methods usually omit the restoration of image edge information, or directly use a backbone network to process image rain removing and image detail restoration, although the image rain removing method based on deep learning is mature day by day, the important edge information restoration is still solved while the rain is removed. The invention provides a context aggregation image rain removing method based on edge information guiding, which designs a multi-scale information network, wherein the multi-scale information network comprises an upper branch image rain removing network for obtaining rain removing information of a coarse adjustment image and a lower branch edge information detection network for obtaining edge information of the image, and comprises a context aggregation module, the context aggregation module is used for aggregating and processing the context information, guiding the rain removing information of the coarse adjustment image by using the aggregated information, and enhancing the representation capability of the upper branch image rain removing network on the detail information of the image. Experimental results show that the method enables the image to obtain richer texture information and edge information while completing the rain removal of the image, and obtains richer edge information and better rain removal effect while the resolution is not lost.

Disclosure of Invention

The invention aims to provide a contextual image rain removing method based on edge information guidance, which can achieve the purpose of removing rain and simultaneously recovering edge information.

The technical solution for realizing the purpose of the invention is as follows: a contextual image rain removing method based on edge information guidance comprises the following steps:

step 1, selecting N images in Rain data of a Rain removing synthetic image Rain of Rain200L, wherein N is more than 100 and less than 10000, carrying out normalization processing, taking the images with uniform size, namely height multiplied by width of h multiplied by w as a training sample set S, and turning to step 2;

step 2, constructing a multi-scale information network, wherein the multi-scale information network comprises an encoder Enc _ P, a first decoder Dnc _ R, a second decoder Dnc _ E, an image output layer and three context aggregation modules EGCA_kAnd k is 1,2,3, and the process goes to step 3;

step 3, training the multi-scale information network by using the training sample set S to obtain the trained multi-scale information network:

step 3-1, inputting the training sample set S into the encoder Enc _ P, extracting image characteristic information of the training sample set S, and correspondingly obtaining rain removal information and image edge information of a coarse-adjustment image by respectively utilizing the first decoder Dnc _ R and the second decoder Dnc _ E;

step 3-2, utilizing three context aggregation modules EGCA_kPerforming context aggregation processing on the rain removing information and the image edge information of the coarse adjustment image to obtain aggregated information

Step 3-3, utilizing the information after the polymerization treatment

Guiding the rain removing information of the rough-adjusted image to obtain edge information-guided rain removing information of the image, and sending the edge information-guided rain removing information of the image into an image inputGoing out of the layer to obtain a rain removing image, further obtaining a trained multi-scale information network, and turning to step 4;

step 4, reselecting M images in Rain removing data of the Rain synthetic image Rain200L, wherein M is more than 100 and less than 10000, unifying the size of the images into hxw through normalization processing to form a test sample set T, and turning to step 5;

and step 5, inputting the rain-containing images in the test sample set T into the trained multi-scale information network to obtain a rain-removing image, so that the image has richer texture information and edge information while the rain-removing information is removed, and the rain-removing result is more realistic.

Compared with the prior art, the invention has the advantages that:

(1) the existing image rain removing method usually causes the loss of image details while removing information such as rain lines, raindrops and the like, so that the rain removing result graph has deviation with the original image.

(2) The existing image rain removing method directly uses a main network to process image rain removing and image detail repairing, and the result is poor. The invention utilizes the context aggregation module to aggregate and process the rain removing information of the rough adjustment image obtained by the rain removing network of the upper branch image and the edge information of the image obtained by the rain removing network of the lower branch, and utilizes the aggregated information to enhance the representation capability of the rain removing network of the upper branch image to the detail information of the image, thereby completing the rain removing of the image and simultaneously enabling the image to obtain richer texture information and edge information.

Drawings

FIG. 1 is a flowchart of a method for removing rain from a context aggregation image guided based on edge information according to the present invention.

FIG. 2 is a model diagram of a method for removing rain from a context aggregation image guided based on edge information according to the present invention.

FIG. 3 is a diagram of the results of comparison experiments of two semi-supervised image rain-removing algorithms SIRR and Syn2Real on a synthesized domain rain-containing image sample and the method of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention are described in further detail below.

With reference to fig. 1 and fig. 2, a method for removing rain based on context aggregation image guided by edge information includes the following steps:

step 1, selecting N images in a Rain database of a Rain removing synthetic image Rain200L, wherein N is more than 100 and less than 10000, carrying out normalization processing, taking the images with uniform size, namely height multiplied by width multiplied by h multiplied by w as a training sample set S, and turning to step 2.

Step 2, constructing a multi-scale information network, wherein the multi-scale information network comprises an encoder Enc _ P, a first decoder Dnc _ R, a second decoder Dnc _ E, an image output layer and three context aggregation modules EGCA_kAnd k is 1,2 and 3, and the specific formula is as follows:

1) the encoder Enc _ P has four convolutional blocks, defined as E₁、E₂、E₃、E₄The encoder Enc _ P network is defined as follows:

wherein,

representing image feature information extracted via an mth convolution block in the encoder, S represents a training sample set input to the encoder,

the size of the image characteristic information, i.e. height x width x channel number, is h_m×w_m×c_mWherein h is_m＝h/2^m-1，w_m＝w/2^m-1，c_m＝32×2^m-1。

2) The first decoder Dnc _ R includes three convolutional blocks, each defined as D_r1、D_r2、D_r3The image feature information extracted by the encoder Enc _ P is input into the first decoder Dnc _ R to obtain the coarse image degrain information, and the network of the first decoder Dnc _ R is defined as follows:

wherein,

representing the coarse image degrain information acquired after the ith convolution block of the first decoder Dnc _ R,

the size of the image characteristic information, i.e. height x width x channel number, is h_i×w_i×c_iWherein h is_i＝h/2^i-1，w_i＝w/2^i-1，c_i＝32×2^i-1。

3) The second decoder Dnc _ E includes three convolution blocks, each defined as D_e1、D_e2、D_e3The image feature information extracted by the encoder Enc _ P is input to the second decoder Dnc _ E to obtain the image edge information, and the network of the second decoder Dnc _ E is defined as follows:

wherein,

representing the image edge information obtained via the jth convolutional block in the second decoder Dnc _ E,

the size of the image characteristic information, i.e. height x width x channel number, is h_j×w_j×c_jWherein h is_j＝h/2^j-1，w_j＝w/2^j-1，c_j＝32×2^j-1。

4) Three context aggregation module EGCA_kK is 1,2,3, i.e. EGCA₁、EGCA₂、EGCA₃。

And (5) turning to the step 3.

step 3-1, inputting the training sample set S into the encoder Enc _ P, extracting image feature information of the training sample set S, and then correspondingly obtaining rain removal information and image edge information of the coarse-tuned image by using the first decoder Dnc _ R and the second decoder Dnc _ E, which are specifically as follows:

the encoder Enc _ P is used for extracting image feature information of a training sample set S, the encoder Enc _ P and the first decoder Dnc _ R jointly construct an upper branch image rain removal network for obtaining coarse adjustment image rain removal information, the encoder Enc _ P and the second decoder Dnc _ E jointly construct a lower branch edge information detection network for obtaining image edge information, the image feature information extracted by the same encoder aims at enabling an upper branch and a lower branch to share a weight, and the image rain removal and edge information detection process is facilitated:

the extraction process of the image characteristic information is specifically developed as follows:

when m is equal to 1, the compound is,

when m is equal to 2, the compound is,

when the m is equal to 3, the compound has the following characteristics,

when m is 4, the compound is shown in the specification,

wherein E is_mDenotes the signal extracted via the mth convolution block in the encoderAnd e, operating, wherein m is 1,2,3 and 4.

The process of acquiring the rain removing information of the rough adjusted image by using the upper branch image rain removing network is specifically developed as follows:

when the value of i is 1, the value of i,

when the value of i is equal to 2,

when the value of i is 3, the value of i,

wherein D is_ri(. x) denotes the operation of obtaining the rain information of the coarse image by the i-th convolution block of the first decoder Dnc _ R, i being 1,2, 3.

The process of acquiring the image edge information by using the lower branch edge information detection network specifically expands as follows:

when j is equal to 1, the value of j,

when the j is 2, the sum of the j,

when j is 3, the number of the adjacent groups is 3,

wherein D is_ej(×) denotes an operation of acquiring image edge information via the jth convolution block in the second decoder Dnc _ E, where j is 1,2, 3.

The method comprises the following specific steps:

firstly, the size of the image characteristic information is h_i×w_i×c_iCoarse adjustment of image rain removal information

And the size of the image characteristic information is h_j×w_j×c_jImage edge information

Respectively carrying out three convolution kernels with the size of 1 multiplied by 1 to obtain three image characteristic information with the sizes of h_i×w_i×c_i/2、h_i×w_i×c_iH and 2_j×w_j×c_j[ 2 ] of

Convolution of image information, i.e.

A second step of

The convolution image information is subjected to image characteristic recombination transformation to obtain the image characteristic information with the size of h_i×w_i×c_iFirst recombined image information of/2

And image feature information size of c_j/2×h_j×w_jSecond reconstructed image information of

Namely, it is

Thirdly, the first recombined image information is processed

And second reconstructed image information

Matrix multiplication is carried out to obtain preliminary image characteristic information Feture1_kThe size of the image feature information is (h)_i×w_j)×(h_j×w_j) I.e. by

Represents a matrix multiplication;

fourthly, the preliminary image characteristic information Feture1 is processed_kAfter being processed by the normalization layer, the mixture is mixed with

Matrix multiplication is carried out to obtain final image characteristic information Feture2_kThe size of the image characteristic information is h_i×w_i×c_i2, i.e. that

Fifthly, the final characteristic diagram information Feture2_kAfter a convolution kernel with the size of 1 multiplied by 1, the information after the polymerization treatment is obtained

Image characteristic information size is h_i×w_i×c_iI.e. by

Step 3-3, utilizing the information after the polymerization treatment

Guiding the rain removing information of the rough-adjusted image to obtain edge information guided image rain removing information, sending the edge information guided image rain removing information to an image output layer to obtain a rain removing image, and further obtaining a trained multi-scale information network, wherein the method specifically comprises the following steps:

defining guiding coarse adjustment image rain removal information as edge information guiding image rain removal information

Namely, it is

Wherein,

representing the rain removal information of the coarse image obtained by the ith convolution block in the first decoder,

representing the image edge information obtained by the jth convolutional block in the second decoder,

and the information aggregated by the k-th context aggregation module is represented.

Using information after aggregation processing

The specific process of guiding the rain removal information of the coarse adjustment image is as follows:

when k is equal to 1, the first step is carried out,

when k is equal to 2, the number of the bits is increased,

when k is 3, the number of the groups is 3,

then will be

And (4) obtaining a rain removing image after passing through the image output layer, further obtaining a trained multi-scale information network, and turning to the step 4.

And 4, reselecting M images in Rain removing data of the Rain200L synthetic image, wherein M is more than 100 and less than 10000, unifying the size of the images into h multiplied by w through normalization processing to form a test sample set T, and turning to the step 5.

Example 1

With reference to fig. 1 and fig. 2, a method for removing rain based on context aggregation image guided by edge information according to the present invention includes the following steps:

step 1, selecting 1800 images in Rain removing data of a Rain removing image synthesized by Rain200L, carrying out normalization processing, taking the images with the size of 256 multiplied by 256 as a training sample set S, and turning to step 2.

Step 2, constructing a multi-scale information network, wherein the multi-scale information network comprises an encoder Enc _ P, a first decoder Dnc _ R, a second decoder Dnc _ E, an image output layer and three context aggregation modules EGCA₁、EGCA₂、EGCA₃The method comprises the following steps:

wherein,

has a size of 256 × 256 × 32,

has a size of 128 x 64,

has an image characteristic information size of 64 x 128,

has a size of 32 × 32 × 256.

wherein,

has an image characteristic information size of 256 × 256 × 32,

has a size of 128 x 64,

has a size of 64 × 64 × 128.

wherein,

representing the image edge information obtained by the jth convolutional block in the second decoder Dnc _ E,

has an image characteristic information size of 256 × 256 × 32,

has a size of 128 x 64,

has a size of 64 × 64 × 128.

4) Three context aggregation module EGCA₁、EGCA₂、EGCA₃And (5) turning to the step 3.

the encoder Enc _ P is used for extracting image characteristic information of the training sample set S

The encoder Enc _ P and the first decoder Dnc _ R together construct an upper branch image rain removal network for obtaining coarse image rain removal information

The encoder Enc _ P and the second decoder Dnc _ E together construct a lower branch edge information detection network for obtaining image edge information

The image characteristic information extracted by the same encoder aims to enable the upper branch and the lower branch to share the weight, and is more beneficial to the rain removal and edge information detection process of the image:

when m is equal to 1, the compound has the following structure,

when m is equal to 2, the compound is,

when m is 3, the compound is added,

when m is 4, the compound is shown in the specification,

when the value of i is 1, the value of i,

when the value of i is 2, the ratio of i to i is,

when the value of i is 3, the value of i,

when j is equal to 1, the value of j,

when the j is 2, the sum of the j,

when j is 3, the number of the adjacent groups is 3,

The method comprises the following specific steps:

first, image characteristic information is 256 × 256 × 32, rain information of image is coarsely adjusted, and image is subjected to rain removal

And image edge information having an image feature information size of 256 × 256 × 32

The number of image channels is reduced by three convolution kernels with the size of 1 multiplied by 1 respectively, and the three convolution image information with the image characteristic information size of 256 multiplied by 16, namely the three convolution image information

A second step of

The convolution image information is subjected to image characteristic reorganization transformation to obtain first reorganized image information with the image characteristic information size of 256 multiplied by 16

And second reconstructed image information having an image feature information size of 16 × 256 × 256

Namely, it is

Thirdly, the first recombined image information is processed

And second reconstructed image information

Matrix multiplication is carried out to obtain preliminary image characteristic information Feture1_kThe image feature information size is (256 × 256) × (256 × 256), i.e.

Representing a matrix multiplication.

Fourthly, the preliminary image characteristic information Feture1 is processed₁After being processed by the normalization layer, the mixture is mixed with

Matrix multiplication is carried out to obtain final image characteristic information Feture2₁The size of the image feature information is 256 × 256 × 16, i.e.

Fifthly, the final characteristic diagram information Feture2₁After a convolution kernel with the size of 1 multiplied by 1, the information after the polymerization treatment is obtained

The image characteristic information size is 256 × 256 × 32, i.e.

Similarly, information after polymerization treatment is obtained

Step 3-3, utilizing the information after the polymerization treatment

Removing rain information from coarse adjustment image

Guiding to obtain edge information guided image rain removal information

Sending the image rain removing information guided by the edge information into an image output layer to obtain a rain removing image, and further obtaining a trained multi-scale information network, wherein the rain removing information comprises the following specific steps:

defining guiding coarse adjustment image rain removing information as edge information guiding image rain removing information

Namely, it is

Wherein,

Using information after aggregation processing

when k is equal to 1, the first step is carried out,

when the k is equal to 2, the reaction condition is as follows,

when k is 3, the number of the groups is 3,

then will be

And 4, reselecting 1400 images in the Rain removal database of the Rain synthesis image of Rain200L, unifying the sizes of the images into 256 multiplied by 256 through normalization processing to form a test sample set T, and turning to the step 5.

And 5, inputting the rain-containing images in the test sample set T into the trained multi-scale information network to obtain a rain-removing image, so that the image has richer texture information and edge information while the rain-removing information is removed, and the rain-removing result is more realistic.

The method of the invention adopts python programming language and tensoflow framework language to build a network framework on an Nvidia2080Ti GPU host computer to carry out relevant experiments. Firstly, an encoder Enc _ P, a first decoder Dnc _ R and a second decoder Dnc _ E of the multi-scale information network are trained, each convolutional layer uses a ReLU activation function, the learning rate of the network is set to be 2E^-4The batch-size of the training encoder Enc _ P, the first decoder Dnc _ R, and the second decoder Dnc _ E is set to 3, and the training iterations 400 times. Then training a context aggregation module, using a ReLU activation function, using a sigmoid activation function in an SE attention mechanism, and setting the learning rate of the network to be 2e^-4The batch-size is set to 2 and the training is iterated 400 times. In the network training process, the size of the input image is normalized to 256 × 256, and the whole rain-removing network model is obtained.

In order to better embody the effect of the algorithm proposed by the present invention on image rain removal, a model visualization experiment was designed according to example 1. And the rain removing effect of the image after each context aggregation module is visualized, and the rain removing expression of the image guided by the edge information every time is judged by vision. And experiments of two semi-supervised image rain removing algorithms SIRR and Syn2Real on a rain-containing image sample in a synthetic domain are also carried out, and the results of the experiments are compared with the experimental results of the invention to discover that the experimental results of the invention not only can obtain good image rain removing effect, but also can recover the detail information and edge information of the image.

Claims

1. A context aggregation image rain removing method based on edge information guidance is characterized by comprising the following steps:

step 1, selecting a Rain database with a Rain image synthesized by Rain200L and N images in the Rain database, wherein N is more than 100 and less than 10000, carrying out normalization processing, taking the images with uniform size, namely height multiplied by width multiplied by h multiplied by w as a training sample set S, and turning to step 2;

step 3-1, inputting the training sample set S into an encoder Enc _ P, extracting image characteristic information of the training sample set S, correspondingly obtaining rain removing information and image edge information of a rough-adjusted image by respectively utilizing a first decoder Dnc _ R and a second decoder Dnc _ E, and turning to step 3-2;

Turning to the step 3-3;

step 3-3, utilizing the information after the polymerization treatment

Guiding the rain removing information of the rough-adjusted image to obtain edge information-guided image rain removing information, sending the edge information-guided image rain removing information to an image output layer to obtain a rain removing image, further obtaining a trained multi-scale information network, and turning to step 4;

step 4, reselecting the Rain200L synthetic image to remove M images in the Rain database, wherein M is more than 100 and less than 10000, unifying the size of the images into hxw through normalization processing to form a test sample set T, and turning to step 5;

2. The method of claim 1, wherein in step 2, a multi-scale information network is constructed, the multi-scale information network comprises an encoder Enc _ P, a first decoder Dnc _ R, a second decoder Dnc _ E, an image output layer, and three context aggregation modules EGCA_kThe context aggregation module serial number k is 1,2, and 3, which is specifically as follows:

wherein,

the size of the image characteristic information, i.e. height x width x channel number, is h_m×w_m×c_mWherein h is_m＝h/2^m-1，w_m＝w/2^m-1，c_m＝32×2^m-1；

wherein,

the size of the image characteristic information, i.e. height x width x channel number, is h_i×w_i×c_iWherein h is_i＝h/2^i-1，w_i＝w/2^i-1，c_i＝32×2^i-1；

3) The second decoder Dnc _ E includes three convolutional blocks, each defined as D_e1、D_e2、D_e3The image feature information extracted by the encoder Enc _ P is input to the second decoder Dnc _ E to obtain the image edge information, and the network of the second decoder Dnc _ E is defined as follows:

wherein,

the size of the image characteristic information, i.e. height x width x channel number, is h_j×w_j×c_jWherein h is_j＝h/2^j-1，w_j＝w/2^j-1，c_j＝32×2^j-1；

3. The method according to claim 2, wherein h is h_m＝h_i＝h_j，w_m＝w_i＝w_j，c_m＝c_i＝c_j。

4. The method as claimed in claim 2, wherein in step 3-1, the training sample set S is input to the encoder Enc _ P, the image feature information is extracted, and then the first decoder Dnc _ R and the second decoder Dnc _ E are respectively used to obtain the image edge information and the coarse-adjustment image rain removal information, which are as follows:

the encoder Enc _ P is configured to extract image feature information of the training sample set S, the encoder Enc _ P and the first decoder Dnc _ R together construct an upper branch image rain removal network for obtaining coarse image rain removal information, and the encoder Dnc _ P and the second decoder Dnc _ E together construct a lower branch edge information detection network for obtaining image edge information:

when m is equal to 1, the compound is,

when m is equal to 2, the compound is,

when m is 3, the compound is added,

when m is 4, the compound is shown in the specification,

wherein E is_m(indicates the block extraction by the mth convolution block in the encoderTaking information, wherein m is 1,2,3 and 4;

when the value of i is 1, the value of i,

when the value of i is 2, the ratio of i to i is,

when the value of i is 3, the value of i,

wherein D is_ri(xvi) denotes an operation of obtaining a coarse image degrain information via the i-th convolution block of the first decoder Dnc _ R, i ═ 1,2, 3;

when j is equal to 1, the value of j,

when the j is 2, the sum of the j,

when j is 3, the number of the adjacent groups is 3,

5. The method as claimed in claim 4, wherein in step 3-2, three context aggregation modules EGCA are used_kPerforming context aggregation processing on the rain removing information and the image edge information of the rough-adjusted image to obtain aggregated information, which is specifically as follows:

Respectively carrying out three convolution kernels with the size of 1 multiplied by 1 to obtain three image characteristic information with the sizes of h_i×w_i×c_i/2、h_i×w_i×c_iH and/2_j×w_j×c_j[ 2 ] of

Convolution of image information, i.e.

A second step of

The convolution image information is subjected to image characteristic recombination transformation to obtain image characteristic information with the size of h_i×w_i×c_iFirst reconstructed image information of/2

And an image characteristic information size of c_j/2×h_j×w_jSecond reconstructed image information of

Namely, it is

Thirdly, the first recombined image information is processed

And second recombined image information

Matrix multiplication is carried out to obtain preliminary image characteristic information Feture1_kAnd image characteristic information size is (h)_i×w_i)×(h_j×w_j) I.e. by

Represents a matrix multiplication;

Image featureSign information of size h_i×w_i×c_iI.e. by

6. The method for removing rain based on context aggregation image guided by edge information as claimed in claim 5, wherein the information after aggregation processing is utilized in step 3-3