CN110827213B

CN110827213B - Super-resolution image restoration method based on generation type countermeasure network

Info

Publication number: CN110827213B
Application number: CN201910963790.6A
Authority: CN
Inventors: 李云红; 穆兴; 汤汶; 朱绵云; 罗雪敏; 姚兰; 刘畅; 喻晓航
Original assignee: Xian Polytechnic University
Current assignee: Xian Polytechnic University
Priority date: 2019-10-11
Filing date: 2019-10-11
Publication date: 2023-03-31
Anticipated expiration: 2039-10-11
Also published as: CN110827213A

Abstract

The invention discloses a super-resolution image restoration method based on a generative countermeasure network, which specifically comprises the following steps: step 1, collecting and sorting real image data, processing the collected images into images with the same size, and forming a training data set by the processed images with the same size; step 2, constructing a generating type confrontation network model which comprises a generator and a discriminator; step 3, printing masks with the same size on each image in the training data set to obtain an image to be repaired, importing the image to be repaired into the generative confrontation network model, and training the generative confrontation network model by adopting a weak supervision learning mode to obtain a trained image; and 4, inputting the training image into the U-Net generation network to repair the trained image to be repaired, and outputting a repaired high-resolution image. The problems of low resolution and inconsistent visual effect after large-area image deletion repair in the prior art are solved.

Description

Super-resolution image restoration method based on generation type countermeasure network

Technical Field

The invention belongs to the technical field of computer vision and image processing, and particularly relates to a super-resolution image restoration method based on a generative confrontation network.

Background

In recent years, computer graphics and computer vision have been rapidly developed, and image restoration methods are diversified, and image restoration techniques have appeared early in the period of renaturation of art, which is a long-history technique, and artworks damaged by improper preservation or other different reasons often appear early, and craftsmen have done some filling work to maintain the integrity of artworks through the skill they master. With the rapid development of computer vision technology, digital image processing technology gradually replaces manual image restoration technology.

At present, image restoration is an important content in the field of computer vision, and aims to automatically restore lost content according to known content in an image, and the method has wide application value in the fields of image editing, movie and television special effect production, virtual reality, digital cultural heritage protection and the like. The traditional image restoration method is based on a diffusion method, for example, smooth prior is introduced through a parameter model or a partial differential equation, the local structure to be restored is spread from outside to inside or diffused, the texture is simple, the image restoration effect with small damaged area is good, but the complex image restoration is easy to generate the phenomena of blurring and artifacts, and the resolution is low after the image restoration; the image restoration method based on the variational partial differential equation algorithm is mainly started from pixel points, effective information is found out according to the damaged edge structure of an image, so that the expansion content and the direction are determined, the diffusion of the edge information of a to-be-restored area to the inside is ensured, and the purpose of image restoration is achieved; when selecting a pixel block in a to-be-repaired area, the texture structure-based image repairing method mainly takes a central point of the pixel block as an object, searches and compares the pixel block in a good area, and fills the compared optimal pixel block into the to-be-repaired area, so that the detailed repairing effect on image textures is good.

Disclosure of Invention

The invention aims to provide a super-resolution image restoration method based on a generative confrontation network, which solves the problems of low resolution and inconsistent visual effect after large-area image deletion restoration in the prior art.

The technical scheme adopted by the invention is that the super-resolution image restoration method based on the generative countermeasure network is implemented according to the following steps:

step 1, collecting and sorting real image data to form a real image data sample set, processing the collected images into images with the same size, and forming a training data set by the processed images with the same size;

step 2, constructing a generative confrontation network model, wherein the generative confrontation network model comprises a generator and a discriminator;

step 3, printing masks with the same size on each low-resolution image in the training data set formed in the step 1 to obtain an image to be repaired, importing the image to be repaired into the generative confrontation network model constructed in the step 2, training the generative confrontation network model by adopting a weak supervision learning mode to obtain a trained image, updating the weights of the generator and the VDB discriminator in the training process, and calculating system loss by utilizing confrontation loss, content loss, perception loss and reconstruction loss;

and 4, inputting the training image obtained in the step 3 into a U-Net generation network to repair the trained image to be repaired, and outputting a repaired high-resolution image.

The invention is also characterized in that:

in step 2, the objective function expression of the generative confrontation network model is:

wherein D represents a discriminator, G represents a generator, and E represents a mathematical expectation; x to p (x) represent the image distribution p (x) in the x obeying real image data sample set; z to G (z) represent a generation distribution G (z) of a z-obeyed image, x is real image data, z is random noise, D (x) represents a discriminant function, and D (z | x) represents a discriminant function of the random noise z under the condition of the real image data x.

In step 2, the pair of the variational arbiter bottleneck VDB to the arbiter objective function is:

J(D,E)＝min _D,E E _x～p(x) [E _z～E(z|x) [-log(D(x))]]+E _z～G(z) [E _z～E(z|x) [-log(1-D(z))]] (2)

wherein D represents a discriminator, E represents an expectation of encoded image information, and x to p ^* (x) Representing x obedience generating image data distribution p ^* (x) (ii) a z-E (z | x) represents z obeying Gaussian distribution E (z | x), KL is a method of divergence, also called relative entropy information probability distribution, r (z) is the characteristic distribution of random noise z, and I _c To generate mutual information of the image data by the generator, D (z) represents a discriminant function of the random noise z, and equation (3) is a constraint condition of equation (2).

By

Introducing a discriminator Lagrange optimization objective function:

wherein, beta is Lagrange multiplier, formula (5) is constraint condition of formula (4), and mutual information I _c The correlation between variables is expressed for limiting the correlation of the real image data x and the random noise z.

In step 2, the VDB discriminator is a binary classifier, and judges whether the image information is true or false by the correlation of the image information generated by the generator

Limiting mutual information of image data samples for limiting conditions, obfuscating true and false image information, and updating parameters of a VDB discriminator as follows:

wherein, I _r And (d) representing mutual characteristic information of the real image data, wherein r (x) is the characteristic distribution of the real image data.

In step 2, the generator adopts a U-Net generation network, the U-Net generation network is composed of a contraction path and an expansion path, the contraction path and the expansion path are symmetrical to each other, the U-Net generation network totally comprises 23 convolution layers, a maximum pooling layer of 2*2 is formed after every two convolution layers of 3*3 on the contraction path, a relu activation function is used for performing down-sampling operation on images in the training data set after each convolution layer, the number of channels is increased for each down-sampling operation to obtain context information of a real image, two convolution layers of 3*3 are formed after each convolution layer of 2*2 on the expansion path, a relu activation function is used for performing up-sampling operation on the images in the training data set after each convolution layer, a characteristic diagram from the corresponding contraction path is added in each up-sampling operation, and the shapes are kept the same after being cut.

In step 2, the discriminator adopts a VDB discrimination network which maps the real image data transmitted from the generator to the encoder onto gaussian distribution, classifies the images generated by the generator, and acts information bottleneck on random noise z.

In step 3, updating parameters of the generative confrontation network model by using a random gradient descent method, specifically:

for a set of real image data samples { x _i ,y _j Its loss function is:

wherein, y _j Is x for e {0,1} _i The label of (1);

let the training data set be

Image sample->

The input is obtained corresponding weak supervision output

The structured risk function of the image samples on the training dataset M is therefore:

wherein W and b represent all weights and bias vectors in the network, respectively;

the regularization term is used for preventing overfitting during training of the image restoration model; λ is a hyper-parameter which is a positive number and is used for controlling the weight, the larger λ is, the closer W is to '0', N represents the nth image sample, N represents the total number of samples of the training data set, l represents the number of layers, and m represents the number of neurons;

then the weight W of the l-th layer in the iterative operation of the gradient descent algorithm ^(l) And an offset vector b ^(l) The updating method comprises the following steps:

where α is the learning rate, where the optimum learning rate α is 0.001.

In step 3, the generator countermeasure loss of the generator countermeasure network model is:

L _G ＝-E _x～p(x) [log(1-D(z|x))]-E _x～G(x) [logD(z|y)] (13)

wherein D (z | y) represents a discriminant function distribution of random noise under a condition of real image data output;

the 1-norm distance is used to measure the difference between the undamaged area of the generator generated image and the undamaged area in the real image data:

the content loss is:

wherein, I represents the image loss area and the mutual information of the pixel points;

the perceptual loss is:

wherein, C _j H _j W _j For normalization processing on the image path, [ phi ] _j Pixel points in the real image data;

the reconstruction loss is:

L _pixel ＝||x _i -G(x _i )|| ₂ (16)

the system loss function is:

L＝L _G +L _ij +L _percep +L _pixel (17)。

the specific steps of step 4 are as follows:

step 4.1, inputting the trained image obtained in the step 3 into an encoder, and inputting random noise z into a generator;

step 4.2, a generator G (z) generates an image according to the input random noise z and the related information of the real image data in the step 1, and a U-Net generation network generates a repaired high-resolution image according to a system loss function in the image generation process;

step 4.3, the real image data input to the encoder is mapped to Gaussian distribution to obtain a real image mapping relation, meanwhile, the generator generates the mapping relation of the high-resolution image repaired in the step 4.2, then the real image mapping relation and the mapping relation of the repaired high-resolution image are input into a VDB (visual desktop bus) discrimination network, and the VDB network applies the repaired high-resolution image information bottleneck to a discriminator to judge;

and 4.4, outputting the repaired high-resolution image by the discriminator, returning to the step 4.2 if the repaired high-resolution image does not meet the mapping relation of the repaired high-resolution image, otherwise, outputting the repaired high-resolution image, and setting the iteration number of repairing the high-resolution image according to the visual effect of the repaired high-resolution image.

The beneficial effects of the invention are:

(1) The super-resolution image restoration method based on the generative confrontation network trains the generative confrontation network model in a weak supervision learning mode, does not need excessive labels from the mapping of a real image data sample to a feature space, can sequence the mapping relation between the real image data and the generated image, has low training cost, and is beneficial to training the required trained image;

(2) The invention relates to a super-resolution image restoration method based on a generating type countermeasure network.A generator adopts a U-Net network, the network is a full convolution neural network, the input and the output of the network are images, a full connection layer structure is not provided, the problem of pixel positioning is solved by a shallower layer, the restoration of damaged pixels of the images is realized by a deeper layer, and the high-resolution restoration and reconstruction of the images are realized;

(3) The invention relates to a super-resolution image restoration method based on a generative confrontation network.A discriminator utilizes a variable score discriminator bottleneck (VDB) to weaken the capability of the discriminator by limiting mutual information, thereby balancing the training of a generative confrontation network model, meeting the thought of zero sum game of the generative confrontation network, improving the difficulty of the discriminator in distinguishing true and false, preventing the generative confrontation network model from being over-learned in the training, simultaneously playing a role in reducing dimension on image characteristic information, optimizing the restoration performance of the generative confrontation network model, and ensuring that a shallow network realizes high-resolution image restoration;

(4) The super-resolution image restoration method based on the generative confrontation network adopts a random gradient algorithm to update a generative confrontation network model, introduces a hyper-parameter control weight, accelerates the training speed of the generative confrontation network model, and enables the generative confrontation network model to be converged;

(5) The super-resolution image restoration method based on the generative confrontation network adopts the confrontation loss, the content loss, the perception loss and the reconstruction loss to calculate the system loss, so that the restored image can reach high resolution, and introduces a regular term to prevent the generative confrontation network model from training and trying to fit, thereby improving the stability of the generative confrontation network model.

Drawings

FIG. 1 is a schematic block diagram of a super-resolution image modification method based on a generative confrontation network according to the present invention;

FIG. 2 is a diagram of a generator network structure of a super-resolution image restoration method based on a generator countermeasure network according to the present invention;

FIG. 3 is a diagram of an example of a super-resolution image restoration method based on a generative countermeasure network according to FIG. 1;

fig. 4 is a diagram 2 of an example of high-resolution image restoration based on the super-resolution image restoration method of the generative countermeasure network according to the present invention.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

The invention discloses a super-resolution image restoration method based on a generative countermeasure network, which is implemented according to the following steps as shown in figure 1:

step 1, collecting and sorting real image data to form a real image data sample set, processing the collected real images into images with the same size, and forming a training data set by the processed images with the same size; wherein, the real image data comprises an ancient fabric image damaged naturally;

step 2, constructing a generative confrontation network (GAN) model, wherein the generative confrontation network model comprises a generator and a discriminator;

the method for constructing the generative confrontation network model specifically comprises the following steps:

suppose the true image data sample set is { x } _i ,y _j At this time x _i For image data samples, y _j And if the label is a corresponding weak label, the objective function expression of the generated countermeasure network model is as follows:

wherein D represents a discriminator, G represents a generator, and E represents a mathematical expectation; x to p (x) represent the image distribution p (x) in the x obeying real image data sample set; z to G (z) represent a generation distribution G (z) of z-compliant images, x is real image data, z is random noise, D (x) represents a discriminant function, and D (z | x) represents a discriminant function of random noise z under the condition of the real image data x;

the Variational Discriminator Bottleneck (VDB) to the discriminator objective function in the generative confrontation network model is:

wherein D represents the discriminator, E represents the expectation of the encoded image information, i.e. the mathematical expectation, x-p ^* (x) Representing x obedience generating image data distribution p ^* (x) (ii) a z-E (z | x) represents z obeying Gaussian distribution E (z | x), KL is a method of divergence, also called relative entropy information probability distribution, r (z) is the characteristic distribution of random noise z, and I _c Generating mutual information of the image data for the generator, wherein D (z) represents a discriminant function of random noise z, and formula (3) is a constraint condition of formula (2);

by

Introducing a discriminator Lagrange optimization objective function:

wherein, beta is Lagrange multiplier, formula (5) is constraint condition of formula (4), and mutual information I _c Representing the correlation between variables for limiting the correlation of real image data x and random noise z;

the VDB discriminator is a binary discriminator for discriminating between true and false by generating the correlation of image information by the generator

Limiting mutual information of image data samples as a limiting condition, mixing true and false image information, improving the distinguishing difficulty of a discriminator, balancing a system model, and updating parameters of a VDB discriminator as follows:

wherein, I _r Representing the characteristic mutual information of the real image data, wherein r (x) is the characteristic distribution of the real image data;

as shown in fig. 2, the generator adopts U-Net to generate a network, the U-Net generating network is composed of a contraction path and an expansion path, and the contraction path and the expansion path are symmetrical to each other, the U-Net generating network totally includes 23 layers of convolution layers, after every two convolution layers of 3*3 on the contraction path, a maximum pooling layer of 2*2 is formed, and after each convolution layer, a relu activation function is used to perform down-sampling operation on an image original image in a training data set, wherein each down-sampling operation increases one cup of channel number for obtaining context information of a real image, after each convolution layer of 2*2 on the expansion path, two convolution layers of 3*3 are formed, and after each convolution layer, a relu activation function is used to perform up-sampling operation on an image in the training data set, wherein each up-sampling operation adds a feature map from the corresponding contraction path, performs clipping on a boundary portion of a lost pixel, and after that the image in the training data set and the contraction path keep the same shape as the feature map;

the discriminator adopts a VDB discrimination network which maps real image data transmitted to the encoder by the generator to Gaussian distribution, classifies images generated by the generator and acts information bottleneck on random noise z.

Step 3, printing masks with the same size on each low-resolution (LR) image in the training data set formed in the step 1 to obtain an image to be repaired, importing the image to be repaired into the generative confrontation network model constructed in the step 2, training the generative confrontation network model by adopting a weak supervision learning mode to obtain a trained image, updating the weights of the generator and the VDB discriminator in the training process, and calculating system loss by utilizing confrontation loss, content loss, perception loss and reconstruction loss;

the method comprises the following steps of updating the weights of a generative confrontation network model generator and a VDB (visual data base) discriminator by adopting a random gradient descent method, and specifically comprises the following steps:

for a set of real image data samples { x _i ,y _j Its loss function is:

wherein, y _j E {0,1} is x _i The label of (1);

let the training data set be

Image sample->

The input is obtained as corresponding weak supervision output

So that the image sample->

The structured risk function on the training dataset M is:

wherein, W and b respectively represent all weights and offset vectors in the network;

wherein α is a learning rate, wherein the optimum learning rate α is 0.001;

the process of calculating the system loss by using the confrontation loss, the content loss, the perception loss and the reconstruction loss specifically comprises the following steps:

the generator confrontation loss of the generator confrontation network model is as follows:

L _G ＝-E _x～p(x) [log(1-D(z|x))]-E _x～G(x) [logD(z|y)] (13)

wherein D (z | y) represents discriminant function distribution of random noise under a condition of real image data output;

in the image restoration problem, in order to match the restored damaged area of the image with the real image as much as possible, make the undamaged area clearer, and keep the high resolution of the restored image, the 1-norm distance is used to measure the difference between the undamaged area of the generator generated image and the undamaged area in the real image data:

the content loss is:

the perceptual loss is:

the reconstruction loss is:

L _pixel ＝||x _i -G(x _i )|| ₂ (16)

the system loss function is:

L＝L _G +L _ij +L _percep +L _pixel (17)；

step 4, inputting the training image obtained in the step 3 into a U-Net generation network to repair the image to be repaired after training, and outputting a repaired high-resolution image, wherein the specific process is as follows:

step 4.2, a generator G (z) generates an image according to the input random noise z and the related information (the upper, lower, left and right pixel information of the image missing region and the characteristic distribution information of the pixels) of the real image data in the step 1, and a U-Net generation network generates a repaired high-resolution (HR) image according to a system loss function in the image generation process;

Fig. 3 is a diagram of an example of high-resolution image restoration by using the image restoration method of the present invention, where fig. 3 is a first column, a second column, and a third column from left to right, respectively, where the first column of pictures is a low-resolution (LR) image in a training data set, the second column is a to-be-restored image obtained by masking the first column of images with the same size, and the third column is a restored high-resolution image.

Fig. 4 is a diagram 2 of an example of high-resolution image restoration by using the image restoration method of the present invention, where fig. 4 is a first column, a second column and a third column from left to right, respectively, where the first column is an ancient textile image damaged naturally, the second column is a mask with the same size as the first column image, and an image to be restored is obtained, and the third column is a restored high-resolution image.

By adopting the mode, the generative confrontation network model is trained in a weak supervision learning mode, so that the training cost is reduced, and the convergence speed of the generative confrontation network model is improved; the high-resolution (HR) image is generated through the U-Net generation network, high-resolution restoration of a damaged image (a masked image, namely an image to be restored) is achieved, the network weight and the random gradient descent algorithm are balanced by the VDB discriminator through the whole image generation type confrontation network model to update parameters, complex iterative operation is avoided, and a good visual perception effect of high-resolution restoration of the damaged image is achieved.

Claims

1. A super-resolution image restoration method based on a generative confrontation network is characterized by comprising the following steps:

step 3, printing a mask with the same size on each low-resolution image in the training data set formed in the step 1 to obtain an image to be repaired, importing the image to be repaired into the generative confrontation network model constructed in the step 2, training the generative confrontation network model by adopting a weak supervision learning mode to obtain a trained image, updating the weights of a generator and a VDB (visual desktop manager) discriminator in the training process, and calculating system loss by utilizing confrontation loss, content loss, perception loss and reconstruction loss;

2. The super-resolution image restoration method based on the generative confrontation network as claimed in claim 1, wherein in step 2, the objective function expression of the generative confrontation network model is:

wherein D represents a discriminator, G represents a generator, and E represents a mathematical expectation; x to p (x) represent the image distribution p (x) in the x obeying real image data sample set; z to G (z) denote a generation distribution G (z) of z-compliant images, x is real image data, z is random noise, D (x) denotes a discriminant function, and D (z | x) denotes a discriminant function of the random noise z under the condition of the real image data x.

3. The super-resolution image restoration method based on the generative countermeasure network as claimed in claim 1, wherein in step 2, the pair of classifier objective functions of the variational classifier bottleneck VDB is:

wherein D represents a discriminator, E represents an expectation of encoded image information, and x to p ^* (x) Representing x obedience generating image data distribution p ^* (x) (ii) a z-E (z | x) represents z obeying Gaussian distribution E (z | x), KL is a method of divergence, also called relative entropy information probability distribution, r (z) is the characteristic distribution of random noise z, and I _c For the life of a living beingThe generator generates mutual information of the image data, D (z) represents a discriminant function of random noise z, and formula (3) is a constraint condition of formula (2).

4. The super-resolution image restoration method based on the generative countermeasure network as claimed in claim 3, wherein the super-resolution image restoration method is implemented by

Introducing a discriminator Lagrange optimization objective function:

wherein beta is Lagrange multiplier, formula (5) is constraint condition of formula (4), and mutual information I _c The correlation between variables is expressed for limiting the correlation of the real image data x and the random noise z.

5. The super-resolution image restoration method based on the generative countermeasure network as claimed in claim 1, wherein in step 2, the VDB discriminator is a binary discriminator, and the correlation between the image information generated by the generator is used to discriminate between true and false

6. The super-resolution image restoration method based on the generative countermeasure network as claimed in claim 1, wherein in step 2, the generator employs U-Net to generate the network, the U-Net generates the network, the network is composed of a contraction path and an expansion path, and the contraction path and the expansion path are symmetrical to each other, the U-Net generates the network totally includes 23 layers of convolution layers, each two convolution layers of 3*3 on the contraction path are followed by a maximum pooling layer of 2*2, and each convolution layer is followed by a relu activation function to perform down-sampling operation on the images in the training data set, wherein each down-sampling operation increases one cup channel number for obtaining context information of a real image, each convolution layer of 2*2 on the expansion path is followed by two convolution layers of 3*3, and each convolution layer is followed by a relu activation function to perform up-sampling operation on the images in the training data set, wherein each up-sampling operation adds a feature map from the corresponding contraction path, and keeps the same shape after each up-sampling operation.

7. The super-resolution image restoration method based on the generative countermeasure network as claimed in claim 1, wherein in step 2, the discriminator employs a VDB discrimination network, which maps the real image data transmitted from the generator to the encoder onto gaussian distribution, classifies the images generated by the generator, and applies information bottleneck to random noise z.

8. The super-resolution image restoration method based on the generative confrontation network according to claim 1, wherein in step 3, the parameters of the generative confrontation network model are updated by a stochastic gradient descent method, specifically:

for real image dataSample set { x _i ,y _j Its loss function is:

wherein, y _j E {0,1} is x _i The label of (1);

let the training data set be

Image sample->

The input gets a corresponding weakly supervised output->

where α is the learning rate, where the optimum learning rate α is 0.001.

9. The super-resolution image restoration method based on the generative confrontation network as claimed in claim 1, wherein in step 3, the generative confrontation loss of the generative confrontation network model is:

L _G ＝-E _x～p(x) [log(1-D(z|x))]-E _x～G(x) [logD(z|y)] (13)

the content loss is:

the perceptual loss is:

the reconstruction loss is:

L _pixel ＝||x _i -G(x _i )|| ₂ (16)

the system loss function is:

L＝L _G +L _ij +L _percep +L _pixel (17)。

10. the super-resolution image restoration method based on the generative countermeasure network according to claim 1, wherein the specific steps of step 4 are as follows: