CN111027603B

CN111027603B - Image generation method for improving GAN model

Info

Publication number: CN111027603B
Application number: CN201911181133.2A
Authority: CN
Inventors: 舒军; 李振亚; 杨露; 吴柯; 蒋明威; 邓明舟; 舒心怡; 潘健; 王淑青
Original assignee: Hubei University of Technology
Current assignee: Hubei University of Technology
Priority date: 2019-11-27
Filing date: 2019-11-27
Publication date: 2022-07-05
Anticipated expiration: 2039-11-27
Also published as: CN111027603A

Abstract

The invention discloses an image generation method for improving a GAN model, which comprises the steps of firstly reading a local existing original image sample; pre-training an improved LeNet model; taking the improved LeNet model as a discrimination model of the improved GAN model; designing a generation model of an improved GAN model according to the generation model of the original GAN model; taking random noise as input of a generation model, and obtaining a generated image sample after deconvolution operation; taking the generated image sample and the existing original image sample as the input of the discrimination model to obtain the probability that the picture is true, and returning the result to the discrimination model in the step 3 and the generation model in the step 4; judging the model updating parameters to better distinguish the truth of the picture, and generating a model to generate a more real picture; and (4) the generated model and the discrimination model compete with each other and are mutually promoted, the training is finished after N steps, and finally the generated model generates a real sample. The invention not only can save cost for manufacturers, but also can provide more samples for subsequent classification problems.

Description

Image generation method for improving GAN model

Technical Field

The invention belongs to the technical field of image processing, relates to an image generation method, and particularly relates to an image generation method for improving a GAN (generic image warping) model.

Background

When the classification problem on the production line is processed, the number of samples provided by manufacturers is limited, and in order to increase the characteristics of the samples and improve the stability of the classification model, pictures need to be generated according to the existing samples.

At present, the GAN model is widely applied to image generation, but the problems of model collapse and difficult model training exist.

Disclosure of Invention

In order to solve the technical problem, the invention provides an image generation method for improving a GAN model by combining an improved LeNet model and the GAN model.

The technical scheme adopted by the invention is as follows: an image generation method for improving a GAN model is characterized by comprising the following steps:

step 1: reading a local original image sample;

step 2: pre-training an improved LeNet model;

and step 3: taking the improved LeNet model as a discrimination model of the improved GAN model;

and 4, step 4: designing a generation model of an improved GAN model according to the generation model of the original GAN model;

and 5: taking random noise as input of a generation model, and obtaining a generated image sample after deconvolution operation;

step 6: taking the generated image sample and the existing original image sample as the input of the discrimination model to obtain the probability that the picture is true, and returning the result to the discrimination model in the step 3 and the generation model in the step 4;

updating and judging model parameters by using a gradient ascending algorithm, wherein the parameters are weights of the convolution layers of each layer, distinguishing true and false of the picture, and generating a model to generate a more real picture;

and 7: and (4) the generated model and the discrimination model compete with each other and are mutually promoted, the training is finished after N steps, and finally the generated model generates a real sample.

Preferably, the improved LeNet model in step 2 is composed of 3 groups of same modules, each group of module structure includes 2 3 × 3 convolutions and one 1 × 1 convolution, each convolution needs to be subjected to normalization processing and relu activation function activation, then 3 convolution operations are subjected to residual error processing, and finally 2 × 2 pooling operation is connected.

Preferably, the pre-trained improved LeNet model in step 2 is trained by using a convolutional neural network training method to improve parameters of the LeNet model, where the parameters include weights of convolutional layers of the improved LeNet, and an obtained effect is that each weight reaches an optimal value so that the model can determine whether the picture is true or false.

Preferably, the generated model of the improved GAN model in step 4 includes 4 sets of the same modules, each set of modules includes 5 × 5 deconvolution and 1 × 1 convolution, and each deconvolution and convolution is then activated by a normalization process and a relu activation function.

Preferably, the generating model and the discriminating model in step 7 compete with each other and promote each other, and the specific implementation process is as follows: firstly, a generation model G receives random noise Z and generates a false sample; then, the generated sample and the original real sample are used as input of a discrimination model D to judge whether the sample is true or false; returning the result to G and D to enable the two models, wherein G generates a new batch of samples to deceive D, and D updates parameters to more accurately distinguish true from false; the two network models promote updating mutually until the generated sample D and the real sample have the same distribution, and G cannot distinguish the true and false of the generated sample and the real sample.

The invention provides an improved GAN by combining the improved LeNet and the GAN and provides a training method for generating pictures after classification, thereby not only saving the cost for manufacturers, but also providing more samples for subsequent classification problems.

Drawings

FIG. 1 is a flow chart of an embodiment of the present invention;

FIG. 2 is a diagram of a decision model according to an embodiment of the present invention;

FIG. 3 is a diagram of a generation model according to an embodiment of the present invention.

Detailed Description

In order to facilitate the understanding and implementation of the present invention for those of ordinary skill in the art, the present invention is further described in detail with reference to the accompanying drawings and examples, it is to be understood that the embodiments described herein are merely illustrative and explanatory of the present invention and are not restrictive thereof.

Referring to fig. 1, the image generation method for improving a GAN model provided by the present invention includes the following steps:

step 1: reading 5000 original image samples of local pictures, wherein the original image samples comprise two categories, such as crayfish: one is bads (black and damaged shrimp, etc.) and the other is goods (good lobster), with corresponding labels 0 and 1;

step 2: 5000 samples are used as input of an improved LeNet model, and model parameters are trained by a convolutional neural network training method.

The improved LeNet model of the embodiment is composed of 3 groups of same modules, each group of module structures comprises 2 3 × 3 convolutions and 1 × 1 convolution, each convolution needs to be subjected to normalization processing and relu activation function activation, then 3 convolution operations are subjected to residual error processing, and finally 2 × 2 pooling operations are connected.

The whole model inputs pictures, and the probability of judging whether the pictures are true or false is finally output after the pictures pass through the 3 groups of modules.

And step 3: the pre-trained improved LeNet model is used as a discrimination model of the improved GAN model, and the model is shown in figure 2.

The pre-training improved LeNet model of the embodiment trains parameters of the improved LeNet model by a convolutional neural network training method, wherein the parameters comprise weights of convolutional layers of the improved LeNet, and the obtained effect is that the weights reach an optimal value so that the model can judge whether a picture is true or false.

And 4, step 4: and designing a generation model of the improved GAN model according to the generation model of the original GAN model, wherein the model is shown in figure 3.

The generated model of the improved GAN model of the embodiment includes 4 sets of the same modules, each set of the modules includes 5 × 5 deconvolution and 1 × 1 convolution, and each deconvolution and convolution is then subjected to normalization processing and relu activation function activation.

The whole model is input with random noise, then full connection operation is carried out, activation is carried out through BN (normalization processing) and relu activation function, and then the picture is output through the operation of the 4 modules.

And 5: and (4) taking random noise as an input of a generation model, and performing operations such as deconvolution to obtain a generated image sample.

in the embodiment, a gradient ascent algorithm is used for updating and distinguishing model parameters, wherein the parameters are weights of convolution layers of each layer, the truth of a picture is distinguished, and a model is generated to generate a more real picture;

and 7: the two models compete with each other and promote each other, the training 20000 steps are finished, and finally the generated model can generate a real sample.

In this embodiment, the generation model and the discrimination model compete with each other and promote each other, and the specific implementation process is as follows: firstly, a generation model G receives random noise Z and generates a false sample; then, the generated sample and the original real sample are used as input of a discrimination model D to judge whether the sample is true or false; returning the result to G and D to enable the two models, wherein G generates a new batch of samples to deceive D, and D updates parameters to more accurately distinguish true from false; the two network models promote updating mutually until the generated sample D and the real sample have the same distribution, and G cannot distinguish the true and false of the generated sample and the real sample.

It should be understood that the above description of the preferred embodiments is given for clarity and not for any purpose of limitation, and that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. An image generation method for improving a GAN model is characterized by comprising the following steps:

step 1: reading a local original image sample;

step 2: pre-training an improved LeNet model;

the improved LeNet model consists of 3 groups of same modules, each group of module structure comprises 2 3 × 3 convolutions and 1 × 1 convolution, each convolution needs to be subjected to normalization processing and relu activation function activation, then 3 convolution operations are subjected to residual error processing, and finally 2 × 2 pooling operation is connected;

the generation model of the improved GAN model comprises 4 groups of same modules, wherein each group of modules comprises 5 × 5 deconvolution and 1 × 1 convolution, and each deconvolution and convolution is subjected to normalization processing and relu activation function activation;

step 6: taking the generated image sample and the existing original image sample as input of a discrimination model to obtain the probability that the picture is true, and returning the result to the discrimination model in the step 3 and the generation model in the step 4;

and 7: the generated model and the discrimination model compete with each other and are mutually promoted, training is finished after N steps, and finally the generated model generates a real sample;

the generation model and the discrimination model compete with each other and promote each other, and the specific implementation process is as follows: firstly, a generation model G receives random noise Z and generates a false sample; then, the generated sample and the original real sample are used as input of a discrimination model D to judge whether the sample is true or false; returning the result to G and D to enable the two models, wherein G generates a new batch of samples to deceive D, and D updates parameters to more accurately distinguish true from false; the two network models promote updating mutually until the generated sample D and the real sample have the same distribution, and G cannot distinguish the true and false of the generated sample and the real sample.

2. The method of generating an image of an improved GAN model as claimed in claim 1, wherein: the pre-training improved LeNet model in the step 2 is specifically realized by the following steps: training parameters of an improved LeNet model by a convolutional neural network training method, wherein the parameters comprise the weight of each convolutional layer of the improved LeNet, and the obtained effect is that each weight reaches an optimal value so that the model can judge whether the picture is true or false.