CN111192206A

CN111192206A - Method for improving image definition

Info

Publication number: CN111192206A
Application number: CN201911217831.3A
Authority: CN
Inventors: 王敏; 范晓烨; 付昱承
Original assignee: Hohai University HHU
Current assignee: Hohai University HHU
Priority date: 2019-12-03
Filing date: 2019-12-03
Publication date: 2020-05-22

Abstract

The invention discloses a method for improving image definition, which comprises the following steps: acquiring characteristic information of an input image by using a generator network, and generating an image by using the generator network according to the acquired characteristic information; respectively transmitting the image and the real image generated by the generator network into a discriminator network, judging whether the generated image is the real image or not by using the discriminator network, and extracting the characteristic information of the image and the real image generated by the generator; calculating a loss function by using an Adam algorithm according to the extracted characteristic information, and continuously updating parameters of the loss function until the parameters reach the optimal parameters; and finishing training to generate a clear image. The method can effectively improve the image definition and has extremely high application value.

Description

Method for improving image definition

Technical Field

The invention belongs to the field of computer vision and deep learning, relates to multiple subjects such as computer vision, digital image processing, artificial intelligence, computer science and the like, and particularly relates to a method for improving image definition.

Background

With the development of image technology, people have higher and higher requirements on image definition, high-definition equipment is already applied to various aspects of our lives in a large scale, and China is satisfied with various ultra-high-definition equipment on the celebration in seventy-year. Therefore, the improvement of the image definition is particularly important, and the method not only provides people with higher sensory experience, but also has wide application space in various research fields.

The image definition is low due to a plurality of reasons, including atmospheric factors, brightness factors, artificial factors and the like, and the image definition is reduced due to various reasons, such as a shooting mode, image amplification, equipment obsolescence, physical damage and the like, so that the improvement of the image definition becomes an urgent requirement of people. General monitoring refers to a large-range scene, such as a square pedestrian flow situation, a highway traffic flow situation and the like, and high-definition images are needed for face recognition, license plate recognition and the like; the method has higher requirements on the definition of a picture in the aspect of target identification, and can accurately present characters, license plates, characters, marks and the like in the image; detailed feature recognition is mainly used in special application places such as bank counters, ATMs, casinos and the like, and more detailed features are required to be obtained on the basis of object recognition. Therefore, the improvement of the image definition has important significance in various aspects.

However, there is no effective method for further improving the image definition in the prior art, so a new technical solution is urgently needed to solve the problem.

Disclosure of Invention

The purpose of the invention is as follows: in order to overcome the defects in the prior art, the method for improving the image definition is provided, and the image definition can be effectively improved.

The technical scheme is as follows: in order to achieve the above object, the present invention provides a method for improving image sharpness, comprising the steps of:

s1: acquiring characteristic information of an input image by using a generator network, and generating an image by using the generator network according to the acquired characteristic information;

s2: respectively transmitting the image and the real image generated by the generator network into a discriminator network, judging whether the generated image is the real image or not by using the discriminator network, and extracting the characteristic information of the image and the real image generated by the generator;

s3: calculating a loss function by using an Adam algorithm according to the characteristic information extracted in the step S2, and continuously updating parameters of the loss function until the parameters reach the optimal parameters;

s4: and finishing training to generate a clear image.

Further, in step S1, before the generator network performs feature information acquisition on the input image, the input image is subjected to enhancement preprocessing, and the operations of performing enhancement preprocessing on the image are performed without changing image pixel information, where the operations include: cutting, turning over and the like.

Further, the enhancing pretreatment specifically comprises: mapping the value of each pixel point of the image from 0-255 to 0-1, randomly cutting to obtain a 24x24 image, inputting the image into the model, randomly turning left and right/up and down, and if the input image is turned left and right/up and down, performing the same operation on the corresponding input image in the training process.

Further, the step S1 is specifically: in the generator network, a plurality of residual error network blocks are used for carrying out feature extraction on an input image step by step, except for a first residual error block, the input of each residual error block is the summation of the input and the output of the previous residual error block in pixel level, finally, the input of the first residual error block and the output of the last residual error block are subjected to pixel level summation, and an image with the same channel number as the input image is obtained through dimensionality reduction and serves as the output of the whole generator network.

Further, the pixel-level summation in step S1 is implemented by a residual error network and a jump connection, which specifically includes: feature maps of deep networks are stacked with feature maps of deep networks, which are imported from shallow networks, and convolutional layers behind select the influence of shallow and deep features on the final prediction by weights.

Further, the determination method using the discriminator network in step S2 is: after convolution-activation-regularization operations are carried out for multiple times, the characteristics of the image and the real image generated by the generator network are extracted, the generated image is judged to be true or false through a full connection layer and a Sigmoid activation function, and the result is applied to a discriminator loss function.

Further, the loss function in step S3 includes a generator loss function and a discriminator loss function, and the calculation formula of the discriminator loss function is as follows:

wherein l_disRepresenting the discriminant loss function, I^TRepresenting true high definition images, I^LRepresenting a blurred image input to the generator network,

representing images generated by a network of generators, we wish for an arbiter

As large as possible because the output of the determination of the real image by the discriminator must be true; hope for

As small as possible, since the output of the decision by the discriminator to generate the image must be false, and thus the function has a different growth tendency and cannot be optimized, the calculation is performed

The optimization objectives expected by the two formulas are the same and can be optimized together, but the expectation is the maximum value at the moment, and the expectation is the minimum value at the time of program implementation, so the inverse is taken as

As a discriminator loss function;

the generator loss function comprises two parts of a content loss function and a counter loss function:

the content loss function is calculated as follows:

wherein l_conThe content loss function is represented by the L1 norm, W and H representing the width and height of the image respectively,

representing each pixel of a true high definition image, I^LRepresenting a blurred image input to the generator network,

representing each pixel of the image generated by the generator network through feature extraction and dimension reduction, and solving the minimum value of the Manhattan distance as an optimal solution through the optimization of an L1 loss function between the generated image and a real high-definition image;

the calculation formula of the penalty function is as follows:

wherein l_advWhich represents a function of the resistance loss,

the expression that the image generated by the generator is input into the discriminator, and the logarithm operation is added into all the calculation formulas to reduce the influence caused by the unilateral effect and fluctuation of data distribution, so that a plurality of numerical problems are avoided in the actual program implementation.

Furthermore, the Adam algorithm in step S3 introduces quadratic gradient correction, performs learning rate attenuation after training, and continuously updates the parameters to achieve the optimum by calculating the derivative of loss to the parameters needing training in the specified calculation graph.

Has the advantages that: compared with the prior art, the method has the advantages that the GAN framework is used, the generation network takes the fuzzy image and the real high-definition image as input, pixel-level mapping is carried out through the residual error network and jump connection, the characteristic image transmitted from the shallow network to the deep network is stacked with the deep characteristic image, and finally parameters are continuously updated through calculation of gradient back propagation of a loss function by using an Adam optimization algorithm, so that the image definition can be effectively improved, and the method has a good application prospect.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a flow chart of an image enhancement pre-processing method;

FIG. 3 is a generator workflow diagram;

FIG. 4 is a flow chart of the arbiter operation;

FIG. 5 is a flow chart of the model loss function of the present invention.

Detailed Description

The invention is further elucidated with reference to the drawings and the embodiments.

As shown in fig. 1, the present invention provides a method for improving image sharpness, comprising the following steps:

step 1: inputting an image, and performing an enhancement preprocessing operation on the image, wherein the operation is performed under the condition that image pixel information is not changed, and the operation comprises the following steps: cutting, turning over and the like.

Step 2: and collecting characteristic information of the input image by using a generator network, wherein the collected characteristic information comprises low-layer characteristic information and high-layer characteristic information.

And step 3: the generator network generates an image according to the acquired characteristic information;

and 4, step 4: and respectively transmitting the image generated by the generator network and the real high-definition image into a discriminator network, judging whether the generated image is a real image or not by using the discriminator network, and extracting the characteristic information of the generated image and the real image of the generator.

And 5: calculating a loss function by using an Adam algorithm according to the extracted characteristic information, and continuously updating parameters of the loss function until the parameters reach the optimal parameters;

step 6: and finishing training to generate a clear image.

As shown in fig. 2, the enhancing preprocessing operation in step 1 in this embodiment specifically includes: mapping the value of each pixel point of an input image from 0-255 to 0-1, randomly cutting to obtain a 24x24 image, inputting the image into a generation model for improving the image definition, turning, wherein turning comprises turning left and right or up and down, and turning left and right/up and down randomly, and if the input image is turned left and right/up and down, the corresponding real high-definition image needs to be operated in the same way in the training process.

As shown in fig. 3, the specific workflow of the generator in steps 2 and 3 in this embodiment is as follows: in a generator network, a plurality of residual error network blocks are used for carrying out feature extraction on an input image step by step, except for a first residual error block, the input of each residual error block is the summation of the input and the output of the previous residual error block in pixel level, finally, the input of the first residual error block and the output of the last residual error block are subjected to pixel level summation, and an image with the same channel number as the input image is obtained through dimensionality reduction to serve as the output of the whole generation network. The summation operation is realized by residual error network and jump connection, the feature map transmitted into the deep layer network from the shallow layer network is stacked with the deep layer feature map, and the convolution layer selects the influence of the shallow layer feature and the deep layer feature on the final prediction by weight and generates an image.

As shown in fig. 4, the specific work flow of the discriminator in step 4 in this embodiment is as follows: in the discriminator network, the generator generated image and the real high-definition image are transmitted into a discriminator, and after a plurality of convolution-activation-regularization operations, the characteristics of the generator generated image and the real high-definition image are respectively extracted. And finally, judging whether the signals are true or false through a full connection layer (Dense) and a Sigmoid activation function, and applying the result to a discriminator loss function.

As shown in fig. 5, the loss function in step 5 mainly includes two blocks, which are a generator loss function and a discriminator loss function:

(1) the formula for calculating the loss function of the discriminator is as follows:

As a function of the discriminator penalty.

(2) The generator loss function mainly comprises two parts: a content loss function and a counter loss function.

① the formula for the calculation of the content loss function is as follows:

representing each pixel of the image generated by the generator network through feature extraction and dimension reduction. Through optimization of an L1 loss function between the generated image and the real high-definition image, the minimum value of the Manhattan distance is solved as an optimal solution;

② the formula for calculating the opposition loss function is as follows:

wherein l_advWhich represents a function of the resistance loss,

The content loss function and the counter loss function together form a generator loss function, and the minimum value is continuously obtained in the optimization.

As shown in fig. 5, in step 5, on a model composed of a generator loss function and a discriminator loss function, an Adam optimization algorithm is used for calculation, which is an optimization algorithm for finding a global optimum point, and quadratic gradient correction is introduced, wherein an exponential decay rate β 1 of a first-order moment estimate in the Adam optimization algorithm is set to 0.9, an exponential decay rate β 2 of a second-order moment estimate is set to 0.999, 10000 times of training are performed, learning rate decay is performed after five thousand times of training, the decay rate is set to 0.1, in a batch, the generator is updated twice, the discriminator is updated once to prevent the discriminant from being over-trained, and parameters are continuously updated to be optimal by calculating derivatives of loss functions to parameters needing to be trained in a given calculation diagram.

Claims

1. A method for improving image definition is characterized in that: the method comprises the following steps:

s4: and finishing training to generate a clear image.

2. A method of improving image sharpness according to claim 1, wherein: in step S1, before the generator network performs feature information acquisition on the input image, the input image is subjected to enhancement preprocessing.

3. A method of improving image sharpness according to claim 1, wherein: the enhancing pretreatment specifically comprises the following steps: mapping the value of each pixel point of the image from 0-255 to 0-1, randomly cutting to obtain a 24x24 image, inputting the image into the model, randomly turning left and right/up and down, and if the input image is turned left and right/up and down, performing the same operation on the corresponding input image in the training process.

4. A method of improving image sharpness according to claim 1, wherein: the step S1 specifically includes: in the generator network, a plurality of residual error network blocks are used for carrying out feature extraction on an input image step by step, except for a first residual error block, the input of each residual error block is the summation of the input and the output of the previous residual error block in pixel level, finally, the input of the first residual error block and the output of the last residual error block are subjected to pixel level summation, and an image with the same channel number as the input image is obtained through dimensionality reduction and serves as the output of the whole generator network.

5. A method of improving image sharpness according to claim 4, wherein: the pixel-level summation in step S1 is implemented by a residual error network and a jump connection, which specifically includes: feature maps of deep networks are stacked with feature maps of deep networks, which are imported from shallow networks, and convolutional layers behind select the influence of shallow and deep features on the final prediction by weights.

6. A method of improving image sharpness according to claim 1, wherein: the judgment method using the discriminator network in step S2 is: after convolution-activation-regularization operation is carried out for multiple times, the characteristics of the image generated by the generator network and the characteristics of the real image are extracted, and the generated image is judged to be true or false through a full connection layer and a Sigmoid activation function.

7. A method of improving image sharpness according to claim 1, wherein: the loss function in step S3 includes a generator loss function and a discriminator loss function, and the calculation formula of the discriminator loss function is as follows:

wherein l_disRepresenting the discriminant loss function, I^TRepresenting a real image, I^LRepresenting a blurred image input to the generator network,

representing an image generated by a generator network;

the content loss function is calculated as follows:

representing each pixel, I, of a real image^LRepresenting a blurred image input to the generator network,

the calculation formula of the penalty function is as follows:

wherein l_advWhich represents a function of the resistance loss,

indicating that the image generated by the generator is input into the discriminator.

8. A method of improving image sharpness according to claim 1, wherein: in the step S3, the Adam algorithm introduces quadratic gradient correction, learning rate attenuation is carried out after training, and parameters are continuously updated to be optimal by calculating the derivative of loss to parameters needing to be trained in a specified calculation graph.