CN111275646A

CN111275646A - Edge-preserving image smoothing method based on deep learning knowledge distillation technology

Info

Publication number: CN111275646A
Application number: CN202010066280.1A
Authority: CN
Inventors: 徐君; 程明明; 刘志昂; 韩琦
Original assignee: Nankai University
Current assignee: Nankai University
Priority date: 2020-01-20
Filing date: 2020-01-20
Publication date: 2020-06-12
Anticipated expiration: 2040-01-20
Also published as: CN111275646B

Abstract

An edge-preserving image smoothing method based on a deep learning knowledge distillation technology belongs to the technical field of image processing. The method is based on a novel deep learning knowledge distillation technology, and the edge detection capability of an edge detection teacher network is distilled into an image smoothing student network, so that the student network has the image smoothing capability of edge protection. The task of the student network is to perform image smoothing, and the task of the teacher network is to extract information of structural edges. The teacher network is used for distilling and transferring the structural edge knowledge to the student network, so that the student network has the capability of keeping the structural edge of the teacher network while performing an image smoothing task. The method can keep the smooth structural edge of the image on the premise of not additionally introducing a depth network for keeping the edge information, overcomes the visual influence of image quality loss such as poor image edge information and blurring in the traditional image smoothness enhancement method, and greatly improves the image smoothing effect.

Description

Edge-preserving image smoothing method based on deep learning knowledge distillation technology

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to an edge-preserving image smoothing method based on a novel deep learning knowledge distillation technology.

Background

With the rapid development of multimedia technology and the wide demand of the industry in recent years, the application scenarios of image processing technology are expanding. The existing image can be divided into fusion of a global structure and a local texture, and the texture of the image can influence the automatic extraction of image structure information to a certain extent. In addition, due to the requirement of people on the photographing technology, the face needs to be beautified by skin grinding when the face is photographed, so that the face looks more attractive. When performing style migration, the structure edge needs to be preserved and the specific texture information needs to be ignored. Based on these requirements, we need edge-preserving smoothing processing on the image after the image is captured, so that the image can be better used for subsequent image beautification, analysis and processing. Image smoothing techniques are a type of computer vision task that preserves global structural information in an image and removes local texture information in the image for certain specific needs. The main purpose of image smoothing is to make the smoothed image more convenient for human or machine analysis and processing by enhancing the visual effect of the image structure relative to the image texture. The image smoothing technology has been widely applied to various fields such as mobile phone photographing technology and social media, and plays an important role in our life. For example, beauty cameras typically smooth human faces using image smoothing techniques.

However, there are some difficulties to be solved in the task of image smoothing. Such as how well image smoothing techniques preserve information of structural edges. Human eyes can instinctively extract meaningful structural information from complex texture information of an image directly. However, it is difficult for a computer program to automatically distinguish between structure information and texture information in an image, and the existing image smoothing technology may smooth out the edge portion of the structure in order to remove the texture information. Edge preserving image smoothing techniques can automatically preserve structural edge information while removing texture, a very challenging task that is very important for computer vision applications.

The article of distinguishing the knowledge in a neural network (NIPS 2014 Deep Learning network), which was published by Hinton et al in 2015, firstly proposes a framework of Deep Learning knowledge distillation, and introduces concepts of 'student network' and 'teacher network', so as to intensively migrate knowledge of a plurality of trained teacher networks into one student network. Knowledge distillation is used here to generalize the "hard label" of the classification dataset to the "soft label" so that the degree of similarity between images of different labels can be learned. After the teacher network is trained, the teacher network provides additional discriminant and similarity knowledge hidden outside the labels, and then the knowledge is migrated to the student network through a knowledge distillation technology. In recent years, with the improvement and improvement of computing devices, neural networks have been developed rapidly. Because of its powerful representation capability, neural networks have been widely used in the research fields of computer vision, natural language processing, etc. One of the major research hotspots on neural networks today is the deployment of neural networks in portable embedded devices. Due to the nature of the neural network, the larger the parameter quantity of the neural network is, the stronger the model performance is. However, the more the network model parameters are, the more memory resources the network needs to occupy, and the operation is very time-consuming, which is contrary to the requirements of the industry that the resources are few and the operation time is short. And knowledge of distillation techniques can help to solve this problem. Knowledge distillation techniques utilize one or more networks of teachers with large parameters to improve the performance of a network of students with small parameters. It is likely that the student network will not ultimately perform as well as the teacher's network, but will be more powerful than a student network trained separately from the teacher's network.

However, the conventional deep learning knowledge distillation method cannot be directly used in the task of edge-preserving image smoothing. The reason for this is that in our task, teacher networks mainly extract structural edge information, and student networks mainly remove local texture in images. The student network is distilled by directly using the trained teacher network, so that only the student network with image edge extraction and detection can be obtained, and the student network with image smoothing capability cannot be obtained. Therefore, a novel distillation technology based on deep learning knowledge is provided and applied to the task of edge-preserving image smoothing. The method can complete the task of image smoothing and has the capability of retaining structural edges.

Disclosure of Invention

The technical problem to be solved by the invention is to reserve the structural edge information of an image while removing the image texture in an image smoothing task. The invention aims to provide a novel deep learning knowledge distillation technology, which is used for carrying out knowledge migration on the structural edge extraction capability of a teacher network to a student network with a smooth image, so that the student network can have the structural edge retention capability of the teacher network while carrying out image smoothing.

The realization process of the invention is as follows:

optionally, selecting an existing deep learning model or retraining a deep learning model as the teacher network includes:

training a deep neural network to carry out an edge detection task;

optionally, the teacher network is used to assist in training the image smoothing student network, and the iteration includes:

inputting the input image I into a student network to obtain a smooth image Is;

respectively inputting the input image I and the smoothed image Is into a teacher network to obtain a structural edge label Ie of the input image and structural edge information Ise of the smoothed image;

obtaining a smooth image label Igt from the input image I by using an existing commonly-used image smoothing algorithm (such as L0);

respectively calculating the distance loss between the smoothed image Is and the smoothed image label Igt and the distance loss between the structural edge information Ise of the smoothed image and the structural edge label Ie of the input image by using a loss function;

carrying out weighted summation on the two losses to obtain a distance loss sum;

and (4) carrying out gradient return on the student network by using the sum of the losses, and updating the parameters of the student network.

The student network is trained using a plurality of iterations as described above.

Compared with the prior art, the invention has the beneficial effects that: the student network is updated by calculating the additional structural edge information calculation loss through the teacher network, so that the student network can learn the capability of image smoothing and can also have the capability of keeping the structural edge of the teacher network. Therefore, the negative influence that the structural edge cannot be well reserved by image smoothing in the conventional image smoothing method can be compensated.

Drawings

FIG. 1 is a flow chart of a novel deep learning knowledge distillation technique-based edge preserving image smoothing method.

FIG. 2 is a schematic diagram of a model of an edge-preserving image smoothing method based on a novel deep learning knowledge distillation technology.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the examples of the present invention, and it is obvious that the described examples are only a part of the embodiments of the present invention, and not all of the embodiments. Variations, modifications, substitutions and alterations of the embodiments of the invention are within the scope of the invention, without departing from the principles of the invention.

The embodiment of the invention provides an image enhancement deep learning method based on a knowledge distillation technology, the flow is shown in figure 1, and the method comprises the following steps:

and S1, selecting a teacher network to calculate the structural edge graph.

Optionally, the structural edge information is an edge distribution map describing image content structure information in the image;

optionally, a trained deep neural network capable of extracting structural edge information is selected as a teacher network, and a deep neural network model with a similar function can be retrained, wherein a Holistic edge detection model or a classic VGG model can be adopted, and the teacher network is used for assisting in training an image smoothing student network.

And S2, calculating a smooth image by the student network.

Optionally, the student network structure uses a classical deep neural network, such as VGG or residual neural network;

specifically, an image to be smoothed Is input into a deep neural network model (such as a commonly-used VGGNet, which has 5 modules consisting of convolution layers, a correction linear unit and a maximum pooling operation, a module consisting of 3 full-link layers and a correction linear unit, and the last 1 softmax layer), the designed neural network structure Is a structure of a 'predicted image by image' (such as the VGGNet introduced earlier), and an output image which Is the same as the input image I in size and Is subjected to image smoothing Is output by the model.

And S3, calculating the structural edge information of the structural edge information label and the smoothed image by the teacher network.

Specifically, the input image I and the smoothed image Is are respectively input to an edge detection teacher network, and the teacher network predicts a structural edge label Ie of the input image and structural edge information Ise of the smoothed image. And in the edge detection teacher network, the structural edge label Ie of the input image is used as an edge image of the input image, and the structural edge information Ise of the smoothed image is an edge image detected by the edge detection teacher network on the smoothed image.

And S4, calculating the loss of the student network to update the student network parameters.

Optionally, a common loss function (such as L2 norm, L1 norm, cross entropy) Is used to calculate the loss | | | Is-Igt | | | _ F } {2} between the smoothed image Is and the smoothed image label Igt, and the loss | | | | Ise-Ie | | _ F } {2} between the structural edge information Ise of the smoothed image and the structural edge label Ie of the input image, respectively; the smoothed image label Igt in the present invention is obtained by using the conventional image smoothing algorithm (e.g. L0) to input image I.

Specifically, the two losses are weighted and summed to obtain a loss sum, the loss sum is used for carrying out conventional gradient return on the student network (the conventional gradient return can be automatically realized through an SGD (Adam algorithm) in a deep learning framework PyTorch or Tensorflow), and the student network parameters are updated.

And S5, training the student network by multiple iterations.

Specifically, the learning student network parameters are continuously updated by using the updating mode to train the student network, and a model schematic diagram is shown in fig. 2. The teacher network parameters are fixed in the training process, namely the teacher network parameters are not updated during the period of training the student network.

Claims

1. An edge-preserving image smoothing method based on deep learning knowledge distillation technology is characterized by comprising the following steps: the method carries out knowledge migration on the structural edge extraction capability of the teacher network to the student network with smooth images, so that the student network has the structural edge retention capability of the teacher network while carrying out image smoothing, and comprises the following steps:

s1, training a deep neural network with a reserved structural edge as a teacher network;

s2, inputting the input image into a student network, wherein the network outputs a smooth image which has the same size as the input image and is subjected to image enhancement;

s3, respectively inputting the input image and the smoothed image into a teacher network, and calculating structural edge information by the teacher network, wherein the edge information of the input image is used as a label, and the edge information of the smoothed image is used as predicted edge information;

s4, calculating loss information of the labels and the predicted edge information in the S3 as the loss of the student network, and updating parameters of the student network by using a reverse iteration strategy;

and S5, continuously updating the learning student network parameters by using the updating mode, and training the model parameters of the student network.

2. The edge-preserving image smoothing method based on the deep learning knowledge distillation technology according to claim 1, characterized in that: respectively inputting the input image and the smooth image into a teacher network, extracting structural edge labels of the input image and structural edge information of the smooth image, taking the structural edge labels of the input image extracted by the edge detection teacher network as an edge image of the input image, and taking the structural edge information of the smooth image as an edge image of the smoothed image.

3. The edge-preserving image smoothing method based on the deep learning knowledge distillation technology according to claim 1, characterized in that: the total loss function of the student network is formed by weighted summation of two parts of loss, wherein one part is to calculate the loss between the smoothed image and the smoothed image label by using a common loss function, and the other part is to calculate the loss between the structural edge information of the smoothed image and the structural edge label of the input image by using the common loss function.

4. The edge-preserving image smoothing method based on the deep learning knowledge distillation technology according to claim 1 or 2, characterized in that: the teacher network is trained in advance, and teacher network parameters are not updated in the process of training the student network.