CN114677304A - Image deblurring algorithm based on knowledge distillation and deep neural network - Google Patents
Image deblurring algorithm based on knowledge distillation and deep neural network Download PDFInfo
- Publication number
- CN114677304A CN114677304A CN202210313655.9A CN202210313655A CN114677304A CN 114677304 A CN114677304 A CN 114677304A CN 202210313655 A CN202210313655 A CN 202210313655A CN 114677304 A CN114677304 A CN 114677304A
- Authority
- CN
- China
- Prior art keywords
- image
- network
- training
- network model
- pairs
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 31
- 238000013140 knowledge distillation Methods 0.000 title claims abstract description 21
- 238000012549 training Methods 0.000 claims abstract description 73
- 238000000034 method Methods 0.000 claims abstract description 32
- 238000012360 testing method Methods 0.000 claims abstract description 25
- 238000012795 verification Methods 0.000 claims abstract description 22
- 238000003062 neural network model Methods 0.000 claims abstract description 6
- 230000006870 function Effects 0.000 claims description 16
- 230000004927 fusion Effects 0.000 claims description 16
- 230000008569 process Effects 0.000 claims description 14
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 claims description 6
- 238000000137 annealing Methods 0.000 claims description 6
- 230000007246 mechanism Effects 0.000 claims description 6
- 238000007781 pre-processing Methods 0.000 claims description 6
- 238000012545 processing Methods 0.000 claims description 6
- 230000002238 attenuated effect Effects 0.000 claims description 5
- 230000007306 turnover Effects 0.000 claims description 5
- 238000005070 sampling Methods 0.000 claims description 4
- 238000006243 chemical reaction Methods 0.000 abstract 1
- 230000000694 effects Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 238000005457 optimization Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/73—Deblurring; Sharpening
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Image Processing (AREA)
- Image Analysis (AREA)
Abstract
The invention discloses an image deblurring algorithm based on knowledge distillation and a deep neural network, which can be used for realizing high-quality conversion from a blurred image to a clear image. The algorithm comprises the following steps: constructing a training set, a verification set and a test set according to the public image data set; constructing a self-coding deep neural network based on an Unet structure, namely a teacher model; carrying out supervised training on the teacher model based on the high-definition image data set until convergence; constructing a deep neural network model based on a Unet structure and deformable convolution, namely a student model; based on a complete fuzzy image data set, a converged teacher network is combined, a knowledge distillation method is used for carrying out supervised training on a deblurring student model, and then a converged deep deblurring model is obtained; and evaluating the deblurring performance of the student network model on the blurred image test set. The invention obtains better deblurring performance, and obtains better structural similarity while obtaining higher peak signal-to-noise ratio.
Description
Technical Field
The invention belongs to the technical field of image deblurring in computer vision, and particularly relates to an image deblurring algorithm based on knowledge distillation and a deep neural network.
Background
Digital images are an important information carrier and are widely used in various fields, such as: digital audio and video, remote sensing detection, traffic management and the like. However, digital images are very susceptible to various damage factors in the processes of acquisition, transmission and storage, so that the quality of the digital images is reduced, and blurring is a main characteristic. Therefore, the method has important significance and great value for the research of deblurring the digital image.
The traditional image deblurring algorithm mainly models an image blurring process into a linear convolution model, and a solving model of the traditional image deblurring algorithm is mainly based on a Maximum a Posterior probability (Max a Posterior, MAP) model and a Maximum Likelihood Estimation (MLE) model to solve a minimized energy function. However, the traditional deblurring algorithm is limited to a limited scene type under the condition of space uniform blurring, the prior robustness of the used image is poor, and a non-convex and non-linear constraint function is introduced into an optimization model, so that the optimization process is large in calculated amount, low in restoration speed and not high in practical application value.
With the great success of the deep neural network in the field of computer vision, a lot of researches have proposed that the deep neural network is applied to the problem of image deblurring. Researchers consider that most of real scenes are complex spatial non-uniform fuzzy scenes, and are difficult to establish a proper degradation model, and try to directly learn a non-linear mapping function from a fuzzy image to a clear image by utilizing the strong fitting capability of a deep neural network. The method is often used for solving the problem of complex spatial non-uniform deblurring (such as dynamic scene blurring) and achieves a better recovery effect. The current image deblurring algorithm based on the deep neural network mainly comprises a multi-scale convolution neural network, a multi-image block hierarchical network, a GAN-based deep neural network, an image prior-based deep neural network and the like. However, the existing image deblurring algorithm based on the deep neural network has large overall parameter quantity, the training difficulty of the network is increased only by simply deepening the network depth, supervision and constraint on intermediate features are lacked in the network training process, and finally the deblurring performance is limited to be improved. Meanwhile, when the complex and variable spatial non-uniform blurring exists, the recovery effect of the algorithm still has limitation.
Disclosure of Invention
The invention aims to provide an image deblurring algorithm based on knowledge distillation and a deep neural network, and aims to solve the technical problems that the existing image deblurring algorithm has no higher practical application value and the existing image deblurring algorithm based on the deep neural network has limited improvement on deblurring performance.
In order to solve the technical problems, the specific technical scheme of the invention is as follows:
an image deblurring algorithm based on knowledge distillation and a deep neural network comprises the following steps:
step S1: constructing an end-to-end deep neural network model based on a Unet structure, namely a teacher network model;
step S2: acquiring a fuzzy image data set, and performing supervised training on a teacher network model by using a clear image set in the fuzzy image data set so as to obtain a converged teacher network model, wherein the converged teacher network model realizes that a high-definition image is input, outputs a plurality of intermediate features through forward operation and obtains a recovered high-definition image;
step S3: constructing an end-to-end image deblurring model based on a Unet structure and deformable convolution, namely a student network model;
step S4: and (5) performing supervision training on the intermediate characteristic output of the student network model by using the teacher network model converged in the step (S2) based on the fuzzy image data set in the step (S2) so as to obtain a converged student network model, wherein the converged student network model takes the fuzzy image as input and outputs a reconstructed deblurred image through forward operation.
Further, the step S2 specifically includes the following steps:
step S201: acquiring a public fuzzy image data set GOPRO, and dividing the data set GOPRO into a training set, a verification set and a test set, wherein the training set, the verification set and the test set respectively comprise 2000 pairs of training image pairs, 103 pairs of verification image pairs and 1111 pairs of test image pairs; training a teacher network model by using only real and clear images of the training set;
step S202: in the data preprocessing stage, image blocks with 256 × 256 resolutions are randomly cut from an original real clear image, random horizontal and vertical turnover data enhancement processing is carried out on the image blocks, and then the image blocks are input into a teacher network model for training after being normalized;
step S203: the method comprises the steps of extracting network characteristics of an input clear image through an encoder, reconstructing the network characteristics of a decoder, and finally fusing the characteristics to obtain a reconstructed clear image;
step S204: performing supervision training on the teacher network by using an L1 loss function;
step S205: and the trained teacher network model performs forward operation on the input clear image to obtain a plurality of decoding feature maps with different scales of the clear image, and finally outputs a restored clear image, wherein the intermediate decoding feature maps are used for subsequent supervised training of the student network model.
Further, the step S4 specifically includes the following steps:
step S401: using the data set GOPRO partitioned in step S2 as a training set, a verification set and a test set, wherein the training set includes 2000 pairs of blurred image and real sharp image pairs, the verification set includes 103 pairs of blurred image and real sharp image pairs, and the test set includes 1111 pairs of blurred image and real sharp image pairs;
step S402: in the data preprocessing stage, 256 × 256 image pairs are randomly cut from an original image pair, the image pairs are subjected to random horizontal and vertical turnover data enhancement processing, finally, the normalized image pairs are input into a student network model, and meanwhile, real clear images in the image pairs are input into a converged teacher network model;
step S403: extracting the input fuzzy image through the network characteristics of an encoder, reconstructing the network characteristics of a decoder, and finally performing characteristic fusion to output a reconstructed deblurred image; meanwhile, the teacher network model performs forward operation on the 256 × 256 real clear images to obtain intermediate feature maps of a plurality of clear images, and the intermediate feature maps are used for performing supervision training on the decoding feature output of the student network;
step S404: performing supervised training on the student network output image by adopting an L1 loss function and a frequency domain-based L1 loss function;
step S405: and performing forward operation on the input test set fuzzy image by using the optimized available student network model to obtain a recovered deblurred image.
Further, step S204 specifically includes the following steps:
in the training process, an Adam optimizer is used, the learning rate is continuously attenuated in a cosine annealing mode, when the Peak Signal-to-Noise Ratio (PSNR) and the Structural Similarity (SSIM) on the verification set do not increase any more after training, the network stops training, and an optimized available teacher network model is obtained.
Further, step S404 specifically includes the following steps: loss function L of student network populationtotalComprises the following steps:
Ltotal=L1+Lfft1+Lfeats
wherein L is1Represents the reconstruction loss, Lfft1Representing the reconstruction loss, L, based on the frequency domainfeatsRepresenting the intermediate feature reconstruction loss; in the training process, an Adam optimizer is used, the learning rate is continuously attenuated in a cosine annealing mode, when the peak signal-to-noise ratio and the structural similarity on the verification set do not increase any more after training, the student network stops training, and an optimized usable image deblurring network model is obtained.
Further, in step S1, the teacher network is composed of an encoder network and a decoder network, where the encoder network is composed of a plurality of residual error modules ResCAB including a channel attention mechanism and a downsampling layer, and after the feature map is downsampled, the feature resolution is reduced, but the convolution depth is continuously deepened, and the number of extracted feature channels is increased; the decoder network consists of a plurality of residual error modules ResCAB containing a channel attention mechanism and an up-sampling layer, and after the feature map is up-sampled, the feature resolution is increased, and the number of feature channels is reduced; meanwhile, jump connection is added between feature layers with the same resolution of the coding and decoding network, so that the learning difficulty and the training difficulty of the coding and decoding network are reduced; and finally, performing fusion reconstruction on the features by using a residual error module ResCAB, and outputting a reconstructed image.
Further, the student network model in step S3 is composed of an encoder network, a decoder network, and a feature fusion network; the encoder network consists of a plurality of residual modules ResCAB containing channel attention and downsampling layers; the decoder network consists of an improved residual error module, a deformable convolution module and the like, and jump connection exists between different scale layers of the encoder network and the decoder network; and finally, performing cross-scale feature fusion on the shallow feature and the decoding features with different scales by a feature fusion network.
Further, the method comprises the step of evaluating the deblurring performance of the student network model on the blurred image test set.
Further, the peak signal-to-noise ratio and the structural similarity are calculated according to the deblurred image obtained by the convergent student network forward operation and the real clear image in the blurred image data set, and the image deblurring performance of the algorithm is evaluated by combining the network parameter quantity and the forward operation time.
The image deblurring algorithm based on knowledge distillation and the deep neural network has the following advantages: the method is based on knowledge distillation technology and deep neural network architecture, and adds a teacher network model on the basis of normal supervision and training, so that supervision and constraint on intermediate features in the network training process are enhanced, and higher-quality deblurred images can be obtained. Compared with the current main image deblurring algorithm, the method can improve the deblurring performance of the network model under the conditions of keeping small network parameter quantity and short forward operation time, obtain higher peak signal-to-noise ratio (PSNR) and Structural Similarity (SSIM), and restore clearer images. In the field of deep learning, the invention proves that the knowledge distillation can improve the performance of a deep neural network in a specific field.
Drawings
FIG. 1 is a flow chart of the knowledge-based distillation and deep neural network image deblurring algorithm of the present invention;
FIG. 2 is a diagram of a teacher network model architecture of the present invention;
FIG. 3 is a flow chart of teacher network model training and prediction of the present invention;
FIG. 4 is an overall block diagram of a teacher network and student network model of the present invention;
FIG. 5 is a flow chart of student network model training and prediction of the present invention;
FIG. 6 is a comparison of the deblurred image effect of the method of the present invention with the current dominant image deblurring algorithm.
Detailed Description
In order to better understand the purpose, structure and function of the present invention, an image deblurring algorithm based on knowledge distillation and deep neural network is described in further detail below with reference to the attached drawings.
Example 1
Referring to fig. 1-6, the present implementation provides an image deblurring algorithm based on knowledge distillation and a deep neural network.
As shown in fig. 1, the algorithm specifically includes:
step S1: constructing an end-to-end deep neural network model based on a Unet structure, namely a teacher network model;
the specific structure of the teacher network model is shown in fig. 2. The teacher network mainly comprises an encoder network and a decoder network, wherein the encoder network mainly comprises a plurality of residual error modules ResCAB containing a channel attention mechanism and a down-sampling layer, when a feature map is down-sampled, the feature resolution is reduced, but the convolution depth is continuously deepened, and the number of extracted feature channels is increased. The decoder network is composed of a plurality of residual modules ResCAB containing channel attention mechanisms and an upsampling layer, and after the feature map is upsampled, the feature resolution is increased, and the number of feature channels is reduced. Meanwhile, jump connection is added between feature layers with the same resolution of the coding and decoding network, and the learning difficulty and the training difficulty of the network are reduced. And finally, performing fusion reconstruction on the features by using a residual error module ResCAB, and outputting a reconstructed image.
Step S2: acquiring a fuzzy image data set, and performing supervised training on a teacher network model by using a clear image set in the fuzzy image data set so as to obtain a converged teacher network model, wherein the converged teacher network model realizes that a high-definition image is input, outputs a plurality of intermediate features through forward operation and obtains a recovered high-definition image;
the teacher network model training and predicting process is shown in fig. 3, and includes:
step S201: acquiring a public fuzzy image data set GOPRO, and dividing the data set GOPRO into a training set, a verification set and a test set, wherein the training set, the verification set and the test set respectively comprise 2000 pairs of training image pairs, 103 pairs of verification image pairs and 1111 pairs of test image pairs; the teacher network model is trained using only the true sharp images of the training set.
Step S202: in the data preprocessing stage, image blocks with 256 × 256 resolutions are randomly cut from an original real clear image, data enhancement processing such as random horizontal and vertical overturning is carried out on the image blocks, and then the image blocks are normalized and input into a teacher neural network model for training;
step S203: and (3) extracting the network characteristics of the input clear image through an encoder, reconstructing the network characteristics of a decoder, and finally performing characteristic fusion to obtain a reconstructed clear image.
Step S204: performing supervision training on a teacher network by adopting an L1 loss function, using an Adam optimizer in the training process, continuously attenuating the learning rate in a cosine annealing mode, and stopping the training of the network when the PSNR and SSIM on a verification set do not increase any more after training so as to obtain an optimized available teacher network model;
step S205: and the trained teacher network model performs forward operation on the input clear image to obtain a plurality of decoding feature maps with different scales of the clear image, and finally outputs a restored clear image, wherein the intermediate decoding feature maps are used for subsequent supervised training of the student network model.
Step S3: constructing an end-to-end image deblurring model based on a Unet structure and deformable convolution, namely a student neural network model;
the concrete structure of the student network model is shown in fig. 4. The student network model mainly comprises an encoder network, a decoder network and a feature fusion network. The encoder network consists of a plurality of residual modules ResCAB containing channel attention and downsampling layers; the decoder network consists of an improved residual error module, a deformable convolution module and the like, and jump connection exists between different scale layers of the encoder network and the decoder network. And finally, the feature fusion network performs cross-scale feature fusion on the shallow features and the decoding features with different scales.
Step S4: and (5) performing supervision training on the intermediate characteristic output of the student network model by using the teacher network model converged in the step (S2) based on the fuzzy image data set in the step (S2) so as to obtain a converged student network model, wherein the converged student network model takes the fuzzy image as input and outputs a reconstructed deblurred image through forward operation.
The training and predicting process of the student network model is shown in fig. 5, and includes:
step S401: the data set GOPRO divided in step S2 is used as a training set, a verification set and a test set, wherein the training set includes 2000 pairs of blurred images and real sharp image pairs, the verification set includes 103 pairs of blurred images and real sharp image pairs, and the test set includes 1111 pairs of blurred images and real sharp image pairs.
Step S402: and in the data preprocessing stage, image pairs with 256 × 256 resolutions are randomly cut from the original image pairs, the images are subjected to random horizontal and vertical turnover data enhancement processing, finally, the normalized images are input into a student network model, and meanwhile, real clear images in the image pairs are input into a converged teacher network model.
Step S403: extracting the input fuzzy image through the network characteristics of an encoder, reconstructing the network characteristics of a decoder, and finally performing characteristic fusion to output a reconstructed deblurred image; meanwhile, the teacher network model performs forward operation on the 256 × 256 real clear images to obtain intermediate feature maps of a plurality of clear images, and the intermediate feature maps are used for performing supervision training on the decoding feature output of the student network.
Step S404: and carrying out supervised training on the student network output image by adopting an L1 loss function and a frequency domain-based L1 loss function. Student netLoss function L of the ensemble of the networktotalComprises the following steps:
Ltotal=L1+Lfft1+Lfeats(1)
wherein L is1Represents the reconstruction loss, Lfft1Representing the reconstruction loss, L, based on the frequency domainfeatsRepresenting the intermediate feature reconstruction loss. In the training process, an Adam optimizer is used, the learning rate is continuously attenuated in a cosine annealing mode, and when the Peak Signal-to-Noise Ratio (PSNR) and the Structural Similarity (SSIM) on a verification set are basically not increased any more, the student network stops training, so that an optimized usable image deblurring network model is obtained.
Step S405: and carrying out forward operation on the input test set fuzzy image by using the optimized student network model with fixed parameters to obtain a recovered deblurred image.
It should be noted that the teacher network model and the student network model provided in this embodiment are not limited to the Pytorch framework, and only need to train the dataset GORPO, and achieve convergence after iteration for several times (order of magnitude) in the training process, and finally achieve reconstruction of deblurred images.
And finally, evaluating the deblurring performance of the student network model on the blurred image test set. More specifically, the peak signal-to-noise ratio and the structural similarity are calculated according to the deblurred image obtained by the convergent student network forward operation and the real clear image in the blurred image data set, and the image deblurring performance of the algorithm is evaluated by combining the network parameter quantity and the forward operation time. Wherein, the larger the peak signal-to-noise ratio and the structural similarity value are, the better the quality of the restored image is.
As shown in the following table, the method of the present invention is compared with the deblurring performance of the current main image deblurring algorithm. From the test results in the following table, the method of the present invention obtains the highest PSNR value and SSIM value on the GOPRO test set, which indicates that the final deblurring effect achieved by the method of the present invention is the best. The PSNR of the method is still 0.56dB higher than the PNSR value of the best-performance HINet algorithm in other algorithms in the table, which shows that the deblurring performance of the algorithm is greatly improved.
Compared with the MPRIT and HINet algorithms with better deblurring effect, the method has smaller parameter quantity and lighter network in terms of network parameter quantity. The running time of the method is still short in the view of the network forward operation time.
Fig. 6 is a diagram showing an example of the deblurring image effect comparison of the method of the present invention and the current main image deblurring algorithm. The first row is respectively an original blurred image, a DeepDeblur, a DMPHN and an MPRNet algorithm deblurred image from left to right; the second row is from left to right respectively MIMO-Unet, HINet and the deblurred image and the real clear image of the method of the invention. From the point of view of the overall quality of the image and the recovery effect of the local image details, the deblurring effect of the method can be visually displayed to be better than that of other algorithms.
The details of the present invention are well known to those skilled in the art.
It is to be understood that the present invention has been described with reference to certain embodiments, and that various changes in the features and embodiments, or equivalent substitutions may be made therein by those skilled in the art without departing from the spirit and scope of the invention. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the invention without departing from the essential scope thereof. Therefore, it is intended that the invention not be limited to the particular embodiment disclosed, but that the invention will include all embodiments falling within the scope of the appended claims.
Claims (9)
1. An image deblurring algorithm based on knowledge distillation and a deep neural network is characterized by comprising the following steps:
step S1: constructing an end-to-end deep neural network model based on a Unet structure, namely a teacher network model;
step S2: acquiring a fuzzy image data set, and performing supervised training on a teacher network model by using a clear image set in the fuzzy image data set so as to obtain a converged teacher network model, wherein the converged teacher network model realizes that a high-definition image is input, outputs a plurality of intermediate features through forward operation and obtains a recovered high-definition image;
step S3: constructing an end-to-end image deblurring model based on a Unet structure and deformable convolution, namely a student network model;
step S4: and (5) performing supervision training on the intermediate characteristic output of the student network model by using the teacher network model converged in the step (S2) based on the fuzzy image data set in the step (S2) so as to obtain a converged student network model, wherein the converged student network model takes the fuzzy image as input and outputs a reconstructed deblurred image through forward operation.
2. The knowledge distillation and deep neural network based image deblurring algorithm of claim 1, wherein the step S2 specifically comprises the steps of:
step S201: acquiring a public fuzzy image data set GOPRO, and dividing the data set GOPRO into a training set, a verification set and a test set, wherein the training set, the verification set and the test set respectively comprise 2000 pairs of training image pairs, 103 pairs of verification image pairs and 1111 pairs of test image pairs; training the teacher network model using only the real clear images of the training set;
step S202: in the data preprocessing stage, image blocks with 256 × 256 resolutions are randomly cut from an original real clear image, random horizontal and vertical turnover data enhancement processing is carried out on the image blocks, and then the image blocks are input into a teacher network model for training after being normalized;
step S203: the method comprises the steps of extracting network characteristics of an input clear image through an encoder, reconstructing the network characteristics of a decoder, and finally fusing the characteristics to obtain a reconstructed clear image;
step S204: performing supervision training on the teacher network by using an L1 loss function;
step S205: and the trained teacher network model performs forward operation on the input clear image to obtain a plurality of decoding feature maps with different scales of the clear image, and finally outputs a restored clear image, wherein the intermediate decoding feature maps are used for subsequent supervised training of the student network model.
3. The knowledge distillation and deep neural network based image deblurring algorithm of claim 2, wherein the step S4 specifically comprises the steps of:
step S401: using the data set GOPRO divided in step S2 as a training set, a verification set and a test set, wherein the training set includes 2000 pairs of blurred images and real sharp image pairs, the verification set includes 103 pairs of blurred images and real sharp image pairs, and the test set includes 1111 pairs of blurred images and real sharp image pairs;
step S402: in the data preprocessing stage, 256 × 256 image pairs are randomly cut from an original image pair, the image pairs are subjected to random horizontal and vertical turnover data enhancement processing, finally, the normalized image pairs are input into a student network model, and meanwhile, real clear images in the image pairs are input into a converged teacher network model;
step S403: extracting the input fuzzy image through the network characteristics of an encoder, reconstructing the network characteristics of a decoder, and finally performing characteristic fusion to output a reconstructed deblurred image; meanwhile, the teacher network model performs forward operation on the 256 × 256 real clear images to obtain intermediate feature maps of a plurality of clear images, and the intermediate feature maps are used for performing supervision training on the decoding feature output of the student network;
step S404: performing supervised training on the student network output image by adopting an L1 loss function and a frequency domain-based L1 loss function;
step S405: and performing forward operation on the input test set fuzzy image by using the optimized available student network model to obtain a recovered deblurred image.
4. The knowledge distillation and deep neural network based image deblurring algorithm of claim 3, wherein the step S204 specifically comprises the following steps:
in the training process, an Adam optimizer is used, the learning rate is continuously attenuated in a cosine annealing mode, when the peak signal-to-noise ratio and the structural similarity on the verification set do not increase any more after training, the network stops training, and an optimized available teacher network model is obtained.
5. The image deblurring algorithm based on knowledge distillation and deep neural network of claim 4, wherein the step S404 comprises the following steps: loss function L of student network populationtotalComprises the following steps:
Ltotal=L1+Lfft1+Lfeats
wherein L is1Represents the reconstruction loss, Lfft1Representing the reconstruction loss, L, based on the frequency domainfeatsRepresenting the intermediate feature reconstruction loss; in the training process, an Adam optimizer is used, the learning rate is continuously attenuated in a cosine annealing mode, when the peak signal-to-noise ratio and the structural similarity on the verification set do not increase any more after training, the student network stops training, and an optimized usable image deblurring network model is obtained.
6. The knowledge distillation and depth neural network based image deblurring algorithm of claim 1, wherein in step S1, the teacher network is composed of an encoder network and a decoder network, wherein the encoder network is composed of a plurality of residual error modules ResCAB containing channel attention mechanism and a downsampling layer, when the feature map is downsampled, the feature resolution is reduced, but the convolution depth is continuously deepened, and the number of extracted feature channels is increased; the decoder network consists of a plurality of residual error modules ResCAB containing a channel attention mechanism and an up-sampling layer, and after the feature map is up-sampled, the feature resolution is increased, and the number of feature channels is reduced; meanwhile, jump connection is added between feature layers with the same resolution of the coding and decoding network, so that the learning difficulty and the training difficulty of the coding and decoding network are reduced; and finally, performing fusion reconstruction on the features by using a residual error module ResCAB, and outputting a reconstructed image.
7. The knowledge distillation and deep neural network based image deblurring algorithm of claim 1, wherein the student network model in step S3 is composed of an encoder network, a decoder network and a feature fusion network; the encoder network consists of a plurality of residual modules ResCAB containing channel attention and a down-sampling layer; the decoder network consists of an improved residual error module, a deformable convolution module and the like, and jump connection exists between different scale layers of the encoder network and the decoder network; and finally, the feature fusion network performs cross-scale feature fusion on the shallow features and the decoding features with different scales.
8. The knowledge distillation and deep neural network based image deblurring algorithm of claim 1, further comprising evaluating deblurring performance of the student network model on a test set of blurred images.
9. The knowledge distillation and deep neural network based image deblurring algorithm of claim 8, wherein peak signal-to-noise ratio and structural similarity are calculated from deblurred images obtained by convergent student network forward operations and true sharp images in the blurred image dataset, and the image deblurring performance of the algorithm is evaluated in combination with the network parameters and forward operation time.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210313655.9A CN114677304A (en) | 2022-03-28 | 2022-03-28 | Image deblurring algorithm based on knowledge distillation and deep neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210313655.9A CN114677304A (en) | 2022-03-28 | 2022-03-28 | Image deblurring algorithm based on knowledge distillation and deep neural network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114677304A true CN114677304A (en) | 2022-06-28 |
Family
ID=82076688
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210313655.9A Pending CN114677304A (en) | 2022-03-28 | 2022-03-28 | Image deblurring algorithm based on knowledge distillation and deep neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114677304A (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115019183A (en) * | 2022-07-28 | 2022-09-06 | 北京卫星信息工程研究所 | Remote sensing image model migration method based on knowledge distillation and image reconstruction |
CN115456914A (en) * | 2022-11-08 | 2022-12-09 | 浙江优众新材料科技有限公司 | Priori knowledge-based defocused image deblurring method, device and medium |
CN115511059A (en) * | 2022-10-12 | 2022-12-23 | 北华航天工业学院 | Network lightweight method based on convolutional neural network channel decoupling |
CN115601536A (en) * | 2022-12-02 | 2023-01-13 | 荣耀终端有限公司(Cn) | Image processing method and electronic equipment |
CN116385329A (en) * | 2023-06-06 | 2023-07-04 | 之江实验室 | Multilayer knowledge distillation medical image generation method and device based on feature fusion |
CN116823656A (en) * | 2023-06-27 | 2023-09-29 | 华东师范大学 | Image blind deblurring method and system based on frequency domain local feature attention mechanism |
WO2024040544A1 (en) * | 2022-08-26 | 2024-02-29 | Intel Corporation | Training neural network through many-to-one knowledge injection |
CN117831138A (en) * | 2024-03-05 | 2024-04-05 | 天津科技大学 | Multi-mode biological feature recognition method based on third-order knowledge distillation |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113592736A (en) * | 2021-07-27 | 2021-11-02 | 温州大学 | Semi-supervised image deblurring method based on fusion attention mechanism |
CN113947589A (en) * | 2021-10-26 | 2022-01-18 | 北京理工大学 | Missile-borne image deblurring method based on countermeasure generation network |
CN114066750A (en) * | 2021-10-22 | 2022-02-18 | 西北工业大学 | Self-encoder deblurring method based on domain transformation |
-
2022
- 2022-03-28 CN CN202210313655.9A patent/CN114677304A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113592736A (en) * | 2021-07-27 | 2021-11-02 | 温州大学 | Semi-supervised image deblurring method based on fusion attention mechanism |
CN114066750A (en) * | 2021-10-22 | 2022-02-18 | 西北工业大学 | Self-encoder deblurring method based on domain transformation |
CN113947589A (en) * | 2021-10-26 | 2022-01-18 | 北京理工大学 | Missile-borne image deblurring method based on countermeasure generation network |
Non-Patent Citations (1)
Title |
---|
郭业才;朱文军;: "基于深度卷积神经网络的运动模糊去除算法", 南京理工大学学报, no. 03, 30 June 2020 (2020-06-30) * |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115019183A (en) * | 2022-07-28 | 2022-09-06 | 北京卫星信息工程研究所 | Remote sensing image model migration method based on knowledge distillation and image reconstruction |
WO2024040544A1 (en) * | 2022-08-26 | 2024-02-29 | Intel Corporation | Training neural network through many-to-one knowledge injection |
CN115511059A (en) * | 2022-10-12 | 2022-12-23 | 北华航天工业学院 | Network lightweight method based on convolutional neural network channel decoupling |
CN115511059B (en) * | 2022-10-12 | 2024-02-09 | 北华航天工业学院 | Network light-weight method based on convolutional neural network channel decoupling |
CN115456914A (en) * | 2022-11-08 | 2022-12-09 | 浙江优众新材料科技有限公司 | Priori knowledge-based defocused image deblurring method, device and medium |
CN115456914B (en) * | 2022-11-08 | 2023-03-24 | 浙江优众新材料科技有限公司 | Priori knowledge-based defocused image deblurring method, device and medium |
CN115601536A (en) * | 2022-12-02 | 2023-01-13 | 荣耀终端有限公司(Cn) | Image processing method and electronic equipment |
CN116385329A (en) * | 2023-06-06 | 2023-07-04 | 之江实验室 | Multilayer knowledge distillation medical image generation method and device based on feature fusion |
CN116385329B (en) * | 2023-06-06 | 2023-08-29 | 之江实验室 | Multilayer knowledge distillation medical image generation method and device based on feature fusion |
CN116823656A (en) * | 2023-06-27 | 2023-09-29 | 华东师范大学 | Image blind deblurring method and system based on frequency domain local feature attention mechanism |
CN117831138A (en) * | 2024-03-05 | 2024-04-05 | 天津科技大学 | Multi-mode biological feature recognition method based on third-order knowledge distillation |
CN117831138B (en) * | 2024-03-05 | 2024-05-24 | 天津科技大学 | Multi-mode biological feature recognition method based on third-order knowledge distillation |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114677304A (en) | Image deblurring algorithm based on knowledge distillation and deep neural network | |
CN110120011B (en) | Video super-resolution method based on convolutional neural network and mixed resolution | |
WO2021056969A1 (en) | Super-resolution image reconstruction method and device | |
CN111192200A (en) | Image super-resolution reconstruction method based on fusion attention mechanism residual error network | |
CN110634105B (en) | Video high-space-time resolution signal processing method combining optical flow method and depth network | |
CN113052764B (en) | Video sequence super-resolution reconstruction method based on residual connection | |
CN117058160B (en) | Three-dimensional medical image segmentation method and system based on self-adaptive feature fusion network | |
CN112699844A (en) | Image super-resolution method based on multi-scale residual error level dense connection network | |
CN115239591A (en) | Image processing method, image processing apparatus, electronic device, storage medium, and program product | |
CN112884650A (en) | Image mixing super-resolution method based on self-adaptive texture distillation | |
CN115526779A (en) | Infrared image super-resolution reconstruction method based on dynamic attention mechanism | |
CN115330631A (en) | Multi-scale fusion defogging method based on stacked hourglass network | |
CN115293966A (en) | Face image reconstruction method and device and storage medium | |
CN114936977A (en) | Image deblurring method based on channel attention and cross-scale feature fusion | |
CN113393435B (en) | Video saliency detection method based on dynamic context sensing filter network | |
CN114022356A (en) | River course flow water level remote sensing image super-resolution method and system based on wavelet domain | |
CN113850721A (en) | Single image super-resolution reconstruction method, device and equipment and readable storage medium | |
CN117391920A (en) | High-capacity steganography method and system based on RGB channel differential plane | |
CN113949880B (en) | Extremely-low-bit-rate man-machine collaborative image coding training method and coding and decoding method | |
CN115797181A (en) | Image super-resolution reconstruction method for mine fuzzy environment | |
US20230186608A1 (en) | Method, device, and computer program product for video processing | |
CN115689918A (en) | Parallel single image rain removing method based on residual error prior attention mechanism | |
CN114219738A (en) | Single-image multi-scale super-resolution reconstruction network structure and method | |
CN110648291B (en) | Unmanned aerial vehicle motion blurred image restoration method based on deep learning | |
CN113674154A (en) | Single image super-resolution reconstruction method and system based on generation countermeasure network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |