WO2020103171A1 - Bi-level optimization method for image deblurring - Google Patents
Bi-level optimization method for image deblurringInfo
- Publication number
- WO2020103171A1 WO2020103171A1 PCT/CN2018/117635 CN2018117635W WO2020103171A1 WO 2020103171 A1 WO2020103171 A1 WO 2020103171A1 CN 2018117635 W CN2018117635 W CN 2018117635W WO 2020103171 A1 WO2020103171 A1 WO 2020103171A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- loss
- level
- model
- conv
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 238000005457 optimization Methods 0.000 title claims abstract description 27
- 238000012549 training Methods 0.000 claims abstract description 6
- 230000002452 interceptive effect Effects 0.000 claims abstract description 3
- 230000008569 process Effects 0.000 claims description 10
- 238000013461 design Methods 0.000 claims description 5
- 230000006835 compression Effects 0.000 claims description 4
- 238000007906 compression Methods 0.000 claims description 4
- 238000002474 experimental method Methods 0.000 claims description 2
- 239000000284 extract Substances 0.000 claims description 2
- 230000001105 regulatory effect Effects 0.000 claims description 2
- 230000003993 interaction Effects 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 15
- 230000006870 function Effects 0.000 abstract description 14
- 230000001629 suppression Effects 0.000 abstract 1
- 238000013135 deep learning Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 7
- 230000004438 eyesight Effects 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 238000013527 convolutional neural network Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000016776 visual perception Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/73—Deblurring; Sharpening
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/70—Denoising; Smoothing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/77—Retouching; Inpainting; Scratch removal
Definitions
- the invention relates to the field of digital image processing, in particular to a Bi-Level optimization method for image deblurring.
- This method proposes a Bi-Level optimization method during the restoration of a blurred image.
- Deblurring technology is the subject of extensive research in the field of image and video processing. To a certain extent, blurring caused by camera shake seriously affects the imaging quality and visual perception of images. As an important branch of image preprocessing, the improvement of deblurring technology directly affects the performance of other computer vision algorithms, such as foreground segmentation, object detection, behavior analysis, etc. At the same time, it also affects the image coding performance. Therefore, it is imperative to study a high-performance deblurring algorithm.
- Documents 1-3 are the background information of the deep learning deblurring algorithm compared in the present invention: Document 1: Kupyn O, Budzan V, Mykhailych M, et al. DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks [J]. 1711.07064, 2017. Reference 2: Nah S, Kim T, H, Lee K, M. Deep multi-scale convolutional neural network for dynamic scene deblurring [C] // CVPR.2017, 1 (2): 3. Reference 3: Sun J, Cao W, Xu Z, et al. Learning. A convolutional neural network for non-uniform motion blur removal [C] // Proceedings of the IEEE conference Computer on Vision Vision and Pattern Recognition. 2015: 769-777.
- image deblurring algorithms can be divided into traditional algorithms based on probability models and deblurring algorithms based on deep learning.
- the traditional algorithm uses a convolution model to explain the cause of blur.
- the process of camera shake can be mapped to a blur kernel trajectory PSF (Point Spread Function).
- PSF Point Spread Function
- the problem of restoring a clear image when the blur kernel is unknown is an ill-posed problem, so it is usually necessary to estimate the blur kernel first, and then use the evaluated blur kernel to perform the deconvolution operation to obtain the restored image.
- the deep learning-based deblurring algorithm uses the deep network structure to obtain the latent information of the image, and then realize the blurred image restoration.
- the purpose of the present invention is to propose a Bi-Level optimization method for image deblurring.
- the Bi-Level optimization method is to optimize GAN (Generative Adversarial Network), aiming to solve the shortcomings of existing deep learning deblurring algorithms .
- GAN Geneative Adversarial Network
- the present invention proposes a Bi-Level optimization method.
- the design allows two levels of loss functions to alternate with each other for model optimization. Considering that the L 1 loss and the MSE loss satisfy the same relationship, L1 will introduce more noise but can retain more texture than MSE, and the perceived loss has an excessive smoothing effect.
- three kinds of losses are optimized. Introduced at the same time. At the same time, in order to balance the role of various condition losses in the optimization process, the present invention normalizes these losses to the same magnitude according to (Equation 4).
- the Bi-Level optimization method is divided into two steps. In the first step, a basic model is trained with MSE loss conditions, and in the second step, a two-level loss interactive iteration is used to fine-tune the model.
- the texture restoration is accurate.
- the similarity of image pixel values can ensure the similarity of depth features, but the contrary is not true; that is, the use of feature-level perceptual loss function may cause image detail information repair failure.
- the depth feature matches the pixel value. Considering that the pixel level MSE and L1 often cause noise amplification, and the perceived loss can effectively suppress the noise. By combining the above loss functions, the invention patent can guarantee to a certain extent. Pixel texture recovery is accurate, and it can ensure that the depth feature matches the pixel value.
- Figure 1 the process of generating an adversarial network
- Figure 5a is a comparison diagram of blurred images
- Figure 5b is a comparison diagram of the recovery effects of Nah et al.
- Figure 5c is a comparison diagram of the recovery effects of Kupyn et al.
- Figure 5d is a comparison graph of the restoration effect of the Bi-Level optimization method.
- Figure 1 generates the adversarial network process
- Figure 4 generator design Bi-Skip-Net + residual
- Table 1 is the discriminator parameter table, as shown in the figure
- the blurred image is input to the generator to obtain the restored image; after that, the restored image and the clear image are input to the discriminator to distinguish the clear image.
- the present invention uses Bi-Skip-Net to train the image residuals, and uses the blurred image + image residuals mode to restore the image.
- ⁇ is the weight of the conditional loss function.
- L and S respectively represent the output and true value of the model at different levels, and the value of ⁇ is 1 or 2, the entire conditional loss function is regulated by the number of channels c, width w and height h.
- the Li-Level optimization method of the present invention includes two steps (the number of epochs in the training process is N).
- Step1 When the number of iterations is less than 1 / 3N, the present invention adopts pixel level mean square error (MSE) as the loss function to train the model;
- MSE mean square error
- Step2 When the number of iterations is greater than or equal to 1 / 3N, the present invention uses a pixel-level L1 loss function and a feature-level perceptual loss function to alternately train the model. The present invention performs loss function replacement every two iterations during the experiment.
- FIG 3 is the structure diagram of Bi-Skip-Net.
- Bi-Skip-Net consists of a compression path (D *), a double-span connection path (S *) and an expansion path (U *) 3 parts.
- the compression path extracts the depth and shallow features of the image;
- the double-span connection path connects the image features with the upsampling features in the expansion path;
- the expansion path implements feature upsampling.
- Fig. 5 is a subjective comparison between the present invention and other algorithms.
- Fig. 5a is a comparison of blurred images
- Fig. 5b is a comparison of restoration effects of Nah et al .
- Fig. 5c is a comparison of restoration effects of Kupyn et al .
- Fig. 5d is Bi -A comparison diagram of the restoration effect of the Level optimization method, that is, a comparison diagram of the restoration effect of the present invention.
- Figure 5a, Figure 5b, Figure 5c, and Figure 5d are three identical photos, and two points are specifically selected with two boxes in each picture. The enlarged views of these two boxes are correspondingly placed in this Below the picture, you can see the restoration effect clearly.
- the comparison results are shown in Table 2.
- the de-blurring algorithm based on deep learning of the present invention utilizes the deep network structure to obtain the latent information of the image, and then realizes the restoration of the blurred image.
- the deep learning deblurring algorithm can realize two operations of fuzzy kernel estimation and non-blind deconvolution to restore the image, and it can also use the generational confrontation method to restore the image.
Landscapes
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Image Processing (AREA)
- Image Analysis (AREA)
Abstract
A Bi-Level optimization method for image deblurring, which is designed to enable two levels of loss functions to alternative perform model optimization. The Bi-Level optimization mechanism is divided into two steps. In the first step, a basic model is trained using MSE loss conditions, and in the second step, two-level loss interactive iteration is adopted to perform fine-tuning on the model. At the initial stage of training, the divergence between a restoration effect and a clear image is relatively large, the effect of noise can be ignored, while during the later stage of training, the noise is continuously amplified and the negative effect becomes more obvious. Therefore, the perceptual loss is introduced for noise suppression, and in addition, the MSE loss is changed to L 1 loss to ensure structural continuity. The present method can achieve accurate texture restoration and matching of depth features with pixel values.
Description
本发明涉及数字图像处理领域,具体为用于图像去模糊的Bi-Level优化方法,该方法在模糊图像复原过程中提出了一种Bi-Level优化方法。The invention relates to the field of digital image processing, in particular to a Bi-Level optimization method for image deblurring. This method proposes a Bi-Level optimization method during the restoration of a blurred image.
去模糊技术是图像和视频处理领域被广泛研究的主题。基于相机抖动造成的模糊在一定意义上严重影响图像的成像质量,视觉观感。作为图像预处理领域一个及其重要的分支,去模糊技术的提升直接影响其他计算机视觉算法的性能,如前景分割,物体检测,行为分析等;同时它也影响着图像的编码性能。因此,研究一种高性能的去模糊算法势在必行。Deblurring technology is the subject of extensive research in the field of image and video processing. To a certain extent, blurring caused by camera shake seriously affects the imaging quality and visual perception of images. As an important branch of image preprocessing, the improvement of deblurring technology directly affects the performance of other computer vision algorithms, such as foreground segmentation, object detection, behavior analysis, etc. At the same time, it also affects the image coding performance. Therefore, it is imperative to study a high-performance deblurring algorithm.
文献1-3是本发明对比的深度学习去模糊算法的背景资料:文献1:Kupyn O,Budzan V,Mykhailych M,et al.DeblurGAN:Blind Motion Deblurring Using Conditional Adversarial Networks[J].arXiv preprint arXiv:1711.07064,2017。文献2:Nah S,Kim T H,Lee K M.Deep multi-scale convolutional neural network for dynamic scene deblurring[C]//CVPR.2017,1(2):3。文献3:Sun J,Cao W,Xu Z,et al.Learning a convolutional neural network for non-uniform motion blur removal[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:769-777。Documents 1-3 are the background information of the deep learning deblurring algorithm compared in the present invention: Document 1: Kupyn O, Budzan V, Mykhailych M, et al. DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks [J]. 1711.07064, 2017. Reference 2: Nah S, Kim T, H, Lee K, M. Deep multi-scale convolutional neural network for dynamic scene deblurring [C] // CVPR.2017, 1 (2): 3. Reference 3: Sun J, Cao W, Xu Z, et al. Learning. A convolutional neural network for non-uniform motion blur removal [C] // Proceedings of the IEEE conference Computer on Vision Vision and Pattern Recognition. 2015: 769-777.
一般来说,图像去模糊算法可以分为基于概率模型的传统算法和基于深度学习的去模糊算法。传统算法采用卷积模型来解释模糊成因,相机抖动的过程可以映射为模糊核轨迹PSF(Point Spread Function)。在模糊核未知的情况下还原清晰图像,这一问题属于不适定(ill-posed)问题,所以通常意义上需要先估计模糊核,再利用评估的模糊核进行返卷积操作得到复原图像。基于深度学习的去模糊算法则利用深层网络结构获取图像的潜在信息,进而实现模糊图像复原。深度学习的去模糊算法可以实现模糊核估计和非盲反卷积两个操作来进行图像复原,同时也可以采用生 成对抗方法来复原图像。本专利旨在解决深度学习去模糊算法存在的缺点:In general, image deblurring algorithms can be divided into traditional algorithms based on probability models and deblurring algorithms based on deep learning. The traditional algorithm uses a convolution model to explain the cause of blur. The process of camera shake can be mapped to a blur kernel trajectory PSF (Point Spread Function). The problem of restoring a clear image when the blur kernel is unknown is an ill-posed problem, so it is usually necessary to estimate the blur kernel first, and then use the evaluated blur kernel to perform the deconvolution operation to obtain the restored image. The deep learning-based deblurring algorithm uses the deep network structure to obtain the latent information of the image, and then realize the blurred image restoration. The deep learning deblurring algorithm can realize two operations of fuzzy kernel estimation and non-blind deconvolution to restore the image, and it can also use the generated confrontation method to restore the image. This patent aims to solve the shortcomings of the deep learning deblurring algorithm:
1)纹理恢复不准确,1) Texture restoration is not accurate,
2)深度特征与像素值不匹配问题。2) The problem that the depth feature does not match the pixel value.
发明的公开Disclosure of invention
本发明的目的在于:提出了用于图像去模糊的Bi-Level优化方法,该Bi-Level优化方法来对GAN(Generative Adversarial Network)进行优化,旨在解决现有深度学习去模糊算法存在的缺点。通过对比现有最优算法,本发明在图像复图像原性能上平均提升了1.3dB。The purpose of the present invention is to propose a Bi-Level optimization method for image deblurring. The Bi-Level optimization method is to optimize GAN (Generative Adversarial Network), aiming to solve the shortcomings of existing deep learning deblurring algorithms . By comparing the existing optimal algorithms, the present invention improves the original performance of the image complex image by 1.3 dB on average.
本发明提供的技术方案如下:The technical solutions provided by the present invention are as follows:
MSE损失可以保证优化过程在像素层级和特征层级上的同一性,但带来的问题是引入了大量的噪声;而感知损失在一定程度上是一种不错的替代方案,但却无法保证优化的同一性。为了解决这个条件损失不同一(non-identical)问题,同时降低优化的复杂度,本发明提出了一种Bi-Level优化方法。MSE loss can guarantee the identity of the optimization process at the pixel level and the feature level, but the problem is that a lot of noise is introduced; and the perceived loss is a good alternative to a certain extent, but it cannot guarantee the optimization Identity. In order to solve the problem of non-identical loss of condition and reduce the complexity of optimization, the present invention proposes a Bi-Level optimization method.
具体来说,设计让两种层级损失函数相互交替进行模型优化。考虑到L
1损失与MSE损失满足同一关系,L1相比MSE会引入更多的噪声但可以保留更多的纹理,而感知损失又有过度的平滑作用,本发明在优化过程中将三种损失同时引入。同时为了平衡各条件损失在优化过程中的作用,本发明将这些损失按照(式4)归一到同一量级。如图2所示,Bi-Level优化方法分为两个步骤,第一步用MSE损失条件训练出一个基本模型,第二步采用双层级损失交互迭代进行模型微调操作。这是因为在训练初期,复原效果与清晰图像之间的散度比较大,噪声的作用可以忽略不计,而在训练后期噪声被不断放大使其负面作用愈加明显,因此引入感知损失进行噪声抑制同时将MSE损失改为L
1损失以足够保证结构连续性。
Specifically, the design allows two levels of loss functions to alternate with each other for model optimization. Considering that the L 1 loss and the MSE loss satisfy the same relationship, L1 will introduce more noise but can retain more texture than MSE, and the perceived loss has an excessive smoothing effect. In the present invention, three kinds of losses are optimized. Introduced at the same time. At the same time, in order to balance the role of various condition losses in the optimization process, the present invention normalizes these losses to the same magnitude according to (Equation 4). As shown in Figure 2, the Bi-Level optimization method is divided into two steps. In the first step, a basic model is trained with MSE loss conditions, and in the second step, a two-level loss interactive iteration is used to fine-tune the model. This is because in the early stage of training, the divergence between the restoration effect and the clear image is relatively large, and the role of noise can be ignored. However, in the late stage of training, the noise is continuously amplified to make its negative effect more obvious. Change MSE loss to L 1 loss to ensure structural continuity.
本发明具有如下技术效果:The present invention has the following technical effects:
1、纹理恢复准确,在图像复原过程中,图像像素值的相似性可以保证深度特征的相似,反之却并不成立;即采用特征级的感知损失函数可能造成图像细节信息修 复失败。1. The texture restoration is accurate. In the process of image restoration, the similarity of image pixel values can ensure the similarity of depth features, but the contrary is not true; that is, the use of feature-level perceptual loss function may cause image detail information repair failure.
2、深度特征与像素值匹配,考虑到像素级的MSE、L1经常会造成噪声放大,而感知损失又可以有效抑制噪声,通过将以上损失函数进行结合,本发明专利在一定程度上既可以保证像素纹理回复准确,又可以保证深度特征与像素值匹配。2. The depth feature matches the pixel value. Considering that the pixel level MSE and L1 often cause noise amplification, and the perceived loss can effectively suppress the noise. By combining the above loss functions, the invention patent can guarantee to a certain extent. Pixel texture recovery is accurate, and it can ensure that the depth feature matches the pixel value.
附图的简要说明Brief description of the drawings
图1生成对抗网络流程;Figure 1 the process of generating an adversarial network;
图2 Bi-Level优化流程;Figure 2 Bi-Level optimization process;
图3 Bi-Skip-Net结构图;Figure 3 Bi-Skip-Net structure diagram;
图4生成器设计:Bi-Skip-Net+残差;Figure 4 generator design: Bi-Skip-Net + residual;
图5本发明与其它算法的主观对比,其中:Figure 5 Subjective comparison of the present invention with other algorithms, in which:
图5a为模糊图像对比图;Figure 5a is a comparison diagram of blurred images;
图5b为Nah等人的复原效果对比图;Figure 5b is a comparison diagram of the recovery effects of Nah et al;
图5c为Kupyn等人的复原效果对比图;Figure 5c is a comparison diagram of the recovery effects of Kupyn et al;
图5d为Bi-Level优化方法的复原效果对比图。Figure 5d is a comparison graph of the restoration effect of the Bi-Level optimization method.
实现本发明的最佳方式Best way to implement the invention
下面结合附图和实施例,对本发明进行详细的描述,但不以任何方式限制本发明的范围。The following describes the present invention in detail with reference to the drawings and embodiments, but does not limit the scope of the present invention in any way.
图1生成对抗网络流程,图4生成器设计:Bi-Skip-Net+残差,表1为判别器参数表,如图所示,Figure 1 generates the adversarial network process, Figure 4 generator design: Bi-Skip-Net + residual, Table 1 is the discriminator parameter table, as shown in the figure,
表1.判别器参数表Table 1. Discriminator parameter table
本发明用于图像去模糊的Bi-Level优化方法的具体步骤如下:The specific steps of the Bi-Level optimization method for image deblurring of the present invention are as follows:
(1)采用生成对抗网络流程来进行模糊图像复原,采用图4和表1来设计生成器和判别器;(1) The process of generating confrontation network is used to restore the blurred image, and the generator and discriminator are designed using Figure 4 and Table 1;
(2)如图1所示,将模糊图像输入到生成器得到复原图像;之后,将复原图像和清晰图像输入到判别器进行区分得到清晰图像。如图4所示,本发明用Bi-Skip-Net来训练图像残差,采用模糊图像+图像残差的模式来复原图像。(2) As shown in FIG. 1, the blurred image is input to the generator to obtain the restored image; after that, the restored image and the clear image are input to the discriminator to distinguish the clear image. As shown in FIG. 4, the present invention uses Bi-Skip-Net to train the image residuals, and uses the blurred image + image residuals mode to restore the image.
(3)采用如下的损失函数来训练网络;(3) Use the following loss function to train the network;
其中
为对抗损失函数,
为条件损失函数,λ为条件损失函数的权重。
among them To fight the loss function, For the conditional loss function, λ is the weight of the conditional loss function.
通过最小化式3来优化生成器G;Optimize generator G by minimizing Equation 3;
其中,L,S分别表示模型在不同层级的输出和真值,α取值为1或2,整个条件损失函数被通道数c,宽度w和高度h所规范。Among them, L and S respectively represent the output and true value of the model at different levels, and the value of α is 1 or 2, the entire conditional loss function is regulated by the number of channels c, width w and height h.
(4)采用图2所示的Li-Level优化方法来对网络进行优化;(4) Use the Li-Level optimization method shown in Figure 2 to optimize the network;
本发明的Li-Level优化方法包括两个步骤(设训练过程中epoch次数为N)。The Li-Level optimization method of the present invention includes two steps (the number of epochs in the training process is N).
Step1:当迭代次数小于1/3N,本发明采用像素级均方误差(MSE)作为损失函数训练模型;Step1: When the number of iterations is less than 1 / 3N, the present invention adopts pixel level mean square error (MSE) as the loss function to train the model;
Step2:当迭代次数大于等于1/3N,本发明采用像素级的L1损失函数和特征级的感知损失函数来交替训练模型,本发明在实验过程中每隔2此迭代进行一次损失函数替换。Step2: When the number of iterations is greater than or equal to 1 / 3N, the present invention uses a pixel-level L1 loss function and a feature-level perceptual loss function to alternately train the model. The present invention performs loss function replacement every two iterations during the experiment.
(5)将训练好的网络作为最终的复原模型。(5) Take the trained network as the final restoration model.
图3为Bi-Skip-Net结构图,如图3所示,如图所示,Bi-Skip-Net由压缩路径(D*)、双跨连接路径(S*)和扩张路径(U*)3部分组成。压缩路径提取图像的深度特征和浅层特征;双跨连接路径连接图像特征与扩张路径中的上采样特征;扩张路径实现特征上采样。Figure 3 is the structure diagram of Bi-Skip-Net. As shown in Figure 3, as shown in the figure, Bi-Skip-Net consists of a compression path (D *), a double-span connection path (S *) and an expansion path (U *) 3 parts. The compression path extracts the depth and shallow features of the image; the double-span connection path connects the image features with the upsampling features in the expansion path; the expansion path implements feature upsampling.
图5为本发明与其它算法的主观对比,其中,图5a为模糊图像对比图;图5b为Nah等人的复原效果对比图;图5c为Kupyn等人的复原效果对比图;图5d为Bi-Level优化方法的复原效果对比图,即本发明的复原效果对比图。其中,图5a、图5b、图5c和图5d都是三张相同的照片,每张图片中用两个方框特别选取了两个点,这两个方框的放大图对应地放在该图片的下方,可以看清复原效果。其对比结果见表2本发明与其它算法在GoPro数据集上的测试对比。Fig. 5 is a subjective comparison between the present invention and other algorithms. Among them, Fig. 5a is a comparison of blurred images; Fig. 5b is a comparison of restoration effects of Nah et al .; Fig. 5c is a comparison of restoration effects of Kupyn et al .; Fig. 5d is Bi -A comparison diagram of the restoration effect of the Level optimization method, that is, a comparison diagram of the restoration effect of the present invention. Among them, Figure 5a, Figure 5b, Figure 5c, and Figure 5d are three identical photos, and two points are specifically selected with two boxes in each picture. The enlarged views of these two boxes are correspondingly placed in this Below the picture, you can see the restoration effect clearly. The comparison results are shown in Table 2. The test comparison between the present invention and other algorithms on the GoPro data set.
表2.本发明与其它算法在GoPro数据集上的测试对比Table 2. Comparison between the present invention and other algorithms on GoPro dataset
需要注意的是,公布实施例的目的在于帮助进一步理解本发明,但是本领域的技术人员可以理解:在不脱离本发明及所附权利要求的精神和范围内,各种替换和 修改都是可能的。因此,本发明不应局限于实施例所公开的内容,本发明要求保护的范围以权利要求书界定的范围为准。It should be noted that the purpose of the disclosed embodiments is to help further understand the present invention, but those skilled in the art can understand that various replacements and modifications are possible without departing from the spirit and scope of the present invention and the appended claims of. Therefore, the present invention should not be limited to the contents disclosed in the embodiments, and the scope of protection claimed by the present invention is subject to the scope defined by the claims.
本发明基于深度学习的去模糊算法则利用深层网络结构获取图像的潜在信息,进而实现模糊图像复原。深度学习的去模糊算法可以实现模糊核估计和非盲反卷积两个操作来进行图像复原,同时也可以采用生成对抗方法来复原图像。The de-blurring algorithm based on deep learning of the present invention utilizes the deep network structure to obtain the latent information of the image, and then realizes the restoration of the blurred image. The deep learning deblurring algorithm can realize two operations of fuzzy kernel estimation and non-blind deconvolution to restore the image, and it can also use the generational confrontation method to restore the image.
Claims (4)
- 用于图像去模糊的Bi-Level优化方法,具体步骤是:Bi-Level optimization method for image deblurring, the specific steps are:第一步,用MSE损失条件训练出一个基本模型;The first step is to train a basic model with MSE loss conditions;第二步,采用双层级损失交互迭代进行模型微调操作。The second step is to use two-level loss interaction iteration to fine-tune the model.
- 根据权利要求1所述的用于图像去模糊的Bi-Level优化方法,其特征在于:The Bi-Level optimization method for image deblurring according to claim 1, characterized in that:(1)所述的生成器按以下方式设计:Bi-Skip-Net+残差,判别器参数依据表1:(1) The generator is designed in the following way: Bi-Skip-Net + residual, the discriminator parameters are based on Table 1:表1.判别器参数表Table 1. Discriminator parameter table
### 层Floor 参数维度Parameter dimension 步长Step 11 convconv 32x3x5x532x3x5x5 22 22 convconv 64x32x5x564x32x5x5 11 33 convconv 64x64x5x564x64x5x5 22 44 convconv 128x64x5x5128x64x5x5 11 55 convconv 128x128x5x5128x128x5x5 44 66 convconv 256x128x5x5256x128x5x5 11 77 convconv 256x256x5x5256x256x5x5 44 88 convconv 512x256x5x5512x256x5x5 11 99 convconv 512x512x4x4512x512x4x4 44 1010 fcfc 512x1x1x1512x1x1x1 -- (2)用Bi-Skip-Net来训练图像残差,采用模糊图像+图像残差的模式来复原图像;(2) Use Bi-Skip-Net to train the image residuals, and use the blurred image + image residuals mode to restore the image;(3)用MSE损失条件训练出一个基本模型,采用如下的损失函数来训练网络;(3) Train a basic model with MSE loss conditions, and use the following loss function to train the network;其中 为对抗损失函数, 为条件损失函数,λ为条件损失函数的权重。 among them To fight the loss function, Is the conditional loss function, and λ is the weight of the conditional loss function.通过最小化式3来优化生成器G;Optimize generator G by minimizing Equation 3;其中,L,S分别表示模型在不同层级的输出和真值,α取值为1或2,整个条件损失函数被通道数c,宽度w和高度h所规范;Among them, L and S respectively represent the output and true value of the model at different levels, and the value of α is 1 or 2, the entire conditional loss function is regulated by the number of channels c, width w and height h;(4)采用双层级损失交互迭代进行模型微调操作,具体是采用Li-Level优化机制来对网络进行优化;(4) Use double-level loss interactive iteration to fine-tune the model, specifically using Li-Level optimization mechanism to optimize the network;(5)将训练好的网络作为最终的复原模型。(5) Take the trained network as the final restoration model. - 根据权利要求2所述的用于图像去模糊的Bi-Level优化方法,其特征在于:The Bi-Level optimization method for image deblurring according to claim 2, characterized in that:所述的Bi-Skip-Net是由压缩路径(D*)、双跨连接路径(S*)和扩张路径(U*)3部分组成;压缩路径提取图像的深度特征和浅层特征;双跨连接路径连接图像特征与扩张路径中的上采样特征;扩张路径实现特征上采样。The Bi-Skip-Net is composed of three parts: compression path (D *), double-span connection path (S *) and expansion path (U *); the compression path extracts the depth and shallow features of the image; double-span The connection path connects the image features with the upsampling features in the expansion path; the expansion path implements feature upsampling.
- 根据权利要求2所述的用于图像去模糊的Bi-Level优化方法,其特征在于:步骤(4)所述的Li-Level优化机制包括两个步骤(设训练过程中epoch次数为N),The Bi-Level optimization method for image deblurring according to claim 2, wherein the Li-Level optimization mechanism in step (4) includes two steps (set the number of epochs in the training process to N),Step1:当迭代次数小于1/3N,本发明采用像素级均方误差(MSE)作为损失函数训练模型;Step1: When the number of iterations is less than 1 / 3N, the present invention adopts pixel level mean square error (MSE) as the loss function to train the model;Step2:当迭代次数大于等于1/3N,本发明采用像素级的L1损失函数和特征级的感知损失函数来交替训练模型,本发明在实验过程中每隔2此迭代进行一次损失函数替换。Step2: When the number of iterations is greater than or equal to 1 / 3N, the present invention uses a pixel-level L1 loss function and a feature-level perceptual loss function to alternately train the model. The present invention performs loss function replacement every two iterations during the experiment.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811389239.7A CN109544475A (en) | 2018-11-21 | 2018-11-21 | Bi-Level optimization method for image deblurring |
CN201811389239.7 | 2018-11-21 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020103171A1 true WO2020103171A1 (en) | 2020-05-28 |
Family
ID=65848630
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2018/117635 WO2020103171A1 (en) | 2018-11-21 | 2018-11-27 | Bi-level optimization method for image deblurring |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN109544475A (en) |
WO (1) | WO2020103171A1 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111783545A (en) * | 2020-06-02 | 2020-10-16 | 山西潞安环保能源开发股份有限公司五阳煤矿 | Coal mine unmanned aerial vehicle image enhancement method based on generation of countermeasure network |
CN112634163A (en) * | 2020-12-29 | 2021-04-09 | 南京大学 | Method for removing image motion blur based on improved cycle generation countermeasure network |
CN112785527A (en) * | 2021-01-28 | 2021-05-11 | 太原巍昂科电子科技有限责任公司 | Image deblurring method based on ResNet network |
CN112837245A (en) * | 2021-03-16 | 2021-05-25 | 西北工业大学 | Dynamic scene deblurring method based on multi-mode fusion |
CN113947589A (en) * | 2021-10-26 | 2022-01-18 | 北京理工大学 | Missile-borne image deblurring method based on countermeasure generation network |
CN114049497A (en) * | 2021-11-07 | 2022-02-15 | 西北工业大学 | Image depth characterization method based on self-adaptive contrast learning |
CN114066750A (en) * | 2021-10-22 | 2022-02-18 | 西北工业大学 | Self-encoder deblurring method based on domain transformation |
CN114331913A (en) * | 2022-01-06 | 2022-04-12 | 福州大学 | Motion blurred image restoration method based on residual attention block |
CN114331859A (en) * | 2021-07-15 | 2022-04-12 | 西安科技大学 | Image deblurring method based on improved generation countermeasure network |
CN114511466A (en) * | 2022-02-21 | 2022-05-17 | 北京大学深圳研究生院 | Blind person face image restoration method based on generation of confrontation network prior |
CN114783020A (en) * | 2022-04-03 | 2022-07-22 | 南京邮电大学 | Dynamic human face recognition method based on novel counterstudy deblurring theory |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110175961B (en) * | 2019-05-22 | 2021-07-27 | 艾特城信息科技有限公司 | Reticulation removing method based on human face image segmentation countermeasure thought |
US11544558B2 (en) * | 2019-08-30 | 2023-01-03 | Nec Corporation | Continual learning of artificial intelligence systems based on bi-level optimization |
CN114612319A (en) * | 2022-03-03 | 2022-06-10 | 北京航空航天大学 | Image restoration supervised learning method based on double-layer optimization weight redistribution |
CN117197002B (en) * | 2023-11-07 | 2024-02-02 | 松立控股集团股份有限公司 | Image restoration method based on perception diffusion |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080253678A1 (en) * | 2007-04-10 | 2008-10-16 | Arcsoft, Inc. | Denoise method on image pyramid |
US20100020208A1 (en) * | 2008-07-24 | 2010-01-28 | Florida State University Research Foundation | Systems and methods for training an active random field for real-time image denoising |
CN106157268A (en) * | 2016-07-28 | 2016-11-23 | 浙江工业大学 | A kind of degraded image restored method based on the convex approximation of L0 |
CN106709875A (en) * | 2016-12-30 | 2017-05-24 | 北京工业大学 | Compressed low-resolution image restoration method based on combined deep network |
CN107507141A (en) * | 2017-08-07 | 2017-12-22 | 清华大学深圳研究生院 | A kind of image recovery method based on adaptive residual error neutral net |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106548159A (en) * | 2016-11-08 | 2017-03-29 | 中国科学院自动化研究所 | Reticulate pattern facial image recognition method and device based on full convolutional neural networks |
CN108665058B (en) * | 2018-04-11 | 2021-01-05 | 徐州工程学院 | Method for generating countermeasure network based on segment loss |
CN108550125B (en) * | 2018-04-17 | 2021-07-30 | 南京大学 | Optical distortion correction method based on deep learning |
-
2018
- 2018-11-21 CN CN201811389239.7A patent/CN109544475A/en active Pending
- 2018-11-27 WO PCT/CN2018/117635 patent/WO2020103171A1/en active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080253678A1 (en) * | 2007-04-10 | 2008-10-16 | Arcsoft, Inc. | Denoise method on image pyramid |
US20100020208A1 (en) * | 2008-07-24 | 2010-01-28 | Florida State University Research Foundation | Systems and methods for training an active random field for real-time image denoising |
CN106157268A (en) * | 2016-07-28 | 2016-11-23 | 浙江工业大学 | A kind of degraded image restored method based on the convex approximation of L0 |
CN106709875A (en) * | 2016-12-30 | 2017-05-24 | 北京工业大学 | Compressed low-resolution image restoration method based on combined deep network |
CN107507141A (en) * | 2017-08-07 | 2017-12-22 | 清华大学深圳研究生院 | A kind of image recovery method based on adaptive residual error neutral net |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111783545A (en) * | 2020-06-02 | 2020-10-16 | 山西潞安环保能源开发股份有限公司五阳煤矿 | Coal mine unmanned aerial vehicle image enhancement method based on generation of countermeasure network |
CN112634163A (en) * | 2020-12-29 | 2021-04-09 | 南京大学 | Method for removing image motion blur based on improved cycle generation countermeasure network |
CN112785527A (en) * | 2021-01-28 | 2021-05-11 | 太原巍昂科电子科技有限责任公司 | Image deblurring method based on ResNet network |
CN112837245A (en) * | 2021-03-16 | 2021-05-25 | 西北工业大学 | Dynamic scene deblurring method based on multi-mode fusion |
CN112837245B (en) * | 2021-03-16 | 2024-05-31 | 西北工业大学 | Dynamic scene deblurring method based on multi-mode fusion |
CN114331859A (en) * | 2021-07-15 | 2022-04-12 | 西安科技大学 | Image deblurring method based on improved generation countermeasure network |
CN114066750A (en) * | 2021-10-22 | 2022-02-18 | 西北工业大学 | Self-encoder deblurring method based on domain transformation |
CN114066750B (en) * | 2021-10-22 | 2024-03-05 | 西北工业大学 | Self-encoder deblurring method based on domain transformation |
CN113947589A (en) * | 2021-10-26 | 2022-01-18 | 北京理工大学 | Missile-borne image deblurring method based on countermeasure generation network |
CN114049497A (en) * | 2021-11-07 | 2022-02-15 | 西北工业大学 | Image depth characterization method based on self-adaptive contrast learning |
CN114049497B (en) * | 2021-11-07 | 2024-03-05 | 西北工业大学 | Image depth characterization method based on self-adaptive contrast learning |
CN114331913A (en) * | 2022-01-06 | 2022-04-12 | 福州大学 | Motion blurred image restoration method based on residual attention block |
CN114511466A (en) * | 2022-02-21 | 2022-05-17 | 北京大学深圳研究生院 | Blind person face image restoration method based on generation of confrontation network prior |
CN114511466B (en) * | 2022-02-21 | 2024-04-26 | 北京大学深圳研究生院 | Blind face image restoration method based on generation of countermeasure network priori |
CN114783020A (en) * | 2022-04-03 | 2022-07-22 | 南京邮电大学 | Dynamic human face recognition method based on novel counterstudy deblurring theory |
Also Published As
Publication number | Publication date |
---|---|
CN109544475A (en) | 2019-03-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020103171A1 (en) | Bi-level optimization method for image deblurring | |
WO2021208122A1 (en) | Blind video denoising method and device based on deep learning | |
WO2020087607A1 (en) | Bi-skip-net-based image deblurring method | |
Zheng et al. | Adaptive unfolding total variation network for low-light image enhancement | |
CN110782399B (en) | Image deblurring method based on multitasking CNN | |
Ju et al. | Single image dehazing via an improved atmospheric scattering model | |
Zhang et al. | Idr: Self-supervised image denoising via iterative data refinement | |
KR101328741B1 (en) | Apparatus and method for image enhancement using color channel | |
TWI509567B (en) | Method and image processing apparatus for image visibility restoration | |
CN111091503B (en) | Image defocusing and blurring method based on deep learning | |
CN103761710B (en) | The blind deblurring method of efficient image based on edge self-adaption | |
CN105765607A (en) | Multiple phase method for image deconvolution | |
Chowdhury et al. | Fuzzy logic based filtering for image de-noising | |
TW201830330A (en) | Image processing method and image processing system | |
CN111161360A (en) | Retinex theory-based image defogging method for end-to-end network | |
Zhao et al. | Legacy photo editing with learned noise prior | |
Malik et al. | Llrnet: A multiscale subband learning approach for low light image restoration | |
Teranishi et al. | Improvement of robustness blind image restoration method using failing detection process | |
Arora et al. | Low light image enhancement via global and local context modeling | |
CN107451971A (en) | The blind convolved image restoring method of low-light (level) of priori is combined based on dark and Gauss | |
Kaur et al. | An improved adaptive bilateral filter to remove gaussian noise from color images | |
Xie et al. | DHD-Net: A novel deep-learning-based dehazing network | |
CN106780398B (en) | A kind of image de-noising method based on noise prediction | |
CN106897975B (en) | Image denoising method for hypercube particle calculation | |
CN106952243A (en) | UUV Layer Near The Sea Surface infrared image self adaptation merger histogram stretches Enhancement Method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18941012 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18941012 Country of ref document: EP Kind code of ref document: A1 |