WO2020103171A1 - Bi-level optimization method for image deblurring - Google Patents

Bi-level optimization method for image deblurring

Info

Publication number
WO2020103171A1
WO2020103171A1 PCT/CN2018/117635 CN2018117635W WO2020103171A1 WO 2020103171 A1 WO2020103171 A1 WO 2020103171A1 CN 2018117635 W CN2018117635 W CN 2018117635W WO 2020103171 A1 WO2020103171 A1 WO 2020103171A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
loss
level
model
conv
Prior art date
Application number
PCT/CN2018/117635
Other languages
French (fr)
Chinese (zh)
Inventor
李革
张毅伟
王荣刚
王文敏
高文
Original Assignee
北京大学深圳研究生院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京大学深圳研究生院 filed Critical 北京大学深圳研究生院
Publication of WO2020103171A1 publication Critical patent/WO2020103171A1/en

Links

Images

Classifications

    • G06T5/73
    • G06T5/70
    • G06T5/77

Definitions

  • the invention relates to the field of digital image processing, in particular to a Bi-Level optimization method for image deblurring.
  • This method proposes a Bi-Level optimization method during the restoration of a blurred image.
  • Deblurring technology is the subject of extensive research in the field of image and video processing. To a certain extent, blurring caused by camera shake seriously affects the imaging quality and visual perception of images. As an important branch of image preprocessing, the improvement of deblurring technology directly affects the performance of other computer vision algorithms, such as foreground segmentation, object detection, behavior analysis, etc. At the same time, it also affects the image coding performance. Therefore, it is imperative to study a high-performance deblurring algorithm.
  • Documents 1-3 are the background information of the deep learning deblurring algorithm compared in the present invention: Document 1: Kupyn O, Budzan V, Mykhailych M, et al. DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks [J]. 1711.07064, 2017. Reference 2: Nah S, Kim T, H, Lee K, M. Deep multi-scale convolutional neural network for dynamic scene deblurring [C] // CVPR.2017, 1 (2): 3. Reference 3: Sun J, Cao W, Xu Z, et al. Learning. A convolutional neural network for non-uniform motion blur removal [C] // Proceedings of the IEEE conference Computer on Vision Vision and Pattern Recognition. 2015: 769-777.
  • image deblurring algorithms can be divided into traditional algorithms based on probability models and deblurring algorithms based on deep learning.
  • the traditional algorithm uses a convolution model to explain the cause of blur.
  • the process of camera shake can be mapped to a blur kernel trajectory PSF (Point Spread Function).
  • PSF Point Spread Function
  • the problem of restoring a clear image when the blur kernel is unknown is an ill-posed problem, so it is usually necessary to estimate the blur kernel first, and then use the evaluated blur kernel to perform the deconvolution operation to obtain the restored image.
  • the deep learning-based deblurring algorithm uses the deep network structure to obtain the latent information of the image, and then realize the blurred image restoration.
  • the purpose of the present invention is to propose a Bi-Level optimization method for image deblurring.
  • the Bi-Level optimization method is to optimize GAN (Generative Adversarial Network), aiming to solve the shortcomings of existing deep learning deblurring algorithms .
  • GAN Geneative Adversarial Network
  • the present invention proposes a Bi-Level optimization method.
  • the design allows two levels of loss functions to alternate with each other for model optimization. Considering that the L 1 loss and the MSE loss satisfy the same relationship, L1 will introduce more noise but can retain more texture than MSE, and the perceived loss has an excessive smoothing effect.
  • three kinds of losses are optimized. Introduced at the same time. At the same time, in order to balance the role of various condition losses in the optimization process, the present invention normalizes these losses to the same magnitude according to (Equation 4).
  • the Bi-Level optimization method is divided into two steps. In the first step, a basic model is trained with MSE loss conditions, and in the second step, a two-level loss interactive iteration is used to fine-tune the model.
  • the texture restoration is accurate.
  • the similarity of image pixel values can ensure the similarity of depth features, but the contrary is not true; that is, the use of feature-level perceptual loss function may cause image detail information repair failure.
  • the depth feature matches the pixel value. Considering that the pixel level MSE and L1 often cause noise amplification, and the perceived loss can effectively suppress the noise. By combining the above loss functions, the invention patent can guarantee to a certain extent. Pixel texture recovery is accurate, and it can ensure that the depth feature matches the pixel value.
  • Figure 1 the process of generating an adversarial network
  • Figure 5a is a comparison diagram of blurred images
  • Figure 5b is a comparison diagram of the recovery effects of Nah et al.
  • Figure 5c is a comparison diagram of the recovery effects of Kupyn et al.
  • Figure 5d is a comparison graph of the restoration effect of the Bi-Level optimization method.
  • Figure 1 generates the adversarial network process
  • Figure 4 generator design Bi-Skip-Net + residual
  • Table 1 is the discriminator parameter table, as shown in the figure
  • the blurred image is input to the generator to obtain the restored image; after that, the restored image and the clear image are input to the discriminator to distinguish the clear image.
  • the present invention uses Bi-Skip-Net to train the image residuals, and uses the blurred image + image residuals mode to restore the image.
  • is the weight of the conditional loss function.
  • L and S respectively represent the output and true value of the model at different levels, and the value of ⁇ is 1 or 2, the entire conditional loss function is regulated by the number of channels c, width w and height h.
  • the Li-Level optimization method of the present invention includes two steps (the number of epochs in the training process is N).
  • Step1 When the number of iterations is less than 1 / 3N, the present invention adopts pixel level mean square error (MSE) as the loss function to train the model;
  • MSE mean square error
  • Step2 When the number of iterations is greater than or equal to 1 / 3N, the present invention uses a pixel-level L1 loss function and a feature-level perceptual loss function to alternately train the model. The present invention performs loss function replacement every two iterations during the experiment.
  • FIG 3 is the structure diagram of Bi-Skip-Net.
  • Bi-Skip-Net consists of a compression path (D *), a double-span connection path (S *) and an expansion path (U *) 3 parts.
  • the compression path extracts the depth and shallow features of the image;
  • the double-span connection path connects the image features with the upsampling features in the expansion path;
  • the expansion path implements feature upsampling.
  • Fig. 5 is a subjective comparison between the present invention and other algorithms.
  • Fig. 5a is a comparison of blurred images
  • Fig. 5b is a comparison of restoration effects of Nah et al .
  • Fig. 5c is a comparison of restoration effects of Kupyn et al .
  • Fig. 5d is Bi -A comparison diagram of the restoration effect of the Level optimization method, that is, a comparison diagram of the restoration effect of the present invention.
  • Figure 5a, Figure 5b, Figure 5c, and Figure 5d are three identical photos, and two points are specifically selected with two boxes in each picture. The enlarged views of these two boxes are correspondingly placed in this Below the picture, you can see the restoration effect clearly.
  • the comparison results are shown in Table 2.
  • the de-blurring algorithm based on deep learning of the present invention utilizes the deep network structure to obtain the latent information of the image, and then realizes the restoration of the blurred image.
  • the deep learning deblurring algorithm can realize two operations of fuzzy kernel estimation and non-blind deconvolution to restore the image, and it can also use the generational confrontation method to restore the image.

Abstract

A Bi-Level optimization method for image deblurring, which is designed to enable two levels of loss functions to alternative perform model optimization. The Bi-Level optimization mechanism is divided into two steps. In the first step, a basic model is trained using MSE loss conditions, and in the second step, two-level loss interactive iteration is adopted to perform fine-tuning on the model. At the initial stage of training, the divergence between a restoration effect and a clear image is relatively large, the effect of noise can be ignored, while during the later stage of training, the noise is continuously amplified and the negative effect becomes more obvious. Therefore, the perceptual loss is introduced for noise suppression, and in addition, the MSE loss is changed to L 1 loss to ensure structural continuity. The present method can achieve accurate texture restoration and matching of depth features with pixel values.

Description

用于图像去模糊的Bi-Level优化方法Bi-Level optimization method for image deblurring 技术领域Technical field
本发明涉及数字图像处理领域,具体为用于图像去模糊的Bi-Level优化方法,该方法在模糊图像复原过程中提出了一种Bi-Level优化方法。The invention relates to the field of digital image processing, in particular to a Bi-Level optimization method for image deblurring. This method proposes a Bi-Level optimization method during the restoration of a blurred image.
背景技术Background technique
去模糊技术是图像和视频处理领域被广泛研究的主题。基于相机抖动造成的模糊在一定意义上严重影响图像的成像质量,视觉观感。作为图像预处理领域一个及其重要的分支,去模糊技术的提升直接影响其他计算机视觉算法的性能,如前景分割,物体检测,行为分析等;同时它也影响着图像的编码性能。因此,研究一种高性能的去模糊算法势在必行。Deblurring technology is the subject of extensive research in the field of image and video processing. To a certain extent, blurring caused by camera shake seriously affects the imaging quality and visual perception of images. As an important branch of image preprocessing, the improvement of deblurring technology directly affects the performance of other computer vision algorithms, such as foreground segmentation, object detection, behavior analysis, etc. At the same time, it also affects the image coding performance. Therefore, it is imperative to study a high-performance deblurring algorithm.
文献1-3是本发明对比的深度学习去模糊算法的背景资料:文献1:Kupyn O,Budzan V,Mykhailych M,et al.DeblurGAN:Blind Motion Deblurring Using Conditional Adversarial Networks[J].arXiv preprint arXiv:1711.07064,2017。文献2:Nah S,Kim T H,Lee K M.Deep multi-scale convolutional neural network for dynamic scene deblurring[C]//CVPR.2017,1(2):3。文献3:Sun J,Cao W,Xu Z,et al.Learning a convolutional neural network for non-uniform motion blur removal[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:769-777。Documents 1-3 are the background information of the deep learning deblurring algorithm compared in the present invention: Document 1: Kupyn O, Budzan V, Mykhailych M, et al. DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks [J]. 1711.07064, 2017. Reference 2: Nah S, Kim T, H, Lee K, M. Deep multi-scale convolutional neural network for dynamic scene deblurring [C] // CVPR.2017, 1 (2): 3. Reference 3: Sun J, Cao W, Xu Z, et al. Learning. A convolutional neural network for non-uniform motion blur removal [C] // Proceedings of the IEEE conference Computer on Vision Vision and Pattern Recognition. 2015: 769-777.
一般来说,图像去模糊算法可以分为基于概率模型的传统算法和基于深度学习的去模糊算法。传统算法采用卷积模型来解释模糊成因,相机抖动的过程可以映射为模糊核轨迹PSF(Point Spread Function)。在模糊核未知的情况下还原清晰图像,这一问题属于不适定(ill-posed)问题,所以通常意义上需要先估计模糊核,再利用评估的模糊核进行返卷积操作得到复原图像。基于深度学习的去模糊算法则利用深层网络结构获取图像的潜在信息,进而实现模糊图像复原。深度学习的去模糊算法可以实现模糊核估计和非盲反卷积两个操作来进行图像复原,同时也可以采用生 成对抗方法来复原图像。本专利旨在解决深度学习去模糊算法存在的缺点:In general, image deblurring algorithms can be divided into traditional algorithms based on probability models and deblurring algorithms based on deep learning. The traditional algorithm uses a convolution model to explain the cause of blur. The process of camera shake can be mapped to a blur kernel trajectory PSF (Point Spread Function). The problem of restoring a clear image when the blur kernel is unknown is an ill-posed problem, so it is usually necessary to estimate the blur kernel first, and then use the evaluated blur kernel to perform the deconvolution operation to obtain the restored image. The deep learning-based deblurring algorithm uses the deep network structure to obtain the latent information of the image, and then realize the blurred image restoration. The deep learning deblurring algorithm can realize two operations of fuzzy kernel estimation and non-blind deconvolution to restore the image, and it can also use the generated confrontation method to restore the image. This patent aims to solve the shortcomings of the deep learning deblurring algorithm:
1)纹理恢复不准确,1) Texture restoration is not accurate,
2)深度特征与像素值不匹配问题。2) The problem that the depth feature does not match the pixel value.
发明的公开Disclosure of invention
本发明的目的在于:提出了用于图像去模糊的Bi-Level优化方法,该Bi-Level优化方法来对GAN(Generative Adversarial Network)进行优化,旨在解决现有深度学习去模糊算法存在的缺点。通过对比现有最优算法,本发明在图像复图像原性能上平均提升了1.3dB。The purpose of the present invention is to propose a Bi-Level optimization method for image deblurring. The Bi-Level optimization method is to optimize GAN (Generative Adversarial Network), aiming to solve the shortcomings of existing deep learning deblurring algorithms . By comparing the existing optimal algorithms, the present invention improves the original performance of the image complex image by 1.3 dB on average.
本发明提供的技术方案如下:The technical solutions provided by the present invention are as follows:
MSE损失可以保证优化过程在像素层级和特征层级上的同一性,但带来的问题是引入了大量的噪声;而感知损失在一定程度上是一种不错的替代方案,但却无法保证优化的同一性。为了解决这个条件损失不同一(non-identical)问题,同时降低优化的复杂度,本发明提出了一种Bi-Level优化方法。MSE loss can guarantee the identity of the optimization process at the pixel level and the feature level, but the problem is that a lot of noise is introduced; and the perceived loss is a good alternative to a certain extent, but it cannot guarantee the optimization Identity. In order to solve the problem of non-identical loss of condition and reduce the complexity of optimization, the present invention proposes a Bi-Level optimization method.
具体来说,设计让两种层级损失函数相互交替进行模型优化。考虑到L 1损失与MSE损失满足同一关系,L1相比MSE会引入更多的噪声但可以保留更多的纹理,而感知损失又有过度的平滑作用,本发明在优化过程中将三种损失同时引入。同时为了平衡各条件损失在优化过程中的作用,本发明将这些损失按照(式4)归一到同一量级。如图2所示,Bi-Level优化方法分为两个步骤,第一步用MSE损失条件训练出一个基本模型,第二步采用双层级损失交互迭代进行模型微调操作。这是因为在训练初期,复原效果与清晰图像之间的散度比较大,噪声的作用可以忽略不计,而在训练后期噪声被不断放大使其负面作用愈加明显,因此引入感知损失进行噪声抑制同时将MSE损失改为L 1损失以足够保证结构连续性。 Specifically, the design allows two levels of loss functions to alternate with each other for model optimization. Considering that the L 1 loss and the MSE loss satisfy the same relationship, L1 will introduce more noise but can retain more texture than MSE, and the perceived loss has an excessive smoothing effect. In the present invention, three kinds of losses are optimized. Introduced at the same time. At the same time, in order to balance the role of various condition losses in the optimization process, the present invention normalizes these losses to the same magnitude according to (Equation 4). As shown in Figure 2, the Bi-Level optimization method is divided into two steps. In the first step, a basic model is trained with MSE loss conditions, and in the second step, a two-level loss interactive iteration is used to fine-tune the model. This is because in the early stage of training, the divergence between the restoration effect and the clear image is relatively large, and the role of noise can be ignored. However, in the late stage of training, the noise is continuously amplified to make its negative effect more obvious. Change MSE loss to L 1 loss to ensure structural continuity.
本发明具有如下技术效果:The present invention has the following technical effects:
1、纹理恢复准确,在图像复原过程中,图像像素值的相似性可以保证深度特征的相似,反之却并不成立;即采用特征级的感知损失函数可能造成图像细节信息修 复失败。1. The texture restoration is accurate. In the process of image restoration, the similarity of image pixel values can ensure the similarity of depth features, but the contrary is not true; that is, the use of feature-level perceptual loss function may cause image detail information repair failure.
2、深度特征与像素值匹配,考虑到像素级的MSE、L1经常会造成噪声放大,而感知损失又可以有效抑制噪声,通过将以上损失函数进行结合,本发明专利在一定程度上既可以保证像素纹理回复准确,又可以保证深度特征与像素值匹配。2. The depth feature matches the pixel value. Considering that the pixel level MSE and L1 often cause noise amplification, and the perceived loss can effectively suppress the noise. By combining the above loss functions, the invention patent can guarantee to a certain extent. Pixel texture recovery is accurate, and it can ensure that the depth feature matches the pixel value.
附图的简要说明Brief description of the drawings
图1生成对抗网络流程;Figure 1 the process of generating an adversarial network;
图2 Bi-Level优化流程;Figure 2 Bi-Level optimization process;
图3 Bi-Skip-Net结构图;Figure 3 Bi-Skip-Net structure diagram;
图4生成器设计:Bi-Skip-Net+残差;Figure 4 generator design: Bi-Skip-Net + residual;
图5本发明与其它算法的主观对比,其中:Figure 5 Subjective comparison of the present invention with other algorithms, in which:
图5a为模糊图像对比图;Figure 5a is a comparison diagram of blurred images;
图5b为Nah等人的复原效果对比图;Figure 5b is a comparison diagram of the recovery effects of Nah et al;
图5c为Kupyn等人的复原效果对比图;Figure 5c is a comparison diagram of the recovery effects of Kupyn et al;
图5d为Bi-Level优化方法的复原效果对比图。Figure 5d is a comparison graph of the restoration effect of the Bi-Level optimization method.
实现本发明的最佳方式Best way to implement the invention
下面结合附图和实施例,对本发明进行详细的描述,但不以任何方式限制本发明的范围。The following describes the present invention in detail with reference to the drawings and embodiments, but does not limit the scope of the present invention in any way.
图1生成对抗网络流程,图4生成器设计:Bi-Skip-Net+残差,表1为判别器参数表,如图所示,Figure 1 generates the adversarial network process, Figure 4 generator design: Bi-Skip-Net + residual, Table 1 is the discriminator parameter table, as shown in the figure,
表1.判别器参数表Table 1. Discriminator parameter table
Figure PCTCN2018117635-appb-000001
Figure PCTCN2018117635-appb-000001
Figure PCTCN2018117635-appb-000002
Figure PCTCN2018117635-appb-000002
本发明用于图像去模糊的Bi-Level优化方法的具体步骤如下:The specific steps of the Bi-Level optimization method for image deblurring of the present invention are as follows:
(1)采用生成对抗网络流程来进行模糊图像复原,采用图4和表1来设计生成器和判别器;(1) The process of generating confrontation network is used to restore the blurred image, and the generator and discriminator are designed using Figure 4 and Table 1;
(2)如图1所示,将模糊图像输入到生成器得到复原图像;之后,将复原图像和清晰图像输入到判别器进行区分得到清晰图像。如图4所示,本发明用Bi-Skip-Net来训练图像残差,采用模糊图像+图像残差的模式来复原图像。(2) As shown in FIG. 1, the blurred image is input to the generator to obtain the restored image; after that, the restored image and the clear image are input to the discriminator to distinguish the clear image. As shown in FIG. 4, the present invention uses Bi-Skip-Net to train the image residuals, and uses the blurred image + image residuals mode to restore the image.
(3)采用如下的损失函数来训练网络;(3) Use the following loss function to train the network;
Figure PCTCN2018117635-appb-000003
Figure PCTCN2018117635-appb-000003
其中
Figure PCTCN2018117635-appb-000004
为对抗损失函数,
Figure PCTCN2018117635-appb-000005
为条件损失函数,λ为条件损失函数的权重。
among them
Figure PCTCN2018117635-appb-000004
To fight the loss function,
Figure PCTCN2018117635-appb-000005
For the conditional loss function, λ is the weight of the conditional loss function.
Figure PCTCN2018117635-appb-000006
Figure PCTCN2018117635-appb-000006
通过最大化
Figure PCTCN2018117635-appb-000007
来优化判别器D;
By maximizing
Figure PCTCN2018117635-appb-000007
To optimize discriminator D;
Figure PCTCN2018117635-appb-000008
Figure PCTCN2018117635-appb-000008
通过最小化式3来优化生成器G;Optimize generator G by minimizing Equation 3;
其中
Figure PCTCN2018117635-appb-000009
设计如下:
among them
Figure PCTCN2018117635-appb-000009
The design is as follows:
Figure PCTCN2018117635-appb-000010
Figure PCTCN2018117635-appb-000010
其中,L,S分别表示模型在不同层级的输出和真值,α取值为1或2,整个条件损失函数被通道数c,宽度w和高度h所规范。Among them, L and S respectively represent the output and true value of the model at different levels, and the value of α is 1 or 2, the entire conditional loss function is regulated by the number of channels c, width w and height h.
(4)采用图2所示的Li-Level优化方法来对网络进行优化;(4) Use the Li-Level optimization method shown in Figure 2 to optimize the network;
本发明的Li-Level优化方法包括两个步骤(设训练过程中epoch次数为N)。The Li-Level optimization method of the present invention includes two steps (the number of epochs in the training process is N).
Step1:当迭代次数小于1/3N,本发明采用像素级均方误差(MSE)作为损失函数训练模型;Step1: When the number of iterations is less than 1 / 3N, the present invention adopts pixel level mean square error (MSE) as the loss function to train the model;
Step2:当迭代次数大于等于1/3N,本发明采用像素级的L1损失函数和特征级的感知损失函数来交替训练模型,本发明在实验过程中每隔2此迭代进行一次损失函数替换。Step2: When the number of iterations is greater than or equal to 1 / 3N, the present invention uses a pixel-level L1 loss function and a feature-level perceptual loss function to alternately train the model. The present invention performs loss function replacement every two iterations during the experiment.
(5)将训练好的网络作为最终的复原模型。(5) Take the trained network as the final restoration model.
图3为Bi-Skip-Net结构图,如图3所示,如图所示,Bi-Skip-Net由压缩路径(D*)、双跨连接路径(S*)和扩张路径(U*)3部分组成。压缩路径提取图像的深度特征和浅层特征;双跨连接路径连接图像特征与扩张路径中的上采样特征;扩张路径实现特征上采样。Figure 3 is the structure diagram of Bi-Skip-Net. As shown in Figure 3, as shown in the figure, Bi-Skip-Net consists of a compression path (D *), a double-span connection path (S *) and an expansion path (U *) 3 parts. The compression path extracts the depth and shallow features of the image; the double-span connection path connects the image features with the upsampling features in the expansion path; the expansion path implements feature upsampling.
图5为本发明与其它算法的主观对比,其中,图5a为模糊图像对比图;图5b为Nah等人的复原效果对比图;图5c为Kupyn等人的复原效果对比图;图5d为Bi-Level优化方法的复原效果对比图,即本发明的复原效果对比图。其中,图5a、图5b、图5c和图5d都是三张相同的照片,每张图片中用两个方框特别选取了两个点,这两个方框的放大图对应地放在该图片的下方,可以看清复原效果。其对比结果见表2本发明与其它算法在GoPro数据集上的测试对比。Fig. 5 is a subjective comparison between the present invention and other algorithms. Among them, Fig. 5a is a comparison of blurred images; Fig. 5b is a comparison of restoration effects of Nah et al .; Fig. 5c is a comparison of restoration effects of Kupyn et al .; Fig. 5d is Bi -A comparison diagram of the restoration effect of the Level optimization method, that is, a comparison diagram of the restoration effect of the present invention. Among them, Figure 5a, Figure 5b, Figure 5c, and Figure 5d are three identical photos, and two points are specifically selected with two boxes in each picture. The enlarged views of these two boxes are correspondingly placed in this Below the picture, you can see the restoration effect clearly. The comparison results are shown in Table 2. The test comparison between the present invention and other algorithms on the GoPro data set.
表2.本发明与其它算法在GoPro数据集上的测试对比Table 2. Comparison between the present invention and other algorithms on GoPro dataset
Figure PCTCN2018117635-appb-000011
Figure PCTCN2018117635-appb-000011
需要注意的是,公布实施例的目的在于帮助进一步理解本发明,但是本领域的技术人员可以理解:在不脱离本发明及所附权利要求的精神和范围内,各种替换和 修改都是可能的。因此,本发明不应局限于实施例所公开的内容,本发明要求保护的范围以权利要求书界定的范围为准。It should be noted that the purpose of the disclosed embodiments is to help further understand the present invention, but those skilled in the art can understand that various replacements and modifications are possible without departing from the spirit and scope of the present invention and the appended claims of. Therefore, the present invention should not be limited to the contents disclosed in the embodiments, and the scope of protection claimed by the present invention is subject to the scope defined by the claims.
工业实用性Industrial applicability
本发明基于深度学习的去模糊算法则利用深层网络结构获取图像的潜在信息,进而实现模糊图像复原。深度学习的去模糊算法可以实现模糊核估计和非盲反卷积两个操作来进行图像复原,同时也可以采用生成对抗方法来复原图像。The de-blurring algorithm based on deep learning of the present invention utilizes the deep network structure to obtain the latent information of the image, and then realizes the restoration of the blurred image. The deep learning deblurring algorithm can realize two operations of fuzzy kernel estimation and non-blind deconvolution to restore the image, and it can also use the generational confrontation method to restore the image.

Claims (4)

  1. 用于图像去模糊的Bi-Level优化方法,具体步骤是:Bi-Level optimization method for image deblurring, the specific steps are:
    第一步,用MSE损失条件训练出一个基本模型;The first step is to train a basic model with MSE loss conditions;
    第二步,采用双层级损失交互迭代进行模型微调操作。The second step is to use two-level loss interaction iteration to fine-tune the model.
  2. 根据权利要求1所述的用于图像去模糊的Bi-Level优化方法,其特征在于:The Bi-Level optimization method for image deblurring according to claim 1, characterized in that:
    (1)所述的生成器按以下方式设计:Bi-Skip-Net+残差,判别器参数依据表1:(1) The generator is designed in the following way: Bi-Skip-Net + residual, the discriminator parameters are based on Table 1:
    表1.判别器参数表Table 1. Discriminator parameter table
    ### Floor 参数维度Parameter dimension 步长Step 11 convconv 32x3x5x532x3x5x5 22 22 convconv 64x32x5x564x32x5x5 11 33 convconv 64x64x5x564x64x5x5 22 44 convconv 128x64x5x5128x64x5x5 11 55 convconv 128x128x5x5128x128x5x5 44 66 convconv 256x128x5x5256x128x5x5 11 77 convconv 256x256x5x5256x256x5x5 44 88 convconv 512x256x5x5512x256x5x5 11 99 convconv 512x512x4x4512x512x4x4 44 1010 fcfc 512x1x1x1512x1x1x1 --
    (2)用Bi-Skip-Net来训练图像残差,采用模糊图像+图像残差的模式来复原图像;(2) Use Bi-Skip-Net to train the image residuals, and use the blurred image + image residuals mode to restore the image;
    (3)用MSE损失条件训练出一个基本模型,采用如下的损失函数来训练网络;(3) Train a basic model with MSE loss conditions, and use the following loss function to train the network;
    Figure PCTCN2018117635-appb-100001
    Figure PCTCN2018117635-appb-100001
    其中
    Figure PCTCN2018117635-appb-100002
    为对抗损失函数,
    Figure PCTCN2018117635-appb-100003
    为条件损失函数,λ为条件损失函数的权重。
    among them
    Figure PCTCN2018117635-appb-100002
    To fight the loss function,
    Figure PCTCN2018117635-appb-100003
    Is the conditional loss function, and λ is the weight of the conditional loss function.
    Figure PCTCN2018117635-appb-100004
    Figure PCTCN2018117635-appb-100004
    通过最大化
    Figure PCTCN2018117635-appb-100005
    来优化判别器D;
    By maximizing
    Figure PCTCN2018117635-appb-100005
    To optimize discriminator D;
    Figure PCTCN2018117635-appb-100006
    Figure PCTCN2018117635-appb-100006
    通过最小化式3来优化生成器G;Optimize generator G by minimizing Equation 3;
    其中
    Figure PCTCN2018117635-appb-100007
    设计如下:
    among them
    Figure PCTCN2018117635-appb-100007
    The design is as follows:
    Figure PCTCN2018117635-appb-100008
    Figure PCTCN2018117635-appb-100008
    其中,L,S分别表示模型在不同层级的输出和真值,α取值为1或2,整个条件损失函数被通道数c,宽度w和高度h所规范;Among them, L and S respectively represent the output and true value of the model at different levels, and the value of α is 1 or 2, the entire conditional loss function is regulated by the number of channels c, width w and height h;
    (4)采用双层级损失交互迭代进行模型微调操作,具体是采用Li-Level优化机制来对网络进行优化;(4) Use double-level loss interactive iteration to fine-tune the model, specifically using Li-Level optimization mechanism to optimize the network;
    (5)将训练好的网络作为最终的复原模型。(5) Take the trained network as the final restoration model.
  3. 根据权利要求2所述的用于图像去模糊的Bi-Level优化方法,其特征在于:The Bi-Level optimization method for image deblurring according to claim 2, characterized in that:
    所述的Bi-Skip-Net是由压缩路径(D*)、双跨连接路径(S*)和扩张路径(U*)3部分组成;压缩路径提取图像的深度特征和浅层特征;双跨连接路径连接图像特征与扩张路径中的上采样特征;扩张路径实现特征上采样。The Bi-Skip-Net is composed of three parts: compression path (D *), double-span connection path (S *) and expansion path (U *); the compression path extracts the depth and shallow features of the image; double-span The connection path connects the image features with the upsampling features in the expansion path; the expansion path implements feature upsampling.
  4. 根据权利要求2所述的用于图像去模糊的Bi-Level优化方法,其特征在于:步骤(4)所述的Li-Level优化机制包括两个步骤(设训练过程中epoch次数为N),The Bi-Level optimization method for image deblurring according to claim 2, wherein the Li-Level optimization mechanism in step (4) includes two steps (set the number of epochs in the training process to N),
    Step1:当迭代次数小于1/3N,本发明采用像素级均方误差(MSE)作为损失函数训练模型;Step1: When the number of iterations is less than 1 / 3N, the present invention adopts pixel level mean square error (MSE) as the loss function to train the model;
    Step2:当迭代次数大于等于1/3N,本发明采用像素级的L1损失函数和特征级的感知损失函数来交替训练模型,本发明在实验过程中每隔2此迭代进行一次损失函数替换。Step2: When the number of iterations is greater than or equal to 1 / 3N, the present invention uses a pixel-level L1 loss function and a feature-level perceptual loss function to alternately train the model. The present invention performs loss function replacement every two iterations during the experiment.
PCT/CN2018/117635 2018-11-21 2018-11-27 Bi-level optimization method for image deblurring WO2020103171A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811389239.7A CN109544475A (en) 2018-11-21 2018-11-21 Bi-Level optimization method for image deblurring
CN201811389239.7 2018-11-21

Publications (1)

Publication Number Publication Date
WO2020103171A1 true WO2020103171A1 (en) 2020-05-28

Family

ID=65848630

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/117635 WO2020103171A1 (en) 2018-11-21 2018-11-27 Bi-level optimization method for image deblurring

Country Status (2)

Country Link
CN (1) CN109544475A (en)
WO (1) WO2020103171A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114049497A (en) * 2021-11-07 2022-02-15 西北工业大学 Image depth characterization method based on self-adaptive contrast learning
CN114066750A (en) * 2021-10-22 2022-02-18 西北工业大学 Self-encoder deblurring method based on domain transformation
CN114331913A (en) * 2022-01-06 2022-04-12 福州大学 Motion blurred image restoration method based on residual attention block
CN114511466A (en) * 2022-02-21 2022-05-17 北京大学深圳研究生院 Blind person face image restoration method based on generation of confrontation network prior
CN114511466B (en) * 2022-02-21 2024-04-26 北京大学深圳研究生院 Blind face image restoration method based on generation of countermeasure network priori

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110175961B (en) * 2019-05-22 2021-07-27 艾特城信息科技有限公司 Reticulation removing method based on human face image segmentation countermeasure thought
US11544558B2 (en) * 2019-08-30 2023-01-03 Nec Corporation Continual learning of artificial intelligence systems based on bi-level optimization
CN117197002B (en) * 2023-11-07 2024-02-02 松立控股集团股份有限公司 Image restoration method based on perception diffusion

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080253678A1 (en) * 2007-04-10 2008-10-16 Arcsoft, Inc. Denoise method on image pyramid
US20100020208A1 (en) * 2008-07-24 2010-01-28 Florida State University Research Foundation Systems and methods for training an active random field for real-time image denoising
CN106157268A (en) * 2016-07-28 2016-11-23 浙江工业大学 A kind of degraded image restored method based on the convex approximation of L0
CN106709875A (en) * 2016-12-30 2017-05-24 北京工业大学 Compressed low-resolution image restoration method based on combined deep network
CN107507141A (en) * 2017-08-07 2017-12-22 清华大学深圳研究生院 A kind of image recovery method based on adaptive residual error neutral net

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106548159A (en) * 2016-11-08 2017-03-29 中国科学院自动化研究所 Reticulate pattern facial image recognition method and device based on full convolutional neural networks
CN108665058B (en) * 2018-04-11 2021-01-05 徐州工程学院 Method for generating countermeasure network based on segment loss
CN108550125B (en) * 2018-04-17 2021-07-30 南京大学 Optical distortion correction method based on deep learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080253678A1 (en) * 2007-04-10 2008-10-16 Arcsoft, Inc. Denoise method on image pyramid
US20100020208A1 (en) * 2008-07-24 2010-01-28 Florida State University Research Foundation Systems and methods for training an active random field for real-time image denoising
CN106157268A (en) * 2016-07-28 2016-11-23 浙江工业大学 A kind of degraded image restored method based on the convex approximation of L0
CN106709875A (en) * 2016-12-30 2017-05-24 北京工业大学 Compressed low-resolution image restoration method based on combined deep network
CN107507141A (en) * 2017-08-07 2017-12-22 清华大学深圳研究生院 A kind of image recovery method based on adaptive residual error neutral net

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114066750A (en) * 2021-10-22 2022-02-18 西北工业大学 Self-encoder deblurring method based on domain transformation
CN114066750B (en) * 2021-10-22 2024-03-05 西北工业大学 Self-encoder deblurring method based on domain transformation
CN114049497A (en) * 2021-11-07 2022-02-15 西北工业大学 Image depth characterization method based on self-adaptive contrast learning
CN114049497B (en) * 2021-11-07 2024-03-05 西北工业大学 Image depth characterization method based on self-adaptive contrast learning
CN114331913A (en) * 2022-01-06 2022-04-12 福州大学 Motion blurred image restoration method based on residual attention block
CN114511466A (en) * 2022-02-21 2022-05-17 北京大学深圳研究生院 Blind person face image restoration method based on generation of confrontation network prior
CN114511466B (en) * 2022-02-21 2024-04-26 北京大学深圳研究生院 Blind face image restoration method based on generation of countermeasure network priori

Also Published As

Publication number Publication date
CN109544475A (en) 2019-03-29

Similar Documents

Publication Publication Date Title
WO2020103171A1 (en) Bi-level optimization method for image deblurring
WO2021208122A1 (en) Blind video denoising method and device based on deep learning
Zheng et al. Adaptive unfolding total variation network for low-light image enhancement
CN110782399B (en) Image deblurring method based on multitasking CNN
WO2020087607A1 (en) Bi-skip-net-based image deblurring method
Ju et al. Single image dehazing via an improved atmospheric scattering model
KR101328741B1 (en) Apparatus and method for image enhancement using color channel
TWI509567B (en) Method and image processing apparatus for image visibility restoration
CN108564549B (en) Image defogging method based on multi-scale dense connection network
CN111091503B (en) Image defocusing and blurring method based on deep learning
CN105765607A (en) Multiple phase method for image deconvolution
Chowdhury et al. Fuzzy logic based filtering for image de-noising
TW201830330A (en) Image processing method and image processing system
CN111161360A (en) Retinex theory-based image defogging method for end-to-end network
Zhao et al. Legacy photo editing with learned noise prior
Malik et al. Llrnet: A multiscale subband learning approach for low light image restoration
Teranishi et al. Improvement of robustness blind image restoration method using failing detection process
Du et al. Perceptually optimized generative adversarial network for single image dehazing
Arora et al. Low light image enhancement via global and local context modeling
CN113344804A (en) Training method of low-light image enhancement model and low-light image enhancement method
Kaur et al. An improved adaptive bilateral filter to remove gaussian noise from color images
CN106897975B (en) Image denoising method for hypercube particle calculation
CN106952243A (en) UUV Layer Near The Sea Surface infrared image self adaptation merger histogram stretches Enhancement Method
Khmag et al. Blur removal in natural digital images using self-reference generative networks
Xie et al. DHD-Net: A novel deep-learning-based dehazing network

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18941012

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18941012

Country of ref document: EP

Kind code of ref document: A1