CN110363727B

CN110363727B - Image defogging method based on multi-scale dark channel prior cascade deep neural network

Info

Publication number: CN110363727B
Application number: CN201910673412.4A
Authority: CN
Inventors: 崔智高; 苏延召; 李爱华; 王涛; 姜柯; 蔡艳平; 冯国彦; 李庆辉
Original assignee: Rocket Force University of Engineering of PLA
Current assignee: Rocket Force University of Engineering of PLA
Priority date: 2019-07-24
Filing date: 2019-07-24
Publication date: 2020-06-12
Anticipated expiration: 2039-07-24
Also published as: CN110363727A

Abstract

The invention discloses an image defogging method based on a multi-scale dark channel prior cascade deep neural network, which comprises the following steps: firstly, establishing a training set of atomized images; secondly, defogging of a single random foggy image; thirdly, calculating a loss objective function of the original single foggy image; fourthly, updating the weight parameter set; fifthly, taking a new single random foggy image, circulating the steps from the second step to the fourth step until the loss objective function of the original single foggy image is smaller than the loss objective function threshold value, and determining a final cascading defogging model; sixthly, defogging the single actual foggy image. According to the method, the dark channel and the global illumination parameter are estimated on the images with different scales by using the convolutional neural network, then the dark channel and the defogged image are fused step by step, and finally the defogged image is obtained by supervised learning.

Description

Image defogging method based on multi-scale dark channel prior cascade deep neural network

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to an image defogging method based on a multi-scale dark channel prior cascade deep neural network.

Background

The quality degradation phenomenon can appear in the image of gathering under the bad weather of fog, haze and so on because the effect of atmosphere scattering, makes image color offwhite, and the contrast reduces, and the object characteristic is difficult to discern, not only makes the visual effect variation, and the image sight reduces, still can lead to the understanding of image content to appear the deviation. Image defogging refers to the reduction or elimination of the adverse effects of airborne particles on an image by specific methods and means. The single image defogging refers to the defogging treatment of the single image to obtain a clear image under the condition that only one foggy image exists.

The existing single image defogging method mainly comprises three categories: the first category is image enhancement based methods, the second category is physical model based methods, and the third category is deep learning based methods.

The essence of the image enhancement based method is to enhance the degraded image, improving the quality of the image. Such as common histogram equalization, logarithmic transformation, power law transformation, sharpening, wavelet transformation, etc. By these methods, the contrast of the image is enhanced or the features of the image are highlighted. In contrast to common contrast enhancement methods, another common method of image enhancement is the Retinex method based on color constancy and retinal cortex theory. According to the method, the image is decomposed into the product of the essential image and the illumination image, so that the influence of the illumination factor shielded by the haze on the image imaging is eliminated. Compared with the traditional contrast improvement method, the Retinex method has the advantages that the obtained defogged image has better local contrast and smaller color distortion. However, since the Retinex method is also a pathological problem, only approximate estimation can be performed, and thus the image defogging effect is also influenced to a certain extent.

The method based on the physical model utilizes an atmospheric scattering model (I ═ JT + (1-T) a, wherein I represents a foggy image and J represents a fogless image) to respectively estimate a scene medium perspective ratio T and global atmospheric illumination a, thereby obtaining a clear fogless image. However, under only a single foggy image, estimating T and a is also a pathological problem, and only myopia estimation can be performed. The method for restoring the foggy image to the fogless image by utilizing the atmospheric scattering model can be generally divided into three types, namely a method based on depth information in the 1 st type; class 2 is a defogging algorithm based on the polarization characteristics of atmospheric light; class 3 is a priori knowledge based approach. The first two methods usually require manual cooperation to obtain a better result, while the 3 rd method is a common method at present, such as a dark channel statistical prior-based method and a color statistical prior-based method. Due to the fact that the methods are knowledge obtained through statistical information, the methods cannot adapt to all scenes, for example, a dark channel priori knowledge-based method can generate deviation when a perspective system is estimated for a bright area such as sky, and the whole defogged image is dark. Meanwhile, the method has the problem that more parameters need to be manually set according to scenes.

The deep learning-based method utilizes technologies such as artificially synthesized foggy image data sets and convolutional neural networks to realize defogging, and is specifically divided into two types: (1) the deep neural network is used for representing an atmospheric scattering model, and corresponding T and A are automatically learned and estimated. Different from methods based on prior knowledge and the like for estimating a perspective coefficient and atmospheric illumination, the method mainly learns from data so as to overcome the deviation of partial prior knowledge, but the method usually needs to know the scene depth to synthesize and obtain T so as to carry out supervised learning; (2) the defogging process is directly considered as the transformation or image synthesis of the image without any assumption or estimation on T and A. The image synthesis-based method generally preprocesses the foggy image by using methods such as contrast enhancement, white balance and the like, and then learns a weight function through a neural network so as to fuse the preprocessed image, thereby realizing defogging. However, the method is easy to have strong dependence on the preprocessed image, and the single-frame image processing time is long. The image transformation-based method directly utilizes a neural network to learn a non-linear transformation function between the fog image and the fog-free image, thereby obtaining the fog-free image. However, this method lacks contrast of real scenes, and thus has a very strong dependence on data.

Disclosure of Invention

The technical problem to be solved by the invention is to provide an image defogging method based on a multi-scale dark channel prior cascade deep neural network aiming at the defects in the prior art, the estimation of dark channels and global illumination parameters is carried out on images with different scales by using a convolutional neural network, then the dark channels and the defogged images are fused step by step, and finally the defogged images are obtained by supervised learning.

In order to solve the technical problems, the invention adopts the technical scheme that: the image defogging method based on the multi-scale dark channel prior cascade deep neural network is characterized by comprising the following steps of:

step one, establishing a training set of atomized images: synthesizing a group of foggy image training sets by using an image data set with known depth according to an atmospheric scattering model;

step two, defogging of a single random foggy image, which comprises the following steps:

step 201, randomly extracting a foggy image from the foggy image training set in the step one, and normalizing the image size of a single random foggy image to obtain an image with the size of 2^m×2ⁿOriginal single foggy image of

Wherein m and n are not positive integers less than 8;

step 202, the original single foggy image is processed

Down-sampling is carried out to respectively obtain original foggy images with a first scale

Second scale original hazy image

Third scale original hazy image

And fourth scale original hazy image

Wherein the original foggy image of the first scale

Has a resolution of 2^m-4×2^n-4Second-scale original hazy image I₂ ^hHas a resolution of 2^m-3×2^n-3Third-scale original fogging image

Has a resolution of 2^m-2×2^n-2Fourth scale original hazy image

Has a resolution of 2^m-1×2^n-1；

Step 203, utilizing the first deep convolutional network

For the original foggy image of the first scale

Estimating a first global atmospheric illumination A₁First transmittance image T₁And a first transmittance image T₁Up-sampled image T of₁ ^uI.e. first deep convolutional network

Is a first scale original hazy image

The output is first global atmospheric illumination A₁First transmittance image T₁And a first transmittance image T₁Up-sampled image T of₁ ^uWherein w is₁For a first deep convolutional network

A first global atmospheric illumination

First transmittance image

First transmittance image T₁Up-sampled image T of₁ ^u＝Deconv(T₁) Conv (. cndot.) is a convolution module, Maxpool (. cndot.) is a maximum pooling module, Gfl (. cndot.) is a guided filtering module, Deconv (. cndot.) is a deconvolution module;

obtaining a first-scale defogged image D by using an atmospheric scattering model₁Wherein, in the step (A),

step 204, utilizing a second deep convolutional network

For the original foggy image of the second scale

Estimating a second global atmospheric illumination A₂A second transmittance image T₂And a second transmittance image T₂Up-sampled image T of₂ ^uI.e. second deep convolutional network

Is a second scale original hazy image

The output is second global atmospheric illumination A₂A second transmittance image T₂And a second transmittance image T₂Up-sampled image T of₂ ^uWherein w is₂For the second deep convolutional network

A set of weight parameters of, a second global atmospheric illumination

Second transmittance image

Second transmittance image T₂Up-sampled image T of₂ ^u＝Deconv(T₂)；

Obtaining a second-scale defogging temporary image by using the atmospheric scattering model

Wherein the content of the first and second substances,

concat (. cndot.) is a superposition function;

according to the formula

Fusing to obtain a second-scale defogged image D₂；

Step 205, utilizing a third deep convolutional network

For original foggy image of third scale

Estimating third Global atmospheric illumination A₃And a third transmittance image T₃And a third transmittance image T₃Up-sampled image of

I.e. the third deep convolutional network

Is the original foggy image of the third scale

The output is third global atmospheric illumination A₃And a third transmittance image T₃And a third transmittance image T₃Up-sampled image of

Wherein, w₃For a third deep convolutional network

A set of weight parameters of, a third global atmospheric illumination

Third transmittance image

Third transmittance image T₃Up-sampled image of

Obtaining a third-scale defogging temporary image by using the atmospheric scattering model

Wherein the content of the first and second substances,

according to the formula

Fusing to obtain a third-scale defogged image D₃；

Step 206, utilizing a fourth deep convolutional network

For the fourth-scale original foggy image

Estimating a fourth global atmospheric illumination A₄And a fourth transmittance image T₄And a fourth transmittance image T₄Up-sampled image of

I.e. the fourth deep convolutional network

Is the fourth scale original hazy image

The output is the fourth global atmospheric illumination A₄And a fourth transmittance image T₄And a fourth transmittance image T₄Up-sampled image of

Wherein, w₄For a fourth deep convolutional network

A fourth global atmospheric illumination

Fourth transmittance image

Fourth transmittance image T₄Up-sampled image of

Obtaining a fourth-scale defogging temporary image by using the atmospheric scattering model

Wherein the content of the first and second substances,

according to the formula

Fusing to obtain a fourth-scale defogged image D₄；

Step 207, utilizing a fifth deep convolutional network

To original single fogged image

Estimating a fifth global atmospheric illumination A₅And a fifth transmittance image T₅I.e. fifth deep convolutional network

Is the original single foggy image

The output is the fifth global atmospheric illumination A₅And a fifth transmittance image T₅Wherein w is₅For a fifth deep convolutional network

Weight parameter set of (1), fifth global atmospheric lighting

Fifth transmittance image

Obtaining an original defogging temporary image by using an atmospheric scattering model

Wherein the content of the first and second substances,

according to the formula

Fusing to obtain an original defogged image D₅；

Step three, according to the formula

Computing original single foggy images

The loss objective function L of (1), wherein i is a scale number, the numeric range of i is 1-5, and G_iAs an image D_iCorresponding reference truth image, N_iAs an image D_iNumber of upper pixels, L_iAs an image D_iCorresponding countermeasure loss;

step four, updating the weight parameter set: original single foggy image

Sending the loss objective function L into an Adam optimizer to carry out cascade defogging on the model

Training optimization, wherein each weight parameter set in the process of updating is obtained;

step five, taking a new single random foggy image, and circulating the step two to the step four until the original single foggy image

Is a loss objective function L<Δ, at this time, a cascade defogging model f is obtained_wThe training result w ═ w of each weight parameter set in the training set₁,w₂,w₃,w₄,w₅And determining a final cascade defogging model f_wWherein Δ is a loss objective function threshold;

step six, defogging of a single actual foggy image: using a trained cascade defogging model f_wIn the method, the single actual foggy image is defogged to obtain the single actual foggy image

The image defogging method based on the multi-scale dark channel prior cascade deep neural network is characterized by comprising the following steps of: and the value ranges of m and n are both 8-12.

The above-mentioned multi-rulerThe image defogging method of the dullness channel prior cascade deep neural network is characterized by comprising the following steps: the first deep convolutional network in the second step

Second deep convolutional network

Third deep convolutional network

And a fourth deep convolutional network

First depth convolutional network for initial use

Set of weight parameters w₁A second deep convolutional network

Set of weight parameters w₂A third deep convolutional network

Set of weight parameters w₃And a fourth deep convolutional network

Set of weight parameters w₄Is a random initialization value.

The image defogging method based on the multi-scale dark channel prior cascade deep neural network is characterized by comprising the following steps of: the image dataset of known depth comprises a NYU image dataset.

The image defogging method based on the multi-scale dark channel prior cascade deep neural network is characterized by comprising the following steps of: the value range of the loss objective function threshold value delta is as follows: 0< Δ < 0.004.

Compared with the prior art, the invention has the following advantages:

1. according to the method, the dark channel priori estimation is carried out from low resolution, a preliminary defogging result is obtained, the characteristics of multiple scales, namely medium transmissivity and defogging images, are continuously fused, the defogging image with high resolution is finally obtained, the global illumination and the medium transmissivity are estimated according to the dark channel priori mode, the spatial multi-scale defogging result convolution fusion and the multi-scale loss function optimization training are adopted for defogging, and the method is good in real-time performance, high in accuracy and convenient to popularize and use.

2. The invention can realize the end-to-end high-resolution defogging result by using less weight parameters, and has the advantages of reliability, stability and good use effect.

3. The method has simple steps, simulates the dark channel prior estimation and defogging process, utilizes the full convolution neural network to carry out global illumination estimation and multilevel characteristic fusion of multiple scales, automatically learns the parameter data required to be set in the dark channel estimation process, and is convenient for popularization and use.

4. The method utilizes the convolutional neural network to estimate the dark channel and the global illumination parameter on the images with different scales, then gradually fuses the dark channel and the defogged image, and finally obtains the defogged image through supervised learning.

5. Defogging of each single random fogging image is carried out in multiple image scales by utilizing a cascade deep neural network; when a loss objective function of an original single foggy image is calculated, calculating loss functions on a plurality of image scales respectively, and carrying out weighted average; and the optimizer is used for gradient descent optimization, and the weight parameter set is updated, so that the real-time performance is good, and the accuracy is high.

In conclusion, the method utilizes the convolutional neural network to estimate the dark channel and the global illumination parameter on the images with different scales, then gradually fuses the dark channel and the defogged image, and finally obtains the defogged image through supervised learning, effectively utilizes the characteristic modeling capability of the deep neural network to realize the parameter fusion with different scales, can obtain the defogged image with high resolution under the condition of less model parameters, better adapts to outdoor scenes, has the advantages of small model parameters, good real-time performance and high accuracy, and is convenient to popularize and use.

The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.

Drawings

FIG. 1 is a block diagram of a process flow of the method of the present invention.

Detailed Description

As shown in fig. 1, the image defogging method based on the multi-scale dark channel prior cascade deep neural network of the invention comprises the following steps:

step one, establishing a training set of atomized images: synthesizing a grouped foggy image training set by using an image data set with a known depth according to an atmospheric scattering model, and effectively expanding the image data volume of the foggy image training set;

in this embodiment, the image data set with the known depth includes an NYU image data set, and the experimental result is trained by using a public standard data set, so that the method is strong in adaptability, high in image processing accuracy, and good in defogging effect.

Wherein m and n are not positive integers less than 8;

step 202, the original single foggy image is processed

Second scale original hazy image

Third scale original hazy image

And fourth scale original hazy image

Wherein the original foggy image of the first scale

Has a resolution of 2^m-4×2^n-4Second scale original hazy image

Has a resolution of 2^m-3×2^n-3Third-scale original fogging image

Has a resolution of 2^m-2×2^n-2Fourth scale original hazy image

Has a resolution of 2^m-1×2^n-1；

Step 203, utilizing the first deep convolutional network

For the original foggy image of the first scale

Is a first scale original hazy image

A first global atmospheric illumination

First transmittance image

step 204, utilizing a second deep convolutional network

For the original foggy image of the second scale

Estimating a second global atmospheric illumination A₂A second transmittance image T₂And a second transmittance image T₂Up-sampled image of

I.e. the second deep convolutional netCollaterals of kidney meridian

Is a second scale original hazy image

The output is second global atmospheric illumination A₂A second transmittance image T₂And a second transmittance image T₂Up-sampled image of

Wherein, w₂For the second deep convolutional network

A set of weight parameters of, a second global atmospheric illumination

Second transmittance image

Second transmittance image T₂Up-sampled image of

Wherein the content of the first and second substances,

concat (. cndot.) is a superposition function;

according to the formula

Fusing to obtain a second-scale defogged image D₂；

Step 205, utilizing a third deep convolutional network

For original foggy image of third scale

Estimating third Global atmospheric illumination A₃And a third transmittance image T₃And a third transmittance image T₃Up-sampled image T of₃ ^uI.e. the third deep convolutional network

Is the original foggy image of the third scale

The output is third global atmospheric illumination A₃And a third transmittance image T₃And a third transmittance image T₃Up-sampled image T of₃ ^uWherein w is₃For a third deep convolutional network

A set of weight parameters of, a third global atmospheric illumination

Third transmittance image

Third transmittance image T₃Up-sampled image of

Wherein the content of the first and second substances,

according to the formula

Fusing to obtain a third-scale defogged image D₃；

Step 206, utilizing a fourth deep convolutional network

For the fourth-scale original foggy image

I.e. the fourth deep convolutional network

Is the fourth scale original hazy image

Wherein, w₄For a fourth deep convolutional network

A fourth global atmospheric illumination

Fourth transmittance image

Fourth transmittance image T₄Up-sampled image of

Wherein the content of the first and second substances,

according to the formula

Fusing to obtain a fourth-scale defogged image D₄；

Step 207, utilizing a fifth deep convolutional network

To original single fogged image

Is the original single foggy image

Weight parameter set of (1), fifth global atmospheric lighting

Fifth transmittance image

Wherein the content of the first and second substances,

according to the formula

Fusing to obtain an original defogged image D₅；

In this embodiment, the first deep convolutional network in the second step

Second deep convolutional network

Third deep convolutional network

And a fourth deep convolutional network

First depth convolutional network for initial use

Set of weight parameters w₁A second deep convolutional network

Set of weight parameters w₂A third deep convolutional network

Set of weight parameters w₃And a fourth deep convolutional network

Right of (1)Set of heavy parameters w₄Is a random initialization value.

In the embodiment, the dark channel prior estimation is carried out from low resolution to obtain a preliminary defogging result, the characteristics of multiple scales, namely medium transmissivity and a defogging image, are continuously fused to finally obtain a defogging image with high resolution, the global illumination and the medium transmissivity are estimated according to the dark channel prior mode, the spatial multi-scale defogging result convolution fusion and the multi-scale loss function optimization training are adopted to carry out defogging, the real-time performance is good, the accuracy is high, the full convolution neural network is utilized to carry out the global illumination estimation and the multi-level characteristic fusion of multiple scales by simulating the dark channel prior estimation and defogging processes, and the parameter data required to be set in the dark channel estimation process is automatically learned; the method comprises the steps of estimating dark channels and global illumination parameters on images with different scales by using a convolutional neural network, then fusing the dark channels and defogged images step by step, and finally obtaining the defogged images through supervised learning.

Step three, according to the formula

Computing original single foggy images

step four, updating the weight parameter set: original single foggy image

it should be noted that, end-to-end high-resolution defogging results can be realized with fewer weight parameters, and the reliability and stability are achieved.

in this embodiment, the value range of the loss objective function threshold Δ is: 0< Δ < 0.004.

In the embodiment, the value ranges of m and n are both 8-12.

When the method is used, the defogging of each single random foggy image utilizes the cascade deep neural network to perform defogging in a plurality of image scales; when a loss objective function of an original single foggy image is calculated, calculating loss functions on a plurality of image scales respectively, and carrying out weighted average; the optimizer is used for gradient descent optimization, the weight parameter set is updated, and the method is good in real-time performance and high in accuracy; determining a final cascading defogging model until the loss objective function of the original single foggy image is smaller than a loss objective function threshold value; and finally, defogging the single actual foggy image by using the final cascading defogging model.

The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and all simple modifications, changes and equivalent structural changes made to the above embodiment according to the technical spirit of the present invention still fall within the protection scope of the technical solution of the present invention.

Claims

1. The image defogging method based on the multi-scale dark channel prior cascade deep neural network is characterized by comprising the following steps of:

Wherein m and n are not positive integers less than 8;

step 202, the original single foggy image is processed

Second scale original hazy image

Third scale original hazy image

And fourth scale original hazy image

Wherein the original foggy image of the first scale

Has a resolution of 2^m-4×2^n-4Second scale original hazy image

Has a resolution of 2^m-3×2^n-3Third-scale original fogging image

Has a resolution of 2^m-2×2^n-2Fourth scale original hazy image

Has a resolution of 2^m-1×2^n-1；

Step 203, utilizing the first deep convolutional network

For the original foggy image of the first scale

Estimating a first global atmospheric illumination A₁First transmittance image T₁And a first transmittance image T₁Up-sampled image of

I.e. the first deep convolutional network

Is a first scale original hazy image

The output is first global atmospheric illumination A₁First transmittance image T₁And a first transmittance image T₁Up-sampled image of

Wherein, w₁For a first deep convolutional network

A first global atmospheric illumination

First transmittance image

First transmittance image T₁Up-sampled image of

Conv (-) is a convolution module, Maxpool (-) is a maximum pooling module, Gfl (-) is a guided filtering module, Deconv (-) is a deconvolution module;

step 204, utilizing a second deep convolutional network

For the original foggy image of the second scale

I.e. the second deep convolutional network

Is a second scale original hazy image

Wherein, w₂For the second deep convolutional network

A set of weight parameters of, a second global atmospheric illumination

Second transmittance image

Second transmittance image T₂Up-sampled image of

Wherein the content of the first and second substances,

concat (. cndot.) is a superposition function;

according to the formula

Fusing to obtain a second-scale defogged image D₂；

Step 205, utilizing a third deep convolutional network

For original foggy image of third scale

I.e. the third deep convolutional network

Is the original foggy image of the third scale

Wherein, w₃For a third deep convolutional network

A set of weight parameters of, a third global atmospheric illumination

Third transmittance image

Third transmittance image T₃Up-sampled image of

Wherein the content of the first and second substances,

according to the formula

Fusing to obtain a third-scale defogged image D₃；

Step 206, utilizing a fourth deep convolutional network

For the fourth-scale original foggy image

I.e. the fourth deep convolutional network

Is the fourth scale original hazy image

Wherein, w₄For a fourth deep convolutional network

A fourth global atmospheric illumination

Fourth transmittance image

Fourth transmittance image T₄Up-sampled image of

Wherein the content of the first and second substances,

according to the formula

Fusing to obtain a fourth-scale defogged image D₄；

Step 207, utilizing a fifth deep convolutional network

To original single fogged image

Is the original single foggy image

Weight parameter set of (1), fifth global atmospheric lighting

Fifth transmittance image

Wherein the content of the first and second substances,

according to the formula

Fusing to obtain an original defogged image D₅；

Step three, according to the formula

Computing original single foggy images

The loss objective function L of (1), wherein i is a scale number, the numeric range of i is 1-5, and G_iAs an image D_iCorresponding reference truth image, N_iAs an image D_iNumber of upper pixels, L_iAs an image D_iCorresponding countermeasure loss；

Step four, updating the weight parameter set: original single foggy image

2. The image defogging method based on the multi-scale dark channel prior cascade deep neural network as claimed in claim 1, wherein: and the value ranges of m and n are both 8-12.

3. The image defogging method based on the multi-scale dark channel prior cascade deep neural network as claimed in claim 1, wherein: the first deep convolutional network in the second step

Second deep convolutional network

Third deep convolutional network

And a fourth deep convolutional network

First depth convolutional network for initial use

Set of weight parameters w₁A second deep convolutional network

Set of weight parameters w₂A third deep convolutional network

Set of weight parameters w₃And a fourth deep convolutional network

Set of weight parameters w₄Is a random initialization value.

4. The image defogging method based on the multi-scale dark channel prior cascade deep neural network as claimed in claim 1, wherein: the image dataset of known depth comprises a NYU image dataset.

5. The image defogging method based on the multi-scale dark channel prior cascade deep neural network as claimed in claim 1, wherein: the value range of the loss objective function threshold value delta is as follows: 0< Δ < 0.004.