US20190114747A1 - Image dehazing and restoration - Google Patents
Image dehazing and restoration Download PDFInfo
- Publication number
- US20190114747A1 US20190114747A1 US16/092,053 US201716092053A US2019114747A1 US 20190114747 A1 US20190114747 A1 US 20190114747A1 US 201716092053 A US201716092053 A US 201716092053A US 2019114747 A1 US2019114747 A1 US 2019114747A1
- Authority
- US
- United States
- Prior art keywords
- pixels
- digital image
- haze
- image
- color
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 claims abstract description 175
- 230000005540 biological transmission Effects 0.000 claims abstract description 114
- 239000003086 colorant Substances 0.000 claims description 49
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 43
- 238000003860 storage Methods 0.000 claims description 22
- 238000004590 computer program Methods 0.000 claims description 13
- 238000005070 sampling Methods 0.000 claims description 7
- 238000012545 processing Methods 0.000 description 21
- 238000004422 calculation algorithm Methods 0.000 description 18
- 230000001419 dependent effect Effects 0.000 description 17
- 230000006870 function Effects 0.000 description 12
- 238000010586 diagram Methods 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 238000009826 distribution Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 239000003643 water by type Substances 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 6
- 238000005286 illumination Methods 0.000 description 6
- 239000004576 sand Substances 0.000 description 6
- 230000009471 action Effects 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 239000003653 coastal water Substances 0.000 description 5
- 238000013139 quantization Methods 0.000 description 5
- 230000002238 attenuated effect Effects 0.000 description 4
- 230000004438 eyesight Effects 0.000 description 4
- 238000003384 imaging method Methods 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 230000003044 adaptive effect Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000012937 correction Methods 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 238000006731 degradation reaction Methods 0.000 description 3
- 238000003708 edge detection Methods 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 238000010200 validation analysis Methods 0.000 description 3
- 238000012935 Averaging Methods 0.000 description 2
- 241001125830 Sphyraenidae Species 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 230000000996 additive effect Effects 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 230000004313 glare Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 230000010287 polarization Effects 0.000 description 2
- 230000001902 propagating effect Effects 0.000 description 2
- 239000011435 rock Substances 0.000 description 2
- 230000011218 segmentation Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 241000242757 Anthozoa Species 0.000 description 1
- 235000000832 Ayote Nutrition 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 235000014653 Carica parviflora Nutrition 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 241000219122 Cucurbita Species 0.000 description 1
- 235000009854 Cucurbita moschata Nutrition 0.000 description 1
- 235000009804 Cucurbita pepo subsp pepo Nutrition 0.000 description 1
- 241000874889 Euphilotes enoptes Species 0.000 description 1
- 241000404815 Pseudanthias squamipinnis Species 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000002146 bilateral effect Effects 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 230000005670 electromagnetic radiation Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 125000001475 halogen functional group Chemical group 0.000 description 1
- 238000003064 k means clustering Methods 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 235000015136 pumpkin Nutrition 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012876 topography Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/73—Deblurring; Sharpening
-
- G06T5/003—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/901—Indexing; Data structures therefor; Storage structures
- G06F16/9027—Trees
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G06K9/6215—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/90—Determination of colour characteristics
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20048—Transform domain processing
- G06T2207/20061—Hough transform
Definitions
- the invention relates to the field of digital image processing.
- the term image means a digital image as collected by a digital camera sensor from photons emitted by a physical scene, and stored on a non-transitory computer-readable storage medium.
- the digital image comprises pixel, where each pixel has three or more color channel values.
- Haze for example, results from small particles in the air that scatter the light in the atmosphere.
- Fog results from tiny water droplets suspended in the air near the earth's surface.
- Haze and fog are independent of scene radiance and have two effects on the acquired image: they attenuate the signal of the viewed scene, and introduce an additive component to the image, termed the ambient light, or airlight (the color of a scene point at infinity).
- airlight means a set of values representing color, such as the red, green, and blue color channel values, that represent the color of the haze in the images where no objects are visualized.
- the image degradation caused by haze or fog increases with the distance from the camera, since the scene radiance decreases and the airlight magnitude increases.
- hazy or foggy images may be modeled as a per-pixel convex combination of a haze/fog-free image and the global airlight.
- Images taken in media other than air may suffer from similar degradation.
- some media such as water, is characterized by wavelength-dependent transmission, distorting the colors in the resulting image. Compensating for this wavelength-dependent transmission may require applying different attenuation coefficients for different color channels in the image. This is sometimes done with underwater imagery.
- One embodiment provides a method for dehazing a digital image, comprising: operating at least one hardware processor to: cluster pixels of a digital image into haze-lines, wherein each of the haze-lines is comprised of a sub-group of the pixels that are scattered non-locally over the digital image; estimate, based on the haze-lines, a transmission map of the digital image, wherein the transmission map encodes scene depth information for each pixel of the digital image; and calculate a dehazed digital image based on the transmission map.
- Another embodiment provides a method for restoring a digital image, comprising: operating at least one hardware processor to: convert a digital image of an underwater scene to a plurality of medium-compensated images that are each based on attenuation coefficient ratios of a different water type; and for each of the plurality of medium-compensated images: (a) cluster pixels of the medium-compensated image into haze-lines, wherein each of the haze-lines is comprised of a sub-group of the pixels that are scattered non-locally over the medium-compensated image, (b) estimate, based on the haze-lines, a transmission map of the medium-compensated image, wherein the transmission map encodes scene depth information for each pixel of the medium-compensated image, and (c) calculate, based on the transmission map and the attenuation coefficient ratios, a restored digital image of the underwater scene.
- the clustering of pixels comprises: representing colors of the pixels of the digital image in a spherical coordinate system whose origin is an estimated global airlight value; uniformly sampling the unit sphere of the representation, to output a plurality of color samples each associated with one of the pixels of the digital image; grouping the color samples based on their ⁇ (Theta) and ⁇ (Phi) angles in the spherical coordinate system, according to a mutual closest point on the unit sphere, thereby producing multiple groups each being one of the haze-lines.
- the clustering of pixels comprises representing the color differences between the pixels of the digital image and the airlight value on a pre-computed tessellation of the unit sphere, where the pre-computed tessellation is uniformly sampled and stored in Cartesian coordinates in a KD-tree.
- the clustering of pixels comprises searching for nearest neighbors on the KD-tree using Euclidean distance coordinates.
- the clustering of pixels comprises grouping the color samples based on the nearest neighbors, thereby producing multiple groups each being one of the haze-lines.
- the estimating of the transmission map comprises: estimating an initial transmission map as the quotient, for each individual pixel of the pixels of the digital image, of: (a) a distance of the individual pixel from an airlight value, and (b) a distance from a pixel which is farthest away from the airlight value and belong to the same haze-line as the individual pixel; regularizing the initial transmission map by enforcing a smoothness of the digital image on the initial transmission.
- the estimating of the transmission map comprises: estimating an initial transmission map as the quotient, for each individual pixel of the pixels of the digital image, of: (a) a distance of the individual pixel from an veiling-light value, and (b) a distance from a pixel which is farthest away from the veiling-light value and belong to the same haze-line as the individual pixel; regularizing the initial transmission map by enforcing a smoothness of the digital image on the initial transmission.
- the method further comprises operating said at least one hardware processor to: for each of the restored digital images: (a) perform global white balancing of the restored digital image, to output a white-balanced image, (b) calculate a standard deviation of a red channel of the white-balanced image and of a green channel of the white-balanced image; and output the white-balanced image having the lowest standard deviation.
- the method further comprises computing the estimated global veiling-light value by: generating an edge map of the digital image; thresholding the edge map, to produce multiple pixel blobs; and determining that a color or an average color of pixels making up a largest one of the multiple pixel blobs, is the global veiling-light value.
- Another embodiment provides a system that comprises: an image sensor configured to acquire the digital image of any one of the embodiments listed above; a non-transitory computer-readable storage medium having stored thereon program instructions to perform the steps of any one of the embodiments listed above; and at least one hardware processor configured to execute the program instructions.
- a further embodiment provides a computer program product comprising a non-transitory computer-readable storage medium having program code embodied therewith, the program code executable by at least one hardware processor to perform the steps of any one of the embodiments listed above.
- Another embodiment provides a method for estimating a set of airlight color channel values for a digital image.
- the method comprising operating at least one hardware processor to automatically perform the method actions.
- the method comprising an action of receiving a digital image comprising a plurality of pixels, each pixel comprising at least three color channel values.
- the method comprising for each of the plurality of pixels, an action of assigning, based on the color channel values, a Hough transform vote for each of the plurality of pixels to at least one of a plurality of candidate airlight color channel value sets, each of the sets comprising at least three airlight color channel values.
- the method comprising, based on the assigned votes, an action of selecting one of the sets as the airlight color channel value set of the digital image.
- each pixel color channel value and each airlight color channel value is one of a red channel value, a green channel value, and a blue channel value.
- the assigning comprises computing for each pixel a plurality of distances, in a color channel value space, between each pixel and a plurality of candidate haze-lines, wherein each of the plurality of candidate haze-lines is defined by (a) one of the plurality of candidate airlight color channel value sets and (b) one of a plurality of solid angles.
- the assigning comprises comparing the plurality of distances with an adaptive threshold, wherein the adaptive threshold is based on the distance from each pixel to the respective one of the plurality of candidate airlight color channel value sets.
- the assigning comprises, for each pixel, assigning at least one vote to some of the plurality of candidate airlight color channel value sets based on the comparison.
- the at least one of the plurality of candidate airlight color channel value sets that is voted for is brighter than the voting pixel.
- the method further comprises selecting, for each pixel, a a plurality of subsets, each subset a unique combination of at least two color channel values, thereby producing at least three limited color channel datasets.
- the method further comprises performing the steps of assigning and selecting for each of the at least three limited color channel datasets, producing at least three selected airlight color channel value sets.
- the method further comprises combining the at least three selected airlight color channel values to produce a single airlight color channel value set.
- the method further comprises grouping the plurality of pixel color values into a plurality of clusters, wherein the vote is assigned for each of the plurality of clusters.
- the plurality of clusters are grouped by at least one of a k-means algorithm and a Minimum Variance Quantization algorithm.
- the assigned vote for each of the plurality of clusters is weighted by a statistical parameter of each respective cluster.
- FIG. 1 illustrates the haze-lines prior
- FIGS. 2 a -2 c show a comparison between haze-lines and color lines
- FIGS. 3 a -3 d illustrate a validation of the haze-lines prior
- FIG. 4 shows an airlight-centered spherical representation
- FIGS. 5 a -5 b show distance distribution per haze-line
- FIGS. 6 a -6 h show intermediate and final results of the present dehazing technique
- FIGS. 7 a , 7 b , and 7 c ( i )- 7 c ( ii ) show a comparison of the present dehazing technique and previous methods, on natural images;
- FIGS. 8 a -8 d show a comparison of transmission maps and dehazed images of the present dehazing technique versus previous methods
- FIG. 9 illustrates the advantages of the present, global, approach over a patch-based approach
- FIG. 10 illustrates color clustering
- FIG. 11 illustrates attenuation coefficients of Jerlow water types
- FIGS. 12 a -12 f demonstrate the effect of scattering medium on observed scene colors
- FIGS. 13 a -13 c demonstrate the importance of water type matching
- FIG. 14 shows a comparison of the present restoration technique and previous methods, on natural images
- FIG. 15 shows a comparison of the transmission maps of the present restoration technique and previous methods
- FIGS. 16 a -16 d show a comparison of the present restoration technique and previous methods, on a JPEG-compressed input image
- FIG. 17 shows an exemplary underwater image and a veiling light pixel blob computed for that image
- FIGS. 18 a -18 f show an exemplary selection of airlight values from Hough transform
- FIGS. 19 a -19 d show images used for comparison of selection of airlight values computed with Hough transform and with alternative techniques
- FIGS. 20 a -20 h show a comparison of selections of airlight values from Hough transform for images with and without visible sky.
- FIGS. 21 a -21 d show a comparison of dehazing between Hough transform and an alternative technique.
- single-image dehazing technique that operates globally on a hazy image without having to divide the image into patches.
- the technique relies on the assumption that colors of a haze-free image are well approximated by a few hundred distinct colors, that form tight clusters in red-green-blue (RGB) space, such as a three values where each represents the intensity of that color channel.
- RGB red-green-blue
- a key observation of the present application is that pixels in a given cluster are often non-local, i.e., they are spread over the entire image plane and are located at different distances from the camera. In the presence of haze, these varying distances translate to different transmission coefficients.
- each color cluster in the hazy image becomes a shape (such as a line, arc, curve, and/or the like ie combination) in RGB space, that is termed here a “haze-line”.
- the RGB values of the cluster pixels are substantially along a line (such as substantially colinear) extending through the airlight (or veiling-light) RGB value.
- the correlation coefficient between a model shape and the pixel color values may be used to determine the haze-line.
- the haze line model may be loosely (i.e. elastically) associated with the airlight point, rigidly associated with the airlight point (i.e. constrainted fitting), and/or the like.
- the present technique recovers both the distance map and the haze-free image.
- the technique is linear in the size of the image, deterministic, and requires no training. It performs well on a wide variety of images and is competitive with other state-of-the-art methods.
- the adapted technique takes into account the different attenuation coefficient for the different color channels, affected by the medium in which the imaging takes place.
- the disclosed technique aims to recover, out of a hazy image, the RGB values of a haze-free image.
- Another, optional, aim is to recover the transmission (the coefficient of the convex combination) for each pixel, which provides a precursor to the scene depth.
- the present technique is referred to as “dehazing”, and such terminology is used throughout the specification.
- the technique may also apply to foggy images, underwater images, and/or the like.
- the atmospheric phenomena of haze and fog are similar in how they affect photographs. Accordingly, it is hereby intended that the term “haze”, and any grammatical inflections thereof, is interpreted as relating to haze, fog, or any like image degredation due to light's reflection, refraction, scattering, absorption, dispersion, and/or the like.
- the present technique is global and does not divide the image to patches. Patch-based methods take great care to avoid artifacts by either using multiple patch sizes or taking into consideration patch overlap and regularization using connections between distant pixels.
- the pixels that form the haze-lines are spread across the entire image and therefore capture a global phenomena that is not limited to small image patches.
- our prior is more robust and significantly more efficient in run-time.
- the present technique is an efficient algorithm that is linear in the size of the image. We automatically detect haze-lines and use them to dehaze the image. Also presented here are the results of extensive experiments conducted by the inventors to validate the technique and report quantitative and qualitative results on many outdoor images.
- an edge map of the image using an edge detection tool, such as, for example, the Structured Edge Detection Toolbox (P. Dollár and C. L. Zitnick. Structured forests for fast edge detection. In Proc. IEEE ICCV, 2013; available online at: https://github.com/pdollar/edges, last viewed Mar. 27, 2017) and threshold the edge map, to produce multiple connected components (i.e., multiple pixel blobs).
- the pixels belonging to the largest connected component are classified as veiling-light pixels (x ⁇ VL).
- FIG. 17 An example may be seen in FIG. 17 , where the bottom frame shows only the veiling light pixels of the top image.
- the veiling-light A is the color of these pixels, or, when the thresholding yielded multiple colors, the average color of these pixels.
- ⁇ denotes the attenuation coefficient of the atmosphere and d(x) denotes the distance of the scene at pixel x.
- ⁇ is wavelength dependent and therefore t is different per color channel, as discussed in S. G. Narasimhan and S. K. Nayar. Chromatic framework for vision in bad weather.
- ⁇ is wavelength dependent and therefore t is different per color channel, as discussed in S. G. Narasimhan and S. K. Nayar. Chromatic framework for vision in bad weather.
- Instant dehazing of images using polarization In Proc. IEEE CVPR, 2001. This dependency has been assumed negligible in many previous single image dehazing methods, to reduce the number of unknowns. We follow this assumption.
- the transmission t(x) acts as the matting coefficient between the scene J and the airlight A.
- Eq. (1) has three measurements I(x) and four unknowns: J(x) and t(x), resulting in an under-determined estimation problem.
- the present technique is based on the observation that the number of distinct colors in an image is orders of magnitude smaller than the number of pixels, as presented, for example, by Orchard et al. (1999), Id. This assumption has been used extensively in the past and is used for saving color images using indexed colormaps. The present inventors have validated and quantifies it on the Berkeley Segmentation Dataset (BSDS300), available online at http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/segbench/, last viewed Apr. 2, 2016. This is a diverse dataset of clear outdoor natural images and thus represents the type of scenes that might be degraded by haze.
- BSDS300 Berkeley Segmentation Dataset
- FIG. 1 A haze-free image is clustered using K-means to 500 clusters. The pixels belonging to four of these clusters are marked by different color markers in FIG. 1 a and their RGB coordinates are plotted in FIG. 1 b , demonstrating tight clusters. Note that the clusters include pixels distributed over the entire image that come from objects with different distances from the camera. A synthetic hazy image was generated from the clear image ( FIG. 1 c ) by the method used in R. Fattal. Dehazing using color-lines. ACM Trans. Graph., 34(1):13, 2014. The same pixels as in FIG. 1 a are marked. However, now, colors of pixels that belonged to the same color cluster are no longer similar.
- FIG. 2 demonstrates the haze-lines prior on a hazy outdoor image.
- Six different pixels identified by our method as belonging to the same haze line are circled. All of them are on shaded tree trunks and branches, and are likely to have similar radiance J. However, their observed intensity I is quite different, as shown in FIG. 2 b , where these pixels form a haze-line in RGB space that passes through the airlight.
- the present technique in some embodiments thereof, is composed of three core steps: clustering the pixels into haze-lines, estimating a transmission map, and dehazing.
- the estimation of the transmission map is divided into two: first, an estimation of an initial transmission map; second, a regularization step which yields a more accurate transmission map.
- Embodiments of the present technique uses an example of an RGB color channel input image.
- a non-RGB input image such as CMYK, YIQ, YUV, YDbDr, YPbPr, YCbCr, xvYCC, HSV, HSL, etc.
- it may first be converted to RGB using techniques known in the art.
- the present technique may operate on any color space, with out without respective modifications.
- the present technique may work directly on non-RGB color spaces with linear transformation to RGB space.
- equivalent embodiments maybe applied to any spectral image space, such as two color channel, three color channel, four color channel, and/or the like.
- the maximum number of color channels that an embodiment may automatically process is limited by the limitations of the physical processing hardware, and may include fields of applications that have other technical problems from those described herein, such as the image dehazing of images depicting a landscape, seascape, and/or the like. Therefore, the number of color channels of an image to be automatically processed by an embodiment may be a range between 2 and 15, 3 and 20, 4 and 10, 5 and 25, or any combination thereof.
- range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the invention. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 3, 4, 5, and 6. This applies regardless of the breadth of the range.
- a numerical range is indicated herein, it is meant to include any cited numeral (fractional or integral) within the indicated range.
- the phrases “ranging/ranges between” a first indicate number and a second indicate number and “ranging/ranges from” a first indicate number “to” a second indicate number are used herein interchangeably and are meant to include the first and second indicated numbers and all the fractional and integral numerals therebetween.
- embodiments may be implemented for different imaging modalities, such as different camera images, stereo camera images, photon sensor images, electromagnetic radiation images, particle images, and/or the like.
- the primary criterion for application to a modality is that the “haze-lines” can be modelled in the color space as an analytical shape (line, arc, parabola, etc.) and that the “airlight value can be used to remove the unwanted image characteristic.
- images may be in two dimensions, three dimensions, four dimensions, five dimension, and/or the like.
- a two channel embodiment may use the techniques described herein to partially process the dehazing of an image, or otherwise remove unwanted image characteristics (i.e. glare, prismatic effects, and/or the like).
- multispectral or hyperspectral images may by processed, such as remote sensing atmospheric images comprising 5 color channels (i.e. atmospheric infrared transparency windows) to remove cloud cover, hazing, glare, and/or the like.
- Such augmented images may better be used to compute sea surface temperature, vegetation indices, and/or the like.
- dual-energy computed topography images may be processed using embodiments of the techniques to remove ghosting.
- the first core step is finding the haze-lines.
- A may be estimated using conventional methods, such those in R. Fattal. Single image dehazing. ACM Trans. Graph., 27(3):72, 2008; K. He, J. Sun, and X. Tang. Single image haze removal using dark channel prior. In Proc. IEEE CVPR, 2009; and R. Tan. Visibility in bad weather from a single image. In Proc. IEEE CVPR, 2008.
- r denotes the distance to the origin (i.e., ⁇ I ⁇ A ⁇ )
- ⁇ and ⁇ denote the longitude and latitude, respectively.
- FIG. 4 shows the histogram of the Forest image ( FIG. 2 a ) projected onto a sphere.
- the sphere was sampled uniformly using 500 points.
- the color at each point [ ⁇ , ⁇ ] indicates the number of pixels x with these angles when writing I A (x) in spherical coordinates (image size 768 ⁇ 1024 pixels).
- the color-mapping is logarithmic for illustration purposes.
- the histogram indicates that the pixels are highly concentrated in terms of their longitude and latitude.
- pixels belong to the same haze-line when their [ ⁇ (x), ⁇ (x)] values are similar.
- Each point on the sphere in FIG. 4 represents a haze-line, in which all the pixels have approximately the same angles [ ⁇ (x), ⁇ (x)].
- the pixels in each haze-line have similar values in the non-hazy image J with high probability.
- pixels In order to determine which pixels are on the same haze-line, pixels should be grouped according to their angles [ ⁇ , ⁇ ].
- a two-dimensional (2D) histogram binning of ⁇ and ⁇ with uniform edges in the range [0,2 ⁇ ] ⁇ [0, ⁇ ] may not generate a uniform sampling of a sphere. Instead, the samples may be denser near the poles, as observed by G. Marsaglia. Choosing a point from the surface of a sphere. Ann. Math. Statist., 43(2):645-646, 04 1972, since the distance on the sphere is relative to sin( ⁇ ). Therefore, we sample the unit sphere uniformly, as shown in FIG. 4 , where each vertex is a sample point.
- Each vertex corresponds to a haze-line.
- the number of samples in FIG. 1 is smaller than the actual number we use.
- We group the pixels based on their [ ⁇ (x), ⁇ (x)] values, according to the closest sample point on the surface to the airlight color channel values (i.e. the color differences between the RGB values of the pixels of the digital image and the airlight color values). This may be implemented efficiently by building a KD-Tree from the pre-defined tessellation and querying the tree for each pixel. This is much faster than running a clustering algorithm, such as K-means, Minimum Variance Quantization algorithm, or the like.
- the technique yields a range of between 10-50, 50-100, 100-200, 200-300, 300-400, 400-500, 500-600, 600-700, 700-800, 800-900, or more than 900 haze-lines—each of these ranges constituting a different embodiment.
- the number of haze-lines is dependent on the amount of colors in the image; generally, a very colorful image would yield a large number of haze-lines (.e.g., above 400), while a relatively pale image would yield a lower number (e.g., below 50).
- FIG. 5 a depicts the layout of two different haze-lines in the image plane for the Forest image. Pixels belonging to two different haze-lines are depicted in green and blue, respectively.
- FIG. 5 b is a histogram of r(x) within each cluster. The horizontal axis is limited to the range [0, ⁇ A ⁇ ], as no pixel may have a radius outside that range in this particular image.
- the second core step of the present technique is to estimate the transmission map.
- this core step is broken into two.
- FIG. 5 b displays the radii histograms of the two clusters shown in FIG. 5 a .
- the farthest pixel from the airlight is haze free, and that such a pixel exists for every haze-line. This assumption does not hold for all of the haze-lines in an image, however the regularization step partially compensates for it.
- a regularization step may take place due to the following reason.
- the initial transmission is estimated using the haze-lines, without using any spatial information.
- nearby pixels that were clustered to different haze-lines might have significantly different transmission values, while in reality they are nearly at the same distance from the camera.
- the regularization enforces the image smoothness on the transmission. Where the image is smooth, we expect to find the same object at a similar distance and therefore expect the transmission to change smoothly.
- there is a significant gradient (color variance) in the image it is likely to match to a depth discontinuity and we might see a discontinuity in the transmission as well.
- t LB ⁇ ( x ) 1 - min c ⁇ ⁇ R , G , B ⁇ ⁇ I c ⁇ ( x ) A c Eq . ⁇ ( 13 )
- the estimation in Eq. (12) is performed per-pixel, without imposing spatial coherency. This estimation may be inaccurate when a small amount of pixels were mapped to a particular haze-line, or in very hazy areas, where r(x) is very small and noise may affect the angles significantly.
- the transmission map should be smooth, except for depth discontinuities, as observed by Fattal et al. (2014), Id.; K. Nishino, L. Kratz, and S. Lombardi. Bayesian defogging. Int. Journal of Computer Vision ( IJCV ), 98(3):263-278, 2012; Tan (2008), Visibility in bad weather from a single image, in Proc. IEEE CVPR, 2008; and J.-P. Tarel and N. Hautiere. Fast visibility restoration from a single color or gray level image.
- Tarel In Computer Vision, 2009 IEEE 12 th International Conference on, pages 2201-2208, September 2009 (hereinafter Tarel).
- ⁇ denotes a parameter that controls trade-off between the data and the smoothness terms
- N x denotes the four nearest neighbors of x in the image plane
- ⁇ (x) denotes the standard deviation of ⁇ tilde over (t) ⁇ LB , which is calculated per haze-line.
- ⁇ (x) plays a significant role since it allows us to apply our estimate only to pixels where the assumptions hold. When the variance is high, the initial estimation is less reliable. ⁇ (x) increases as the number of pixels in a haze line decreases. When the radii distribution in a given haze-line is small, our haze-line assumption does not hold since we do not observe pixels with different amounts of haze. In such cases, ⁇ (x) increases as well.
- the third core step of the technique is the dehazing: Once ⁇ circumflex over (t) ⁇ (x) is calculated as the minimum of Eq. (13), the dehazed image is calculated using Eq. (1):
- FIG. 6 a shows the input hazy image.
- the final, dehazed image is shown in FIG. 6 b .
- FIG. 6 c shows the distance r(x) in RGB space of every pixel in the hazy image to the airlight. Note that this distance decreases as haze increases.
- FIG. 6 d shows the maximum radii ⁇ circumflex over (r) ⁇ max (x) per haze-line, calculated according to Eq. (11). Observe that FIG. 6 d is much brighter than FIG. 6 c . Since larger values are represented by brighter colors, this indicates that the distance to the airlight is increased.
- FIG. 6 f which is colormapped data term confidence in Eq. (15) (warm colors depict high values).
- the ratio of FIGS. 6 c and 6 d yields the initial transmission ⁇ tilde over (t) ⁇ (x) that is shown in FIG. 6 g .
- the transmission map after regularization is shown in FIG. 6 h . While ⁇ tilde over (t) ⁇ (x) contains fine details even in grass areas that are at the same distance from the camera, ⁇ circumflex over (t) ⁇ (x) does not exhibit this behavior. This indicates the regularization is advantageous.
- the clustering of pixels in spherical coordinates is performed by representing the color differences between the pixels of the digital image and the airlight value of the pixels of the digital image on a pre-computed tessellation of the unit sphere, where the pre-computed tessellation is uniformly sampled and stored in Cartesian coordinates in a k-dimensional tree (KD-tree).
- KD-tree is a computerized data structure for organizing points in a space with k dimensions. It is a binary search tree with constraints imposed on it. KD trees are very useful for nearest neighbor searches (i.e. in a color space). The searching for nearest neighbors on the KD-tree may be performed using Euclidean distance coordinates.
- the pixel clusters are grouped with the color samples based on the nearest neighbors, thereby producing multiple groups each being one of the haze-lines.
- the algorithm is linear in N ⁇ the number of pixels in the image, and therefore fast.
- the clustering is done using a nearest neighbor search on a KD-Tree with a fixed number of points. Estimating the radius within each cluster is linear in N. Therefore, the initial radius estimation is O(N). Seeking the minimum of Eq. (15) requires solving a sparse linear system, which is also O(N). Restoring the dehazed image from the transmission map is O(N) as well.
- the present technique outperforms previous methods in most cases, and handles the noise well. As expected, our performance degrades when the noise variance increases. However, our technique maintains its ranking, with respect to other methods, regardless of the amount of noise. This shows that our algorithm is quite robust to noise, despite being pixel-based.
- FIGS. 7 and 8 compare results to six state-of-the-art single image dehazing methods: C. O. Ancuti and C. Ancuti. Single image dehazing by multiscale fusion. IEEE Trans. on Image Processing, 22(8):3271-3282, 2013 (hereinafter Ancuti); Fattal et al. (2014), Id.; K. B. Gibson and T. Q. Nguyen. An analysis of single image defogging methods using a color ellipsoid framework. EURASIP Journal on Image and Video Processing, 2013(1), 2013; R. Luzon-Gonzalez, J. L. Nieves, and J. Romero. Recovering of weather degraded images based on RGB response ratio constancy. Appl.
- FIG. 8 compares both the transmission maps and the dehazed images. It shows our technique is comparable to other methods, and in certain cases works better. For example, The two rows of trees are well separated in our result when compared to He et al. (2009), Id.
- FIG. 9 a shows an enlarged portion of an image, where clear artifacts are visible in the result of Fattal et al. (2014), Id. ( FIG. 9 c ), around the leaves and at the boundary between the trunk and the background.
- FIG. 9 d shows our result.
- a patch-based method is less likely to estimate the distance of such scenes accurately.
- the result of He et al. (2009), Id. does not exhibit these artifacts in FIG. 9 b , because the dehazing is less effective in this image and the details are less clear (e.g, the circled trunk). This phenomena is also visible in FIG.
- FIG. 10 demonstrates this is not the case.
- the pumpkins (a crop of FIG. 6 a ) are lit from above, and therefore are brighter at the top and gradually become darker towards the ground ( FIG. 10 left).
- FIG. 10 right depicts the cluster map—each color symbolizes a different haze-line. The gradual tone change is evident in the cluster map.
- a Hough transform in RGB space is used to automatically calculate airlight values, such as a set of color channel values for an airlight coordinate in RGB space.
- Hough transforms find imperfect instances of haze-lines by a voting procedure, the voting procedure carried out in a parameter space.
- Haze-line candidates are automatically obtained as local maxima in an “accumulator space” that is constructed by the Hough transform.
- clusters of point are automatically modeled as haze-lines by the Hough transform, and each point in each cluster may vote for the airlight RGB values, such as in a na ⁇ ve embodiment.
- a global airlight value may be automatically determined in hazy images quickly and efficiently.
- the method is based on the haze-line model introduced herein, that considers a a cluster of pixels intensities with similar colors to form lines in RGB space under haze. These lines may intersect at the airlight color and we take advantage of this observation to find their point of intersection.
- airlight color channel value set means a set of three or more color channel values, corresponding to the pixel color channel values.
- the set is a set of RGB values. This has a dramatic effect on the number of airlight candidates we need to sample and evaluate.
- a pixel vote may be assigned to a candidate A when the distance to one of the lines is smaller than a threshold ⁇ .
- This threshold is adaptive and depends on the distance between A and I(x) to allow for small intensity variations. For example, instead of working with cylinders (lines with a fixed threshold) we work with cones (lines with a variable threshold).
- a pixel to vote only for an airlight that is brighter than the pixel, such as by computing the brightness from the color channel values and comparing. This is due to the fact that bright objects are quite rare, as shown empirically to justify the dark channel prior, and usually do not contain information about the haze (e.g., a bright building close to the camera).
- the proposed scheme which includes collecting votes from all pixels for all angles and airlight candidates in the 3D RGB space, is computationally expensive. Therefore, we propose the following approximations, which significantly accelerate the computation while maintaining accuracy.
- the first clustering the colors in the image and using the cluster centers instead of all the pixels.
- the second performing the voting scheme in two dimensions. The voting is repeated three times, with only two of the (R,G,B) color channels being used each time.
- the algorithm's run-time depends on the following parameters: the number of pixels in the image P, the number of airlight candidates (in each color channel) M, the number of color clusters N and the number of haze-line orientations K.
- the conversion from RGB to an indexed image has a run-time complexity of O(NP), while the airlight estimation using the indexed image has a run-time complexity of O(NKM 2 ).
- FIG. 18 shows an exemplary selection of airlight values 18 A, 18 B, and 18 C from Hough transform.
- Each cluster n is marked by a circle with a size proportional to w n .
- the ground-truth (GT) airlight is marked by a green circle while our estimate is marked by a purple diamond.
- the GT values are determined by a train operator selecting the airline values manually from the image.
- Each colored cluster votes for the GT value where different colors indicate different haze-lines.
- the three voting arrays 18 A, 18 B, and 18 C show accum c 1 ,c 2 , (c 1 , c 2 ) ⁇ RG,GB,RB as a function of the candidate air-light values.
- the color-map indicates the number of votes, and an airlight value set is selected from the accumulated assigned votes from all pixels. In this case, the ground-truth air-light had the most votes in all planes (strong yellow color).
- the bottom row images 18 D, 18 E, and 18 F show the distribution of the clustered pixels in RGB space. We show 2D plots since these projections are used in the 2D voting procedure (step 8 of Algorithm 1) and may improve visualization.
- the airlight value is pointed at by the strongest haze-lines.
- the threshold ⁇ 0 0.02 determines whether a pixel I n supports a certain haze-line.
- FIG. 19 shows images used for comparison of airlight values computed with Hough transform and with alternative techniques. These images show the results from evaluating the accuracy of the estimated airlight on natural images with different techniques. The results are from the techniques of Sulami, He et al, Bahat et al (described in Blind dehazing using internal patch recurrence, Proceedings of the 2016 IEEE International Conference on Computational Photography (ICCP), 13-15 May 2016, EISBN: 978-1-4673-8623-4, DOI: 10.1109/ICCPHOT.2016.7492870), and the present invention. Following is table 2 summarizing the L 2 errors between the techniques from analysis of 40 images.
- Images 19 A, 19 B, 19 C, and 19 D show examples of hazy images, along with their manually extracted ground-truth airlight (GT) colors (modified to in gray scale). Following are tables of values corresponding to the airlight colors of the images by the different methods.
- GT ground-truth airlight
- the error bars corresponding to them in 19 E are labeled.
- our error is larger than Bahat. This may be caused by several bright pixels that have a high red value.
- the Schechner image 19 B our method outperforms all methods.
- the Train image 19 C shows that all methods except Sulami perform well.
- all methods yield relatively high errors. This may be because the airlight is not uniform across the scene.
- FIG. 20 shows a comparison of selections of airlight values from Hough transform for images 20 A, 20 B, 20 C, 20 D, 20 E, 20 F, 20 G, and 20 H with and without visible sky.
- First we estimated the airlight using the entire image, and received an average error of 0.116 (median error 0.091).
- the images the cropped region is marked by a dotted line.
- the estimated airlight values of the full and cropped images are shown, as well as the GT value extracted manually from the images.
- Our cropped image estimations are close to the ones estimated from the full image. The largest error, both before and after cropping, was calculated for the right image on the second row from the top—it had an L 2 error of 0.35.
- FIG. 21 shows a comparison of dehazing between Hough transform 21 C and 21 D and an alternative technique 21 A and 21 B. Shown are end-to-end dehazing results using both the airlight estimation described in He et al and the airlight estimation of an embodiment of the current technique. Both using an embodiment and the dehazing method described herein. Using different airlight values shows the effect of the estimation on the output dehazed image. Images 21 C and 21 D show a successful example where the airlight was accurately estimated for the Schechner image 19 B. The incorrect value estimated by He et al leads to an incorrect transmission map 21 B, while the transmission in 21 D approximately describes the scene structure. In the transmission maps, the darker colors are farther from the sensor, and the lighter are closer. Seen in 21 B, the previous technique, the image intensities cause the brighter buildings to be considered farther (bottom right), awhile in 21 D the transmission maps shows much better visual correspondence with the “depth” of the depicted objects.
- enhanced images may improve automatic segmentation, increase the quality of feature matching between images taken from multiple viewpoints, and aid in identification.
- Present embodiments aim to recover the object's colors in scenes photographed under ambient illumination in water using solely a single image as an input. Another aim is to recover a distance map of the photographed scene. This problem is closely related to the single image dehazing problem discussed above, in which images are degraded by weather conditions such as haze or fog. The above dehazing technique assumes that the attenuation is uniform across colors.
- the input image may be converted to a medium-compensated image where the attenuation coefficient is the same for all color channels. Then, the above image dehazing technique may be used to solve the problem. In alternative embodiments, other image dehazing techniques may be applied to the medium-compensated image.
- I c denotes the acquired image value in color channel c
- t c denotes the transmission of that color channel
- J c denotes the image value of the object that would have been acquired without the scattering and absorption of the water medium.
- I refers to the image obtained from the raw file after minimal processing such as demosaicing and black current subtraction, as disclosed by Akkaynak, D.,maschineitz, T., Xiao, B., Gürkan, U.
- the transmission depends on object distance z and the water attenuation coefficient for each channel ⁇ c :
- the attenuation of red colors may be an order of magnitude larger than the attenuation of blue and green, os observed, for example, by Mobley, C. D.: Light and water: radiative transfer in natural waters. Academic press (1994). Therefore, as opposed to the common assumption in single image dehazing, the transmission t is wavelength-dependent.
- the Jerlov water types are I, IA, IB, II and III for open ocean waters, and 1 through 9 for coastal waters.
- Type I is the clearest and type III is the most turbid open ocean water.
- type 1 is clearest and type 9 is most turbid.
- FIG. 11 depicts the attenuation coefficients of Jerlov water types; the figure was adapted from data in Mobley et al.
- FIG. 1 shows the ratios of the attenuation coefficients: ⁇ B ⁇ R vs. ⁇ B ⁇ G of Jerlov water types for wavelengths of peak camera sensitivity according to Jiang, J., Liu, D., Gu, J., Susstrunk, S.: What is the space of spectral sensitivity functions for digital color cameras? In: Proc. IEEE Workshop Applications of Computer Vision ( WACV ). (2013) 168-179.
- FIG. 12 demonstrates the effect of scattering medium on observed scene colors.
- FIG. 12 a shows a scene with four different clusters of similar colors J marked. Pixels of a haze-free color image are clustered using K-means. Pixels belonging to four of the clusters are marked. Note that the pixels are non-local and are spread all over the image plane.
- FIG. 12 b shows the same pixels in RGB space, with colors of the clusters corresponding to the highlighted pixels in FIG. 12 b.
- FIG. 12 c shows the same scene with added synthetic haze.
- the same clustered pixels are marked, but their observed colors are affected by different amounts of haze.
- FIG. 12 e shows the scene as if it was captured under water.
- FIGS. 12 d and 12 f show the corresponding pixels in RGB space for haze and underwater, respectively.
- the hazy pixels are distributed haze-lines passing through the airlight, marked in black.
- the cluster pixels are distributed along curves that do not coincide with the linear lines spanned by the original color and the veiling-light (the haze-lines), due to the wavelength-dependent attenuation.
- the attenuation is mostly wavelength independent and the tight color clusters become haze-lines.
- the attenuation is wavelength-dependent, and the clusters become curves.
- a c is extracted from a patch in the image.
- Eq. (24) compensates for the intensity changes that happen in the path between the object and the camera.
- the ambient illumination is attenuated by the water column from the surface to the imaging depth, resulting in a colored (bluish) global illumination.
- FIG. 11 shows the approximate attenuation coefficient ratios ⁇ BG , ⁇ BR , calculated for different Jerlov water types ( FIG. 11 a ).
- ⁇ BG , ⁇ BR the approximate attenuation coefficient ratios
- parameters may be standard deviations of one or more color channel values, other statistical values of the RGB color values, and/or the like.
- the color charts are used only for validation. During the transmission estimation, we masked out the color charts, in order to estimate the transmission based on natural scene objects alone. The transmission of those pixels is determined during the regularization step based on neighboring values.
- FIG. 13 demonstrates the importance of choosing the correct water type. Using an incorrect value leads to an incorrect transmission estimation. As a result, some area in the image may be under- or over-compensated in the restoration process.
- FIG. 13 a shows an input image with X-rite color checker, and two different outputs ( FIGS. 13 b and 13 c ) for two water types, including a zoom-in on the color-checker.
- FIG. 13 a an image is restored using two different water types (different ⁇ BR , ⁇ BG values), shown in FIGS. 13 b and 13 c , respectively.
- Using incorrect water type leads to incorrect colors of the restored image, as shown both qualitatively and quantitatively by the zoom-in on the color chart marked by a yellow rectangle.
- Qualitatively the rightmost chart shows a pink hue.
- the correct values indicate water type 3 (coastal waters), while the incorrect values are for open ocean waters.
- FIG. 14 compares prominent single underwater image restoration methods.
- the top row shows the original input to the restoration methods, with veiling-light estimation: using UDCP in orange, using WCID in red and averaging over a patch in yellow for Haze-lines and the proposed method.
- the rest of the rows, from top to bottom, show the output of a global contrast enhancement of each channel (which affects the white-balance), Haze-lines, UDCP, WCID, and the present restoration technique.
- the methods Haze-Lines, UDCP, and WCID do not restore the color of the sand in the foreground of Frames and Pier correctly, as some areas have a blue-green color-cast while others do not. This phenomenon is an indication of an incorrect wavelength-dependent correction, not a global white balance problem.
- the red color is attenuated much more than the blue and green, and is not amplified enough by these methods.
- the present restoration technique is able to compensate for the distance-dependent attenuation. For example, the Barracudas all have similar colors in the output image, regardless of their original distance. Similarly, the sand in the foreground of Frames has a uniform color.
- FIG. 15 shows an image along with the transmission maps estimated in the process for three methods: UDCP, WCID, and the present restoration technique. Due to the unconstrained nature of the problem, a prior must be used to estimate the transmission. Both UDCP and WCID are based on the dark channel prior, and estimate the transmission according to:
- FIGS. 15I .a to 15 II.e show the value of the dark channel
- FIGS. 15I .b, 15 I.c, 15 II.b, and 15 II.c are not restored properly due to the wrong estimation of the transmission.
- the prior of the present restoration technique produces better results ( FIG. 5 d ). For example, FIG. 15I .d shows that the color of the Sea Goldies fish is restored much better than the other methods.
- FIG. 15I .a shows the input image.
- FIGS. 15I .a thru 15 I.d show the output of three restoration methods: UDCP, WCID, and the present restoration technique.
- FIG. 15I .e shows a step in the transmission calculation using the dark channel prior:
- FIGS. 15I .f thru 15 I.h show the transmission maps estimated during the restoration process, corresponding to FIGS. 15I .b thru 15 I.d, color-mapped such that warm colors indicate a high transmission, and cold colors indicate a low transmission.
- FIGS. 15I .f thru 15 I.h show the transmission maps estimated during the restoration process, corresponding to FIGS. 15I .b thru 15 I.d, color-mapped such that warm colors indicate a high transmission, and cold colors indicate a low transmission.
- FIG. 16 compares our result to methods proposed in Carlevaris-Bianco, N., Mohan, A., Eustice, R. M.: Initial results in underwater single image dehazing.
- Image Processing ICIP
- JPEG compression artifacts are amplified by the restoration method, however the present restoration technique removes the blue hue completely from the farther corals, unlike previously proposed methods.
- 16 a is the input image
- 16 b - d show the output of three restoration methods: Carlevaris-Bianco et al. (2010), Id., Peng et al. (2015), Id., and the present restoration technique, respectively.
- the present invention may be a system, a method, and/or a computer program product.
- the computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a hardware processor to carry out aspects of the present invention.
- the computer readable storage medium may be a tangible device that may retain and store instructions for use by an instruction execution device.
- the computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
- a non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device having instructions recorded thereon, and any suitable combination of the foregoing.
- a computer readable storage medium is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire. Rather, the computer readable storage medium is a non-transient (i.e., not-volatile) medium.
- Computer readable program instructions described herein may be downloaded to respective computing/processing devices (which comprise hardware processor) from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network.
- the network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers.
- a network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
- Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages.
- the computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
- the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
- LAN local area network
- WAN wide area network
- Internet Service Provider an Internet Service Provider
- a hardware processor that is, for example, a microprocessor, programmable logic circuitry, a field-programmable gate array (FPGA), or programmable logic arrays (PLA), may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the hardware processor, in order to perform aspects of the present invention.
- a hardware processor that is, for example, a microprocessor, programmable logic circuitry, a field-programmable gate array (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the hardware processor, in order to perform aspects of the present invention.
- These specialized computer readable program instructions may be provided to a microprocessor of a general-purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the microprocessor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
- These computer readable program instructions may also be stored in a computer readable storage medium that may direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
- the computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
- each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s).
- the functions noted in the block may occur out of the order noted in the figures.
- two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Image Processing (AREA)
- Image Analysis (AREA)
- Transition And Organic Metals Composition Catalysts For Addition Polymerization (AREA)
- Apparatus For Radiation Diagnosis (AREA)
Abstract
Description
- This application claims priority to U.S. Provisional Patent Application No. 61/319,338, filed Apr. 7, 2016, entitled “Image Dehazing and Restoration”. The contents of that application, including all its color drawings, are incorporated herein by reference in their entirety.
- The invention relates to the field of digital image processing.
- Outdoor digital images often suffer from low contrast and limited visibility due to haze, fog, or other atmospheric phenomena. As used herein, the term image means a digital image as collected by a digital camera sensor from photons emitted by a physical scene, and stored on a non-transitory computer-readable storage medium. The digital image comprises pixel, where each pixel has three or more color channel values. Haze, for example, results from small particles in the air that scatter the light in the atmosphere. Fog results from tiny water droplets suspended in the air near the earth's surface. Haze and fog are independent of scene radiance and have two effects on the acquired image: they attenuate the signal of the viewed scene, and introduce an additive component to the image, termed the ambient light, or airlight (the color of a scene point at infinity). As used herein the term airlight means a set of values representing color, such as the red, green, and blue color channel values, that represent the color of the haze in the images where no objects are visualized. The image degradation caused by haze or fog increases with the distance from the camera, since the scene radiance decreases and the airlight magnitude increases. Thus, hazy or foggy images may be modeled as a per-pixel convex combination of a haze/fog-free image and the global airlight.
- Images taken in media other than air may suffer from similar degradation. In addition, some media, such as water, is characterized by wavelength-dependent transmission, distorting the colors in the resulting image. Compensating for this wavelength-dependent transmission may require applying different attenuation coefficients for different color channels in the image. This is sometimes done with underwater imagery.
- The foregoing examples of the related art and limitations related therewith are intended to be illustrative and not exclusive. Other limitations of the related art will become apparent to those of skill in the art upon a reading of the specification and a study of the figures.
- The following embodiments and aspects thereof are described and illustrated in conjunction with systems, tools and methods which are meant to be exemplary and illustrative, not limiting in scope.
- One embodiment provides a method for dehazing a digital image, comprising: operating at least one hardware processor to: cluster pixels of a digital image into haze-lines, wherein each of the haze-lines is comprised of a sub-group of the pixels that are scattered non-locally over the digital image; estimate, based on the haze-lines, a transmission map of the digital image, wherein the transmission map encodes scene depth information for each pixel of the digital image; and calculate a dehazed digital image based on the transmission map.
- Another embodiment provides a method for restoring a digital image, comprising: operating at least one hardware processor to: convert a digital image of an underwater scene to a plurality of medium-compensated images that are each based on attenuation coefficient ratios of a different water type; and for each of the plurality of medium-compensated images: (a) cluster pixels of the medium-compensated image into haze-lines, wherein each of the haze-lines is comprised of a sub-group of the pixels that are scattered non-locally over the medium-compensated image, (b) estimate, based on the haze-lines, a transmission map of the medium-compensated image, wherein the transmission map encodes scene depth information for each pixel of the medium-compensated image, and (c) calculate, based on the transmission map and the attenuation coefficient ratios, a restored digital image of the underwater scene.
- In some embodiments, the clustering of pixels comprises: representing colors of the pixels of the digital image in a spherical coordinate system whose origin is an estimated global airlight value; uniformly sampling the unit sphere of the representation, to output a plurality of color samples each associated with one of the pixels of the digital image; grouping the color samples based on their θ (Theta) and φ (Phi) angles in the spherical coordinate system, according to a mutual closest point on the unit sphere, thereby producing multiple groups each being one of the haze-lines.
- In some embodiments, the clustering of pixels comprises representing the color differences between the pixels of the digital image and the airlight value on a pre-computed tessellation of the unit sphere, where the pre-computed tessellation is uniformly sampled and stored in Cartesian coordinates in a KD-tree. The clustering of pixels comprises searching for nearest neighbors on the KD-tree using Euclidean distance coordinates. The clustering of pixels comprises grouping the color samples based on the nearest neighbors, thereby producing multiple groups each being one of the haze-lines.
- In some embodiments, the estimating of the transmission map comprises: estimating an initial transmission map as the quotient, for each individual pixel of the pixels of the digital image, of: (a) a distance of the individual pixel from an airlight value, and (b) a distance from a pixel which is farthest away from the airlight value and belong to the same haze-line as the individual pixel; regularizing the initial transmission map by enforcing a smoothness of the digital image on the initial transmission.
- In some embodiments, the estimating of the transmission map comprises: estimating an initial transmission map as the quotient, for each individual pixel of the pixels of the digital image, of: (a) a distance of the individual pixel from an veiling-light value, and (b) a distance from a pixel which is farthest away from the veiling-light value and belong to the same haze-line as the individual pixel; regularizing the initial transmission map by enforcing a smoothness of the digital image on the initial transmission.
- In some embodiments, the method further comprises operating said at least one hardware processor to: for each of the restored digital images: (a) perform global white balancing of the restored digital image, to output a white-balanced image, (b) calculate a standard deviation of a red channel of the white-balanced image and of a green channel of the white-balanced image; and output the white-balanced image having the lowest standard deviation.
- In some embodiments, the method further comprises computing the estimated global veiling-light value by: generating an edge map of the digital image; thresholding the edge map, to produce multiple pixel blobs; and determining that a color or an average color of pixels making up a largest one of the multiple pixel blobs, is the global veiling-light value.
- Another embodiment provides a system that comprises: an image sensor configured to acquire the digital image of any one of the embodiments listed above; a non-transitory computer-readable storage medium having stored thereon program instructions to perform the steps of any one of the embodiments listed above; and at least one hardware processor configured to execute the program instructions.
- A further embodiment provides a computer program product comprising a non-transitory computer-readable storage medium having program code embodied therewith, the program code executable by at least one hardware processor to perform the steps of any one of the embodiments listed above.
- Another embodiment provides a method for estimating a set of airlight color channel values for a digital image. The method comprising operating at least one hardware processor to automatically perform the method actions. The method comprising an action of receiving a digital image comprising a plurality of pixels, each pixel comprising at least three color channel values. The method comprising for each of the plurality of pixels, an action of assigning, based on the color channel values, a Hough transform vote for each of the plurality of pixels to at least one of a plurality of candidate airlight color channel value sets, each of the sets comprising at least three airlight color channel values. The method comprising, based on the assigned votes, an action of selecting one of the sets as the airlight color channel value set of the digital image.
- In some embodiments, each pixel color channel value and each airlight color channel value is one of a red channel value, a green channel value, and a blue channel value.
- In some embodiments, the assigning comprises computing for each pixel a plurality of distances, in a color channel value space, between each pixel and a plurality of candidate haze-lines, wherein each of the plurality of candidate haze-lines is defined by (a) one of the plurality of candidate airlight color channel value sets and (b) one of a plurality of solid angles. The assigning comprises comparing the plurality of distances with an adaptive threshold, wherein the adaptive threshold is based on the distance from each pixel to the respective one of the plurality of candidate airlight color channel value sets. The assigning comprises, for each pixel, assigning at least one vote to some of the plurality of candidate airlight color channel value sets based on the comparison.
- In some embodiments, for each Hough transform vote, the at least one of the plurality of candidate airlight color channel value sets that is voted for, is brighter than the voting pixel.
- In some embodiments, the method further comprises selecting, for each pixel, a a plurality of subsets, each subset a unique combination of at least two color channel values, thereby producing at least three limited color channel datasets. The method further comprises performing the steps of assigning and selecting for each of the at least three limited color channel datasets, producing at least three selected airlight color channel value sets. The method further comprises combining the at least three selected airlight color channel values to produce a single airlight color channel value set.
- In some embodiments, the method further comprises grouping the plurality of pixel color values into a plurality of clusters, wherein the vote is assigned for each of the plurality of clusters.
- In some embodiments, the plurality of clusters are grouped by at least one of a k-means algorithm and a Minimum Variance Quantization algorithm.
- In some embodiments, the assigned vote for each of the plurality of clusters is weighted by a statistical parameter of each respective cluster.
- In addition to the exemplary aspects and embodiments described above, further aspects and embodiments will become apparent by reference to the figures and by study of the following detailed description.
- The patent or application file, or the file of U.S. Provisional Patent Application No. 61/319,338 to which priority is claimed, contains at least one drawing executed in color. Copies of these color drawing(s) will be provided by the U.S. Patent and Trademark Office upon request and payment of the necessary fee.
- Exemplary embodiments are illustrated in referenced figures. Dimensions of components and features shown in the figures are generally chosen for convenience and clarity of presentation and are not necessarily shown to scale. The figures are listed below.
-
FIG. 1 illustrates the haze-lines prior; -
FIGS. 2a-2c show a comparison between haze-lines and color lines; -
FIGS. 3a-3d illustrate a validation of the haze-lines prior; -
FIG. 4 shows an airlight-centered spherical representation; -
FIGS. 5a-5b show distance distribution per haze-line; -
FIGS. 6a-6h show intermediate and final results of the present dehazing technique; -
FIGS. 7a, 7b, and 7c (i)-7 c(ii) show a comparison of the present dehazing technique and previous methods, on natural images; -
FIGS. 8a-8d show a comparison of transmission maps and dehazed images of the present dehazing technique versus previous methods; -
FIG. 9 illustrates the advantages of the present, global, approach over a patch-based approach; -
FIG. 10 illustrates color clustering; -
FIG. 11 illustrates attenuation coefficients of Jerlow water types; -
FIGS. 12a-12f demonstrate the effect of scattering medium on observed scene colors; -
FIGS. 13a-13c demonstrate the importance of water type matching; -
FIG. 14 shows a comparison of the present restoration technique and previous methods, on natural images; -
FIG. 15 . shows a comparison of the transmission maps of the present restoration technique and previous methods; -
FIGS. 16a-16d show a comparison of the present restoration technique and previous methods, on a JPEG-compressed input image; -
FIG. 17 shows an exemplary underwater image and a veiling light pixel blob computed for that image; -
FIGS. 18a-18f show an exemplary selection of airlight values from Hough transform; -
FIGS. 19a-19d show images used for comparison of selection of airlight values computed with Hough transform and with alternative techniques; -
FIGS. 20a-20h show a comparison of selections of airlight values from Hough transform for images with and without visible sky; and -
FIGS. 21a-21d show a comparison of dehazing between Hough transform and an alternative technique. - Disclosed herein is single-image dehazing technique that operates globally on a hazy image without having to divide the image into patches. The technique relies on the assumption that colors of a haze-free image are well approximated by a few hundred distinct colors, that form tight clusters in red-green-blue (RGB) space, such as a three values where each represents the intensity of that color channel. A key observation of the present application is that pixels in a given cluster are often non-local, i.e., they are spread over the entire image plane and are located at different distances from the camera. In the presence of haze, these varying distances translate to different transmission coefficients. Therefore, each color cluster in the hazy image becomes a shape (such as a line, arc, curve, and/or the like ie combination) in RGB space, that is termed here a “haze-line”. For example, the RGB values of the cluster pixels are substantially along a line (such as substantially colinear) extending through the airlight (or veiling-light) RGB value. Optionally, the correlation coefficient between a model shape and the pixel color values may be used to determine the haze-line. Optionally, the haze line model may be loosely (i.e. elastically) associated with the airlight point, rigidly associated with the airlight point (i.e. constrainted fitting), and/or the like. Using these haze-lines, the present technique recovers both the distance map and the haze-free image. The technique is linear in the size of the image, deterministic, and requires no training. It performs well on a wide variety of images and is competitive with other state-of-the-art methods.
- Also disclosed is an adaptation of the single-image dehazing technique which makes it suitable for scenes characterized by wavelength-dependent transmission, such as under water. The adapted technique takes into account the different attenuation coefficient for the different color channels, affected by the medium in which the imaging takes place.
- Further disclosed are techniques for airlight RGB value determination from single images.
- The disclosed technique aims to recover, out of a hazy image, the RGB values of a haze-free image. Another, optional, aim is to recover the transmission (the coefficient of the convex combination) for each pixel, which provides a precursor to the scene depth. These are ill-posed problems that have an under-determined system of three equations and at least four unknowns per pixel, with inherent ambiguity between haze and object radiance.
- For simplicity of discussion, the present technique is referred to as “dehazing”, and such terminology is used throughout the specification. However, the technique may also apply to foggy images, underwater images, and/or the like. For example, the atmospheric phenomena of haze and fog are similar in how they affect photographs. Accordingly, it is hereby intended that the term “haze”, and any grammatical inflections thereof, is interpreted as relating to haze, fog, or any like image degredation due to light's reflection, refraction, scattering, absorption, dispersion, and/or the like.
- The technique uses the observation that colors of a haze-free image may be well approximated by a few hundred distinct colors, as presented, for example, by M. T. Orchard and C. A. Bouman. Color quantization of images. Signal Processing, IEEE Transactions on, 39(12):2677-2690, 1991. This implies that pixels in a hazy image may be modeled by lines in RGB space that pass through the airlight coordinate. These lines are termed here haze-lines, to stress this characteristic. Pixels along a haze-line come from objects that have similar radiance colors, located over the entire image plane. These objects may be located at different distances from the camera. Since their acquired color may be modeled by a convex combination of the radiance color and the airlight color, such objects may span a line in RGB space. We use these lines to estimate the per-pixel transmission based on the pixel's position along the line it belongs to.
- As opposed to recent state-of-the-art methods, the present technique is global and does not divide the image to patches. Patch-based methods take great care to avoid artifacts by either using multiple patch sizes or taking into consideration patch overlap and regularization using connections between distant pixels. In the present application, the pixels that form the haze-lines are spread across the entire image and therefore capture a global phenomena that is not limited to small image patches. Thus, our prior is more robust and significantly more efficient in run-time.
- The present technique is an efficient algorithm that is linear in the size of the image. We automatically detect haze-lines and use them to dehaze the image. Also presented here are the results of extensive experiments conducted by the inventors to validate the technique and report quantitative and qualitative results on many outdoor images.
- We first present the haze model and then describe how we use non-local haze-lines for image dehazing.
- The common hazy image formation model, as discussed in W. E. K. Middleton. Vision through the atmosphere. Toronto: University of Toronto Press, 1952, is:
-
I(x)=t(x)·J(x)+[1−t(x)]·A, Eq. (1) - where x denotes the image coordinate, I denotes the observed hazy image RGB values at x, t(x) denotes the transmission at x, and J denotes the true RGB radiance of the scene point imaged at x. The airlight A denotes a single color (i.e. RGB values) representing the airlight in image areas where t=0. It should be emphasized that, although the term “airlight” implies that open air photography is involved, its concept is nonetheless applicable to underwater photography, where it may be termed “veiling light”. In this disclosure, these terms may be used interchangably.
- To estimate the veiling light, we assume an area without objects is visible in the image, in which the color of the pixels is determined by the veiling light alone. This is a reasonable assumption when the line of sight is horizontal. It does not hold when photographing a reef wall up close, or when the camera is pointed downwards. However, in these cases, the distance of objects from the camera usually varies less then in horizontal scenes, and a simple contrast stretch is likely to be sufficient.
- Optionally, in order to detect the pixels that belong to the veiling light, we generate an edge map of the image using an edge detection tool, such as, for example, the Structured Edge Detection Toolbox (P. Dollár and C. L. Zitnick. Structured forests for fast edge detection. In Proc. IEEE ICCV, 2013; available online at: https://github.com/pdollar/edges, last viewed Mar. 27, 2017) and threshold the edge map, to produce multiple connected components (i.e., multiple pixel blobs). We then look for and determine the largest connected component. The pixels belonging to the largest connected component are classified as veiling-light pixels (x∈VL). An example may be seen in
FIG. 17 , where the bottom frame shows only the veiling light pixels of the top image. The veiling-light A is the color of these pixels, or, when the thresholding yielded multiple colors, the average color of these pixels. - The scene transmission t(x) is distance-dependent:
-
t(x)=e −βd(x) Eq. (2) - where β denotes the attenuation coefficient of the atmosphere and d(x) denotes the distance of the scene at pixel x. Generally, β is wavelength dependent and therefore t is different per color channel, as discussed in S. G. Narasimhan and S. K. Nayar. Chromatic framework for vision in bad weather. In Proc. IEEE CVPR, 2000, and in Y. Y. Schechner, S. G. Narasimhan, and S. K. Nayar. Instant dehazing of images using polarization. In Proc. IEEE CVPR, 2001. This dependency has been assumed negligible in many previous single image dehazing methods, to reduce the number of unknowns. We follow this assumption. The transmission t(x) acts as the matting coefficient between the scene J and the airlight A. Thus, per-pixel x, Eq. (1) has three measurements I(x) and four unknowns: J(x) and t(x), resulting in an under-determined estimation problem.
- The present technique, as briefly discussed above, is based on the observation that the number of distinct colors in an image is orders of magnitude smaller than the number of pixels, as presented, for example, by Orchard et al. (1999), Id. This assumption has been used extensively in the past and is used for saving color images using indexed colormaps. The present inventors have validated and quantifies it on the Berkeley Segmentation Dataset (BSDS300), available online at http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/segbench/, last viewed Apr. 2, 2016. This is a diverse dataset of clear outdoor natural images and thus represents the type of scenes that might be degraded by haze. We clustered the RGB pixel values of each image, such by using K-means clustering, a Minimum Variance Quantization clustering, and/or the like, to a maximum of 500 clusters, and replaced every pixel in the image with its respective cluster center. The result is an image with 500 different RGB values at most (two orders of magnitude smaller than image size). The PSNR (Peak Signal to Noise Ratio) of the images generated with the reduced color set, compared to the original ones, were high and ranged from 36.6 dB to 52.6 dB. A histogram of the obtained PSNR values is shown in
FIG. 3 , as well as the image that had the worst PSNR, before and after color quantization. This validates the haze-lines prior. - The observation regarding a small number of distinct colors holds for haze-free images. In the presence of haze, object points that belong to the same color cluster end up with different acquired colors since they are located in disparate image areas and thus have different distances from the camera. This prior suggests that pixels that are clustered together in a haze-free image form a line in RGB space in a hazy image. Based on Eq. (1), the two end points of the line are the original color J and the airlight A. These are the haze-lines.
- This prior is demonstrated in
FIG. 1 . A haze-free image is clustered using K-means to 500 clusters. The pixels belonging to four of these clusters are marked by different color markers inFIG. 1a and their RGB coordinates are plotted inFIG. 1b , demonstrating tight clusters. Note that the clusters include pixels distributed over the entire image that come from objects with different distances from the camera. A synthetic hazy image was generated from the clear image (FIG. 1c ) by the method used in R. Fattal. Dehazing using color-lines. ACM Trans. Graph., 34(1):13, 2014. The same pixels as inFIG. 1a are marked. However, now, colors of pixels that belonged to the same color cluster are no longer similar. This is depicted in RGB space inFIG. 1d , where the color coordinates of these pixels are distributed along a haze-line spanned by the original color and the airlight. The pixels marked by purple circles (originating from the sand patch) are located in similar distances, so their distribution along the haze-line is rather tight. However, the pixels marked by orange triangles (grassy areas) are found at different locations in the real world, so they are distributed along the haze-line. -
FIG. 2 demonstrates the haze-lines prior on a hazy outdoor image. Six different pixels identified by our method as belonging to the same haze line are circled. All of them are on shaded tree trunks and branches, and are likely to have similar radiance J. However, their observed intensity I is quite different, as shown inFIG. 2b , where these pixels form a haze-line in RGB space that passes through the airlight. - The present technique, in some embodiments thereof, is composed of three core steps: clustering the pixels into haze-lines, estimating a transmission map, and dehazing. Optionally, the estimation of the transmission map is divided into two: first, an estimation of an initial transmission map; second, a regularization step which yields a more accurate transmission map.
- Embodiments of the present technique uses an example of an RGB color channel input image. When a non-RGB input image is received (such as CMYK, YIQ, YUV, YDbDr, YPbPr, YCbCr, xvYCC, HSV, HSL, etc.), it may first be converted to RGB using techniques known in the art. Alternatively, the present technique may operate on any color space, with out without respective modifications. For example, the present technique may work directly on non-RGB color spaces with linear transformation to RGB space.
- Optionally, equivalent embodiments maybe applied to any spectral image space, such as two color channel, three color channel, four color channel, and/or the like. The maximum number of color channels that an embodiment may automatically process is limited by the limitations of the physical processing hardware, and may include fields of applications that have other technical problems from those described herein, such as the image dehazing of images depicting a landscape, seascape, and/or the like. Therefore, the number of color channels of an image to be automatically processed by an embodiment may be a range between 2 and 15, 3 and 20, 4 and 10, 5 and 25, or any combination thereof.
- Throughout this application, various embodiments of this invention may be presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the invention. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 3, 4, 5, and 6. This applies regardless of the breadth of the range.
- Whenever a numerical range is indicated herein, it is meant to include any cited numeral (fractional or integral) within the indicated range. The phrases “ranging/ranges between” a first indicate number and a second indicate number and “ranging/ranges from” a first indicate number “to” a second indicate number are used herein interchangeably and are meant to include the first and second indicated numbers and all the fractional and integral numerals therebetween.
- Optionally, embodiments may be implemented for different imaging modalities, such as different camera images, stereo camera images, photon sensor images, electromagnetic radiation images, particle images, and/or the like. The primary criterion for application to a modality is that the “haze-lines” can be modelled in the color space as an analytical shape (line, arc, parabola, etc.) and that the “airlight value can be used to remove the unwanted image characteristic.
- Optionally, images may be in two dimensions, three dimensions, four dimensions, five dimension, and/or the like. For example, a two channel embodiment may use the techniques described herein to partially process the dehazing of an image, or otherwise remove unwanted image characteristics (i.e. glare, prismatic effects, and/or the like). For example, multispectral or hyperspectral images may by processed, such as remote sensing atmospheric images comprising 5 color channels (i.e. atmospheric infrared transparency windows) to remove cloud cover, hazing, glare, and/or the like. Such augmented images may better be used to compute sea surface temperature, vegetation indices, and/or the like. For example, dual-energy computed topography images may be processed using embodiments of the techniques to remove ghosting.
- The first core step is finding the haze-lines. A may be estimated using conventional methods, such those in R. Fattal. Single image dehazing. ACM Trans. Graph., 27(3):72, 2008; K. He, J. Sun, and X. Tang. Single image haze removal using dark channel prior. In Proc. IEEE CVPR, 2009; and R. Tan. Visibility in bad weather from a single image. In Proc. IEEE CVPR, 2008.
- Let us define IA as:
-
I A(x)=I(x)−A, Eq. (3) - where the three-dimensional (3D) RGB coordinate system is translated such that the airlight is at the origin. Following Eq. (1),
-
I A(x)=t(x)·[J(x)−A]. Eq. (4) - We express IA(x) in spherical coordinates:
-
I A(x)=[r(x),θ(x),ϕ(x)] Eq. (5) - Here r denotes the distance to the origin (i.e., ∥I−A∥)), θ and ϕ denote the longitude and latitude, respectively.
- The colors of the pixels are now represented in a spherical coordinate system around the airlight.
FIG. 4 shows the histogram of the Forest image (FIG. 2a ) projected onto a sphere. The sphere was sampled uniformly using 500 points. The color at each point [ϕ,θ] indicates the number of pixels x with these angles when writing IA(x) in spherical coordinates (image size 768×1024 pixels). The equator (ϕ=0) is marked by a bold dashed blue line, while the longitudes -
- are marked by dotted blue lines. The color-mapping is logarithmic for illustration purposes. The histogram indicates that the pixels are highly concentrated in terms of their longitude and latitude.
- Let us look at Eq. (4). For given values of J and A, scene points at different distances from the camera differ only in the value of t. In the spherical coordinate system we defined, changes in t affect only r(x) without changing either ϕ(x) or θ(x). In other words, pixels x and y have similar RGB values in the underlying haze-free image when their [ϕ,θ] are similar:
-
J(x)≈J(y)⇒{ϕ(x)≈ϕ(y),θ(x)≈θ(y)},∀t. Eq. (6) - Therefore, pixels belong to the same haze-line when their [ϕ(x),θ(x)] values are similar. Each point on the sphere in
FIG. 4 represents a haze-line, in which all the pixels have approximately the same angles [ϕ(x),θ(x)]. The pixels in each haze-line have similar values in the non-hazy image J with high probability. - Note that there is inherent ambiguity between color and haze for colors which are collinear with the airlight:
-
J 1 −A=α(J 2 −A)⇒J 1=(1−α)A+αJ 2, Eq. (7) - where α denotes a scale factor. In this case all single image dehazing methods may correct J1 and J2 to the same color. This is the only case in our method when two color clusters may be mapped to the same haze-line.
- In order to determine which pixels are on the same haze-line, pixels should be grouped according to their angles [ϕ,θ]. A two-dimensional (2D) histogram binning of θ and ϕ with uniform edges in the range [0,2π]×[0,π] may not generate a uniform sampling of a sphere. Instead, the samples may be denser near the poles, as observed by G. Marsaglia. Choosing a point from the surface of a sphere. Ann. Math. Statist., 43(2):645-646, 04 1972, since the distance on the sphere is relative to sin(θ). Therefore, we sample the unit sphere uniformly, as shown in
FIG. 4 , where each vertex is a sample point. Each vertex corresponds to a haze-line. For clarity of display, the number of samples inFIG. 1 is smaller than the actual number we use. We group the pixels based on their [ϕ(x),θ(x)] values, according to the closest sample point on the surface to the airlight color channel values (i.e. the color differences between the RGB values of the pixels of the digital image and the airlight color values). This may be implemented efficiently by building a KD-Tree from the pre-defined tessellation and querying the tree for each pixel. This is much faster than running a clustering algorithm, such as K-means, Minimum Variance Quantization algorithm, or the like. - Based on the analysis of the prior described above, several hundreds of haze-lines represent an image with a good approximation. In some embodiment, the technique yields a range of between 10-50, 50-100, 100-200, 200-300, 300-400, 400-500, 500-600, 600-700, 700-800, 800-900, or more than 900 haze-lines—each of these ranges constituting a different embodiment. In some embodiments, the number of haze-lines is dependent on the amount of colors in the image; generally, a very colorful image would yield a large number of haze-lines (.e.g., above 400), while a relatively pale image would yield a lower number (e.g., below 50). For example, in one experiment, an image of haystacks, which included a relatively low number of colors, was well dehazed using as little as 20 haze-lines.
FIG. 5a depicts the layout of two different haze-lines in the image plane for the Forest image. Pixels belonging to two different haze-lines are depicted in green and blue, respectively.FIG. 5b is a histogram of r(x) within each cluster. The horizontal axis is limited to the range [0,∥A∥], as no pixel may have a radius outside that range in this particular image. - The second core step of the present technique is to estimate the transmission map. Optionally, this core step is broken into two. First, estimation of initial transmission: For a given haze-line defined by J and A, r(x) depends on object distance:
-
r(x)=t(x)∥J(x)−A∥,0≤t(x)≤1. Eq. (8) - Thus, t=1 corresponds to the largest radial coordinate:
-
- Combining Eqs. (8,9) results in an expression for the transmission based on radii in the haze-line:
-
t(x)=r(x)/r max. Eq. (10) - Now, the question is how to find an estimate {circumflex over (r)}max for the maximal radius? When a haze-line H contains a haze-free pixel, then {circumflex over (r)}max is the maximal radius of that haze-line:
-
- where the estimation is done per haze-line H.
FIG. 5b displays the radii histograms of the two clusters shown inFIG. 5a . We assume that the farthest pixel from the airlight is haze free, and that such a pixel exists for every haze-line. This assumption does not hold for all of the haze-lines in an image, however the regularization step partially compensates for it. Combining Eqs. (10,11) results in a per-pixel estimation of the transmission: -
{tilde over (t)}(x)=r(x)/{circumflex over (r)} max(x). Eq. (12) - Following the estimation of the initial transmission, a regularization step may take place due to the following reason. The initial transmission is estimated using the haze-lines, without using any spatial information. As a result, nearby pixels that were clustered to different haze-lines might have significantly different transmission values, while in reality they are nearly at the same distance from the camera. The regularization enforces the image smoothness on the transmission. Where the image is smooth, we expect to find the same object at a similar distance and therefore expect the transmission to change smoothly. On the other hand, when there is a significant gradient (color variance) in the image, it is likely to match to a depth discontinuity and we might see a discontinuity in the transmission as well.
- Since the radiance J is positive (i.e., J≥0), Eq. (1) gives a lower bound LB on the transmission:
-
- In He at al. (described in Single image haze removal using dark channel prior, published in in Proc. of IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR) (June 2009), pp. 1956-1963, 978-1-4244-3991-1/09), the transmission estimate is based on an eroded version of tLB. We impose this bound on the estimated transmission, per-pixel:
-
{tilde over (t)} LB(x)=max {{tilde over (t)}(x),t LB(x)} Eq. (14) - The estimation in Eq. (12) is performed per-pixel, without imposing spatial coherency. This estimation may be inaccurate when a small amount of pixels were mapped to a particular haze-line, or in very hazy areas, where r(x) is very small and noise may affect the angles significantly. The transmission map should be smooth, except for depth discontinuities, as observed by Fattal et al. (2014), Id.; K. Nishino, L. Kratz, and S. Lombardi. Bayesian defogging. Int. Journal of Computer Vision (IJCV), 98(3):263-278, 2012; Tan (2008), Visibility in bad weather from a single image, in Proc. IEEE CVPR, 2008; and J.-P. Tarel and N. Hautiere. Fast visibility restoration from a single color or gray level image. In Computer Vision, 2009 IEEE 12th International Conference on, pages 2201-2208, September 2009 (hereinafter Tarel).
- We seek a transmission map {circumflex over (t)}(x) that is similar to {tilde over (t)}LB(x) and is smooth when the input image is smooth. Mathematically, we minimize the following function w.r.t. {circumflex over (t)}(x):
-
- where λ denotes a parameter that controls trade-off between the data and the smoothness terms, Nx denotes the four nearest neighbors of x in the image plane, and σ(x) denotes the standard deviation of {tilde over (t)}LB, which is calculated per haze-line.
- σ(x) plays a significant role since it allows us to apply our estimate only to pixels where the assumptions hold. When the variance is high, the initial estimation is less reliable. σ(x) increases as the number of pixels in a haze line decreases. When the radii distribution in a given haze-line is small, our haze-line assumption does not hold since we do not observe pixels with different amounts of haze. In such cases, σ(x) increases as well.
- The third core step of the technique is the dehazing: Once {circumflex over (t)}(x) is calculated as the minimum of Eq. (13), the dehazed image is calculated using Eq. (1):
-
Ĵ(x)={I(x)−[1−{circumflex over (t)}(x)]A}/{circumflex over (t)}(x). Eq. (16) - The technique is summarized in
Algorithm 1 below, and exemplary results thereof are demonstrated inFIG. 6 . - Intermediate and final results of our method: (a) the input hazy image, (b) the dehazed images, (c) the distance r(x) of every pixel in the hazy image to the airlight, (d) the estimated radii {circumflex over (r)}max(x) calculated according to Eq. (9). (e) The input image is shown, with the pixels x for which r(x)={circumflex over (r)}max(x) marked by cyan circles, (f) The data term confidence in Eq. (13) colormapped (warm colors show the larger values), (g) the estimated transmission map {circumflex over (t)}(x) before the regularization, (h) the final transmission map {tilde over (t)}(x) after regularization. (g) and (h) are colormapped.
-
FIG. 6a shows the input hazy image. The final, dehazed image is shown inFIG. 6b .FIG. 6c shows the distance r(x) in RGB space of every pixel in the hazy image to the airlight. Note that this distance decreases as haze increases.FIG. 6d shows the maximum radii {circumflex over (r)}max(x) per haze-line, calculated according to Eq. (11). Observe thatFIG. 6d is much brighter thanFIG. 6c . Since larger values are represented by brighter colors, this indicates that the distance to the airlight is increased. The pixels x with the maximum radius in their haze-line (for which for which r(x)={circumflex over (r)}max(x)) are marked by cyan circles on the hazy image inFIG. 6e . Note that these pixels are mostly at the foreground, where indeed there a minimal amount of haze. We filtered out pixels that had a maximum radius in the haze line, yet had a σ>2, because the model assumptions do not hold for these haze-lines. This happens in the sky, because the distance to the airlight in RGB space is so small that clustering according to the angles is not reliable due to noise. In the regularization step this fact is taken into consideration through the data-term -
- weight which is shown in
FIG. 6f , which is colormapped data term confidence in Eq. (15) (warm colors depict high values). The ratio ofFIGS. 6c and 6d yields the initial transmission {tilde over (t)}(x) that is shown inFIG. 6g . The transmission map after regularization is shown inFIG. 6h . While {tilde over (t)}(x) contains fine details even in grass areas that are at the same distance from the camera, {circumflex over (t)}(x) does not exhibit this behavior. This indicates the regularization is advantageous. -
Algorithm 1: Haze Removal. Input: I(x), A Ourput: Ĵ(x), {circumflex over (t)}(x) 1: IA(x) = I(x) − A 2: Convert IA to spherical coordinates to obtain [r(x), ϕ(x), θ(x)] 3: Cluster the pixels according to [ϕ(x), θ(x)]. Each cluster H is a haze-line. 4: for each cluster H do 5: Estimate maximum radius: {circumflex over (r)}max(x) = maxxϵH{r(x)} 6: for each pixel x do 7: 8: Perform regularization by calculating {circumflex over (t)}(x) that minimizes Eq. 15 9: Calculate the dehazed image using Eq. (16) - Optionally, the clustering of pixels in spherical coordinates is performed by representing the color differences between the pixels of the digital image and the airlight value of the pixels of the digital image on a pre-computed tessellation of the unit sphere, where the pre-computed tessellation is uniformly sampled and stored in Cartesian coordinates in a k-dimensional tree (KD-tree). A KD-tree is a computerized data structure for organizing points in a space with k dimensions. It is a binary search tree with constraints imposed on it. KD trees are very useful for nearest neighbor searches (i.e. in a color space). The searching for nearest neighbors on the KD-tree may be performed using Euclidean distance coordinates. The pixel clusters are grouped with the color samples based on the nearest neighbors, thereby producing multiple groups each being one of the haze-lines.
- As to the computational complexity of the present technique, the algorithm is linear in N−the number of pixels in the image, and therefore fast. The clustering is done using a nearest neighbor search on a KD-Tree with a fixed number of points. Estimating the radius within each cluster is linear in N. Therefore, the initial radius estimation is O(N). Seeking the minimum of Eq. (15) requires solving a sparse linear system, which is also O(N). Restoring the dehazed image from the transmission map is O(N) as well.
- The inventors have evaluated the technique on a large dataset containing both natural and synthetic images and compared its performance to state-of-the-art algorithms We assumed A is given, and used the airlight vector A calculated by M. Sulami, I. Geltzer, R. Fattal, and M. Werman. Automatic recovery of the atmospheric light in hazy images. In Proc. IEEE ICCP, 2014 (hereinafter Sulami). We used the same parameters for all of the images: in Eq. (15) we set λ=0.1 and we scaled 1/σ2(x) to be in the range [0,1] in order to avoid numeric issues. In order to find the haze-lines, we sampled uniformly 1000 points on the unit sphere (
FIG. 4 shows only 500 for clarity). - A synthetic dataset of hazy images of natural scenes was introduced by Fattal et al. (2014), Id., and is available online, at http://www.cs.huji.ac.il/˜raananf/projects/dehaze_c1/results/, last viewed Apr. 4, 2016. The dataset contains eleven haze free images, synthetic distance maps and corresponding simulated haze images. An identically-distributed zero-mean Gaussian noise with three different noise level: σn=0.01,0.025,0.05 was added to these images (with image intensity scaled to [0,1]). Table 1 summarizes the L1 errors on non-sky pixels (same metric used in Fattal et al. (2014), Id.) of the transmission maps and the dehazed images. Our technique is compared to the method of Fattal et al. (2014), Id. and an implementation of He et al. (2009), Id. by Fattal et al. (2014), Id. For five images out of this dataset, results of both clear and noisy images are provided by Fattal et al. (2014), Id.
-
TABLE 1 Comparison of L1 errors over synthetic hazy images with various amount of noise. The noise standard deviation is given and the images are scaled to the range [0, 1]. The table compares the L1 errors of the estimated transmission maps (left value) and the dehazed images (right value). The present σ He et al. (2009) Fattal et al. (2014) technique Road1 0 0.097/0.051 0.069/0.033 0.058/0.040 0.01 0.100/0.058 0.068/0.038 0.061/0.045 0.025 0.106/0.074 0.084/0.065 0.072/0.064 0.05 0.136/0.107 0.120/0.114 0.091/0.100 Lawn1 0 0.118/0.063 0.077/0.035 0.032/0.026 0.01 0.116/0.067 0.056/0.038 0.032/0.032 0.025 0.109/0.077 0.056/0.065 0.052/0.056 0.05 0.115/0.102 0.114/0.121 0.099/0.107 Mansion 0 0.074/0.043 0.042/0.022 0.080/0.049 0.01 0.067/0.040 0.048/0.030 0.088/0.056 0.025 0.057/0.044 0.065/0.051 0.104/0.072 0.05 0.083/0.075 0.081/0.080 0.116/0.095 Church 0 0.07/0.048 0.039/0.025 0.047/0.032 0.01 0.067/0.050 0.053/0.043 0.049/0.041 0.025 0.058/0.059 0.089/0.081 0.047/0.057 0.05 0.087/0.121 0.121/0.136 0.043/0.092 Raindeer 0 0.127/0.068 0.066/0.034 0.089/0.045 0.01 0.119/0.066 0.077/0.042 0.093/0.049 0.025 0.109/0.067 0.084/0.054 0.104/0.063 0.05 0.117/0.085 0.106/0.083 0.131/0.092 - As illustrated in Table 1, the present technique outperforms previous methods in most cases, and handles the noise well. As expected, our performance degrades when the noise variance increases. However, our technique maintains its ranking, with respect to other methods, regardless of the amount of noise. This shows that our algorithm is quite robust to noise, despite being pixel-based.
-
FIGS. 7 and 8 compare results to six state-of-the-art single image dehazing methods: C. O. Ancuti and C. Ancuti. Single image dehazing by multiscale fusion. IEEE Trans. on Image Processing, 22(8):3271-3282, 2013 (hereinafter Ancuti); Fattal et al. (2014), Id.; K. B. Gibson and T. Q. Nguyen. An analysis of single image defogging methods using a color ellipsoid framework. EURASIP Journal on Image and Video Processing, 2013(1), 2013; R. Luzon-Gonzalez, J. L. Nieves, and J. Romero. Recovering of weather degraded images based on RGB response ratio constancy. Appl. Opt., 2014; He et al. (2009), Id.; K. Nishino, L. Kratz, and S. Lombardi. Bayesian defogging. Int. Journal of Computer Vision (IJCV), 98(3):263-278, 2012; and K. Tang, J. Yang, and J. Wang. Investigating haze-relevant features in a learning framework for image dehazing. In Proc. IEEE CVPR, 2014 (hereinafter Tang). - As previously noted by Fattal et al. (2014), Id., the image after haze removal might look dim, since the scene radiance is usually not as bright as the airlight. For display, we performed a global linear contrast stretch on the output, clipping 0.5% of the pixel values both in the shadows and in the highlights. Pixels whose radius is maximal in their haze-line are marked in pink on the hazy input. We marked only pixels x for which σ(x)<2 and for clarity, only ones that belong to large clusters.
- The method of Ancuti et al. (2013), Id. leaves haze in the results, as seen in the areas circled in yellow. In the result of Luzon-Gonzalez et al. (2014), Id. there are artifacts in the boundary between segments (pointed by arrows). The method of Nishino et al. (2012), Id. tends to oversaturate (e.g., House). The methods of He et al. (2009), Id. and Tang et al. (2014), Id. produce excellent results in general but lack some micro-contrast when compared to Fattal et al. (2014), Id. and to ours. This is evident in the zoomed-in buildings shown in Cityscape results, where in our result and in Fattal et al. (2014), Id. the windows are sharper than in He et al. (2009), Id. and Tang et al. (2014), Id. The result of Gibson et al. (2013), Id. was not enlarged as it has a low resolution. Results of Fattal et al. (2014), Id. are sometimes clipped, e.g., the leaves in House and in the sky in Forest. Our assumption regarding having a haze-free pixel in each haze-line does not hold in Cityscape, as evident by several hazy pixels that set a maximum radius, e.g. the red buildings. Despite that, the transmission in those areas is estimated correctly due to the regularization that propagates the depth information spatially from the other haze-lines.
-
FIG. 8 compares both the transmission maps and the dehazed images. It shows our technique is comparable to other methods, and in certain cases works better. For example, The two rows of trees are well separated in our result when compared to He et al. (2009), Id. - A major advantage of the global approach of the present technique is the ability to cope well with fast variations in depth, when the details are smaller than the patch size.
FIG. 9a shows an enlarged portion of an image, where clear artifacts are visible in the result of Fattal et al. (2014), Id. (FIG. 9c ), around the leaves and at the boundary between the trunk and the background.FIG. 9d shows our result. A patch-based method is less likely to estimate the distance of such scenes accurately. The result of He et al. (2009), Id. does not exhibit these artifacts inFIG. 9b , because the dehazing is less effective in this image and the details are less clear (e.g, the circled trunk). This phenomena is also visible inFIG. 7 in the dehazed Cityscape image of Gibson et al. (2013), Id., where a halo between the trees in the foreground and the background is visible, and also in the train output of Fattal et al. (2014), Id. around the pole (marked by a yellow square). - Using a fixed tessellation of the unit sphere might raise a concern that fine tones may not be distinguished.
FIG. 10 demonstrates this is not the case. The pumpkins (a crop ofFIG. 6a ) are lit from above, and therefore are brighter at the top and gradually become darker towards the ground (FIG. 10 left).FIG. 10 right depicts the cluster map—each color symbolizes a different haze-line. The gradual tone change is evident in the cluster map. - The precise technique presented above in the framework of the experimental results section is considered to be an embodiment of the present invention.
- Optionally, a Hough transform in RGB space is used to automatically calculate airlight values, such as a set of color channel values for an airlight coordinate in RGB space. Hough transforms find imperfect instances of haze-lines by a voting procedure, the voting procedure carried out in a parameter space. Haze-line candidates are automatically obtained as local maxima in an “accumulator space” that is constructed by the Hough transform. For example, clusters of point are automatically modeled as haze-lines by the Hough transform, and each point in each cluster may vote for the airlight RGB values, such as in a naïve embodiment.
- Using the Hough transforms, a global airlight value may be automatically determined in hazy images quickly and efficiently. The method is based on the haze-line model introduced herein, that considers a a cluster of pixels intensities with similar colors to form lines in RGB space under haze. These lines may intersect at the airlight color and we take advantage of this observation to find their point of intersection.
- For example, given a candidate airlight coordinate in RGB space, we model pixels' intensities with a fixed set of lines emanating from the airlight candidate. That is, we wish to model pixels' values by an intersection point (i.e., the airlight) and a collection of lines (i.e., the Haze-Lines). An airlight in the correct RGB location may fit the data better than an airlight in a wrong location.
- We search for an airlight point so that all lines emanating from the airlight point, in the given line directions, may fit the data. For that we use the Hough transform, where the point with the highest vote is assumed to be the airlight color. Running the Hough transform in three or four dimensions (3D or 4D) may be computationally expensive, so we may use two optional techniques to automatically accelerate the technique. One option is to work in 2D color spaces instead of a 3D color space, for example, by automatically projecting pixels' values on the RG, GB and RB planes. The second option is by automatically clustering pixels' values to collect votes for a candidate airlight from cluster centers and weight each vote by a statistical parameter of the cluster, such as the cluster size, rather than collecting votes from all pixels. The actions for processing the digital images described herein may be performed completely automatically as no user intervention is required for the steps.
- For example, we reduce the problem from 3D to 2D by considering the projection of pixel values on the RG, GB and RB planes. We may combine the votes in the three planes to obtain the final airlight estimation, such as a single airlight value selected from multiple airlight candidates. As used herein the term airlight color channel value set means a set of three or more color channel values, corresponding to the pixel color channel values. For example, the set is a set of RGB values. This has a dramatic effect on the number of airlight candidates we need to sample and evaluate. Second, we may cluster all pixels in the image into roughly a thousand clusters. As a result of the optional improvements, the airlight value may be determined in a matter of seconds, as opposed to minutes in the naïve implementation. We demonstrate our method on other real-world images and synthetic data. Our method may be more efficient than state-of-the-art methods (linear vs. quadratic complexity) and performs on-par with them. For example, the proposed algorithm's complexity is linear in the number of pixels in the image, compared to alternatives which are quadratic. As a reference, the run-time of our MATLAB implementation on a desktop with a 4th generation Intel core i7 CPU @3.4 GHz and 32 GB of memory is on average 6 seconds for a 1 Mpixel image.
- Following is a detailed technical description of applying the Hough transform technique for determining airlight values. When using a Hough transform to estimate the airlight value, we may detect unknown parameters of a model given noisy data via a voting scheme. In this case, the voting procedure is carried out in a parameter space consisting of candidate airlight values in RGB space. In particular, we uniformly sample a fixed set of line angles {θk,ϕk}k=1 K. Given this set, we consider a discrete set of possible airlight values. The distance between a pixel I(x) and the line defined by the airlight value A and a pair of angles (θ,ϕ) is:
-
d(I(x),(A,ϕ,θ))=∥(A−I(x))×(cos(θ),sin(ϕ))∥. Eq. (35) - A pixel vote may be assigned to a candidate A when the distance to one of the lines is smaller than a threshold τ. This threshold is adaptive and depends on the distance between A and I(x) to allow for small intensity variations. For example, instead of working with cylinders (lines with a fixed threshold) we work with cones (lines with a variable threshold). Formally:
-
- In addition, we allow a pixel to vote only for an airlight that is brighter than the pixel, such as by computing the brightness from the color channel values and comparing. This is due to the fact that bright objects are quite rare, as shown empirically to justify the dark channel prior, and usually do not contain information about the haze (e.g., a bright building close to the camera).
- The best representation of the pixels' values from a hazy image may be found with airlight A and fixed line directions {θk,ϕk}k=1 K. This may be formulated as follows:
-
- where 1 [⋅] is an indicator function that equals 1 when true and equals 0 otherwise. The term 1[A>I(x)] equals 1 when all elements of A are greater than the corresponding elements of I(x).
- A huge value of A>>1 might be chosen as the solution, since it maximizes Eq. 37 the pixels in the same large cone. To prevent this, we give a larger weight to values of A that are close to the pixel's values. Formally, we optimize:
-
- where f(y)=1+4·e−y is a fast decaying weight that gives preference to values of A in the vicinity of the pixel's distributions.
- The proposed scheme, which includes collecting votes from all pixels for all angles and airlight candidates in the 3D RGB space, is computationally expensive. Therefore, we propose the following approximations, which significantly accelerate the computation while maintaining accuracy. The first, clustering the colors in the image and using the cluster centers instead of all the pixels. The second, performing the voting scheme in two dimensions. The voting is repeated three times, with only two of the (R,G,B) color channels being used each time.
- Color clusters may be quantized before the Hough voting, such as quantizing the image into N clusters. We may do this by converting the RGB image into an indexed image with a unique color palette of length N. This may give us a set of N typical color values, {In}n=1 N, where N is much smaller than the number of pixels in the image. In addition, we have {wn}n=1 N, the number of pixels in the image belonging to each cluster. During the Hough voting procedure, each representative color value In votes based on its distance to the candidate airlight, and the vote has a relative strength wn. Therefore, the final optimization function is:
-
- Calculating the full 3D accumulator for all possible airlight values is computationally expensive. Therefore, voting e may be done in a lower dimension. The accumulator may be seen as the joint probability distribution of the airlight in all color channels, where the final selected value is the one with the maximal probability. By performing the accumulation two color channels at a time, we calculate three marginal probabilities, where each time the summation is performed on a different color channel. Finally, we look for a candidate airlight that may maximize the 3D volume created by the outer product of the marginal accumulators. The proposed Hough technique is summarized in
Algorithm 2. -
Algorithm 2: Airlight Estimation Input: hazy image, I(x) Output: airlight value, Â 1: cluster the pixels' colors and generate an indexed image Î(x) whose values are n ∈ {1, . . . , N}, a colormap {In}n=1 N, and cluster sizes {Wn}n=1 N 2: for each pair of color channels (c1, c2) ∈ {RG, GB, RB} do 3: initialize accumc 1 ,c2 to zero4: for A = (m · ΔA, l · ΔA), m, l ∈ {0, . . . , M} do 5: 6: for n ∈ {1, . . . , N} do 7: d = |(A − In(c1, c2))×(cos(θk), sin(θk))| 8: if (d < τ) ∧ (m · ΔA > In(c1)) ∧ (l · ΔA > In(c2)) then accumc 1 ,c2 (k, m, l)+= wn · f(∥ A − In ∥)9: Â = argmax{accumR,G ⊗ accumG,B ⊗ accumR,B}, where ⊗ is an outer product 10: Return - The algorithm's run-time depends on the following parameters: the number of pixels in the image P, the number of airlight candidates (in each color channel) M, the number of color clusters N and the number of haze-line orientations K. The conversion from RGB to an indexed image has a run-time complexity of O(NP), while the airlight estimation using the indexed image has a run-time complexity of O(NKM2).
- Reference is now made to
FIG. 18 , which shows an exemplary selection ofairlight values different 2D planes voting arrays 1 ,c2 , (c1, c2)∈RG,GB,RB as a function of the candidate air-light values. The color-map indicates the number of votes, and an airlight value set is selected from the accumulated assigned votes from all pixels. In this case, the ground-truth air-light had the most votes in all planes (strong yellow color). Thebottom row images - We validate the proposed method on a diverse set of images. In all of our experiments we use the following parameters: N=1000, the number of color clusters for each image (some images have less typical colors, resulting in empty clusters and N<1000 in practice); K=40, the number of angles, i.e., haze-lines, in each plane; all of the pixels' intensities are normalized to the range [0,1], and therefore we set ΔA=0.02 and
-
- the threshold τ0=0.02 determines whether a pixel In supports a certain haze-line.
- Reference is now made to
FIG. 19 , which shows images used for comparison of airlight values computed with Hough transform and with alternative techniques. These images show the results from evaluating the accuracy of the estimated airlight on natural images with different techniques. The results are from the techniques of Sulami, He et al, Bahat et al (described in Blind dehazing using internal patch recurrence, Proceedings of the 2016 IEEE International Conference on Computational Photography (ICCP), 13-15 May 2016, EISBN: 978-1-4673-8623-4, DOI: 10.1109/ICCPHOT.2016.7492870), and the present invention. Following is table 2 summarizing the L2 errors between the techniques from analysis of 40 images. -
TABLE 2 Statistic Ours Bahat He Sulami Mean L2 error 0.119 0.111 0.129 0.255 Median L2 error 0.063 0.086 0.109 0.191 Variance 0.014 0.013 0.013 0.061 - Generally, the present embodiment and Bahat outperformed the others. Compared to Bahat, our embodiment results in a lower median error, with slightly higher mean and variance. The performance depends on the extent the image adheres to the prior used by each method.
-
Images -
TABLE 3 range: 0-1 Image 19Aname = road.png He: 0.94118 0.97647 0.98824 Sulami: 1.1842 1.2987 1.353 Bahat: 0.85481 0.92456 0.94885 ours: 0.96 0.99 1 GT: 0.84428 0.93775 0.97859 Image 19B name = schechner.png He: 0.76078 0.73333 0.6902 Sulami: 0.48603 0.8293 1.1188 Bahat: 0.84718 0.79573 0.73942 ours: 0.52 0.63 0.8 GT: 0.50074 0.59792 0.75972 Image 19Cname = train.png He: 0.70588 0.70588 0.71373 Sulami: 510183.8 510183.5 510185 Bahat: 0.75459 0.75287 0.7529 ours: 0.72 0.71 0.72 GT: 0.72548 0.72693 0.73123 Image 19Dname = underwaterVessel.png He: 0.56471 0.71765 0.68235 Sulami: 0.16142 2.2662 3.5469 Bahat: 0.42121 0.76008 0.93351 ours: 0.42 0.75 0.92 GT: 0.28975 0.60814 0.83459 -
TABLE 4 range: 0-255 Image 19Aname = road.png He: 240 249 252 Sulami: 255 255 255 Bahat: 218 236 242 ours: 245 252 255 GT: 215 239 250 Image 19B name = schechner.png He: 194 187 176 Sulami: 124 211 255 Bahat: 216 203 189 ours: 133 161 204 GT: 128 152 194 Image 19Cname = train.png He: 180 180 182 Sulami: 255 255 255 Bahat: 192 192 192 ours: 184 181 184 GT: 185 185 186 Image 19Dname = underwaterVessel.png He: 144 183 174 Sulami: 41 255 255 Bahat: 107 194 238 ours: 107 191 235 GT: 74 155 213 - The error bars corresponding to them in 19E are labeled. In the
Road image 19A our error is larger than Bahat. This may be caused by several bright pixels that have a high red value. In the Schechner image 19B our method outperforms all methods. In theTrain image 19C shows that all methods except Sulami perform well. In theVessel image 19D all methods yield relatively high errors. This may be because the airlight is not uniform across the scene. - Reference is now made to
FIG. 20 , which shows a comparison of selections of airlight values from Hough transform forimages FIG. 20 the images the cropped region is marked by a dotted line. The estimated airlight values of the full and cropped images are shown, as well as the GT value extracted manually from the images. Our cropped image estimations are close to the ones estimated from the full image. The largest error, both before and after cropping, was calculated for the right image on the second row from the top—it had an L2 error of 0.35. - Reference is now made to
FIG. 21 , which shows a comparison of dehazing between Hough transform 21C and 21D and analternative technique Images incorrect transmission map 21B, while the transmission in 21D approximately describes the scene structure. In the transmission maps, the darker colors are farther from the sensor, and the lighter are closer. Seen in 21B, the previous technique, the image intensities cause the brighter buildings to be considered farther (bottom right), awhile in 21D the transmission maps shows much better visual correspondence with the “depth” of the depicted objects. - Following are results of a comparative analysis of synthetic images. In Sulami, the images were simulated from haze-free RGB images and their distance maps, gathered from the Lightfields and the Middlebury datasets used respectively in He et al and Scharstein et al (described in A taxonomy and evaluation of dense two-frame stereo correspondence algorithms, IJCV, 47(1-3):7-42, 2002). The transmission maps were calculated by t(x)=e−βd(x), and β was chosen such that the most distant object in the scene received t=0.1. The airlight magnitude was uniformly sampled in the range [0.8,1.8] and the orientation was uniformly sampled from the 10° cone around [1,1,1]. The sampling process was repeated three times for each image and the results are reported in Sulami. We did not perform a per-image comparison of the techniques. Instead we report average and median errors in Table 5.
-
TABLE 5 He Tan Tarel Sulami Ours Orientation Mean 3.218 3.576 3.253 0.581 0.043 Median 3.318 3.316 3.49 0.22 0.037 Magnitude Mean 0.172 0.218 0.412 0.157 0.178 Median 0.141 0.208 0.393 0.116 0.095 l∞ Endpoint Error Mean 0.147 0.177 0.278 0.103 0.141 Median 0.144 0.178 0.286 0.077 0.106 - Some of the images in this dataset are indoor images, whose depth distribution is significantly different from that of outdoor images. Despite that, our results are competitive. Specifically, our orientation estimation is the most accurate, which is significant. It has been shown in [15] that estimating the airlight's orientation is more important than its magnitude, since errors in the orientation induce color distortions in the dehazed image, whereas magnitude errors induce only brightness distortions.
FIG. 5 shows two examples of synthetic images used in this experiment. - As light propagates in water it is attenuated and scattered. Both effects depend on the distance the light travels and its wavelength, as shown, for example, by Mobley, C. D.: Light and water: radiative transfer in natural waters. Academic press (1994). The wavelength-dependent attenuation causes color distortions that depend on the object's distance and therefore cannot be globally compensated for. The scattering induces a distance dependent additive component on the scene that reduces contrast. As a result, many underwater images appear blue and lack vivid colors.
- Nevertheless, color and contrast are extremely important for visual surveys in the ocean. For example, enhanced images may improve automatic segmentation, increase the quality of feature matching between images taken from multiple viewpoints, and aid in identification.
- Present embodiments aim to recover the object's colors in scenes photographed under ambient illumination in water using solely a single image as an input. Another aim is to recover a distance map of the photographed scene. This problem is closely related to the single image dehazing problem discussed above, in which images are degraded by weather conditions such as haze or fog. The above dehazing technique assumes that the attenuation is uniform across colors.
- Under water, where the assumption of color-independent attenuation does not hold, there are theoretically three unknown transmission values per pixel (one per channel), yielding six unknowns with only three measurements. However, the color-dependent transmission is related to the distance via the attenuation coefficients. Based on this relation we show that the problem may be reduced to four unknowns per pixel as before, with two new global parameters—the ratios between the attenuation coefficients of the color channels.
- We show that when the attenuation ratios between the different channels are known, then the input image may be converted to a medium-compensated image where the attenuation coefficient is the same for all color channels. Then, the above image dehazing technique may be used to solve the problem. In alternative embodiments, other image dehazing techniques may be applied to the medium-compensated image. We are left with the question of how to estimate the two additional global parameters. To this end, we show that using the wrong parameters results in images with distorted colors. Hence, we automatically choose the parameters as the ones that yield the best looking image. This is defined as the image that best adheres to the gray world assumption, that was used before for above and under water imaging. We find the correct parameters by sampling the parameter space, which is bounded by known physical measurements of naturally occurring water types.
- The results of experiments conducted by the inventors demonstrate single image restoration of underwater scenes using the full physical image formation model. Thus, we are able to recover complex 3D scenes and, in addition, estimate the water properties.
- We follow the model developed in Schechner, Y. Y., Karpel, N.: Recovery of underwater visibility and structure by polarization analysis. IEEE J. Oceanic Engineering 30(3) (2005) 570-587. In each color channel c∈{R,G,B}, the image intensity at each pixel is composed of two components, attenuated signal and veiling-light:
-
I c(x)=t c(x)J c(x)+(1−t c(x))·A c, Eq. (17) - where x denotes the pixel coordinate, Ic denotes the acquired image value in color channel c, tc denotes the transmission of that color channel, and Jc denotes the image value of the object that would have been acquired without the scattering and absorption of the water medium. The global veiling-light component Ac denotes the scene value in areas with no objects (t=0). Eq. (17) applies to linear captured data, prior to in-camera processing such as color-space conversion, gamma correction and compression. Therefore, I refers to the image obtained from the raw file after minimal processing such as demosaicing and black current subtraction, as disclosed by Akkaynak, D., Treibitz, T., Xiao, B., Gürkan, U. A., Allen, J. J., Demirci, U., Hanlon, R. T.: Use of commercial off-the-shelf digital cameras for scientific data acquisition and scene-specific color calibration. JOSA A 31(2) (2014) 312-321; and in Sumner, R.: Processing raw images in matlab. https://users.soe.ucsc.edu/rcsumner/rawguide/RAWguide.pdf (2014), last viewed Apr. 4, 2016.
- The transmission depends on object distance z and the water attenuation coefficient for each channel βc:
-
t c=exp(−βc z). Eq. (18) - Under water, the attenuation of red colors may be an order of magnitude larger than the attenuation of blue and green, os observed, for example, by Mobley, C. D.: Light and water: radiative transfer in natural waters. Academic press (1994). Therefore, as opposed to the common assumption in single image dehazing, the transmission t is wavelength-dependent.
- Jerlov, N. G.: Marine optics. Volume 14. Elsevier (1976) developed a frequently used classification scheme for oceanic waters, based on water clarity. The Jerlov water types are I, IA, IB, II and III for open ocean waters, and 1 through 9 for coastal waters. Type I is the clearest and type III is the most turbid open ocean water. Likewise, for coastal waters,
type 1 is clearest andtype 9 is most turbid.FIG. 11 (Left) depicts the attenuation coefficients of Jerlov water types; the figure was adapted from data in Mobley et al. (1994), Id., based on measurements in Austin, R., Petzold, T.: Spectral dependence of the diffuse attenuation coefficient of light in ocean waters. Optical Engineering 25(3) (1986) 253471-253471. Solid lines mark open ocean waters while dashed lines mark coastal waters. - When capturing an image using a commercial camera, three color channels R,G,B are obtained. Thus, we are interested in three attenuation coefficients: (βR,βG,βB). We show below that the three attenuation coefficient themselves are not required for transmission estimation, but rather their ratios (two variables).
FIG. 1 (Right) shows the ratios of the attenuation coefficients: βBβR vs. βBβG of Jerlov water types for wavelengths of peak camera sensitivity according to Jiang, J., Liu, D., Gu, J., Susstrunk, S.: What is the space of spectral sensitivity functions for digital color cameras? In: Proc. IEEE Workshop Applications of Computer Vision (WACV). (2013) 168-179. -
FIG. 12 demonstrates the effect of scattering medium on observed scene colors. -
FIG. 12a shows a scene with four different clusters of similar colors J marked. Pixels of a haze-free color image are clustered using K-means. Pixels belonging to four of the clusters are marked. Note that the pixels are non-local and are spread all over the image plane. -
FIG. 12b shows the same pixels in RGB space, with colors of the clusters corresponding to the highlighted pixels inFIG. 12 b. -
FIG. 12c shows the same scene with added synthetic haze. The same clustered pixels are marked, but their observed colors are affected by different amounts of haze. -
FIG. 12e shows the scene as if it was captured under water. -
FIGS. 12d and 12f show the corresponding pixels in RGB space for haze and underwater, respectively. InFIG. 12d , the hazy pixels are distributed haze-lines passing through the airlight, marked in black. InFIG. 12f , the cluster pixels are distributed along curves that do not coincide with the linear lines spanned by the original color and the veiling-light (the haze-lines), due to the wavelength-dependent attenuation. In haze, the attenuation is mostly wavelength independent and the tight color clusters become haze-lines. However, under water, the attenuation is wavelength-dependent, and the clusters become curves. - We first show that the absolute values of the attenuation coefficients are not required for recovery. Instead, we show how to reconstruct the scene using only two global ratios between the attenuation coefficients. Then, we show how to estimate the ratios of the attenuation coefficients from the image itself.
- We modify the non-local single image dehazing technique discussed above, to take into account different attenuation coefficients for the different color channels.
- Given attenuation ratios, we convert the input image into a medium compensated image where all three channels have the same attenuation coefficient. Then we apply the present dehazing technique (or a different one known in the art) to solve the problem.
- We assume Ac is extracted from a patch in the image.
- Combining and rearranging Eqs. (1a, 2a) yields for the blue channel:
-
A B −I B =e −βB Z·(A B −J B), Eq. (19) - and the same for the red channel:
-
A R −I R =e −βR Z·(A R −J R). Eq. (20) - Raising Eq. (20) to the power of
-
- yields
-
- Denote the ratios between the attenuation coefficients:
-
βBR=βB/βR,βBG=βB/βG. Eq. (22) - Then, in this medium-compensated space we achieve a form similar to Eq. (1), with one unknown transmission per-pixel, common to all color channels:
-
- This form is similar to the haze-lines formulation. We expect to find haze-lines in the medium-compensated space, where the transmission of the blue channel spans the haze-lines.
- Scene Recovery: Once tB is estimated, we may compensate for the color attenuation using the following:
-
- where c∈{R,G,B}.
- Eq. (24) compensates for the intensity changes that happen in the path between the object and the camera. In addition, the ambient illumination is attenuated by the water column from the surface to the imaging depth, resulting in a colored (bluish) global illumination. We are interested in restoring the colors as when they were viewed under white light. Since this effect is global in the scene, we correct it by performing a global white balance on the result. This global white balance works well only because the distance-dependent attenuation and scattering effects have already been compensated for. Otherwise, as demonstrated in
FIG. 14 , it has little effect. That is, the contrast in the image may not be uniform and further objects may have a reduced contrast. - Finally, since Eq. (17) applies to the linear captured data, we convert the linear image to sRGB using a standard image processing pipeline, including color-space conversion from the sensor-specific to a standard sRGB, and a gamma curve as in Sumner, R.: Processing raw images in matlab. hdps://users.soe.ucsc.edukcsumner/rawguide/RAWguide.pdf (2014), last viewed Apr. 4, 2016.
- We have shown that accounting for color-dependent attenuation requires only two additional global parameters. Next we show how to estimate them automatically.
- Using the wrong coefficients results in reconstructions that are color skewed. We use this insight to search for the most appropriate water type. We perform the restoration multiple times using different attenuation coefficients corresponding to different water types, and choose the best result automatically based on a variant of the gray world assumption.
-
FIG. 11 (right) shows the approximate attenuation coefficient ratios βBG,βBR, calculated for different Jerlov water types (FIG. 11a ). For each color channel, we take the attenuation at the peak wavelength of typical camera sensitivity responses based on Jiang et al. (2011), Id.: 475 nm, 525 nm and 600 nm for B, G, R, respectively. These values are a mere approximation, since they are based on a single wavelength, while cameras have a wideband response. However, our analysis shows this camera-independent approximation works well in practice: taking into account a wideband response did not yield a noticeable difference in the restoration. - According to the Gray-World assumption of Lu, H., Li, Y., Serikawa, S., Underwater image enhancement using guided trigonometric bilateral filter and fast automatic color correction. In: Image Processing (ICIP), 2013 20th IEEE International Conference on. (September 2013) 3412-3416, the average reflectance of surfaces in the world is achromatic. It has been used in the past for estimating attenuation coefficients underwater using known distances, such as by Bryson, M., Johnson-Roberson, M., Pizarro, O., Williams, S. B.: Colour-consistent structure-from-motion models using underwater imagery. In: Robotics: Science and Systems, Citeseer (2012) 1-8. A significant portion of images taken under water often contains water without any objects. The Gray-World assumption obviously does not hold there. Therefore, we apply the Gray-World assumption only at image regions that contain objects, i.e., those that were not identified as veiling-light pixels. Thus, among all results for different water types, we choose the image where the difference between the average values of the red, green, and blue channels is the smallest.
- We considered several other measures such as maximal contrast (such as in Tan (2008)), Gray-World assumption on all three color channels and the maximal eigen-value of the RGB-histogram (looking for a similar color distribution among channels). We found that a simple Gray world assumption on non-veiling pixels gave the best results and therefore we focus on this measure.
- The present restoration technique is summarized in Algorithm 3 below.
-
Algorithm 3: Underwater image restoration. Input: I(x), A Ourput: Ĵ(x), {circumflex over (t)}(x) 1: for each (βBR, βBG) values of water types (FIG. 10 right) do 2: for each c ∈ {R, G, B} do 3: Ĩc(x) = sign(Ic(x) − Ac) · abs(Ic(x) − Ac)β Bc (see definition of βBc in Eq. (22)) 4: Convert Ĩc to spherical coordinates to obtain [r(x), ϕ(x), θ(x)] 5: Cluster the pixels according to [ϕ(x), θ(x)] Each cluster H is a haze-line. 6: for each cluster H do 7: Estimate maximum radius: {circumflex over (r)}max(x) = maxxϵH{r(x)} 8: for each pixel x do 9: 10: Perform regularization by calculating {circumflex over (t)}(x) that minimizes Eq. (15) 11: Calculate the restored image using Eq. (24) 12: Perform a global WB on the restored image 13: Calculate the mean colors of non-veiling-light pixels 14: Return the image with the mean colors closest to gray - Optionally, we choose and return the image that best conforms to the Gray-World assumption, on non-veiling-light pixels. Optionally, other methods for computing a a parameter using the image data, and choosing one of the images based on the parameter. For example, parameters may be standard deviations of one or more color channel values, other statistical values of the RGB color values, and/or the like.
- We first discuss implementation details of the underwater haze-lines variation. According to Eq. (23), we expect to find haze-lines in the medium-compensated space:
-
[(I R(x)−A R)βBR ,(I G(x)−A G)βBG ,(I B(x)−A B)] Eq. (25) - The ratios denoted βBR and βBG are often fractions, and the ambient light denoted A may be larger than the acquired color I. In order to avoid numerical problems, we calculate the colors in the medium-compensated space as follows:
-
- We then cluster the points in the medium-compensated space into haze-lines. Due to the smaller variety of colors in the underwater environment, which stems partially from the narrower spectrum of illumination, we use only 500 points sampled uniformly on a sphere, in contrast to the 1000 sampled points in the experiments of the dehazing techniques discussed above.
- Once the haze-lines are obtained, we calculate the transmission of each pixel according to the ratio between its distance to the veiling-light and the distance of the most distant pixel in that haze-line. While in air it is somewhat reasonable to assume there is an almost haze-free pixel in each haze-line, under water this assumption does not hold. Even scene points that are located at a distance of one meter from the camera have a transmission of about 0.9 in the blue channel, depending on water type. Therefore, we multiply the initial transmission estimation by 0.9 even before the regularization.
- We found the underwater data to be noisier then haze images. Therefore, we set
-
- in the regularization term to be 1 when the haze line has more than 50 pixels and when the radius of the pixels at i is larger than 0.1.
- We used raw images taken with a Canon 5DII and a Nikon D810 in three different locations, in tropical waters and in murkier coastal water. Two different color charts were placed in the scenes for verification: one, based on the X-rite color checker (by X-Rite Inc., Michigan, USA), and the second is QPcard-202 (by QPcard AB, Sweden), both encased for water protection, with matte coating.
- The color charts are used only for validation. During the transmission estimation, we masked out the color charts, in order to estimate the transmission based on natural scene objects alone. The transmission of those pixels is determined during the regularization step based on neighboring values.
-
FIG. 13 demonstrates the importance of choosing the correct water type. Using an incorrect value leads to an incorrect transmission estimation. As a result, some area in the image may be under- or over-compensated in the restoration process.FIG. 13a shows an input image with X-rite color checker, and two different outputs (FIGS. 13b and 13c ) for two water types, including a zoom-in on the color-checker. The color-checker is used only for validation of the restoration. Direct comparison of color-checker values requires that both images are captured under the same global illumination conditions. Since the global illumination in our case is unknown, we measure the angle θ in RGB space between the color of the gray-level patches in our result and the direction of a vector with equal color components [1,1,1]. In a perfect restoration θ=0 and cos(θ)=1. The correct water-type achieves a value of cos(θ)=0.99 while the incorrect achieves a value of cos(θ)=0.96. - In
FIG. 13a , an image is restored using two different water types (different βBR,βBG values), shown inFIGS. 13b and 13c , respectively. Using incorrect water type leads to incorrect colors of the restored image, as shown both qualitatively and quantitatively by the zoom-in on the color chart marked by a yellow rectangle. Qualitatively: the rightmost chart shows a pink hue. Quantitatively: We measured the angles in RGB space between the gray level patches and the gray direction [1,1,1] (one angle per patch). The median angle of the patches is presented bellow the color charts (cos(θ)=1 is a perfect restoration). The correct values indicate water type 3 (coastal waters), while the incorrect values are for open ocean waters. -
TABLE 6 Each color chart has six or seven gray level patches. We sample the average value of each patch and calculate the cosine of angles between the patch color and the vector [1, 1, 1]. The median result is presented in the table. An ideal restoration would yield 1. Frames (FIG. 14) Pier (FIG. 14) Rocks (FIG. 15) Mac- QPcard- Mac- QPcard- Mac- QPcard- Method beth 202 beth 202 beth 202 Haze- 0.806 0.806 0.920 0.947 0.997 0.984 Lines UDCP 0.724 0.788 0.577 0.782 0.946 0.815 WCID 0.782 0.805 0.715 0.952 0.930 0.807 The 0.980 0.981 0.923 0.959 0.999 0.997 present technique - We present comparative results of our technique against the following single underwater image restoration methods, which are all based on a dark channel prior: a naive white-balance and contrast stretching, UDCP (Drews, P., Nascimento, E., Moraes, F., Botelho, S., Campos, M.: Transmission estimation in underwater single images. In: Proc. IEEE ICCV Underwater Vision Workshop. (2013) 825-830), WCID (Chiang, J. Y., Chen, Y. C.: Underwater image enhancement by wavelength compensation and dehazing. IEEE Trans. Image Processing 21(4) (2012) 1756-1769) and the present restoration technique. In addition, we include the result of the present dehazing technique (denoted Haze-Lines) as a baseline.
- Each of these two paper suggests a different method for choosing the veiling-light: in WCID it is chosen as the brightest pixel value among all local minima in a small neighborhood, while in UDCP it is estimated by finding the brightest pixel in the underwater dark channel Idark UDCP(x)=miny∈Ω(x)[minc∈{G,B}(Ic(y))]. We manually extract A by averaging a patch in the image, since we find the suggested methods often find bright sand pixels as the veiling-light. The top row of
FIG. 14 shows the pixel chosen as veiling-light by WCID marked by a red cross, the one chosen by UDCP marked by an orange plus, and the rectangle chosen manually marked in yellow. Using the same veiling-light value for all methods resulted in artifacts, hence we show the best result for each method, with different veiling-light values (the values are given in the supplementary material). -
FIG. 14 compares prominent single underwater image restoration methods. - Applying a nave contrast enhancement is not enough, since the contrast degradation caused by the medium is spatially non-uniform. This is evident in the left column, where the farther Barracudas are almost indistinguishable from the background, and in the middle column (Frames), where the structure in the back is hardly noticeable.
- In
FIG. 14 , The top row shows the original input to the restoration methods, with veiling-light estimation: using UDCP in orange, using WCID in red and averaging over a patch in yellow for Haze-lines and the proposed method. The rest of the rows, from top to bottom, show the output of a global contrast enhancement of each channel (which affects the white-balance), Haze-lines, UDCP, WCID, and the present restoration technique. - The methods Haze-Lines, UDCP, and WCID do not restore the color of the sand in the foreground of Frames and Pier correctly, as some areas have a blue-green color-cast while others do not. This phenomenon is an indication of an incorrect wavelength-dependent correction, not a global white balance problem. The red color is attenuated much more than the blue and green, and is not amplified enough by these methods. The present restoration technique is able to compensate for the distance-dependent attenuation. For example, the Barracudas all have similar colors in the output image, regardless of their original distance. Similarly, the sand in the foreground of Frames has a uniform color.
- In addition to the qualitative comparison of the images, we used color charts as a quantitative measure. The scenes Frames, Pier and Rocks contain two different color charts, at two different distances from camera, in order to validate the quality of the restoration. The median angle between the gray-level patches and the direction [1,1,1] are summarized in table 6. The present restoration technique out-performs the other methods.
-
FIG. 15 shows an image along with the transmission maps estimated in the process for three methods: UDCP, WCID, and the present restoration technique. Due to the unconstrained nature of the problem, a prior must be used to estimate the transmission. Both UDCP and WCID are based on the dark channel prior, and estimate the transmission according to: -
- where in WCID the outer minimization is carried over c∈{R,G,B}, and in UDCP over c∈{G,B}.
-
FIGS. 15I .a to 15II.e show the value of the dark channel -
- per pixel x. In these particular cases, the dark channel assumption does not hold. The top of the scenes has no objects, and therefore their transmission should tend to zero. However, they are relatively dark and according to the prior contain no haze. The bright sand in the foreground has a significant value in all color channels, and therefore is estimated to contain veiling-light by the prior. In contrast, the non-local prior is able to distinguish the foreground sand from the background. The results shown in
FIGS. 15I .b, 15I.c, 15II.b, and 15II.c are not restored properly due to the wrong estimation of the transmission. The prior of the present restoration technique produces better results (FIG. 5d ). For example,FIG. 15I .d shows that the color of the Sea Goldies fish is restored much better than the other methods. -
FIG. 15I .a shows the input image.FIGS. 15I .a thru 15I.d show the output of three restoration methods: UDCP, WCID, and the present restoration technique.FIG. 15I .e shows a step in the transmission calculation using the dark channel prior: -
-
FIGS. 15I .f thru 15I.h show the transmission maps estimated during the restoration process, corresponding toFIGS. 15I .b thru 15I.d, color-mapped such that warm colors indicate a high transmission, and cold colors indicate a low transmission. For the present restoration technique, we show the transmission of the red channel. -
FIG. 16 compares our result to methods proposed in Carlevaris-Bianco, N., Mohan, A., Eustice, R. M.: Initial results in underwater single image dehazing. In: IEEE OCEANS. (2010) 1-8; and Peng, Y. T., Zhao, X., Cosman, P. C.: Single underwater image enhancement using depth estimation based on blurriness. In: Image Processing (ICIP), 2015 IEEE International Conference on. (2015) 4952-4956. We did not have the raw files from these two papers, so we used the processed image as input to our technique. JPEG compression artifacts are amplified by the restoration method, however the present restoration technique removes the blue hue completely from the farther corals, unlike previously proposed methods. In this figure, 16 a is the input image, 16 b-d show the output of three restoration methods: Carlevaris-Bianco et al. (2010), Id., Peng et al. (2015), Id., and the present restoration technique, respectively. - The present invention may be a system, a method, and/or a computer program product. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a hardware processor to carry out aspects of the present invention.
- The computer readable storage medium may be a tangible device that may retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device having instructions recorded thereon, and any suitable combination of the foregoing.
- A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire. Rather, the computer readable storage medium is a non-transient (i.e., not-volatile) medium.
- Computer readable program instructions described herein may be downloaded to respective computing/processing devices (which comprise hardware processor) from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
- Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
- In some embodiments, a hardware processor that is, for example, a microprocessor, programmable logic circuitry, a field-programmable gate array (FPGA), or programmable logic arrays (PLA), may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the hardware processor, in order to perform aspects of the present invention.
- Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It may be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, may be implemented by computer readable program instructions.
- These specialized computer readable program instructions may be provided to a microprocessor of a general-purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the microprocessor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that may direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
- The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
- The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, may be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
- The descriptions of the various embodiments of the present invention have been presented for purposes of illustration, but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein was chosen to best explain the principles of the embodiments, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.
Claims (22)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/092,053 US10885611B2 (en) | 2016-04-07 | 2017-04-06 | Image dehazing and restoration |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662319338P | 2016-04-07 | 2016-04-07 | |
US16/092,053 US10885611B2 (en) | 2016-04-07 | 2017-04-06 | Image dehazing and restoration |
PCT/IL2017/050426 WO2017175231A1 (en) | 2016-04-07 | 2017-04-06 | Image dehazing and restoration |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IL2017/050426 A-371-Of-International WO2017175231A1 (en) | 2016-04-07 | 2017-04-06 | Image dehazing and restoration |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/139,056 Continuation US11810272B2 (en) | 2016-04-07 | 2020-12-31 | Image dehazing and restoration |
Publications (2)
Publication Number | Publication Date |
---|---|
US20190114747A1 true US20190114747A1 (en) | 2019-04-18 |
US10885611B2 US10885611B2 (en) | 2021-01-05 |
Family
ID=60001581
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/092,053 Active 2037-10-29 US10885611B2 (en) | 2016-04-07 | 2017-04-06 | Image dehazing and restoration |
US17/139,056 Active 2037-11-27 US11810272B2 (en) | 2016-04-07 | 2020-12-31 | Image dehazing and restoration |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/139,056 Active 2037-11-27 US11810272B2 (en) | 2016-04-07 | 2020-12-31 | Image dehazing and restoration |
Country Status (4)
Country | Link |
---|---|
US (2) | US10885611B2 (en) |
EP (1) | EP3440627A4 (en) |
IL (2) | IL300998A (en) |
WO (1) | WO2017175231A1 (en) |
Cited By (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190019313A1 (en) * | 2017-07-11 | 2019-01-17 | Datacolor Inc. | Color identification in images |
CN110070049A (en) * | 2019-04-23 | 2019-07-30 | 北京市商汤科技开发有限公司 | Facial image recognition method and device, electronic equipment and storage medium |
CN110335210A (en) * | 2019-06-11 | 2019-10-15 | 长江勘测规划设计研究有限责任公司 | Underwater image restoration method |
CN111325696A (en) * | 2020-03-03 | 2020-06-23 | 杭州瑞利海洋装备有限公司 | Underwater acoustic image reverberation suppression method based on normal distribution interval estimation |
CN111539896A (en) * | 2020-04-30 | 2020-08-14 | 华中科技大学 | Domain-adaptive-based image defogging method and system |
CN111553405A (en) * | 2020-04-24 | 2020-08-18 | 青岛杰瑞工控技术有限公司 | Clustering fog recognition algorithm based on pixel density K-means |
CN111598886A (en) * | 2020-05-25 | 2020-08-28 | 中国科学院长春光学精密机械与物理研究所 | Pixel-level transmittance estimation method based on single image |
CN111598812A (en) * | 2020-05-25 | 2020-08-28 | 中国科学院长春光学精密机械与物理研究所 | Image defogging method based on RGB and HSV double-color space |
CN111899198A (en) * | 2020-08-06 | 2020-11-06 | 北京科技大学 | Defogging method and device for marine image |
CN111915501A (en) * | 2020-01-17 | 2020-11-10 | 杭州瞳创医疗科技有限公司 | Brightness balancing method for fundus image |
CN111986119A (en) * | 2020-09-01 | 2020-11-24 | 安徽萤瞳科技有限公司 | Dark channel image brightness value interference filtering method and sea fog image sea fog removing method |
CN112132742A (en) * | 2020-08-28 | 2020-12-25 | 稿定(厦门)科技有限公司 | Particle-based adaptive halo image generation method and device |
CN112529841A (en) * | 2020-11-16 | 2021-03-19 | 中国海洋大学 | Method and system for processing seabed gas plume in multi-beam water column data and application |
CN112733914A (en) * | 2020-12-31 | 2021-04-30 | 大连海事大学 | Underwater target visual identification and classification method based on support vector machine |
CN112804510A (en) * | 2021-01-08 | 2021-05-14 | 海南省海洋与渔业科学院 | Color fidelity processing method and device for deep water image, storage medium and camera |
CN112874438A (en) * | 2021-03-01 | 2021-06-01 | 上海应用技术大学 | Real-time defogging display windshield device |
US11030731B2 (en) * | 2016-12-27 | 2021-06-08 | Zhejiang Dahua Technology Co., Ltd. | Systems and methods for fusing infrared image and visible light image |
CN112950504A (en) * | 2021-03-02 | 2021-06-11 | 山东鲁能软件技术有限公司智能电气分公司 | Power transmission line inspection haze weather monocular hidden danger object distance measurement method and system |
US20210319541A1 (en) * | 2018-09-06 | 2021-10-14 | Carmel Haifa University Economic Corporation Ltd. | Model-free physics-based reconstruction of images acquired in scattering media |
CN113516607A (en) * | 2021-04-23 | 2021-10-19 | Oppo广东移动通信有限公司 | Image processing method, image processing apparatus, electronic device, and storage medium |
CN113763488A (en) * | 2021-07-21 | 2021-12-07 | 广东工业大学 | Remote sensing image demisting degree method combining dark channel pre-inspection algorithm and U-Net |
CN114066780A (en) * | 2022-01-17 | 2022-02-18 | 广东欧谱曼迪科技有限公司 | 4k endoscope image defogging method and device, electronic equipment and storage medium |
US11270425B2 (en) * | 2018-11-15 | 2022-03-08 | Qualcomm Technologies, Inc. | Coordinate estimation on n-spheres with spherical regression |
CN114359103A (en) * | 2022-01-04 | 2022-04-15 | 中国电建集团中南勘测设计研究院有限公司 | Hyperspectral image defogging method and device, computer product and storage medium |
CN114463211A (en) * | 2022-01-28 | 2022-05-10 | 华中科技大学 | Underwater image enhancement method based on turbidity classification |
CN114792294A (en) * | 2022-05-20 | 2022-07-26 | 陈恩依 | Underwater image color correction method based on attenuation coefficient |
WO2023086199A1 (en) * | 2021-11-12 | 2023-05-19 | Verily Life Sciences Llc | Dynamic smoke reduction in images from a surgical system |
CN117218033A (en) * | 2023-09-27 | 2023-12-12 | 仲恺农业工程学院 | Underwater image restoration method, device, equipment and medium |
Families Citing this family (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10051252B1 (en) | 2017-03-07 | 2018-08-14 | Filmic Inc. | Method of decaying chrominance in images |
CN108550120B (en) * | 2018-03-29 | 2020-03-27 | 青岛大学 | Method for restoring underwater image under variable frame |
CN108492274B (en) * | 2018-04-03 | 2020-08-07 | 中国人民解放军国防科技大学 | Long-wave infrared polarization feature extraction and fusion image enhancement method |
WO2020014531A1 (en) * | 2018-07-11 | 2020-01-16 | Raven Industries, Inc. | Adaptive color transformation to aid computer vision |
CN109410180B (en) * | 2018-09-30 | 2021-09-21 | 清华-伯克利深圳学院筹备办公室 | Attenuation coefficient determination method and device, computer equipment and storage medium |
CN109389569B (en) * | 2018-10-26 | 2021-06-04 | 大象智能科技(南京)有限公司 | Monitoring video real-time defogging method based on improved DehazeNet |
CN109636735B (en) * | 2018-11-02 | 2023-03-10 | 中国航空工业集团公司洛阳电光设备研究所 | Rapid video defogging method based on space-time consistency constraint |
CN109859129A (en) * | 2019-01-29 | 2019-06-07 | 哈工大机器人(岳阳)军民融合研究院 | A kind of underwater picture enhancing treating method and apparatus |
CN109934780A (en) * | 2019-02-21 | 2019-06-25 | 北京以萨技术股份有限公司 | A kind of traffic surveillance videos defogging method based on dark primary priori |
CN109934791A (en) * | 2019-04-02 | 2019-06-25 | 山东浪潮云信息技术有限公司 | A kind of image defogging method and system based on Style Transfer network |
CN110232666B (en) * | 2019-06-17 | 2020-04-28 | 中国矿业大学(北京) | Underground pipeline image rapid defogging method based on dark channel prior |
CN110298809B (en) * | 2019-07-08 | 2021-03-30 | 广东工业大学 | Image defogging method and device |
WO2021007554A1 (en) | 2019-07-11 | 2021-01-14 | Sneyders Yuri | Determining image feature height disparity |
CN111192213B (en) * | 2019-12-27 | 2023-11-14 | 浙江芯劢微电子股份有限公司 | Image defogging self-adaptive parameter calculation method, image defogging method and system |
CN111275645A (en) | 2020-01-20 | 2020-06-12 | 腾讯科技(深圳)有限公司 | Image defogging method, device and equipment based on artificial intelligence and storage medium |
CZ2020157A3 (en) * | 2020-03-20 | 2021-11-10 | Univerzita Hradec Králové | A method of processing a pre-processed image and the apparatus for this |
CN111510578B (en) * | 2020-03-31 | 2021-07-09 | 天津大学 | JPEG compressed image reconstruction method based on reinforcement learning |
CN111462022B (en) * | 2020-04-29 | 2022-11-01 | 青岛大学 | Underwater image sharpness enhancement method |
CN111681180B (en) * | 2020-05-25 | 2022-04-26 | 厦门大学 | Priori-driven deep learning image defogging method |
CN111754438B (en) * | 2020-06-24 | 2021-04-27 | 安徽理工大学 | Underwater image restoration model based on multi-branch gating fusion and restoration method thereof |
CN111833270B (en) * | 2020-07-13 | 2023-02-10 | 新疆大学 | Rapid sand-dust degradation image enhancement method |
CN112070683B (en) * | 2020-07-21 | 2024-03-12 | 西北工业大学 | Underwater polarized image restoration method based on polarization and wavelength attenuation combined optimization |
CN112183338B (en) * | 2020-09-28 | 2021-06-15 | 广东石油化工学院 | Video-based method, system and terminal for re-identifying people in smoke scene |
CN112419231A (en) * | 2020-10-15 | 2021-02-26 | 上海眼控科技股份有限公司 | Visibility determination method and device, computer equipment and storage medium |
CN112488943B (en) * | 2020-12-02 | 2024-02-02 | 北京字跳网络技术有限公司 | Model training and image defogging method, device and equipment |
CN112488955B (en) * | 2020-12-08 | 2023-07-14 | 大连海事大学 | Underwater image restoration method based on wavelength compensation |
US11528435B2 (en) | 2020-12-25 | 2022-12-13 | Industrial Technology Research Institute | Image dehazing method and image dehazing apparatus using the same |
CN113034391B (en) * | 2021-03-19 | 2023-08-08 | 西安电子科技大学 | Multi-mode fusion underwater image enhancement method, system and application |
CN113344802B (en) * | 2021-04-19 | 2024-08-20 | 大连海事大学 | Underwater image restoration method based on self-adaptive atmosphere light fusion |
CN113344830B (en) * | 2021-05-10 | 2024-06-21 | 深圳瀚维智能医疗科技有限公司 | Fusion method and device based on multiple single-channel temperature pictures |
CN113837971B (en) * | 2021-09-30 | 2023-08-04 | 重庆邮电大学 | Image defogging method based on dark channel and fractional order multi-transformation regularization |
KR102577361B1 (en) * | 2021-11-12 | 2023-09-11 | 중앙대학교 산학협력단 | Method and apparatus for image dehazing via complementary adversarial learning |
US11803942B2 (en) | 2021-11-19 | 2023-10-31 | Stmicroelectronics (Research & Development) Limited | Blended gray image enhancement |
CN113989164B (en) * | 2021-11-24 | 2024-04-09 | 河海大学常州校区 | Underwater color image restoration method, system and storage medium |
CN114549342B (en) * | 2022-01-13 | 2024-08-02 | 河南师范大学 | Restoration method for underwater image |
CN118446913A (en) * | 2024-04-12 | 2024-08-06 | 北京科技大学 | Underwater image enhancement and deblurring method and system based on depth iteration |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6687414B1 (en) * | 1999-08-20 | 2004-02-03 | Eastman Kodak Company | Method and system for normalizing a plurality of signals having a shared component |
KR100378351B1 (en) * | 2000-11-13 | 2003-03-29 | 삼성전자주식회사 | Method and apparatus for measuring color-texture distance, and method and apparatus for sectioning image into a plurality of regions using the measured color-texture distance |
US7929753B2 (en) * | 2006-11-29 | 2011-04-19 | Kwe International Inc. | Image processing including, but not limited to, logarithimic coding in color coordinate systems including systems with a coordinate defined by a square root of a quadratic polynomial in tristimulus values and, possibly, by a sign of a function of one or more of tristimulus values |
US8290294B2 (en) * | 2008-09-16 | 2012-10-16 | Microsoft Corporation | Dehazing an image using a three-dimensional reference model |
US8350933B2 (en) * | 2009-04-08 | 2013-01-08 | Yissum Research Development Company Of The Hebrew University Of Jerusalem, Ltd. | Method, apparatus and computer program product for single image de-hazing |
WO2015125146A1 (en) | 2014-02-19 | 2015-08-27 | Yissum Research Development Company Of The Hebrew University Of Jerusalem Ltd. | Method and system for dehazing natural images using color-lines |
US20170132771A1 (en) | 2014-06-13 | 2017-05-11 | Board Of Regents Of The University Of Texas System | Systems and methods for automated hierarchical image representation and haze removal |
CN104504658A (en) * | 2014-12-15 | 2015-04-08 | 中国科学院深圳先进技术研究院 | Single image defogging method and device on basis of BP (Back Propagation) neural network |
-
2017
- 2017-04-06 US US16/092,053 patent/US10885611B2/en active Active
- 2017-04-06 WO PCT/IL2017/050426 patent/WO2017175231A1/en active Application Filing
- 2017-04-06 EP EP17778801.5A patent/EP3440627A4/en active Pending
- 2017-04-06 IL IL300998A patent/IL300998A/en unknown
-
2018
- 2018-10-07 IL IL262175A patent/IL262175B2/en unknown
-
2020
- 2020-12-31 US US17/139,056 patent/US11810272B2/en active Active
Cited By (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11030731B2 (en) * | 2016-12-27 | 2021-06-08 | Zhejiang Dahua Technology Co., Ltd. | Systems and methods for fusing infrared image and visible light image |
US20190019313A1 (en) * | 2017-07-11 | 2019-01-17 | Datacolor Inc. | Color identification in images |
US10692245B2 (en) * | 2017-07-11 | 2020-06-23 | Datacolor Inc. | Color identification in images |
US20210319541A1 (en) * | 2018-09-06 | 2021-10-14 | Carmel Haifa University Economic Corporation Ltd. | Model-free physics-based reconstruction of images acquired in scattering media |
US11270425B2 (en) * | 2018-11-15 | 2022-03-08 | Qualcomm Technologies, Inc. | Coordinate estimation on n-spheres with spherical regression |
CN110070049A (en) * | 2019-04-23 | 2019-07-30 | 北京市商汤科技开发有限公司 | Facial image recognition method and device, electronic equipment and storage medium |
CN110335210A (en) * | 2019-06-11 | 2019-10-15 | 长江勘测规划设计研究有限责任公司 | Underwater image restoration method |
CN111915501A (en) * | 2020-01-17 | 2020-11-10 | 杭州瞳创医疗科技有限公司 | Brightness balancing method for fundus image |
CN111325696A (en) * | 2020-03-03 | 2020-06-23 | 杭州瑞利海洋装备有限公司 | Underwater acoustic image reverberation suppression method based on normal distribution interval estimation |
CN111553405A (en) * | 2020-04-24 | 2020-08-18 | 青岛杰瑞工控技术有限公司 | Clustering fog recognition algorithm based on pixel density K-means |
CN111539896A (en) * | 2020-04-30 | 2020-08-14 | 华中科技大学 | Domain-adaptive-based image defogging method and system |
CN111598812A (en) * | 2020-05-25 | 2020-08-28 | 中国科学院长春光学精密机械与物理研究所 | Image defogging method based on RGB and HSV double-color space |
CN111598886A (en) * | 2020-05-25 | 2020-08-28 | 中国科学院长春光学精密机械与物理研究所 | Pixel-level transmittance estimation method based on single image |
CN111899198A (en) * | 2020-08-06 | 2020-11-06 | 北京科技大学 | Defogging method and device for marine image |
CN112132742A (en) * | 2020-08-28 | 2020-12-25 | 稿定(厦门)科技有限公司 | Particle-based adaptive halo image generation method and device |
CN111986119A (en) * | 2020-09-01 | 2020-11-24 | 安徽萤瞳科技有限公司 | Dark channel image brightness value interference filtering method and sea fog image sea fog removing method |
CN112529841A (en) * | 2020-11-16 | 2021-03-19 | 中国海洋大学 | Method and system for processing seabed gas plume in multi-beam water column data and application |
CN112733914A (en) * | 2020-12-31 | 2021-04-30 | 大连海事大学 | Underwater target visual identification and classification method based on support vector machine |
CN112804510A (en) * | 2021-01-08 | 2021-05-14 | 海南省海洋与渔业科学院 | Color fidelity processing method and device for deep water image, storage medium and camera |
CN112874438A (en) * | 2021-03-01 | 2021-06-01 | 上海应用技术大学 | Real-time defogging display windshield device |
CN112950504A (en) * | 2021-03-02 | 2021-06-11 | 山东鲁能软件技术有限公司智能电气分公司 | Power transmission line inspection haze weather monocular hidden danger object distance measurement method and system |
CN113516607A (en) * | 2021-04-23 | 2021-10-19 | Oppo广东移动通信有限公司 | Image processing method, image processing apparatus, electronic device, and storage medium |
CN113763488A (en) * | 2021-07-21 | 2021-12-07 | 广东工业大学 | Remote sensing image demisting degree method combining dark channel pre-inspection algorithm and U-Net |
WO2023086199A1 (en) * | 2021-11-12 | 2023-05-19 | Verily Life Sciences Llc | Dynamic smoke reduction in images from a surgical system |
CN114359103A (en) * | 2022-01-04 | 2022-04-15 | 中国电建集团中南勘测设计研究院有限公司 | Hyperspectral image defogging method and device, computer product and storage medium |
CN114066780A (en) * | 2022-01-17 | 2022-02-18 | 广东欧谱曼迪科技有限公司 | 4k endoscope image defogging method and device, electronic equipment and storage medium |
CN114463211A (en) * | 2022-01-28 | 2022-05-10 | 华中科技大学 | Underwater image enhancement method based on turbidity classification |
CN114792294A (en) * | 2022-05-20 | 2022-07-26 | 陈恩依 | Underwater image color correction method based on attenuation coefficient |
CN117218033A (en) * | 2023-09-27 | 2023-12-12 | 仲恺农业工程学院 | Underwater image restoration method, device, equipment and medium |
Also Published As
Publication number | Publication date |
---|---|
US20210201452A1 (en) | 2021-07-01 |
IL262175A (en) | 2018-11-29 |
US10885611B2 (en) | 2021-01-05 |
IL262175B1 (en) | 2023-03-01 |
EP3440627A4 (en) | 2019-12-04 |
EP3440627A1 (en) | 2019-02-13 |
IL262175B2 (en) | 2023-07-01 |
IL300998A (en) | 2023-04-01 |
WO2017175231A1 (en) | 2017-10-12 |
US11810272B2 (en) | 2023-11-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11810272B2 (en) | Image dehazing and restoration | |
Berman et al. | Single image dehazing using haze-lines | |
Berman et al. | Non-local image dehazing | |
Artusi et al. | A survey of specularity removal methods | |
Ancuti et al. | Color balance and fusion for underwater image enhancement | |
Krig | Computer vision metrics | |
Park et al. | Single image dehazing with image entropy and information fidelity | |
Finlayson et al. | Entropy minimization for shadow removal | |
Li et al. | A multi-scale fusion scheme based on haze-relevant features for single image dehazing | |
Imamoglu et al. | Hyperspectral image dataset for benchmarking on salient object detection | |
Xiong et al. | From pixels to physics: Probabilistic color de-rendering | |
US20160196637A1 (en) | Raw sensor image and video de-hazing and atmospheric light analysis methods and systems | |
Guo et al. | Image dehazing via enhancement, restoration, and fusion: A survey | |
Wang et al. | Specular reflection removal of ocean surface remote sensing images from UAVs | |
Besheer et al. | Modified invariant colour model for shadow detection | |
EP3973500A1 (en) | Physics-based recovery of lost colors in underwater and atmospheric images under wavelength dependent absorption and scattering | |
Gauci et al. | A Machine Learning approach for automatic land cover mapping from DSLR images over the Maltese Islands | |
Wang et al. | Multiscale single image dehazing based on adaptive wavelet fusion | |
Shen et al. | Image-matching enhancement using a polarized intensity-hue-saturation fusion method | |
Voronin et al. | A block-based method for the remote sensing images cloud detection and removal | |
Beigpour et al. | A comprehensive multi-illuminant dataset for benchmarking of the intrinsic image algorithms | |
Lee et al. | Joint defogging and demosaicking | |
Saxena et al. | An efficient single image haze removal algorithm for computer vision applications | |
Goud et al. | Evaluation of image fusion of multi focus images in spatial and frequency domain | |
Agarwal et al. | Specular reflection removal in cervigrams |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO SMALL (ORIGINAL EVENT CODE: SMAL); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
AS | Assignment |
Owner name: RAMOT AT TEL-AVIV UNIVERSITY LTD., ISRAEL Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BERMAN, DANA;AVIDAN, SHAI;SIGNING DATES FROM 20181022 TO 20181111;REEL/FRAME:047482/0325 Owner name: CARMEL HAIFA UNIVERSITY ECONOMIC CORPORATION LTD., Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:TREIBITZ, AVITAL;REEL/FRAME:047482/0275 Effective date: 20180902 Owner name: CARMEL HAIFA UNIVERSITY ECONOMIC CORPORATION LTD., ISRAEL Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:TREIBITZ, AVITAL;REEL/FRAME:047482/0275 Effective date: 20180902 |
|
FEPP | Fee payment procedure |
Free format text: PETITION RELATED TO MAINTENANCE FEES GRANTED (ORIGINAL EVENT CODE: PTGR); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |