EP2128693A1 - Spatially Adaptive Photographic Flash Unit - Google Patents

Spatially Adaptive Photographic Flash Unit Download PDF

Info

Publication number
EP2128693A1
EP2128693A1 EP08104125A EP08104125A EP2128693A1 EP 2128693 A1 EP2128693 A1 EP 2128693A1 EP 08104125 A EP08104125 A EP 08104125A EP 08104125 A EP08104125 A EP 08104125A EP 2128693 A1 EP2128693 A1 EP 2128693A1
Authority
EP
European Patent Office
Prior art keywords
depth
camera
flash
flash unit
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP08104125A
Other languages
German (de)
French (fr)
Inventor
Rolf Adelsberger
Markus Gross
Marc Levoy
Remo Ziegler
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Eidgenoessische Technische Hochschule Zurich ETHZ
Original Assignee
Eidgenoessische Technische Hochschule Zurich ETHZ
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Eidgenoessische Technische Hochschule Zurich ETHZ filed Critical Eidgenoessische Technische Hochschule Zurich ETHZ
Priority to EP08104125A priority Critical patent/EP2128693A1/en
Priority to EP09726437.8A priority patent/EP2321697B1/en
Publication of EP2128693A1 publication Critical patent/EP2128693A1/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G03PHOTOGRAPHY; CINEMATOGRAPHY; ANALOGOUS TECHNIQUES USING WAVES OTHER THAN OPTICAL WAVES; ELECTROGRAPHY; HOLOGRAPHY
    • G03BAPPARATUS OR ARRANGEMENTS FOR TAKING PHOTOGRAPHS OR FOR PROJECTING OR VIEWING THEM; APPARATUS OR ARRANGEMENTS EMPLOYING ANALOGOUS TECHNIQUES USING WAVES OTHER THAN OPTICAL WAVES; ACCESSORIES THEREFOR
    • G03B15/00Special procedures for taking photographs; Apparatus therefor
    • G03B15/02Illuminating scene
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/70Circuitry for compensating brightness variation in the scene
    • H04N23/74Circuitry for compensating brightness variation in the scene by influencing the scene brightness using illuminating means
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/95Computational photography systems, e.g. light-field imaging systems
    • H04N23/951Computational photography systems, e.g. light-field imaging systems by using two or more images to influence resolution, frame rate or aspect ratio

Definitions

  • the present invention relates to photographic flash unit for a camera according to the preamble of claim 1.
  • Digital cameras have become inexpensive, robust, and small. As a result, most consumers own one, carry it everywhere they go, and expect it to take good pictures under a variety of shooting conditions. To accommodate dark scenes, most digital cameras include electronic flash - either integrated with the camera or in a detachable unit. More expensive units offer control over brightness, duration, and angular spread. Regardless of how these parameters are chosen, flash illumination always falls off sharply with distance from the camera, making most flash pictures look unnatural. It is plausible to address this problem by capturing a flash-noflash image pair, estimating depth as the ratio of the flash and noflash images, then inversely scaling pixel intensities by this depth. However, separately scaling the intensities of individual pixels does not account for possible interreflections among the corresponding features in the scene. This can lead to physically invalid scenes, and hence to images that appear unnatural.
  • Petschnigg et al. [Petschnigg et al. 2004] and Eisemann et al. [Eisemann and Durand 2004] take two images - one with flash and one without, then use the low noise of the flash image to reduce noise in the noflash image.
  • a spatially invariant flash on a scene with a large depth extent can lead to underexposed regions in the flash image. These dark regions are noisy, thereby diminishing the quality of the detail that can be transferred to the corresponding regions of the noflash image.
  • Using our spatially varying flash we can ameliorate this problem.
  • Nayar et. al [Nayar et al. 2006b] show how a programmable flash can be used to separate an image into direct and indirect reflections. However, they cannot compute how these reflections would change if only a part of the scene were lit. This is a necessary capability if one wishes to reduce illumination on nearby objects.
  • Mohan et al. [Mohan et al. 2007] presents an inexpensive method for acquiring high-quality photographic lighting of desktop-sized static objects such as museum artifacts. They take multiple pictures of the scene, while a moving-head spotlight is scanning the inside of a foam box enclosing the scene.
  • Nayar et. al [Nayar and Branzoi 2003] provide a good summary of previous work. They also present a way to enlarge the dynamic range of a system by adding a spatial light modulator (LCD) or DMD [Nayar et al. 2006a] to the optical path. This enables them to extend the dynamic range of the sensor by varying the exposure on a per-pixel basis.
  • the invention presented here is complementary to this method. It permits us to attenuate light that is too bright, while also providing a way to add light to regions that would otherwise be too dark.
  • a task of this invention to provide a photographic flash unit for a camera allowing a control and a compensation of the flash illumination with the distance of the object as well as a compensation of the effects due to reflexions of the object to be photographed.
  • the invention allows to be used in two different operation modes:
  • one single depth image is acquired, followed by a color image that captures the subject under spatially varying illumination. Since the depth image can be acquired during the focusing of the color camera, that is before the actual photograph, this is considered a «single-shot method».
  • one depth image and at least one color image are captured.
  • the color image can be used to refine the depth image.
  • the color images can serve as pilot images, from which the brightness of each feature in the scene is estimated. This information is used to further refine the spatially varying illumination during a final color image acquisition.
  • the basic idea consists in changing the physical illumination of the scene on a per-pixel basis.
  • the flash unit employs a rangefinder, e.g. an infrared time-of-flight device) to determine the distance to each object, a spatial light modulator SLM, e.g., an LCD-panel and a lense system
  • a rangefinder e.g. an infrared time-of-flight device
  • SLM spatial light modulator
  • this spatially adaptive flash unit sits atop the camera, it cannot simulate lights that arrive from the side or above the subject.
  • this spatially adaptive flash unit sits atop the camera, it cannot simulate lights that arrive from the side or above the subject.
  • by modulating the illumination of each object as a function of its distance from the camera it can simulate a light source located some distance behind the photographer. Illumination from such a source falls off quadratically with distance, but at a slower rate than from a flash unit mounted on the camera.
  • the physical light source is mounted above the camera, as for conventional flash.
  • the flash unit With the flash unit, only one snapshot is required by the main camera, preceded by a single capture using the rangefinder.
  • a second problem is that highly reflective objects may saturate the sensor, especially if they lie close to the camera.
  • the SLM can be controlled to reduce the illumination of these objects during the final image capture.
  • LCD liquid Crystal Display
  • micro-mirror technology some details see e.g. WO 2008/027230 A2 .
  • the LCD- Technique represents a preferred solution. The invention is however not restricted to this LCD-Technique.
  • FIG. 1 Overview of the flash unit
  • Figure 3 diagram intensity vs. depth for a real and virtual light sources.
  • the setup consists of a Spatially Adaptive Flash Unit 100 as well as a color camera 40.
  • the Spatially Adaptive Flash Unit 100 is positioned, such that the location roughly corresponds to a conventional flash unit mounted on the hot shoe of the color camera 40, cf. Fig. 1 .
  • This allocation of the different components has been chosen to get as close as possible to a candid shot setting.
  • this setup with a wired communication 105 is not limited to this setting, since the Spatially Adaptive Flash Unit 100 could also be triggered - sometimes also called released - wirelessly from the camera 40.
  • SAFU Stasonic Flash Unit
  • the Spatially Adaptive Flash Unit 100 consists of a projector 20 or a case 20 with lamp 21, a LCD panel 23 and a lens 22.
  • the Spatially Adaptive Flash Unit 100 contains additionally a depth camera 30 based on time-of-flight (ToF) technology.
  • the conceptual or procedural setup is shown in Fig. 2 .
  • the light bulb from a projector SANYO PLC-XW50 was removed and replaced by a Canon Speedlite 580EX II.
  • a mirrored tube 25 was built. This maximizes the light projected into the light path 24.
  • the rest of the light path 24, including three LCD's 23, a lens 22, and the depth camera 30, are not modified.
  • the alignment of the optical axis of the depth camera 30 and the optical axis of the lens 22 of the light are chosen to be as close as possible, in order to minimize the disparity between their views of the seen.
  • the before mentioned tube was built in this embodiment, other embodiments are possible with such a tube 25.
  • a Swissranger ToF-camera 30 from Mesa-Imaging was employed. It has a resolution of 176 ⁇ 144 pixels, providing the depth and amplitude at a modulation frequency of 20MHz. This allows for a depth range of 7.5m without ambiguities.
  • the ToF-camera 30 has a built in illumination source, which operates in the infrared spectrum of 850 nm. The accuracy is highly dependent on the reflected amplitude as well as on the background illumination. According to [Büttgen et al.] the best depth accuracy lies in the mm range. The captured depth maps partially suffer from substantial noise, which has to be filtered.
  • the color camera 40 is a Canon EOS 1D Mark III camera with a Canon EF 16-35mm f/2.8L USM lens.
  • the camera 40 is either connected by USB bidirectionally to a Personal Computer (not shown in the Figures) or bidirectionally to a processing unit 101 with a communication connection 102.
  • a Canon SDK all required settings of the camera 40 can be made.
  • the distortion, as well as the extrinsic and intrinsic matrices of the depth camera 30 and the color camera 40, are computed from six images of the printed checkerboard pattern.
  • This method can be used for both types of cameras, since the checkers are also visible in the infrared spectrum of the depth camera 30.
  • the case 20 is calibrated relative to the color camera 40 based on the method presented in [Shen and Menq 2002].
  • a novel checkerboard pattern of different color has been used. Instead of taking a black checkerboard pattern, which has to be covered or removed in order to recognize the projected one, a blue colored and printed checkerboard pattern was used, while projecting white for the first image. The same printed checkerboard pattern is used as a projection surface for the second image.
  • the projected pattern is a standard black and white checkerboard pattern.
  • the calibration pattern can be extracted from the red channel for the first image, while it is extracted from the blue channel for the second image. This modification has the advantage, that the plane of reference will be exactly the same for both images, leading to a better accuracy, and more simplicity than the previous approach.
  • Fig. 2 shows the single-shot-method and multishot-method. Both methods can be split into four steps:
  • a median filter is applied to remove outliers in depths returned by the ToF-camera 30, and a novel trilateral filter is applied to reduce noise.
  • the filtered depth data is then back projected, triangulated, and rendered into the case 20 view using graphics hardware.
  • This step performs warping as resolves visibility in real-time.
  • a conservative depth discontinuity diffusion filter was applied. This filter gives depth values that are correct or slightly closer than the actual depth, thereby diminishing the visibility of possible misalignments.
  • an attenuation mask is computed. This computation takes as input the filtered depths, the virtual light position, and a user defined artistic filter. This attenuation mask is loaded onto the LCD-Panel 23 of the Spatially Adaptive Flash Unit 100, and a final image is captured by triggering the flash unit 100 and camera to fire via the release 41.
  • the single-shot method permits compensation for the unnatural falloff of a conventional flash by virtually repositioning the light source 21 further back -behind the photographer.
  • per-pixel depths are required.
  • the time-of-flight camera 30 captures these depths at the same time the color camera is being aimed and focused.
  • This method where flash intensity is modulated based solely on depth, is called here as «Single-shot flash adjustment».
  • the «Single-shot flash adjustment» has the advantage of minimizing motion artifacts relative to methods that require multiple color images. However, variations in object reflectance cannot be taken into account by this method.
  • the captured depth data has a direct influence on the projected intensity per pixel. Noise in the depth data is, therefore, also visible as noise in the evaluated illumination. Since the human visual system is very sensitive to relative intensity variations, noise in the flash intensity becomes very easily visible. Although increasing the exposure time of the depth camera 30 improves the Signal Noise Ratio, it also increases motion blur artifacts. Therefore, the noise is reduced by applying different filters on the depth data allowing to keep a short exposure time.
  • Bilateral filtering [Tomasi and Manduchi 1998] has shown to perform well for filtering noise while preserving discontinuities in the signal. However, in order to remove the noise, the discontinuity of the signal always has to be higher than the noise level, which is not always the case when dealing with depth data from ToF-cameras. Eisemann and Durand [Eisemann and Durand 2004] and Petschnigg et al. [Petschnigg et al. 2004] introduced the cross bilateral filter - also known as joint bilateral filter - as a variant of the classical bilateral filter. They use a low noise image, meaning a high confidence in the high frequency data, as input to the range weight. This avoids smoothing across discontinuities of the signal.
  • c is the speed of light
  • f mod is the modulation frequency
  • BG is the background illumination
  • a sig is the mean number of electrons generated by the signal
  • a joint adaptive trilateral filter has been introduced, which takes the amplitude dependent standard deviation ⁇ D of the depth values into account.
  • the joint adaptive trilateral filter consists of a spatial proximity component g( ⁇ ), a depth dependent component f( ⁇ ) in the range domain and an amplitude dependent component h( ⁇ )in the range domain.
  • a m and a i are amplitude values at position m and i respectively.
  • the three functions g( ⁇ ), f( ⁇ ) and h( ⁇ ) are chosen to be Gaussian kernels, each with a kernel support of ⁇ , and the corresponding standard deviations ⁇ p , ⁇ d and ⁇ a respectively.
  • the filter is smoothing the depth values, while limiting the influence at depth or amplitude discontinuities.
  • the noise is not uniformly distributed, we adapt ⁇ d and ⁇ a , and therefore, the influence of f( ⁇ ) and h( ⁇ ).
  • a median filter is applied, which removes the outliers in one step.
  • the median filter has only two pixels with an amplitude below a threshold ⁇ m .
  • a threshold of ⁇ m 40 has shown to be a good value.
  • One possibility to improve the depth data in a sense as to comply with the «conservative approach» is shallow edge diffusion.
  • First all the big depth discontinuities are extracted using a Canny edge detector.
  • the obtained edges are simplified to a contour and every vertex of it is moved in the direction of its local gradient towards the region with the bigger depth values.
  • the amount of translation corresponds to the multiplication of a globally defined width and the local gradient of the vertex before translation.
  • a set P of points lying inside the polygon is created.
  • a modified bilateral filter is applied, cf Equation (8) to all the depth values d m lying inside of P, such that the depth edge is filtered and moved towards the region of bigger depth.
  • the value k c is normalizing the weights of the depth values d i . g( ⁇ ) and h p ( ⁇ ) are two Gaussian kernels defined on the support ⁇ P . d imin corresponds to the shallowest depth value of the kernel support.
  • d m 1 k c ⁇ i ⁇ ⁇ P d i ⁇ g ⁇ ⁇ m - i ⁇ ⁇ h p ⁇ ⁇ d i min - d i ⁇
  • This intensity falloff becomes visible in flash-photography and leads to unnaturally lit scenes.
  • a light source positioned further behind the photographer would result in a more homogeneously lit scene, which would appear more natural.
  • a spatially dependent attenuation function can be evaluated, the more desirable light position as a virtual light source can be simulated.
  • the photographer can choose two interdependent parameters, namely the position of the virtual light source along the optical axis of the Spatial Adaptive Flash Unit 100, and the minimal illumination for a specific subject of the scene. These preferences determine the depth compensation function leading to an attenuation per pixel.
  • the attenuation can in fact be interpreted as a filter by which the flash light is being modulated. It can also be used as artistic filters giving the photographer more creative freedom.
  • the parameters which can be set by the user, are chosen based on the possible needs of the photographer.
  • the photographer wants to pick the position of the light, and on the other hand, he wants to choose how much light is added to the subject of interest.
  • the former can be set by providing the depth offset d off relative to the photographers position, with the positive axis lying in front of the photographer.
  • the latter is defined by an intensity I sub ⁇ 0 ⁇ 1 ⁇ 2 .. 255 of the flash at the depth corresponding to the focal length d sub of the camera. It is believed, that this is a natural way of parameterizing the virtual light source, since the photographer wants to primarily have control over how much light is added to the subject in focus. Especially for hand-held shots, the exposure time should not drop below a certain threshold to avoid motion blur due to camera shake.
  • this method is not limited to this parameterization, but it is the one to be most useful for candid shot settings.
  • f c (d i ) corresponds to the intensity that a pixel at index i requires to compensate for the position of the virtual light source.
  • the «single-shot flash adjustment» is limited by a low resolution depth image, as well as a low resolution infrared amplitude image. Therefore, scenes featuring very fine detail might show artifacts due to wrong depth values. Furthermore, the lack of color information does not allow to correct for varying reflectance properties and overexposed regions automatically. To improve on these two limitations, a «multi-shot flash adjustment» is introduced. Since a color image has to be captured before taking the final shot with the corrected flash light, this is considered as a multi-shot approach. Assuming an already calibrated setup as for the «single-shot flash adjustment», the «multi-shot flash adjustment» can be split into following steps, see also Figure 2 :
  • n is the ratio between the width in pixels of the color camera, and the width in pixels of the depth camera, and for the height respectively.
  • the function g ⁇ ( ⁇ ) and h ⁇ c ⁇ •) are Gaussian kernels with a kernel support of ⁇ .
  • the color at position m is referred to as I ⁇ m .
  • the image pixel values z i are only dependent of the irradiance E i and the chosen exposure time ⁇ t for the camera response.
  • g(z i ) can be determined using a method presented by Debevec et. al [Debevec and Malik 1997]. Since the duration of the flash unit is usually much shorter than the exposure time, its response function g F (z i ) does not depend on the exposure time.
  • a new response curve g n (z i ) for the camera-flash system has to be found, which is similar to the response curve g(z i ) of the camera, but extends the camera's dynamic range independently of per-pixel depth. Furthermore, the response curve shall avoid overexposure in the flash-lit image.
  • g n (z i ) is bound by the lower bound g l (z i ) and the upper bound g(z i ).
  • the image pixels I xy of the no-flash image at position (x, y), and the image pixels of the flash image I' xy lie in the same range as z i .
  • Equation(19) The solution for g n (z i ) can now be approximated as a linear scaling of g(z i ) by factor k, see Equation(19).
  • I xy is the pixel value at coordinate x,y.
  • z i is the intensity, which lies in the same range as the intensity values I xy . Therefore, z i is used to describe the transfer function. But the z i could anytime be exchanged by I xy . complies with the lower bound g l (z i ).
  • g n (z i ) could be smoothed slightly in order to avoid discontinuities in the fixed range.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computing Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Studio Devices (AREA)

Abstract

Using photographic flash for candid shots often results in an unevenly lit scene, in which objects in the back appear dark. A spatially adaptive photographic flash (100) is disclosed, in which the intensity of illumination (21, 23) varies depending on the depth and reflectivity (30, 101) of features in the scene. Adaption to changes in depth are used in a single-shot method. Adaption to changes in reflectivity are used in a multishot method. The single-shot method requires only a depth image (30), whereas the multi-shot method requires at least one color image (40) in addition to the depth data (30).

Description

  • The present invention relates to photographic flash unit for a camera according to the preamble of claim 1.
  • Digital cameras have become inexpensive, robust, and small. As a result, most consumers own one, carry it everywhere they go, and expect it to take good pictures under a variety of shooting conditions. To accommodate dark scenes, most digital cameras include electronic flash - either integrated with the camera or in a detachable unit. More expensive units offer control over brightness, duration, and angular spread. Regardless of how these parameters are chosen, flash illumination always falls off sharply with distance from the camera, making most flash pictures look unnatural. It is tempting to address this problem by capturing a flash-noflash image pair, estimating depth as the ratio of the flash and noflash images, then inversely scaling pixel intensities by this depth. However, separately scaling the intensities of individual pixels does not account for possible interreflections among the corresponding features in the scene. This can lead to physically invalid scenes, and hence to images that appear unnatural.
  • Graphics and vision researchers have long recognized the importance of controlling illumination in computational photography methods. These methods according to the kind of illumination they employ can be classified as follows:
  • i) Single Spatially Invariant Flash
  • Petschnigg et al. [Petschnigg et al. 2004] and Eisemann et al. [Eisemann and Durand 2004] take two images - one with flash and one without, then use the low noise of the flash image to reduce noise in the noflash image. For this purpose they employ a joint bilateral filter (a.k.a. cross-bilateral filter). However, using a spatially invariant flash on a scene with a large depth extent can lead to underexposed regions in the flash image. These dark regions are noisy, thereby diminishing the quality of the detail that can be transferred to the corresponding regions of the noflash image. Using our spatially varying flash, we can ameliorate this problem.
  • ii) Single Spatially Invariant Flash using Multiple Shots
  • Several problems of flash images, such as strong highlights due to reflection of the flash by glossy surfaces, saturation of nearby objects, and poor illumination of distant objects are addressed by Agarwal et. al [Agrawal et al. 2005]. They capture a flash-noflash image pair, estimate depth as the ratio of the flash and noflash images, then inversely scale the gradients of each pixel in the flash image by this depth. The final image is computed by integrating this gradient field. However, separately scaling the intensities of individual pixels does not account for possible interreflections among features in the scene, as mentioned earlier. By contrast, our spatially adaptive flash applies different illumination along each ray. This produces a physically valid scene, and it requires capturing only one image rather than a pair of images.
  • iii) Programmable Illumination
  • Using a projector-like light source is a powerful alternative to spatially invariant flash illumination. Nayar et. al [Nayar et al. 2006b] show how a programmable flash can be used to separate an image into direct and indirect reflections. However, they cannot compute how these reflections would change if only a part of the scene were lit. This is a necessary capability if one wishes to reduce illumination on nearby objects. Mohan et al. [Mohan et al. 2007] presents an inexpensive method for acquiring high-quality photographic lighting of desktop-sized static objects such as museum artifacts. They take multiple pictures of the scene, while a moving-head spotlight is scanning the inside of a foam box enclosing the scene. This requires a controlled environment, which is typically not possible for candid shots. Other systems for programmatically controlled illumination include time-multiplexed lighting [Belhumeur 2007] and Paul Debevec's Light Stages [Debevec et al. 2000]. However, these methods are not suitable for candid photography.
  • iv) Dynamic Range
  • Many methods have been proposed for improving the dynamic range of an imaging system. These systems typically compromise either temporal or spatial resolution, or they employ multiple detectors. Nayar et. al [Nayar and Branzoi 2003] provide a good summary of previous work. They also present a way to enlarge the dynamic range of a system by adding a spatial light modulator (LCD) or DMD [Nayar et al. 2006a] to the optical path. This enables them to extend the dynamic range of the sensor by varying the exposure on a per-pixel basis. The invention presented here is complementary to this method. It permits us to attenuate light that is too bright, while also providing a way to add light to regions that would otherwise be too dark.
  • It is therefore, a task of this invention to provide a photographic flash unit for a camera allowing a control and a compensation of the flash illumination with the distance of the object as well as a compensation of the effects due to reflexions of the object to be photographed.
  • This task is solved by a photographic flash unit given with the features in claim 1.
  • Summary of the invention
  • With
    «a connection of the depth-camera with a spatial light modulator,
    • the spatial light modulator being arranged between the lamp and the lens,
    • the spatial light modulator being structured into a plurality of pixels,
      a transformation of depth-data, the depth-data containing a distance per pixel taken from the depth-camera, being displayed on the spatial light modulator in order to achieve per pixel a depth-dependent light adjustment of the flash emitted by the lamp»;
      a flash unit is provided which allows an illumination of a scene with a variety of light sources, where each light source illuminates a part of the scene depending of the distance from the camera used for capturing a picture to that part of the scene.
  • The invention allows to be used in two different operation modes:
    • single-shot operation;
    • multi-shot operation.
  • In the single-shot operation, one single depth image is acquired, followed by a color image that captures the subject under spatially varying illumination. Since the depth image can be acquired during the focusing of the color camera, that is before the actual photograph, this is considered a «single-shot method».
  • In the multi-shot operation, one depth image and at least one color image are captured. The color image can be used to refine the depth image. Furthermore, the color images can serve as pilot images, from which the brightness of each feature in the scene is estimated. This information is used to further refine the spatially varying illumination during a final color image acquisition.
  • The basic idea consists in changing the physical illumination of the scene on a per-pixel basis. The flash unit employs a rangefinder, e.g. an infrared time-of-flight device) to determine the distance to each object, a spatial light modulator SLM, e.g., an LCD-panel and a lense system
    • similar to a projector - to modulate the illumination. While this approach is limited by the power of the light source, the illumination it produces is physically plausible, as are the interreflections it triggers in the scene, so the images look natural.
  • Using such a flash unit it is possible to simulate a variety of virtual light sources. Unless the light source is a dramatic player in the scene, most photographers prefer sources that are diffuse, distant from the primary subject, and angled with respect to the direction of view. For interior shots, examples are ceiling lights or the sun passing through a side window. Since this spatially adaptive flash unit sits atop the camera, it cannot simulate lights that arrive from the side or above the subject. However, by modulating the illumination of each object as a function of its distance from the camera, it can simulate a light source located some distance behind the photographer. Illumination from such a source falls off quadratically with distance, but at a slower rate than from a flash unit mounted on the camera. In order to avoid unnatural shadows above the subject, the physical light source is mounted above the camera, as for conventional flash.
  • With the flash unit, only one snapshot is required by the main camera, preceded by a single capture using the rangefinder. For scenes in which flash provides the bulk of the illumination, a second problem is that highly reflective objects may saturate the sensor, especially if they lie close to the camera. By capturing an additional pilot image to estimate object reflectance's, the SLM can be controlled to reduce the illumination of these objects during the final image capture. One can treat this additional modulation as a sort of «physical tone mapping» - reducing the dynamic range of the scene by adjusting its illumination locally.
  • Projection displays have obtained popularity because of the smaller form-factor and larger size of screen. Among several types of projection displays, the ones using micro-Spatial Light Modulators SLMs are gaining recognition by consumers because of high performance of picture quality as well as lower cost than Flat Panel Displays. There are two types of a micro-SLM used for projection displays in the market. One is microLiquid Crystal Display (LCD) and the other is micro-mirror technology, some details see e.g. WO 2008/027230 A2 . In the context of this invention the LCD- Technique represents a preferred solution. The invention is however not restricted to this LCD-Technique.
  • The working principle of the invention will now be described more in detail with reference to the accompanying drawings wherein:
  • Figure 1
    Overview of the flash unit;
  • Figure 2
    steps of the single-shot-method and multishot-method;
  • Figure 3
    diagram intensity vs. depth for a real and virtual light sources.
  • The setup consists of a Spatially Adaptive Flash Unit 100 as well as a color camera 40. In a standard setup the Spatially Adaptive Flash Unit 100 is positioned, such that the location roughly corresponds to a conventional flash unit mounted on the hot shoe of the color camera 40, cf. Fig. 1. This allocation of the different components has been chosen to get as close as possible to a candid shot setting. However, this setup with a wired communication 105 is not limited to this setting, since the Spatially Adaptive Flash Unit 100 could also be triggered - sometimes also called released - wirelessly from the camera 40. However, if the Spatially Adaptive Flash Unit «SAFU» 100 is not attached rigidly to the camera 40, the multi-shot approach - details see later on - would require recalibration each time either is moved.
  • The Spatially Adaptive Flash Unit 100 consists of a projector 20 or a case 20 with lamp 21, a LCD panel 23 and a lens 22. The Spatially Adaptive Flash Unit 100 contains additionally a depth camera 30 based on time-of-flight (ToF) technology. The conceptual or procedural setup is shown in Fig. 2. For a concrete embodiment of the invention, the light bulb from a projector SANYO PLC-XW50 was removed and replaced by a Canon Speedlite 580EX II. In order to increase the intensity from the Spatially Adaptive Flash Unit 100, a mirrored tube 25 was built. This maximizes the light projected into the light path 24. The rest of the light path 24, including three LCD's 23, a lens 22, and the depth camera 30, are not modified. The alignment of the optical axis of the depth camera 30 and the optical axis of the lens 22 of the light are chosen to be as close as possible, in order to minimize the disparity between their views of the seen. the before mentioned tube was built in this embodiment, other embodiments are possible with such a tube 25.
  • A Swissranger ToF-camera 30 from Mesa-Imaging was employed. It has a resolution of 176 × 144 pixels, providing the depth and amplitude at a modulation frequency of 20MHz. This allows for a depth range of 7.5m without ambiguities. The ToF-camera 30 has a built in illumination source, which operates in the infrared spectrum of 850 nm. The accuracy is highly dependent on the reflected amplitude as well as on the background illumination. According to [Büttgen et al.] the best depth accuracy lies in the mm range. The captured depth maps partially suffer from substantial noise, which has to be filtered.
  • The color camera 40 is a Canon EOS 1D Mark III camera with a Canon EF 16-35mm f/2.8L USM lens. The camera 40 is either connected by USB bidirectionally to a Personal Computer (not shown in the Figures) or bidirectionally to a processing unit 101 with a communication connection 102. Using a Canon SDK all required settings of the camera 40 can be made.
  • The distortion, as well as the extrinsic and intrinsic matrices of the depth camera 30 and the color camera 40, are computed from six images of the printed checkerboard pattern. This method can be used for both types of cameras, since the checkers are also visible in the infrared spectrum of the depth camera 30. The case 20 is calibrated relative to the color camera 40 based on the method presented in [Shen and Menq 2002]. To increase the accuracy of the projector calibration, a novel checkerboard pattern of different color has been used. Instead of taking a black checkerboard pattern, which has to be covered or removed in order to recognize the projected one, a blue colored and printed checkerboard pattern was used, while projecting white for the first image. The same printed checkerboard pattern is used as a projection surface for the second image. The projected pattern is a standard black and white checkerboard pattern. The calibration pattern can be extracted from the red channel for the first image, while it is extracted from the blue channel for the second image. This modification has the advantage, that the plane of reference will be exactly the same for both images, leading to a better accuracy, and more simplicity than the previous approach.
  • It is important to note that this calibration has to be done only once, assuming the Spatially Adaptive Flash Unit 100 is mounted rigidly to the camera.
  • Fig. 2 shows the single-shot-method and multishot-method. Both methods can be split into four steps:
    • acquisition,
    • depth filtering,
    • reprojection and
    • lighting.
  • During the filter step, a median filter is applied to remove outliers in depths returned by the ToF-camera 30, and a novel trilateral filter is applied to reduce noise. The filtered depth data is then back projected, triangulated, and rendered into the case 20 view using graphics hardware. This step performs warping as resolves visibility in real-time. To cope with inaccuracies in calibration or depth values, a conservative depth discontinuity diffusion filter was applied. This filter gives depth values that are correct or slightly closer than the actual depth, thereby diminishing the visibility of possible misalignments. Next an attenuation mask is computed. This computation takes as input the filtered depths, the virtual light position, and a user defined artistic filter. This attenuation mask is loaded onto the LCD-Panel 23 of the Spatially Adaptive Flash Unit 100, and a final image is captured by triggering the flash unit 100 and camera to fire via the release 41.
  • The single-shot method permits compensation for the unnatural falloff of a conventional flash by virtually repositioning the light source 21 further back -behind the photographer. To simulate the falloff of such a light source 21, per-pixel depths are required. The time-of-flight camera 30 captures these depths at the same time the color camera is being aimed and focused. This method, where flash intensity is modulated based solely on depth, is called here as «Single-shot flash adjustment». The «Single-shot flash adjustment» has the advantage of minimizing motion artifacts relative to methods that require multiple color images. However, variations in object reflectance cannot be taken into account by this method.
  • Assuming an already calibrated setup, The «Single-shot flash adjustment» can be split into the following steps, see also Fig. 2:
  • 1
    Acquisition of depth and amplitude (infrared);
    2
    Depth filtering;
    3
    Reprojection from the depth-camera to the flash unit;
    4
    Per pixel, depth-dependent light adjustment;
    5
    Color camera Capture.
  • The acquisition using the depth-camera 30 is straight forward, and therefore for the person skilled in the art not elaborated any further.
  • Depth Filtering
  • The captured depth data has a direct influence on the projected intensity per pixel. Noise in the depth data is, therefore, also visible as noise in the evaluated illumination. Since the human visual system is very sensitive to relative intensity variations, noise in the flash intensity becomes very easily visible. Although increasing the exposure time of the depth camera 30 improves the Signal Noise Ratio, it also increases motion blur artifacts. Therefore, the noise is reduced by applying different filters on the depth data allowing to keep a short exposure time.
  • Bilateral filtering [Tomasi and Manduchi 1998] has shown to perform well for filtering noise while preserving discontinuities in the signal. However, in order to remove the noise, the discontinuity of the signal always has to be higher than the noise level, which is not always the case when dealing with depth data from ToF-cameras. Eisemann and Durand [Eisemann and Durand 2004] and Petschnigg et al. [Petschnigg et al. 2004] introduced the cross bilateral filter - also known as joint bilateral filter - as a variant of the classical bilateral filter. They use a low noise image, meaning a high confidence in the high frequency data, as input to the range weight. This avoids smoothing across discontinuities of the signal. This approach has been extended to a Joint Adaptive Trilateral Filter, where the noise model of the depth values of the ToF-Camera 30 is taken into account for the confidence of the range weights. According to [Büttgen et al.] , the standard deviation σ D of the depth data can be described as σ D = c 4 π f mod 2 B c demod A sig , with
    Figure imgb0001
    c demod = A B , and
    Figure imgb0002
    B = A sig + BG ,
    Figure imgb0003
  • where c is the speed of light, fmod is the modulation frequency, BG is the background illumination, Asig is the mean number of electrons generated by the signal, and A is the measured amplitude in electrons. Additional noise sources, such as thermal noise or dark current electrons are minimized by the camera and are neglected by this filter. Since c and fmod are constant and Cdemod can be approximated by 0.5 according to [Büttgen et al. ], the standard deviation of the depth distribution is σ D k A sig ,
    Figure imgb0004

    with k = c 4 Π f mod 2 2 .
    Figure imgb0005

    This relationship shows that σD, and therefore, the noise in the depth image, mainly depends on the amplitude.
  • Joint Adaptive Trilateral Filter
  • A joint adaptive trilateral filter has been introduced, which takes the amplitude dependent standard deviation σD of the depth values into account. The joint adaptive trilateral filter consists of a spatial proximity component g(·), a depth dependent component f(·) in the range domain and an amplitude dependent component h(·)in the range domain. The filtered depth value dm at position m can be described as: d m = 1 k c i Ω d i g m - i spatial f d m - d i h a m - a i range
    Figure imgb0006
    k c = i Ω g m - i f d m - d i h a m - a i
    Figure imgb0007
  • where am and ai are amplitude values at position m and i respectively. The three functions g(·), f(·) and h(·) are chosen to be Gaussian kernels, each with a kernel support of Ω, and the corresponding standard deviations σp, σd and σa respectively.
  • Intuitively, the filter is smoothing the depth values, while limiting the influence at depth or amplitude discontinuities. However, since the noise is not uniformly distributed, we adapt σd and σa, and therefore, the influence of f(·) and h(·). Having a good approximation of the uncertainty measure of the depth values equation (1), the definition of two amplitude dependent functions σd(·) and σa(·) as follows: σ d a m = σ d init τ a a m + 1
    Figure imgb0008
    σ a a m = σ a init - σ a min 1 + e - a m + τ a + σ a min .
    Figure imgb0009
  • The initial standard deviation σdinit is set to a low value, while the is set to a high value, dropping to σamin at ama.
  • Median Filter For very low amplitudes, the provided depth is very often an outlier. Although the joint adaptive trilateral filter would be able to remove such outliers, it would require several iterations to do so. Therefore, a median filter is applied, which removes the outliers in one step. To avoid unnecessarily blurring regions without outliers, the median filter has only two pixels with an amplitude below a threshold τm. A threshold of τm =40 has shown to be a good value.
  • Mesh-Based Reprojection
  • To attenuate the light according to the depth values, they have to be transformed from the depth camera view to the projector view. In order to solve this reprojection step efficiently, a trivial triangular mesh is formed using all the back projected pixels from the depth camera and render them into the projector view. This provides a depth value per projector pixel and solves the visibility issue using the graphics card.
  • Artifacts can be introduced through resolution differences or remaining inaccuracies in the depth values. Since there is not more information to improve the accuracy of the data during the single-shot acquisition, the quality of the result is improved by moving the error to regions where it is least visible. Depth values that are too large produce visible artifacts, appearing as unexpectedly bright illumination. These artifacts are especially noticeable on objects close to the camera. Similarly, depth values that are too small produce unexpectedly dim illumination. Most of these artifacts appear at depth discontinuities. This means, that dim illumination due to too small depth values, will be adjacent to shadows produced by the flash and thus be less noticeable than the ones caused by too large depth values. Therefore, it is advantageous to err on the side of computing depth values that are either correct or too shallow. This is referred as the «conservative approach».
  • Shallow Edge Diffusion
  • One possibility to improve the depth data in a sense as to comply with the «conservative approach», is shallow edge diffusion. First all the big depth discontinuities are extracted using a Canny edge detector. Then, the obtained edges are simplified to a contour and every vertex of it is moved in the direction of its local gradient towards the region with the bigger depth values. The amount of translation corresponds to the multiplication of a globally defined width and the local gradient of the vertex before translation. By connecting the first and last vertex of the original and the moved contour, a set P of points lying inside the polygon is created. Finally, a modified bilateral filter is applied, cf Equation (8) to all the depth values dm lying inside of P, such that the depth edge is filtered and moved towards the region of bigger depth. The value kc is normalizing the weights of the depth values di. g(·) and hp(·) are two Gaussian kernels defined on the support ΩP . dimin corresponds to the shallowest depth value of the kernel support. d m = 1 k c i Ω P d i g m - i h p d i min - d i
    Figure imgb0010
  • Depth Dependent Flash Adjustment
  • The intensity I0 of an illumination source at distance 1 decreases with the distance d to an intensity of I d = I 0 d 2 .
    Figure imgb0011

    This intensity falloff becomes visible in flash-photography and leads to unnaturally lit scenes. A light source positioned further behind the photographer, would result in a more homogeneously lit scene, which would appear more natural. Based on the depth values obtained from the previous step, a spatially dependent attenuation function can be evaluated, the more desirable light position as a virtual light source can be simulated. The photographer can choose two interdependent parameters, namely the position of the virtual light source along the optical axis of the Spatial Adaptive Flash Unit 100, and the minimal illumination for a specific subject of the scene. These preferences determine the depth compensation function leading to an attenuation per pixel.
  • The attenuation can in fact be interpreted as a filter by which the flash light is being modulated. It can also be used as artistic filters giving the photographer more creative freedom.
  • User Preferences
  • The parameters, which can be set by the user, are chosen based on the possible needs of the photographer. On the one hand, the photographer wants to pick the position of the light, and on the other hand, he wants to choose how much light is added to the subject of interest. The former can be set by providing the depth offset doff relative to the photographers position, with the positive axis lying in front of the photographer. The latter is defined by an intensity I sub 0 1 2 .. 255
    Figure imgb0012

    of the flash at the depth corresponding to the focal length dsub of the camera. It is believed, that this is a natural way of parameterizing the virtual light source, since the photographer wants to primarily have control over how much light is added to the subject in focus. Especially for hand-held shots, the exposure time should not drop below a certain threshold to avoid motion blur due to camera shake. Of course, this method is not limited to this parameterization, but it is the one to be most useful for candid shot settings.
  • Depth Compensation Function
  • Given the final depth values and the user parameters (both see above), a depth compensation function fc(·) can be defined as f c d i = I sub d sub - d off 2 d i 2 d i - d off 2 d sub 2
    Figure imgb0013
    f att d i = { I max - f c d i if f c d i < I max 0 if f c d i I max
    Figure imgb0014

    with di being the depth value at the pixel with index i. fc(di)corresponds to the intensity that a pixel at index i requires to compensate for the position of the virtual light source. fatt(•)is the attenuation function dimming the maximal lamp power of the Spatially Adaptive Flash Unit 100 to result in a projection intensity of fc(•). Since the compensation function is limited, a maximal compensation depth dcmax can be defined as d cmax = - I max d sub - I max I sub d sub - d off 2 d off d sub I sub d sub 2 - 2 I sub d sub d off + I sub d off 2 - I max d sub 2 ,
    Figure imgb0015

    where Imax =fc(dcmax). If dcmax is smaller than any depth in the scene, the system could notify the photographer about a possible underexposure and either recommend a lower Isub or a smaller offset doff .
  • Artistic Filters
  • In addition to compensating for depth, one can also adapt the color balance to keep the lighting mood of the scene.
  • Multi-Shot Flash Adjustment
  • The «single-shot flash adjustment» is limited by a low resolution depth image, as well as a low resolution infrared amplitude image. Therefore, scenes featuring very fine detail might show artifacts due to wrong depth values. Furthermore, the lack of color information does not allow to correct for varying reflectance properties and overexposed regions automatically. To improve on these two limitations, a «multi-shot flash adjustment» is introduced. Since a color image has to be captured before taking the final shot with the corrected flash light, this is considered as a multi-shot approach. Assuming an already calibrated setup as for the «single-shot flash adjustment», the «multi-shot flash adjustment» can be split into following steps, see also Figure 2:
  • 1
    Acquisition of depth and amplitude (infrared), and two color images;
    2
    Depth filtering using high resolution color;
    3
    Reprojection from the up sampled depth-data to the flash unit;
    4
    Per pixel, depth-dependent light adjustment/reflectance correction;
    5
    Color camera capture.
  • Since most of the steps are very similar to the «single-shot flash adjustment» approach, the focus will be on the enhanced filtering and on the flash-tone-mapping as described in the following paragraphs.
  • Depth Filtering with Color
  • For reasons mentioned before, a good depth filtering is very important. Usage of the high-resolution color image allows to filter and up-sample the low resolution depth data, and therefore, get higher detail. Kopf et. al [Kopf et al. 2007] presented the joint bilateral up-sampling for images, where the mapping of corresponding areas between a low resolution image and a high resolution image is given. In the setup presented the depth data from the depth camera is mapped to the color camera by using the reprojection step described above. To improve the correspondences between the depth and the color values, the depth data is filtered according to the definitions given above prior to the mapping. As a result the trivially up-sampled depth data D~ is obtained being of the same resolution as the color image I~ . To improve the performance of the filtering step, a technique motivated by the joint bilateral up sampling [Kopf et al. 2007] is applied.
    Figure imgb0016
    Figure imgb0017
    Instead of evaluating the full filter kernel, every nth pixel at position p on the filter kernel Ω is evaluated. n is the ratio between the width in pixels of the color camera, and the width in pixels of the depth camera, and for the height respectively. The closer the depth camera and the color camera are to a paraxial setting, the closer this approach is to the joint bilateral up-sampling. The function g~(·) and h~c·•) are Gaussian kernels with a kernel support of Ω~. The color at position m is referred to as I~m.
  • Flash-Tone-Mapping
  • In the previous sections a method to adjust the amount of light per pixel depending on the per pixel depth value has been presented. However, the reflectance of the scene has not been taken into account. Regions of the scene, which were not lit before adding the flash light, might saturate when doing so. Therefore, not taking into account the reflectance, would not allow to compensate for this. Using a flash tone-mapping with a locally variable projection, which is motivated by the dodging-and-burning approach presented by Reinhard et. al in [Reinhard et al. 2002] is proposed in this invention.
  • The response curve of the camera without flash g(zi) and the maximum extent of the response curve with flash gF(zi), can be defined as Camera : g z i = ln E i Δ t
    Figure imgb0018
    SAFU : g F z i = ln Δ t E i + Δ F i
    Figure imgb0019

    The image pixel values zi are only dependent of the irradiance Ei and the chosen exposure time Δ t for the camera response. g(zi) can be determined using a method presented by Debevec et. al [Debevec and Malik 1997]. Since the duration of the flash unit is usually much shorter than the exposure time, its response function gF(zi) does not depend on the exposure time.
  • Depth Independent Range Extension
  • A new response curve gn(zi) for the camera-flash system has to be found, which is similar to the response curve g(zi) of the camera, but extends the camera's dynamic range independently of per-pixel depth. Furthermore, the response curve shall avoid overexposure in the flash-lit image. gn(zi) is bound by the lower bound gl(zi) and the upper bound g(zi). The lower bound is determined as g l z i = ln Δ t E i + Δ F i max
    Figure imgb0020
    = ln Δ t E i + min Δ F xy z i , where
    Figure imgb0021
    Δ F xy z i = e g I xy ʹ - e g I xy , xy I xy .
    Figure imgb0022

    The image pixels Ixy of the no-flash image at position (x, y), and the image pixels of the flash image I'xy lie in the same range as zi. The solution for gn(zi) can now be approximated as a linear scaling of g(zi) by factor k, see Equation(19). Ixy is the pixel value at coordinate x,y. zi is the intensity, which lies in the same range as the intensity values Ixy. Therefore, zi is used to describe the transfer function. But the zi could anytime be exchanged by Ixy. complies with the lower bound gl(zi). gn(zi) could be smoothed slightly in order to avoid discontinuities in the fixed range. Given gn(zi) the corresponding luminance contributed by the flash can be calculated as follows g n z i = k g z i = min g l z i g z i g z i .
    Figure imgb0023
  • The scaling factor sxy of the required light per pixel (x, y) of the Spatially Adaptive Flash Unit 100 can now be evaluated as S xy = Δ F i max Δ F xy
    Figure imgb0024

    with ΔF imax = g n z i - g z i .
    Figure imgb0025

    Assuming only direct light contribution by the case 20, the maximum intensity of the corresponding case pixel will be scaled by the scaling factor sxy as well.
  • List of reference signs, glossary
  • 20
    Case
    21
    lamp for emitting a flash, light source
    22
    lens
    23
    Spatial light modulator (SLM), e.g. LCD-panel
    24
    flash propagation inside the case; light path
    25
    mirrored tube
    30
    depth-camera for measuring distances, rangefinder
    40
    Camera, color camera of high resolution
    41
    Release, trigger
    100
    Photographic flash unit, SAFU
    101
    Processing unit (PU), processing unit containing an attenuation function
    102
    bi-directional communication connection from the processing unit to depth-camera
    103
    Control signal to SLM
    104
    bi-directional communication between processing unit and color sensor
    105
    (bi-directional) communication between color sensor and ''flash''
    SAFU
    Spatially adaptive flash unit, flash unit
    ToF
    Time of Flight
    List of cited documents
    • AGRAWAL, A., RASKAR, R., NAYAR, S. K., AND LI, Y. 2005. Removing photography artifacts using gradient projection and flash-exposure sampling. In SIGGRAPH '05: ACM SIGGRAPH 2005 Papers, ACM, New York, NY, USA, 828-835.
    • BELHUMEUR, P. N. 2007. Multiplexing for optimal lighting. IEEE Trans. Pattern Anal. Mach. Intell. 29, 8, 1339-1354. Member-Yoav Y. Schechner and Member-Shree K. Nayar.
    • B" B., OGGIER,T., LEHMANN, R., AND LUSTENUTTGEN, M.,
      KAUFMANN, BERGER, F. Ccd/cmos lock-in pixel for range imaging: Challenges, limitations and state-of-the-art.
    • DEBEVEC, P. E., AND MALIK, J. 1997. Recovering high dynamic range radiance maps from photographs. In SIGGRAPH '97: Proceedings of the 24th annual conference on Computer graphics and interactive techniques, ACM Press/Addison-Wesley Publishing Co., New York, NY, USA, 369-378.
    • DEBEVEC,P., HAWKINS,T., TCHOU, C., DUIKER, H.-P., SAROKIN,W., AND SAGAR, M. 2000. Acquiring the reflectance field of a human face. In SIGGRAPH '00: Proceedings of the 27th annual conference on Computer graphics and interactive techniques, ACM Press/Addison-Wesley Publishing Co., New York, NY, USA, 145-156.
    • EISEMANN, E., AND DURAND, F. 2004. Flash photography enhancement via intrinsic relighting.
    • KOPF, J., COHEN,M. F., LISCHINSKI, D., AND UYTTENDAELE, M. 2007. Joint bilateral up sampling. ACM Trans. Graph. 26, 3, 96.
    • LEVOY, M., CHEN, B., VAISH,V., HOROWITZ, M., MCDOWALL, I., AND BOLAS, M. 2004. Synthetic aperture confocal imaging. ACM Trans. Graph. 23, 3, 825-834.
    • MOHAN, A., BAILEY, R. J., WAITE, J., TUMBLIN, J., GRIMM, C., AND BODENHEIMER, B. 2007. Tabletop computed lighting for practical digital photography. IEEE Trans. Vis. Comput. Graph. 13, 4, 652-662.
    • NAYAR, S. K., AND BRANZOI, V. 2003. Adaptive dynamic range imaging: Optical control of pixel exposures over space and time. iccv 02, 1168.
    • NAYAR, S. K., BRANZOI,V., AND BOULT, T. E. 2006. Programmable imaging: Towards a flexible camera. Int. J. Comput. Vision 70, 1, 7-22.
    • NAYAR, S. K., KRISHNAN, G., GROSSBERG, M. D., AND RASKAR, R. 2006. Fast separation of direct and global components of a scene using high frequency illumination. ACM Trans. Graph. 25, 3, 935-944.
    • PETSCHNIGG, G., SZELISKI, R., AGRAWALA, M., COHEN, M., HOPPE, H., AND TOYAMA, K. 2004. Digital photography with flash and no-flash image pairs. ACM Trans. Graph. 23, 3, 664-672.
    • RASKAR, R., WELCH, G., LOW, K.-L., AND BANDYOPADHYAY, D. 2001. Shader lamps: Animating real objects with image-based illumination. In Proceedings of the 12th Eurographics Workshop on Rendering Techniques, Springer-Verlag, London, UK, 89-102.
    • REINHARD, E., STARK, M., SHIRLEY,P., AND FERWERDA, J. 2002. Photographic tone reproduction for digital images. ACM Transactions on .
    • RUSINKIEWICZ, S., HALL-HOLT, O., AND LEVOY, M. 2002. Real-time 3d model acquisition. In SIGGRAPH '02: Proceedings of the 29th annual conference on Computer graphics and interactive techniques, ACM, New York, NY, USA, 438-446.
    • SEN,P., CHEN, B., GARG, G., MARSCHNER, S. R., HOROWITZ, M., LEVOY, M., AND LENSCH, H. P. A. 2005. Dual photography. ACM Trans. Graph. 24,3, 745-755.
    • SHEN, T.-S., AND MENQ, C.-H. 2002. Digital projector calibration for 3d active vision systems. Transactions of the ASME 124.
    • TOMASI, C., AND MANDUCHI, R. 1998. Bilateral filtering for gray and color images. In ICCV '98: Proceedings of the Sixth International Conference on Computer Vision, IEEE Computer Society, Washington, DC, USA, 839.
    • ZHANG, L., CURLESS, B., HERTZMANN, A., AND SEITZ, S. M. 2003. Shape and motion under varying illumination: Unifying structure from motion, photometric stereo, and multi-view stereo. In ICCV '03: Proceedings of the Ninth IEEE International Conference on Computer Vision, IEEE Computer Society, Washington, DC, USA, 618.

Claims (8)

  1. Photographic flash unit (100) for illuminating objects to be captured by a camera (40), the flash unit (100) comprising
    - a lamp (21) for emitting a flash,
    - a lens (22),
    - a depth-camera (30) for measuring distances per pixel to objects to be flashed and captured by the camera (40);
    where the measured distances are represented by depth-data;
    - a connection (105) between the camera (40) and the lamp for the release (41) of a flash;
    characterized in by
    a connection (102, 103) of the depth-camera (30) with a spatial light modulator (23),
    - the spatial light modulator (23) being arranged between the lamp (21) and the lens (22),
    - the spatial light modulator (23) being structured into a plurality of pixels,
    a transformation of depth-data, the depth-data containing a distance per pixel taken from the depth-camera (30), being displayed on the spatial light modulator (23) in order to achieve per pixel a depth-dependent light adjustment of the flash emitted by the lamp (21).
  2. Flash unit (100) according to claim 1,
    characterized in by
    where the spatial light modulator (23) is a LCD-Panel (23).
  3. Flash unit (100) according to claim 1 or 2,
    characterized in by
    a Processing Unit (101) connected between the depth-camera (30) and the spatial light modulator (23), where the Processing Unit (101) contains a depth compensation function fc in order to evaluate a modulation dependent on the distance per pixel taken from the depth-camera (30).
  4. Flash unit (100) according to claim 3,
    characterized in by
    a filtering function dm contained in the Processing Unit (101) where m represents at least one pixel, the filtering function dm eliminating noise in the depth-data.
  5. Flash unit (100) according to claim 4,
    characterized in by
    the filtering function dm being a median filter in order to remove outliers in the depth-data returned by the depth camera (30).
  6. Flash unit (100) according to one of the claims 3 to 5,
    characterized in by
    depth compensation function fc being fed with two interdependent parameters:
    - the position of a virtual light source along the optical axis of the Flash unit (100) and
    - a minimal illumination of a specific subject of the scene to be photographed.
  7. Flash unit (100) according to one of the claims 3 to 6,
    characterized in by
    a connection (104) between the camera (40) and the processing unit (101), where the processing unit (101) maps the depth-data to image-data captured by the camera (40) and the mapped-data is in the processing unit (101) filtered and transformed resulting in a modulation;
    the modulation is set on the spatial light modulator (23) before emitting a flash.
  8. Flash unit (100) according to one of the claim 1 to 7,
    characterized in by
    means allowing a rigid attachment of the camera (40) to the Flash unit (100).
EP08104125A 2008-04-04 2008-05-28 Spatially Adaptive Photographic Flash Unit Withdrawn EP2128693A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP08104125A EP2128693A1 (en) 2008-05-28 2008-05-28 Spatially Adaptive Photographic Flash Unit
EP09726437.8A EP2321697B1 (en) 2008-04-04 2009-02-04 Spatially adaptive photographic flash unit

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP08104125A EP2128693A1 (en) 2008-05-28 2008-05-28 Spatially Adaptive Photographic Flash Unit

Publications (1)

Publication Number Publication Date
EP2128693A1 true EP2128693A1 (en) 2009-12-02

Family

ID=39709007

Family Applications (1)

Application Number Title Priority Date Filing Date
EP08104125A Withdrawn EP2128693A1 (en) 2008-04-04 2008-05-28 Spatially Adaptive Photographic Flash Unit

Country Status (1)

Country Link
EP (1) EP2128693A1 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011080383A1 (en) * 2009-12-30 2011-07-07 Nokia Corporation Method and apparatus for illumination
US8514269B2 (en) 2010-03-26 2013-08-20 Microsoft Corporation De-aliasing depth images
CN104012086A (en) * 2011-12-19 2014-08-27 思科技术公司 System and method for depth-guided image filtering in a video conference environment
US8885890B2 (en) 2010-05-07 2014-11-11 Microsoft Corporation Depth map confidence filtering
JP2016092808A (en) * 2014-11-11 2016-05-23 キヤノン株式会社 Imaging apparatus and imaging apparatus control method
US9584790B2 (en) 2013-06-03 2017-02-28 Microsoft Technology Licensing, Llc Edge preserving depth filtering
WO2017080875A1 (en) * 2015-11-10 2017-05-18 Koninklijke Philips N.V. Adaptive light source
EP3229558A1 (en) * 2016-04-08 2017-10-11 Rotolight Limited Lighting system and control thereof
CN107430317A (en) * 2015-03-02 2017-12-01 保富图公司 Flash generator and flash head and extension cable with identification electronic device
US10841472B2 (en) 2016-04-08 2020-11-17 Rotolight Limited Lighting system and control thereof
EP4254928A1 (en) * 2022-03-31 2023-10-04 Infineon Technologies AG Apparatus and method for processing an image

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006082112A1 (en) * 2005-02-03 2006-08-10 Sony Ericsson Mobile Communications Ab An optical device

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006082112A1 (en) * 2005-02-03 2006-08-10 Sony Ericsson Mobile Communications Ab An optical device

Non-Patent Citations (22)

* Cited by examiner, † Cited by third party
Title
AGRAWAL, A. ET AL.: "Removing photography artifacts using gradient projection and flash-exposure sampling. In SIGGRAPH '05", ACM SIGGRAPH, 2005, pages 828 - 835, XP002442303, DOI: doi:10.1145/1073204.1073269
BELHUMEUR, P. N.: "Multiplexing for optimal lighting", IEEE TRANS. PATTERN ANAL. MACH. INTELL., vol. 29, no. 8, 2007, pages 1339 - 1354, XP011185890, DOI: doi:10.1109/TPAMI.2007.1151
DEBEVEC, P. E.; MALIK, J.: "SIGGRAPH '97: Proceedings of the 24th annual conference on Computer graphics and interactive techniques", 1997, ACM PRESS/ADDISON-WESLEY PUBLISHING CO., article "Recovering high dynamic range radiance maps from photographs", pages: 369 - 378
DEBEVEC,P. ET AL.: "SIGGRAPH '00: Proceedings of the 27th annual conference on Computer graphics and interactive techniques", 2000, ACM PRESS/ADDISON-WESLEY PUBLISHING CO., article "Acquiring the reflectance field of a human face", pages: 145 - 156
EISEMANN, E.; DURAND, F., FLASH PHOTOGRAPHY ENHANCEMENT VIA INTRINSIC RELIGHTING, 2004
KOPF, J. ET AL.: "Joint bilateral up sampling", ACM TRANS. GRAPH., vol. 26, no. 3, 2007, pages 96
LEVOY, M. ET AL.: "Synthetic aperture confocal imaging", ACM TRANS. GRAPH., vol. 23, no. 3, 2004, pages 825 - 834
MOHAN, A. ET AL.: "Tabletop computed lighting for practical digital photography", IEEE TRANS. VIS. COMPUT. GRAPH., vol. 13, no. 4, 2007, pages 652 - 662, XP011190838, DOI: doi:10.1109/TVCG.2007.1008
NAYAR, S. K. ET AL.: "Fast separation of direct and global components of a scene using high frequency illumination", ACM TRANS. GRAPH., vol. 25, no. 3, 2006, pages 935 - 944
NAYAR, S. K.; BRANZOI, V.: "Adaptive dynamic range imaging", OPTICAL CONTROL OF PIXEL EXPOSURES OVER SPACE AND TIME. ICCV 02, 2003, pages 1168, XP010662526, DOI: doi:10.1109/ICCV.2003.1238624
NAYAR, S. K.; BRANZOI,V.; BOULT, T. E.: "Programmable imaging: Towards a flexible camera", INT. J. COMPUT. VISION, vol. 70, no. 1, 2006, pages 7 - 22, XP019410082, DOI: doi:10.1007/s11263-005-3102-6
OGGIER,T. ET AL.: "Ccd/cmos lock-in pixel for range imaging", CHALLENGES, LIMITATIONS AND STATE-OF-THE-ART
PETSCHNIGG, G. ET AL.: "Digital photography with flash and no-flash image pairs", ACM TRANS. GRAPH., vol. 23, no. 3, 2004, pages 664 - 672
RASKAR R ET AL: "SPATIALLY AUGMENTED REALITY", AUGMENTED REALITY. PLACING ARTIFICIAL OBJECTS IN REAL SCIENCES.PROCEEDINGS OF IWAR. INTERNATIONAL WORKSHOP ON AUGMENTED REALITY, XX, XX, 1 November 1998 (1998-11-01), pages 63 - 72, XP008018391 *
RASKAR R ET AL: "Table-top spatially-augmented realty: bringing physical models to life with projected imagery", AUGMENTED REALITY, 1999. (IWAR '99). PROCEEDINGS. 2ND IEEE AND ACM INT ERNATIONAL WORKSHOP ON SAN FRANCISCO, CA, USA 20-21 OCT. 1999, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 20 October 1999 (1999-10-20), pages 64 - 71, XP010358746, ISBN: 978-0-7695-0359-2 *
RASKAR, R. ET AL.: "Proceedings of the 12th Eurographics Workshop on Rendering Techniques", 2001, SPRINGER-VERLAG, article "Shader lamps: Animating real objects with image- based illumination", pages: 89 - 102
REINHARD, E. ET AL.: "Photographic tone reproduction for digital images", ACM TRANSACTIONS ON GRAPHICS, vol. 21, no. 3, 2002, pages 267 - 276
RUSINKIEWICZ, S.; HALL-HOLT, O.; LEVOY, M.: "Real- time 3d model acquisition. In SIGGRAPH '02: Proceedings of the 29th annual conference on Computer graphics and interactive techniques", ACM, 2002, pages 438 - 446
SEN,P. ET AL.: "Dual photography.", ACM TRANS. GRAPH., vol. 24, no. 3, 2005, pages 745 - 755
SHEN, T.-S.; MENQ, C.-H.: "Digital projector calibration for 3d active vision systems", TRANSACTIONS OF THE ASME, 2002, pages 124
TOMASI, C.; MANDUCHI, R.: "Bilateral filtering for gray and color images. In ICCV '98", PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, IEEE COMPUTER SOCIETY, 1998, pages 839, XP000926378
ZHANG, L. ET AL.: "Shape and motion under varying illumination: Unifying structure from motion, photometric stereo, and multi- view stereo", ICCV '03: PROCEEDINGS OF THE NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, IEEE COMPUTER SOCIETY, 2003, pages 618, XP010662419

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8459811B2 (en) 2009-12-30 2013-06-11 Nokia Corporation Method and apparatus for illumination
WO2011080383A1 (en) * 2009-12-30 2011-07-07 Nokia Corporation Method and apparatus for illumination
US8514269B2 (en) 2010-03-26 2013-08-20 Microsoft Corporation De-aliasing depth images
US8885890B2 (en) 2010-05-07 2014-11-11 Microsoft Corporation Depth map confidence filtering
CN104012086A (en) * 2011-12-19 2014-08-27 思科技术公司 System and method for depth-guided image filtering in a video conference environment
CN104012086B (en) * 2011-12-19 2017-05-31 思科技术公司 The system and method for image filtering are oriented to for the depth of field in video conference environment
US9584790B2 (en) 2013-06-03 2017-02-28 Microsoft Technology Licensing, Llc Edge preserving depth filtering
JP2016092808A (en) * 2014-11-11 2016-05-23 キヤノン株式会社 Imaging apparatus and imaging apparatus control method
CN107430317A (en) * 2015-03-02 2017-12-01 保富图公司 Flash generator and flash head and extension cable with identification electronic device
CN107430317B (en) * 2015-03-02 2020-07-03 保富图公司 Flash generator and flash head and extension cable with identification electronics
CN108369364A (en) * 2015-11-10 2018-08-03 亮锐控股有限公司 Adaptive light source
CN113433775A (en) * 2015-11-10 2021-09-24 亮锐控股有限公司 Adaptive light source
US11988943B2 (en) 2015-11-10 2024-05-21 Lumileds Llc Adaptive light source
US10484616B2 (en) 2015-11-10 2019-11-19 Lumileds Llc Adaptive light source
US10602074B2 (en) 2015-11-10 2020-03-24 Lumileds Holding B.V. Adaptive light source
WO2017080875A1 (en) * 2015-11-10 2017-05-18 Koninklijke Philips N.V. Adaptive light source
US11803104B2 (en) 2015-11-10 2023-10-31 Lumileds Llc Adaptive light source
KR20180084865A (en) * 2015-11-10 2018-07-25 루미리즈 홀딩 비.브이. The adaptive light source
US11184552B2 (en) 2015-11-10 2021-11-23 Lumileds Llc Adaptive light source
US11223777B2 (en) 2015-11-10 2022-01-11 Lumileds Llc Adaptive light source
CN113433775B (en) * 2015-11-10 2023-01-17 亮锐控股有限公司 Adaptive light source
US10841472B2 (en) 2016-04-08 2020-11-17 Rotolight Limited Lighting system and control thereof
EP3229558A1 (en) * 2016-04-08 2017-10-11 Rotolight Limited Lighting system and control thereof
EP4254928A1 (en) * 2022-03-31 2023-10-04 Infineon Technologies AG Apparatus and method for processing an image

Similar Documents

Publication Publication Date Title
EP2321697B1 (en) Spatially adaptive photographic flash unit
EP2128693A1 (en) Spatially Adaptive Photographic Flash Unit
Bimber et al. The visual computing of projector-camera systems
TWI668997B (en) Image device for generating panorama depth images and related image device
KR100810844B1 (en) Method and apparatus for optimizing capture device settings through depth information
JP5871862B2 (en) Image blur based on 3D depth information
Bimber et al. Superimposing pictorial artwork with projected imagery
JP3640156B2 (en) Pointed position detection system and method, presentation system, and information storage medium
US8654234B2 (en) Bi-directional screen
KR101696630B1 (en) Lighting system and method for image and object enhancement
CN109714539B (en) Image acquisition method and device based on gesture recognition and electronic equipment
EP3381015B1 (en) Systems and methods for forming three-dimensional models of objects
Bimber et al. Closed-loop feedback illumination for optical inverse tone-mapping in light microscopy
Wang et al. A context-aware light source
Adelsberger et al. Spatially adaptive photographic flash
US9075482B2 (en) Optical touch display
JP2004326188A (en) Large screen touch panel system and search/display system
CN105208286A (en) Photographing method and device for simulating low-speed shutter
CN108351576A (en) Method for shooting image by mobile device
WO2010023348A1 (en) Interactive displays
JP2009164949A (en) Image processor, image processing method, and program
JP2011254268A (en) Imaging apparatus, imaging method, and program
CN112954206B (en) Virtual makeup display method, system, device and storage medium thereof
WO2022099322A1 (en) A dark flash normal camera
Hudon Active illumination for high speed image acquisition and recovery of shape and albedo

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MT NL NO PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA MK RS

AKY No designation fees paid
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20100603

REG Reference to a national code

Ref country code: DE

Ref legal event code: 8566