EP3164992A1 - Method and apparatus for image capturing and simultaneous depth extraction - Google Patents

Method and apparatus for image capturing and simultaneous depth extraction

Info

Publication number
EP3164992A1
EP3164992A1 EP15814578.9A EP15814578A EP3164992A1 EP 3164992 A1 EP3164992 A1 EP 3164992A1 EP 15814578 A EP15814578 A EP 15814578A EP 3164992 A1 EP3164992 A1 EP 3164992A1
Authority
EP
European Patent Office
Prior art keywords
spectrum
image
coded aperture
disparity
basis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP15814578.9A
Other languages
German (de)
French (fr)
Other versions
EP3164992A4 (en
Inventor
Vladimir Petrovich PARAMONOV
Ivan Andreevich PANCHENKO
Victor Valentinovich Bucha
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Original Assignee
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Samsung Electronics Co Ltd filed Critical Samsung Electronics Co Ltd
Priority claimed from PCT/KR2015/006966 external-priority patent/WO2016003253A1/en
Publication of EP3164992A1 publication Critical patent/EP3164992A1/en
Publication of EP3164992A4 publication Critical patent/EP3164992A4/en
Withdrawn legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/204Image signal generators using stereoscopic image cameras
    • H04N13/254Image signal generators using stereoscopic image cameras in combination with electromagnetic radiation sources for illuminating objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/204Image signal generators using stereoscopic image cameras
    • H04N13/207Image signal generators using stereoscopic image cameras using a single 2D image sensor
    • H04N13/214Image signal generators using stereoscopic image cameras using a single 2D image sensor using spectral multiplexing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/257Colour aspects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/271Image signal generators wherein the generated image signals comprise depth maps or disparity maps
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N2013/0074Stereoscopic image analysis
    • H04N2013/0081Depth or disparity estimation from stereoscopic image signals

Definitions

  • Apparatuses and methods consistent with exemplary embodiments relate to computational photography, and more particularly, to light field capturing and processing.
  • One of the main applications of light field photography is in extraction of image depth information.
  • Examples of apparatuses for light field capturing or image depth information extraction may include a stereo camera, a plenoptic camera, a camera with a binary coded aperture, and a camera with a color coded aperture.
  • these apparatuses may require additional space, increase costs of cameras, or cause a reduction in optical efficiency.
  • a system for image capturing and depth extraction including: a lens system; a spectrum coded aperture including at least two regions that pass spectrum channels of an incident light field which are different from each other; and a sensor configured to record the at least two spectrum channels to form an image captured in a sensor basis; and a data processor configured to convert the image captured in the sensor basis into an image of a processing basis, extract a disparity from the image of the processing basis, and convert the disparity into depth information.
  • FIG. 1 is a diagram of a depth extraction/image restoration apparatus according to an exemplary embodiment
  • FIGS. 2a to 2f are diagrams of spectrum coded apertures according to exemplary embodiments
  • FIGS. 3a to 3i are diagrams for describing a channel shift
  • FIG. 4 is a high-level outline diagram of a depth information extraction/image restoration method according to an exemplary embodiment
  • FIG. 5 is a diagram for describing a parabola fitting according to an exemplary embodiment.
  • FIGS. 6a to 6d are diagrams for describing a depth extraction/image restoration apparatus according to an exemplary embodiment.
  • a system for image capturing and depth extraction including: a lens system; a spectrum coded aperture including at least two regions that pass spectrum channels of an incident light field which are different from each other; and a sensor configured to record the at least two spectrum channels to form an image captured in a sensor basis; and a data processor configured to convert the image captured in the sensor basis into an image of a processing basis, extract a disparity from the image of the processing basis, and convert the disparity into depth information.
  • the different spectrum channels may form a basis of the spectrum coded aperture.
  • the processing basis may be different from the sensor basis and the basis of the spectrum coded aperture.
  • the spectrum coded aperture may have three regions, and the three regions may include a transparent region in a central portion, and two regions having spectrum bandwidths respectively corresponding to yellow and cyan.
  • the processing basis may three vectors, and the three vectors may include a vector corresponding to yellow, a vector corresponding to cyan, and a vector perpendicular to the two vector.
  • the spectrum coded aperture may include two regions having spectrum bandwidths respectively corresponding to yellow and cyan.
  • the processing basis may include three vectors, and the three vectors may include a vector respectively corresponding to yellow, a vector corresponding to cyan, and a vector perpendicular to the two vector.
  • the spectrum coded aperture may include three congruent regions having spectrum bandwidths respectively corresponding to yellow, cyan, and magenta.
  • the processing basis may include vectors corresponding to yellow, cyan, and magenta.
  • the spectrum coded aperture may include three non-congruent regions having spectrum bandwidths respectively corresponding to yellow, cyan, and magenta.
  • the processing basis may include vectors respectively corresponding to yellow, cyan, and magenta.
  • the spectrum coded aperture may have a smooth bandwidth change over an aperture region.
  • the spectrum coded aperture may be fixed to the lens system.
  • the spectrum coded aperture may be attachable to and detachable from the lens system.
  • the spectrum coded aperture may be moved from an optical train that does not participate in the image formation.
  • the captured image may be an image selected from a video sequence.
  • the spectrum coded aperture may insert the image selected from the video sequence into the lens system.
  • the spectrum coded aperture may be inserted into an aperture stop of the lens system.
  • the lens system may include a single lens and the spectrum coded aperture may be located in the lens.
  • the spectrum coded aperture may correct a previous video image of the video sequence acquired by the sensor.
  • the spectrum coded aperture may have a combination of an opaque region and a congruent region, and the congruent region may be transparent or transmit ultraviolet light, infrared light, or visible light.
  • the spectrum coded aperture may have a combination of an opaque region and a non-congruent region, and the non-congruent region may be transparent or transmits ultraviolet light, infrared light, or visible light.
  • the spectrum coded aperture may be a spatial light modulator (SLM).
  • SLM spatial light modulator
  • the data processor may include a preprocessing unit configured to perform the converting the captured image, a disparity estimation unit configured to perform the extracting the disparity, and a conversion unit configured to perform the converting the disparity to the depth information.
  • the data processor may further include an image restoration unit configured to restore the captured image based on the extracted disparity.
  • a method of image capturing and depth extraction including: recording at least two shifted spectrum channels of a light field to form an image captured from a video; converting the captured image into an image of a processing basis; estimating a disparity based on a correlation between pixels of the spectrum channels in the processing basis to extract a disparity map; restoring the captured image based on the extracted disparity map; and converting the disparity map into a depth map.
  • the estimating of the disparity may include: generating candidate images having respective shifts in the spectrum channels; computing matching cost involved in the candidate images in the spectrum channels; propagating a matching cost involved in a low textured region of the candidate images; and estimating a matching cost having a sub-pixel accuracy based on the propagated matching cost.
  • the correlation between the pixels of the spectrum channel for requesting the disparity estimation may include a correlation metric computed in a sparse moving window.
  • the correlation between the pixels of the spectrum channel for requesting the disparity estimation may be computed by using at least one stereo matching algorithm.
  • the computing of the correlation by using the stereo matching algorithm may include sum of absolute differences (SAD), normalized cross correlation (NCC), or Laplacian image contrast (LIC).
  • SAD sum of absolute differences
  • NCC normalized cross correlation
  • LIC Laplacian image contrast
  • the correlation metric may include a fast Fourier transform (FFT).
  • FFT fast Fourier transform
  • the correlation metric may include a recursive exponential filter (REF).
  • REF recursive exponential filter
  • the restoring of the captured image may include performing image blurring.
  • the restoring of the captured image may include performing a spectrum channel alignment in the processing basis.
  • a mobile device for image capturing and depth extraction in ultraviolet light, infrared light, or visible light including: a lens system; at least one spectrum coded aperture including at least two regions that pass spectrum channels of an incident light field which are different from each other; a sensor configured to record the at least two spectrum channels to form an image captured in a sensor basis; and a coded aperture fixture configured to move at least one spectrum coded aperture relatively with respect to the lens system; and a data processor configured to convert the image captured in the sensor basis into an image of a processing basis, extract a disparity from the image of the processing basis, and convert the disparity into depth information.
  • the coded aperture fixture may be configured to replace at least two spectrum coded apertures in an optical train.
  • the coded aperture fixture may be configured to shift all the spectrum coded apertures from the optical train.
  • the coded aperture fixture may be inserted into an aperture stop.
  • the spectrum coded aperture may have a combination of an opaque region and a congruent region, and the congruent region may be transparent or transmit ultraviolet light, infrared light, or visible light.
  • the spectrum coded aperture may have a combination of an opaque region and a non-congruent region, and the non-congruent region may be transparent or transmits ultraviolet light, infrared light, or visible light.
  • an apparatus for image capturing including: a lens system; at least two spectrum coded apertures including a first aperture and a second aperture which have different characteristics of optical efficiency and depth discrimination from each other; a coded aperture fixture adapted to dispose the first aperture in front of the lens system; and a data processor configured to obtain depth information of an image captured through the first spectrum coded aperture, and control the coded aperture fixture to determine whether to switch the first aperture to the second aperture based on the depth information.
  • the first aperture may include a transparent region placed in the center of the first aperture and two regions separated by the transparent region.
  • the two regions pass different color spectrums, respectively.
  • the two regions may pass a yellow spectrum and a cyan spectrum, respectively.
  • the second aperture may include equally divided two regions which may pass yellow and cyan spectrums, respectively.
  • FIG. 1 is a diagram of a depth extraction/image restoration apparatus 101 according to an exemplary embodiment.
  • the depth extraction/image restoration apparatus 101 may include a camera 102 and a data processor 103.
  • the camera 102 may include optical lens (objective lens) 104, a spectrum coded aperture 105, and a sensor 106.
  • the spectrum coded aperture 105 may be inserted into an optical system which is constituted by the combination of the lens 104, the sensor 106, and other optical parts.
  • the spectrum coded aperture 105 may be placed in an optical path that a ray of light follows through the optical system.
  • the spectrum coded aperture 105 may be a diaphragm plane.
  • the sensor 106 may be configured to discriminate different spectrum bandwidths from one another.
  • the sensor 106 may be a sensor covered with a mosaic color/spectrum filter array, or a color stacked photodiode sensor.
  • the data processor 103 may include a preprocessing unit 108, a disparity estimation unit 109, an image restoration unit 110, and a disparity-to-depth conversion unit 111.
  • the data processor 103 may receive a raw image 107 captured by the camera 102.
  • the preprocessing unit 108 may convert the captured image 107 from a sensor basis to a processing basis in which a spectrum coded aperture filter may not be present.
  • the disparity estimation unit 109 may perform disparity estimation.
  • image restoration unit 110 may perform image restoration.
  • the disparity-to-depth conversion unit 111 may perform disparity-to-depth conversion on optical system parameters.
  • the spectrum coded aperture 105 may be divided into sub-regions that respectively have spectrum passbands.
  • the number, geometric structures, and spectrum passbands of the sub-regions may be changed according to applications of optical efficiency, a depth map, and color image restoration image quality. Some of them are illustrated in FIGS. 2a to 2f.
  • FIGS. 2a to 2f are diagrams illustrating patterns of various spectrum coded apertures having a tradeoff relationship among the optical efficiency, the depth map, and the color image restoration image quality.
  • spectrum filters For light field coding, spectrum filters , , and may be used. Examples of the spectrum filters , , and may include a visibly recognizable color filter, an infrared/ultraviolet filter, and a multi-path filter having two or more passbands
  • Main characteristics of a spectrum coded aperture are optical efficiency, depth discrimination ability, and color image restoration image quality.
  • the highest depth discrimination index may be obtained from a geometric structure of a spectrum coded aperture having the longest distance between the centers of aperture sub-regions corresponding to respective optical spectrum bands.
  • FIG. 2a shows an aperture pattern that has a relatively long distance between the centers of sub-regions , , and and a relatively small filter size in the sub-regions. Consequently, an opaque region of the coded aperture may be increased so that the optical system has a reduced optical efficiency. If the aperture design is deformed to enhance optical efficiency as shown in FIG. 2b, the typically extracted disparity accuracy may be deteriorated.
  • FIG. 2c shows a geometric structure of an aperture having a cyan filter (i.e. ) and a yellow filter (i.e., ) on halves
  • FIG. 2d shows a geometric structure of an aperture having a transparent sub-region a cyan filter (i.e., ), a yellow filter (i.e., ), and a green filter (i.e., ).
  • the yellow filter may have a passband including green and red light spectrums.
  • the cyan filter may have a passband including green and blue light spectrums.
  • the transparent region may not filter incoming light.
  • the green channel may not be distorted by these filters and may be used as a reference in an image restoration process.
  • the aperture structure in FIG. 2c may have a better depth map.
  • the aperture structure of FIG. 2d may have an superior optical efficiency to the aperture structure of FIG. 2c.
  • FIG. 2a shows an aperture having a circular filter and an opaque region, which may be used to obtain a high-quality depth map image when light is excessive.
  • the aperture structure of FIG. 2a may compensate for excessive light directed to the camera 102.
  • An aperture structure having infrared light and ultraviolet light on halves as shown in FIG. 2c may be a fully opened aperture and may have the same optical efficiency and have excellent potential with respect to depth extraction.
  • FIG. 2e shows a spectrum coded aperture having three or more spectrum sub-regions with a hive arrangement
  • FIG. 2f illustrates a spectrum coded aperture having a smooth bandwidth change over an aperture region.
  • the light field which is corrected by the spectrum coded aperture 105, may be input to the image sensor 106 that generates the captured raw image 107.
  • the light field having passed through the spectrum coded aperture 105 may be coded. That is, the light field may be divided into different spectrum parts by passing through corresponding aperture sub-regions. Therefore, different views may be extracted from a single captured image with respect to the same scene by dividing the single captured image into spectrum channels correspondingly with respect to the spectrum coded aperture.
  • FIG. 3a illustrates the captured image 107 obtained by a sensor 106 that is capable of discriminating the corresponding spectrum bandwidth with respect to the spectrum coded aperture described above with reference to FIG. 2b.
  • a position of a defocused object 302 in FIG. 3a which is obtained by the presence of the spectrum coded aperture, may be changed with respect to relatively corresponding spectrum filter positions as shown in FIGS. 3d, 3e, and 3f as compared to a focused object 301 in FIG. 3a).
  • Such a view may be used for extracting a disparity map and restoring the captured image 107.
  • the results of image deblurring with respect to the spectrum channels are illustrated in FIGS. 3g, 3h, and 3i.
  • a deblurred color image is illustrated in FIG. 3b.
  • a deblurred image (restored image) aligned in the spectrum channel is illustrated in FIG. 3c.
  • FIG. 4 is a high-level outline diagram of the data processor 103.
  • a system input may be the raw image 107 captured by the camera 102.
  • the captured image 107 may be preprocessed by denoising and demosaic technologies and be translated from a sensor spectrum basis to a processing basis.
  • the processing basis may not be a spectrum filter. is an image color channel acquired by an optical system sensor.
  • a conversion matrix ⁇ needs to be preferentially estimated.
  • the camera 102 uses the aperture structure having a cyan filter and a yellow as described above with reference to FIG. 2C, and a red, green, blue (RGB) mosaic color filter array.
  • a third basis vector is defined as a vector product .
  • Vectors and are respectively a red basis, a green basis, and a blue basis for the camera sensor 106.
  • the sensor spectrum basis In the sensor spectrum basis,
  • any observed color w may be decomposed by an aperture filter response.
  • the matrix ⁇ may be inversely converted. represents an image channel acquired in the processing basis.
  • an inverse conversion matrix (a left inverse matrix and a right inverse matrix) may be used.
  • a disparity may be estimated with respect to all pixels of the image. is a matching cost for disparity estimation and may use a conventional cross-correlation method of a shifted spectrum channel .
  • a generalized mutual correlation metric may be used in the disparity estimation unit 109 so as to process an arbitrary number of spectrum channels. represents a set of nth acquired views in the nth acquired spectrum channel with respect to the same scene from slightly different viewpoints. represents an frame.
  • a conventional correlation matrix may be expressed by the set and a disparity value d .
  • a determinant of the matrix is a good measure of the mutual correlation .
  • the matrix is a singular matrix and the determinant thereof is 0.
  • the determinant of the matrix is 1.
  • the disparity value d corresponding to the least value of the determinant needs to be found from each pixel of the image.
  • operators for cost computation matching may be used.
  • operators may include conventional stereo matching metrics, Laplacian contrast metrics, and feature based metrics.
  • an exponential moving window may be used because this complies with a naturally sparse gradient prior and propagates a matching cost with respect to a low textured region.
  • an exponential kernel filtering may be efficiently computed by using a recursive convolution in a spectrum domain.
  • This equation may also be used for computing an effective approximate value of a joint bilateral filter for propagating disparity information on a small texture region.
  • Sub-pixel estimation may be performed by using a parabola fitting algorithm as shown in FIG. 5.
  • parabola fitting three given points, , may be taken into consideration. may be represented as (i.e., ), and and may be set as a previous argument and a next argument, respectively.
  • a variable of a maximum value of a unique parabola satisfying , ⁇ and may be analytically computed in the following formula.
  • the image restoration unit 110 may perform preliminary image restoration based on the disparity estimation.
  • the captured image of FIG. 3a may be deblurred as shown in FIG. 3b.
  • a color alignment of the deblurred image may be performed as shown in FIG. 3c.
  • FIG. 3a illustrates an example of the image captured by the system.
  • FIG. 2b illustrates a geometric structure of a spectrum coded aperture.
  • the system may be focused on one object 301 and another object 302 may be defocused.
  • the defocused object 302 captured by the camera 102 may cause a spectrum channel misalignment in a photo array to the extent that the blurred images 305, 306, and 307 as shown in FIG. 3d, FIG. 3e, and FIG. 3f are blurred with respect to a conventional imaging system.
  • the image deblurring may be performed based on a deconvolution technology and be applied to images corresponding to different disparity values. For example, while the focused object 301 does not require the deblurring, the images 305, 306, and 307 of the defocused object 302 in the respective spectrum channels are deblurred with respect to the disparity levels thereof.
  • the deblurred image of FIG. 3b is still misaligned with respect to the spectrum channels , , and , as shown in FIGS. 3g, 3h, and 3i.
  • Misalignment vectors , , and respectively corresponding to the spectrum channels , and may be estimated at the respective positions of the captured image 302.
  • a restored image 304 may be acquired by the aligned spectrum channel, based on the misalignment vectors , , and .
  • i is the number of spectrum channels, and and are projections in an x-axis direction and a y-axis direction of a vector , respectively.
  • the image may be converted from a spectrum filter basis to a device play unit basis .
  • the imaging system has a vignetting effect that results in a reduction of an image’s brightness at the periphery of the image, as compared to the center of image.
  • the vignetting effect may be mathematically alleviated by the following equation.
  • the unvignetting coefficient needs to be independently computed with respect to each spectrum channel. This process may be performed by the image restoration unit 110.
  • a final image refinement process may be used to reduce artifact caused by inaccurate disparity estimation.
  • Technologies based on a human’s visual perception for example, bilateral filtering, median filtering, or the like
  • natural image priors for example, sparse gradient prior, color lines prior, or the like
  • the placement-to-depth conversion unit 111 may convert the disparity into a depth map 114 with respect to a single lens optical system by using generalized optical system parameters 112 generalized in a thin lens formula.
  • This formula for a complex object may depend on the design of the optical system.
  • the above-described image capturing apparatus may be extended for performing a temporal coding and a spectral coding.
  • the temporal coding may be performed while moving the spectrum coded aperture with respect to the image capturing apparatus. This extension may remove a motion blur as well as a known defocus blur caused by a movement of the spectrum coded aperture.
  • the above-described image capturing apparatus may extract depth information from a photograph as well as a video stream that is appropriately encrypted by the coded aperture and is appropriately registered by a detector array.
  • the spectrum coded aperture may be modified so as to mix a photograph and depth information on the image captured according to the presence or absence of the spectrum coded aperture.
  • the depth map extraction process may be performed by just using a key frame (for example, every Nth frames) of a video sequence, and other frames may be restored by using image information and a depth map of the key frame. This process may increase time efficiency and image quality of the system.
  • the type of the spectrum coded aperture and the geometric structure may be changed according to the image automatically captured by the detector array.
  • the aperture including the circular filter and the opaque region, as illustrated in FIG. 2a may be used instead of reducing the exposure time or increasing the f-number of the optical system.
  • the depth extraction/image restoration apparatus may be included in mobile phone camera or web camera equipment, but is not limited thereto.
  • the depth extraction/image restoration apparatus according to the exemplary embodiment may be used in a compact optical camera.
  • FIG. 6a is a diagram of a permanently fixed color coded aperture in an optical system of a camera, according to an exemplary embodiment. Since light passes through a fixed color filter aperture, the image quality of a color image may degrade. Each color band may be projected at different positions of a photograph array causing a ghost image effect. A depth estimation and a color image restoration may be performed by the above-described depth estimation method.
  • FIG. 6b is a diagram of a color coded aperture in which an optical system is movable by a mechanical or electromagnetic unit, according to an exemplary embodiment.
  • the color coded aperture may be present in an optical system to acquire depth information on a scene and a computatively restored color image.
  • the color coded aperture may not be present in an optical system that captures an original 2D image without distortion.
  • the slider also referred to as an aperture fixture
  • the slider may switch between the spectrum coded apertures, for example, according to a control signal from the data processor 103.
  • the present embodiment is not limited thereto, and the spectrum coded apertures may be switched manually or under the control of a central processing unit (CPU) in the smartphone.
  • the data processor 103 may extract depth information from the captured image and determine whether to change the aperture to another one based on the depth information.
  • the data processor 103 may send a control signal to the slider so that the previously used aperture is changed to another one which is known to have a better depth discrimination ability.
  • FIG. 6c is a diagram of a spectrum coded aperture with a spatial light modulator (SLM) capable of changing a spectrum passband of a coded color aperture, based on time, according to an exemplary embodiment.
  • SLM spatial light modulator
  • the apparatus of FIG. 6c may operate in a 2D or 3D mode as described above with reference to the exemplary embodiment of FIG. 6b.
  • the apparatuses of FIGS. 6b and 6c may also acquire alternating video frames.
  • one frame may be obtained in the 2D mode and another frame may be obtained in the 3D mode. Consequently, the system may acquire two video streams.
  • One video frame may include an original color frame acquired in the 2D mode, and another video stream may include a frame suitable for the depth extraction.
  • FIG. 6d is a diagram of a spectrum coded aperture that is attachable to a smartphone lens, according to an exemplary embodiment. Due to a larger size of an optical system, the apparatus of FIG. 6d may obtain more excellent depth map image quality as well as more excellent optical efficiency and video image quality than apparatuses with the attached spectrum coded aperture.
  • the apparatus includes a spectrum filtered aperture, and at least one of a RGB color filter, a red, green, blue, and white (RGBW) color filter, a cyan, magenta, yellow (CMY) filter, a cyan, magenta, yellow, green (CMYG) color filter, and an infrared (IR) filter, but is not limited thereto.
  • RGBW red, green, blue, and white
  • CMY cyan, magenta, yellow
  • CYG cyan, magenta, yellow, green
  • IR infrared
  • the exemplary embodiment may be applied to any digital cameras, including a mobile phone camera, so as to perform mirror hardware modification and generate the disparity/depth maps having low cost algorithms.
  • the acquired disparity map may be used in image splitting, custom blur type (bokeh), computational viewpoint disparity, image filtering, and digital post-refocusing having other special effects.
  • unit may mean a hardware component, such as a processor or a circuit, and/or a software component that is executed by a hardware component such as a processor.
  • an exemplary embodiment can be embodied as computer-readable code on a computer-readable recording medium.
  • the computer-readable recording medium is any data storage device that can store data that can be thereafter read by a computer system. Examples of the computer-readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage devices.
  • the computer-readable recording medium can also be distributed over network-coupled computer systems so that the computer-readable code is stored and executed in a distributed fashion.
  • an exemplary embodiment may be written as a computer program transmitted over a computer-readable transmission medium, such as a carrier wave, and received and implemented in general-use or special-purpose digital computers that execute the programs.
  • one or more units of the above-described apparatuses and devices can include circuitry, a processor, a microprocessor, etc., and may execute a computer program stored in a computer-readable medium.

Abstract

A system for image capturing and depth extraction includes a camera and a data processor. The camera includes: a spectrum coded aperture including at least two regions that pass spectrum channels of an incident light field which are different from each other; and a sensor configured to record the at least two spectrum channels to form an image captured in a sensor basis. The data processor is configured to convert the image captured in the sensor basis into an image of a processing basis, extract a disparity from the image of the processing basis, and convert the disparity into depth information.

Description

    METHOD AND APPARATUS FOR IMAGE CAPTURING AND SIMULTANEOUS DEPTH EXTRACTION
  • Apparatuses and methods consistent with exemplary embodiments relate to computational photography, and more particularly, to light field capturing and processing.
  • One of the main applications of light field photography is in extraction of image depth information. Examples of apparatuses for light field capturing or image depth information extraction may include a stereo camera, a plenoptic camera, a camera with a binary coded aperture, and a camera with a color coded aperture. However, these apparatuses may require additional space, increase costs of cameras, or cause a reduction in optical efficiency.
  • A system for image capturing and depth extraction including: a lens system; a spectrum coded aperture including at least two regions that pass spectrum channels of an incident light field which are different from each other; and a sensor configured to record the at least two spectrum channels to form an image captured in a sensor basis; and a data processor configured to convert the image captured in the sensor basis into an image of a processing basis, extract a disparity from the image of the processing basis, and convert the disparity into depth information.
  • The above and/or other aspects will be more apparent by describing certain exemplary embodiments, with reference to the accompanying drawings, in which:
  • FIG. 1 is a diagram of a depth extraction/image restoration apparatus according to an exemplary embodiment;
  • FIGS. 2a to 2f are diagrams of spectrum coded apertures according to exemplary embodiments;
  • FIGS. 3a to 3i are diagrams for describing a channel shift;
  • FIG. 4 is a high-level outline diagram of a depth information extraction/image restoration method according to an exemplary embodiment;
  • FIG. 5 is a diagram for describing a parabola fitting according to an exemplary embodiment; and
  • FIGS. 6a to 6d are diagrams for describing a depth extraction/image restoration apparatus according to an exemplary embodiment.
  • According to an aspect of an exemplary embodiment, there is provided a system for image capturing and depth extraction including: a lens system; a spectrum coded aperture including at least two regions that pass spectrum channels of an incident light field which are different from each other; and a sensor configured to record the at least two spectrum channels to form an image captured in a sensor basis; and a data processor configured to convert the image captured in the sensor basis into an image of a processing basis, extract a disparity from the image of the processing basis, and convert the disparity into depth information.
  • The different spectrum channels may form a basis of the spectrum coded aperture.
  • The processing basis may be different from the sensor basis and the basis of the spectrum coded aperture.
  • The spectrum coded aperture may have three regions, and the three regions may include a transparent region in a central portion, and two regions having spectrum bandwidths respectively corresponding to yellow and cyan.
  • The processing basis may three vectors, and the three vectors may include a vector corresponding to yellow, a vector corresponding to cyan, and a vector perpendicular to the two vector.
  • The spectrum coded aperture may include two regions having spectrum bandwidths respectively corresponding to yellow and cyan.
  • The processing basis may include three vectors, and the three vectors may include a vector respectively corresponding to yellow, a vector corresponding to cyan, and a vector perpendicular to the two vector.
  • The spectrum coded aperture may include three congruent regions having spectrum bandwidths respectively corresponding to yellow, cyan, and magenta.
  • The processing basis may include vectors corresponding to yellow, cyan, and magenta.
  • The spectrum coded aperture may include three non-congruent regions having spectrum bandwidths respectively corresponding to yellow, cyan, and magenta.
  • The processing basis may include vectors respectively corresponding to yellow, cyan, and magenta.
  • The spectrum coded aperture may have a smooth bandwidth change over an aperture region.
  • The spectrum coded aperture may be fixed to the lens system.
  • The spectrum coded aperture may be attachable to and detachable from the lens system.
  • The spectrum coded aperture may be moved from an optical train that does not participate in the image formation.
  • The captured image may be an image selected from a video sequence.
  • The spectrum coded aperture may insert the image selected from the video sequence into the lens system.
  • The spectrum coded aperture may be inserted into an aperture stop of the lens system.
  • The lens system may include a single lens and the spectrum coded aperture may be located in the lens.
  • The spectrum coded aperture may correct a previous video image of the video sequence acquired by the sensor.
  • The spectrum coded aperture may have a combination of an opaque region and a congruent region, and the congruent region may be transparent or transmit ultraviolet light, infrared light, or visible light.
  • The spectrum coded aperture may have a combination of an opaque region and a non-congruent region, and the non-congruent region may be transparent or transmits ultraviolet light, infrared light, or visible light.
  • The spectrum coded aperture may be a spatial light modulator (SLM).
  • The data processor may include a preprocessing unit configured to perform the converting the captured image, a disparity estimation unit configured to perform the extracting the disparity, and a conversion unit configured to perform the converting the disparity to the depth information.
  • The data processor may further include an image restoration unit configured to restore the captured image based on the extracted disparity.
  • According to another aspect of an exemplary embodiment, there is provided a method of image capturing and depth extraction including: recording at least two shifted spectrum channels of a light field to form an image captured from a video; converting the captured image into an image of a processing basis; estimating a disparity based on a correlation between pixels of the spectrum channels in the processing basis to extract a disparity map; restoring the captured image based on the extracted disparity map; and converting the disparity map into a depth map.
  • The estimating of the disparity may include: generating candidate images having respective shifts in the spectrum channels; computing matching cost involved in the candidate images in the spectrum channels; propagating a matching cost involved in a low textured region of the candidate images; and estimating a matching cost having a sub-pixel accuracy based on the propagated matching cost.
  • The correlation between the pixels of the spectrum channel for requesting the disparity estimation may include a correlation metric computed in a sparse moving window.
  • The correlation between the pixels of the spectrum channel for requesting the disparity estimation may be computed by using at least one stereo matching algorithm.
  • The computing of the correlation by using the stereo matching algorithm may include sum of absolute differences (SAD), normalized cross correlation (NCC), or Laplacian image contrast (LIC).
  • The correlation metric may include a fast Fourier transform (FFT).
  • The correlation metric may include a recursive exponential filter (REF).
  • The restoring of the captured image may include performing image blurring.
  • The restoring of the captured image may include performing a spectrum channel alignment in the processing basis.
  • According to another aspect of an exemplary embodiment, there is provided a mobile device for image capturing and depth extraction in ultraviolet light, infrared light, or visible light including: a lens system; at least one spectrum coded aperture including at least two regions that pass spectrum channels of an incident light field which are different from each other; a sensor configured to record the at least two spectrum channels to form an image captured in a sensor basis; and a coded aperture fixture configured to move at least one spectrum coded aperture relatively with respect to the lens system; and a data processor configured to convert the image captured in the sensor basis into an image of a processing basis, extract a disparity from the image of the processing basis, and convert the disparity into depth information.
  • The coded aperture fixture may be configured to replace at least two spectrum coded apertures in an optical train.
  • The coded aperture fixture may be configured to shift all the spectrum coded apertures from the optical train.
  • The coded aperture fixture may be inserted into an aperture stop.
  • The spectrum coded aperture may have a combination of an opaque region and a congruent region, and the congruent region may be transparent or transmit ultraviolet light, infrared light, or visible light.
  • The spectrum coded aperture may have a combination of an opaque region and a non-congruent region, and the non-congruent region may be transparent or transmits ultraviolet light, infrared light, or visible light.
  • According to another aspect of an exemplary embodiment, there is provided an apparatus for image capturing including: a lens system; at least two spectrum coded apertures including a first aperture and a second aperture which have different characteristics of optical efficiency and depth discrimination from each other; a coded aperture fixture adapted to dispose the first aperture in front of the lens system; and a data processor configured to obtain depth information of an image captured through the first spectrum coded aperture, and control the coded aperture fixture to determine whether to switch the first aperture to the second aperture based on the depth information.
  • The first aperture may include a transparent region placed in the center of the first aperture and two regions separated by the transparent region. The two regions pass different color spectrums, respectively.
  • The two regions may pass a yellow spectrum and a cyan spectrum, respectively.
  • The second aperture may include equally divided two regions which may pass yellow and cyan spectrums, respectively.
  • Exemplary embodiments are described in greater detail below with reference to the accompanying drawings.
  • In the following description, like drawing reference numerals are used for like elements, even in different drawings. The matters defined in the description, such as detailed construction and elements, are provided to assist in a comprehensive understanding of the exemplary embodiments. However, it is apparent that the exemplary embodiments can be practiced without those specifically defined matters. Also, well-known functions or constructions are not described in detail since they would obscure the description with unnecessary detail.
  • As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.
  • It will be understood that when a region is referred to as being “connected to” or “coupled to” another region, it may be directly connected or coupled to the other region or intervening regions may be present. It will be understood that terms such as “comprise”, “include”, and “have”, when used herein, specify the presence of stated elements, but do not preclude the presence or addition of one or more other elements.
  • FIG. 1 is a diagram of a depth extraction/image restoration apparatus 101 according to an exemplary embodiment. The depth extraction/image restoration apparatus 101 may include a camera 102 and a data processor 103. The camera 102 may include optical lens (objective lens) 104, a spectrum coded aperture 105, and a sensor 106. The spectrum coded aperture 105 may be inserted into an optical system which is constituted by the combination of the lens 104, the sensor 106, and other optical parts. The spectrum coded aperture 105 may be placed in an optical path that a ray of light follows through the optical system. The spectrum coded aperture 105 may be a diaphragm plane. The sensor 106 may be configured to discriminate different spectrum bandwidths from one another. For example, the sensor 106 may be a sensor covered with a mosaic color/spectrum filter array, or a color stacked photodiode sensor. The data processor 103 may include a preprocessing unit 108, a disparity estimation unit 109, an image restoration unit 110, and a disparity-to-depth conversion unit 111. The data processor 103 may receive a raw image 107 captured by the camera 102. The preprocessing unit 108 may convert the captured image 107 from a sensor basis to a processing basis in which a spectrum coded aperture filter may not be present. The disparity estimation unit 109 may perform disparity estimation. Then image restoration unit 110 may perform image restoration. The disparity-to-depth conversion unit 111 may perform disparity-to-depth conversion on optical system parameters.
  • The spectrum coded aperture 105 may be divided into sub-regions that respectively have spectrum passbands. The number, geometric structures, and spectrum passbands of the sub-regions may be changed according to applications of optical efficiency, a depth map, and color image restoration image quality. Some of them are illustrated in FIGS. 2a to 2f.
  • FIGS. 2a to 2f are diagrams illustrating patterns of various spectrum coded apertures having a tradeoff relationship among the optical efficiency, the depth map, and the color image restoration image quality. For light field coding, spectrum filters , , and may be used. Examples of the spectrum filters , , and may include a visibly recognizable color filter, an infrared/ultraviolet filter, and a multi-path filter having two or more passbands
  • Main characteristics of a spectrum coded aperture are optical efficiency, depth discrimination ability, and color image restoration image quality. The highest depth discrimination index may be obtained from a geometric structure of a spectrum coded aperture having the longest distance between the centers of aperture sub-regions corresponding to respective optical spectrum bands. FIG. 2a shows an aperture pattern that has a relatively long distance between the centers of sub-regions , , and and a relatively small filter size in the sub-regions. Consequently, an opaque region of the coded aperture may be increased so that the optical system has a reduced optical efficiency. If the aperture design is deformed to enhance optical efficiency as shown in FIG. 2b, the typically extracted disparity accuracy may be deteriorated.
  • For specific applications, there may exist a tradeoff between optical efficiency and depth discrimination ability. For example, FIG. 2c shows a geometric structure of an aperture having a cyan filter (i.e. ) and a yellow filter (i.e., ) on halves and FIG. 2d shows a geometric structure of an aperture having a transparent sub-region a cyan filter (i.e., ), a yellow filter (i.e., ), and a green filter (i.e., ). Here, the yellow filter may have a passband including green and red light spectrums. The cyan filter may have a passband including green and blue light spectrums. The transparent region may not filter incoming light. The green channel may not be distorted by these filters and may be used as a reference in an image restoration process. In comparison with the aperture structure of FIG. 2d, the aperture structure in FIG. 2c may have a better depth map. However, the aperture structure of FIG. 2d may have an superior optical efficiency to the aperture structure of FIG. 2c. FIG. 2a shows an aperture having a circular filter and an opaque region, which may be used to obtain a high-quality depth map image when light is excessive. The aperture structure of FIG. 2a may compensate for excessive light directed to the camera 102. An aperture structure having infrared light and ultraviolet light on halves as shown in FIG. 2c may be a fully opened aperture and may have the same optical efficiency and have excellent potential with respect to depth extraction. However, an additional process such as image restoration and photograph array correction may be performed for an image captured through the aperture structure of FIG. 2. FIG. 2e shows a spectrum coded aperture having three or more spectrum sub-regions with a hive arrangement and FIG. 2f illustrates a spectrum coded aperture having a smooth bandwidth change over an aperture region.
  • The light field, which is corrected by the spectrum coded aperture 105, may be input to the image sensor 106 that generates the captured raw image 107.
  • The light field having passed through the spectrum coded aperture 105 may be coded. That is, the light field may be divided into different spectrum parts by passing through corresponding aperture sub-regions. Therefore, different views may be extracted from a single captured image with respect to the same scene by dividing the single captured image into spectrum channels correspondingly with respect to the spectrum coded aperture.
  • FIG. 3a illustrates the captured image 107 obtained by a sensor 106 that is capable of discriminating the corresponding spectrum bandwidth with respect to the spectrum coded aperture described above with reference to FIG. 2b. In the optical system, a position of a defocused object 302 in FIG. 3a), which is obtained by the presence of the spectrum coded aperture, may be changed with respect to relatively corresponding spectrum filter positions as shown in FIGS. 3d, 3e, and 3f as compared to a focused object 301 in FIG. 3a). Such a view may be used for extracting a disparity map and restoring the captured image 107. The results of image deblurring with respect to the spectrum channels are illustrated in FIGS. 3g, 3h, and 3i. A deblurred color image is illustrated in FIG. 3b. A deblurred image (restored image) aligned in the spectrum channel is illustrated in FIG. 3c.
  • FIG. 4 is a high-level outline diagram of the data processor 103. A system input may be the raw image 107 captured by the camera 102. In operation 108, the captured image 107 may be preprocessed by denoising and demosaic technologies and be translated from a sensor spectrum basis to a processing basis. In general, the processing basis may not be a spectrum filter. is an image color channel acquired by an optical system sensor. In order to perform such a conversion, a conversion matrix Π needs to be preferentially estimated. For simplicity, it is assumed that the camera 102 uses the aperture structure having a cyan filter and a yellow as described above with reference to FIG. 2C, and a red, green, blue (RGB) mosaic color filter array.
  • and are color filters that represent cyan and yellow filters in an RGB color space. In order to construct a conversion matrix that has an excellent condition number and is capable of a non-degenerate inverse conversion, a third basis vector is defined as a vector product . Vectors , and are respectively a red basis, a green basis, and a blue basis for the camera sensor 106. In the sensor spectrum basis,
  • <Equation 1>
  • An auxiliary matrix Π is represented as follows:
  • <Equation 2>
  • If the matrix Π is used, any observed color w may be decomposed by an aperture filter response.
  • <Equation 3>
  • means a channel intensity in the spectrum filter basis (cyan, X, and yellow). The matrix Π may be inversely converted. represents an image channel acquired in the processing basis. In the case of a different number of basis vectors in the sensor basis and the processing basis, an inverse conversion matrix (a left inverse matrix and a right inverse matrix) may be used.
  • In operation 109, a disparity may be estimated with respect to all pixels of the image. is a matching cost for disparity estimation and may use a conventional cross-correlation method of a shifted spectrum channel .
  • <Equation 4>
  • A generalized mutual correlation metric may be used in the disparity estimation unit 109 so as to process an arbitrary number of spectrum channels. represents a set of nth acquired views in the nth acquired spectrum channel with respect to the same scene from slightly different viewpoints. represents an frame. A conventional correlation matrix may be expressed by the set and a disparity value d .
  • <Equation 5>
  • where means a parallel shift in a corresponding channel.
  • A determinant of the matrix is a good measure of the mutual correlation . In practice, in a case where all channels are completely correlated, the matrix is a singular matrix and the determinant thereof is 0. In another aspect, in a case where data is completely uncorrelated, the determinant of the matrix is 1. In order to estimate the depth map by using such an operator, the disparity value d corresponding to the least value of the determinant needs to be found from each pixel of the image.
  • Other operators for cost computation matching may be used. Examples of the operators may include conventional stereo matching metrics, Laplacian contrast metrics, and feature based metrics.
  • All statistic computations may use a conventional local moving window. However, in an exemplary embodiment, an exponential moving window may be used because this complies with a naturally sparse gradient prior and propagates a matching cost with respect to a low textured region. Furthermore, an exponential kernel filtering may be efficiently computed by using a recursive convolution in a spectrum domain.
  • <Equation 6>
  • where is a result of convolution with respect to an image I at an nth pixel, and is defined as follows:
  • <Equation 7>
  • where is an exponential dampling factor that represents an image similarity required in a spatial domain.
  • This equation may also be used for computing an effective approximate value of a joint bilateral filter for propagating disparity information on a small texture region.
  • <Equation 8>
  • where is a disparity of an nth pixel, and is a function representing the degree of similarity of an image color.
  • <Equation 9>
  • where 1 represents the degree of similarity between color images in a range domain.
  • Sub-pixel estimation may be performed by using a parabola fitting algorithm as shown in FIG. 5. In parabola fitting, three given points, , , may be taken into consideration. may be represented as (i.e., ), and and may be set as a previous argument and a next argument, respectively. A variable of a maximum value of a unique parabola satisfying , }, and may be analytically computed in the following formula.
  • <Equation 10>
  • where and .
  • The image restoration unit 110 may perform preliminary image restoration based on the disparity estimation. The captured image of FIG. 3a may be deblurred as shown in FIG. 3b. A color alignment of the deblurred image may be performed as shown in FIG. 3c. FIG. 3a illustrates an example of the image captured by the system. FIG. 2b illustrates a geometric structure of a spectrum coded aperture. The system may be focused on one object 301 and another object 302 may be defocused. The defocused object 302 captured by the camera 102 may cause a spectrum channel misalignment in a photo array to the extent that the blurred images 305, 306, and 307 as shown in FIG. 3d, FIG. 3e, and FIG. 3f are blurred with respect to a conventional imaging system. The image deblurring may be performed based on a deconvolution technology and be applied to images corresponding to different disparity values. For example, while the focused object 301 does not require the deblurring, the images 305, 306, and 307 of the defocused object 302 in the respective spectrum channels are deblurred with respect to the disparity levels thereof. The deblurred image of FIG. 3b is still misaligned with respect to the spectrum channels , , and , as shown in FIGS. 3g, 3h, and 3i. Misalignment vectors , , and respectively corresponding to the spectrum channels , , and may be estimated at the respective positions of the captured image 302. A restored image 304 may be acquired by the aligned spectrum channel, based on the misalignment vectors , , and .
  • <Equation 11>
  • where i is the number of spectrum channels, and and are projections in an x-axis direction and a y-axis direction of a vector , respectively.
  • The image may be converted from a spectrum filter basis to a device play unit basis . The imaging system has a vignetting effect that results in a reduction of an image’s brightness at the periphery of the image, as compared to the center of image. In such a system, the vignetting effect may be mathematically alleviated by the following equation.
  • <Equation 12>
  • where and are a captured image and a restored image at an pixel, respectively. is an unvignetting coefficient previously computed once during the calibration of the optical system.
  • <Equation 13>
  • where and are a captured image and an unvignetted image of a known image at an pixel, respectively.
  • In a case where the coded aperture is present, the unvignetting coefficient needs to be independently computed with respect to each spectrum channel. This process may be performed by the image restoration unit 110.
  • A final image refinement process may be used to reduce artifact caused by inaccurate disparity estimation. Technologies based on a human’s visual perception (for example, bilateral filtering, median filtering, or the like) and natural image priors (for example, sparse gradient prior, color lines prior, or the like) may be used.
  • The placement-to-depth conversion unit 111 may convert the disparity into a depth map 114 with respect to a single lens optical system by using generalized optical system parameters 112 generalized in a thin lens formula.
  • <Equation 14>
  • where is a lens center distance, and and are distances from each lens to an object plane and an image plane, respectively.
  • This formula for a complex object may depend on the design of the optical system.
  • The above-described image capturing apparatus may be extended for performing a temporal coding and a spectral coding. The temporal coding may be performed while moving the spectrum coded aperture with respect to the image capturing apparatus. This extension may remove a motion blur as well as a known defocus blur caused by a movement of the spectrum coded aperture.
  • The above-described image capturing apparatus may extract depth information from a photograph as well as a video stream that is appropriately encrypted by the coded aperture and is appropriately registered by a detector array. In addition, the spectrum coded aperture may be modified so as to mix a photograph and depth information on the image captured according to the presence or absence of the spectrum coded aperture. For example, the depth map extraction process may be performed by just using a key frame (for example, every Nth frames) of a video sequence, and other frames may be restored by using image information and a depth map of the key frame. This process may increase time efficiency and image quality of the system.
  • Furthermore, the type of the spectrum coded aperture and the geometric structure may be changed according to the image automatically captured by the detector array. For example, when light is excessive, the aperture including the circular filter and the opaque region, as illustrated in FIG. 2a, may be used instead of reducing the exposure time or increasing the f-number of the optical system.
  • The depth extraction/image restoration apparatus according to the exemplary embodiment may be included in mobile phone camera or web camera equipment, but is not limited thereto. The depth extraction/image restoration apparatus according to the exemplary embodiment may be used in a compact optical camera.
  • FIG. 6a is a diagram of a permanently fixed color coded aperture in an optical system of a camera, according to an exemplary embodiment. Since light passes through a fixed color filter aperture, the image quality of a color image may degrade. Each color band may be projected at different positions of a photograph array causing a ghost image effect. A depth estimation and a color image restoration may be performed by the above-described depth estimation method.
  • FIG. 6b is a diagram of a color coded aperture in which an optical system is movable by a mechanical or electromagnetic unit, according to an exemplary embodiment. In a three-dimensional (3D) mode, the color coded aperture may be present in an optical system to acquire depth information on a scene and a computatively restored color image. In a two-dimensional (2D) mode, the color coded aperture may not be present in an optical system that captures an original 2D image without distortion.
  • As shown in FIG. 6b, at least two spectrum coded apertures may be attached to the smartphone. The slider (also referred to as an aperture fixture) may switch between the spectrum coded apertures, for example, according to a control signal from the data processor 103. However, the present embodiment is not limited thereto, and the spectrum coded apertures may be switched manually or under the control of a central processing unit (CPU) in the smartphone. When an image is captured through one of the spectrum coded apertures, the data processor 103 may extract depth information from the captured image and determine whether to change the aperture to another one based on the depth information. For example, if the data processor 103 determines that the depth discrimination of the image does not meet a requirement preset by a user input, the data processor 103 may send a control signal to the slider so that the previously used aperture is changed to another one which is known to have a better depth discrimination ability.
  • FIG. 6c is a diagram of a spectrum coded aperture with a spatial light modulator (SLM) capable of changing a spectrum passband of a coded color aperture, based on time, according to an exemplary embodiment. The apparatus of FIG. 6c may operate in a 2D or 3D mode as described above with reference to the exemplary embodiment of FIG. 6b.
  • In addition, the apparatuses of FIGS. 6b and 6c may also acquire alternating video frames. By changing the aperture before the frame is recorded, one frame may be obtained in the 2D mode and another frame may be obtained in the 3D mode. Consequently, the system may acquire two video streams. One video frame may include an original color frame acquired in the 2D mode, and another video stream may include a frame suitable for the depth extraction.
  • FIG. 6d is a diagram of a spectrum coded aperture that is attachable to a smartphone lens, according to an exemplary embodiment. Due to a larger size of an optical system, the apparatus of FIG. 6d may obtain more excellent depth map image quality as well as more excellent optical efficiency and video image quality than apparatuses with the attached spectrum coded aperture.
  • The apparatus according to the exemplary embodiment includes a spectrum filtered aperture, and at least one of a RGB color filter, a red, green, blue, and white (RGBW) color filter, a cyan, magenta, yellow (CMY) filter, a cyan, magenta, yellow, green (CMYG) color filter, and an infrared (IR) filter, but is not limited thereto. A combination of sensors having color/spectrum spaces may be used.
  • The exemplary embodiment may be applied to any digital cameras, including a mobile phone camera, so as to perform mirror hardware modification and generate the disparity/depth maps having low cost algorithms. The acquired disparity map may be used in image splitting, custom blur type (bokeh), computational viewpoint disparity, image filtering, and digital post-refocusing having other special effects.
  • In addition, the term “unit” as used herein may mean a hardware component, such as a processor or a circuit, and/or a software component that is executed by a hardware component such as a processor.
  • While not restricted thereto, an exemplary embodiment can be embodied as computer-readable code on a computer-readable recording medium. The computer-readable recording medium is any data storage device that can store data that can be thereafter read by a computer system. Examples of the computer-readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage devices. The computer-readable recording medium can also be distributed over network-coupled computer systems so that the computer-readable code is stored and executed in a distributed fashion. Also, an exemplary embodiment may be written as a computer program transmitted over a computer-readable transmission medium, such as a carrier wave, and received and implemented in general-use or special-purpose digital computers that execute the programs. Moreover, it is understood that in exemplary embodiments, one or more units of the above-described apparatuses and devices can include circuitry, a processor, a microprocessor, etc., and may execute a computer program stored in a computer-readable medium.
  • The foregoing exemplary embodiments are merely exemplary and are not to be construed as limiting. The present teaching can be readily applied to other types of apparatuses. Also, the description of the exemplary embodiments is intended to be illustrative, and not to limit the scope of the claims, and many alternatives, modifications, and variations will be apparent to those skilled in the art.

Claims (15)

  1. A system for image capturing and depth extraction, the system comprising:
    a lens system;
    a spectrum coded aperture including at least two regions that pass spectrum channels of an incident light field which are different from each other; and
    a sensor configured to record the at least two spectrum channels to form an image captured in a sensor basis; and
    a data processor configured to convert the image captured in the sensor basis into an image of a processing basis, extract a disparity from the image of the processing basis, and convert the disparity into depth information.
  2. The system of claim 1, wherein the different spectrum channels form a basis of the spectrum coded aperture.
  3. The system of claim 2, wherein the processing basis is different from the sensor basis and the basis of the spectrum coded aperture.
  4. The system of claim 1, wherein the spectrum coded aperture has three regions including a transparent region in a central portion, and two regions having spectrum bandwidths respectively corresponding to yellow and cyan.
  5. The system of claim 1, wherein the at least two regions of the spectrum coded aperture have spectrum bandwidths respectively corresponding to yellow and cyan.
  6. The system of claim 1, wherein the spectrum coded aperture includes three congruent regions having spectrum bandwidths respectively corresponding to yellow, cyan, and magenta.
  7. The system of claim 1, wherein the spectrum coded aperture includes three non-congruent regions having spectrum bandwidths respectively corresponding to yellow, cyan, and magenta.
  8. The system of claim 1, wherein the spectrum coded aperture has a smooth bandwidth change over an aperture region.
  9. The system of claim 1, wherein the spectrum coded aperture is fixed to the lens system.
  10. The system of claim 1, wherein the spectrum coded aperture is attachable to and detachable from the lens system.
  11. The system of claim 1, wherein the spectrum coded aperture has a combination of an opaque region and a congruent region, and
    the congruent region is transparent or transmits ultraviolet light, infrared light, or visible light.
  12. The system of claim 1, wherein the spectrum coded aperture has a combination of an opaque region and a non-congruent region, and
    the non-congruent region is transparent or transmits ultraviolet light, infrared light, or visible light.
  13. The system of claim 1, wherein the data processor comprises a preprocessing unit configured to perform the converting the captured image, a disparity estimation unit configured to perform the extracting the disparity, and a conversion unit configured to perform the converting the disparity to the depth information.
  14. The system of claim 13, wherein the data processor further comprises an image restoration unit configured to restore the captured image based on the extracted disparity.
  15. A method of image capturing and depth extraction, the method comprising:
    recording at least two shifted spectrum channels of a light field to form an image captured from a video;
    converting the captured image into an image of a processing basis;
    estimating a disparity based on a correlation between pixels of the spectrum channels in the processing basis to extract a disparity map;
    restoring the captured image based on the extracted disparity map ; and
    converting the disparity map into a depth map.
EP15814578.9A 2014-07-04 2015-07-06 Method and apparatus for image capturing and simultaneous depth extraction Withdrawn EP3164992A4 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
RU2014127469/08A RU2595759C2 (en) 2014-07-04 2014-07-04 Method and image capturing device and simultaneous extraction of depth
KR1020150083666A KR20160004912A (en) 2014-07-04 2015-06-12 Method and apparatus for image capturing and simultaneous depth extraction
PCT/KR2015/006966 WO2016003253A1 (en) 2014-07-04 2015-07-06 Method and apparatus for image capturing and simultaneous depth extraction

Publications (2)

Publication Number Publication Date
EP3164992A1 true EP3164992A1 (en) 2017-05-10
EP3164992A4 EP3164992A4 (en) 2018-02-21

Family

ID=55172768

Family Applications (1)

Application Number Title Priority Date Filing Date
EP15814578.9A Withdrawn EP3164992A4 (en) 2014-07-04 2015-07-06 Method and apparatus for image capturing and simultaneous depth extraction

Country Status (4)

Country Link
EP (1) EP3164992A4 (en)
KR (1) KR20160004912A (en)
CN (1) CN106471804B (en)
RU (1) RU2595759C2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11893668B2 (en) 2021-03-31 2024-02-06 Leica Camera Ag Imaging system and method for generating a final digital image via applying a profile to image information

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI669538B (en) * 2018-04-27 2019-08-21 點晶科技股份有限公司 Three-dimensional image capturing module and method for capturing three-dimensional image
CN110891131A (en) 2018-09-10 2020-03-17 北京小米移动软件有限公司 Camera module, processing method and device, electronic equipment and storage medium
JP7256368B2 (en) * 2019-02-06 2023-04-12 ミツミ電機株式会社 ranging camera
CN112526801B (en) * 2019-09-03 2022-01-25 宏达国际电子股份有限公司 Double-lens imaging module and extraction method thereof
CN113362224A (en) * 2021-05-31 2021-09-07 维沃移动通信有限公司 Image processing method and device, electronic equipment and readable storage medium

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7061693B2 (en) * 2004-08-16 2006-06-13 Xceed Imaging Ltd. Optical method and system for extended depth of focus
US8531662B2 (en) * 2008-06-17 2013-09-10 Koninklijke Philips N.V. Method and device for optically examining the interior of turbid media
JP4538766B2 (en) * 2008-08-21 2010-09-08 ソニー株式会社 Imaging device, display device, and image processing device
US8363093B2 (en) * 2009-07-27 2013-01-29 Eastman Kodak Company Stereoscopic imaging using split complementary color filters
EP2537332A1 (en) * 2010-02-19 2012-12-26 Dual Aperture, Inc. Processing multi-aperture image data
KR101220413B1 (en) * 2010-10-15 2013-01-09 중앙대학교 산학협력단 Apparatus and method for enhancing image quality of image captured by using multiple color-filter aperture
CN103827920B (en) * 2011-09-28 2018-08-14 皇家飞利浦有限公司 It is determined according to the object distance of image
CN102595171B (en) * 2012-02-03 2014-05-14 浙江工商大学 Imaging method and imaging system of dynamic optical fields of multichannel space-time coding apertures
CN104335246B (en) * 2012-05-01 2018-09-04 Fotonation开曼有限公司 The camera model of pattern is formed with pi optical filters group

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11893668B2 (en) 2021-03-31 2024-02-06 Leica Camera Ag Imaging system and method for generating a final digital image via applying a profile to image information

Also Published As

Publication number Publication date
CN106471804A (en) 2017-03-01
RU2014127469A (en) 2016-01-27
EP3164992A4 (en) 2018-02-21
RU2595759C2 (en) 2016-08-27
KR20160004912A (en) 2016-01-13
CN106471804B (en) 2019-01-04

Similar Documents

Publication Publication Date Title
WO2016003253A1 (en) Method and apparatus for image capturing and simultaneous depth extraction
EP3164992A1 (en) Method and apparatus for image capturing and simultaneous depth extraction
WO2013125768A1 (en) Apparatus and method for automatically detecting object and depth information of image photographed by image pickup device having multiple color filter aperture
US7773115B2 (en) Method and system for deblurring digital camera images using reference image and motion estimation
WO2013103184A1 (en) Apparatus and method for improving image using color channels
US11570333B2 (en) Image pickup device and electronic system including the same
WO2016137238A1 (en) Processing multi-aperture image data
WO2014142417A1 (en) System for improving foggy luminance image using fog reduction estimation model
WO2017007096A1 (en) Image capturing apparatus and method of operating the same
US20120154541A1 (en) Apparatus and method for producing 3d images
WO2020045946A1 (en) Image processing device and image processing method
CN106709894B (en) Image real-time splicing method and system
US11275296B2 (en) Signal processing apparatus and imaging apparatus
KR20160074337A (en) Image processing device for removing color fringe
KR101158678B1 (en) Stereoscopic image system and stereoscopic image processing method
WO2020055196A1 (en) Apparatus and methods for generating high dynamic range media, based on multi-stage compensation of motion
WO2017209509A1 (en) Image processing device, image processing method thereof, and non-transitory computer-readable recording medium
CN113727042A (en) Image processing system and method
WO2022103121A1 (en) Electronic device for estimating camera illuminant and method of the same
WO2021261737A1 (en) Electronic device comprising image sensor, and method for controlling same
CN107517367B (en) Baeyer area image interpolation method, device, picture processing chip and storage device
WO2018216937A1 (en) Method and device for image processing using dual image sensor
JP6807538B2 (en) Image processing equipment, methods, and programs
JP2021005798A (en) Imaging apparatus, control method of imaging apparatus, and program
WO2024043428A1 (en) An apparatus for performing multi-frame de-fencing and method thereof

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20161222

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20180119

RIC1 Information provided on ipc code assigned before grant

Ipc: H04N 13/00 20180101AFI20180115BHEP

Ipc: G06T 7/55 20170101ALI20180115BHEP

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20190515

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20210421