WO2022195537A1 - Microlens amplitude masks for flying pixel removal in time-of-flight imaging - Google Patents

Microlens amplitude masks for flying pixel removal in time-of-flight imaging Download PDF

Info

Publication number
WO2022195537A1
WO2022195537A1 PCT/IB2022/052448 IB2022052448W WO2022195537A1 WO 2022195537 A1 WO2022195537 A1 WO 2022195537A1 IB 2022052448 W IB2022052448 W IB 2022052448W WO 2022195537 A1 WO2022195537 A1 WO 2022195537A1
Authority
WO
WIPO (PCT)
Prior art keywords
pixel
image
depth
computer
implemented method
Prior art date
Application number
PCT/IB2022/052448
Other languages
French (fr)
Inventor
Ilya CHUGUNOV
Seung-Hwan Baek
Qiang Fu
Wolfgang Heidrich
Felix Heide
Original Assignee
The Trustees Of Princeton University
The King Abdullah University Of Science And Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The Trustees Of Princeton University, The King Abdullah University Of Science And Technology filed Critical The Trustees Of Princeton University
Priority to US17/915,536 priority Critical patent/US11657523B2/en
Priority to US17/850,321 priority patent/US20220414913A1/en
Publication of WO2022195537A1 publication Critical patent/WO2022195537A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S17/00Systems using the reflection or reradiation of electromagnetic waves other than radio waves, e.g. lidar systems
    • G01S17/88Lidar systems specially adapted for specific applications
    • G01S17/89Lidar systems specially adapted for specific applications for mapping or imaging
    • G01S17/8943D imaging with simultaneous measurement of time-of-flight at a 2D array of receiver pixels, e.g. time-of-flight cameras or flash lidar
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S7/00Details of systems according to groups G01S13/00, G01S15/00, G01S17/00
    • G01S7/48Details of systems according to groups G01S13/00, G01S15/00, G01S17/00 of systems according to group G01S17/00
    • G01S7/481Constructional features, e.g. arrangements of optical elements
    • G01S7/4816Constructional features, e.g. arrangements of optical elements of receivers alone
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S7/00Details of systems according to groups G01S13/00, G01S15/00, G01S17/00
    • G01S7/48Details of systems according to groups G01S13/00, G01S15/00, G01S17/00 of systems according to group G01S17/00
    • G01S7/491Details of non-pulse systems
    • G01S7/4912Receivers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds

Definitions

  • the embodiments of the present invention generally relate to image processing, and more particularly, toward techniques for three-dimensional (3D) image processing and depth determination.
  • typical time-of-flight (ToF) depth capture cameras collect light incident to a lens, focus the incident light onto a sensor (i.e.. along the sensor plane), and measure an output at each pixel of a pixel array sensor (e.g., a complementary metal -oxide-semiconductor (“CMOS”) type sensor).
  • CMOS complementary metal -oxide-semiconductor
  • one or more individual pixels receive a mixed- light signal.
  • the mixed-light may originate from multiple object surfaces at varying depths.
  • ToF imaging can be further categorized into direct and indirect techniques.
  • Direct ToF devices such as light detection and ranging (“LiDAR”) send out pulses of light, scanning over a scene and directly measuring their round-trip time using photodiodes or photon detectors. While accurate and long-ranged, these systems can produce only a few spatial measurements at a time, resulting in sparse depth maps. Furthermore, their specialized detectors are orders of magnitude more expensive than conventional CMOS sensors.
  • AMCW ToF imaging is a type of indirect ToF.
  • AMCW devices instead flood the whole scene with periodically modulated light and infer depth from phase differences between captures (i.e.. using a plurality of correlation images at varying phase offsets). These captures can be acquired with a standard CMOS sensor, making AMCW ToF cameras an affordable solution for dense depth measurement.
  • direct ToF e.g., LiDAR
  • indirect ToF e.g., AMCW
  • the resultant estimated depth for a given pixel is incorrect when mixed-light is received.
  • a so-called “flying pixel” has an estimated depth that is between the objects of varying depths. As neighboring pixels also included mixed-light, neighboring pixels cannot be reliably used to disambiguate the flying pixel artifact.
  • Fig. 1 illustrates a correlation imager system 100 according to the related art.
  • ToF camera 110 illuminates (depicted as illumination 111) a target 120 with continuously modulated light.
  • the light is reflected by target 120 (depicted as reflected signal 112), which results in an accrued depth-dependent phase shift 113.
  • the light is collected on camera sensor 114, converted to an electrical signal 115, and correlated with an on-board reference signal 116 to produce a time-of-flight correlation measurement 117 and one or more correlation images.
  • Fig. 2 illustrates the use of multiple correlation images to calculate depth according to the related art.
  • the phase of the reflected light can be determined.
  • the correlation values C at each of the varying phase offsets can be used to extract the measured signal true phase f according to: where 2ph is a phase ambiguity for certain depths.
  • the phase f is calculated.
  • a phase map 220 of the correlation images can be converted into a depth map 230.
  • depth z is calculated according to: where c is the speed of light and w is a modulation frequency of the amplitude modulated light that is used for illumination (depicted as illumination 111 in Fig. 1).
  • indirect ToF methods are still subject to fundamental limitations of the sensing process including noise from ambient light, photon shot, phase wrapping, multipath interference (MPI), and flying pixels.
  • MPI multipath interference
  • Fig. 3 illustrates the reflection of a mixed-light signal according to the related art.
  • foreground object 221 and background object 222 are illuminated (depicted as illumination 111).
  • ToF camera lens 218 receives reflected light from both objects through aperture 219, and focuses the light to produce foreground signal 231 and background signal 232 on pixel 220 of sensor 114.
  • Mixed light including foreground signal 231 and background signal 232 are used to calculate the depth of the target object (e.g., either foreground object 221 or background object 222).
  • the mixed light produces a mixed depth measurement, and the calculated depth does not accurately reflect the depth of the target object and a flying pixel 240 is produced.
  • Flying pixels such as flying pixel 240, frequently occur around or near depth edges, where light paths from both an object and its background or foreground are integrated over the aperture.
  • the inventors have developed mask-ToF learning microlens masks for flying pixel correction in ToF imaging to overcome the limitations and drawbacks of the related art devices.
  • the present invention is directed to microlens amplitude masks for flying pixel removal in time-of-flight imaging that substantially obviates one or more problems due to limitations and disadvantages of the related art. Additional features and advantages of the invention will be set forth in the description which follows, and in part will be apparent from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
  • flying pixels are pervasive artifacts that occur at object boundaries, where background and foreground light mix to produce erroneous measurements that can negatively impact downstream 3D vision tasks, such as depth determination.
  • the embodiments of the present invention generate a microlens-level occlusion mask pattern which modulates the selection of foreground and background light on a per-pixel basis.
  • the embodiments of the present invention are able to effectively decode these modulated measurements to produce high fidelity depth reconstructions with significantly reduced flying pixel counts.
  • the microlens amplitude masks for flying pixel removal in time-of-flight imaging includes systems, devices, methods, and instructions for image depth determination, including receiving an image, adding noise to the image, determining a set of correlation images, each correlation image having a varying phase offset, for each pixel of the image, generating a masked pixel by applying a mask array, and for each masked pixel, determining the depth of the masked pixel to generate a depth map for the image on a per pixel basis.
  • the microlens amplitude masks for flying pixel removal in time-of-flight imaging includes systems, devices, methods, and instructions for image depth determination, including a time-of-flight system for image depth determination, the system a lens configured to receive incident light, and a light sensor having a plurality of pixels, the light sensor configured to receive the incident light through a plurality of masks, each pixel corresponding to a respective mask that selectively blocks incident light paths to provide a differentiable apertures for neighboring pixels.
  • Fig. 1 illustrates a correlation imager system according to the related art.
  • Fig. 2 illustrates the use of multiple correlation images to calculate depth according to the related art.
  • Fig. 3 illustrates the reflection of a mixed-light signal according to the related art.
  • Fig. 4 illustrates a camera system having a microlens mask according to an example embodiment of the present invention.
  • Figs. 5 A and 5B illustrate the data used for generation of a mask pattern according to an example embodiment of the present invention.
  • Fig. 6 illustrates the generation of a masked pixel according to an example embodiment of the present invention.
  • Fig. 7 illustrates the generation of a decoded depth construction according to an example embodiment of the present invention.
  • Fig. 8 illustrates the updating of a mask pattern according to an example embodiment of the present invention.
  • FIG. 9 illustrates a computer-implemented method for depth determination according to an example embodiment of the present invention.
  • Flying pixels are pervasive artifacts in ToF imaging which occur at object discontinuities, where both foreground and background light signal is integrated over the camera aperture.
  • the light mixes at a sensor pixel to produce erroneous depth estimates, which then adversely affect downstream 3D vision tasks, such as depth determination.
  • the embodiments of the present invention introduce a custom-shaped sub-aperture for each sensor pixel.
  • the embodiments of the present invention generate a microlens-level occlusion mask which effectively generates a custom-shaped sub-aperture for each sensor pixel.
  • Fig. 4 illustrates a camera system 400 having a microlens mask 411 according to an example embodiment of the present invention.
  • Microlens mask 411 selected from a plurality microlens mask patterns 410, is disposed between sensor (e.g., CMOS sensor pixel 420) and microlens 430.
  • the aperture of microlens mask 411 is configured to selectively block incident light paths to enable a custom aperture for each pixel. This modulates the selection of foreground and background light mixtures on a per-pixel basis and further encodes scene geometric information directly into the ToF measurements.
  • microlens mask 411 provides spatially varying susceptibility to noise and flying pixels, and is used to de-noise and reduce the occurrence of flying pixels.
  • use of microlens mask 411 with its learned mask pattern (as described below), further enables measurements from neighboring pixels with different effective apertures to provide additional data to accurately identify and rectify flying pixels.
  • a mask 411 may be photolithographically disposed on each pixel of sensor 420 during fabrication of the sensor.
  • a custom optical relay system was used to validate the mask pattern.
  • the mask 411 can be fabricated directly on each pixel of sensor 420.
  • camera system 400 depicts a microlens 430, microlens mask 411, and pixel of sensor 420, the embodiments are not so limited.
  • a variety of lens sizes and types can be used, a mask array having a plurality of masks 411 can be used, and a variety of sensor types can be used.
  • Figs. 5 A and 5B illustrate the data used for generation of a mask pattern according to an example embodiment of the present invention.
  • the generation of a mask is driven by an image having a set of light field data 510 that includes scene view data from multiple viewing angles.
  • a set of correlation values 521 (as a function of cosfip + y), where f is the phase and y is the phase offset) produce a set of correlation images 511A, 51 IB, 511 C, 511D that are determined for each sub-aperture view at varying phase offsets (e.g, 0, p, p/2, 3p/2, respectively).
  • Correlation values can include weights to encode depth data of depth map 530.
  • depth data of depth map 530 is determined using ToF measurements.
  • Simulated noise 522 is added to light field data 510 or the set of correlation images 511A, 51 IB, 511C, 51 ID at varying phase offsets (e.g., 0, p, p/2, 3p/2, respectively).
  • simulated noise 522 can include noise according to a Poisson distribution or a Skellam distribution that approximates Gaussian noise.
  • the introduction of noise can be to simulate system and/or environmental perturbations.
  • Sub-aperture views 541-549 correspond to a subsection of a respective correlation image, such as a subsection or pixel group 540.
  • each of sub-aperture views 541-549 corresponds to respective viewing angle data contained in light field data 510.
  • the set of light field data 510 of correlation image 511 with depth map 530 are used to determine ToF amplitude measurements.
  • the time of flight measurements are decoded or otherwise extracted from the set of light field data 510 to determine initial depth estimate for depth map 530.
  • Fig. 6 illustrates the generation of a masked pixel according to an example embodiment of the present invention.
  • a masked pixel 660 is produced.
  • sub-aperture pixels 640 are weighted according to a mask array 650.
  • simulated noise can be added, and the weighted sub-aperture pixels are combined with the simulated noise to produce an initial depth estimate on a per pixel basis.
  • Fig. 7 illustrates the generation of a decoded depth construction according to an example embodiment of the present invention.
  • each generated masked pixel 660 (e.g., generated using is masking process as illustrated in Fig. 6) is processed by a convolution refinement network 770 to output a decoded and refined depth reconstruction map 780.
  • Initial depth estimates are input as masked pixels 660 to convolution refinement network 770 that decodes the spatially varying pixel measurements to produce refined (e.g., more accurate, more granular, etc.) depth estimates as refined depth reconstruction map 780.
  • an estimated depth map can be generated from multiple (e.g., four) masked correlation images.
  • the depth can be estimated using Eq. (1) and Eq. (2), or alternatively, other depth estimation techniques can be used, such as the discrete Fourier transform.
  • Convolution refinement network 770 is a residual encoder-decoder model, implemented using a memory and a graphical processing unit (“GPU”) or other processor, that utilizes an initial depth estimate and mask information as input to refined depth reconstruction map 780.
  • refined depth reconstruction map 780 can be calculated according to where D"* is the refined depth map, R is the convolution refinement network, P(C) is the initial depth estimate, M is the mask, D A is the initial depth estimate, and D "R is the refined residual depth which when added to D" serves to correct the now spatially multiplexed effects of noise and flying pixels.
  • Eq. (3) in contrast to Eq. (1) and Eq. (2) introduces the use of an initial depth calculation.
  • convolution refinement network 770 does not generate depth from phase, and the processing and computational needs of convolution refinement network 770 are substantially reduced as compared to a conventional deep reconstruction network.
  • convolution refinement network 770 quickly determines high level depth and mask features, as well as determines other image information where raw phase data might significantly differ from a training set.
  • the sequential depth estimation and refinement approach also enables calibration procedures implemented by the sensor manufacturers. Real depth data can be supplied to convolution refinement network 770 without having to retrain and leam calibration offsets.
  • the encoder-decoder model of convolution refinement network 770 is configured to aggregate the spatial information and utilize mask structural cues to produce refined depth estimates.
  • the errors between initial depth estimates and refined depth estimates can be used to improve mask patterns.
  • Fig. 8 illustrates the updating of a mask pattern according to an example embodiment of the present invention.
  • convolution refinement network 770 errors in depth calculations (e.g., between the initial depth and refined depth) are calculated. Calculating the errors with respect to the light field depth, the errors can be used to improve convolution refinement network 770 and mask array 650 (e.g., as illustrated in Fig. 6). In this way, starting at an initial mask, the embodiments can simultaneously determine an encoding or otherwise update a mask pattern and decoding network weights.
  • the updated mask pattern can be applied on (e.g., photolithographically) each pixel of a sensor (e.g. , sensor 420) during fabrication of the sensor.
  • a global aperture of the related art As illustrated in Fig. 3, all pixels are equally susceptible to flying pixels, and if one sensor pixel returns a flying pixel, likely so will its neighboring sensor pixels.
  • the addition of spatially variable susceptibility via a microlens mask means that neighboring pixels are no longer equally susceptible to noise and/or flying pixels.
  • a sensor pixel with a wide effective aperture can be trusted with regards to noise statistics, but is likely to return flying pixels if near an object boundary. Contrastingly, a neighboring pixel with a narrow aperture will likely produce noisier measurements, but be less affected by depth discontinuities.
  • FIG. 9 illustrates a computer-implemented method 900 for depth determination according to an example embodiment of the present invention.
  • the computer-implemented method can be implemented using one or more memory devices (e.g., a non-transitory memory), one or more processing devices (e.g., a CPU, GPU, etc.), and/or one or more communication channels to transmit one or more instructions.
  • memory devices e.g., a non-transitory memory
  • processing devices e.g., a CPU, GPU, etc.
  • communication channels to transmit one or more instructions.
  • method 900 receives an image (e.g., an image containing a set of light field data 510 as illustrated in Fig. 5).
  • the image may include one or more objects, one or more surfaces, and is captured by a ToF camera.
  • method 900 adds simulated noise (e.g., noise 522 as illustrated in Fig. 5) to the image.
  • Simulated noise is added to the image.
  • simulated noise can include noise according to a Poisson distribution or a Skellam distribution that approximates Gaussian noise.
  • method 900 generates a set of correlation images, each correlation image having a varying phase offset (e.g., correlation images 511A, 51 IB, 511C, 51 ID as illustrated in Fig. 5), at 930.
  • a varying phase offset e.g., correlation images 511A, 51 IB, 511C, 51 ID as illustrated in Fig. 5
  • method 900 For each pixel of the image, method 900 generates a masked pixel by applying a mask array.
  • a mask array such as mask array 650, including a set of micro-lens masks 651-659
  • a masked pixel (such as masked pixel 660) is produced.
  • sub-aperture pixels 640 are weighted according to a mask array 650. The weighted sub-aperture pixels are combined with the simulated noise to produce an initial depth estimate on a per pixel basis.
  • method 900 determines the depth of the masked pixel to generate a depth map for the image on a per pixel basis.
  • the respective depths of masked pixels can be determined using a convolution refinement network 770 (such as convolution refinement network 770).
  • convolution refinement network 770 such as convolution refinement network 770
  • other known depth determination techniques may be used.
  • a pinhole aperture produces an extremely noisy reconstruction
  • an open aperture produces blurred edges with a plethora of flying pixels
  • the mask pattern provides substantially improved depth determination with acceptable SNR and substantially reduced flying pixels.
  • the mask pattern achieves a 30% reduction in flying pixels as compared to an identical light throughput using a global aperture mask.
  • the results generalize to scenes of varying geometry and surface material.
  • the results were achieved without re-training or fine-tuning the convolution refinement network.
  • the embodiments of the invention can be readily applied to numerous applications. Some non-exhaustive examples include cameras for mobile phones or tablets, autonomous vehicles, collision avoidance, delivery robotics, cartography including topography and other 3D maps, gaming, augmented reality (“AR”), virtual reality (“VR”), facial identification, and others. It will be apparent to those skilled in the art that various modifications and variations can be made in the microlens amplitude masks for flying pixel removal in time-of-flight imaging of the present invention without departing from the spirit or scope of the invention. Thus, it is intended that the present invention cover the modifications and variations of this invention provided they come within the scope of the appended claims and their equivalents.

Abstract

The microlens amplitude masks for flying pixel removal in time-of-flight imaging includes systems, devices, methods, and instructions for image depth determination, including receiving an image, adding noise to the image, determining a set of correlation images, each correlation image having a varying phase offset, for each pixel of the image, generating a masked pixel by applying a mask array, and for each masked pixel, determining the depth of the masked pixel to generate a depth map for the image on a per pixel basis.

Description

MICROLENS AMPLITUDE MASKS FOR FLYING PIXEL REMOVAL IN TIME-OF-FLIGHT IMAGING
Federally Sponsored Research
This invention was made with government support under Grant No. IIS-2047359 awarded by the National Science Foundation (NSF). The United States Government has certain rights in the invention.
BACKGROUND OF THE INVENTION
Field of the Invention
The embodiments of the present invention generally relate to image processing, and more particularly, toward techniques for three-dimensional (3D) image processing and depth determination.
Discussion of the Related Art
At present, typical time-of-flight (ToF) depth capture cameras collect light incident to a lens, focus the incident light onto a sensor (i.e.. along the sensor plane), and measure an output at each pixel of a pixel array sensor (e.g., a complementary metal -oxide-semiconductor (“CMOS”) type sensor). In some instances, one or more individual pixels receive a mixed- light signal. Depending on the subject matter of the image, the mixed-light may originate from multiple object surfaces at varying depths.
ToF imaging can be further categorized into direct and indirect techniques. Direct ToF devices such as light detection and ranging (“LiDAR”) send out pulses of light, scanning over a scene and directly measuring their round-trip time using photodiodes or photon detectors. While accurate and long-ranged, these systems can produce only a few spatial measurements at a time, resulting in sparse depth maps. Furthermore, their specialized detectors are orders of magnitude more expensive than conventional CMOS sensors.
Amplitude modulated continuous wave (“AMCW”) ToF imaging is a type of indirect ToF. AMCW devices instead flood the whole scene with periodically modulated light and infer depth from phase differences between captures (i.e.. using a plurality of correlation images at varying phase offsets). These captures can be acquired with a standard CMOS sensor, making AMCW ToF cameras an affordable solution for dense depth measurement. In current ToF imaging applications, both direct ToF (e.g., LiDAR) and indirect ToF (e.g., AMCW), the resultant estimated depth for a given pixel is incorrect when mixed-light is received. A so-called “flying pixel” has an estimated depth that is between the objects of varying depths. As neighboring pixels also included mixed-light, neighboring pixels cannot be reliably used to disambiguate the flying pixel artifact.
Fig. 1 illustrates a correlation imager system 100 according to the related art. As illustrated in Fig. 1, ToF camera 110 illuminates (depicted as illumination 111) a target 120 with continuously modulated light. The light is reflected by target 120 (depicted as reflected signal 112), which results in an accrued depth-dependent phase shift 113. The light is collected on camera sensor 114, converted to an electrical signal 115, and correlated with an on-board reference signal 116 to produce a time-of-flight correlation measurement 117 and one or more correlation images.
Fig. 2 illustrates the use of multiple correlation images to calculate depth according to the related art. As illustrated in Fig. 2, by collecting multiple (e.g., four) correlation images 211A, 21 IB, 211C, 21 ID with varying phase offsets (e.g., 0, p, p/2, 3p/2, respectively), the phase of the reflected light can be determined. Here, the correlation values C at each of the varying phase offsets can be used to extract the measured signal true phase f according to:
Figure imgf000004_0001
where 2ph is a phase ambiguity for certain depths. For each pixel, the phase f is calculated. Subsequently, a phase map 220 of the correlation images can be converted into a depth map 230. For each pixel, depth z is calculated according to:
Figure imgf000004_0002
where c is the speed of light and w is a modulation frequency of the amplitude modulated light that is used for illumination (depicted as illumination 111 in Fig. 1).
However, the related art techniques are subject to various limitations and drawbacks. For example, indirect ToF methods are still subject to fundamental limitations of the sensing process including noise from ambient light, photon shot, phase wrapping, multipath interference (MPI), and flying pixels.
Fig. 3 illustrates the reflection of a mixed-light signal according to the related art. As illustrated in Fig. 3, foreground object 221 and background object 222 are illuminated (depicted as illumination 111). ToF camera lens 218 receives reflected light from both objects through aperture 219, and focuses the light to produce foreground signal 231 and background signal 232 on pixel 220 of sensor 114.
Mixed light including foreground signal 231 and background signal 232 are used to calculate the depth of the target object (e.g., either foreground object 221 or background object 222). However, the mixed light produces a mixed depth measurement, and the calculated depth does not accurately reflect the depth of the target object and a flying pixel 240 is produced.
Flying pixels, such as flying pixel 240, frequently occur around or near depth edges, where light paths from both an object and its background or foreground are integrated over the aperture.
One common solution to reduce flying pixel count is to narrow the camera aperture. However, use of a narrow aperture also reduces overall light throughput and increases the system’s susceptibility to noise. While a narrower aperture could reduce the effects of flying pixels, it is not light efficient, and leads to high noise susceptibility in the measurements.
Unfortunately, such a masking approach (/. e. , reducing aperture size) significantly lowers the signal -to-noise ratio (“SNR”). Thus, there exists a strict SNR verses flying pixel tradeoff for typical ToF depth cameras.
Accordingly, the inventors have developed mask-ToF learning microlens masks for flying pixel correction in ToF imaging to overcome the limitations and drawbacks of the related art devices.
SUMMARY
Accordingly, the present invention is directed to microlens amplitude masks for flying pixel removal in time-of-flight imaging that substantially obviates one or more problems due to limitations and disadvantages of the related art. Additional features and advantages of the invention will be set forth in the description which follows, and in part will be apparent from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
As discussed above, flying pixels are pervasive artifacts that occur at object boundaries, where background and foreground light mix to produce erroneous measurements that can negatively impact downstream 3D vision tasks, such as depth determination. The embodiments of the present invention generate a microlens-level occlusion mask pattern which modulates the selection of foreground and background light on a per-pixel basis.
When configured in an end-to-end fashion with a depth refinement network, the embodiments of the present invention are able to effectively decode these modulated measurements to produce high fidelity depth reconstructions with significantly reduced flying pixel counts.
To achieve these and other advantages and in accordance with the purpose of the present invention, as embodied and broadly described, the microlens amplitude masks for flying pixel removal in time-of-flight imaging includes systems, devices, methods, and instructions for image depth determination, including receiving an image, adding noise to the image, determining a set of correlation images, each correlation image having a varying phase offset, for each pixel of the image, generating a masked pixel by applying a mask array, and for each masked pixel, determining the depth of the masked pixel to generate a depth map for the image on a per pixel basis.
In another aspect, the microlens amplitude masks for flying pixel removal in time-of-flight imaging includes systems, devices, methods, and instructions for image depth determination, including a time-of-flight system for image depth determination, the system a lens configured to receive incident light, and a light sensor having a plurality of pixels, the light sensor configured to receive the incident light through a plurality of masks, each pixel corresponding to a respective mask that selectively blocks incident light paths to provide a differentiable apertures for neighboring pixels. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are intended to provide further explanation of the invention as claimed.
BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention.
Fig. 1 illustrates a correlation imager system according to the related art. Fig. 2 illustrates the use of multiple correlation images to calculate depth according to the related art.
Fig. 3 illustrates the reflection of a mixed-light signal according to the related art.
Fig. 4 illustrates a camera system having a microlens mask according to an example embodiment of the present invention. Figs. 5 A and 5B illustrate the data used for generation of a mask pattern according to an example embodiment of the present invention.
Fig. 6 illustrates the generation of a masked pixel according to an example embodiment of the present invention.
Fig. 7 illustrates the generation of a decoded depth construction according to an example embodiment of the present invention.
Fig. 8 illustrates the updating of a mask pattern according to an example embodiment of the present invention.
FIG. 9 illustrates a computer-implemented method for depth determination according to an example embodiment of the present invention. DETAILED DESCRIPTION OF THE INVENTION
Reference will now be made in detail to the embodiments of the present invention, examples of which are illustrated in the accompanying drawings. Wherever possible, like reference numbers will be used for like elements.
Flying pixels are pervasive artifacts in ToF imaging which occur at object discontinuities, where both foreground and background light signal is integrated over the camera aperture. The light mixes at a sensor pixel to produce erroneous depth estimates, which then adversely affect downstream 3D vision tasks, such as depth determination. The embodiments of the present invention introduce a custom-shaped sub-aperture for each sensor pixel. For example, the embodiments of the present invention generate a microlens-level occlusion mask which effectively generates a custom-shaped sub-aperture for each sensor pixel. By customizing the aperture for each sensor pixel, the effects of flying pixels are significantly reduced.
Fig. 4 illustrates a camera system 400 having a microlens mask 411 according to an example embodiment of the present invention.
Microlens mask 411, selected from a plurality microlens mask patterns 410, is disposed between sensor (e.g., CMOS sensor pixel 420) and microlens 430. The aperture of microlens mask 411 is configured to selectively block incident light paths to enable a custom aperture for each pixel. This modulates the selection of foreground and background light mixtures on a per-pixel basis and further encodes scene geometric information directly into the ToF measurements. Thus, microlens mask 411 provides spatially varying susceptibility to noise and flying pixels, and is used to de-noise and reduce the occurrence of flying pixels. In addition, use of microlens mask 411, with its learned mask pattern (as described below), further enables measurements from neighboring pixels with different effective apertures to provide additional data to accurately identify and rectify flying pixels.
For example, a mask 411 may be photolithographically disposed on each pixel of sensor 420 during fabrication of the sensor. A custom optical relay system was used to validate the mask pattern. In another example, the mask 411 can be fabricated directly on each pixel of sensor 420. Although camera system 400 depicts a microlens 430, microlens mask 411, and pixel of sensor 420, the embodiments are not so limited. A variety of lens sizes and types can be used, a mask array having a plurality of masks 411 can be used, and a variety of sensor types can be used.
Figs. 5 A and 5B illustrate the data used for generation of a mask pattern according to an example embodiment of the present invention.
As illustrated in Fig. 5A, the generation of a mask is driven by an image having a set of light field data 510 that includes scene view data from multiple viewing angles. Using as input a set of light field data 510, a set of correlation values 521 (as a function of cosfip + y), where f is the phase and y is the phase offset) produce a set of correlation images 511A, 51 IB, 511 C, 511D that are determined for each sub-aperture view at varying phase offsets (e.g, 0, p, p/2, 3p/2, respectively). Correlation values can include weights to encode depth data of depth map 530. Typically, depth data of depth map 530 is determined using ToF measurements.
Simulated noise 522 is added to light field data 510 or the set of correlation images 511A, 51 IB, 511C, 51 ID at varying phase offsets (e.g., 0, p, p/2, 3p/2, respectively). For example, simulated noise 522 can include noise according to a Poisson distribution or a Skellam distribution that approximates Gaussian noise. The introduction of noise can be to simulate system and/or environmental perturbations.
As illustrated in Fig. 5B, to determine the microlens mask, ToF data of correlation image 511 with sub-aperture views 541-549 are used. Sub-aperture views 541-549 correspond to a subsection of a respective correlation image, such as a subsection or pixel group 540. In addition, each of sub-aperture views 541-549 corresponds to respective viewing angle data contained in light field data 510.
As there are no available datasets, the set of light field data 510 of correlation image 511 with depth map 530 are used to determine ToF amplitude measurements. In some embodiments, the time of flight measurements are decoded or otherwise extracted from the set of light field data 510 to determine initial depth estimate for depth map 530.
Fig. 6 illustrates the generation of a masked pixel according to an example embodiment of the present invention.
By multiplication of a set of sub-aperture pixels 640 (e.g., including sub-aperture pixels 641- 649) by a mask array 650 (e.g., including a set of micro-lens masks 651-659) and summing the results on a per pixel basis, a masked pixel 660 is produced. Here, sub-aperture pixels 640 are weighted according to a mask array 650. As discussed above, simulated noise can be added, and the weighted sub-aperture pixels are combined with the simulated noise to produce an initial depth estimate on a per pixel basis.
Fig. 7 illustrates the generation of a decoded depth construction according to an example embodiment of the present invention.
For a given masked correlation image, each generated masked pixel 660 (e.g., generated using is masking process as illustrated in Fig. 6) is processed by a convolution refinement network 770 to output a decoded and refined depth reconstruction map 780. Initial depth estimates are input as masked pixels 660 to convolution refinement network 770 that decodes the spatially varying pixel measurements to produce refined (e.g., more accurate, more granular, etc.) depth estimates as refined depth reconstruction map 780.
In some embodiments, an estimated depth map can be generated from multiple (e.g., four) masked correlation images. The depth can be estimated using Eq. (1) and Eq. (2), or alternatively, other depth estimation techniques can be used, such as the discrete Fourier transform.
Convolution refinement network 770 is a residual encoder-decoder model, implemented using a memory and a graphical processing unit (“GPU”) or other processor, that utilizes an initial depth estimate and mask information as input to refined depth reconstruction map 780. For example, refined depth reconstruction map 780 can be calculated according to
Figure imgf000010_0001
where D"* is the refined depth map, R is the convolution refinement network, P(C) is the initial depth estimate, M is the mask, DA is the initial depth estimate, and D"R is the refined residual depth which when added to D" serves to correct the now spatially multiplexed effects of noise and flying pixels.
Eq. (3) in contrast to Eq. (1) and Eq. (2) introduces the use of an initial depth calculation. In addition, convolution refinement network 770 does not generate depth from phase, and the processing and computational needs of convolution refinement network 770 are substantially reduced as compared to a conventional deep reconstruction network. As a result, convolution refinement network 770 quickly determines high level depth and mask features, as well as determines other image information where raw phase data might significantly differ from a training set. The sequential depth estimation and refinement approach also enables calibration procedures implemented by the sensor manufacturers. Real depth data can be supplied to convolution refinement network 770 without having to retrain and leam calibration offsets.
Thus, the encoder-decoder model of convolution refinement network 770 is configured to aggregate the spatial information and utilize mask structural cues to produce refined depth estimates. The errors between initial depth estimates and refined depth estimates can be used to improve mask patterns.
Fig. 8 illustrates the updating of a mask pattern according to an example embodiment of the present invention.
At convolution refinement network 770, errors in depth calculations (e.g., between the initial depth and refined depth) are calculated. Calculating the errors with respect to the light field depth, the errors can be used to improve convolution refinement network 770 and mask array 650 (e.g., as illustrated in Fig. 6). In this way, starting at an initial mask, the embodiments can simultaneously determine an encoding or otherwise update a mask pattern and decoding network weights. The updated mask pattern can be applied on (e.g., photolithographically) each pixel of a sensor (e.g. , sensor 420) during fabrication of the sensor.
With a global aperture of the related art, as illustrated in Fig. 3, all pixels are equally susceptible to flying pixels, and if one sensor pixel returns a flying pixel, likely so will its neighboring sensor pixels. Returning to the embodiments of the present invention, the addition of spatially variable susceptibility via a microlens mask, as illustrated in Fig. 4, for example, means that neighboring pixels are no longer equally susceptible to noise and/or flying pixels. A sensor pixel with a wide effective aperture can be trusted with regards to noise statistics, but is likely to return flying pixels if near an object boundary. Contrastingly, a neighboring pixel with a narrow aperture will likely produce noisier measurements, but be less affected by depth discontinuities. By aggregating information in pixel neighborhoods, wide aperture pixels can be used to de-noise local measurements, and narrow aperture pixels can be used to reduce the occurrence of flying -pixels. FIG. 9 illustrates a computer-implemented method 900 for depth determination according to an example embodiment of the present invention. The computer-implemented method can be implemented using one or more memory devices (e.g., a non-transitory memory), one or more processing devices (e.g., a CPU, GPU, etc.), and/or one or more communication channels to transmit one or more instructions.
At 910, method 900 receives an image (e.g., an image containing a set of light field data 510 as illustrated in Fig. 5). The image may include one or more objects, one or more surfaces, and is captured by a ToF camera.
Next, at 920, method 900 adds simulated noise (e.g., noise 522 as illustrated in Fig. 5) to the image. Simulated noise is added to the image. For example, simulated noise can include noise according to a Poisson distribution or a Skellam distribution that approximates Gaussian noise.
Subsequently, for the image, method 900 generates a set of correlation images, each correlation image having a varying phase offset (e.g., correlation images 511A, 51 IB, 511C, 51 ID as illustrated in Fig. 5), at 930.
At 940, for each pixel of the image, method 900 generates a masked pixel by applying a mask array. As discussed in connection with Fig. 6, by multiplying a set of sub-aperture pixels (such as sub-aperture pixels 641-649) with a mask array (such as mask array 650, including a set of micro-lens masks 651-659) and summing the results on a per pixel basis, a masked pixel (such as masked pixel 660) is produced. Here, sub-aperture pixels 640 are weighted according to a mask array 650. The weighted sub-aperture pixels are combined with the simulated noise to produce an initial depth estimate on a per pixel basis.
Uastly, for each masked pixel, method 900 determines the depth of the masked pixel to generate a depth map for the image on a per pixel basis. Here, the respective depths of masked pixels can be determined using a convolution refinement network 770 (such as convolution refinement network 770). Alternatively, or additionally, other known depth determination techniques may be used.
In implementation, it was demonstrated that a pinhole aperture produces an extremely noisy reconstruction; an open aperture produces blurred edges with a plethora of flying pixels; and the mask pattern provides substantially improved depth determination with acceptable SNR and substantially reduced flying pixels. For real scene captures, the mask pattern achieves a 30% reduction in flying pixels as compared to an identical light throughput using a global aperture mask. In addition, the results generalize to scenes of varying geometry and surface material. Moreover, the results were achieved without re-training or fine-tuning the convolution refinement network.
The embodiments of the invention can be readily applied to numerous applications. Some non-exhaustive examples include cameras for mobile phones or tablets, autonomous vehicles, collision avoidance, delivery robotics, cartography including topography and other 3D maps, gaming, augmented reality (“AR”), virtual reality (“VR”), facial identification, and others. It will be apparent to those skilled in the art that various modifications and variations can be made in the microlens amplitude masks for flying pixel removal in time-of-flight imaging of the present invention without departing from the spirit or scope of the invention. Thus, it is intended that the present invention cover the modifications and variations of this invention provided they come within the scope of the appended claims and their equivalents.

Claims

WHAT IS CLAIMED IS:
1. A computer-implemented method for image depth determination, the computer- implemented method comprising: receiving an image; adding noise to the image; for the image, determining a set of correlation images, each correlation image having a varying phase offset; for each pixel of the image, generating a masked pixel by applying a mask array; and for each masked pixel, determining the depth of the masked pixel to generate a depth map for the image on a per pixel basis.
2. The computer-implemented method according to claim 1, wherein the image is captured at a sensor having a mask pattern.
3. The computer-implemented method according to claim 2, wherein the mask pattern is photolithographically applied to each pixel of the sensor.
4. The computer-implemented method according any of the preceding claims, wherein each pixel of the image is captured using a respective aperture.
5. The computer-implemented method according any of the preceding claims, wherein neighboring pixels have different apertures.
6. The computer-implemented method according to claim 5, wherein a first neighboring pixel has a different signal-to-noise ratio than a second neighboring pixel.
7. The computer-implemented method according to claim 6, wherein the first neighboring pixel has a different susceptibility to being a flying pixel than the second neighboring pixel.
8. The computer-implemented method according any of the preceding claims, wherein the noise is added according to a Poisson distribution or a Skellam distribution.
9. The computer-implemented method according to any of the preceding claims, further comprising: processing each masked pixel by a convolution refinement network.
10. The computer-implemented method according to claim 9, wherein the convolution refinement network receives a first depth estimate for each pixel and outputs the depth for each pixel.
11. The computer-implemented method according to claim 9 or 10, wherein the convolution refinement network outputs a correction to the mask array based upon the difference between the first depth estimate for each pixel and the depth for each pixel.
12. A time-of-flight system for image depth determination, the system comprising: a lens configured to receive incident light; and a light sensor having a plurality of pixels, the light sensor configured to receive the incident light through a plurality of masks, each pixel corresponding to a respective mask that selectively blocks incident light paths to provide a differentiable apertures for neighboring pixels.
13. The system according to claim 12, wherein a first neighboring pixel has a different signal-to-noise ratio than a second neighboring pixel.
14. The system according to claim 13, wherein the first neighboring pixel has a different susceptibility to being a flying pixel than the second neighboring pixel.
PCT/IB2022/052448 2021-03-17 2022-03-17 Microlens amplitude masks for flying pixel removal in time-of-flight imaging WO2022195537A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US17/915,536 US11657523B2 (en) 2021-03-17 2022-03-17 Microlens amplitude masks for flying pixel removal in time-of-flight imaging
US17/850,321 US20220414913A1 (en) 2021-06-25 2022-06-27 Polka lines: learning structured illumination and reconstruction for active stereo

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202163162336P 2021-03-17 2021-03-17
US63/162,336 2021-03-17

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US17/850,321 Continuation-In-Part US20220414913A1 (en) 2021-06-25 2022-06-27 Polka lines: learning structured illumination and reconstruction for active stereo

Publications (1)

Publication Number Publication Date
WO2022195537A1 true WO2022195537A1 (en) 2022-09-22

Family

ID=83322174

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2022/052448 WO2022195537A1 (en) 2021-03-17 2022-03-17 Microlens amplitude masks for flying pixel removal in time-of-flight imaging

Country Status (2)

Country Link
US (1) US11657523B2 (en)
WO (1) WO2022195537A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150192510A1 (en) * 2012-06-22 2015-07-09 The Regents Of The University Of Colorado, A Body Corporate Imaging or measurement methods and systems
US20160357008A1 (en) * 2015-06-02 2016-12-08 The Johns Hopkins University Fiber-optic methods and devices enabling multiphoton imaging with improved signal-to-noise ratio
US20180041759A1 (en) * 2015-03-02 2018-02-08 Dolby International Ab Content-adaptive perceptual quantizer for high dynamic range images
WO2021004263A1 (en) * 2019-07-11 2021-01-14 Oppo广东移动通信有限公司 Depth map processing method and apparatus, electronic device and readable storage medium
WO2021033191A1 (en) * 2019-08-20 2021-02-25 Technology Innovation Momentum Fund (Israel) Limited Partnership Method and apparatus for authentication of a three-dimensional object

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2487504A1 (en) * 2011-02-10 2012-08-15 Technische Universität München Method of enhanced depth image acquisition
US8542268B2 (en) * 2011-03-02 2013-09-24 Ricoh Co., Ltd. Compact multi-aperture three-dimensional imaging system
US9819879B2 (en) * 2011-07-12 2017-11-14 Samsung Electronics Co., Ltd. Image filtering apparatus and method based on noise prediction using infrared ray (IR) intensity
US9213883B2 (en) * 2012-01-10 2015-12-15 Samsung Electronics Co., Ltd. Method and apparatus for processing depth image
US8619082B1 (en) * 2012-08-21 2013-12-31 Pelican Imaging Corporation Systems and methods for parallax detection and correction in images captured using array cameras that contain occlusions using subsets of images to perform depth estimation
KR101896301B1 (en) * 2013-01-03 2018-09-07 삼성전자주식회사 Apparatus and method for processing depth image
KR20150037366A (en) * 2013-09-30 2015-04-08 삼성전자주식회사 Method for decreasing noise of depth image, image processing apparatus and image generating apparatus using thereof
US9729857B2 (en) * 2014-04-08 2017-08-08 Semyon Nisenzon High resolution depth map computation using multiresolution camera clusters for 3D image generation
US9805294B2 (en) * 2015-02-12 2017-10-31 Mitsubishi Electric Research Laboratories, Inc. Method for denoising time-of-flight range images
LU92688B1 (en) * 2015-04-01 2016-10-03 Iee Int Electronics & Eng Sa Method and system for real-time motion artifact handling and noise removal for tof sensor images
US9992477B2 (en) * 2015-09-24 2018-06-05 Ouster, Inc. Optical system for collecting distance information within a field
US10983213B2 (en) * 2017-03-29 2021-04-20 Luminar Holdco, Llc Non-uniform separation of detector array elements in a lidar system
JP7324518B2 (en) * 2018-04-01 2023-08-10 オプシス テック リミテッド Noise adaptive solid-state lidar system
US11573304B2 (en) * 2018-04-27 2023-02-07 Liturex (Guangzhou) Co. Ltd LiDAR device with a dynamic spatial filter
US11563911B2 (en) * 2018-10-10 2023-01-24 Northwestern University Method and system for time-of-flight imaging with high lateral resolution
US11393115B2 (en) * 2018-11-27 2022-07-19 Infineon Technologies Ag Filtering continuous-wave time-of-flight measurements, based on coded modulation images
US11272156B2 (en) * 2019-02-15 2022-03-08 Analog Devices International Unlimited Company Spatial correlation sampling in time-of-flight imaging
US11405535B2 (en) * 2019-02-28 2022-08-02 Qualcomm Incorporated Quad color filter array camera sensor configurations
US11416998B2 (en) * 2019-07-30 2022-08-16 Microsoft Technology Licensing, Llc Pixel classification to reduce depth-estimation error
US11818462B2 (en) * 2019-08-30 2023-11-14 Qualcomm Incorporated Phase detection autofocus sensor apparatus and method for depth sensing
US11550037B2 (en) * 2020-05-01 2023-01-10 The Charles Stark Draper Laboratory, Inc. Monostatic LiDAR transceiver system
US11600010B2 (en) * 2020-06-03 2023-03-07 Lucid Vision Labs, Inc. Time-of-flight camera having improved dynamic range and method of generating a depth map
US11721031B2 (en) * 2020-10-28 2023-08-08 Stmicroelectronics (Research & Development) Limited Scalable depth sensor

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150192510A1 (en) * 2012-06-22 2015-07-09 The Regents Of The University Of Colorado, A Body Corporate Imaging or measurement methods and systems
US20180041759A1 (en) * 2015-03-02 2018-02-08 Dolby International Ab Content-adaptive perceptual quantizer for high dynamic range images
US20160357008A1 (en) * 2015-06-02 2016-12-08 The Johns Hopkins University Fiber-optic methods and devices enabling multiphoton imaging with improved signal-to-noise ratio
WO2021004263A1 (en) * 2019-07-11 2021-01-14 Oppo广东移动通信有限公司 Depth map processing method and apparatus, electronic device and readable storage medium
WO2021033191A1 (en) * 2019-08-20 2021-02-25 Technology Innovation Momentum Fund (Israel) Limited Partnership Method and apparatus for authentication of a three-dimensional object

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
CHUGUNOV ILYA; BAEK SEUNG-HWAN; FU QIANG; HEIDRICH WOLFGANG; HEIDE FELIX: "Mask-ToF: Learning Microlens Masks for Flying Pixel Correction in Time-of-Flight Imaging", 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), IEEE, 20 June 2021 (2021-06-20), pages 9112 - 9122, XP034010835, DOI: 10.1109/CVPR46437.2021.00900 *

Also Published As

Publication number Publication date
US11657523B2 (en) 2023-05-23
US20230118593A1 (en) 2023-04-20

Similar Documents

Publication Publication Date Title
Shin et al. Epinet: A fully-convolutional neural network using epipolar geometry for depth from light field images
US10832429B2 (en) Device and method for obtaining distance information from views
US9805294B2 (en) Method for denoising time-of-flight range images
US8305485B2 (en) Digital camera with coded aperture rangefinder
US9048153B2 (en) Three-dimensional image sensor
US8436912B2 (en) Range measurement using multiple coded apertures
WO2013052781A1 (en) Method and apparatus to determine depth information for a scene of interest
US8330852B2 (en) Range measurement using symmetric coded apertures
CN105890546A (en) Structured light three-dimensional measurement method based on orthogonal Gray code and line shift combination
CN105654547B (en) Three-dimensional rebuilding method
CN102997891A (en) Device and method for measuring scene depth
CN103299343A (en) Range image pixel matching method
CN106170086B (en) Method and device thereof, the system of drawing three-dimensional image
CN113686314B (en) Monocular water surface target segmentation and monocular distance measurement method for shipborne camera
CN112313541A (en) Apparatus and method
CN114485579A (en) Sea surface measuring system, sea surface measuring method and storage medium
CN106033614A (en) Moving object detection method of mobile camera under high parallax
US7382900B2 (en) Method of determining a three-dimensional velocity field in a volume
Skinner et al. Underwater image dehazing with a light field camera
Paramonov et al. Depth camera based on color-coded aperture
Yamashita et al. Underwater sensing with omni-directional stereo camera
Volak et al. Interference artifacts suppression in systems with multiple depth cameras
US11657523B2 (en) Microlens amplitude masks for flying pixel removal in time-of-flight imaging
CN110310492B (en) Speed measuring method and device for mobile vehicle
EP3327481B1 (en) Method for modelling an imaging device, corresponding computer program product and computer-readable carrier medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22770738

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22770738

Country of ref document: EP

Kind code of ref document: A1