CN114723797B

CN114723797B - TOF depth imaging method based on deep learning

Info

Publication number: CN114723797B
Application number: CN202110015239.6A
Authority: CN
Inventors: 胡雪梅; 李家渠; 岳涛; 黄晨迪
Original assignee: Nanjing University
Current assignee: Nanjing University
Priority date: 2021-01-06
Filing date: 2021-01-06
Publication date: 2024-04-12
Anticipated expiration: 2041-01-06
Also published as: CN114723797A

Abstract

The invention discloses a TOF depth imaging method based on deep learning. The method comprises the following specific steps: (1) Inputting a depth map to a TOF imaging network, simulating a modulation function of TOF by using a learnable matrix, and performing corresponding shift according to pixel values of the depth map; (2) Simulating a demodulation function of TOF by using a learnable matrix, and performing integration operation with the output of the last step; (3) Adding ambient light after integration, adding noise, and inputting to a denoising imaging sub-network; (4) Training the modulation function, the demodulation function and the denoising imaging subnetwork simultaneously; (5) Modulating the laser diode using the trained modulation function to illuminate the scene; (6) Measuring the reflected signal by using an APD, and multiplying the reflected signal by a demodulation function obtained by training; (7) The multiplied signals pass through a low-pass filter and the voltage value is measured; (8) And inputting the voltage value to a denoising imaging sub-network obtained by training, and completing TOF depth imaging. The method can enhance the robustness of TOF depth imaging to noise and improve the imaging precision.

Description

TOF depth imaging method based on deep learning

Technical Field

The invention relates to the fields of computational photography and deep learning, in particular to a TOF (time of flight) depth imaging technology based on deep learning.

Background

In recent years, with the rise of artificial intelligence surge, the application of artificial intelligence to the field of computational photography has become a leading-edge research hotspot in the fields of computer vision, digital signal processing, optics and the like.

The processing of depth maps as an application of artificial intelligence is attracting a great deal of attention, and the related research work of depth imaging has important significance for the fields of automatic driving, geographic remote sensing, medical imaging and the like. The depth map has wide application range, and can acquire richer position relations between objects through distance information relative to the two-dimensional image, namely, the foreground and the background are distinguished. Through further deepening, the three-dimensional modeling and other applications can be completed, and the target identification and tracking can be completed rapidly. Meanwhile, the depth information can still finish traditional applications such as segmentation, marking, identification, tracking and the like of the target image.

Conventional Time-of-Flight (TOF) depth imaging methods acquire depth information by measuring the Time interval of two signals directly using a pulse wave, or acquire depth information by measuring the phase using a sinusoidal signal as a modulation and demodulation function. However, conventional TOF depth imaging methods suffer from several drawbacks, such as: the influence of noise is large, and the measurement precision is relatively low; the measured result is obviously interfered by the property of the measured object, the external environment and the external light source; systematic errors and random errors have obvious influence on the result, and later data processing and the like are needed.

Disclosure of Invention

In order to solve the defects existing in the existing TOF imaging method, the invention aims to provide a TOF depth imaging method based on deep learning.

In order to achieve the above purpose, the technical scheme adopted by the invention is as follows:

a deep learning-based TOF depth imaging method comprising the steps of:

step 1, inputting a depth map in a training data set into a TOF imaging network, wherein the TOF imaging network comprises a modulation function, a demodulation function and a denoising imaging sub-network;

step 2, simulating a modulation function of actual TOF imaging by using a learnable matrix, wherein the matrix carries out corresponding shift according to the value of each pixel of the depth map;

step 3, simulating a demodulation function of the actual TOF imaging by using another learnable matrix, and performing integral operation with the output of the step 2;

step 4, adding the ambient light, photon noise and noise read by a sensor to the result integrated in the step 3 to form a noisy measurement map;

step 5, inputting the noisy measurement graph to a denoising imaging sub-network of the TOF imaging network, and training the denoising imaging sub-network, the modulation function and the demodulation function simultaneously by using a deep learning method to obtain a trained modulation function, a trained demodulation function and a trained denoising imaging sub-network;

step 6, modulating the laser diode by using the modulation function trained in the step 5, and driving the laser diode to emit laser to illuminate the scene;

step 7, the reflected light reflected by the surface of the scene object is focused on a detector after passing through the beam splitter, and the detector receives a reflected signal with modulation information;

step 8, multiplying the reflected signal in the step 7 with the demodulation function trained in the step 5 through a multiplier, integrating through a low-pass filter, and acquiring the voltage output by the low-pass filter through an analog-to-digital converter;

step 9, repeating the steps 6 to 8, scanning each point in the scene, and completing depth measurement of all points in the scene to obtain a noisy measurement map;

and 10, inputting the noisy measurement map obtained in the step 9 to the denoising imaging sub-network obtained in the training in the step 5 to obtain a depth map of the scene, and completing TOF depth imaging.

Compared with the existing TOF imaging method, the method provided by the invention has robustness to the noise of the scene and high depth imaging precision, and solves the problem that the existing TOF imaging method is difficult to obtain an accurate depth map under the condition of large noise. The method has higher depth imaging precision under the condition of stronger interference of the ambient light and the external light source, and is little affected by noise.

Drawings

FIG. 1 is a flow chart of a method according to an embodiment of the invention.

Detailed Description

The embodiment provides a TOF depth imaging method based on deep learning, and the specific flow is shown in fig. 1, and the method includes the following steps:

step 1, selecting a proper depth data set, expanding the data set, including cutting, turning and rotating, and enhancing the generalization capability of the data set.

And 2, selecting Pytorch, setting the Batchsize to be 4, selecting a proper learning rate, setting 10 epoch attenuations every 10 epochs, and inputting a depth map of a data set to a TOF imaging network by a deep learning framework.

Step 3, selecting a learnable matrix parameter initialized to Gaussian distribution to simulate a modulation function M (t) in TOF imaging, wherein the number represents the data number of the simulated modulation function, and the rand represents moment initialized to Gaussian distributionMatrix, parameter represents setting a matrix to a learnable matrix. Another option is to simulate the demodulation function D (t) in TOF imaging by initializing a learnable matrix parameter (rand) with gaussian distribution, where number represents the number of data simulating the demodulation function, rand represents the matrix initialized with gaussian distribution, and parameter represents setting the matrix to a learnable matrix. Performing cross-correlation operation on the modulation function M (t) and the demodulation function D (t):and obtaining a modulating and demodulating function cross-correlation matrix. This step finds the modulation and demodulation functions that optimize the imaging effect by a deep learning method and applies to the modulation and demodulation circuits of the light source.

And 4, indexing the cross-correlation matrix by using the value pair of each pixel of the depth map, completing shifting and integrating operation and simulating demodulation operation.

And 5, adding photon noise and sensor reading noise to the integrated image according to noise generated in the actual TOF imaging process to form a noisy image. In the actual TOF imaging, noise exists when the detector receives the reflected light signal and the reading voltage, photon noise and sensor reading noise corresponding to the reflected light signal and the reading voltage are added when the network is trained, and the obtained depth map is robust to the noise.

And 6, inputting the noisy image into a denoising reconstruction sub-network of the TOF imaging network, denoising and reconstructing the noisy measurement image, and restoring the depth map input in the step 2.

And 7, selecting a mean square error function by the loss function, and simultaneously training a modulation function, a demodulation function and a denoising imaging sub-network. After training by using the deep learning method is completed, the network is verified by using the depth map of the test set. And obtaining a trained modulation function M (t), a demodulation function D (t) and a denoising reconstruction sub-network.

And 8, modulating the laser diode by using the modulation function M (t) trained in the step 7, and driving a laser diode circuit to work as light for illuminating a scene.

Step 9, the light source encounters the fieldThe object in the scene is reflected, part of the reflected light passes through the beam splitter and then is focused on the APD430 through the lens, and the APD430 obtains a reflected signal alpha M (t-t ₀ ) +β; where α is the reflectance, t ₀ Is the time delay in the spatial transfer and β is the radiation component caused by the external light source.

Step 10, reflecting signal αM (t-t ₀ ) The +beta and the demodulation signal D (t) obtained by the network training are multiplied by a multiplier and then pass through a low-pass filter to filter out high-frequency components, so that direct current offset is obtained, and the direct current offset represents a depth measurement value of a scene. The ADC is used to collect the dc component of the low pass filter output.

And 11, repeating the steps 8, 9 and 10 to finish depth measurement of all points in the scene, and obtaining a noisy depth measurement map.

And step 12, inputting the depth measurement image with noise into a denoising reconstruction sub-network obtained by training to obtain a noise-free depth image, and completing TOF depth imaging.

Claims

1. A deep learning-based TOF depth imaging method, comprising the steps of:

step 2, simulating a modulation function of actual TOF imaging by using a learnable matrix, and performing corresponding shift according to the value of each pixel of the depth map;

step 8, multiplying the reflected signal in step 7 with the demodulation function trained in step 5 through a multiplier, integrating through a low-pass filter, and acquiring the voltage output by the low-pass filter through an analog-to-digital converter;

2. The TOF depth imaging method according to claim 1, wherein in step 7, assuming that the modulation function used is M (t), the reflected signal received by the detector is: f (t) =αm (t-t) ₀ ) +β; where α is the reflectance, t ₀ Is the time delay in the spatial transfer and β is the radiation component caused by the external light source.