CN113971639A - Depth estimation based under-exposed LDR image reconstruction HDR image - Google Patents

Depth estimation based under-exposed LDR image reconstruction HDR image Download PDF

Info

Publication number
CN113971639A
CN113971639A CN202110993297.6A CN202110993297A CN113971639A CN 113971639 A CN113971639 A CN 113971639A CN 202110993297 A CN202110993297 A CN 202110993297A CN 113971639 A CN113971639 A CN 113971639A
Authority
CN
China
Prior art keywords
hdr
image
images
ldr
depth estimation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110993297.6A
Other languages
Chinese (zh)
Inventor
张涛
梁杰
王昊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN202110993297.6A priority Critical patent/CN113971639A/en
Publication of CN113971639A publication Critical patent/CN113971639A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/90Dynamic range modification of images or parts thereof
    • G06T5/92Dynamic range modification of images or parts thereof based on global image properties
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20172Image enhancement details
    • G06T2207/20208High dynamic range [HDR] image processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Processing (AREA)
  • Image Analysis (AREA)

Abstract

Digital cameras can only capture the brightness of a real scene within a limited range. Texture details present in High Dynamic Range (HDR) data are lost due to quantization and saturation of the camera sensor, which presents great difficulties for subsequent image-based applications (depth estimation, etc.). As an indicator of HDR image reconstruction effect, depth estimation errors caused by incorrect tones reflected in LDR images may support improving learning efficiency of HDR image generation networks. Therefore, we propose an end-to-end framework consisting of two connected CNNs for the most critical issues in HDR image reconstruction. By performing a large number of quantitative and qualitative experiments on the reference and recently released challenging data sets, it is demonstrated that the proposed method achieves extraordinary performance compared to the latest single image HDR reconstruction and depth estimation algorithms.

Description

Depth estimation based under-exposed LDR image reconstruction HDR image
Technical Field
The invention relates to a reconstruction technology of HDR images, depth estimation is introduced into an image reconstruction model, and the whole network is superior to other HDR image reconstruction methods in an unsupervised learning mode.
Background
Digital cameras can only capture the brightness of a real scene within a limited range. Due to quantization and saturation of the camera sensor, texture details present in the High Dynamic Range (HDR) data are lost, causing great difficulties for subsequent image-based applications (depth estimation, etc.). Unlike existing learning-based methods, which introduce depth estimation into a learning model for HDR image reconstruction, the proposed method facilitates each other. As an indicator of HDR image reconstruction effect, depth estimation errors caused by incorrect tones in LDR images may support improving learning efficiency of HDR image reconstruction networks. On the other hand, a good HDR image may also improve depth estimation accuracy. Therefore, we propose an end-to-end framework consisting of two connected CNNs for HDR image reconstruction and depth estimation. Since underexposed images are the most critical issue in HDR image reconstruction, the reconstruction of HDR images from underexposed LDR images is mainly studied herein. By performing a large number of quantitative and qualitative experiments on the reference and recently released challenging data sets, it is demonstrated that the proposed method achieves extraordinary performance compared to the latest single image HDR reconstruction and depth estimation algorithms.
Currently, HDR image reconstruction techniques are widely used. HDR images generally contain more texture information, can bring better visual experience to human beings, and are therefore widely used in the fields of photography, physics-based rendering, computer games, movies, medical and industrial imaging, and the like. Since digital high resolution cameras are typically very expensive, high resolution image reconstruction algorithms that generate high resolution images from images captured by conventional digital cameras have become a common method of high resolution imaging. Whereas high resolution imaging can preserve details in both the extremely dark and bright regions, it has great potential to aid various tasks, in particular depth estimation, which relies on accurate color and intensity information to explore the pixel spatial relationships to estimate depth, with much less depth error for normally exposed LDR images. Under-exposed LDR images are usually noisy, resulting in loss of illumination information, which is not favorable for extraction of effective features. In contrast, HDR retains rich texture and color information. Intuitively, in dark environments, humans cannot accurately estimate the depth of a scene. Inspired by the depth estimation performance of computer vision and human eye vision under different illumination conditions, in order to better acquire the depth information of an exposed area, the HDR technology can be used for solving the problem of insufficient depth information of the exposed area, so that the lost depth information is recovered, the error of depth estimation is reduced, and the reconstruction of an HDR image is facilitated.
Inverse tone mapping can recover information lost due to underexposure/overexposure and color quantization. The most common methods are to fuse a series of Low Dynamic Range (LDR) images of different exposure times produced by a conventional camera or to directly learn a single LDR to HDR mapping, generating HDR images, although these methods reconstruct HDR images that can meet visual requirements, in many applications they do not reflect satisfactory results.
Disclosure of Invention
The present invention proposes an efficient method to reconstruct HDR images and corresponding depth maps from underexposed LDR images. Firstly, HDR reconstruction is combined with depth estimation, and the HDR reconstruction problem is solved. Compared to most other methods accepting a normal exposure image as input, the overexposure model network herein generates LDR images of different exposures with an underexposed LDR image as input; compared with most methods for reconstructing HDR images, the learning framework does not need the HDR images as ground-route and trains the HDR reconstruction network by taking depth loss as a constraint condition.
Detailed Description
The present invention combines the knowledge of HDR reconstruction with depth estimation to design a model. The core idea of the invention is to integrate depth estimation into the HDR image reconstruction model and develop two connected depth neural networks to solve each task. The method comprises the following specific steps:
the first step is the HDR reconstruction network.
The method of inferring 32 bit HDR images directly from 8 bit LDR images is difficult to train with unsupervised learning, and HDR reconstruction networks do not map LDRs directly to HDR images, but employ indirect methods to generate LDR images of different exposures, which are then merged into HDR images. The specific method is to give the input LDR image, infer from the given underexposed image that the LDR image with higher exposure than the input image, then output the overexposed image, and finally generate the final HDR image from these encompassed LDR images.
The second step is a depth estimation network.
And taking the HDR image as an input of a depth estimation network to perform depth estimation on the scene. The framework provided by the invention adopts an unsupervised depth estimation network based on CNN, and a depth map is trained from an image as input. The specific method is that a pair of three-dimensional HDR images are respectively sent into the same CNN network model, corresponding disparity maps are output, then reconstructed HDR left and right images are obtained through a sampler, and the predicted parity check can be aligned with the opposite input images.
CNN firstly adopts an encoder structure to extract geometric features, a decoder structure further constrains output parallax, nearest pixels at peripheral positions are linearly searched through bilinear sampling, and a distorted image is formed
Figure BSA0000251157100000021
Ultimately generating a reconstructed HDR image
Figure BSA0000251157100000022
And
Figure BSA0000251157100000023
warping images
Figure BSA0000251157100000024
Predictive disparity map
Figure BSA0000251157100000025
The relationship with the original input image I is as follows:
Figure BSA0000251157100000026

Claims (3)

1. the present invention combines the knowledge of HDR reconstruction with depth estimation to design a model. The core idea of the invention is to integrate depth estimation into an HDR image reconstruction model and develop two connected depth neural networks to solve each task, and the method is characterized by comprising the following steps:
1) since the method of inferring the 32-bit HDR image directly from the 8-bit LDR image is difficult to train for unsupervised learning, the present invention proposes an efficient method to generate HDR images and corresponding accurate depth maps under the exposed LDR image;
2) the method combines a training HDR reconstruction network and a depth estimation network, adopts an unsupervised end-to-end training strategy, integrates depth estimation into an HDR image reconstruction model, and develops two connected depth networks to solve each task;
3) in the invention, residual connection is added between the convolution layer and the deconvolution layer to transmit information between different exposure images, the characteristic diagram contains a large amount of image details, and the characteristic diagram can be shared between different exposure images, which is beneficial to the deconvolution layer to recover better images, and in addition, the residual connection can also reversely transmit the gradient to the bottom, thereby being convenient for better training network;
4) the invention discusses the influence of a depth loss function on an HDR image and a depth image, and adopts an ablation research method.
2. The method of generating an HDR image of claim 1, wherein step 1) comprises:
(1) using the underexposed image as an input, deducing LDR images with higher exposure rate than the input image, wherein the information recorded by each LDR image corresponds to the content of different dynamic ranges in the real scene;
(2) the overexposed images are output and finally the final HDR image is generated from these LDR images.
3. The method of claim 1, wherein step 2) comprises:
(1) firstly, giving an input LDR image, and recovering detail information lost due to underexposure by using an HDR image generation network, wherein the HDR image generation network consists of a convolutional layer and an anti-convolutional layer and is used for learning different exposure representations, the convolutional layer serves as a feature extractor and retains main components of objects in the image, and the anti-convolutional layers are combined to recover the details of the image;
(2) taking HDR images as input of a depth estimation network, performing depth estimation on a scene, learning the whole network in an unsupervised mode, constraining the network learning process by utilizing the relation between the HDR images and the LDR images, and training by combining the LDR generated by HDR and the LDR generated by losing the consistency between the LDR images generated by RGB under different exposure amounts until the HDR and the LDR are well generated;
(3) respectively sending a pair of three-dimensional HDR images into the same network model, outputting corresponding disparity maps, then obtaining reconstructed HDR left and right images through a sampler, and aligning predicted parity with opposite input images;
(4) the depth estimation network adopts an encoder to extract geometric features, the decoder structure further constrains output parallax, nearest pixels of the surrounding positions are searched linearly through bilinear sampling, and finally a reconstructed image is generated.
(5) In order to make the reconstructed image consistent with the original image, appearance matching loss, parallax smoothness loss and left-right parallax consistency loss are respectively set.
CN202110993297.6A 2021-08-27 2021-08-27 Depth estimation based under-exposed LDR image reconstruction HDR image Pending CN113971639A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110993297.6A CN113971639A (en) 2021-08-27 2021-08-27 Depth estimation based under-exposed LDR image reconstruction HDR image

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110993297.6A CN113971639A (en) 2021-08-27 2021-08-27 Depth estimation based under-exposed LDR image reconstruction HDR image

Publications (1)

Publication Number Publication Date
CN113971639A true CN113971639A (en) 2022-01-25

Family

ID=79586367

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110993297.6A Pending CN113971639A (en) 2021-08-27 2021-08-27 Depth estimation based under-exposed LDR image reconstruction HDR image

Country Status (1)

Country Link
CN (1) CN113971639A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130108183A1 (en) * 2010-07-06 2013-05-02 Koninklijke Philips Electronics N.V. Generation of high dynamic range images from low dynamic range images in multiview video coding
CN105705979A (en) * 2013-11-07 2016-06-22 三星电子株式会社 Method and system for creating a camera refocus effect
US9697592B1 (en) * 2015-12-30 2017-07-04 TCL Research America Inc. Computational-complexity adaptive method and system for transferring low dynamic range image to high dynamic range image
CN108805836A (en) * 2018-05-31 2018-11-13 大连理工大学 Method for correcting image based on the reciprocating HDR transformation of depth
CN111242883A (en) * 2020-01-10 2020-06-05 西安电子科技大学 Dynamic scene HDR reconstruction method based on deep learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130108183A1 (en) * 2010-07-06 2013-05-02 Koninklijke Philips Electronics N.V. Generation of high dynamic range images from low dynamic range images in multiview video coding
CN105705979A (en) * 2013-11-07 2016-06-22 三星电子株式会社 Method and system for creating a camera refocus effect
US9697592B1 (en) * 2015-12-30 2017-07-04 TCL Research America Inc. Computational-complexity adaptive method and system for transferring low dynamic range image to high dynamic range image
CN108805836A (en) * 2018-05-31 2018-11-13 大连理工大学 Method for correcting image based on the reciprocating HDR transformation of depth
CN111242883A (en) * 2020-01-10 2020-06-05 西安电子科技大学 Dynamic scene HDR reconstruction method based on deep learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陈文;王强;: "基于卷积神经网络的LDR图像重建HDR图像的方法研究", 包装工程, no. 05, 10 March 2020 (2020-03-10) *

Similar Documents

Publication Publication Date Title
Yang et al. Image correction via deep reciprocating HDR transformation
Yan et al. Deep HDR imaging via a non-local network
Lee et al. Deep chain hdri: Reconstructing a high dynamic range image from a single low dynamic range image
Zhou et al. Cross-view enhancement network for underwater images
Wang et al. Deep learning for hdr imaging: State-of-the-art and future trends
CN111242883B (en) Dynamic scene HDR reconstruction method based on deep learning
CN111986084A (en) Multi-camera low-illumination image quality enhancement method based on multi-task fusion
CN110225260B (en) Three-dimensional high dynamic range imaging method based on generation countermeasure network
CN113096029A (en) High dynamic range image generation method based on multi-branch codec neural network
CN111105376B (en) Single-exposure high-dynamic-range image generation method based on double-branch neural network
Zamir et al. Learning digital camera pipeline for extreme low-light imaging
CN110910336B (en) Three-dimensional high dynamic range imaging method based on full convolution neural network
CN113344773B (en) Single picture reconstruction HDR method based on multi-level dual feedback
Moriwaki et al. Hybrid loss for learning single-image-based HDR reconstruction
Yan et al. High dynamic range imaging via gradient-aware context aggregation network
Pan et al. Multi-exposure high dynamic range imaging with informative content enhanced network
CN111784596A (en) General endoscope image enhancement method and device based on generation of antagonistic neural network
CN113409247B (en) Multi-exposure fusion image quality evaluation method
Fu et al. Low-light raw video denoising with a high-quality realistic motion dataset
CN117237207A (en) Ghost-free high dynamic range light field imaging method for dynamic scene
CN112435165A (en) Two-stage video super-resolution reconstruction method based on generation countermeasure network
Tian et al. Deformable convolutional network constrained by contrastive learning for underwater image enhancement
Ye et al. Single exposure high dynamic range image reconstruction based on deep dual-branch network
CN116402908A (en) Dense light field image reconstruction method based on heterogeneous imaging
CN116245968A (en) Method for generating HDR image based on LDR image of transducer

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination