CN109003297B - 一种单目深度估计方法、装置、终端和存储介质 - Google Patents
一种单目深度估计方法、装置、终端和存储介质 Download PDFInfo
- Publication number
- CN109003297B CN109003297B CN201810790093.0A CN201810790093A CN109003297B CN 109003297 B CN109003297 B CN 109003297B CN 201810790093 A CN201810790093 A CN 201810790093A CN 109003297 B CN109003297 B CN 109003297B
- Authority
- CN
- China
- Prior art keywords
- depth
- model
- discrimination
- image
- generation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 48
- 238000012549 training Methods 0.000 claims abstract description 77
- 230000006870 function Effects 0.000 claims description 87
- 238000004422 calculation algorithm Methods 0.000 claims description 12
- 238000005457 optimization Methods 0.000 claims description 11
- 230000003042 antagnostic effect Effects 0.000 claims description 10
- 238000004590 computer program Methods 0.000 claims description 5
- 239000000203 mixture Substances 0.000 claims description 4
- 239000000126 substance Substances 0.000 claims description 4
- 238000010586 diagram Methods 0.000 description 10
- 238000005070 sampling Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000002939 conjugate gradient method Methods 0.000 description 2
- 238000013527 convolutional neural network Methods 0.000 description 2
- 238000011478 gradient descent method Methods 0.000 description 2
- 238000003062 neural network model Methods 0.000 description 2
- 238000011176 pooling Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10028—Range image; Depth image; 3D point clouds
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Biomedical Technology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810790093.0A CN109003297B (zh) | 2018-07-18 | 2018-07-18 | 一种单目深度估计方法、装置、终端和存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810790093.0A CN109003297B (zh) | 2018-07-18 | 2018-07-18 | 一种单目深度估计方法、装置、终端和存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109003297A CN109003297A (zh) | 2018-12-14 |
CN109003297B true CN109003297B (zh) | 2020-11-24 |
Family
ID=64599844
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810790093.0A Active CN109003297B (zh) | 2018-07-18 | 2018-07-18 | 一种单目深度估计方法、装置、终端和存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109003297B (zh) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109635770A (zh) * | 2018-12-20 | 2019-04-16 | 上海瑾盛通信科技有限公司 | 活体检测方法、装置、存储介质及电子设备 |
CN109753071B (zh) * | 2019-01-10 | 2022-04-22 | 上海物景智能科技有限公司 | 一种机器人贴边行走方法及系统 |
CN110264505B (zh) * | 2019-06-05 | 2021-07-30 | 北京达佳互联信息技术有限公司 | 一种单目深度估计方法、装置、电子设备及存储介质 |
CN112241976A (zh) * | 2019-07-19 | 2021-01-19 | 杭州海康威视数字技术股份有限公司 | 一种训练模型的方法及装置 |
CN110599532A (zh) * | 2019-09-18 | 2019-12-20 | 厦门美图之家科技有限公司 | 图像的深度估计模型优化、深度估计处理方法及装置 |
CN110674759A (zh) * | 2019-09-26 | 2020-01-10 | 深圳市捷顺科技实业股份有限公司 | 一种基于深度图的单目人脸活体检测方法、装置及设备 |
CN111429501A (zh) * | 2020-03-25 | 2020-07-17 | 贝壳技术有限公司 | 深度图预测模型生成方法和装置、深度图预测方法和装置 |
CN111428859A (zh) * | 2020-03-05 | 2020-07-17 | 北京三快在线科技有限公司 | 自动驾驶场景的深度估计网络训练方法、装置和自主车辆 |
CN111861949B (zh) * | 2020-04-21 | 2023-07-04 | 北京联合大学 | 一种基于生成对抗网络的多曝光图像融合方法及系统 |
TWI825566B (zh) * | 2022-01-24 | 2023-12-11 | 宏碁股份有限公司 | 立體影像產生裝置與立體影像產生方法 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015122674A1 (ko) * | 2014-02-13 | 2015-08-20 | 고려대학교 산학협력단 | 깊이 지도를 생성하는 방법 및 장치 |
CN107133934A (zh) * | 2017-05-18 | 2017-09-05 | 北京小米移动软件有限公司 | 图像补全方法及装置 |
CN107437077A (zh) * | 2017-08-04 | 2017-12-05 | 深圳市唯特视科技有限公司 | 一种基于生成对抗网络的旋转面部表示学习的方法 |
CN108090902A (zh) * | 2017-12-30 | 2018-05-29 | 中国传媒大学 | 一种基于多尺度生成对抗网络的无参考图像质量客观评价方法 |
CN108122249A (zh) * | 2017-12-20 | 2018-06-05 | 长沙全度影像科技有限公司 | 一种基于gan网络深度学习模型的光流估计方法 |
CN108197525A (zh) * | 2017-11-20 | 2018-06-22 | 中国科学院自动化研究所 | 人脸图像生成方法及装置 |
-
2018
- 2018-07-18 CN CN201810790093.0A patent/CN109003297B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015122674A1 (ko) * | 2014-02-13 | 2015-08-20 | 고려대학교 산학협력단 | 깊이 지도를 생성하는 방법 및 장치 |
CN107133934A (zh) * | 2017-05-18 | 2017-09-05 | 北京小米移动软件有限公司 | 图像补全方法及装置 |
CN107437077A (zh) * | 2017-08-04 | 2017-12-05 | 深圳市唯特视科技有限公司 | 一种基于生成对抗网络的旋转面部表示学习的方法 |
CN108197525A (zh) * | 2017-11-20 | 2018-06-22 | 中国科学院自动化研究所 | 人脸图像生成方法及装置 |
CN108122249A (zh) * | 2017-12-20 | 2018-06-05 | 长沙全度影像科技有限公司 | 一种基于gan网络深度学习模型的光流估计方法 |
CN108090902A (zh) * | 2017-12-30 | 2018-05-29 | 中国传媒大学 | 一种基于多尺度生成对抗网络的无参考图像质量客观评价方法 |
Non-Patent Citations (4)
Title |
---|
Deep Convolutional neural fields for depth estimation from a single image;F.Liu et al.;《In Proceedings of the IEEE Conference on Computer Vision and pattern Recognition》;20151231;第5162-5170页 * |
LSD-SLAM:large-scale direct monocular SLAM;Jakob Engel et al.;《In Eurpean Conference on Computer Vision. Springer》;20141231;第834-849页 * |
何东超.基于深度学习和用户交互的单张图像深度恢复算法研究.《中国优秀硕士学位论文全文数据库 信息科技辑》.2018, * |
基于深度学习和用户交互的单张图像深度恢复算法研究;何东超;《中国优秀硕士学位论文全文数据库 信息科技辑》;20180615;第5章 * |
Also Published As
Publication number | Publication date |
---|---|
CN109003297A (zh) | 2018-12-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109003297B (zh) | 一种单目深度估计方法、装置、终端和存储介质 | |
CN109087349B (zh) | 一种单目深度估计方法、装置、终端和存储介质 | |
US10614574B2 (en) | Generating image segmentation data using a multi-branch neural network | |
CN107274445B (zh) | 一种图像深度估计方法和系统 | |
US20200034971A1 (en) | Image Object Segmentation Based on Temporal Information | |
CN111738110A (zh) | 基于多尺度注意力机制的遥感图像车辆目标检测方法 | |
CN110717851A (zh) | 图像处理方法及装置、神经网络的训练方法、存储介质 | |
CN108764039B (zh) | 神经网络、遥感影像的建筑物提取方法、介质及计算设备 | |
CN114758337B (zh) | 一种语义实例重建方法、装置、设备及介质 | |
CN112581379A (zh) | 图像增强方法以及装置 | |
CN112990232B (zh) | 面向多种高空作业施工现场的安全带佩戴识别与检测方法 | |
CN111047630A (zh) | 神经网络和基于神经网络的目标检测及深度预测方法 | |
CN110633718B (zh) | 用于确定环境图像中的行驶区域的方法和装置 | |
CN111222522A (zh) | 神经网络训练、路面检测、智能驾驶控制方法和装置 | |
CN114419490A (zh) | 一种基于注意力金字塔的sar船只目标检测方法 | |
CN108734712B (zh) | 背景分割的方法、装置及计算机存储介质 | |
CN111292331B (zh) | 图像处理的方法与装置 | |
CN116258756B (zh) | 一种自监督单目深度估计方法及系统 | |
CN111611835A (zh) | 一种船只检测方法及装置 | |
CN116012483A (zh) | 一种图像渲染的方法、装置、存储介质及电子设备 | |
CN116009581A (zh) | 输电线路的无人机巡检方法、无人机控制终端及存储介质 | |
CN112651351B (zh) | 一种数据处理的方法和装置 | |
CN116883770A (zh) | 深度估计模型的训练方法、装置、电子设备及存储介质 | |
US20240046601A1 (en) | Deep recognition model training method, electronic device and readable storage medium | |
CN117078984B (zh) | 双目图像处理方法、装置、电子设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20210924 Address after: Room 501 / 503-505, 570 shengxia Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 201203 Patentee after: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Patentee after: HUAZHONG University OF SCIENCE AND TECHNOLOGY Address before: Room 501 / 503-505, 570 shengxia Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 201203 Patentee before: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20211223 Address after: Room 501 / 503-505, 570 shengxia Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 201203 Patentee after: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Address before: Room 501 / 503-505, 570 shengxia Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 201203 Patentee before: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Patentee before: Huazhong University of Science and Technology |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A monocular depth estimation method, device, terminal and storage medium Effective date of registration: 20221008 Granted publication date: 20201124 Pledgee: Industrial Bank Co.,Ltd. Shanghai Xuhui sub branch Pledgor: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2022310000277 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
CP02 | Change in the address of a patent holder |
Address after: 201210 7th Floor, No. 1, Lane 5005, Shenjiang Road, China (Shanghai) Pilot Free Trade Zone, Pudong New Area, Shanghai Patentee after: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Address before: Room 501 / 503-505, 570 shengxia Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 201203 Patentee before: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. |
|
CP02 | Change in the address of a patent holder | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20230906 Granted publication date: 20201124 Pledgee: Industrial Bank Co.,Ltd. Shanghai Xuhui sub branch Pledgor: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2022310000277 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A monocular depth estimation method, device, terminal, and storage medium Effective date of registration: 20231107 Granted publication date: 20201124 Pledgee: Industrial Bank Co.,Ltd. Shanghai Caohejing sub branch Pledgor: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2023310000719 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |