CN112381879A - 基于图像和三维模型的物体姿态估计方法、系统及介质 - Google Patents
基于图像和三维模型的物体姿态估计方法、系统及介质 Download PDFInfo
- Publication number
- CN112381879A CN112381879A CN202011278095.5A CN202011278095A CN112381879A CN 112381879 A CN112381879 A CN 112381879A CN 202011278095 A CN202011278095 A CN 202011278095A CN 112381879 A CN112381879 A CN 112381879A
- Authority
- CN
- China
- Prior art keywords
- image
- features
- dimensional model
- view
- dimensional
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 47
- 238000013507 mapping Methods 0.000 claims abstract description 35
- 230000000007 visual effect Effects 0.000 claims abstract description 25
- 238000013527 convolutional neural network Methods 0.000 claims abstract description 10
- 238000000605 extraction Methods 0.000 claims description 34
- 238000012549 training Methods 0.000 claims description 29
- 238000011176 pooling Methods 0.000 claims description 13
- 238000005457 optimization Methods 0.000 claims description 8
- 238000003860 storage Methods 0.000 claims description 8
- 238000009877 rendering Methods 0.000 claims description 6
- 239000013598 vector Substances 0.000 claims description 4
- 238000004422 calculation algorithm Methods 0.000 claims description 2
- 238000005516 engineering process Methods 0.000 abstract description 6
- 230000010365 information processing Effects 0.000 abstract description 2
- 230000006870 function Effects 0.000 description 18
- 238000010586 diagram Methods 0.000 description 5
- 238000012360 testing method Methods 0.000 description 4
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000011423 initialization method Methods 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
- G06T7/73—Determining position or orientation of objects or cameras using feature-based methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
- G06F18/2148—Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the process organisation or structure, e.g. boosting cascade
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G06T2200/04—Indexing scheme for image data processing or generation, in general involving 3D image data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Biology (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (10)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011278095.5A CN112381879A (zh) | 2020-11-16 | 2020-11-16 | 基于图像和三维模型的物体姿态估计方法、系统及介质 |
PCT/CN2021/124660 WO2022100379A1 (zh) | 2020-11-16 | 2021-10-19 | 基于图像和三维模型的物体姿态估计方法、系统及介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011278095.5A CN112381879A (zh) | 2020-11-16 | 2020-11-16 | 基于图像和三维模型的物体姿态估计方法、系统及介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112381879A true CN112381879A (zh) | 2021-02-19 |
Family
ID=74584723
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011278095.5A Pending CN112381879A (zh) | 2020-11-16 | 2020-11-16 | 基于图像和三维模型的物体姿态估计方法、系统及介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112381879A (zh) |
WO (1) | WO2022100379A1 (zh) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113409290A (zh) * | 2021-06-29 | 2021-09-17 | 北京兆维电子(集团)有限责任公司 | 一种液晶屏外观缺陷检测方法、装置及存储介质 |
CN113643366A (zh) * | 2021-07-12 | 2021-11-12 | 中国科学院自动化研究所 | 一种多视角三维对象姿态估计方法及装置 |
WO2022100379A1 (zh) * | 2020-11-16 | 2022-05-19 | 华南理工大学 | 基于图像和三维模型的物体姿态估计方法、系统及介质 |
CN115115780A (zh) * | 2022-06-29 | 2022-09-27 | 聚好看科技股份有限公司 | 基于多视角rgbd相机的三维重建方法及系统 |
CN115223023A (zh) * | 2022-09-16 | 2022-10-21 | 杭州得闻天下数字文化科技有限公司 | 基于立体视觉和深度神经网络的人体轮廓估计方法及装置 |
CN117315152A (zh) * | 2023-09-27 | 2023-12-29 | 杭州一隅千象科技有限公司 | 双目立体成像方法及其系统 |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115219492B (zh) * | 2022-05-25 | 2023-03-28 | 中国科学院自动化研究所 | 一种三维物体的外观图像采集方法及装置 |
CN114821145B (zh) * | 2022-06-28 | 2022-09-23 | 山东百盟信息技术有限公司 | 一种基于数据修复的非完整多视角图像数据聚类方法 |
CN116643648B (zh) * | 2023-04-13 | 2023-12-19 | 中国兵器装备集团自动化研究所有限公司 | 一种三维场景匹配交互方法、装置、设备及存储介质 |
CN116168137B (zh) * | 2023-04-21 | 2023-07-11 | 湖南马栏山视频先进技术研究院有限公司 | 一种基于神经辐射场的新视角合成方法、装置及存储器 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017156243A1 (en) * | 2016-03-11 | 2017-09-14 | Siemens Aktiengesellschaft | Deep-learning based feature mining for 2.5d sensing image search |
CN108491880A (zh) * | 2018-03-23 | 2018-09-04 | 西安电子科技大学 | 基于神经网络的物体分类和位姿估计方法 |
CN109063301A (zh) * | 2018-07-24 | 2018-12-21 | 杭州师范大学 | 一种基于热力图的单幅图像室内物体姿态估计方法 |
CN109816725A (zh) * | 2019-01-17 | 2019-05-28 | 哈工大机器人(合肥)国际创新研究院 | 一种基于深度学习的单目相机物体位姿估计方法及装置 |
CN109934847A (zh) * | 2019-03-06 | 2019-06-25 | 视辰信息科技(上海)有限公司 | 弱纹理三维物体姿态估计的方法和装置 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11532094B2 (en) * | 2018-12-05 | 2022-12-20 | Qualcomm Technologies, Inc. | Systems and methods for three-dimensional pose determination |
CN112381879A (zh) * | 2020-11-16 | 2021-02-19 | 华南理工大学 | 基于图像和三维模型的物体姿态估计方法、系统及介质 |
-
2020
- 2020-11-16 CN CN202011278095.5A patent/CN112381879A/zh active Pending
-
2021
- 2021-10-19 WO PCT/CN2021/124660 patent/WO2022100379A1/zh active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017156243A1 (en) * | 2016-03-11 | 2017-09-14 | Siemens Aktiengesellschaft | Deep-learning based feature mining for 2.5d sensing image search |
CN108491880A (zh) * | 2018-03-23 | 2018-09-04 | 西安电子科技大学 | 基于神经网络的物体分类和位姿估计方法 |
CN109063301A (zh) * | 2018-07-24 | 2018-12-21 | 杭州师范大学 | 一种基于热力图的单幅图像室内物体姿态估计方法 |
CN109816725A (zh) * | 2019-01-17 | 2019-05-28 | 哈工大机器人(合肥)国际创新研究院 | 一种基于深度学习的单目相机物体位姿估计方法及装置 |
CN109934847A (zh) * | 2019-03-06 | 2019-06-25 | 视辰信息科技(上海)有限公司 | 弱纹理三维物体姿态估计的方法和装置 |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2022100379A1 (zh) * | 2020-11-16 | 2022-05-19 | 华南理工大学 | 基于图像和三维模型的物体姿态估计方法、系统及介质 |
CN113409290A (zh) * | 2021-06-29 | 2021-09-17 | 北京兆维电子(集团)有限责任公司 | 一种液晶屏外观缺陷检测方法、装置及存储介质 |
CN113409290B (zh) * | 2021-06-29 | 2023-12-15 | 北京兆维电子(集团)有限责任公司 | 一种液晶屏外观缺陷检测方法、装置及存储介质 |
CN113643366A (zh) * | 2021-07-12 | 2021-11-12 | 中国科学院自动化研究所 | 一种多视角三维对象姿态估计方法及装置 |
CN113643366B (zh) * | 2021-07-12 | 2024-03-05 | 中国科学院自动化研究所 | 一种多视角三维对象姿态估计方法及装置 |
CN115115780A (zh) * | 2022-06-29 | 2022-09-27 | 聚好看科技股份有限公司 | 基于多视角rgbd相机的三维重建方法及系统 |
CN115223023A (zh) * | 2022-09-16 | 2022-10-21 | 杭州得闻天下数字文化科技有限公司 | 基于立体视觉和深度神经网络的人体轮廓估计方法及装置 |
CN117315152A (zh) * | 2023-09-27 | 2023-12-29 | 杭州一隅千象科技有限公司 | 双目立体成像方法及其系统 |
CN117315152B (zh) * | 2023-09-27 | 2024-03-29 | 杭州一隅千象科技有限公司 | 双目立体成像方法及其系统 |
Also Published As
Publication number | Publication date |
---|---|
WO2022100379A1 (zh) | 2022-05-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112381879A (zh) | 基于图像和三维模型的物体姿态估计方法、系统及介质 | |
Mescheder et al. | Occupancy networks: Learning 3d reconstruction in function space | |
Tang et al. | Emergent correspondence from image diffusion | |
Flynn et al. | Deepstereo: Learning to predict new views from the world's imagery | |
Riegler et al. | Atgv-net: Accurate depth super-resolution | |
Hu et al. | Render4Completion: Synthesizing multi-view depth maps for 3D shape completion | |
Li et al. | Detail-preserving and content-aware variational multi-view stereo reconstruction | |
CN110942512B (zh) | 基于元学习的室内场景重建方法 | |
CN113838191A (zh) | 一种基于注意力机制和单目多视角的三维重建方法 | |
CN112862949A (zh) | 基于多视图的物体3d形状重建方法 | |
Song et al. | Deep novel view synthesis from colored 3d point clouds | |
Chang et al. | Candidate-based matching of 3-D point clouds with axially switching pose estimation | |
CN114972634A (zh) | 基于特征体素融合的多视角三维可变形人脸重建方法 | |
CN114494395A (zh) | 基于平面先验的深度图生成方法、装置、设备及存储介质 | |
Huang et al. | A bayesian approach to multi-view 4d modeling | |
CN116091705A (zh) | 基于神经辐射场的变拓扑动态场景重建与编辑方法和装置 | |
Gadelha et al. | Inferring 3D shapes from image collections using adversarial networks | |
Ghosh et al. | Depth completion using a view-constrained deep prior | |
CN114998630A (zh) | 一种从粗到精的地对空图像配准方法 | |
CN114022458A (zh) | 骨架检测方法、装置、电子设备和计算机可读存储介质 | |
Brito | Autocalibration for structure from motion | |
Ventura et al. | Multiresolution co-clustering for uncalibrated multiview segmentation | |
Kim et al. | Category-specific upright orientation estimation for 3D model classification and retrieval | |
Yuan | A 3D photo-realistic environment simulator for mobile robots | |
Guo et al. | Shape generation via learning an adaptive multimodal prior |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20240226 Address after: 510641 Industrial Building, Wushan South China University of Technology, Tianhe District, Guangzhou City, Guangdong Province Applicant after: Guangzhou South China University of Technology Asset Management Co.,Ltd. Country or region after: China Address before: 511458 Guangdong, Guangzhou, Nansha District, 25 South Road, South China Road, Guangzhou, China Applicant before: SOUTH CHINA University OF TECHNOLOGY Country or region before: China |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20240402 Address after: 518057, Building 4, 512, Software Industry Base, No. 19, 17, and 18 Haitian Road, Binhai Community, Yuehai Street, Nanshan District, Shenzhen City, Guangdong Province Applicant after: Cross dimension (Shenzhen) Intelligent Digital Technology Co.,Ltd. Country or region after: China Address before: 510641 Industrial Building, Wushan South China University of Technology, Tianhe District, Guangzhou City, Guangdong Province Applicant before: Guangzhou South China University of Technology Asset Management Co.,Ltd. Country or region before: China |