CN117576665B - Automatic driving-oriented single-camera three-dimensional target detection method and system - Google Patents
Automatic driving-oriented single-camera three-dimensional target detection method and system Download PDFInfo
- Publication number
- CN117576665B CN117576665B CN202410077692.3A CN202410077692A CN117576665B CN 117576665 B CN117576665 B CN 117576665B CN 202410077692 A CN202410077692 A CN 202410077692A CN 117576665 B CN117576665 B CN 117576665B
- Authority
- CN
- China
- Prior art keywords
- depth
- dimensional
- uncertainty
- target
- predicted
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 131
- 238000000034 method Methods 0.000 claims abstract description 47
- 230000004927 fusion Effects 0.000 claims abstract description 28
- 238000000605 extraction Methods 0.000 claims abstract description 18
- 238000004364 calculation method Methods 0.000 claims description 9
- 238000010606 normalization Methods 0.000 claims description 8
- 230000004913 activation Effects 0.000 claims description 2
- 230000003044 adaptive effect Effects 0.000 claims description 2
- 238000011176 pooling Methods 0.000 claims description 2
- 230000006870 function Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 238000012549 training Methods 0.000 description 6
- 238000011156 evaluation Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 238000011897 real-time detection Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/56—Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
- G06V20/58—Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
- G06T7/55—Depth or shape recovery from multiple images
- G06T7/593—Depth or shape recovery from multiple images from stereo images
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/25—Determination of region of interest [ROI] or a volume of interest [VOI]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/64—Three-dimensional objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/07—Target detection
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Artificial Intelligence (AREA)
- Databases & Information Systems (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410077692.3A CN117576665B (en) | 2024-01-19 | 2024-01-19 | Automatic driving-oriented single-camera three-dimensional target detection method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410077692.3A CN117576665B (en) | 2024-01-19 | 2024-01-19 | Automatic driving-oriented single-camera three-dimensional target detection method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117576665A CN117576665A (en) | 2024-02-20 |
CN117576665B true CN117576665B (en) | 2024-04-16 |
Family
ID=89890470
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202410077692.3A Active CN117576665B (en) | 2024-01-19 | 2024-01-19 | Automatic driving-oriented single-camera three-dimensional target detection method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117576665B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118447468B (en) * | 2024-07-08 | 2024-09-20 | 山西省财政税务专科学校 | Monocular three-dimensional detection method and device based on spatial relationship between adjacent targets |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111325794A (en) * | 2020-02-23 | 2020-06-23 | 哈尔滨工业大学 | Visual simultaneous localization and map construction method based on depth convolution self-encoder |
US11004233B1 (en) * | 2020-05-01 | 2021-05-11 | Ynjiun Paul Wang | Intelligent vision-based detection and ranging system and method |
CN113159151A (en) * | 2021-04-12 | 2021-07-23 | 中国科学技术大学 | Multi-sensor depth fusion 3D target detection method for automatic driving |
CN115222789A (en) * | 2022-07-15 | 2022-10-21 | 杭州飞步科技有限公司 | Training method, device and equipment for instance depth estimation model |
CN116580085A (en) * | 2023-03-13 | 2023-08-11 | 联通(上海)产业互联网有限公司 | Deep learning algorithm for 6D pose estimation based on attention mechanism |
-
2024
- 2024-01-19 CN CN202410077692.3A patent/CN117576665B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111325794A (en) * | 2020-02-23 | 2020-06-23 | 哈尔滨工业大学 | Visual simultaneous localization and map construction method based on depth convolution self-encoder |
US11004233B1 (en) * | 2020-05-01 | 2021-05-11 | Ynjiun Paul Wang | Intelligent vision-based detection and ranging system and method |
CN113159151A (en) * | 2021-04-12 | 2021-07-23 | 中国科学技术大学 | Multi-sensor depth fusion 3D target detection method for automatic driving |
CN115222789A (en) * | 2022-07-15 | 2022-10-21 | 杭州飞步科技有限公司 | Training method, device and equipment for instance depth estimation model |
CN116580085A (en) * | 2023-03-13 | 2023-08-11 | 联通(上海)产业互联网有限公司 | Deep learning algorithm for 6D pose estimation based on attention mechanism |
Non-Patent Citations (1)
Title |
---|
基于单目图像的自动驾驶三维目标检测算法研究;乔德文;《中国优秀硕士学位论文全文数据库 工程科技Ⅱ辑》;20240115;C035-406 * |
Also Published As
Publication number | Publication date |
---|---|
CN117576665A (en) | 2024-02-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9990736B2 (en) | Robust anytime tracking combining 3D shape, color, and motion with annealed dynamic histograms | |
CN114565900A (en) | Target detection method based on improved YOLOv5 and binocular stereo vision | |
CN111201451A (en) | Method and device for detecting object in scene based on laser data and radar data of scene | |
CN117576665B (en) | Automatic driving-oriented single-camera three-dimensional target detection method and system | |
CN110197106A (en) | Object designation system and method | |
KR20210090384A (en) | Method and Apparatus for Detecting 3D Object Using Camera and Lidar Sensor | |
US20220129685A1 (en) | System and Method for Determining Object Characteristics in Real-time | |
CN110992424B (en) | Positioning method and system based on binocular vision | |
CN113092807B (en) | Urban overhead road vehicle speed measuring method based on multi-target tracking algorithm | |
CN113281718B (en) | 3D multi-target tracking system and method based on laser radar scene flow estimation | |
CN114495064A (en) | Monocular depth estimation-based vehicle surrounding obstacle early warning method | |
CN114372523A (en) | Binocular matching uncertainty estimation method based on evidence deep learning | |
CN111862147B (en) | Tracking method for multiple vehicles and multiple lines of human targets in video | |
CN116310673A (en) | Three-dimensional target detection method based on fusion of point cloud and image features | |
CN114608522B (en) | Obstacle recognition and distance measurement method based on vision | |
CN113112547A (en) | Robot, repositioning method thereof, positioning device and storage medium | |
CN115937520A (en) | Point cloud moving target segmentation method based on semantic information guidance | |
CN117523514A (en) | Cross-attention-based radar vision fusion data target detection method and system | |
CN115909268A (en) | Dynamic obstacle detection method and device | |
CN112699748B (en) | Human-vehicle distance estimation method based on YOLO and RGB image | |
CN114220138A (en) | Face alignment method, training method, device and storage medium | |
CN111709269B (en) | Human hand segmentation method and device based on two-dimensional joint information in depth image | |
CN112712062A (en) | Monocular three-dimensional object detection method and device based on decoupling truncated object | |
CN116740519A (en) | Three-dimensional target detection method, system and storage medium for close-range and long-range multi-dimensional fusion | |
CN114140497A (en) | Target vehicle 3D real-time tracking method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20240220 Assignee: Nanjing Benli Information Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980016890 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20240927 Application publication date: 20240220 Assignee: Nanjing Eryuefei Network Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980016831 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20240927 |
|
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20240220 Assignee: Nanjing Zhujin Intelligent Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980017765 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241010 Application publication date: 20240220 Assignee: Nanjing Shangyao Electronic Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980017764 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241010 Application publication date: 20240220 Assignee: Nanjing Qida Network Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980017763 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241010 Application publication date: 20240220 Assignee: Nanjing Donglai Information Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980017666 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241009 Application publication date: 20240220 Assignee: Nanjing Zijin Information Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980017766 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241010 |
|
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20240220 Assignee: Nanjing Yuanshen Intelligent Technology R&D Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018301 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Yuze Robot Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018300 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Zhongyang Information Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018299 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Fangtai Intelligent Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018298 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Gaoxi Information Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018297 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Fuliang Network Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018296 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Yixun Intelligent Equipment Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018292 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Yihe Information Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018291 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Xingzhuo Intelligent Equipment Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018289 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Tichi Information Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018288 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Jindong Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018286 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Jinsheng Artificial Intelligence Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018283 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Jingda Environmental Protection Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018281 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Hancong Robot Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018278 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Jiangsu Huida Information Technology Industry Development Research Institute Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018270 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Extreme New Materials Research Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018268 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Youqi Intelligent Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018261 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Haohang Intelligent Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018249 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Pengjia Robot Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018246 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Nuoyan Intelligent Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018241 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 Application publication date: 20240220 Assignee: Nanjing Junshang Network Technology Co.,Ltd. Assignor: NANJING University OF POSTS AND TELECOMMUNICATIONS Contract record no.: X2024980018234 Denomination of invention: A single camera 3D object detection method and system for autonomous driving Granted publication date: 20240416 License type: Common License Record date: 20241012 |