CN109102543B - 基于图像分割的物体定位方法、设备和存储介质 - Google Patents
基于图像分割的物体定位方法、设备和存储介质 Download PDFInfo
- Publication number
- CN109102543B CN109102543B CN201810943480.3A CN201810943480A CN109102543B CN 109102543 B CN109102543 B CN 109102543B CN 201810943480 A CN201810943480 A CN 201810943480A CN 109102543 B CN109102543 B CN 109102543B
- Authority
- CN
- China
- Prior art keywords
- neural network
- target
- training
- image
- image segmentation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 50
- 238000003709 image segmentation Methods 0.000 title claims abstract description 37
- 238000012549 training Methods 0.000 claims abstract description 81
- 238000013528 artificial neural network Methods 0.000 claims abstract description 67
- 238000013527 convolutional neural network Methods 0.000 claims description 16
- 238000002372 labelling Methods 0.000 claims description 11
- 230000004807 localization Effects 0.000 claims description 10
- 238000011176 pooling Methods 0.000 claims description 7
- 238000004590 computer program Methods 0.000 claims description 6
- 238000005070 sampling Methods 0.000 claims description 6
- 239000000203 mixture Substances 0.000 claims description 2
- 238000003062 neural network model Methods 0.000 abstract description 4
- 230000011218 segmentation Effects 0.000 abstract description 2
- 230000000007 visual effect Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 210000002569 neuron Anatomy 0.000 description 4
- 238000013145 classification model Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/11—Region-based segmentation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
- G06V10/443—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
- G06V10/449—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
- G06V10/451—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
- G06V10/454—Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30108—Industrial image inspection
- G06T2207/30164—Workpiece; Machine component
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Multimedia (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Databases & Information Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Medical Informatics (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Biodiversity & Conservation Biology (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
Abstract
Description
Claims (9)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810943480.3A CN109102543B (zh) | 2018-08-17 | 2018-08-17 | 基于图像分割的物体定位方法、设备和存储介质 |
US16/437,287 US11144787B2 (en) | 2018-08-17 | 2019-06-11 | Object location method, device and storage medium based on image segmentation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810943480.3A CN109102543B (zh) | 2018-08-17 | 2018-08-17 | 基于图像分割的物体定位方法、设备和存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109102543A CN109102543A (zh) | 2018-12-28 |
CN109102543B true CN109102543B (zh) | 2021-04-02 |
Family
ID=64850247
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810943480.3A Active CN109102543B (zh) | 2018-08-17 | 2018-08-17 | 基于图像分割的物体定位方法、设备和存储介质 |
Country Status (2)
Country | Link |
---|---|
US (1) | US11144787B2 (zh) |
CN (1) | CN109102543B (zh) |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109670532B (zh) * | 2018-11-23 | 2022-12-09 | 腾讯医疗健康(深圳)有限公司 | 生物体器官组织图像的异常识别方法、装置及系统 |
CN111695585A (zh) * | 2019-03-14 | 2020-09-22 | 顶级手套国际有限公司 | 手套的取放方法及其系统 |
CN109969178B (zh) * | 2019-03-26 | 2021-09-21 | 齐鲁工业大学 | 基于多元传感器多物料自主搬运装置及方法 |
CN110210487A (zh) * | 2019-05-30 | 2019-09-06 | 上海商汤智能科技有限公司 | 一种图像分割方法及装置、电子设备和存储介质 |
CN114424250A (zh) * | 2019-07-19 | 2022-04-29 | 法弗人工智能有限公司 | 结构建模 |
US11003928B2 (en) | 2019-08-08 | 2021-05-11 | Argo AI, LLC | Using captured video data to identify active turn signals on a vehicle |
CN111402278B (zh) * | 2020-02-21 | 2023-10-27 | 华为云计算技术有限公司 | 分割模型训练方法、图像标注方法及相关装置 |
CN111476840B (zh) * | 2020-05-14 | 2023-08-22 | 阿丘机器人科技(苏州)有限公司 | 目标定位方法、装置、设备及计算机可读存储介质 |
CN111709293B (zh) * | 2020-05-18 | 2023-10-03 | 杭州电子科技大学 | 一种基于ResUNet神经网络的化学结构式分割方法 |
CN112037177B (zh) * | 2020-08-07 | 2024-08-16 | 浙江大华技术股份有限公司 | 一种车厢装载率的评估方法和装置以及存储介质 |
CN112802107A (zh) * | 2021-02-05 | 2021-05-14 | 梅卡曼德(北京)机器人科技有限公司 | 基于机器人的夹具组的控制方法及装置 |
CN112547528B (zh) * | 2021-03-01 | 2021-05-25 | 华鹏飞股份有限公司 | 基于分类识别的物流分拣方法及系统 |
CN112975985B (zh) * | 2021-03-22 | 2022-09-27 | 梅卡曼德(北京)机器人科技有限公司 | 抓取机器人及其控制方法和定位模型训练方法 |
CN113011567B (zh) * | 2021-03-31 | 2023-01-31 | 深圳精智达技术股份有限公司 | 一种卷积神经网络模型的训练方法及装置 |
CN113505776A (zh) * | 2021-07-16 | 2021-10-15 | 青岛新奥清洁能源有限公司 | 一种用于燃气表读数的智能识别方法及装置 |
US20230154166A1 (en) * | 2021-11-12 | 2023-05-18 | Microsoft Technology Licensing, Llc | Adaptive artificial intelligence for three-dimensional object detection using synthetic training data |
CN116197885B (zh) * | 2021-11-28 | 2023-11-24 | 梅卡曼德(北京)机器人科技有限公司 | 基于压叠检测的图像数据过滤方法、装置、设备和介质 |
CN114463579A (zh) * | 2022-01-13 | 2022-05-10 | 中铁第四勘察设计院集团有限公司 | 点云分类方法、装置、电子设备及存储介质 |
CN115116026B (zh) * | 2022-05-26 | 2024-04-09 | 江苏大学 | 一种物流搬运机器人自动循迹方法及系统 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7466848B2 (en) * | 2002-12-13 | 2008-12-16 | Rutgers, The State University Of New Jersey | Method and apparatus for automatically detecting breast lesions and tumors in images |
CN104732493A (zh) * | 2015-03-18 | 2015-06-24 | 西安电子科技大学 | 一种基于Primal Sketch分类和SVD域改进MMSE估计的SAR图像去噪算法 |
CN105184803A (zh) * | 2015-09-30 | 2015-12-23 | 西安电子科技大学 | 一种姿态测量方法和装置 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7903857B2 (en) * | 2006-04-17 | 2011-03-08 | Siemens Medical Solutions Usa, Inc. | Robust click-point linking with geometric configuration context: interactive localized registration approach |
CN104680508B (zh) * | 2013-11-29 | 2018-07-03 | 华为技术有限公司 | 卷积神经网络和基于卷积神经网络的目标物体检测方法 |
US20160225053A1 (en) * | 2015-01-29 | 2016-08-04 | Clear Research Corporation | Mobile visual commerce system |
US10331974B2 (en) * | 2016-11-08 | 2019-06-25 | Nec Corporation | Action recognition system with landmark localization on objects in images using convolutional neural networks |
CN108229514A (zh) * | 2016-12-29 | 2018-06-29 | 北京市商汤科技开发有限公司 | 物体检测方法、装置和电子设备 |
CN106874914B (zh) * | 2017-01-12 | 2019-05-14 | 华南理工大学 | 一种基于深度卷积神经网络的工业机械臂视觉控制方法 |
WO2018187632A1 (en) * | 2017-04-05 | 2018-10-11 | Carnegie Mellon University | Deep learning methods for estimating density and/or flow of objects, and related methods and software |
CN107959883B (zh) * | 2017-11-30 | 2020-06-09 | 广州市百果园信息技术有限公司 | 视频编辑推送方法、系统及智能移动终端 |
-
2018
- 2018-08-17 CN CN201810943480.3A patent/CN109102543B/zh active Active
-
2019
- 2019-06-11 US US16/437,287 patent/US11144787B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7466848B2 (en) * | 2002-12-13 | 2008-12-16 | Rutgers, The State University Of New Jersey | Method and apparatus for automatically detecting breast lesions and tumors in images |
CN104732493A (zh) * | 2015-03-18 | 2015-06-24 | 西安电子科技大学 | 一种基于Primal Sketch分类和SVD域改进MMSE估计的SAR图像去噪算法 |
CN105184803A (zh) * | 2015-09-30 | 2015-12-23 | 西安电子科技大学 | 一种姿态测量方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
US11144787B2 (en) | 2021-10-12 |
US20200057917A1 (en) | 2020-02-20 |
CN109102543A (zh) | 2018-12-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109102543B (zh) | 基于图像分割的物体定位方法、设备和存储介质 | |
CN111784685B (zh) | 一种基于云边协同检测的输电线路缺陷图像识别方法 | |
CN107944396B (zh) | 一种基于改进深度学习的刀闸状态识别方法 | |
CN107609485B (zh) | 交通标志的识别方法、存储介质、处理设备 | |
CN111179249A (zh) | 一种基于深度卷积神经网络的电力设备检测方法和装置 | |
CN108305260B (zh) | 一种图像中角点的检测方法、装置及设备 | |
CN108648169A (zh) | 高压输电塔绝缘子缺陷自动识别的方法及装置 | |
CN107464245B (zh) | 一种图像结构边缘的定位方法及装置 | |
US12017368B2 (en) | Mix-size depalletizing | |
CN107730553B (zh) | 一种基于伪真值搜寻法的弱监督物体检测方法 | |
CN113128610A (zh) | 一种工业零件位姿估计方法及系统 | |
CN109389105B (zh) | 一种基于多任务的虹膜检测和视角分类方法 | |
CN106887006B (zh) | 堆叠物体的识别方法、设备和机器分拣系统 | |
CN113435407B (zh) | 一种输电系统的小目标识别方法及装置 | |
CN110929795A (zh) | 高速焊线机焊点快速识别与定位方法 | |
CN111738036A (zh) | 图像处理方法、装置、设备及存储介质 | |
CN115272204A (zh) | 一种基于机器视觉的轴承表面划痕检测方法 | |
CN113516146A (zh) | 一种数据分类方法、计算机及可读存储介质 | |
US20210216767A1 (en) | Method and computing system for object recognition or object registration based on image classification | |
CN111369526A (zh) | 基于半监督深度学习的多类型旧桥裂痕识别方法 | |
CN110837809A (zh) | 血液自动分析方法、系统、血细胞分析仪及存储介质 | |
CN111144425B (zh) | 检测拍屏图片的方法、装置、电子设备及存储介质 | |
CN109146885B (zh) | 图像分割方法、设备和计算机可读存储介质 | |
CN114842188A (zh) | 一种基于深度学习算法的茶叶嫩芽采摘点定位方法 | |
CN111126402B (zh) | 一种图像处理方法、装置、电子设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: B701-702, industrialization building, Shenzhen Virtual University Park, No.2, Yuexing Third Road, Nanshan District, Shenzhen, Guangdong Province Applicant after: Shenzhen Lan pangzi machine intelligence Co.,Ltd. Address before: B701-702, industrialization building, Shenzhen Virtual University Park, No.2, Yuexing Third Road, Nanshan District, Shenzhen, Guangdong Province Applicant before: SHENZHEN DORABOT Inc. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PP01 | Preservation of patent right |
Effective date of registration: 20240722 Granted publication date: 20210402 |
|
PP01 | Preservation of patent right |