JP5357331B2 - 無作為多項ロジットを用いる意味場面区画 - Google Patents

無作為多項ロジットを用いる意味場面区画 Download PDF

Info

Publication number
JP5357331B2
JP5357331B2 JP2012514018A JP2012514018A JP5357331B2 JP 5357331 B2 JP5357331 B2 JP 5357331B2 JP 2012514018 A JP2012514018 A JP 2012514018A JP 2012514018 A JP2012514018 A JP 2012514018A JP 5357331 B2 JP5357331 B2 JP 5357331B2
Authority
JP
Japan
Prior art keywords
texton
rml
classifier
images
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2012514018A
Other languages
English (en)
Japanese (ja)
Other versions
JP2012529110A (ja
JP2012529110A5 (enExample
Inventor
ランガナサン,アナンス
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Honda Motor Co Ltd
Original Assignee
Honda Motor Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Honda Motor Co Ltd filed Critical Honda Motor Co Ltd
Publication of JP2012529110A publication Critical patent/JP2012529110A/ja
Publication of JP2012529110A5 publication Critical patent/JP2012529110A5/ja
Application granted granted Critical
Publication of JP5357331B2 publication Critical patent/JP5357331B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/462Salient features, e.g. scale invariant feature transforms [SIFT]
    • G06V10/464Salient features, e.g. scale invariant feature transforms [SIFT] using a plurality of salient features, e.g. bag-of-words [BoW] representations
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • G06F18/2115Selection of the most significant subset of features by evaluating different subsets according to an optimisation criterion, e.g. class separability, forward selection or backward elimination
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/771Feature selection, e.g. selecting representative features from a multi-dimensional feature space

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Computing Systems (AREA)
  • Medical Informatics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Probability & Statistics with Applications (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)
JP2012514018A 2009-06-04 2010-05-28 無作為多項ロジットを用いる意味場面区画 Expired - Fee Related JP5357331B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US21793009P 2009-06-04 2009-06-04
US61/217,930 2009-06-04
US12/789,292 US8442309B2 (en) 2009-06-04 2010-05-27 Semantic scene segmentation using random multinomial logit (RML)
US12/789,292 2010-05-27
PCT/US2010/036656 WO2010141369A1 (en) 2009-06-04 2010-05-28 Semantic scene segmentation using random multinomial logit (rml)

Publications (3)

Publication Number Publication Date
JP2012529110A JP2012529110A (ja) 2012-11-15
JP2012529110A5 JP2012529110A5 (enExample) 2013-07-18
JP5357331B2 true JP5357331B2 (ja) 2013-12-04

Family

ID=43298064

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2012514018A Expired - Fee Related JP5357331B2 (ja) 2009-06-04 2010-05-28 無作為多項ロジットを用いる意味場面区画

Country Status (4)

Country Link
US (1) US8442309B2 (enExample)
JP (1) JP5357331B2 (enExample)
DE (1) DE112010002232B4 (enExample)
WO (1) WO2010141369A1 (enExample)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8891869B2 (en) * 2011-03-31 2014-11-18 Sony Corporation System and method for effectively performing an integrated segmentation procedure
WO2012166840A2 (en) * 2011-06-01 2012-12-06 The Board Of Trustees Of The Leland Stanford Junior University Learning of image processing pipeline for digital imaging devices
CN102663418B (zh) * 2012-03-21 2014-04-23 清华大学 一种基于回归模型的图像集合建模与匹配方法
FR2996939B1 (fr) * 2012-10-12 2014-12-19 Commissariat Energie Atomique Procede de classification d'un objet multimodal
CN103268635B (zh) * 2013-05-15 2016-08-10 北京交通大学 一种几何网格场景模型的分割及语义标注方法
US9488483B2 (en) * 2013-05-17 2016-11-08 Honda Motor Co., Ltd. Localization using road markings
EP3120300A4 (en) * 2014-03-19 2017-11-22 Neurala Inc. Methods and apparatus for autonomous robotic control
CN105389583A (zh) * 2014-09-05 2016-03-09 华为技术有限公司 图像分类器的生成方法、图像分类方法和装置
CN106327469B (zh) * 2015-06-29 2019-06-18 北京航空航天大学 一种语义标签引导的视频对象分割方法
US20170200041A1 (en) * 2016-01-13 2017-07-13 Regents Of The University Of Minnesota Multi-modal data and class confusion: application in water monitoring
CN106021376B (zh) * 2016-05-11 2019-05-10 上海点融信息科技有限责任公司 用于处理用户信息的方法和设备
US10963741B2 (en) * 2016-06-07 2021-03-30 Toyota Motor Europe Control device, system and method for determining the perceptual load of a visual and dynamic driving scene
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis
US10635927B2 (en) 2017-03-06 2020-04-28 Honda Motor Co., Ltd. Systems for performing semantic segmentation and methods thereof
CN106971150B (zh) * 2017-03-15 2020-09-08 国网山东省电力公司威海供电公司 基于逻辑回归的排队异常检测方法及装置
US11798297B2 (en) * 2017-03-21 2023-10-24 Toyota Motor Europe Nv/Sa Control device, system and method for determining the perceptual load of a visual and dynamic driving scene
CN110120085B (zh) * 2018-02-07 2023-03-31 深圳市腾讯计算机系统有限公司 一种动态纹理视频生成方法、装置、服务器及存储介质
KR102718664B1 (ko) 2018-05-25 2024-10-18 삼성전자주식회사 영상 처리를 위한 네트워크 조정 방법 및 장치
US12106225B2 (en) 2019-05-30 2024-10-01 The Research Foundation For The State University Of New York System, method, and computer-accessible medium for generating multi-class models from single-class datasets
JP7242882B2 (ja) * 2019-09-27 2023-03-20 富士フイルム株式会社 情報処理装置、情報処理装置の作動方法、情報処理装置の作動プログラム
WO2023003662A1 (en) * 2021-07-21 2023-01-26 Canoo Technologies Inc. Augmented pseudo-labeling for object detection learning with unlabeled images
CN114373027A (zh) * 2021-12-17 2022-04-19 杭州电子科技大学上虞科学与工程研究院有限公司 基于灰度共生矩阵的瓷砖图像数据集生成方法
CN114821210B (zh) * 2022-03-17 2025-06-03 西北工业大学 一种基于多分类逻辑回归的特征选择方法

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4945478A (en) 1987-11-06 1990-07-31 Center For Innovative Technology Noninvasive medical imaging system and method for the identification and 3-D display of atherosclerosis and the like
DE19623033C1 (de) 1996-06-08 1997-10-16 Aeg Electrocom Gmbh Verfahren und Anordnung zur Mustererkennung auf statistischer Basis
US6711278B1 (en) 1998-09-10 2004-03-23 Microsoft Corporation Tracking semantic objects in vector image sequences
US7274810B2 (en) 2000-04-11 2007-09-25 Cornell Research Foundation, Inc. System and method for three-dimensional image rendering and analysis
FR2832832A1 (fr) * 2001-11-23 2003-05-30 Ge Med Sys Global Tech Co Llc Procede de detection et de caracterisation automatique de nodules dans une image tomographique et systeme d'imagerie medicale par tomodensimetrie correspondant
US7313268B2 (en) * 2002-10-31 2007-12-25 Eastman Kodak Company Method for using effective spatio-temporal image recomposition to improve scene classification
EP1609100A2 (en) 2003-03-19 2005-12-28 Customiser Ltd. Recognition of patterns in data
US7110000B2 (en) * 2003-10-31 2006-09-19 Microsoft Corporation Synthesis of progressively-variant textures and application to arbitrary surfaces
US20050221266A1 (en) * 2004-04-02 2005-10-06 Mislevy Robert J System and method for assessment design
JP4260060B2 (ja) * 2004-05-12 2009-04-30 ジーイー・メディカル・システムズ・グローバル・テクノロジー・カンパニー・エルエルシー X線ct装置および画像再構成装置
WO2007139070A1 (ja) * 2006-05-29 2007-12-06 Panasonic Corporation 光源推定装置、光源推定システムおよび光源推定方法、並びに、画像高解像度化装置および画像高解像度化方法
US20080027917A1 (en) 2006-07-31 2008-01-31 Siemens Corporate Research, Inc. Scalable Semantic Image Search
US7840059B2 (en) * 2006-09-21 2010-11-23 Microsoft Corporation Object recognition using textons and shape filters
US20090083790A1 (en) 2007-09-26 2009-03-26 Tao Wang Video scene segmentation and categorization
US8213725B2 (en) * 2009-03-20 2012-07-03 Eastman Kodak Company Semantic event detection using cross-domain knowledge

Also Published As

Publication number Publication date
DE112010002232T5 (de) 2012-07-05
WO2010141369A1 (en) 2010-12-09
JP2012529110A (ja) 2012-11-15
DE112010002232B4 (de) 2021-12-23
US8442309B2 (en) 2013-05-14
US20100310159A1 (en) 2010-12-09

Similar Documents

Publication Publication Date Title
JP5357331B2 (ja) 無作為多項ロジットを用いる意味場面区画
CN111310574B (zh) 一种车载视觉实时多目标多任务联合感知方法和装置
CN113762209B (zh) 一种基于yolo的多尺度并行特征融合路标检测方法
CN113468967B (zh) 基于注意力机制的车道线检测方法、装置、设备及介质
US12205041B2 (en) Training a generative adversarial network for performing semantic segmentation of images
CN111598030B (zh) 一种航拍图像中车辆检测和分割的方法及系统
US10699151B2 (en) System and method for performing saliency detection using deep active contours
EP3608844B1 (en) Methods for training a crnn and for semantic segmentation of an inputted video using said crnn
CN113168510B (zh) 通过细化形状先验分割对象
US20120263346A1 (en) Video-based detection of multiple object types under varying poses
Vaiyapuri et al. Automatic Vehicle License Plate Recognition Using Optimal Deep Learning Model.
CN112651274B (zh) 路上障碍物检测装置、路上障碍物检测方法及记录介质
CN112613387A (zh) 一种基于YOLOv3的交通标志检测方法
CN111476226B (zh) 一种文本定位方法、装置及模型训练方法
CN115082676B (zh) 一种伪标签模型的训练方法、装置、设备及存储介质
Wang et al. A feature-supervised generative adversarial network for environmental monitoring during hazy days
CN113807354B (zh) 图像语义分割方法、装置、设备和存储介质
CN109685830A (zh) 目标跟踪方法、装置和设备及计算机存储介质
CN114445462B (zh) 基于自适应卷积的跨模态视觉跟踪方法及装置
CN114463772B (zh) 基于深度学习的交通标志检测与识别方法及系统
CN115641317A (zh) 面向病理图像的动态知识回溯多示例学习及图像分类方法
CN115439499A (zh) 一种基于生成对抗网络的雨天图像去雨方法及装置
CN110969104A (zh) 基于二值化网络检测可行驶区域的方法、系统及存储介质
JP2023021924A (ja) 画像分類方法及び装置、並びに画像分類器の訓練を向上させる方法及び装置
CN120278916A (zh) 基于改进msr-yolo的交通标志去雾检测方法

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20130528

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20130528

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20130528

A975 Report on accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A971005

Effective date: 20130618

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20130730

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20130829

R150 Certificate of patent or registration of utility model

Ref document number: 5357331

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees