WO2005111918A1 - Method for combining boosted classifiers for efficient multi-class object detection - Google Patents

Method for combining boosted classifiers for efficient multi-class object detection Download PDF

Info

Publication number
WO2005111918A1
WO2005111918A1 PCT/US2005/015854 US2005015854W WO2005111918A1 WO 2005111918 A1 WO2005111918 A1 WO 2005111918A1 US 2005015854 W US2005015854 W US 2005015854W WO 2005111918 A1 WO2005111918 A1 WO 2005111918A1
Authority
WO
WIPO (PCT)
Prior art keywords
class
weak classifiers
ensemble
objects
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2005/015854
Other languages
English (en)
French (fr)
Inventor
Claus Bahlmann
Ying Zhu
Dorin Comaniciu
Thorsten KÖHLER
Martin Pellkofer
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Siemens AG
Siemens Corp
Siemens Corporate Research Inc
Original Assignee
Siemens AG
Siemens Corp
Siemens Corporate Research Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Siemens AG, Siemens Corp, Siemens Corporate Research Inc filed Critical Siemens AG
Priority to JP2007513229A priority Critical patent/JP4999101B2/ja
Priority to EP05745708.7A priority patent/EP1745414B1/en
Publication of WO2005111918A1 publication Critical patent/WO2005111918A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/771Feature selection, e.g. selecting representative features from a multi-dimensional feature space
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • G06V10/7747Organisation of the process, e.g. bagging or boosting

Definitions

  • the present invention is directed to a method for combining boosted classifiers for efficient multi-class object detection, and more particularly, to a system and method for training a detection system that identifies multi-class objects using boosted classifiers.
  • Boosting techniques are particularly effective for detecting a single object class.
  • Many detection applications require multiple object class detection in order to be effective.
  • An example of such an application is vehicle detection where separate object classes may be defined for vehicles, trucks, pedestrians and traffic signs.
  • Another example of a detection application that requires multiple object classes is people detection. Particularly, if the people are in motion, it is more effective to define people sub-classes based on the difference poses or actions of the people. For example, such subclasses could include sitting, standing and walking.
  • the object detection is solved by evaluating the strong classifier h on candidate image patches x e X .
  • is a threshold allowing the user to balance false alarm and miss detection rate.
  • An optimal selection of the weak classifiers h t and a proper weighting ⁇ t is obtained from an AdaBoost training algorithm.
  • AdaBoost to train one individual ensemble of weak classifiers H (/) and weights ⁇ (/ for each class /, that is,
  • a more efficient multi-class detection method and system is devised by a joint design of key components including feature and classifier design of individual detectors.
  • a method for training a system for detecting multi-class objects in an image or a video sequence is described.
  • a common ensemble of weak classifiers for a set of object classes is identified.
  • a separate weighting scheme is adapted for the ensemble of weak classifiers.
  • the present invention is also directed to a method for detecting objects of multiple classes in an image or a video sequence.
  • Each class is assigned a detector that is implemented by a weighted combination of weak classifiers such that all of the detectors are based on a common ensemble of weak classifiers. Then weights are individually set for each class.
  • FIG. 1 is a system block diagram of a system for detecting and tracking multi-class objects in accordance with the present invention.
  • FIG. 2 illustrates a multi-class object detection framework in accordance with the present invention.
  • the present invention is directed to a system and method for combining boosted classifiers for efficient multi-class object detection.
  • the present invention is also directed to a method for training classifiers for multi-class object detection.
  • the present invention can be used in a multitude of applications that require the detection of different objects.
  • Such a system may, for example, be used for surveillance applications, such as for detecting and tracking a person or facial features.
  • the present invention could also be used to detect and track objects on an assembly line. Other applications could be created for detecting and tracking human organs for medical applications. It is to be understood by those skilled in the art that the present invention may be used in other environments as well. Another environment in which the present invention would be useful is in the detection and tracking of vehicles. In addition to detecting different types of vehicles, there is also sometimes a need for detecting pedestrians, traffic signs and other vehicle-environment related objects. For purposes of describing the present invention, it will be described in the on-road obstacle detection environment.
  • FIG. 1 illustrates a block diagram of a system for implementing the present invention.
  • One or more cameras 102 are used to capture images of a road and its surroundings. As would be expected with a typical road image, the image includes background images, such as buildings, trees, and houses, and vehicles driving on the road.
  • the images are communicated to a processor 104 which computes confidence scores using the component classifiers in a database 106.
  • the images are analyzed in accordance with the present invention to identify different classes of objects. Examples of such objects include cars, trucks, pedestrians and traffic signs.
  • Once an object is identified at a sufficient confidence level, the object is identified via an output device 108.
  • the output device 108 provides an output signal which communicates to the user the presence of one or more objects.
  • the output signal may be an audible signal or other type of warning signal.
  • the output device 308 may also include a display for viewing the detected objects.
  • the display provides a view of the images taken by the cameras 302 which may then be enhanced to indicate objects that have been detected and which are being tracked.
  • the present invention is directed to a number of training techniques which are used to identify and select a number of weak classifiers which are then stored in database 106 and used to detect the various objects.
  • a premise of the present invention is an underlying principle that all L detectors share a common set of features or weak classifiers.
  • each individual strong classifier is adapted to its individual set of weights ⁇ (1) posterior to the weak classifier selection.
  • individual weighting there is only a minimal increase in the computation complexity compared to single class detection.
  • the majority of the complexity derived in the computations of Equations (3) and (4) of the prior art arise from the evaluation of the weak classifiers and not the weighting.
  • FIG. 2 illustrates a multi-class object detection framework in accordance with the present invention.
  • Training of the system refers to the selection of a common set of weak classifiers H and the adaptation of individual weights ⁇ (1) .
  • the remaining open issue is the selection of the common weak classifier ensemble H.
  • three different techniques are described for selecting H.
  • the first technique assumes that one distinguished class l 0 exists.
  • the distinguished class can be, for instance, a class with a high occurrence of objects in the scene.
  • the training set of class / 0 is used to select a common ensemble
  • AdaBoost AdaBoost
  • the candidate weak classifiers are taken solely from H.
  • An example of an application where this technique might be used is a manufacturing application. Since the predominant object being detected might be an item on an assembly line the weak classifiers associated with that object would be used as the training set.
  • a second technique for creating a training set is used when the assignment of one distinguished class is not reasonable. In such an instance, a more symmetric procedure would be appropriate.
  • a common classifier H is trained based on the union of all training samples.
  • each individual classifier / is fine-tuned by adjusting ⁇ ⁇ with respect to H and the positive training samples solely from class I.
  • H is optimized in order to discriminate all positive objects from the negatives.
  • the individual detection problem can be specialized by adapting the respective impact of the individual weak classifiers.
  • facial detection uses classes that are made up of different facial poses. However, many of the classifiers for the poses are similar (e.g., eyes, nose, and mouth).
  • vehicle detection uses classes that are made up of different vehicles. In each case, normally certain features such as the edges and corners of the vehicle are looked at.
  • a third technique obtains H by collecting the most appropriate weak classifiers for each object class I.
  • individual sets of weak classifiers are selected in a first training step.
  • the final set H is obtained from a combination of ⁇ H 1) ,.--,H (L) ⁇ .
  • One approach is to use the union
  • This technique is best used for applications that require a distribution of data resources.
  • the application may also detect, traffic signs and pedestrians. Additional objects that may be detected include road lanes, guard rails and background objects like trees, houses and other buildings.
  • each of the classes uses weak classifiers that are distinct from the other classes. However, there still may be some common weak classifiers among all of the classes.
  • the framework shown in FIG. 2 can be extended to be applicable for multi-class classification.
  • the classifier assigns the most probable class label
  • the present invention uses a set of boosted weak classifiers that is common within all individual class detectors and thus has be to computed only once. A possible major loss in detection accuracy by this restriction is prevented by a posterior re-training of the weak classifier weights, individually to each object class to reduce its misclassification error.
  • the training complexity of the present invention does not increase significantly compared to the prior art approaches since re-adjustment of the individual class weights with AdaBoost is based only on H and thus on a much small set of weak classifier candidates.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Computing Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Image Analysis (AREA)
PCT/US2005/015854 2004-05-10 2005-05-05 Method for combining boosted classifiers for efficient multi-class object detection Ceased WO2005111918A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2007513229A JP4999101B2 (ja) 2004-05-10 2005-05-05 効率的なマルチクラス対象物検出のためにブースト分類器を組み合わせる方法
EP05745708.7A EP1745414B1 (en) 2004-05-10 2005-05-05 Method for combining boosted classifiers for efficient multi-class object detection

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US56955204P 2004-05-10 2004-05-10
US60/569,552 2004-05-10
US11/111,640 2005-04-21
US11/111,640 US7769228B2 (en) 2004-05-10 2005-04-21 Method for combining boosted classifiers for efficient multi-class object detection

Publications (1)

Publication Number Publication Date
WO2005111918A1 true WO2005111918A1 (en) 2005-11-24

Family

ID=34968880

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/015854 Ceased WO2005111918A1 (en) 2004-05-10 2005-05-05 Method for combining boosted classifiers for efficient multi-class object detection

Country Status (4)

Country Link
US (1) US7769228B2 (enExample)
EP (1) EP1745414B1 (enExample)
JP (1) JP4999101B2 (enExample)
WO (1) WO2005111918A1 (enExample)

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9310892B2 (en) 2000-11-06 2016-04-12 Nant Holdings Ip, Llc Object information derived from object images
US7680324B2 (en) 2000-11-06 2010-03-16 Evryx Technologies, Inc. Use of image-derived information as search criteria for internet and other search engines
US7565008B2 (en) 2000-11-06 2009-07-21 Evryx Technologies, Inc. Data capture and identification system and process
US8224078B2 (en) 2000-11-06 2012-07-17 Nant Holdings Ip, Llc Image capture and identification system and process
US7899243B2 (en) 2000-11-06 2011-03-01 Evryx Technologies, Inc. Image capture and identification system and process
US7508979B2 (en) * 2003-11-21 2009-03-24 Siemens Corporate Research, Inc. System and method for detecting an occupant and head pose using stereo detectors
WO2007107315A1 (de) * 2006-03-22 2007-09-27 Daimler Ag Multisensorieller hypothesen-basierter objektdetektor und objektverfolger
EP2150437B1 (en) 2007-04-30 2014-06-18 Mobileye Technologies Limited Rear obstruction detection
JP4891197B2 (ja) 2007-11-01 2012-03-07 キヤノン株式会社 画像処理装置および画像処理方法
US8433669B2 (en) * 2007-11-14 2013-04-30 International Business Machines Corporation Configuring individual classifiers with multiple operating points for cascaded classifier topologies under resource constraints
US8306940B2 (en) * 2009-03-20 2012-11-06 Microsoft Corporation Interactive visualization for generating ensemble classifiers
CN101853389A (zh) * 2009-04-01 2010-10-06 索尼株式会社 多类目标的检测装置及检测方法
US8666988B2 (en) * 2009-05-14 2014-03-04 International Business Machines Corporation Configuring classifier trees and classifying data
JP4806101B2 (ja) * 2010-02-01 2011-11-02 株式会社モルフォ 物体検出装置及び物体検出方法
EP2681693B1 (en) * 2011-03-04 2018-08-01 LBT Innovations Limited Method for improving classification results of a classifier
EP2574958B1 (en) * 2011-09-28 2017-02-22 Honda Research Institute Europe GmbH Road-terrain detection method and system for driver assistance systems
US9269017B2 (en) 2013-11-15 2016-02-23 Adobe Systems Incorporated Cascaded object detection
US9208404B2 (en) * 2013-11-15 2015-12-08 Adobe Systems Incorporated Object detection with boosted exemplars
US9710729B2 (en) * 2014-09-04 2017-07-18 Xerox Corporation Domain adaptation for image classification with class priors
CN105718937B (zh) * 2014-12-03 2019-04-05 财团法人资讯工业策进会 多类别对象分类方法及系统
IN2015MU01794A (enExample) * 2015-05-05 2015-06-19 Manharlal Shah Bhavin
CN106295666B (zh) * 2015-05-14 2020-03-03 佳能株式会社 获取分类器、检测对象的方法和装置及图像处理设备
WO2017044550A1 (en) * 2015-09-11 2017-03-16 Intel Corporation A real-time multiple vehicle detection and tracking
CN106327527B (zh) * 2016-08-11 2019-05-14 电子科技大学 基于Online Boosting的目标轮廓跟踪方法
WO2018057866A1 (en) * 2016-09-23 2018-03-29 Apple Inc. Multi-perspective imaging system and method
CN108021940B (zh) * 2017-11-30 2023-04-18 中国银联股份有限公司 基于机器学习的数据分类方法及系统
US10552299B1 (en) 2019-08-14 2020-02-04 Appvance Inc. Method and apparatus for AI-driven automatic test script generation
US10628630B1 (en) 2019-08-14 2020-04-21 Appvance Inc. Method and apparatus for generating a state machine model of an application using models of GUI objects and scanning modes
US11636385B2 (en) * 2019-11-04 2023-04-25 International Business Machines Corporation Training an object detector using raw and unlabeled videos and extracted speech
CN111881764B (zh) * 2020-07-01 2023-11-03 深圳力维智联技术有限公司 一种目标检测方法、装置、电子设备及存储介质
US20220254144A1 (en) * 2021-02-05 2022-08-11 Home Depot Product Authority, Llc Product image classification

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2167748A1 (en) * 1995-02-09 1996-08-10 Yoav Freund Apparatus and methods for machine learning hypotheses
US6453307B1 (en) * 1998-03-03 2002-09-17 At&T Corp. Method and apparatus for multi-class, multi-label information categorization
US6456993B1 (en) * 1999-02-09 2002-09-24 At&T Corp. Alternating tree-based classifiers and methods for learning them
US6546379B1 (en) * 1999-10-26 2003-04-08 International Business Machines Corporation Cascade boosting of predictive models
US7076473B2 (en) * 2002-04-19 2006-07-11 Mitsubishi Electric Research Labs, Inc. Classification with boosted dyadic kernel discriminants
US7362919B2 (en) * 2002-12-12 2008-04-22 Eastman Kodak Company Method for generating customized photo album pages and prints based on people and gender profiles
JP5025893B2 (ja) * 2004-03-29 2012-09-12 ソニー株式会社 情報処理装置および方法、記録媒体、並びにプログラム

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
A. TORRALBA; K. P. MURPHY; W. T. FREEMAN, SHARING VISUAL FEATURES FOR MULTICLASS AND MULTIVIEW OBJECT DETECTION, April 2004 (2004-04-01)
P. VIOLA; M. JONES, RAPID OBJECT DETECTION USING A BOOSTED CASCADE OF SIMPLE FEATURES, 2001
R. XIAO; M. LI; H. ZHANG, ROBUST MULTIPOSE FACE DETECTION IN IMAGES, January 2004 (2004-01-01)
SCHNEIDERMAN H ET AL: "A statistical method for 3D object detection applied to faces and cars", PROCEEDINGS IEEE COMPUTER SOCIETY CONFERENCE OM COMPUTER VISION AND PATTERN RECOGNITION, vol. 1, September 2000 (2000-09-01), pages 746 - 751, XP002313459 *
TIEU K ET AL: "BOOSTING IMAGE RETRIEVAL", PROCEEDINGS 2000 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION. CVPR 2000. HILTON HEAD ISLAND, SC, JUNE 13-15, 2000, PROCEEDINGS OF THE IEEE COMPUTER CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, LOS ALAMITOS, CA : IEEE COMP. SO, vol. VOL. 1 OF 2, 13 June 2000 (2000-06-13), pages 228 - 235, XP001035602, ISBN: 0-7803-6527-5 *
TORRALBA A ET AL.: "Sharing Visual Features for Multiclass and Multiview Object Detection", MASSACHUSETTS INSTITUTE OF TECHNOLOGY, COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY, AI MEMO, no. 2004-008, April 2004 (2004-04-01), Cambridge, Massachusetts, USA, XP002337421 *
VIOLA P ET AL: "Rapid object detection using a boosted cascade of simple features", PROCEEDINGS 2001 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION. CVPR 2001. KAUAI, HAWAII, DEC. 8 - 14, 2001, PROCEEDINGS OF THE IEEE COMPUTER CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, LOS ALAMITOS, CA, IEEE COMP. SOC, US, vol. VOL. 1 OF 2, 8 December 2001 (2001-12-08), pages 511 - 518, XP010583787, ISBN: 0-7695-1272-0 *
XIAO R ET AL: "ROBUST MULTIPOSE FACE DETECTION IN IMAGES", IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, IEEE INC. NEW YORK, US, vol. 14, no. 1, January 2004 (2004-01-01), pages 31 - 41, XP001186968, ISSN: 1051-8215 *

Also Published As

Publication number Publication date
EP1745414B1 (en) 2018-10-17
EP1745414A1 (en) 2007-01-24
JP4999101B2 (ja) 2012-08-15
US7769228B2 (en) 2010-08-03
JP2007537542A (ja) 2007-12-20
US20050249401A1 (en) 2005-11-10

Similar Documents

Publication Publication Date Title
EP1745414B1 (en) Method for combining boosted classifiers for efficient multi-class object detection
US9454819B1 (en) System and method for static and moving object detection
EP1782335B1 (en) Method for traffic sign detection
US7340443B2 (en) Cognitive arbitration system
EP1606769B1 (en) System and method for vehicle detection and tracking
US7639841B2 (en) System and method for on-road detection of a vehicle using knowledge fusion
US8320643B2 (en) Face authentication device
JP5259798B2 (ja) 映像解析方法およびシステム
US8515126B1 (en) Multi-stage method for object detection using cognitive swarms and system for automated response to detected objects
US20040017930A1 (en) System and method for detecting and tracking a plurality of faces in real time by integrating visual ques
US20050271280A1 (en) System or method for classifying images
WO2006073647A2 (en) Method and apparatus for unsupervised learning of discriminative edge measures for vehicle matching between non-overlapping cameras
Meuter et al. A decision fusion and reasoning module for a traffic sign recognition system
Zhou et al. Adaptive fusion of particle filtering and spatio-temporal motion energy for human tracking
EP2259221A1 (en) Computer system and method for tracking objects in video data
Qian et al. Intelligent surveillance systems
JP2008500640A (ja) 検出および追跡のためのグラフィカルオブジェクトモデル
Wang et al. A two-layer night-time vehicle detector
Al Najjar et al. A hybrid adaptive scheme based on selective Gaussian modeling for real-time object detection
Antonakakis et al. A Two-Phase ResNet for Object Detection in Aerial Images
Seitner et al. Pedestrian tracking based on colour and spatial information
Karthikeyan et al. Deep Neural Network Based Smart Intrusion Detection and Alerting System
Azhari et al. UAD Lecturers' introductory system through surveillance cameras with eigenface method
Chau Dynamic and robust object tracking for activity recognition
Madake et al. Electronic system for detection of vacant seats in public transport for visually impaired people

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2007513229

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

WWE Wipo information: entry into national phase

Ref document number: 2005745708

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2005745708

Country of ref document: EP