CN101807256B - Object identification detection method based on multiresolution frame - Google Patents

Object identification detection method based on multiresolution frame Download PDF

Info

Publication number
CN101807256B
CN101807256B CN 201010134143 CN201010134143A CN101807256B CN 101807256 B CN101807256 B CN 101807256B CN 201010134143 CN201010134143 CN 201010134143 CN 201010134143 A CN201010134143 A CN 201010134143A CN 101807256 B CN101807256 B CN 101807256B
Authority
CN
China
Prior art keywords
resolution
image
multiresolution
negative sample
positive
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 201010134143
Other languages
Chinese (zh)
Other versions
CN101807256A (en
Inventor
张加万
付磊
张怡
高中杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN 201010134143 priority Critical patent/CN101807256B/en
Publication of CN101807256A publication Critical patent/CN101807256A/en
Application granted granted Critical
Publication of CN101807256B publication Critical patent/CN101807256B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention belongs to the field of computer vision and relates to an object identification detection method based on a multiresolution frame. The method comprises the following steps: combining a Speeded Up Robust Features (SURF) descriptor with a common simple model (Bag-of-word model) in text categorization and an SVM (support vector machine), thereby constructing a supervised-learning two-dimensional object categorizer based on SURF; and on the basis of the categorizer, combining the SVM categorizer based on SURF with the image multiresolution theory, thereby detecting the object in the spaces with different resolutions. The invention can effectively solve the problems caused by scale change, revolution change, translation change, illumination change, visual angle change and the like in object identification and detection. Under the conditions of a multiresolution object detection frame, the invention can effectively solve the problem of long detection time in using a moving window method, and accurately and quickly detect the position of the object.

Description

A kind of object identification detection method based on the multiresolution framework
Technical field
The invention belongs to computer vision field, relate to a kind of object identification detection method.
Background technology
The background technology that relates among the present invention has:
(1). fast robust (Speeded Up Robust Features, SURF) image local feature descriptor (seeing document [1]): SURF has yardstick and invariable rotary, the feature that has fine robustness for illumination variation, and than other local features, this feature has been taked different feature extraction modes, increased the extraction rate of feature, reduced simultaneously the dimension of describing, and has preferably feature specificity, reach good discrimination, the many aspects in computer vision had obtained application in recent years.
(2). in numerous object identification detection algorithms, object detection algorithm based on supervised learning can detect the object of wanting classification in the image quickly and accurately, and not the object detection of other classifications not out, i.e. recognition object when detecting, and this object detection algorithm can carry out object detection to single image, also can obtain reasonable detection effect for complex scene.For example, the people such as Agarwal (seeing document [2]) propose to use the Forstner point-of-interest just detecting sub-detected image spy, then by SnoW (Sparse Network of Winnows) learning framework training classifier, utilize this sorter to form the act of categorization figure of multiscale space, realize the detection of object by analytic activity figure.The people such as Dalal (seeing document [3]) propose to utilize the gradient direction histogram, realize the detection of people's face in conjunction with the svm classifier device, and reach good effect.But these algorithms still face a lot of problems and challenge, such as problems such as dimensional variation, rotation variation, translation variation, illumination variation, visual angle change.
Summary of the invention
The above-mentioned deficiency that the object of the invention is to the customer service prior art, providing a kind of can have the accurate object identification detection method that also detects fast object space of the complex conditions such as dimensional variation, visual angle change, rotation variation, brightness variation, partial occlusion.For this reason, the present invention adopts following technical scheme:
A kind of object identification detection method based on the multiresolution framework comprises object detection two parts under structure multiresolution framework sorter and the multiresolution framework, and step is as follows:
The first step: the positive and negative sample image of selecting to divide respectively type objects, positive sample is the sub-category subject image of wanting, and negative sample is any other subject image of non-this type objects, and establishing original image resolution is R, be configured to the training set T (R) of sorter;
Second step: training set T (R) is done respectively the sampling of different sampling rate σ, and the resolution that obtains image is respectively r=1, and 2,3 ... image training set T (r) under each resolution of R;
The 3rd step: obtain object classification device C (r) according to following method, finally obtain a hierarchical classifier H (r) who comprises each resolution, H (r) is made of R independent sorter C (r):
(a) from image training set T (r), extract respectively the SURF feature descriptor of all images in positive sample and the negative sample, positive and negative sample characteristics descriptor is stored respectively among two set; (b) utilize the K-means clustering method, the feature descriptor that aligns the negative sample collection carries out cluster analysis; (c) with the cluster combination of positive and negative sample set, form the characteristics of image dictionary; (d) according to the index of positive and negative sample set in cluster, obtain every width of cloth image clustering histogram, and according to the corresponding corresponding label of positive negative sample, histogram data and label are carried out the support vector machine training, obtain the sorter C (r) under the resolution r;
The 4th step: the image that image to be detected is extracted respectively R resolution different resolution according to the structure flow process of multiresolution sorter, then the test pattern of different resolution is extracted successively again the multi-scale image of different scale s, s=1,2,3 ... S, scale factor is β;
The 5th step: each metric space at each image in different resolution uses the window of formed objects to detect, and window uses different windows to detect between different resolution, and the window size of each resolution is (w r, h r)=(w, h)/a R-r, w, h are respectively the length of original resolution window and wide, a is fixed constant;
The 6th step: for each resolution r and yardstick s, the state of all initial detecting windows all is initialized as 1, use sorter C (r) with each metric space image of same window size detection resolution r, on metric space s, to detect is 0, the window area that does not namely comprise object abandons, be that 1 window area that namely comprises object keeps with detection, be delivered among the same metric space s of r+1 resolution and go, carry out successively, until detect highest resolution R, obtain the zone that different scale space s among the highest resolution R comprises object;
The 7th step: each metric space obtains the final position of object by the method with the Mean-shift cluster under original resolution R.
As preferred implementation, wherein the step (b) in the 3rd step comprises the following steps: to utilize the K-means clustering method, the feature descriptor of positive and negative sample set is gathered into respectively the K class, each self-forming K key word, and obtain the cluster index of feature descriptor in every width of cloth image, be which cluster each character symbol exists in the middle of, write down simultaneously the center value of each cluster centre.
The present invention is by summing up relative merits and the scope of application of current main object identification detection technique, according to the characteristics that the image local feature descriptor of relatively good performance is arranged in object classification identification at present, constructed a kind of object classification device based on local feature description's symbol, and the problem that faces in the implementation algorithm according to existing object detection technology, propose a kind of object classification device region-by-region that under image multiresolution framework, utilizes and carried out the method for object detection, obtained good target recognition detection effect.The problem that the dimensional variation that faces during the present invention can effectively solve object identification and detect, rotation variation, translation variation, illumination variation, visual angle change etc. are brought, under multiresolution object detection framework, can effectively solve the problem of using the object detection overlong time that the moving window method brings, accurately and detect fast the position of object.
Description of drawings
Fig. 1 is based on the object identification detection method overview flow chart of multiresolution framework;
(a) of Fig. 2, (b) (c) (d) (e) (f) image be respectively σ=0, Isosorbide-5-Nitrae, 16,64,256 multi-scale Representation;
Fig. 3 multiresolution framework;
Fig. 4 (a), (b) are respectively multiresolution object detection effect and traditional technique in measuring effect under 640 * 480 resolution.
The single resolution object of table 1 detects and the multiresolution object detection time table of comparisons.
Embodiment
The present invention proposes the object identification detection method based on the multiresolution framework, fast robust feature descriptor (SURF) is combined in conjunction with naive model Bag-of-word (word bag model) model and SVM (support vector machine) commonly used in the text classification, constructed the supervised learning binary object sorter based on the SURF feature; On the basis of above-mentioned sorter, will based on the theoretical combination of the multiresolution of the svm classifier device of SURF feature and image, under different resolution space, carry out the detection of object.The problem that the dimensional variation that faces during the present invention can effectively solve object identification and detect, rotation variation, translation variation, illumination variation, visual angle change etc. are brought, under multiresolution object detection framework, can effectively solve the problem of using the object detection overlong time that the moving window method brings, accurately and detect fast the position of object.
The present invention is based on the object identification detection method of multiresolution framework, and Fig. 1 is overview flow chart, specifically may further comprise the steps:
1. the structure of multiresolution framework sorter;
Below be the fast robust tagsort device flow process under the multiresolution framework:
(1). choose training set and test set for the sorter structure.(mainly use ETH-80 for object detection algorithm of the present invention from an image data base, Caltech101 and 2,006 three kinds of databases of The PASCAL Visual Object Classes Challenge, document [4] specifically sees reference, [5] and [6]) in select respectively to divide the positive and negative sample image of type objects, for object detecting system, the positive general employing of sample only comprises the image of this type objects, the least possible background that comprises, to reduce the interference of background, can carry out pre-service during positive sample set structure, positive sample is done some manually to be cut apart, only extract the positive sample of subject image conduct in the image, the general employing of negative sample does not comprise the image that object only comprises the image of background or comprises other objects;
(2). the resolution of establishing image is respectively r=1,2,3......R, R is original image resolution, the sampling rate of image is σ, and the sampling (can adopt simple partiting row sampling, sampling rate σ=0.5) that the training set T (R) under the former resolution R is made respectively different sampling rate σ obtains the image collection T (r) under each resolution, these image collections become respectively the training set under each resolution, have consisted of so a plurality of training set;
(3). under each resolution r, extract respectively SURF feature and the character symbol of each image among each training set T (r), by Bag-of-word model and SVM constructed object sorter C (r) separately under each resolution r, can adopt different number of clusters under the different resolution, so corresponding sorter under each image resolution ratio can arrange the threshold value of different sorter C (r) judgment object classifications under different resolution.High resolving power from the low resolution of image to image training classifier gradually finally obtains a hierarchical classifier H (r) who comprises each resolution like this, and H (r) is made of R independent sorter C (r).Wherein the concrete construction process of C (r) is as follows:
(a) from resolution r hypograph set T (r), extract respectively the SURF feature descriptor of all images in positive sample and the negative sample, positive and negative sample characteristics descriptor is stored respectively among two set; (b). utilize the K-means clustering method, the feature descriptor of positive and negative sample set is gathered into respectively the K class, each self-forming K key word, and obtain the cluster index of feature descriptor in every width of cloth image, be which cluster each character symbol exists in the middle of, write down simultaneously the center value of each cluster centre; (c). with the cluster combination of positive and negative sample set, form a characteristics of image dictionary that size is 2K, form the inquiry of characteristics of image histogram in order to test pattern; (d) according to the index of positive and negative sample set in cluster in (b), obtain every width of cloth image clustering histogram, and according to the corresponding corresponding label of positive negative sample, histogram data and label are sent into SVM train, obtain the sorter C (r) under the resolution r.
2. the detection of object under the multiresolution framework:
(1) test pattern is extracted respectively the image of R resolution different resolution according to the structure flow process of multiresolution sorter, then the test pattern of different resolution is extracted successively again the multi-scale image of different scale s, s=1,2,3 ... S, scale factor is β, and yardstick is larger, and the scale factor of the gaussian filtering of doing is larger.The multi-resolution image of test pattern and the multi-scale image under each resolution have so just been obtained;
(2) obtain the image of each each yardstick of resolution after, each metric space at each image in different resolution uses the window of formed objects to detect, window uses different windows to detect between different resolution, and the window size of each resolution is (w r, h r)=(w, h)/a R-r, w wherein, h are respectively the length of original resolution window and wide, a is the constant of fixing.Detection different metric space under same resolution can carry out simultaneously, is independent of each other, and must carry out from low to high between the different resolution;
(3) for each resolution r and yardstick s, the state of all initial detecting windows all is initialized as 1, namely acquiescence comprises object, use sorter C (r) with each metric space image of same window size detection resolution r, on metric space s, to detect is 0, the window area that does not namely comprise object abandons, be that 1 window area that namely comprises object keeps with detection, being delivered to a high resolution is to go among the same metric space s of r+1 resolution, carry out successively, until detect highest resolution R, this has just obtained the zone that different scale space s among the highest resolution R comprises the soil body;
(4) owing to use the method for this window classification also insensitive in some little changes of yardstick and position for object, so sorter can be near a position duplicate detection repeatedly, in order to obtain the last position of object, need again each metric space under the original resolution to obtain the final position of object by the method with the Mean-shift cluster.
Fig. 2 is the multi-scale Representation of image, Fig. 3 is the multiresolution framework, Fig. 4 (a), (b) are respectively the contrast that multiresolution detects effect and the given standard detection effect of database (what test use is The PASCAL Visual Object Classes Challenge 2006 image data bases), and the difference of two width of cloth figure is carried out mark with rectangle frame.As can be seen from Table 1: the sorter by the fast robust feature detects under the framework at the object multiresolution and has obtained good detection effect, can comprising the complete object under test that detects under few spatial context of trying one's best, obtain reasonable verification and measurement ratio.
Figure GDA0000102929250000041
The single resolution object of table 1 detects and the multiresolution object detection time table of comparisons
List of references
[1]Bay.H,Tuytelaars.T,Gool.LV.Surf:Speeded?up?robust?features.InThe?ninth?European?Conference?on?Computer?Vision,2006.
[2]Agarwal,S.,A.Awan,and?D.Roth,Learning?to?detect?objects?in?images?via?a?sparse,part-based?representation.Pattern?Analysis?and?Machine?Intelligence,IEEE?Transactions?on,2004.26(11):p.1475-1490.
[3]Dalal,N.and?B.Triggs.Histograms?of?oriented?gradients?for?human?detection.in?Computer?Vision?and?Pattern?Recognition,2005.CVPR?2005.IEEE?Computer?Society?Conference?on.2005.
[4]http://www.mis.informatik.tu-darmstadt.de/Research/Projects/categorization/eth80-db.html.
[5]http://www.vision.caltech.edu/ImageDatasets/Caltech101/
[6]http://www.pascal-network.org/challenges/VOC/voc2006/index.html

Claims (2)

1. the object identification detection method based on the multiresolution framework comprises object detection two parts under structure multiresolution framework sorter and the multiresolution framework, and step is as follows:
The first step: the positive and negative sample image of selecting to divide respectively type objects, positive sample is the sub-category subject image of wanting, negative sample is any other subject image of non-this type objects, and establishing original image resolution is R, is configured to the training set T (R) of sorter;
Second step: training set T (R) is done respectively the sampling of different sampling rate σ, and the resolution that obtains image is respectively r=1, and 2,3 ... image training set T (r) under each resolution of R;
The 3rd step: obtain object classification device C (r) according to following method, finally obtain a hierarchical classifier H (r) who comprises each resolution, H (r) is made of R independent sorter C (r):
(a) from image training set T (r), extract respectively the SURF feature descriptor of all images in positive sample and the negative sample, positive and negative sample characteristics descriptor is stored respectively among two set; (b) utilize the K-means clustering method, the feature descriptor that aligns the negative sample collection carries out cluster analysis; (c) with the cluster combination of positive and negative sample set, form the characteristics of image dictionary; (d) according to the index of positive and negative sample set in cluster, obtain every width of cloth image clustering histogram, and according to the corresponding corresponding label of positive negative sample, histogram data and label are carried out the support vector machine training, obtain resolution and be the sorter C (r) under the r;
The 4th step: the image that image to be detected is extracted respectively R resolution different resolution according to the structure flow process of multiresolution sorter, then the test pattern of different resolution is extracted successively again the multi-scale image of different scale s, s=1,2,3 ... S, scale factor is β;
The 5th step: each metric space at each image in different resolution uses the window of formed objects to detect, and window uses different windows to detect between different resolution, and the window size of each resolution is (w r, h r)=(w, h)/a R-r, w, h are respectively the length of original resolution window and wide, a is fixed constant;
The 6th step: for each resolution r and yardstick s, the state of all initial detecting windows all is initialized as 1, use sorter C (r) with same window size detection resolution each metric space image as r, on metric space s, to detect is 0, the window area that does not namely comprise object abandons, be that 1 window area that namely comprises object keeps with detection, be delivered to resolution and be among the same metric space s of r+1 and go, carry out successively, until detect highest resolution, be the original image resolution of R, obtain the zone that different scale space s in the highest resolution comprises object;
The 7th step: be the final position that each metric space obtains object under the original image resolution of R by the method with the Mean-shift cluster.
2. the object identification detection method based on the multiresolution framework according to claim 1, it is characterized in that, the step (b) in the 3rd step wherein comprises the following steps: to utilize the K-means clustering method, the feature descriptor of positive and negative sample set is gathered into respectively the K class, each self-forming K key word, and obtain the cluster index of feature descriptor in every width of cloth image, namely which cluster each character symbol exists in the middle of, writes down simultaneously the center value of each cluster centre.
CN 201010134143 2010-03-29 2010-03-29 Object identification detection method based on multiresolution frame Expired - Fee Related CN101807256B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010134143 CN101807256B (en) 2010-03-29 2010-03-29 Object identification detection method based on multiresolution frame

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010134143 CN101807256B (en) 2010-03-29 2010-03-29 Object identification detection method based on multiresolution frame

Publications (2)

Publication Number Publication Date
CN101807256A CN101807256A (en) 2010-08-18
CN101807256B true CN101807256B (en) 2013-03-20

Family

ID=42609044

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010134143 Expired - Fee Related CN101807256B (en) 2010-03-29 2010-03-29 Object identification detection method based on multiresolution frame

Country Status (1)

Country Link
CN (1) CN101807256B (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622607B (en) * 2012-02-24 2013-09-25 河海大学 Remote sensing image classification method based on multi-feature fusion
CN102708380B (en) * 2012-05-08 2014-02-12 东南大学 Indoor common object identification method based on machine vision
CN102750708B (en) * 2012-05-11 2014-10-15 天津大学 Affine motion target tracing algorithm based on fast robust feature matching
CN103186795B (en) * 2013-03-28 2016-06-29 浙江大学 Writing brush word style of calligraphy recognition methods based on similarity sensitive coding
CN103310208B (en) * 2013-07-10 2016-05-11 西安电子科技大学 The distinctive human face posture recognition methods of describing based on local geometric vision phrase
CN103680145B (en) * 2013-12-06 2016-01-20 河海大学 A kind of people's car automatic identifying method based on local image characteristics
CN103745200B (en) * 2014-01-02 2017-01-04 哈尔滨工程大学 A kind of facial image recognition method based on word bag model
CN104239897B (en) * 2014-09-04 2017-05-17 天津大学 Visual feature representing method based on autoencoder word bag
CN104766096B (en) * 2015-04-17 2017-11-10 南京大学 A kind of image classification method based on multiple dimensioned global characteristics and local feature
CN104899611B (en) * 2015-06-26 2018-09-25 小米科技有限责任公司 Determine the method and device of card position in image
CN106355188B (en) * 2015-07-13 2020-01-21 阿里巴巴集团控股有限公司 Image detection method and device
CN107316001A (en) * 2017-05-31 2017-11-03 天津大学 Small and intensive method for traffic sign detection in a kind of automatic Pilot scene
CN108805063A (en) * 2018-05-31 2018-11-13 王红军 A kind of multiresolution visual perception method to object and environment
CN108921875B (en) * 2018-07-09 2021-08-17 哈尔滨工业大学(深圳) Real-time traffic flow detection and tracking method based on aerial photography data
CN109255789B (en) * 2018-07-26 2020-08-25 上海广播电视信息网络有限公司 Image segmentation system based on computer processing
CN111914795B (en) * 2020-08-17 2022-05-27 四川大学 Method for detecting rotating target in aerial image
JP2023119326A (en) * 2022-02-16 2023-08-28 Tvs Regza株式会社 Video image analysis apparatus and video image analysis method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101061510A (en) * 2004-11-19 2007-10-24 皇家飞利浦电子股份有限公司 False positive reduction in computer-assisted detection (CAD)
CN101226590A (en) * 2008-01-31 2008-07-23 湖南创合制造有限公司 Method for recognizing human face
CN101594314A (en) * 2008-05-30 2009-12-02 电子科技大学 A kind of spam image-recognizing method and device based on high-order autocorrelation characteristic

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090175514A1 (en) * 2004-11-19 2009-07-09 Koninklijke Philips Electronics, N.V. Stratification method for overcoming unbalanced case numbers in computer-aided lung nodule false positive reduction
TW200842733A (en) * 2007-04-17 2008-11-01 Univ Nat Chiao Tung Object image detection method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101061510A (en) * 2004-11-19 2007-10-24 皇家飞利浦电子股份有限公司 False positive reduction in computer-assisted detection (CAD)
CN101226590A (en) * 2008-01-31 2008-07-23 湖南创合制造有限公司 Method for recognizing human face
CN101594314A (en) * 2008-05-30 2009-12-02 电子科技大学 A kind of spam image-recognizing method and device based on high-order autocorrelation characteristic

Also Published As

Publication number Publication date
CN101807256A (en) 2010-08-18

Similar Documents

Publication Publication Date Title
CN101807256B (en) Object identification detection method based on multiresolution frame
CN101976258B (en) Video semantic extraction method by combining object segmentation and feature weighing
Mu et al. Discriminative local binary patterns for human detection in personal album
CN102682287B (en) Pedestrian detection method based on saliency information
Zhang et al. Text extraction from natural scene image: A survey
Yi et al. Text detection in natural scene images by stroke gabor words
CN103514456B (en) Image classification method and device based on compressed sensing multi-core learning
CN101894262B (en) Method and apparatus for classifying image
CN101763503B (en) Face recognition method of attitude robust
CN102938065B (en) Face feature extraction method and face identification method based on large-scale image data
Yap et al. A comparative study of mobile-based landmark recognition techniques
CN104951793B (en) A kind of Human bodys' response method based on STDF features
CN101930549B (en) Second generation curvelet transform-based static human detection method
CN111126240B (en) Three-channel feature fusion face recognition method
Huang et al. Isolated handwritten Pashto character recognition using a K-NN classification tool based on zoning and HOG feature extraction techniques
Arandjelovic Crowd detection from still images
Bai et al. Scene text localization using gradient local correlation
CN102693427A (en) Method and device for forming detector for detecting images
CN102346851B (en) Image segmentation method based on NJW (Ng-Jordan-Weiss) spectral clustering mark
CN106326916A (en) Multi-scale feature estimation and high-order BING feature-based target detection method
CN103345621A (en) Face classification method based on sparse concentration index
CN103971096A (en) Multi-pose face recognition method based on MB-LBP features and face energy diagram
CN103605993A (en) Image-to-video face identification method based on distinguish analysis oriented to scenes
CN107103289B (en) Method and system for handwriting identification by using handwriting outline characteristics
Zheng et al. Weight-based sparse coding for multi-shot person re-identification

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20130320