CN101807256B - Object identification detection method based on multiresolution frame - Google Patents
Object identification detection method based on multiresolution frame Download PDFInfo
- Publication number
- CN101807256B CN101807256B CN 201010134143 CN201010134143A CN101807256B CN 101807256 B CN101807256 B CN 101807256B CN 201010134143 CN201010134143 CN 201010134143 CN 201010134143 A CN201010134143 A CN 201010134143A CN 101807256 B CN101807256 B CN 101807256B
- Authority
- CN
- China
- Prior art keywords
- resolution
- image
- multiresolution
- negative sample
- positive
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Abstract
The invention belongs to the field of computer vision and relates to an object identification detection method based on a multiresolution frame. The method comprises the following steps: combining a Speeded Up Robust Features (SURF) descriptor with a common simple model (Bag-of-word model) in text categorization and an SVM (support vector machine), thereby constructing a supervised-learning two-dimensional object categorizer based on SURF; and on the basis of the categorizer, combining the SVM categorizer based on SURF with the image multiresolution theory, thereby detecting the object in the spaces with different resolutions. The invention can effectively solve the problems caused by scale change, revolution change, translation change, illumination change, visual angle change and the like in object identification and detection. Under the conditions of a multiresolution object detection frame, the invention can effectively solve the problem of long detection time in using a moving window method, and accurately and quickly detect the position of the object.
Description
Technical field
The invention belongs to computer vision field, relate to a kind of object identification detection method.
Background technology
The background technology that relates among the present invention has:
(1). fast robust (Speeded Up Robust Features, SURF) image local feature descriptor (seeing document [1]): SURF has yardstick and invariable rotary, the feature that has fine robustness for illumination variation, and than other local features, this feature has been taked different feature extraction modes, increased the extraction rate of feature, reduced simultaneously the dimension of describing, and has preferably feature specificity, reach good discrimination, the many aspects in computer vision had obtained application in recent years.
(2). in numerous object identification detection algorithms, object detection algorithm based on supervised learning can detect the object of wanting classification in the image quickly and accurately, and not the object detection of other classifications not out, i.e. recognition object when detecting, and this object detection algorithm can carry out object detection to single image, also can obtain reasonable detection effect for complex scene.For example, the people such as Agarwal (seeing document [2]) propose to use the Forstner point-of-interest just detecting sub-detected image spy, then by SnoW (Sparse Network of Winnows) learning framework training classifier, utilize this sorter to form the act of categorization figure of multiscale space, realize the detection of object by analytic activity figure.The people such as Dalal (seeing document [3]) propose to utilize the gradient direction histogram, realize the detection of people's face in conjunction with the svm classifier device, and reach good effect.But these algorithms still face a lot of problems and challenge, such as problems such as dimensional variation, rotation variation, translation variation, illumination variation, visual angle change.
Summary of the invention
The above-mentioned deficiency that the object of the invention is to the customer service prior art, providing a kind of can have the accurate object identification detection method that also detects fast object space of the complex conditions such as dimensional variation, visual angle change, rotation variation, brightness variation, partial occlusion.For this reason, the present invention adopts following technical scheme:
A kind of object identification detection method based on the multiresolution framework comprises object detection two parts under structure multiresolution framework sorter and the multiresolution framework, and step is as follows:
The first step: the positive and negative sample image of selecting to divide respectively type objects, positive sample is the sub-category subject image of wanting, and negative sample is any other subject image of non-this type objects, and establishing original image resolution is R, be configured to the training set T (R) of sorter;
Second step: training set T (R) is done respectively the sampling of different sampling rate σ, and the resolution that obtains image is respectively r=1, and 2,3 ... image training set T (r) under each resolution of R;
The 3rd step: obtain object classification device C (r) according to following method, finally obtain a hierarchical classifier H (r) who comprises each resolution, H (r) is made of R independent sorter C (r):
(a) from image training set T (r), extract respectively the SURF feature descriptor of all images in positive sample and the negative sample, positive and negative sample characteristics descriptor is stored respectively among two set; (b) utilize the K-means clustering method, the feature descriptor that aligns the negative sample collection carries out cluster analysis; (c) with the cluster combination of positive and negative sample set, form the characteristics of image dictionary; (d) according to the index of positive and negative sample set in cluster, obtain every width of cloth image clustering histogram, and according to the corresponding corresponding label of positive negative sample, histogram data and label are carried out the support vector machine training, obtain the sorter C (r) under the resolution r;
The 4th step: the image that image to be detected is extracted respectively R resolution different resolution according to the structure flow process of multiresolution sorter, then the test pattern of different resolution is extracted successively again the multi-scale image of different scale s, s=1,2,3 ... S, scale factor is β;
The 5th step: each metric space at each image in different resolution uses the window of formed objects to detect, and window uses different windows to detect between different resolution, and the window size of each resolution is (w
r, h
r)=(w, h)/a
R-r, w, h are respectively the length of original resolution window and wide, a is fixed constant;
The 6th step: for each resolution r and yardstick s, the state of all initial detecting windows all is initialized as 1, use sorter C (r) with each metric space image of same window size detection resolution r, on metric space s, to detect is 0, the window area that does not namely comprise object abandons, be that 1 window area that namely comprises object keeps with detection, be delivered among the same metric space s of r+1 resolution and go, carry out successively, until detect highest resolution R, obtain the zone that different scale space s among the highest resolution R comprises object;
The 7th step: each metric space obtains the final position of object by the method with the Mean-shift cluster under original resolution R.
As preferred implementation, wherein the step (b) in the 3rd step comprises the following steps: to utilize the K-means clustering method, the feature descriptor of positive and negative sample set is gathered into respectively the K class, each self-forming K key word, and obtain the cluster index of feature descriptor in every width of cloth image, be which cluster each character symbol exists in the middle of, write down simultaneously the center value of each cluster centre.
The present invention is by summing up relative merits and the scope of application of current main object identification detection technique, according to the characteristics that the image local feature descriptor of relatively good performance is arranged in object classification identification at present, constructed a kind of object classification device based on local feature description's symbol, and the problem that faces in the implementation algorithm according to existing object detection technology, propose a kind of object classification device region-by-region that under image multiresolution framework, utilizes and carried out the method for object detection, obtained good target recognition detection effect.The problem that the dimensional variation that faces during the present invention can effectively solve object identification and detect, rotation variation, translation variation, illumination variation, visual angle change etc. are brought, under multiresolution object detection framework, can effectively solve the problem of using the object detection overlong time that the moving window method brings, accurately and detect fast the position of object.
Description of drawings
Fig. 1 is based on the object identification detection method overview flow chart of multiresolution framework;
(a) of Fig. 2, (b) (c) (d) (e) (f) image be respectively σ=0, Isosorbide-5-Nitrae, 16,64,256 multi-scale Representation;
Fig. 3 multiresolution framework;
Fig. 4 (a), (b) are respectively multiresolution object detection effect and traditional technique in measuring effect under 640 * 480 resolution.
The single resolution object of table 1 detects and the multiresolution object detection time table of comparisons.
Embodiment
The present invention proposes the object identification detection method based on the multiresolution framework, fast robust feature descriptor (SURF) is combined in conjunction with naive model Bag-of-word (word bag model) model and SVM (support vector machine) commonly used in the text classification, constructed the supervised learning binary object sorter based on the SURF feature; On the basis of above-mentioned sorter, will based on the theoretical combination of the multiresolution of the svm classifier device of SURF feature and image, under different resolution space, carry out the detection of object.The problem that the dimensional variation that faces during the present invention can effectively solve object identification and detect, rotation variation, translation variation, illumination variation, visual angle change etc. are brought, under multiresolution object detection framework, can effectively solve the problem of using the object detection overlong time that the moving window method brings, accurately and detect fast the position of object.
The present invention is based on the object identification detection method of multiresolution framework, and Fig. 1 is overview flow chart, specifically may further comprise the steps:
1. the structure of multiresolution framework sorter;
Below be the fast robust tagsort device flow process under the multiresolution framework:
(1). choose training set and test set for the sorter structure.(mainly use ETH-80 for object detection algorithm of the present invention from an image data base, Caltech101 and 2,006 three kinds of databases of The PASCAL Visual Object Classes Challenge, document [4] specifically sees reference, [5] and [6]) in select respectively to divide the positive and negative sample image of type objects, for object detecting system, the positive general employing of sample only comprises the image of this type objects, the least possible background that comprises, to reduce the interference of background, can carry out pre-service during positive sample set structure, positive sample is done some manually to be cut apart, only extract the positive sample of subject image conduct in the image, the general employing of negative sample does not comprise the image that object only comprises the image of background or comprises other objects;
(2). the resolution of establishing image is respectively r=1,2,3......R, R is original image resolution, the sampling rate of image is σ, and the sampling (can adopt simple partiting row sampling, sampling rate σ=0.5) that the training set T (R) under the former resolution R is made respectively different sampling rate σ obtains the image collection T (r) under each resolution, these image collections become respectively the training set under each resolution, have consisted of so a plurality of training set;
(3). under each resolution r, extract respectively SURF feature and the character symbol of each image among each training set T (r), by Bag-of-word model and SVM constructed object sorter C (r) separately under each resolution r, can adopt different number of clusters under the different resolution, so corresponding sorter under each image resolution ratio can arrange the threshold value of different sorter C (r) judgment object classifications under different resolution.High resolving power from the low resolution of image to image training classifier gradually finally obtains a hierarchical classifier H (r) who comprises each resolution like this, and H (r) is made of R independent sorter C (r).Wherein the concrete construction process of C (r) is as follows:
(a) from resolution r hypograph set T (r), extract respectively the SURF feature descriptor of all images in positive sample and the negative sample, positive and negative sample characteristics descriptor is stored respectively among two set; (b). utilize the K-means clustering method, the feature descriptor of positive and negative sample set is gathered into respectively the K class, each self-forming K key word, and obtain the cluster index of feature descriptor in every width of cloth image, be which cluster each character symbol exists in the middle of, write down simultaneously the center value of each cluster centre; (c). with the cluster combination of positive and negative sample set, form a characteristics of image dictionary that size is 2K, form the inquiry of characteristics of image histogram in order to test pattern; (d) according to the index of positive and negative sample set in cluster in (b), obtain every width of cloth image clustering histogram, and according to the corresponding corresponding label of positive negative sample, histogram data and label are sent into SVM train, obtain the sorter C (r) under the resolution r.
2. the detection of object under the multiresolution framework:
(1) test pattern is extracted respectively the image of R resolution different resolution according to the structure flow process of multiresolution sorter, then the test pattern of different resolution is extracted successively again the multi-scale image of different scale s, s=1,2,3 ... S, scale factor is β, and yardstick is larger, and the scale factor of the gaussian filtering of doing is larger.The multi-resolution image of test pattern and the multi-scale image under each resolution have so just been obtained;
(2) obtain the image of each each yardstick of resolution after, each metric space at each image in different resolution uses the window of formed objects to detect, window uses different windows to detect between different resolution, and the window size of each resolution is (w
r, h
r)=(w, h)/a
R-r, w wherein, h are respectively the length of original resolution window and wide, a is the constant of fixing.Detection different metric space under same resolution can carry out simultaneously, is independent of each other, and must carry out from low to high between the different resolution;
(3) for each resolution r and yardstick s, the state of all initial detecting windows all is initialized as 1, namely acquiescence comprises object, use sorter C (r) with each metric space image of same window size detection resolution r, on metric space s, to detect is 0, the window area that does not namely comprise object abandons, be that 1 window area that namely comprises object keeps with detection, being delivered to a high resolution is to go among the same metric space s of r+1 resolution, carry out successively, until detect highest resolution R, this has just obtained the zone that different scale space s among the highest resolution R comprises the soil body;
(4) owing to use the method for this window classification also insensitive in some little changes of yardstick and position for object, so sorter can be near a position duplicate detection repeatedly, in order to obtain the last position of object, need again each metric space under the original resolution to obtain the final position of object by the method with the Mean-shift cluster.
Fig. 2 is the multi-scale Representation of image, Fig. 3 is the multiresolution framework, Fig. 4 (a), (b) are respectively the contrast that multiresolution detects effect and the given standard detection effect of database (what test use is The PASCAL Visual Object Classes Challenge 2006 image data bases), and the difference of two width of cloth figure is carried out mark with rectangle frame.As can be seen from Table 1: the sorter by the fast robust feature detects under the framework at the object multiresolution and has obtained good detection effect, can comprising the complete object under test that detects under few spatial context of trying one's best, obtain reasonable verification and measurement ratio.
The single resolution object of table 1 detects and the multiresolution object detection time table of comparisons
List of references
[1]Bay.H,Tuytelaars.T,Gool.LV.Surf:Speeded?up?robust?features.InThe?ninth?European?Conference?on?Computer?Vision,2006.
[2]Agarwal,S.,A.Awan,and?D.Roth,Learning?to?detect?objects?in?images?via?a?sparse,part-based?representation.Pattern?Analysis?and?Machine?Intelligence,IEEE?Transactions?on,2004.26(11):p.1475-1490.
[3]Dalal,N.and?B.Triggs.Histograms?of?oriented?gradients?for?human?detection.in?Computer?Vision?and?Pattern?Recognition,2005.CVPR?2005.IEEE?Computer?Society?Conference?on.2005.
[4]http://www.mis.informatik.tu-darmstadt.de/Research/Projects/categorization/eth80-db.html.
[5]http://www.vision.caltech.edu/ImageDatasets/Caltech101/
[6]http://www.pascal-network.org/challenges/VOC/voc2006/index.html
Claims (2)
1. the object identification detection method based on the multiresolution framework comprises object detection two parts under structure multiresolution framework sorter and the multiresolution framework, and step is as follows:
The first step: the positive and negative sample image of selecting to divide respectively type objects, positive sample is the sub-category subject image of wanting, negative sample is any other subject image of non-this type objects, and establishing original image resolution is R, is configured to the training set T (R) of sorter;
Second step: training set T (R) is done respectively the sampling of different sampling rate σ, and the resolution that obtains image is respectively r=1, and 2,3 ... image training set T (r) under each resolution of R;
The 3rd step: obtain object classification device C (r) according to following method, finally obtain a hierarchical classifier H (r) who comprises each resolution, H (r) is made of R independent sorter C (r):
(a) from image training set T (r), extract respectively the SURF feature descriptor of all images in positive sample and the negative sample, positive and negative sample characteristics descriptor is stored respectively among two set; (b) utilize the K-means clustering method, the feature descriptor that aligns the negative sample collection carries out cluster analysis; (c) with the cluster combination of positive and negative sample set, form the characteristics of image dictionary; (d) according to the index of positive and negative sample set in cluster, obtain every width of cloth image clustering histogram, and according to the corresponding corresponding label of positive negative sample, histogram data and label are carried out the support vector machine training, obtain resolution and be the sorter C (r) under the r;
The 4th step: the image that image to be detected is extracted respectively R resolution different resolution according to the structure flow process of multiresolution sorter, then the test pattern of different resolution is extracted successively again the multi-scale image of different scale s, s=1,2,3 ... S, scale factor is β;
The 5th step: each metric space at each image in different resolution uses the window of formed objects to detect, and window uses different windows to detect between different resolution, and the window size of each resolution is (w
r, h
r)=(w, h)/a
R-r, w, h are respectively the length of original resolution window and wide, a is fixed constant;
The 6th step: for each resolution r and yardstick s, the state of all initial detecting windows all is initialized as 1, use sorter C (r) with same window size detection resolution each metric space image as r, on metric space s, to detect is 0, the window area that does not namely comprise object abandons, be that 1 window area that namely comprises object keeps with detection, be delivered to resolution and be among the same metric space s of r+1 and go, carry out successively, until detect highest resolution, be the original image resolution of R, obtain the zone that different scale space s in the highest resolution comprises object;
The 7th step: be the final position that each metric space obtains object under the original image resolution of R by the method with the Mean-shift cluster.
2. the object identification detection method based on the multiresolution framework according to claim 1, it is characterized in that, the step (b) in the 3rd step wherein comprises the following steps: to utilize the K-means clustering method, the feature descriptor of positive and negative sample set is gathered into respectively the K class, each self-forming K key word, and obtain the cluster index of feature descriptor in every width of cloth image, namely which cluster each character symbol exists in the middle of, writes down simultaneously the center value of each cluster centre.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010134143 CN101807256B (en) | 2010-03-29 | 2010-03-29 | Object identification detection method based on multiresolution frame |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010134143 CN101807256B (en) | 2010-03-29 | 2010-03-29 | Object identification detection method based on multiresolution frame |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101807256A CN101807256A (en) | 2010-08-18 |
CN101807256B true CN101807256B (en) | 2013-03-20 |
Family
ID=42609044
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201010134143 Expired - Fee Related CN101807256B (en) | 2010-03-29 | 2010-03-29 | Object identification detection method based on multiresolution frame |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101807256B (en) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102622607B (en) * | 2012-02-24 | 2013-09-25 | 河海大学 | Remote sensing image classification method based on multi-feature fusion |
CN102708380B (en) * | 2012-05-08 | 2014-02-12 | 东南大学 | Indoor common object identification method based on machine vision |
CN102750708B (en) * | 2012-05-11 | 2014-10-15 | 天津大学 | Affine motion target tracing algorithm based on fast robust feature matching |
CN103186795B (en) * | 2013-03-28 | 2016-06-29 | 浙江大学 | Writing brush word style of calligraphy recognition methods based on similarity sensitive coding |
CN103310208B (en) * | 2013-07-10 | 2016-05-11 | 西安电子科技大学 | The distinctive human face posture recognition methods of describing based on local geometric vision phrase |
CN103680145B (en) * | 2013-12-06 | 2016-01-20 | 河海大学 | A kind of people's car automatic identifying method based on local image characteristics |
CN103745200B (en) * | 2014-01-02 | 2017-01-04 | 哈尔滨工程大学 | A kind of facial image recognition method based on word bag model |
CN104239897B (en) * | 2014-09-04 | 2017-05-17 | 天津大学 | Visual feature representing method based on autoencoder word bag |
CN104766096B (en) * | 2015-04-17 | 2017-11-10 | 南京大学 | A kind of image classification method based on multiple dimensioned global characteristics and local feature |
CN104899611B (en) * | 2015-06-26 | 2018-09-25 | 小米科技有限责任公司 | Determine the method and device of card position in image |
CN106355188B (en) * | 2015-07-13 | 2020-01-21 | 阿里巴巴集团控股有限公司 | Image detection method and device |
CN107316001A (en) * | 2017-05-31 | 2017-11-03 | 天津大学 | Small and intensive method for traffic sign detection in a kind of automatic Pilot scene |
CN108805063A (en) * | 2018-05-31 | 2018-11-13 | 王红军 | A kind of multiresolution visual perception method to object and environment |
CN108921875B (en) * | 2018-07-09 | 2021-08-17 | 哈尔滨工业大学(深圳) | Real-time traffic flow detection and tracking method based on aerial photography data |
CN109255789B (en) * | 2018-07-26 | 2020-08-25 | 上海广播电视信息网络有限公司 | Image segmentation system based on computer processing |
CN111914795B (en) * | 2020-08-17 | 2022-05-27 | 四川大学 | Method for detecting rotating target in aerial image |
JP2023119326A (en) * | 2022-02-16 | 2023-08-28 | Tvs Regza株式会社 | Video image analysis apparatus and video image analysis method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101061510A (en) * | 2004-11-19 | 2007-10-24 | 皇家飞利浦电子股份有限公司 | False positive reduction in computer-assisted detection (CAD) |
CN101226590A (en) * | 2008-01-31 | 2008-07-23 | 湖南创合制造有限公司 | Method for recognizing human face |
CN101594314A (en) * | 2008-05-30 | 2009-12-02 | 电子科技大学 | A kind of spam image-recognizing method and device based on high-order autocorrelation characteristic |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090175514A1 (en) * | 2004-11-19 | 2009-07-09 | Koninklijke Philips Electronics, N.V. | Stratification method for overcoming unbalanced case numbers in computer-aided lung nodule false positive reduction |
TW200842733A (en) * | 2007-04-17 | 2008-11-01 | Univ Nat Chiao Tung | Object image detection method |
-
2010
- 2010-03-29 CN CN 201010134143 patent/CN101807256B/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101061510A (en) * | 2004-11-19 | 2007-10-24 | 皇家飞利浦电子股份有限公司 | False positive reduction in computer-assisted detection (CAD) |
CN101226590A (en) * | 2008-01-31 | 2008-07-23 | 湖南创合制造有限公司 | Method for recognizing human face |
CN101594314A (en) * | 2008-05-30 | 2009-12-02 | 电子科技大学 | A kind of spam image-recognizing method and device based on high-order autocorrelation characteristic |
Also Published As
Publication number | Publication date |
---|---|
CN101807256A (en) | 2010-08-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101807256B (en) | Object identification detection method based on multiresolution frame | |
CN101976258B (en) | Video semantic extraction method by combining object segmentation and feature weighing | |
Mu et al. | Discriminative local binary patterns for human detection in personal album | |
CN102682287B (en) | Pedestrian detection method based on saliency information | |
Zhang et al. | Text extraction from natural scene image: A survey | |
Yi et al. | Text detection in natural scene images by stroke gabor words | |
CN103514456B (en) | Image classification method and device based on compressed sensing multi-core learning | |
CN101894262B (en) | Method and apparatus for classifying image | |
CN101763503B (en) | Face recognition method of attitude robust | |
CN102938065B (en) | Face feature extraction method and face identification method based on large-scale image data | |
Yap et al. | A comparative study of mobile-based landmark recognition techniques | |
CN104951793B (en) | A kind of Human bodys' response method based on STDF features | |
CN101930549B (en) | Second generation curvelet transform-based static human detection method | |
CN111126240B (en) | Three-channel feature fusion face recognition method | |
Huang et al. | Isolated handwritten Pashto character recognition using a K-NN classification tool based on zoning and HOG feature extraction techniques | |
Arandjelovic | Crowd detection from still images | |
Bai et al. | Scene text localization using gradient local correlation | |
CN102693427A (en) | Method and device for forming detector for detecting images | |
CN102346851B (en) | Image segmentation method based on NJW (Ng-Jordan-Weiss) spectral clustering mark | |
CN106326916A (en) | Multi-scale feature estimation and high-order BING feature-based target detection method | |
CN103345621A (en) | Face classification method based on sparse concentration index | |
CN103971096A (en) | Multi-pose face recognition method based on MB-LBP features and face energy diagram | |
CN103605993A (en) | Image-to-video face identification method based on distinguish analysis oriented to scenes | |
CN107103289B (en) | Method and system for handwriting identification by using handwriting outline characteristics | |
Zheng et al. | Weight-based sparse coding for multi-shot person re-identification |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20130320 |