CN102810159A - Human body detecting method based on SURF (Speed Up Robust Feature) efficient matching kernel - Google Patents

Human body detecting method based on SURF (Speed Up Robust Feature) efficient matching kernel Download PDF

Info

Publication number
CN102810159A
CN102810159A CN2012101965262A CN201210196526A CN102810159A CN 102810159 A CN102810159 A CN 102810159A CN 2012101965262 A CN2012101965262 A CN 2012101965262A CN 201210196526 A CN201210196526 A CN 201210196526A CN 102810159 A CN102810159 A CN 102810159A
Authority
CN
China
Prior art keywords
image
human body
surf
window
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012101965262A
Other languages
Chinese (zh)
Other versions
CN102810159B (en
Inventor
韩红
王瑞
谢福强
李晓君
顾建银
张红蕾
韩启强
刘三军
郭玉言
甘露
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xidian University
Original Assignee
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xidian University filed Critical Xidian University
Priority to CN201210196526.2A priority Critical patent/CN102810159B/en
Publication of CN102810159A publication Critical patent/CN102810159A/en
Application granted granted Critical
Publication of CN102810159B publication Critical patent/CN102810159B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Image Analysis (AREA)

Abstract

The invention provides a human body detecting method based on an SURF efficient matching kernel, and mainly solves the problem that image background hybridity can not be better processed in the existing method. The method comprises the steps that a negative sample is obtained through bootstrap in an INRIA (Institute National de Recherce en Informatique et Automatique) database, and a training sample set of the whole human body is formed by the negative sample and a positive sample in the database; SURF descriptor feature points are extracted under different image scales for the training sample; feature points are extracted by random sampling to constitute the initial vector basis of a visual vocabulary; constrained singular value decomposition is utilized for the initial vector basis to obtain the maximum kernel function feature; the maximum kernel function feature in different image scales is weighted to obtain the features under all the image scales; the obtained features are trained in different classes by an SVM (Support Vector Machine) classifier, and a detection classifier is obtained; and the image to be detected is input to the classifier to obtain the final detection result. The method disclosed by the invention can be used for accurately detecting the human body, and can be used for intelligent monitoring, driver auxiliary systems and virtual video.

Description

The human body detecting method that efficiently matees nuclear based on SURF
Technical field
The invention belongs to technical field of image processing, relate to static human detection method, can be used for intelligent monitoring, driver assistance system, human body motion capture, porny filtration and virtual video.
Background technology
Human detection is the very wide technology of an application prospect in computer vision field; Human detection all has application promise in clinical practice in a plurality of fields; But because the diversity of human body attitude; Mixing and the clothes texture of background, illumination condition many-sided factor such as self blocks and causes human detection to become a very problem of difficulty.At present, in the still image method of human detection mainly contain detection method based on kinetic characteristic, based on the method for manikin with based on the method for statistical classification.
Based on the detection method of kinetic characteristic is that attitude when utilizing human body to stablize changes and the symmetry of human body is the cycle and changes this characteristic; Structure self similarity matrix in time domain; Motion change through human cyclin property reflects character different and other motions of matter; And utilize this analytical approach to detect movement human, but this method algorithm complex is big, and higher to the human motion stability requirement.
Based on the method for manikin, clear and definite manikin arranged, carry out human body identification according to each position and the relation between the human body of model construction then.This method can be handled occlusion issue, and can infer the attitude of human body.But the deficiency of this method is the structure difficulty of model, finds the solution complicacy.
Based on the method for statistical classification, from a series of training data middle school acquistion to a sorter, represent human body through machine learning with this sorter, utilize this sorter input is classified and to discern then.Advantage based on the method for statistical classification is that testing result is stable, and effect is better, and shortcoming is to need a lot of training datas, and is difficult to solve the problem that insufficient light and background mix.The human body detecting method that wherein efficiently matees nuclear based on SURF Speed Up Robust Feature; The characteristics of image of its input category device is a kind of based on the part image characterizing method; Traditional background challenge can be avoided, better human detection result can be obtained.
Summary of the invention
The present invention seeks to the deficiency to above-mentioned prior art, a kind of of proposition efficiently mated the human body detecting method of nuclear based on SURF, to reduce the complexity of image characteristics extraction, improves the sign ability of characteristic, effectively improves correct rate of human body detection.
Technical scheme of the present invention realizes through following steps:
(1) from institut national de recherche en infomatique et automatique INRIA database, obtain negative sample, and other positive sample constitutes the whole human body training sample set in database through the bootstrapping operation;
(2) every width of cloth training sample image is divided into 8 * 8 pixel grid, each grid extracts the SURF descriptor unique point F of all training images respectively by the graphical rule sampling of 16 and 25 pixel sizes;
(3) carry out stochastic sampling through SURF descriptor unique point F, obtain the visual vocabulary of whole training sample 350 dimensions, constitute initial base vector R with the 350 dimension visual vocabularies that obtain to all training images;
(4) with initial base vector R, utilize the nuclear svd CKSVD of belt restraining to carry out dictionary study, obtain maximum kernel function characteristic r;
(5) extract the maximum kernel function characteristic r that suppresses similar through the maximization eigenwert; And press descending and extract the kernel function eigenwert, the same element of deletion maximal value obtains proper vector G; Characteristics of image G to each different images yardstick carries out weighted sum, obtains the characteristic G ' of all images yardstick:
G′=G×A l
Wherein, A lBe the weight of difference figure phase yardstick, l=[1,2],
Figure BDA00001765522400021
w l=1/p l, p is the pixel size of the graphical rule of the SURF unique point extracted, p={16,25};
(6) store the characteristic G ' of all images yardstick, select the low dimensional feature h of similar Gaussian distribution among the G ', efficiently mate nuclear characteristic X as the SURF of final image;
(7) use support vector machine svm classifier device that resulting SURF is efficiently mated nuclear characteristic X and carry out classification based training, finally be used to the sorter that detects;
(8) import image to be detected, utilize the sorter that has obtained to confirm final testing result.
The present invention has the following advantages compared with prior art:
1, can avoid traditional fuzzy because the SURF that uses efficiently matees the characteristics of image of nuclear among the present invention, can obtain better human detection result based on the expression edge and that produce based on the graphical representation method of profile.
2, the present invention can effectively reduce characteristic time and the data computation amount extracted because the image feature information that extracts is low than traditional image describing method dimension.
3, the present invention when processing mixes background image, can obtain better result owing to be based on the human body detecting method of local visual characteristic information.
Description of drawings
Fig. 1 is a schematic flow sheet of the present invention;
Fig. 2 is the positive sample image of part that uses among the present invention;
Fig. 3 is the part negative sample image that uses among the present invention;
Fig. 4 is the detection performance comparison diagram with the present invention and existing method;
Fig. 5 is the figure as a result that human body image is detected with the present invention.
Embodiment
With reference to Fig. 1, practical implementation step of the present invention is following:
Step 1; From institut national de recherche en infomatique et automatique INRIA database, a large amount of negative sample image that obtains through bootstrapping operation, and in database other positive sample image composing training sample set; Wherein the negative sample image is as shown in Figure 2, and positive sample image is as shown in Figure 3.
Step 2, the SURF descriptor unique point F of extraction training sample set.
2a) j width of cloth training image is divided into 8 * 8 pixel grid, each grid obtains the SURF Speed Up Robust Feature descriptor unique point F of i width of cloth training image respectively by the graphical rule sampling of 16 and 25 pixel sizes j
2b) according to step 2a) extract the SURF descriptor unique point F of all training images, wherein, F={F 1..., F j..., F N, j ∈ [1, M], M is a number of training.
Step 3, the initial base vector R of acquisition visual vocabulary.
3a) to each width of cloth training sample image; On 8 * 8 image grid; According to 16; 25 pixel size yardsticks, 15 SURF unique points that obtained by step (2) of random sampling are designated as
Figure BDA00001765522400031
i and represent i width of cloth training image respectively;
3b) repeating step 3a), the SURF unique point of the training sample that random extraction is all is designated as F '; Utilize the k-means clustering method that SURF unique point similar among the F ' is carried out cluster, define 350 cluster centres, obtain the visual vocabulary of whole training image 350 dimensions, constitute the initial base vector R of visual vocabulary.
Step 4 obtains the maximum kernel function proper vector r of initial base vector R.
4a) initial base vector R is used projection coefficient v, projects on the space of one 350 dimension, obtain the projection R ' of R:
R′=Rv,
v=[v 1,...v i...,v N]
v i=(R TR) -1(R Tr i),i∈[1,N],
Wherein, r iBe the maximum kernel characteristic of i unique point extracting in the piece image, v iBe the low dimension projection coefficient of i unique point extracting in the piece image, N is the quantity of the unique point of picked at random in the piece image;
4b) on projector space, maximum kernel function proper vector r approached the projection R ' of initial base vector R, obtains approximating function f (r):
f(r)=arg?min‖r-R′‖,
With R '=Rv substitution following formula:
f(r)=arg?min‖r-Rv‖,
Wherein, ‖ ‖ representes 2 norms, and arg min ‖ ‖ representes to minimize;
4c) v among f (r)=arg min ‖ r-Rv ‖ and r are launched, obtain maximum kernel function proper vector r to 2 approximating function f of initial base vector R (v, r):
f ( v , r ) = 1 N Σ i = 1 N | | r i - Rv i | | 2 ,
Wherein, r=[r 1... r i..., r N], expression maximum kernel function proper vector;
(v r), obtains maximum kernel function proper vector r 4d) to use at random the gradient descent method to find the solution approximating function f.
Step 5; By the element among the descending sort maximum kernel function proper vector r, the same element of maximal value among the deletion maximum kernel function proper vector r obtains proper vector G; Proper vector G under each different images yardstick carries out weighted sum, obtains the characteristics of image G ' on all images yardstick:
G′=G×A l
Wherein, A lBe the weight of difference figure phase yardstick, l=[1,2],
Figure BDA00001765522400042
w l=1/p l, p is the pixel size that extracts the graphical rule of SURF descriptor unique point, p={16,25}.
Step 6 stores all images yardstick characteristics of image G ' down, selects the low dimensional feature h of similar Gaussian distribution among the characteristics of image G ', efficiently matees as the SURF of final image and examines characteristic X.
Step 7 is used support vector machine svm classifier method that the SURF that has obtained is efficiently mated nuclear characteristic X and is carried out classification learning, finally is used for the sorter of human detection.
Step 8 is used the sorter that is used for human detection that has obtained, and confirms final testing result.
(8a) input image to be detected; With the size in the image to be detected upper left corner zone that is 128 * 64 pixels as first scanning window; Every to 8 pixels of right translation or downwards 16 pixels of translation as a new scanning window; Obtain one group of scanning window thus, input step (7) gained sorter obtains the sorter mark of each scanning window;
(8b) judge whether comprised human body in the altimetric image,, then from all scanning windows that contains human body, find out the highest scanning window of sorter mark as main window if the scanning window of sorter output contains human body according to the sorter mark of scanning window;
(8c) main window and other human body windows are made up judgement, when other human body windows are in around the main window and overlapping greater than 1/2 the time, with this window and main window combination, the human body window after obtaining to make up;
(8d) the human body window after the reservation combination, deletion main window and all are participated in the human body window of combination;
(8e) if also have remaining human body window, then find out human body window that wherein the sorter mark is the highest again as main window, and repeating step (8b)-(8d);
(8f) on tested person's volume image, mark all testing results, as by the altimetric image final human detection result, adopt rectangle frame to represent testing result, the human body that is detected is in the rectangle frame.
Effect of the present invention can obtain checking through following emulation experiment:
1) emulation experiment condition setting: emulation experiment of the present invention compiles completion on Matlab 2009a, and execution environment is the HP workstation under the Windows framework.Test required positive sample and negative sample and all be taken from institut national de recherche en infomatique et automatique INRIA database.Use 2416 positive samples and 13500 negative samples as training set; 1132 positive samples and 4050 negative samples are as test set; The size of positive sample and negative sample image is 128 * 64 pixels, and Fig. 2 has provided the wherein positive sample image of part, and Fig. 3 is a part negative sample image.
2) emulation content and interpretation of result
Emulation one: use the present invention and existing method that characteristics of image is classified respectively, classification performance is as shown in Figure 4.Among Fig. 4, top curve is a classification performance curve of the present invention, and following curve is the classification performance curve of existing method, and as can beappreciated from fig. 4, classification performance of the present invention is higher than the classification performance of existing method.
Emulation two: use the inventive method and existing method that the same width of cloth is carried out human detection from the image of Massachusetts science and engineering MIT database respectively, testing result is as shown in Figure 5.Wherein, Fig. 5 (a) is to use existing method to carry out human detection; Carry out the experimental result before window merges, Fig. 5 (b) is the final detection result of existing method, and Fig. 5 (c) expression adopts this method to carry out human detection; Carry out the experimental result before window merges, Fig. 5 (d) is the final detection result of this method.Method as can beappreciated from fig. 5 of the present invention has higher human detection accuracy.
To sum up; The present invention has improved the ability to express of characteristic, thereby has made this method be very suitable for the human detection of still image in the complexity that reduces image characteristics extraction; Compare with existing method simultaneously, this method can greatly reduce the empty scape rate of human detection.

Claims (4)

1. a human body detecting method that efficiently matees nuclear based on SURF comprises the steps:
(1) from institut national de recherche en infomatique et automatique INRIA database, obtain negative sample, and other positive sample constitutes the whole human body training sample set in database through the bootstrapping operation;
(2) every width of cloth training sample image is divided into 8 * 8 pixel grid, each grid extracts the SURF Speed Up Robust Feature descriptor unique point F of all training images respectively by the graphical rule sampling of 16 and 25 pixel sizes;
(3) carry out stochastic sampling through SURF descriptor unique point F, obtain the visual vocabulary of whole training sample 350 dimensions, constitute initial base vector R with the 350 dimension visual vocabularies that obtain to all training images;
(4) with initial base vector R, utilize the nuclear svd CKSVD of belt restraining to carry out dictionary study, obtain maximum kernel function characteristic r;
(5) through maximization eigenwert extraction method; Suppress similar maximum kernel function characteristic r; And press descending and extract the kernel function eigenwert, the same element of deletion maximal value obtains proper vector G; Characteristics of image G to each different images yardstick carries out weighted sum, obtains the characteristic G ' of all images yardstick:
G′=G×A l
Wherein, A lBe the weight of difference figure phase yardstick, l=[1,2],
Figure FDA00001765522300011
w l=1/p l, p is the pixel size of the graphical rule of the SURF unique point extracted, p={16,25};
(6) store the characteristic G ' of all images yardstick, select the low dimensional feature h of similar Gaussian distribution among the G ', efficiently mate nuclear characteristic X as the SURF of final image;
(7) use support vector machine svm classifier device that resulting SURF is efficiently mated nuclear characteristic X and carry out classification based training, finally be used to the sorter that detects;
(8) import image to be detected, utilize the sorter that has obtained to confirm final testing result.
2. method according to claim 1, wherein step 2) described in the SURF descriptor unique point F of all training images of extraction, carry out as follows:
2a) j width of cloth training image is divided into 8 * 8 pixel grid, each grid obtains the SURF SpeedUp Robust Feature unique point F of i width of cloth training image respectively by the graphical rule sampling of 16 and 25 pixel sizes j
2b) according to step 2a) extract the SURF descriptor unique point F of all training images, wherein, F={F 1..., F j..., F N, j ∈ [1, M], M is a number of training.
3. method according to claim 1, the visual vocabulary of the whole training sample of the acquisition described in the step (3) 350 dimensions wherein, carry out as follows:
3a) to each width of cloth training sample image; On 8 * 8 image grid; According to 16; 25 pixel size yardsticks, 15 SURF unique points that obtained by step (2) of random sampling are designated as
Figure FDA00001765522300021
i and represent i width of cloth training image respectively;
3b) repeating step 3a), extracts the SURF unique point of all training samples, be designated as F ', define 350 cluster centres, utilize the k-means clustering method that SURF unique point similar among the F ' is carried out cluster, obtain the visual vocabulary of whole training sample 350 dimensions.
4. according to the said method of claim 1, wherein the described input of step (8) image to be detected utilizes the sorter that has obtained to confirm final testing result, carries out as follows:
(8a) input image to be detected; With the size in the image to be detected upper left corner zone that is 128 * 64 pixels as first scanning window; Every to 8 pixels of right translation or downwards 16 pixels of translation as a new scanning window; Obtain one group of scanning window thus, input step (7) gained sorter obtains the sorter mark of each scanning window;
(8b) judge whether comprised human body in the altimetric image,, then from all scanning windows that contains human body, find out the highest scanning window of sorter mark as main window if the scanning window of sorter output contains human body according to the sorter mark of scanning window;
(8c) main window and other human body windows are made up judgement, when other human body windows are in around the main window and overlapping greater than 1/2 the time, with this window and main window combination, the human body window after obtaining to make up;
(8d) the human body window after the reservation combination, deletion main window and all are participated in the human body window of combination;
(8e) if also have remaining human body window, then find out human body window that wherein the sorter mark is the highest again as main window, and repeating step (8b)-(8d);
(8f) on tested person's volume image, mark all testing results,, adopt rectangle frame to represent testing result, the human body that is detected is in the rectangle frame as by the altimetric image final human detection result.
CN201210196526.2A 2012-06-14 2012-06-14 Human body detecting method based on SURF (Speed Up Robust Feature) efficient matching kernel Expired - Fee Related CN102810159B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210196526.2A CN102810159B (en) 2012-06-14 2012-06-14 Human body detecting method based on SURF (Speed Up Robust Feature) efficient matching kernel

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210196526.2A CN102810159B (en) 2012-06-14 2012-06-14 Human body detecting method based on SURF (Speed Up Robust Feature) efficient matching kernel

Publications (2)

Publication Number Publication Date
CN102810159A true CN102810159A (en) 2012-12-05
CN102810159B CN102810159B (en) 2014-10-29

Family

ID=47233863

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210196526.2A Expired - Fee Related CN102810159B (en) 2012-06-14 2012-06-14 Human body detecting method based on SURF (Speed Up Robust Feature) efficient matching kernel

Country Status (1)

Country Link
CN (1) CN102810159B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103455826A (en) * 2013-09-08 2013-12-18 西安电子科技大学 Efficient matching kernel body detection method based on rapid robustness characteristics
CN106062715A (en) * 2013-12-24 2016-10-26 派尔高公司 Method and apparatus for intelligent video pruning
CN106980864A (en) * 2017-03-31 2017-07-25 合肥工业大学 A kind of pedestrian's recognition methods again based on support sample indirect type

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070237387A1 (en) * 2006-04-11 2007-10-11 Shmuel Avidan Method for detecting humans in images
CN101561867A (en) * 2009-05-19 2009-10-21 华中科技大学 Human body detection method based on Gauss shape feature
CN101930549A (en) * 2010-08-20 2010-12-29 西安电子科技大学 Second generation curvelet transform-based static human detection method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070237387A1 (en) * 2006-04-11 2007-10-11 Shmuel Avidan Method for detecting humans in images
CN101561867A (en) * 2009-05-19 2009-10-21 华中科技大学 Human body detection method based on Gauss shape feature
CN101930549A (en) * 2010-08-20 2010-12-29 西安电子科技大学 Second generation curvelet transform-based static human detection method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103455826A (en) * 2013-09-08 2013-12-18 西安电子科技大学 Efficient matching kernel body detection method based on rapid robustness characteristics
CN106062715A (en) * 2013-12-24 2016-10-26 派尔高公司 Method and apparatus for intelligent video pruning
CN106062715B (en) * 2013-12-24 2019-06-04 派尔高公司 The method and apparatus deleted for intelligent video
CN106980864A (en) * 2017-03-31 2017-07-25 合肥工业大学 A kind of pedestrian's recognition methods again based on support sample indirect type
CN106980864B (en) * 2017-03-31 2019-07-19 合肥工业大学 A kind of pedestrian's recognition methods again based on support sample indirect type

Also Published As

Publication number Publication date
CN102810159B (en) 2014-10-29

Similar Documents

Publication Publication Date Title
CN105608454B (en) Character detecting method and system based on text structure component detection neural network
CN101930549B (en) Second generation curvelet transform-based static human detection method
CN104123560B (en) Fuzzy facial image verification method based on phase code feature and more metric learnings
CN108520273A (en) A kind of quick detection recognition method of dense small item based on target detection
CN101719142B (en) Method for detecting picture characters by sparse representation based on classifying dictionary
CN107844740A (en) A kind of offline handwriting, printing Chinese character recognition methods and system
CN105046269B (en) A kind of more example multi-tag scene classification methods based on multi-core integration
CN105205449A (en) Sign language recognition method based on deep learning
CN102819733B (en) Rapid detection fuzzy method of face in street view image
CN103020614B (en) Based on the human motion identification method that space-time interest points detects
CN106940791B (en) A kind of pedestrian detection method based on low-dimensional histograms of oriented gradients
CN103279742A (en) Multi-resolution pedestrian detection method and device based on multi-task model
Meng et al. An extended HOG model: SCHOG for human hand detection
CN105117740A (en) Font identification method and device
CN103544504A (en) Scene character recognition method based on multi-scale map matching core
CN103455826A (en) Efficient matching kernel body detection method based on rapid robustness characteristics
CN106980825A (en) A kind of face posture sorting technique based on normalization pixel difference feature
CN105469117A (en) Image recognition method and device based on robust characteristic extraction
CN101520839B (en) Human body detection method based on second-generation strip wave conversion
CN102609715B (en) Object type identification method combining plurality of interest point testers
CN104966075A (en) Face recognition method and system based on two-dimensional discriminant features
CN102810159B (en) Human body detecting method based on SURF (Speed Up Robust Feature) efficient matching kernel
CN104318224A (en) Face recognition method and monitoring equipment
CN103186777B (en) Based on the human body detecting method of Non-negative Matrix Factorization
CN105956563A (en) Method of face marking in news image based on multiple instance learning

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20141029

Termination date: 20190614