CN108460336A - A kind of pedestrian detection method based on deep learning - Google Patents

A kind of pedestrian detection method based on deep learning Download PDF

Info

Publication number
CN108460336A
CN108460336A CN201810082310.0A CN201810082310A CN108460336A CN 108460336 A CN108460336 A CN 108460336A CN 201810082310 A CN201810082310 A CN 201810082310A CN 108460336 A CN108460336 A CN 108460336A
Authority
CN
China
Prior art keywords
pedestrian
layers
deep learning
candidate person
pedestrian detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810082310.0A
Other languages
Chinese (zh)
Inventor
孙炜程
朱松豪
荆晓远
代心灵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Post and Telecommunication University
Nanjing University of Posts and Telecommunications
Original Assignee
Nanjing Post and Telecommunication University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Post and Telecommunication University filed Critical Nanjing Post and Telecommunication University
Priority to CN201810082310.0A priority Critical patent/CN108460336A/en
Publication of CN108460336A publication Critical patent/CN108460336A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • G06V20/58Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands

Abstract

The present invention relates to a kind of pedestrian detection methods based on deep learning, and this approach includes the following steps:Video image input extraction network to be detected is generated into characteristic pattern first, then, the characteristic pattern input area that network generates will be extracted and propose network, propose that method detection is most likely to be the region of pedestrian using region, pedestrian candidate person and the corresponding score of pedestrian candidate person are generated, finally determines whether pedestrian candidate person is real pedestrian using trained decision Tree algorithms.It is an advantage of the invention that calculating simply, quickly, the accuracy rate of pedestrian detection can be significantly improved.

Description

A kind of pedestrian detection method based on deep learning
Technical field
The present invention relates to a kind of pedestrian detection method, especially a kind of pedestrian detection method based on deep learning belongs to Technical field of image processing.
Background technology
It is reported that pedestrian detection is an important subject of computer vision field.The purpose of pedestrian detection be for Accurately identify and position position of the pedestrian in image or video sequence.Currently, pedestrian detection vehicle DAS (Driver Assistant System), It is widely used in intelligent video monitoring and intelligent transportation.
Traditional pedestrian detection method is also referred to as the model of hand-designed, is the spy that pedestrian is indicated based on low-level image feature Sign, such as HOG features, Haar features, LBP features, LUV features, ICF features, Squares ChnFtrs features and LDCF are special Sign etc..At present mostly pedestrian detection is solved the problems, such as grader using support vector machines or decision tree.However, traditional row People's detection method needs the feature of engineer complex, needs a large amount of professional knowledge and there are one in terms of robustness Fixed limitation.
With the development of deep learning, based on the pedestrian detection method of deep learning in the case where not considering to calculate cost Achieve huge success.In general, the pedestrian detection method based on deep learning can be divided into two classes:One kind is to be based on area The pedestrian detection method that domain is suggested, such as R-CNN methods, SPP-Net methods, Faster R-CNN methods and R-FCN methods; Another kind of is the pedestrian detection method for being not based on region suggestion, such as YOLO methods and SSD methods.Although being not based on region to build The pedestrian detection method of view has some advantages in calculating speed, but this method cannot obtain very high precision.Therefore, right For most of pedestrian detection methods based on deep learning, the plan of pedestrian's candidate is often generated using region suggestion Slightly.
In addition, as deep learning is in the extensive use of pedestrian's detection field, convolutional neural networks are widely used in Pedestrian detection, such as AlexNet networks, VGG networks, ZF networks, Fast-RCNN networks, Faster-RCNN networks, R-CNN nets Network, MS-CNN networks and R-FCN networks.In R-CNN, Fast-RCNN and Faster R-CNN this series of method, Propose that strategy is used to improve target detection accuracy rate and calculating speed in region.For MS-CNN methods, one multiple dimensioned Propose that network is used to improve the accuracy of detection Small object in region.For R-FCN methods, carried using full convolutional network and region View network is combined to carry out pedestrian detection.Compared with Faster-RCNN methods, R-FCN methods substantially increase calculating speed And slightly improve the accuracy rate of pedestrian detection.Although the target detection technique development based on deep learning is very rapid, Either in terms of accuracy rate or speed, the method for pedestrian detection is still significantly improved space.
Invention content
It is an object of the invention to:In view of the defects existing in the prior art, a kind of new row based on study depth is proposed People's detection method, accuracy and rapidity for improving pedestrian detection.
In order to reach object above, the present invention provides a kind of pedestrian detection method based on deep learning, including it is following Step:
Video image input extraction network to be detected is generated characteristic pattern by the first step;
Second step will extract the characteristic pattern input area proposal network that network generates, and region is recycled to propose that method detection most has It may be the region of pedestrian, generate pedestrian candidate person and the corresponding score of pedestrian candidate person;
Third step determines whether pedestrian candidate person is real pedestrian using trained decision Tree algorithms.
Video image is input in designed pedestrian detection model by the present invention first, then using based on deep learning PVANet networks generate characteristic pattern, propose that network generates pedestrian candidate person and corresponding score, last profit followed by region Classified to the pedestrian candidate person of generation to find out real pedestrian with trained decision Tree algorithms.
Preferably, the extraction network uses PVANet networks, the PVANet networks to have 14 layers, and wherein three first layers are Convolutional layer, centre are two groups of initial layers, and every group of initial layers include the identical initial layers of four structures, and last three layers are full connection Layer;The output of the full articulamentum is the input that network and decision tree classifier are proposed in region.
It is further preferred that the structure of all initial layers is all identical, single initial layers are by first, second, third point Zhi Zucheng, first branch are made of one 1 × 1 convolutional layer, and second branch is by one 1 × 1 convolutional layer and one A 3 × 3 convolutional layer composition, the third branch are made of one 1 × 1 convolutional layer and two 3 × 3 convolutional layers.
Still further preferably, the specific method is as follows for single initial layers generation characteristic pattern:The characteristic pattern that last layer generates It is passed to the first, second, third, etc. three branches of initial layers respectively, the characteristic pattern then exported by these three branches is transmitted To an articulamentum, next layer is finally entered, becomes next layer of input feature vector figure, is more precisely obtained by initial layers in this way The target of small scale.Last layer is that the next layer of convolutional layer of single initial layers either front is single initial layers or subsequent Full articulamentum.
Preferably, in network is proposed in the region, for the input feature vector figure that PVANet networks generate, by a m*m The sliding window of size is used and generates multiple features connected entirely on each width characteristic pattern, and the feature each connected entirely includes two Branch, one of branch are scs layers, another branch is cds layers.Single sliding window can predict simultaneously different scale and The region motion of different aspect ratios.For example, when the region motion of sliding window four scales of prediction and four length-width ratios, will produce 4*4 region motion.That is, sliding window generates 4*4*4 output at cds layers, 2*4*4 output is generated at scs layers. Described cds layers is used for generating pedestrian candidate person, including pedestrian candidate person(Predict target)The coordinate of central point and the pedestrian Candidate(Predict target)Width and height;Described scs layers is used for generating the corresponding score of pedestrian candidate person, that is, predicts mesh Target corresponding scores, predict the corresponding scores of target be predict target be the estimated probability of pedestrian and be not pedestrian probability; The pedestrian candidate person generated by cds layers score corresponding with the pedestrian candidate person generated by scs layers is transported to decision tree classification Device is trained and detects.
Cost is calculated in order to be reduced in the case where not influencing pedestrian detection precision, in the training process, by PVANet nets The convolution feature that network generates is used as region and proposes network and detect the input of network.
The decision tree of the present invention uses tree, wherein a characteristic attribute is sentenced in the expression of each non-leaf nodes It is disconnected, characteristic attribute of each branching representation pair judge as a result, each leaf node represents a classification.In order to determine Plan, the characteristic attribute for first having to treat class object since root node are tested, and are then selected according to test result corresponding Branch, finally repeat the process until reaching a leaf node.The classification of the leaf node of arrival is exactly to predict The classification of the target to be classified.
Although the mutation of decision Tree algorithms has very much, such as ID3, C5.0 and CART algorithms, their basic thought All it is identical, and the use of the accuracy rate that decision Tree algorithms are classified is very high.Decision tree classifier it is basic Thought is the multiple Weak Classifiers of training on the same training set, then these Weak Classifiers are combined into final strong classification Device.These Weak Classifiers are respectively there are one weighting parameter β, that is, the ratio of sample number that grader is correctly classified.Therefore it needs One threshold value is set to determine whether sample is correctly classified.Successive ignition will be carried out in the training process to decision tree, If the classification accuracy of some Weak Classifier is very low during an iteration, the performance of the Weak Classifier is also meaned that very Difference, then the parameter of the Weak Classifier will be reduced.
Specifically, training the method for decision tree as follows using RealBoost algorithms:
1. given training set,
(x1, y1)...(xi, yi)...(xN, yN
Wherein, yiIt is feature vector, and i=1 ..., N;
2. in the starting stage, Weak Classifier is numbered and number is denoted as j, according to(1)Formula determines each Weak Classifier Weight,
(1)
Wherein, WjFor the weight of Weak Classifier, H is the number of Weak Classifier;
3. carrying out n times to Weak Classifier to train to obtain training data, training is numbered and number is denoted as n, then basis (2)Formula obtains a probability Estimation,
(2)
Wherein, Pn(y)For the probabilistic estimated value of Weak Classifier, N is the frequency of training of Weak Classifier;
4. basis(3)Formula calculates the true Distribution value of Weak Classifier,
(3)
Wherein, fn(y)For the actual value of Weak Classifier, R is set of real numbers;
5. in the training process, according to(4)Formula obtains the weight of Weak Classifier,
(4)
6. after each iteration, the weight of all Weak Classifiers is normalized again so that total weight of all Weak Classifiers Equal to 1, strong classifier is finally obtained, according to(5)Formula obtains strong classifier,
(5)
Wherein, N is the frequency of training of Weak Classifier.
The method for determining real pedestrian is as follows:First with trained decision Tree algorithms to the pedestrian candidate person of generation into Row classification, then according to pedestrian candidate person's classification preset threshold value, when the pedestrian candidate person in characteristic pattern is the probability of pedestrian When less than preset threshold value, then pedestrian candidate person classification is real pedestrian, and the otherwise classification is not real pedestrian.
It is an advantage of the invention that calculating simply, quickly, the accuracy rate of pedestrian detection can be significantly improved.
Description of the drawings
The present invention will be further described below with reference to the drawings.
Fig. 1 is the network model figure of the present invention.
Fig. 2 is the structural model figure of PVANet networks in the present invention.
Fig. 3 is the structural model figure of PVANet network initial layers in the present invention.
Fig. 4 is the structural model figure that network is proposed in region in the present invention.
Fig. 5(a)For the part training sample exemplary plot in the present invention on Caltech pedestrian detections data set.
Fig. 5(b)Sample instantiation figure is detected for the part in the present invention on Caltech pedestrian detections data set.
Fig. 6(a)For the part training sample exemplary plot in the present invention on INRIA pedestrian detections data set.
Fig. 6(b)Sample instantiation figure is detected for the part in the present invention on INRIA pedestrian detections data set.
Fig. 7(a)The part sample being detected on Caltech pedestrian detection data sets for pedestrian detection model in the present invention This exemplary plot.
Fig. 7(b)The part knot being detected on Caltech pedestrian detection data sets for pedestrian detection model in the present invention Fruit exemplary plot.
Fig. 8(a)The part sample being detected on INRIA pedestrian detection data sets for pedestrian detection model in the present invention Exemplary plot.
Fig. 8(b)The partial results being detected on INRIA pedestrian detection data sets for pedestrian detection model in the present invention Exemplary plot.
Specific implementation mode
Embodiment one
A kind of pedestrian detection method based on deep learning is present embodiments provided, the mentality of designing of this method is:By video figure As being input to designed pedestrian detection model(Model structure is shown in Fig. 1)In, it is given birth to using the PVANet networks based on deep learning Propose that network proposes that method generates pedestrian candidate person and row using region at characteristic pattern, then by the characteristic pattern input area of generation The corresponding score of people's candidate finally classifies to find out to the pedestrian candidate person of generation using trained decision Tree algorithms Real pedestrian.
PVANet networks share 14 layers, and wherein three first layers are convolutional layer, and centre is two groups of initial layers, every group of initial layers packet Containing the identical initial layers of four structures, last three layers are full articulamentum(It is shown in Table 1).As shown in Fig. 2, video image enters PVANet After network, pass through three first layers convolutional layer successively, the processing of intermediate eight layers of initial layers and rear three layers of full articulamentum generates the spy of output Sign figure, characteristic pattern are passed to region by full articulamentum and propose that network and decision tree classifier, the output of full articulamentum are proposed for region The input of network and decision tree classifier.The structure of all initial layers is all identical.
The structure for the PVANet networks that table 1. optimized
Convolutional layer 4*4_32
Convolutional layer 3*3_32
Convolutional layer 3*3_32
First group of initial layers 1*1_96 - 1*1_16_3*3_64 - 1*1_16_3*3_32_3*3_32
First group of initial layers 1*1_96 - 1*1_16_3*3_64 - 1*1_16_3*3_32_3*3_32
First group of initial layers 1*1_96 - 1*1_16_3*3_64 - 1*1_16_3*3_32_3*3_32
First group of initial layers 1*1_96 - 1*1_16_3*3_64 - 1*1_16_3*3_32_3*3_32
Second group of initial layers 1*1_128 - 1*1_32_3*3_96 - 1*1_16_3*3_32_3*3_32
Second group of initial layers 1*1_128 - 1*1_32_3*3_96 - 1*1_16_3*3_32_3*3_32
Second group of initial layers 1*1_128 - 1*1_32_3*3_96 - 1*1_16_3*3_32_3*3_32
Second group of initial layers 1*1_128 - 1*1_32_3*3_96 - 1*1_16_3*3_32_3*3_32
Full articulamentum 4096
Full articulamentum 4096
Full articulamentum 1000
Table 1 lists the structure of the PVANet networks optimized.In the entire network, l*l_M indicates that the convolution kernel of this layer is l*l And will export M characteristic pattern.In initial layers, branch different in initial layers is indicated using "-", last full articulamentum Parameter indicates the number for the neuron for including in full articulamentum.
As shown in figure 3, single initial layers are made of the first, second, third branch, the first branch is by one 1 × 1 convolution Layer composition, the second branch are made of one 1 × 1 convolutional layer and one 3 × 3 convolutional layer, and third branch is by one 1 × 1 Convolutional layer and two 3 × 3 convolutional layers composition.The characteristic pattern that last layer generates is passed to three branches of initial layers respectively, so The characteristic pattern exported afterwards by these three branches is transferred into an articulamentum, and the characteristic pattern which exports finally enters next Layer, becomes next layer of input feature vector figure.Last layer can be the convolutional layer that single initial layers can also be front, and next layer can To be that single initial layers can also subsequent full articulamentum.The convolution feature generated by PVANet networks is used as region and proposes net Network and the input for detecting network.
As shown in figure 4, in network is proposed in region, it is for the input feature vector figure that PVANet networks generate, a m*m is big Small sliding window is used and generates multiple features connected entirely on each width characteristic pattern, and the feature each connected entirely includes two points Branch, one of branch is scs layers, another branch is cds layers.Single sliding window can predict different scale and not simultaneously With the region motion of aspect ratio.Cds layers be used for generating the i.e. coordinates of prediction target's center point and the width of the prediction target and Highly, the scs layers of corresponding scores for being used for generating prediction target predict that target is the estimated probability of pedestrian and is not pedestrian Probability.The output finally generated by cds layers and by scs layers is sent to decision tree classifier and is trained and detects.
In decision tree classifier, the method for training decision tree is as follows:Given training set(x1, y1)...(xi, yi)...(xN, yN),, wherein yiIt is feature vector, and i=1 ..., N.In the starting stage, Weak Classifier is numbered simultaneously Number is denoted as j, according to(1)Formula isDetermine the weight of each Weak Classifier, wherein WjFor Weak Classifier Weight, H be Weak Classifier number.In the training process, n times first are carried out to Weak Classifier to train to obtain training data, is being instructed Practice and training is numbered before starting and number is denoted as n, further according to(2)Formula is Obtain a probability Estimation, wherein Pn(y)For the probabilistic estimated value of Weak Classifier, N is the frequency of training of Weak Classifier, then root According to(3)Formula isCalculate the true Distribution value of Weak Classifier, wherein fn(y)It is weak The actual value of grader, R are set of real numbers, last basis(4)Formula isIt obtains weak The weight of grader.After each iteration, the weight of all Weak Classifiers is normalized again so that all Weak Classifiers Total weight is equal to 1, finally obtains strong classifier
After decision tree trains, classified to the pedestrian candidate person of generation using trained decision Tree algorithms, so Afterwards according to pedestrian candidate person's classification preset threshold value, set in advance when the probability that the pedestrian candidate person in characteristic pattern is pedestrian is less than When fixed threshold value, then pedestrian candidate person classification is real pedestrian, and the otherwise classification is not real pedestrian.
Pedestrian detection model of the present embodiment based on deep learning is tested, to assess its performance.Specific assessment side Method is as follows:
Step A, the data set for assessment algorithm performance is introduced.
The experiment uses two pedestrian detection data sets, respectively Caltech pedestrian detections data set and INRIA pedestrian's inspection Measured data collection.Wherein, Caltech pedestrian detections data set is one bigger pedestrian's data set of current scale, the data Collection has 250000 frames to be marked from a video for being up to ten hours using vehicle-mounted camera shooting in the video, It includes 350000 rectangle frames and 2300 different pedestrians that entire video, which has altogether, and the data set is also to these rectangle frames Between hiding relation marked.This data set includes 11 small videos from entire video, each video For size all in 1G or so, mark all has been carried out in the first six video therein, this six videos include 192000 pedestrians altogether, 6100 positive samples and 61000 negative samples, are used as being trained pedestrian detection network;Five videos do not correspond to afterwards Markup information, this five videos altogether include 155000 pedestrians, 56000 positive samples and 65000 negative samples, be used to Detect the effect of pedestrian detection method.Fig. 5(a)And Fig. 5(b)Give one on the pedestrian detection data set of California Institute of Technology The example of a little pictures, wherein Fig. 5(a)Show the part training image of Caltech pedestrian detection data sets, Fig. 5(b)Exhibition What is shown is the partial test image of Caltech pedestrian detection data sets.From Fig. 5(a)And Fig. 5(b)In as can be seen that Caltech Video frame in pedestrian detection data set is very fuzzy, therefore it is one to carry out experiment on Caltech pedestrian detection data sets Challenging task.
INRIA pedestrian detection data sets are most common static pedestrian's Test databases, include original graph in the data set Piece and corresponding label.INRIA data sets provide two kinds of training and test sample, and it is different that one is photo resolutions Training and test sample, another kind are the identical training of photo resolution and test sample.The experiment of the present embodiment uses picture The different sample of resolution ratio, wherein training set include 614 positive samples and 1218 negative samples, and test set includes 288 positive samples Sheet and 453 negative samples.Fig. 6(a)And Fig. 6(b)Give the example of some pictures on INRIA data sets, Fig. 6(a)It is Train picture, including 6 positive samples and 6 negative samples, Fig. 6 in the part of INIRIA data sets(b)It is the portion of INIRIA data sets Divide test pictures, including 6 positive samples and 6 negative samples.From Fig. 6(a)And Fig. 6(b)As can be seen that in INRIA data sets Video definition is relatively high.
Step B, a kind of pedestrian detection method based on deep learning is given for the present embodiment pedestrian detection model.
First, using the characteristic pattern of the mobile pedestrian of PVANet networks extraction optimized(The knot of the PVANet networks of optimization Structure is as shown in table 1).Video image can generate 512 characteristic patterns by PVANet networks, preceding 128 features in these characteristic patterns Figure is used for region and proposes that network generates pedestrian candidate person and corresponding score.
Secondly, propose that network generates pedestrian candidate person and corresponding score using region.Propose to use in network in region Five kinds of scales and five kinds of length-width ratios generate 25 regions to each sliding window and suggest.For every frame picture, we, which only obtain, divides Highest 200 regions, which suggest being sent into decision tree classifier, is trained decision tree.
Finally, it trains decision tree classifier and the pedestrian candidate person of generation is divided using trained decision Tree algorithms Class is to find out real pedestrian.Purpose to decision tree classifier training is succinct and being capable of fine arranged row in order to obtain one The decision tree of people.In the most initial stage of training decision tree, by all positive samples, grab sample and identical as positive sample quantity Negative sample and a certain proportion of sample for being difficult to classify as training set, entire training is divided into six stages, by decision The threshold value of Tree Classifier is set as 0.7.Include 64 in the first stage of training decision tree to set, later the quantity of tree of each stage It is double, and addition certain proportion, the negative sample for being difficult to classify constitute new training set in old training set, ultimately generate one The decision tree classifier set with 2048, this decision tree classifier is the strong classifier finally used.
Step C, by contrast experiment, the performance of the put forward algorithm of the present embodiment is inquired into.
During the experiment, the threshold value of decision tree classifier is set as 0.7, to the pedestrian detection model of the present embodiment into Row experiment.After experiment, for pedestrian detection model experimental result time performance with it is state-of-the-art with some on miss rate Method is compared, including CompACT-Deep methods, CCF methods and LDCF methods.Final comparison result is in table 2 Middle display.From in final comparison result it can be found that the method for the present embodiment for these state-of-the-art methods not Advantage is only occupied in the processing of single frames picture and on miss rate again smaller than these methods.
Comparison result of the table 2. on time performance and miss rate
Method Time/per pictures (second) Miss rate %
PVANet+RPN+BF [the present embodiment method] 0.48 9
CompACT-Deep 0.5 12
LDCF 0.6 25
CCF 13 17
In addition, Fig. 7(a)And Fig. 7(b)Illustrate the sediment that pedestrian detection is carried out on Caltech pedestrian detection data sets Picture.Fig. 7(a)It is the original image on Caltech data sets, Fig. 7(b)It is that these original images are examined in the pedestrian of the present embodiment Survey corresponding testing result on model.From Fig. 7(a)And Fig. 7(b)Although can be seen that on Caltech pedestrian detection data sets Picture is very fuzzy, but the pedestrian detection model of the present embodiment remains to obtain the result of good pedestrian detection.
The pedestrian detection model that the present embodiment is proposed carried out on Caltech pedestrian detection data sets training and It is tested on INRIA data sets to verify the validity of model.Fig. 8(a)And Fig. 8(b)It illustrates in INRIA pedestrian detection numbers According to the sediment picture for carrying out pedestrian detection on collection.Fig. 8(a)It is the original image on INRIA data sets, Fig. 8(b)It is these Original image corresponding testing result on the pedestrian detection model of the present embodiment.From Fig. 8(a)And Fig. 8(b)As can be seen that the greatest extent The problem of pipe blocks is not resolved, but the model of the present embodiment obtains very well on INRIA pedestrian detection data sets Pedestrian detection result.
In short, by showing the scheme that the present embodiment proposes in the experiment of INRIA and Caltech pedestrian detection data sets It is really effective, the accuracy and rapidity of pedestrian detection can be significantly improved.
In addition to the implementation, the present invention can also have other embodiment.It is all to use equivalent substitution or equivalent transformation shape At technical solution, fall within the scope of protection required by the present invention.

Claims (10)

1. a kind of pedestrian detection method based on deep learning, which is characterized in that include the following steps:
Video image input extraction network to be detected is generated characteristic pattern by the first step;
Second step will extract the characteristic pattern input area proposal network that network generates, and region is recycled to propose that method detection most has It may be the region of pedestrian, generate pedestrian candidate person and the corresponding score of pedestrian candidate person;
Third step determines whether pedestrian candidate person is real pedestrian using trained decision Tree algorithms.
2. a kind of pedestrian detection method based on deep learning according to claim 1, it is characterised in that:The extraction net Network uses PVANet networks, the PVANet networks to have 14 layers, and wherein three first layers are convolutional layer, and centre is two groups of initial layers, Every group of initial layers include the identical initial layers of four structures, and last three layers are full articulamentum;The output of the full articulamentum is area Propose the input of network and decision tree classifier in domain.
3. a kind of pedestrian detection method based on deep learning according to claim 2, it is characterised in that:Single initial layers by First, second, third branch forms, and first branch is made of one 1 × 1 convolutional layer, and second branch is by one 1 The convolutional layer composition of × 1 convolutional layer and one 3 × 3, the third branch is by one 1 × 1 convolutional layer and two 3 × 3 Convolutional layer forms.
4. a kind of pedestrian detection method based on deep learning according to claim 3, which is characterized in that single initial layers Generating characteristic pattern, the specific method is as follows:Last layer generate characteristic pattern by respectively be passed to initial layers three branches, then by The characteristic pattern of these three branches output is transferred into an articulamentum, finally enters next layer, becomes next layer of input feature vector figure.
5. a kind of pedestrian detection method based on deep learning according to claim 4, it is characterised in that:It is carried in the region It discusses in network, for the input feature vector figure that PVANet networks generate, a sliding window is used and is generated on each width characteristic pattern Multiple features connected entirely, the feature each connected entirely include Liang Ge branches, and one of branch is scs layers, another branch It is cds layers;Described cds layers is used for generating pedestrian candidate person, including the coordinate of pedestrian candidate person's central point and the pedestrian candidate The width and height of person;Described scs layers is used for generating the corresponding score of pedestrian candidate person;The pedestrian candidate person generated by cds layers Score corresponding with the pedestrian candidate person generated by scs layers is transported to decision tree classifier and is trained and detects.
6. a kind of pedestrian detection method based on deep learning according to claim 5, it is characterised in that:Single sliding window The region motion of different scale and different aspect ratios can be predicted simultaneously.
7. a kind of pedestrian detection method based on deep learning according to claim 6, it is characterised in that:The sliding window When the region motion of four scales of prediction and four length-width ratios, 4*4 region motion will produce.
8. a kind of pedestrian detection method based on deep learning according to claim 7, it is characterised in that:The sliding window 4*4*4 output is generated at cds layers, and 2*4*4 output is generated at scs layers.
9. a kind of pedestrian detection method based on deep learning according to claim 8, it is characterised in that:Using RealBoost algorithms train the method for decision tree as follows:
1. given training set,
(x1, y1)...(xi, yi)...(xN, yN
Wherein, yiIt is feature vector, and i=1 ..., N;
2. in the starting stage, Weak Classifier is numbered and number is denoted as j, according to(1)Formula determines each Weak Classifier Weight,
(1)
Wherein, WjFor the weight of Weak Classifier, H is the number of Weak Classifier;
3. carrying out n times to Weak Classifier to train to obtain training data, training is numbered and number is denoted as n, then basis (2)Formula obtains a probability Estimation,
(2)
Wherein, Pn(y)For the probabilistic estimated value of Weak Classifier, N is the frequency of training of Weak Classifier;
4. basis(3)Formula calculates the true Distribution value of Weak Classifier,
(3)
Wherein, fn(y)For the actual value of Weak Classifier, R is set of real numbers;
5. in the training process, according to(4)Formula obtains the weight of Weak Classifier,
(4)
6. after each iteration, the weight of all Weak Classifiers is normalized again so that total weight of all Weak Classifiers Equal to 1, strong classifier is finally obtained, according to(5)Formula obtains strong classifier,
(5)
Wherein, N is the frequency of training of Weak Classifier.
10. a kind of pedestrian detection method based on deep learning according to claim 9, which is characterized in that determine real row The method of people is as follows:Classify to the pedestrian candidate person of generation first with trained decision Tree algorithms, then according to row People's candidate classification preset threshold value, when the probability that the pedestrian candidate person in characteristic pattern is pedestrian is less than preset threshold value When, then pedestrian candidate person classification is pedestrian, and the otherwise classification is not pedestrian.
CN201810082310.0A 2018-01-29 2018-01-29 A kind of pedestrian detection method based on deep learning Pending CN108460336A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810082310.0A CN108460336A (en) 2018-01-29 2018-01-29 A kind of pedestrian detection method based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810082310.0A CN108460336A (en) 2018-01-29 2018-01-29 A kind of pedestrian detection method based on deep learning

Publications (1)

Publication Number Publication Date
CN108460336A true CN108460336A (en) 2018-08-28

Family

ID=63239171

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810082310.0A Pending CN108460336A (en) 2018-01-29 2018-01-29 A kind of pedestrian detection method based on deep learning

Country Status (1)

Country Link
CN (1) CN108460336A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109409309A (en) * 2018-11-05 2019-03-01 电子科技大学 A kind of intelligent alarm system and method based on human testing
CN110263712A (en) * 2019-06-20 2019-09-20 江南大学 A kind of coarse-fine pedestrian detection method based on region candidate
CN110837769A (en) * 2019-08-13 2020-02-25 广州三木智能科技有限公司 Embedded far infrared pedestrian detection method based on image processing and deep learning
WO2020191668A1 (en) * 2019-03-27 2020-10-01 Hangzhou Fabu Technology Co. Ltd Proposal processing method and related products
CN111797769A (en) * 2020-07-06 2020-10-20 东北大学 Small target sensitive vehicle detection system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102103690A (en) * 2011-03-09 2011-06-22 南京邮电大学 Method for automatically portioning hair area
CN102496001A (en) * 2011-11-15 2012-06-13 无锡港湾网络科技有限公司 Method of video monitor object automatic detection and system thereof
CN105243356A (en) * 2015-09-10 2016-01-13 北京大学 Method of building pedestrian detection model and device and pedestrian detection method
CN105975929A (en) * 2016-05-04 2016-09-28 北京大学深圳研究生院 Fast pedestrian detection method based on aggregated channel features
CN106897738A (en) * 2017-01-22 2017-06-27 华南理工大学 A kind of pedestrian detection method based on semi-supervised learning
CN107240087A (en) * 2016-11-01 2017-10-10 北京深鉴智能科技有限公司 Object detection system and method
US20170358086A1 (en) * 2015-09-30 2017-12-14 Fotonation Limited Method and system for tracking an object

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102103690A (en) * 2011-03-09 2011-06-22 南京邮电大学 Method for automatically portioning hair area
CN102496001A (en) * 2011-11-15 2012-06-13 无锡港湾网络科技有限公司 Method of video monitor object automatic detection and system thereof
CN105243356A (en) * 2015-09-10 2016-01-13 北京大学 Method of building pedestrian detection model and device and pedestrian detection method
US20170358086A1 (en) * 2015-09-30 2017-12-14 Fotonation Limited Method and system for tracking an object
CN105975929A (en) * 2016-05-04 2016-09-28 北京大学深圳研究生院 Fast pedestrian detection method based on aggregated channel features
CN107240087A (en) * 2016-11-01 2017-10-10 北京深鉴智能科技有限公司 Object detection system and method
CN106897738A (en) * 2017-01-22 2017-06-27 华南理工大学 A kind of pedestrian detection method based on semi-supervised learning

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
CORNELIU ZAHARIA 等: "PVANet optimization for person detection", 《2017 INTERNATIONAL CONFERENCE ON OPTIMIZATION OF ELECTRICAL AND ELECTRONIC EQUIPMENT》 *
JEROME FRIEDMAN 等: "Additive logistic regression: a statistical view of boosting", 《ANNALS OF STATISTICS》 *
SANGHOON HONG 等: "PVANet: Lightweight Deep Neural Networks for Real-time Object Detection", 《网络在线公开: HTTPS://ARXIV.ORG/ABS/1611.08588》 *
WEICHENG SUN 等: "Deep learning based pedestrian detection", 《2018 CHINESE CONTROL AND DECISION CONFERENCE》 *
孙炜程: "基于深度学习的上下文相关的目标跟踪", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
徐科 等: "《金属表面质量在线检测技术》", 30 October 2016, 冶金工业出版社 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109409309A (en) * 2018-11-05 2019-03-01 电子科技大学 A kind of intelligent alarm system and method based on human testing
WO2020191668A1 (en) * 2019-03-27 2020-10-01 Hangzhou Fabu Technology Co. Ltd Proposal processing method and related products
US10909417B2 (en) 2019-03-27 2021-02-02 Hangzhou Fabu Technology Co.. Ltd. Proposal processing method and related products
CN110263712A (en) * 2019-06-20 2019-09-20 江南大学 A kind of coarse-fine pedestrian detection method based on region candidate
CN110263712B (en) * 2019-06-20 2021-02-23 江南大学 Coarse and fine pedestrian detection method based on region candidates
CN110837769A (en) * 2019-08-13 2020-02-25 广州三木智能科技有限公司 Embedded far infrared pedestrian detection method based on image processing and deep learning
CN110837769B (en) * 2019-08-13 2023-08-29 中山市三卓智能科技有限公司 Image processing and deep learning embedded far infrared pedestrian detection method
CN111797769A (en) * 2020-07-06 2020-10-20 东北大学 Small target sensitive vehicle detection system
CN111797769B (en) * 2020-07-06 2023-06-30 东北大学 Small-target-sensitive vehicle detection system

Similar Documents

Publication Publication Date Title
JP6830707B1 (en) Person re-identification method that combines random batch mask and multi-scale expression learning
CN110599448B (en) Migratory learning lung lesion tissue detection system based on MaskScoring R-CNN network
CN109919108B (en) Remote sensing image rapid target detection method based on deep hash auxiliary network
CN108460336A (en) A kind of pedestrian detection method based on deep learning
CN109344736B (en) Static image crowd counting method based on joint learning
WO2020102988A1 (en) Feature fusion and dense connection based infrared plane target detection method
CN106778595B (en) Method for detecting abnormal behaviors in crowd based on Gaussian mixture model
CN107945153A (en) A kind of road surface crack detection method based on deep learning
CN103366602B (en) Method of determining parking lot occupancy from digital camera images
CN106940816A (en) Connect the CT image Lung neoplasm detecting systems of convolutional neural networks entirely based on 3D
CN108830188A (en) Vehicle checking method based on deep learning
CN110033473B (en) Moving target tracking method based on template matching and depth classification network
CN104537647B (en) A kind of object detection method and device
CN112287788A (en) Pedestrian detection method based on improved YOLOv3 and improved NMS
CN111160249A (en) Multi-class target detection method of optical remote sensing image based on cross-scale feature fusion
CN108492272A (en) Cardiovascular vulnerable plaque recognition methods based on attention model and multitask neural network and system
CN109241982A (en) Object detection method based on depth layer convolutional neural networks
CN106980858A (en) The language text detection of a kind of language text detection with alignment system and the application system and localization method
CN110826379B (en) Target detection method based on feature multiplexing and YOLOv3
CN108875624A (en) Method for detecting human face based on the multiple dimensioned dense Connection Neural Network of cascade
CN107507170A (en) A kind of airfield runway crack detection method based on multi-scale image information fusion
CN110716792B (en) Target detector and construction method and application thereof
CN110543906B (en) Automatic skin recognition method based on Mask R-CNN model
CN110008899B (en) Method for extracting and classifying candidate targets of visible light remote sensing image
CN110349167A (en) A kind of image instance dividing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180828

RJ01 Rejection of invention patent application after publication