CN111832463A - Deep learning-based traffic sign detection method - Google Patents

Deep learning-based traffic sign detection method Download PDF

Info

Publication number
CN111832463A
CN111832463A CN202010646070.XA CN202010646070A CN111832463A CN 111832463 A CN111832463 A CN 111832463A CN 202010646070 A CN202010646070 A CN 202010646070A CN 111832463 A CN111832463 A CN 111832463A
Authority
CN
China
Prior art keywords
traffic sign
sample
yolov3
data set
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010646070.XA
Other languages
Chinese (zh)
Inventor
李东洁
孙富文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin University of Science and Technology
Original Assignee
Harbin University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin University of Science and Technology filed Critical Harbin University of Science and Technology
Priority to CN202010646070.XA priority Critical patent/CN111832463A/en
Publication of CN111832463A publication Critical patent/CN111832463A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • G06V20/58Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads
    • G06V20/582Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads of traffic signs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/09Recognition of logos

Abstract

The invention discloses a traffic sign detection method based on deep learning, which comprises the following steps: (1) and acquiring an outdoor traffic sign image by using a digital camera, and denoising the image by using median filtering and Gaussian filtering. (2) And (3) making a traffic sign data set, performing rotation, translation and stretching transformation, expanding the data set, and dividing the data set into a training set and a verification set according to a ratio of 9: 1. (3) And comprehensively using the leaders cluster and the k-means cluster to determine the initial position of the anchor box of the target. (4) Using successive 1 x 1 convolutional layers in YOLOv3, shallow feature information for the image was extracted and a prediction layer for the shallow features was added. (5) The training set is input into a modified YOLOv3 network for training. (6) And testing the trained model by using a verification set, and verifying the detection and identification effects of the improved model on the traffic signs.

Description

Deep learning-based traffic sign detection method
Technical Field
The invention belongs to the field of artificial intelligence, and particularly relates to a deep learning-based traffic sign detection method.
Background
The traffic sign recognition is an important subsystem of the intelligent traffic system and is mainly used for processing images and videos of surrounding conditions of a vehicle, which are shot by a vehicle-mounted camera, detecting and recognizing the traffic sign, submitting the recognition result to a driver or other intelligent traffic systems, and further assisting the driver in driving or providing decision information for the intelligent traffic systems.
With the development of deep learning technology, many target detection algorithms are widely applied in the field of image recognition, wherein the algorithms are typified by Faster-Rcnn, SSD, YOLO. SSD and YOLO both belong to a one-stage target detection algorithm, have good expression on the detection speed and classification accuracy of targets in images, particularly the YOLOv3 algorithm in the YOLO series, and realize real-time performance on the recognition of vehicles and pedestrians in the field of automatic driving. Although YOLOv3 is a detection algorithm with better effect in object detection, the recognition accuracy is still lower for the small-sized objects such as traffic signs. Therefore, in order to solve the problem, a traffic sign detection method based on deep learning is provided, wherein YOLOv3 is selected as a basic model structure for traffic sign detection, a network structure is improved, the characteristic information of the traffic sign can be fully extracted, and the characteristics of a plurality of scales are fused to accurately predict the traffic sign category.
Disclosure of Invention
The invention aims to provide a traffic sign detection method based on deep learning so as to improve the detection speed and accuracy of traffic signs. The traffic sign detection method comprises the following steps.
A. Obtaining traffic sign images
B. A traffic sign data set is created and divided into a training set and a data set
C. K-means clustering method used by improved model YOLOv3
D. Network structure of improved target detection network YOLOv3
E. Training improved Yolov3 target detection network
F. Testing of the improved Yolov3 model with a validation set
Preferably, the step a includes acquiring an outdoor traffic sign image by using a digital camera, where the traffic sign mainly includes an indication sign, a prohibition sign, and a warning sign, and therefore, the acquired image should include the three types of traffic signs. And then, filtering and denoising the acquired traffic sign image.
Preferably, the traffic sign data set is produced in the step B, and the data set is divided into a positive sample and a negative sample, wherein the positive sample indicates that the sample label is consistent with the real type of the sample, and otherwise, the positive sample is the negative sample. In addition, the positive samples should be extended to enhance the classification capability of the model.
Preferably, the step C comprises the following steps
A. Preprocessing the original data set by adopting a leader clustering algorithm to generate a plurality of sample subsets
B. Sampling the generated sample subset
C. Clustering sampled sample subsets using a K-means clustering algorithm
D. Integrating the clustering result to determine the initial position of the anchor box
Preferably, the network structure of the modified YOLOv3 in the step D includes the following two aspects:
A. in the first few layers of the YOLOv3 network, the shallow information of the input image was extracted using successive 1 × 1 convolutional layers (one 1 × 1 convolutional layer after the 1 × 1 convolutional layer).
B. Adding a scale prediction layer
The invention has the beneficial effects that:
(1) due to the adoption of the continuous 1-by-1 convolution layer, the extraction capability of the YOLOv3 model on the shallow features of the input image is enhanced, and the classification capability of the model on the small target traffic signs is improved.
(2) Because a scale prediction layer is added to the model, the extracted shallow feature can be predicted, and the accuracy of the model for predicting the traffic sign category is improved.
(3) As the initial position of the anchor box is determined by comprehensively using leader clustering and k-means clustering algorithms, the model can more accurately position the position of the traffic sign.
Drawings
Fig. 1 is a diagram of the overall network architecture of the present invention.
FIG. 2 shows a continuous 1 x 1 convolutional stack structure used in the present invention
Detailed Description
The method comprises the steps of firstly, acquiring an outdoor traffic sign image by using a digital camera, wherein the outdoor traffic sign image comprises three types of prompting signs, forbidden signs and warning signs. Removing salt and pepper noise in the traffic sign image by using a media Blur median filtering algorithm in an opencv computer vision processing library, further removing noise by using Gaussian filtering, and finally enhancing the characteristics of the traffic sign image by using a histogram matching algorithm.
And step two, making a data set. And (3) carrying out rotation, translation and stretching transformation on the traffic sign image subjected to filtering and noise reduction in the step one, expanding a data set, and dividing the data set into a training set and a verification set according to a ratio of 9: 1.
Step three: the k-means clustering algorithm is improved, and the execution method is as follows:
(1) for dataset X ═ X1,x2,...xn) Using a non-linear mapping theta to map the samples xiMapping into high dimensional space
(2) K-means clustering is carried out in high-latitude sum space, and an objective function is optimized:
Figure BDA0002573158540000031
in the formula mkIs the mean value of the samples and is,
Figure BDA0002573158540000032
the formula for calculating the nuclear distance of two feature points in the nuclear space is defined as:
Figure BDA0002573158540000033
where K () is a kernel function
(3) After clustering each sample subset, all samples are merged, the sample set after merging contains K target classes, and the mean value of each class is calculated:
Figure BDA0002573158540000034
wherein n isiData volume, x, representing the categoryiIs the mean of the i-th class
(4) Calculating the distance between any two mean values
Setting the clustering center threshold value as M, and the distance formula between two class mean values as follows:
L=|xi-xj|2
and when the distance between the mean values of the two targets is smaller than M, merging the two targets into a class, continuing to calculate the class mean value distance according to the formula, and obtaining the final clustering result after merging and fusing the pathogenesis of the sample subset.
And step four, improving the network structure of YOLOv 3. The improved YOLOv3 network is specifically illustrated in connection with fig. 1. As shown in fig. 1, the feature map obtained by the convolution of the previous layer is processed using the consecutive 1 × 1 convolutional layers at the third convolutional layer of the network, and then processed again using one 3 × 3 convolutional layer.
The convolution structure shown in fig. 2 is used in the 7 th, 8 th and 9 th convolution layers of the network, then the convolution layers 10 th, 11 th and 12 th convolution layers continue to extract image shallow features by using the convolution and the structure shown in fig. 2, and finally the feature map at the scale is sent into the image type prediction network of YOLOv3 to predict the type of the target.
Step five: inputting the training set obtained in the step two into the improved YOLOv3 network in the step four for training, wherein the algebraic epoch of the training is 10000, a batch gradient descent method is adopted in each epoch, the input data volume of each batch is 256, and in the training process, the learning rate used by the model is dynamically adjusted by using the degraded learning rate, so that the model training is accelerated.
Step six: and testing the model trained in the step five by using the verification set in the step two, and verifying the detection and identification effect of the improved model on the traffic sign.

Claims (5)

1. A traffic sign detection method based on deep learning is characterized by comprising the following steps:
A. obtaining traffic sign images
B. A traffic sign data set is created and divided into a training set and a data set
C. K-means clustering method used by improved model YOLOv3
D. Network structure of improved target detection network YOLOv3
E. Training improved Yolov3 target detection network
F. The improved YOLOv3 model was tested using a validation set.
2. The method as claimed in claim 1, wherein the step a includes using a digital camera to obtain the outdoor traffic sign image, and the traffic signs mainly include an indication sign, a prohibition sign and a warning sign, so that the obtained image should include the three types of traffic signs. And then, filtering and denoising the acquired traffic sign image.
3. The method as claimed in claim 1, wherein the traffic sign data set is prepared in step B, and the data set is divided into two types, namely positive sample and negative sample, wherein the positive sample indicates that the sample label is consistent with the true category of the sample, and vice versa, the sample is a negative sample. In addition, the positive samples should be extended to enhance the classification capability of the model.
4. The deep learning-based traffic sign detection method according to claim 1, wherein the step C comprises the steps of:
A. preprocessing the original data set by adopting a leader clustering algorithm to generate a plurality of sample subsets
B. Sampling the generated sample subset
C. Clustering sampled sample subsets using a K-means clustering algorithm
D. And integrating the clustering results and determining the initial position of the anchor box.
5. The method as claimed in claim 1, wherein the step D of improving the network structure of YOLOv3 includes two aspects of
A. In the first few layers of the YOLOv3 network, the shallow information of the input image was extracted using successive 1 × 1 convolutional layers (one 1 × 1 convolutional layer after the 1 × 1 convolutional layer).
B. A scale prediction layer is added.
CN202010646070.XA 2020-07-07 2020-07-07 Deep learning-based traffic sign detection method Pending CN111832463A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010646070.XA CN111832463A (en) 2020-07-07 2020-07-07 Deep learning-based traffic sign detection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010646070.XA CN111832463A (en) 2020-07-07 2020-07-07 Deep learning-based traffic sign detection method

Publications (1)

Publication Number Publication Date
CN111832463A true CN111832463A (en) 2020-10-27

Family

ID=72900302

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010646070.XA Pending CN111832463A (en) 2020-07-07 2020-07-07 Deep learning-based traffic sign detection method

Country Status (1)

Country Link
CN (1) CN111832463A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112613387A (en) * 2020-12-18 2021-04-06 五邑大学 Traffic sign detection method based on YOLOv3
CN113609895A (en) * 2021-06-22 2021-11-05 上海中安电子信息科技有限公司 Road traffic information acquisition method based on improved Yolov3

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110414421A (en) * 2019-07-25 2019-11-05 电子科技大学 A kind of Activity recognition method based on sequential frame image
CN111209962A (en) * 2020-01-06 2020-05-29 电子科技大学 Combined image classification method based on CNN (CNN) feature extraction network) and combined heat map feature regression
CN111274970A (en) * 2020-01-21 2020-06-12 南京航空航天大学 Traffic sign detection method based on improved YOLO v3 algorithm
CN111325152A (en) * 2020-02-19 2020-06-23 北京工业大学 Deep learning-based traffic sign identification method
CN111340695A (en) * 2020-02-10 2020-06-26 上海智幻软件科技有限公司 Super-resolution reconstruction method of dome screen video

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110414421A (en) * 2019-07-25 2019-11-05 电子科技大学 A kind of Activity recognition method based on sequential frame image
CN111209962A (en) * 2020-01-06 2020-05-29 电子科技大学 Combined image classification method based on CNN (CNN) feature extraction network) and combined heat map feature regression
CN111274970A (en) * 2020-01-21 2020-06-12 南京航空航天大学 Traffic sign detection method based on improved YOLO v3 algorithm
CN111340695A (en) * 2020-02-10 2020-06-26 上海智幻软件科技有限公司 Super-resolution reconstruction method of dome screen video
CN111325152A (en) * 2020-02-19 2020-06-23 北京工业大学 Deep learning-based traffic sign identification method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
葛雯 等: "改进YOLOV3算法在行人识别中的应用", 《计算机工程与应用》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112613387A (en) * 2020-12-18 2021-04-06 五邑大学 Traffic sign detection method based on YOLOv3
CN113609895A (en) * 2021-06-22 2021-11-05 上海中安电子信息科技有限公司 Road traffic information acquisition method based on improved Yolov3

Similar Documents

Publication Publication Date Title
CN107563372B (en) License plate positioning method based on deep learning SSD frame
CN108510467B (en) SAR image target identification method based on depth deformable convolution neural network
CN112418117B (en) Small target detection method based on unmanned aerial vehicle image
CN106778595B (en) Method for detecting abnormal behaviors in crowd based on Gaussian mixture model
US20170200063A1 (en) Low- and high-fidelity classifiers applied to road-scene images
US20170206434A1 (en) Low- and high-fidelity classifiers applied to road-scene images
Yu et al. Vehicle detection from high-resolution remote sensing imagery using convolutional capsule networks
CN112016605B (en) Target detection method based on corner alignment and boundary matching of bounding box
CN113723377B (en) Traffic sign detection method based on LD-SSD network
CN111709313B (en) Pedestrian re-identification method based on local and channel combination characteristics
CN112738470B (en) Method for detecting parking in highway tunnel
CN110659601B (en) Depth full convolution network remote sensing image dense vehicle detection method based on central point
CN106845458B (en) Rapid traffic sign detection method based on nuclear overrun learning machine
CN112084890A (en) Multi-scale traffic signal sign identification method based on GMM and CQFL
CN112699967B (en) Remote airport target detection method based on improved deep neural network
CN111582339A (en) Vehicle detection and identification method based on deep learning
CN114913498A (en) Parallel multi-scale feature aggregation lane line detection method based on key point estimation
CN111915583A (en) Vehicle and pedestrian detection method based on vehicle-mounted thermal infrared imager in complex scene
CN110991374B (en) Fingerprint singular point detection method based on RCNN
Li et al. Pixel-level recognition of pavement distresses based on U-Net
CN111832463A (en) Deep learning-based traffic sign detection method
CN111950583A (en) Multi-scale traffic signal sign identification method based on GMM clustering
Asgarian Dehkordi et al. Vehicle type recognition based on dimension estimation and bag of word classification
CN115861756A (en) Earth background small target identification method based on cascade combination network
CN116416503A (en) Small sample target detection method, system and medium based on multi-mode fusion

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20201027