CN107122701A

CN107122701A - A kind of traffic route sign based on saliency and deep learning

Info

Publication number: CN107122701A
Application number: CN201710122736.XA
Authority: CN
Inventors: 许泽珊; 叶绿珊; 余卫宇
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2017-03-03
Filing date: 2017-03-03
Publication date: 2017-09-01

Abstract

The invention discloses a kind of traffic route sign based on saliency and deep learning.Methods described generates a network (Alexnet) by convolutional neural networks (CNN) first, then by obtaining traffic route mark picture from ImageNet data sets, training picture is normalized after pretreatment and inputs network, AlexNet networks are then trained.The Alexnet convolutional neural networks trained will be inputted after test pictures progress RC conspicuousness extractions again, carry out the testing classification of traffic route mark.Present invention incorporates the advantage of saliency high efficiency extraction marking area and deep learning in picture recognition, the purpose that exact classification is carried out to traffic route mark picture is reached.

Description

A kind of traffic route sign based on saliency and deep learning

Technical field

The invention belongs to image identification technical field, more particularly to a kind of traffic road based on convolutional neural networks model The image-recognizing method of road sign will.

Background technology

With expanding economy, Modern Traffic is very flourishing, and road traffic still occupies leading position.But, hand over Logical safety and traffic jam are therewith into increasingly serious social concern, while also result in surprising economic loss.Add The problems such as environmental pollution and the energy, the solution for allowing for road traffic problem must not be without recourse to intellectual technology.Intelligent transportation system This research field of uniting just is arisen at the historic moment, and is developed rapidly.

Traffic sign recognition is one of still unsolved problem in intelligent transportation system research field, be also difficulty compared with One of big outdoor scene figure identification problem.Although the research work of traffic sign recognition has been achieved for certain achievement, But also there is many problems and defect.Mainly have：

1st, the comparison for laying particular stress on theoretical research is more, and application-oriented research is lacked.The proposition of many theoretical and methods is all to set up On the basis of standard drawing or certain local circumstance, without in view of the demand in more practical applications.

2nd, processing method is relatively simple, and intelligent method is few, and the combination of intelligent method and other method is seldom.Use at present Processing method has also mainly used the intelligent methods such as artificial neural network and genetic algorithms, and many methods are all the improvement to conventional method Or application, but intelligent method can be seldom combined with other method.

3rd, most of experimental subjects are all based on standard drawing, and the research for realistic picture is less.Differentiate link with gray scale Figure is many of object, lacking by object of cromogram.

The content of the invention

It is a primary object of the present invention to overcome the shortcoming and deficiency of prior art there is provided one kind based on saliency and The traffic route sign of deep learning, the present invention using saliency extraction algorithm to detection image at Manage, efficiently this salient region of extraction traffic route mark, it is suppressed that the interference in non-significant region, can more protrude the complicated back of the body Well-marked target under scape, improves treatment effeciency and accuracy rate, with more preferable robustness, is more suitable for actual intelligent transportation road In the system of road.

In order to achieve the above object, the present invention uses following technical scheme：

A kind of course of work based on saliency and the traffic route sign of the deep learning present invention, bag Include following step：

S1, download training dataset；

S2, pretreatment, are pre-processed to the image in data set in step S1；

S3, training convolutional neural networks model；

S4, conspicuousness are extracted；Region contrast method is employed, the road signs in test pictures are extracted；

S5, the road signs picture extracted in step S4 is input to the convolutional Neural net trained in step S3 In network model, the classification of traffic route sign image is gone out by convolutional neural networks Model Identification.

It is preferred that, download training dataset in step S1, data set from 2015 national fuzzy image processing contest with Traffic Sign Recognition training set in content analysis contest, the traffic mark being related to is divided into Warning Sign, Prohibition Sign, instruction Mark, way-finding sign, tourist district's mark, 7 major classes of road construction secure ID and accessory ID, altogether comprising 72 kinds of traffic marks Know, relevant criterion of the mark title from country and the Ministry of Public Security, such as T-junction.The quality identified in image includes clear, mould Paste, block, 5 types such as shade and inclination.

It is preferred that, step S2 pretreatments are as follows：

Each image, scales the images to 256 × 256 sizes, is then intercepted at image upper, middle and lower or so 55 big It is small be 224 × 224 standard size image, and the classification of standard size image corresponding thereto is preserved in pairs, " a standard Sized image-classification " is to being used as a data.

It is preferred that, step S3 convolutional neural networks workflows are as follows, and input layer is made up of 32 × 32 sensing nodes, is connect Receive original image A, then, calculation process between convolution and sub-sample alternately, it is described in detail below：

First hidden layer carries out convolution, and it is made up of 8 Feature Mappings, and each Feature Mapping is by 28 × 28 neural tuples Into each neuron specifies the receiving of one 5 × 5；

Second hidden layer realizes sub-sample and local average, and it is equally made up of 8 Feature Mappings, but each of which feature is reflected Penetrate and be made up of 14 × 14 neurons.Each neuron has the acceptance region of one 2 × 2, and one can train biasing and one Sigmoid activation primitives；The operating point of coefficient and biasing control neuron can be trained；

3rd hidden layer carries out second of convolution, and it is made up of 20 Feature Mappings, and each Feature Mapping is by 10 × 10 Neuron is constituted；Each neuron in the hidden layer may have the cynapse being connected with the several Feature Mappings of next hidden layer Connection, it to first similar mode of convolutional layer to operate；

4th hidden layer carries out second of sub-sample and local average computation；It is made up of 20 Feature Mappings, but each Feature Mapping is made up of 5 × 5 neurons, and it to first time similar mode of sampling to operate；

5th hidden layer realizes the final stage of convolution, and it is made up of 120 neurons, and each neuron specifies one 5 × 5 acceptance region；

It is finally a full articulamentum, obtains output vector；

Connection alternating of the successive computation layer between convolution and sampling, we obtain the effect of one " double pointed tower ", It is exactly in each convolution or sampling layer, as spatial resolution declines, the Feature Mapping quantity increase compared with corresponding preceding layer. The thought that sub-sample is carried out after convolution is to be followed by " complicated " cell by " simple " cell in animal vision system Idea inspiration and produce.

It is preferred that, step S4 carries out conspicuousness processing to test pictures, employs region contrast method, i.e. RC methods, Spatial relationship is incorporated into area grade contrast and calculates central；It is as follows that conspicuousness extraction step is carried out with RC methods：

S4.1, original image A will be inputted first be divided into color contrast on several target areas, then zoning grade Degree；Input picture is divided into several regions using a kind of image partition method based on figure, is that color is set up in each region Histogram, for a region r_k, its conspicuousness is calculated by measuring it with the color contrast in other regions in image, It is as follows：

Wherein w (r_i) it is region r_iWeights, D_r(r_k,r_i) it is two interregional color measurements；r_iIn number of pixels w (r_i) emphasize and the color contrast in bigger region；Region r₁And r₂Between color distance be defined as foloows：

In order to preferably reflect a difference between color and primary color, we appear in this using this color Frequency in region is used as the weights of this color, wherein f (c_k, i) it is i-th of color c_k,iIn k-th of region r_kIn it is all N_kThe frequency of occurrences in individual color.

S4.2, the region contrast for being re-introduced on the basis of step S4.1 spatial weighting；One is introduced comprising space to believe The weight of breath, the setting of weights is defined according to regional space distance, the target area of space length farther out be endowed compared with Small weights, the nearer target area of space length is endowed larger weights；Increase the influence of immediate area with this and subtract Few influence compared with far region；For any region r_k, the conspicuousness based on spatial weighting region contrast is defined as follows：

Wherein, D_s(r_k,r_i) it is region r_kAnd r_iBetween space length, σ_sControl the intensity of space weights；Larger σ_s Value can more reduce the influence of space weights so that region farther out is more conducive to obtain after the conspicuousness of current region, processing Gray level image B；

S4.3, the salient region for extracting test image, obtain including the gray level image B of traffic route identified areas, Next binaryzation is carried out to gray level image B and obtains bianry image C.

Holes filling in S4.4, the binary image C for obtaining step S4.3, obtains image D, then using image D as Mask, retains the part that D correspondence positions are 1, other are set to 0, obtain including traffic route mark region in original image A Saliency maps picture.

It is preferred that, binary image uses global threshold algorithm in step S4.3, and Otsu methods are carried out at binaryzation to image Reason；Otsu is the statistical property based on entire image, realizes the automatic selection of threshold value；If image has M gray value, span In 0~M-1, gray value t is chosen within this range, two groups of G are divided the image into₀And G₁, G₀Comprising pixel gray value 0~ t,G₁Gray value in t+1~M-1, total number of image pixels, n are represented with N_iRepresent number of the gray value for i pixel, then it is each The Probability p that individual gray value i occurs_iFor：

G₀Class occur probability and average be：

Wherein w₀For G₀The probability that class occurs, u₀For G₀The average of class；

G₁Class occur probability and average be：

Wherein w₁For G₁The probability that class occurs, u₁For G₁The average of class；

Then inter-class variance δ (t)²=w₀w₁(u₀-u₁)²

Global threshold T is the value for the t for making inter-class variance maximum, is：

T=argmax δ (t), t ∈ [0, M-1]

Entire image is divided into foreground and background two parts by threshold value T, the gray value of every pixel is more than this threshold value 225 are set as, 0 is set as less than this threshold value；Gray level image is processed into only black and white binary image, finally given Binary image C.

It is preferred that, step S5 is according to training model measurement image recognition effect：Included after conspicuousness in S4 is extracted Test experiments are carried out in the convolutional neural networks trained in the picture input S3 for having road signs region.

The present invention compared with prior art, has the following advantages that and beneficial effect：

1st, the present invention is handled detection image using saliency extraction algorithm, efficiently extracts traffic route mark This salient region of will, it is suppressed that the interference in non-significant region, can more protrude the well-marked target under complex background, raising processing Efficiency and accuracy rate, with more preferable robustness, are more suitable in actual intelligent transportation roadnet.

2nd, the present invention uses convolutional neural networks model, and the special construction that convolutional neural networks are shared with its local weight exists There is the superiority of uniqueness in terms of speech recognition and image procossing, it is laid out closer to actual biological neural network, weights The shared complexity for reducing network, the image of particularly many dimensional input vectors can directly input network this feature and avoid Feature extraction and the complexity of assorting process data reconstruction.

3rd, the present invention breaks that art methods are single, the few limitation of intelligent method, with reference to image processing method and depth Learning method.

Brief description of the drawings

Fig. 1 is particular flow sheet of the embodiment of the present invention；

Fig. 2 is the concrete structure of the convolutional neural networks of the embodiment of the present invention；

Fig. 3 extracts the flow chart of traffic route mark for the conspicuousness of the embodiment of the present invention.

Embodiment

With reference to embodiment and accompanying drawing, the present invention is described in further detail, but embodiments of the present invention are not limited In this.

Embodiment

It is as shown in Figure 1 flow chart of the method for the present invention, including saliency is extracted, normalized, training convolutional Four parts such as neural network model and test convolutional network model.Comprise the following steps that：

1st, training dataset is downloaded, data set is in national fuzzy image processing contest and content analysis contest in 2015 Traffic Sign Recognition training set, the traffic mark being related to be divided into Warning Sign, Prohibition Sign, sign, way-finding sign, Tourist district's mark, 7 major classes of road construction secure ID and accessory ID, altogether comprising 72 kinds of traffic marks, mark title is come From the relevant criterion of country and the Ministry of Public Security, such as T-junction.That the quality identified in image includes is clear, obscure, block, shade and 5 types such as inclination.

2nd, pre-process, each image scales the images to 256 × 256 sizes, then at image upper, middle and lower or so 5 Interception it is 5 big it is small be 224 × 224 standard size image, and the classification of standard size image corresponding thereto is preserved in pairs, One " standard size image-classification " is to being used as a data.

3rd, training convolutional neural networks model, the input layer of convolutional neural networks is made up of 32 × 32 sensing nodes, is connect Original image A is received, then, calculation process alternately, is illustrated in figure 2 the convolution of the present embodiment between convolution and sub-sample The concrete structure of neutral net, concrete structure is as described below：

It is finally a full articulamentum, obtains output vector；

4th, conspicuousness is extracted, and refer to Analysis of Contrast method：Region contrast is RC methods, and spatial relationship is incorporated into Among area grade contrast is calculated.The conspicuousness for being illustrated in figure 3 the present embodiment extracts the flow chart of traffic route mark, shows It is as follows that work property extracts detailed process：

4.1st, input picture A is divided into the color contrast on several target areas, then zoning grade first.Make Input picture is divided into several regions with a kind of image partition method based on figure, is that color histogram is set up in each region Figure, for a region r_k, its conspicuousness is calculated by measuring it with the color contrast in other regions in image, such as Under：

Wherein S (r_k) be image r in region r_kSaliency value, D_r(r_k,r_i) it is region r in the Lab of space_kWith region r_i Two interregional color measurements, w (r_i) it is r_iIn number of pixels, to emphasize and the color contrast in bigger region.

Region r₁And r₂Between color distance be defined as foloows：

In order to preferably reflect a difference between color and primary color, we appear in this using this color Frequency in region is used as the weights of this color, wherein f (c_k, i) it is i-th of color c_k,iIn k-th of region r_kIn it is all N_kThe frequency of occurrences in individual color, is represented to improve region storage and the computational efficiency present invention using a sparse histograms Method is efficiently stored and calculated.

4.2nd, the region contrast of spatial weighting is re-introduced on the basis of step 4.1.We introduce one and include space The weight of information, the setting of weights is defined according to regional space distance, and the target area of space length farther out is endowed Less weights, the nearer target area of space length is endowed larger weights.Increase the influence of immediate area with this simultaneously Reduce the influence compared with far region.For any region r_k, the conspicuousness based on spatial weighting region contrast is defined as follows：

Wherein, D_s(r_k,r_i) it is region r_kAnd r_iBetween space length, the space length between two regions is defined as Euclidean distance between the center of gravity in respective region.σ_sControl the intensity of space weights.Larger σ_sValue can more reduce space The influence of weights so that region farther out is more conducive to the conspicuousness of current region.We use in the present inventionGray level image B is obtained after processing.

4.3rd, the salient region of test image is extracted, obtains including the gray level image B of traffic route identified areas, connects Get off and bianry image C is obtained to gray level image B progress binaryzation.It is specific as follows：

Binary image uses global threshold algorithm, and Otsu methods carry out binary conversion treatment to image；Otsu is to be based on view picture The statistical property of image, realizes the automatic selection of threshold value；If image has M gray value, span is in 0~M-1, in this scope Interior selection gray value t, divides the image into two groups of G₀And G₁, G₀Comprising pixel gray value in 0~t, G₁Gray value in t+1 ~M-1, total number of image pixels, n are represented with N_iNumber of the gray value for i pixel is represented, then each gray value i appearance is general Rate p_iFor：

G₀Class occur probability and average be：

G₁Class occur probability and average be：

Then inter-class variance δ (t)²=w₀w₁(u₀-u₁)²

T=argmax δ (t), t ∈ [0, M-1]

4.4th, the holes filling in the binary image C for obtaining step 4.3, obtains image D, then using image D as covering Mould, retains the part that D correspondence positions are 1, other are set to 0, obtain including traffic route mark region in original image A Saliency maps picture.

5th, basis trains model measurement image recognition effect：Include road after conspicuousness in step 4 is extracted to hand over Test experiments are carried out in the convolutional neural networks trained in the picture input step 3 of logical mark region.

Above-described embodiment is preferably embodiment, but embodiments of the present invention are not by above-described embodiment of the invention Limitation, other any Spirit Essences without departing from the present invention and the change made under principle, modification, replacement, combine, simplification, Equivalent substitute mode is should be, is included within protection scope of the present invention.

Claims

1. a kind of traffic route sign based on saliency and deep learning, it is characterised in that including as follows Step：

S1, download training dataset；

S2, pretreatment, are pre-processed to the image in data set in step S1；

S3, training convolutional neural networks model；

S5, the road signs picture extracted in step S4 is input to the convolutional neural networks mould trained in step S3 In type, the classification of traffic route sign image is gone out by convolutional neural networks Model Identification.

2. a kind of traffic route sign based on saliency and deep learning according to claim 1, Characterized in that, training dataset is downloaded in step S1, data set national fuzzy image processing contest and content from 2015 The Traffic Sign Recognition training set in contest is analyzed, the traffic mark being related to is divided into Warning Sign, Prohibition Sign, indicateing arm Knowledge, way-finding sign, tourist district's mark, 7 major classes of road construction secure ID and accessory ID；The quality identified in image includes Clearly, obscure, block, shade and tilt 5 types.

3. a kind of traffic route sign based on saliency and deep learning according to claim 1, Characterized in that, step S2 pretreatments detailed process is as follows：

Each image, scales the images to 256 × 256 sizes, and then interception 5 is small greatly at image upper, middle and lower or so 5 is 224 × 224 standard size image, and the classification of standard size image corresponding thereto is preserved in pairs, " a standard size Image-classification " is to being used as a data.

4. a kind of traffic route sign based on saliency and deep learning according to claim 1, Characterized in that, step S3 convolutional neural networks workflows are as follows, input layer is made up of 32 × 32 sensing nodes, receives former Beginning image A, then, calculation process between convolution and sub-sample alternately, it is described in detail below：

First hidden layer carries out convolution, and it is made up of 8 Feature Mappings, and each Feature Mapping is made up of 28 × 28 neurons, Each neuron specifies the receiving of one 5 × 5；

Second hidden layer realizes sub-sample and local average, and it is equally made up of 8 Feature Mappings, but each of which Feature Mapping by 14 × 14 neuron compositions；Each neuron has the acceptance region of one 2 × 2, and one can train biasing and a sigmoid Activation primitive；The operating point of coefficient and biasing control neuron can be trained；

3rd hidden layer carries out second of convolution, and it is made up of 20 Feature Mappings, and each Feature Mapping is by 10 × 10 nerves Member composition；Each neuron in the hidden layer may have the cynapse being connected with the several Feature Mappings of next hidden layer to connect Connect, it to first similar mode of convolutional layer to operate；

4th hidden layer carries out second of sub-sample and local average computation；It is made up of 20 Feature Mappings, but each feature Mapping be made up of 5 × 5 neurons, and it to similar mode of sampling for the first time to operate；

It is finally a full articulamentum, obtains output vector.

5. a kind of traffic route sign based on saliency and deep learning according to claim 1, Characterized in that, step S4 carries out conspicuousness processing to test pictures, region contrast method, i.e. RC methods are employed, by sky Between relation be incorporated into area grade contrast calculate among；It is as follows that conspicuousness extraction step is carried out with RC methods：

S4.1, original image A will be inputted first be divided into color contrast on several target areas, then zoning grade； Input picture is divided into several regions using a kind of image partition method based on figure, is that color histogram is set up in each region Figure, for a region r_k, its conspicuousness is calculated by measuring it with the color contrast in other regions in image, such as Under：

<mrow> <mi>S</mi> <mrow> <mo>(</mo> <msub> <mi>r</mi> <mi>k</mi> </msub> <mo>)</mo> </mrow> <mo>=</mo> <munder> <mo>&Sigma;</mo> <msub> <mi>r</mi> <mrow> <mi>k</mi> <mo>&NotEqual;</mo> <msub> <mi>r</mi> <mi>i</mi> </msub> </mrow> </msub> </munder> <mi>w</mi> <mrow> <mo>(</mo> <msub> <mi>r</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <msub> <mi>D</mi> <mi>r</mi> </msub> <mrow> <mo>(</mo> <msub> <mi>r</mi> <mi>k</mi> </msub> <mo>,</mo> <msub> <mi>r</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> </mrow>

Wherein w (r_i) it is region r_iWeights, D_r(r_k,r_i) it is two interregional color measurements；r_iIn number of pixels w (r_i) To emphasize the color contrast with bigger region；Region r₁And r₂Between color distance be defined as foloows：

<mrow> <msub> <mi>D</mi> <mi>r</mi> </msub> <mrow> <mo>(</mo> <msub> <mi>r</mi> <mn>1</mn> </msub> <mo>,</mo> <msub> <mi>r</mi> <mn>2</mn> </msub> <mo>)</mo> </mrow> <mo>=</mo> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>n</mi> <mn>1</mn> </msub> </munderover> <munderover> <mo>&Sigma;</mo> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>n</mi> <mn>2</mn> </msub> </munderover> <mi>f</mi> <mrow> <mo>(</mo> <msub> <mi>c</mi> <mn>1</mn> </msub> <mo>,</mo> <mi>i</mi> <mo>)</mo> </mrow> <mi>f</mi> <mrow> <mo>(</mo> <msub> <mi>c</mi> <mn>2</mn> </msub> <mo>,</mo> <mi>j</mi> <mo>)</mo> </mrow> <mi>D</mi> <mrow> <mo>(</mo> <msub> <mi>c</mi> <mn>1</mn> </msub> <mo>,</mo> <mi>i</mi> <mo>,</mo> <msub> <mi>c</mi> <mn>2</mn> </msub> <mo>,</mo> <mi>j</mi> <mo>)</mo> </mrow> </mrow>

In order to preferably reflect a difference between color and primary color, we appear in this region using this color In frequency be used as the weights of this color, wherein f (c_k, i) it is i-th of color c_k,iIn k-th of region r_kIn all n_k The frequency of occurrences in individual color；

S4.2, the region contrast for being re-introduced on the basis of step S4.1 spatial weighting；Introduce one and include spatial information Weight, the setting of weights is defined according to regional space distance, and the target area of space length farther out is endowed less Weights, the nearer target area of space length is endowed larger weights；Increased with this immediate area influence and reduce compared with The influence of far region；For any region r_k, the conspicuousness based on spatial weighting region contrast is defined as follows：

<mrow> <mi>S</mi> <mrow> <mo>(</mo> <msub> <mi>r</mi> <mi>k</mi> </msub> <mo>)</mo> </mrow> <mo>=</mo> <munder> <mi>&Sigma;</mi> <msub> <mi>r</mi> <mrow> <mi>k</mi> <mo>&NotEqual;</mo> <msub> <mi>r</mi> <mi>i</mi> </msub> </mrow> </msub> </munder> <mi>exp</mi> <mrow> <mo>(</mo> <mo>-</mo> <msub> <mi>D</mi> <mi>s</mi> </msub> <mo>(</mo> <mrow> <msub> <mi>r</mi> <mrow> <mi>k</mi> <mo>,</mo> </mrow> </msub> <msub> <mi>r</mi> <mi>i</mi> </msub> </mrow> <mo>)</mo> <mo>/</mo> <msubsup> <mi>&sigma;</mi> <mi>s</mi> <mn>2</mn> </msubsup> <mo>)</mo> </mrow> <mi>w</mi> <mrow> <mo>(</mo> <msub> <mi>r</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> <msub> <mi>D</mi> <mi>r</mi> </msub> <mrow> <mo>(</mo> <msub> <mi>r</mi> <mi>k</mi> </msub> <mo>,</mo> <msub> <mi>r</mi> <mi>i</mi> </msub> <mo>)</mo> </mrow> </mrow>

Wherein, D_s(r_k,r_i) it is region r_kAnd r_iBetween space length, σ_sControl the intensity of space weights；Larger σ_sValue more can Reduce the influence of space weights so that region farther out is more conducive to obtain gray-scale map after the conspicuousness of current region, processing As B；

S4.3, the salient region for extracting test image, obtain including the gray level image B of traffic route identified areas, connect down Binary image C is obtained to carry out binaryzation to gray level image B；

Holes filling in S4.4, the binary image C for obtaining step S4.3, obtains image D, then using image D as mask, Retain the part that D correspondence positions are 1 in original image A, other are set to 0, obtain including the aobvious of traffic route mark region Work property image.

6. a kind of traffic route sign based on saliency and deep learning according to claim 5, Characterized in that, binary image uses global threshold algorithm in step S4.3, Otsu methods carry out binary conversion treatment to image； Otsu is the statistical property based on entire image, realizes the automatic selection of threshold value；If image has M gray value, span is 0 ~M-1, chooses gray value t, divides the image into two groups of G within this range₀And G₁, G₀Comprising pixel gray value in 0~t, G₁ Gray value in t+1~M-1, total number of image pixels, n are represented with N_iNumber of the gray value for i pixel is represented, then each ash The Probability p that angle value i occurs_iFor：

G₀Class occur probability and average be：

<mrow> <msub> <mi>w</mi> <mn>0</mn> </msub> <mo>=</mo> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mn>0</mn> </mrow> <mi>t</mi> </munderover> <msub> <mi>p</mi> <mi>i</mi> </msub> </mrow>

<mrow> <msub> <mi>u</mi> <mn>0</mn> </msub> <mo>=</mo> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>0</mn> </mrow> <mi>t</mi> </msubsup> <msub> <mi>ip</mi> <mi>i</mi> </msub> <mo>,</mo> </mrow> 2

G₁Class occur probability and average be：

<mrow> <msub> <mi>w</mi> <mn>1</mn> </msub> <mo>=</mo> <munderover> <mi>&Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mi>i</mi> <mo>+</mo> <mn>1</mn> </mrow> <mrow> <mi>M</mi> <mo>-</mo> <mn>1</mn> </mrow> </munderover> <msub> <mi>p</mi> <mi>i</mi> </msub> <mo>=</mo> <mn>1</mn> <mo>-</mo> <msub> <mi>w</mi> <mn>0</mn> </msub> </mrow>

<mrow> <msub> <mi>u</mi> <mn>1</mn> </msub> <mo>=</mo> <munderover> <mo>&Sigma;</mo> <mrow> <mi>i</mi> <mo>=</mo> <mi>i</mi> <mo>+</mo> <mn>1</mn> </mrow> <mrow> <mi>M</mi> <mo>-</mo> <mn>1</mn> </mrow> </munderover> <msub> <mi>ip</mi> <mi>i</mi> </msub> </mrow>

Then inter-class variance δ (t)²=w₀w₁(u₀-u₁)²

T=argmax δ (t), t ∈ [0, M-1]

Entire image is divided into foreground and background two parts by threshold value T, the gray value of every pixel is more than being set as this threshold value 225, it is set as 0 less than this threshold value；Gray level image is processed into only black and white binary image, two-value is finally given Change image C.

7. a kind of traffic route sign based on saliency and deep learning according to claim 1, Characterized in that, step S5 is according to training model measurement image recognition effect：Included after conspicuousness in step S4 is extracted Test experiments are carried out in the convolutional neural networks trained in the picture input step S3 for having road signs region.