CN107563496A

CN107563496A - A kind of deep learning mode identification method of vectorial core convolutional neural networks

Info

Publication number: CN107563496A
Application number: CN201710665575.9A
Authority: CN
Inventors: 李玉鑑; 欧军
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2017-08-07
Filing date: 2017-08-07
Publication date: 2018-01-09

Abstract

The invention discloses a kind of deep learning mode identification method of vectorial core convolutional neural networks, methods described includes：(1) structure of design vector core convolutional neural networks, including input layer, convolutional layer, at least one full articulamentum and soft maximum grader output layer；Wherein, input is the image that size is m × n, and the core of l layers is all p_lThe vector of × 1 size is all 1 × q_lThe vector of size, N_lRepresent the convolution kernel number of l layers；Rule of thumb select p_l(or q_l) and N_lValue, the convolution algorithm step-length of l layers is s_l(s_l>=1), activation primitive all elects correction linear unit as；(2) iterations of convolutional neural networks is set, cost function is selected, utilizes training sample { (x¹,y¹),…,(x^D,y^D), according to back-propagation algorithm, the parameter of vectorial core convolutional neural networks is learnt；(3) judge whether to complete iterations, if it is not complete, then continuing to train；If completing iterations, tested in the network model that test sample input is trained, obtain test result.

Description

A kind of deep learning mode identification method of vectorial core convolutional neural networks

Technical field

The invention belongs to the technical field in pattern-recognition and deep learning.Particular content is a kind of vectorial core convolutional Neural The deep learning mode identification method of network.

Background technology

Deep learning is one new research direction of artificial intelligence field, in recent years in image recognition, computer vision etc. The progress of making a breakthrough property in multiclass application.With the continuous development of deep learning, autocoder is generated (AutoEncoder, AE), degree of deeply convinceing neutral net (Deep Belief Networks, DBNs), convolutional neural networks Deep-neural-network models such as (Convolutional Neural Networks, CNN).Wherein because there are CNN weights to be total to Enjoy, local receptor field, dimensionality reduction the features such as, be widely used.But CNN weight data amount is larger, with the number of plies Increase, the training time is more, and consumption calculations resource is larger.

The content of the invention

The main object of the present invention is to provide a kind of deep learning mode identification method of vectorial core convolutional neural networks, purport The data volume of weights and training time are being reduced, is reducing the loss of computing resource.

To achieve the above object, the technical solution adopted by the present invention is a kind of deep learning of vectorial core convolutional neural networks Mode identification method, this method comprise the following steps：

(1) structure of design vector core convolutional neural networks, the structure include input layer L, 1≤L≤2000 convolutional layer, At least one full articulamentum and soft maximum (Softmax) grader output layer；Wherein, include input layer L inputs is that size is M × n, m, the image of n >=1,1≤l of l layers≤L core is all p_l× 1,1≤p_lThe vector of≤m sizes is all 1 × q_l, 1 ≤q_lThe vector of≤n sizes, N_lRepresent the convolution kernel number of l layers, 1≤N_l≤5000；Rule of thumb select p_lAnd N_lOr q_lWith N_lValue, the convolution algorithm step-length of l layers is s_l, s_l>=1, activation primitive all elects correction linear unit (Rectified as Linear Units, ReLU)；

(2) the iterations Epochs of convolutional neural networks is set, 1≤Epochs≤20000000, selects cost function, Utilize training sample { (x¹,y¹),…,(x^D,y^D), according to back-propagation algorithm, the parameter of vectorial core convolutional neural networks is entered Row study；

(3) judge whether to complete iterations, if it is not complete, then continuing to train；, will if completing iterations Tested in the network model that test sample input trains, obtain test result.

If the input of Softmax grader output layers is U=(u₁,…u_m)^T, export as O=(o₁,…o_m)^T, then O= Softmax (U), i.e.,

Wherein k is labeling number.Activation primitive ReLu expression formulas are：F (x)=max (0, x), wherein x are the defeated of each layer Go out value, f (x) represents the output valve of activation primitive.

The present invention has the following advantages that compared with prior art：

Present invention employs Vector convolution core, reduces data volume and the training time of weights, reduces computed losses, add Fast convergence rate.For the present invention in experiment test, the index of experimental results is better than existing convolutional neural networks, energy It is enough that preferable recognition effect is obtained in image recognition.

Brief description of the drawings

Fig. 1 is the network structure of the present invention.

Fig. 2 is the network training method flow chart of the present invention.

Embodiment

Below in conjunction with the accompanying drawings and specific implementation case the invention will be further described.It should be understood that following case study on implementation Described embodiment does not represent the consistent all embodiments of the present invention.In addition, it is to be understood that reading the present invention After the content of instruction, those skilled in the art can make various changes or modifications to the present invention, and these equivalent form of values equally fall In the application appended claims limited range.

The present invention provides a kind of deep learning mode identification method of vectorial core convolutional neural networks, applied to handwritten numeral Identify in case study on implementation, comprise the following steps：

(1) structure of design vector core convolutional neural networks, including input layer, individual convolutional layers of L (1≤L≤2000), at least One full articulamentum and soft maximum (Softmax) grader output layer；Wherein, input is that size is m × n (m, n >=1) Image, the core of l layers (1≤l≤L) is all p_l×1(1≤p_l≤ m) size vector or be all 1 × q_l(1≤q_l≤ n) it is big Small vector, N_l(1≤N_l≤ 5000) the convolution kernel number of l layers is represented；Rule of thumb select p_l(or q_l) and N_lValue, l The convolution algorithm step-length of layer is s_l(s_l>=1), activation primitive all elect as correction linear unit (Rectified Linear Units, ReLU)；

Specifically, the vectorial core convolutional neural networks of the implementation case design include：One input layer, five convolutional layers, One full articulamentum, a Softmax grader output layer.0th layer is input layer, and input is the gray-scale map that size is 28 × 28 Picture；1st layer of core size is all 6 × 1, and core number is 32, step-length 1；2nd layer of core size is all 1 × 6, and core number is 32, Step-length is 1；3rd layer of core size is all 1 × 5, and core number is 48, step-length 2；4th layer of core size is all 1 × 5, core number For 48, step-length 1；5th layer of core size is all 1 × 1, and core number is 64, step-length 2；If Softmax grader output layers Input as U=(u₁,…u_m)^T, export as O=(o₁,…o_m)^T, then O=Softmax (U), i.e.,

(2) the iterations Epochs, 1≤Epochs of setting convolutional neural networks≤20000000 select cost function, Utilize training sample { (x¹,y¹),…,(x^D,y^D), according to back-propagation algorithm, the parameter of vectorial core convolutional neural networks is entered Row study；

Specifically, the iterations in the implementation case is set to 10000 times；The implementation case uses MNIST data sets, it It is to be built by the CorinnaCortes in Google laboratories and the YannLeCun of Ke Lang research institutes of New York University, wherein 60000 Individual training sample, 10000 test samples.For the input picture of input layer as described in (1), output classification is 10 classes (digital 0-9 marks Know).The test environment of the present invention is windows7, and 64 bit manipulation systems, program code is to use python3.5+tensorflow Write；Hardware environment is NVIDIATeslaK40C GPU (two pieces), and interior save as is run in the environment of 128G.In network mould In type training process, cost function elects softmax_cross_entropy_with_logits () as, is calculated using backpropagation The stochastic gradient descent version of method is updated to weights and biasing；And in order to prevent over-fitting, used in full articulamentum Dropout methods abandon neuron according to 0.25 probability.

Emulation experiment shows：The present invention compares traditional convolutional neural networks, has the smaller (present invention of weight data amount： 928, CNN:3952), the training time less (present invention：0.69H,CNN:0.87H), the higher (present invention of discrimination：99.56%, CNN:99.55%) the characteristics of.

The experiment effect of the present invention is as shown in table 1：

Parameter and the Experimental comparison results of the implementation case network model of table 1 and traditional convolutional neural networks

Remarks：Numeral " 300 " in full articulamentum refers to neuron number, and " H " represents hour.

Claims

A kind of 1. deep learning mode identification method of vectorial core convolutional neural networks, it is characterised in that：This method includes following Step, the structure of (1) design vector core convolutional neural networks, the structure include input layer L, 1≤L≤2000 convolutional layer, extremely A few full articulamentum and soft maximum grader output layer；Wherein, input layer L inputs is that size is m × n, m, n >=1 Image, 1≤l of l layers≤L core is all p_l× 1,1≤p_lThe vector of≤m sizes is all 1 × q_l, 1≤q_l≤ n sizes Vector, N_lRepresent the convolution kernel number of l layers, 1≤N_l≤5000；Rule of thumb select p_lAnd N_lOr q_lAnd N_lValue, l layers Convolution algorithm step-length is s_l, s_l>=1, activation primitive all elects correction linear unit as；

(2) the iterations Epochs of convolutional neural networks is set, 1≤Epochs≤2000000, selects cost function, is utilized Training sample { (x¹,y¹),…,(x^D,y^D), according to back-propagation algorithm, to the parameter of vectorial core convolutional neural networks Practise；

(3) judge whether to complete iterations, if it is not complete, then continuing to train；, will test if completing iterations Tested in the network model that sample input trains, obtain test result.
2. a kind of deep learning mode identification method of vectorial core convolutional neural networks according to claim 1, its feature It is：If the input of Softmax grader output layers is U=(u₁,…u_m)^T, export as O=(o₁,…o_m)^T, then O= Softmax (U), i.e.,

<mrow> <mi>O</mi> <mo>=</mo> <mfrac> <mn>1</mn> <mrow> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <mi>k</mi> </msubsup> <msup> <mi>e</mi> <msub> <mi>u</mi> <mi>j</mi> </msub> </msup> </mrow> </mfrac> <mfenced open = "[" close = "]"> <mtable> <mtr> <mtd> <msup> <mi>e</mi> <msub> <mi>u</mi> <mn>1</mn> </msub> </msup> </mtd> </mtr> <mtr> <mtd> <msup> <mi>e</mi> <msub> <mi>u</mi> <mn>2</mn> </msub> </msup> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <mo>.</mo> </mtd> </mtr> <mtr> <mtd> <msup> <mi>e</mi> <msub> <mi>u</mi> <mi>j</mi> </msub> </msup> </mtd> </mtr> </mtable> </mfenced> <mo>,</mo> </mrow>

Wherein k is labeling number；Activation primitive ReLu expression formulas are：F (x)=max (0, x), wherein x are the output of each layer Value, f (x) represent the output valve of activation primitive.