CN107480600A

CN107480600A - A kind of gesture identification method based on depth convolutional neural networks

Info

Publication number: CN107480600A
Application number: CN201710597440.3A
Authority: CN
Inventors: 王修晖
Original assignee: China Jiliang University
Current assignee: China Jiliang University
Priority date: 2017-07-20
Filing date: 2017-07-20
Publication date: 2017-12-15

Abstract

The invention discloses a kind of gesture identification method based on depth convolutional neural networks, including：(1) division of edge detection process and sample set is carried out to the sample image of training set；（2）Build depth convolutional neural networks；（3）Determine activation primitive and loss function；（4）Train deep neural network；（5）Gesture identification is realized according to the depth convolutional neural networks after training：Its step includes：A) hand-type image is extracted from gesture data to be identified；B) hand-type image is subjected to rim detection and size normalized；C) normalized hand-type image is input in depth convolutional neural networks, the output valve according to output layer judges the ownership class of current gesture.The present invention uses multiple down-sampling technique construction depth convolutional neural networks, and carries out the training of neutral net as activation primitive using hyperbolic tangent function, can not only improve the efficiency of gesture identification, and can improve the accuracy rate of gesture identification.

Description

A kind of gesture identification method based on depth convolutional neural networks

Technical field

The present invention relates to living things feature recognition field, more particularly to a kind of gesture identification based on depth convolutional neural networks Method.

Background technology

Living things feature recognition is one of key technologies in field such as video monitoring, safety certification.Biological characteristic can be divided into Physiological characteristic and behavioural characteristic.Physiological characteristic mainly includes face, fingerprint and iris etc., and behavioural characteristic then includes gait, gesture Deng.Typically the recognition methods based on physiological characteristic has fingerprint recognition, palm shape and outline identification, recognition of face, and iris is known Not etc..Fingerprint recognition is one of current most widely used personal identification method based on biological characteristic.Fingerprint recognition has skill The advantages that art is ripe, and cost is cheap.Its shortcoming is contact, has the property invaded, and the problem of health aspect, while fingerprint be present It is and easy to wear.Face recognition technology is a very active in recent years research field, have intuitive it is good, conveniently, friend Well, the advantages of being easily accepted.Recognition of face is contactless, passive discerning, it is not necessary to which people's cooperates with one's own initiative；But shortcoming The influence of illumination, visual angle, shelter, environment, expression etc. is susceptible to, causes identification difficult.The safety of iris feature identification Degree and precision are very high, but collection apparatus process is extremely difficult.The identity recognizing technology of Behavior-based control feature, common are Gait Recognition and gesture identification.The input of Gait Recognition is the sequence of video images of one section of walking, and data volume is very big, causes to calculate Complexity is high, deals with relatively difficult.And important component of the gesture identification as non-contact type human-machine interaction, it is big at present In terms of most researchers are primarily focused on the final identification of gesture, it will usually gesture background is simplified, and in the single back of the body Gesture is split using the algorithm studied under scape, then passed through the implication that gesture is expressed using conventional recognition methods Network analysis comes out.

The content of the invention

In view of the above-mentioned deficiencies in the prior art, it is an object of the present invention to provide a kind of gesture based on depth convolutional neural networks Recognition methods.

The purpose of the present invention is achieved through the following technical solutions：A kind of gesture identification based on deep neural network Method, comprise the following steps：

(1) division of edge detection process and sample set is carried out to the sample image of training set：First to the hand of training Power set image carries out hand-type detection and edge detection process, and by the hand-type Image Adjusting of extraction to unified size；It will locate in advance Data after reason are divided into training sample set and checking sample set；

(2) depth convolutional neural networks are built：If I and O are respectively the input layer and output layer of depth convolutional neural networks, Hidden layer between I and O is H1, H2 ..., Hn.Wherein, the hand-type image that input layer obtains for step (1), output layer one The gesture feature vector that individual length is N, hidden layer use multiple down-sampling technology, it is allowed to have between the piecemeal of down-sampling overlapping；

(3) activation primitive and loss function are determined：The non-linear hyperbolic tan shown in formula (1) is selected as nerve The activation primitive of member；Select the loss function shown in formula (2).Wherein, n is the number of sample in training set, and x is hand-type figure Picture, y are the gesture feature vector corresponding to x, and θ is parameter vector；

(4) deep neural network is trained：Concentrated from training sample and choose m training sample, using steepest descent method Calculate gradient；Then, verified using checking sample set, when accuracy exceedes predetermined threshold value 99.5%, terminate training, from And obtain with the deep neural network for determining weight w and bias term b；

(5) gesture identification is realized according to the depth convolutional neural networks after training：Its step includes：A) from gesture to be identified Extracting data hand-type image；B) hand-type image is subjected to rim detection and size normalized；C) hand-type image is inputted Into depth convolutional neural networks, the output valve according to output layer judges the ownership class of current gesture.

The beneficial effects of the invention are as follows：The gesture identification method based on depth convolutional neural networks of the present invention, using more Weight down-sampling technique construction depth convolutional neural networks, and neutral net is carried out as activation primitive using hyperbolic tangent function Training, can not only improve the efficiency of gesture identification, and can improve the accuracy rate of gesture identification.

Brief description of the drawings

Fig. 1 is the implementing procedure of this method；

Fig. 2 hand-type image zooming-out schematic diagrames；

The multiple down-sampling schematic diagrames of Fig. 3；

Fig. 4 hyperbolic tangent function curves；

The error rate that Fig. 5 discriminations compare.

Embodiment

The present invention is described in detail below in conjunction with the accompanying drawings.

As shown in figure 1, the gesture identification method of the invention based on depth convolutional neural networks is divided into training stage and identification Stage.

In the training stage, depth convolutional neural networks are built, and its weights and offset parameter are determined using training set data Value.Specifically include following sub-step：

1st, hand-type image is extracted from training data：For the frame in gesture interaction, as shown in Fig. 2 (a), the present invention is first Rough hand region image is first extracted according to features of skin colors, as shown in Fig. 2 (b)；Secondly, it is swollen using filtering and quick ecological Swollen, erosion algorithm is modified to hand area image, further hand-type image is obtained, as shown in Fig. 2 (c)；

2nd, rim detection and size normalized are carried out to hand-type image：The hand-type image obtained to step 1 carries out side Edge detects and binary conversion treatment, preliminary hand-type contour images is obtained, as shown in Fig. 2 (d)；Then, hand-type contour curve is entered Row trimming and perfect, and be adjusted to unified size obtain corresponding to hand-type image；

3rd, depth convolutional neural networks are built：The hand-type image obtained for step 2, the present invention construct depth convolution god It is used for gesture identification through network.If I and O are respectively the input layer and output layer of depth convolutional neural networks, hidden between I and O Tibetan layer be H1, H2 ..., Hn.Wherein, input layer is the hand-type image that step 2 obtains, and output layer is the gesture that a length is N Characteristic vector, hidden layer use multiple down-sampling technology, it is allowed to have between the piecemeal of down-sampling it is overlapping, as shown in Figure 3；

4th, activation primitive and loss function are determined：As shown in figure 4, selection non-linear hyperbolic tan is as neuron Activation primitive；The squared error function shown in formula (2) is selected as loss function.

5th, depth convolutional neural networks are trained：

Concentrated from training sample and choose m training sample, gradient is calculated using steepest descent method, and according to loss letter Several parameters to each hidden layer are iterated optimization；Then, verified using checking sample set, when accuracy exceedes default threshold During value 99.5%, terminate training, so as to obtain with the deep neural network for determining weight w and bias term b.

In cognitive phase, first from extracting data hand-type image to be identified, and rim detection and size normalization are carried out Processing；Then, the depth convolutional neural networks that train are entered into judge the ownership class of current gesture.Finally, it incite somebody to action this Method compares with the method using data glove and sequence similarity detection (SSDA) method, and Fig. 5 gives mistake Rate experimental data.

Claims

1. a kind of gesture identification method based on depth convolutional neural networks, comprises the following steps：

(1) division of edge detection process and sample set is carried out to the sample image of training set：First to the gesture collection of training Image carries out hand-type detection and edge detection process, and by the hand-type Image Adjusting of extraction to unified size；After pre-processing Data be divided into training sample set and checking sample set；

(2) depth convolutional neural networks are built：If I and O are respectively the input layer and output layer of depth convolutional neural networks, I and O Between hidden layer for H1, H2 ..., Hn.Wherein, input layer is the hand-type image that step (1) obtains, and output layer is a length For N gesture feature vector, hidden layer uses multiple down-sampling technology, it is allowed to has between the piecemeal of down-sampling overlapping；

(3) activation primitive and loss function are determined：The non-linear hyperbolic tan shown in formula (1) is selected as neuron Activation primitive；Select the loss function shown in formula (2).Wherein, n is the number of sample in training set, and x is in hand-type image Point, y are the output valve corresponding to x, and θ is parameter vector；

<mrow> <mi>L</mi> <mo>=</mo> <mfrac> <mn>1</mn> <mi>n</mi> </mfrac> <munder> <mo>&Sigma;</mo> <mi>x</mi> </munder> <mo>|</mo> <mo>|</mo> <mi>y</mi> <mo>-</mo> <mi>f</mi> <mrow> <mo>(</mo> <mi>x</mi> <mo>,</mo> <mi>&theta;</mi> <mo>)</mo> </mrow> <mo>|</mo> <msup> <mo>|</mo> <mn>2</mn> </msup> <mo>-</mo> <mo>-</mo> <mo>-</mo> <mrow> <mo>(</mo> <mn>2</mn> <mo>)</mo> </mrow> </mrow>

(4) deep neural network is trained：Concentrated from training sample and choose m training sample, calculated using steepest descent method Gradient；Then, verified using checking sample set, when accuracy exceedes predetermined threshold value, terminate training, so as to be had Determine weight w and bias term b deep neural network；

(5) gesture identification is realized according to the depth convolutional neural networks after training：Its step includes：A) from gesture data to be identified Middle extraction hand-type image；B) hand-type image is subjected to rim detection and size normalized；C) hand-type image is input to depth Spend in convolutional neural networks, the output valve according to output layer judges the ownership class of current gesture.