CN106228109A

CN106228109A - A kind of action identification method based on skeleton motion track

Info

Publication number: CN106228109A
Application number: CN201610538399.8A
Authority: CN
Inventors: 侯永宏; 李照洋; 董嘉蓉; 叶秀峰; 薛琳; 李传坤
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2016-07-08
Filing date: 2016-07-08
Publication date: 2016-12-14

Abstract

The present invention relates to a kind of action identification method based on skeleton motion track, comprise: choose some human skeletal's Sequence composition training datasets comprising complete action, each human skeletal's sequence comprises the mankind's each skeleton Nodes Three-dimensional coordinate；Each human skeletal's sequence to training dataset, processes；Utilize convolutional neural networks, the skeleton trajectory diagram to three directions generated based on training dataset, does convolution respectively refreshing by the model training of network, respectively obtains a convolutional neural networks model；For test sample, after processing, each the test skeleton trajectory diagram that colouring information represents the temporal information of skeleton motion it is obtained by；By each test skeleton trajectory diagram, it is separately input in the convolutional neural networks model trained accordingly, it is possible to draw the classification results of convolutional neural networks；Obtain final classification results.The present invention can identify human action accurately and reliably.

Description

A kind of action identification method based on skeleton motion track

Technical field

The invention belongs to multimedia signal processing field, relate to a kind of action identification method.

Background technology

The motion detection recognition methods of the mankind, has application widely at society, such as: intelligent monitoring, people The mutual somatic sensation television game of machine, video frequency searching etc..

The motion detection identification of the mankind, from based on traditional rgb video row transition to the most popular RGB-D video sequence Row, movement locus as important feature growth always.The seizure of traditional movement locus is often based upon the detection of characteristic point Algorithm, different feature point detecting methods can draw diverse movement locus.Simultaneously as characteristic point is in the inspection of different frame Rope is highly unstable, and in whole video sequence, characteristic point is often discontinuous, therefore for feature point trajectory fado use based on Histogrammic statistical method, after whole video sequence is calculated and added up, uses the grader such as support vector machine to carry out point Class.

Human action based on three-dimensional skeleton motion track identifies kinds of schemes in recent years.But it is currently based on three-dimensional The method that the movement locus sequence of skeleton carries out action recognition, is mostly and directly processes skeleton data, or convert thereof into Nogata Figure is further processed again.

Summary of the invention

It is an object of the invention to provide the action identification method that a kind of discrimination based on skeleton motion track is higher, this Bright is picture based on three-dimensional skeleton motion track by three-dimensional skeleton motion trajectory map, uses and has on picture classification remarkably The convolutional neural networks of performance, carries out machine learning to the picture of three-dimensional skeleton motion trajectory map and classifies.Technical scheme is such as Under:

A kind of action identification method based on skeleton motion track, comprises following step:

(1) some human skeletal's Sequence composition training datasets comprising complete action, each human skeletal's sequence are chosen All comprise the mankind's each skeleton Nodes Three-dimensional coordinate；

(2) each human skeletal's sequence to training dataset, does following process:

1) for comprising the three-dimensional skeleton sequence of certain action of the mankind's each skeleton Nodes Three-dimensional coordinate, sequentially in time will The three-dimensional coordinate position of skeleton node carries out line, obtains this skeleton node three-dimensional skeleton motion rail in whole skeleton sequence Mark.

2) in cartesian coordinate system, the three-dimensional skeleton motion track of whole action is thrown on three coordinate planes respectively Shadow, thus three-dimensional skeleton motion track is converted into three two-dimentional skeleton motion trajectory diagrams, it is defined as skeleton trajectory diagram；

3) on three skeleton trajectory diagrams, the skeleton orbit segment to different time, give different colors, the value of color Depend on this skeleton orbit segment time location in whole skeleton sequence, thus be obtained by colouring information and represent skeleton motion Each skeleton trajectory diagram of temporal information；

(3) convolutional neural networks is utilized, the skeleton trajectory diagram to three directions generated based on training dataset, does respectively Convolution god, by the model training of network, respectively obtains a convolutional neural networks model；

(4) for test sample, after the process through step (2), it is obtained by colouring information and represents skeleton motion Each of temporal information test skeleton trajectory diagram；

(5) by each test skeleton trajectory diagram, it is separately input in the convolutional neural networks model trained accordingly, Can be derived that the classification results of convolutional neural networks, each convolutional neural networks can export an a length of training class number Scores vector；

(6) three Scores vectors are sued for peace, and take the class label at extreme value place as final classification results.

Accompanying drawing explanation

Accompanying drawing 1 is whole human action's identification framework based on skeleton motion track

Detailed description of the invention

Following skeleton sequence is taken from what the data set such as MSRC-12, G3D, UTD-MHAD provided human skeletal's sequence.

1) assuming that complete human action's skeleton sequence contains n frame, represent with H, the human skeletal of each frame is by m Individual node is constituted.Skeleton sequence H can use formula H={F₁, F₂..., F_nRepresent.WhereinRepresent The set of the three-dimensional coordinate of each skeleton node of the i-th frame,Represent the three-dimensional skeleton node location of jth skeleton node.Thus, In this skeleton sequence, the three-dimensional skeleton motion track T of all skeleton nodes can represent by equation below:

T={T₁, T₂..., T_i..., T_n-1}

WhereinThe three-dimensional motion of all skeletons represented in sequence in certain adjacent two frame Track, in setRepresent the three-dimensional skeleton motion track that kth skeleton node is obtained in adjacent two frames.

2) for each skeleton node at the three-dimensional skeleton motion track of adjacent two interframe, can be orthogonal Descartes A line segment it is expressed as in system.Therefore it can project in the three of Descartes's rhombic system plane respectively.By whole skeleton The all three-dimensional skeleton motion track of sequence is superimposed after all projecting on Descartes's orthogonal plane.Whole skeleton sequence Row just can represent with the skeleton trajectory diagram of three projecting directions.Each skeleton trajectory diagram contains the institute in skeleton sequence There is skeleton track distribution on this projecting direction, thus the spatial information of human action can be by fully in skeleton trajectory diagram Description.

3) in order to describe action message more accurately, in skeleton trajectory diagram, the present invention uses and gives the most in the same time The method of the color that three-dimensional skeleton motion track is different, is added thereto the temporal information of human motion.Concrete operations are as follows:

Initially setting up the color gradients striped of an a length of C, the change of its color can be by changing in HSV color model H (colourity) realize.The present invention uses from dark blue to red color gradients striped, in color gradients striped vector, we Another C_lRepresenting in color gradients striped C, correspondence position is the color at l.Three-dimensional for any two interframe of any skeleton Skeleton motion trackAccording to formula:L can be obtained, thus obtain the face that this three-dimensional skeleton motion track is corresponding Color C_l.The physical significance of this operation is, specifies skeleton motion according to skeleton motion track position in whole skeleton sequence The color of track, thus in skeleton trajectory diagram, the skeleton motion track of same time period has identical color, different time sections Skeleton motion track distinguished.So far, skeleton trajectory diagram contains spatial information and the time of human action simultaneously Information.

4) utilizing convolutional neural networks to the outstanding classification capacity of picture and high-level ability in feature extraction, the present invention will The skeleton trajectory diagram of three different directions projections in cartesian coordinate system generated from training dataset skeleton sequence, the most defeated Enter the training carrying out model in three convolutional neural networks.Convolutional neural networks structure in the present invention have employed the most well-known AlexNet network structure.After having trained, train three convolutional neural networks models are preserved respectively.

5) for each for the skeleton sequence tested, according to the generating mode of skeleton trajectory diagram, it is possible to Descartes Three skeleton trajectory diagrams are generated in rhombic system.According to the convolutional neural networks model trained, by same test skeleton sequence Three corresponding skeleton trajectory diagrams, are separately input in corresponding convolutional neural networks model, it is possible to draw three Scores to Amount.This vector length is training class number, and the value in vector is the normalization probability of subordinate correspondence classification.

6) the skeleton trajectory diagram of each skeleton sequence, through three Scores vectors that convolutional neural networks obtains, represents What this skeleton sequence learnt through convolutional neural networks on three projecting directions and calculated is subordinated to different classes of probability, In the present invention, directly the Scores vector to three directions carries out corresponding classification sum operation, finally takes belonging to maximum Classification is as the sub-categories of this video sequence.

It is present invention experimental result explanation on the data sets such as MSRC-12, G3D, UTD-MHAD below:

In the Realization of Simulation of the enterprising line algorithm of Matlab-2013b platform, training dataset and test data are calculated Collection skeleton trajectory diagram (being three directions).We use the most public degree of depth learning framework Caffe, at linux system Under, by NvidiaGTXTITANX video card, accelerate the training of convolutional neural networks.The comprehensive part of last Scores exists Complete on Matlab-2013b platform.

This method identifies in data set internationally recognized human action and tests, training set and test in data set The method of salary distribution of collection uses the Cross Subject method of salary distribution.Test result is as follows: at the MSRC-12 number comprising 12 class actions According on collection, it is thus achieved that the recognition accuracy of 93.12%；On the G3D data set comprising 20 class actions, it is thus achieved that 94.24% Recognition accuracy；On the UTD-MHAD data set comprising 27 class actions, it is thus achieved that the recognition accuracy of 85.81%.

Claims

1. an action identification method based on skeleton motion track, comprises following step:

(1) choosing some human skeletal's Sequence composition training datasets comprising complete action, each human skeletal's sequence is wrapped Containing the mankind's each skeleton Nodes Three-dimensional coordinate；

(2) each human skeletal's sequence to training dataset, does following process:

1) for comprising the three-dimensional skeleton sequence of certain action of the mankind's each skeleton Nodes Three-dimensional coordinate, sequentially in time by skeleton The three-dimensional coordinate position of node carries out line, obtains this skeleton node three-dimensional skeleton motion track in whole skeleton sequence.

2) in cartesian coordinate system, the three-dimensional skeleton motion track of whole action is projected on three coordinate planes respectively, Thus three-dimensional skeleton motion track is converted into three two-dimentional skeleton motion trajectory diagrams, it is defined as skeleton trajectory diagram；

3) on three skeleton trajectory diagrams, the skeleton orbit segment to different time, give different colors, the value of color depends on In this skeleton orbit segment time location in whole skeleton sequence, thus be obtained by colouring information represent skeleton motion time Between each skeleton trajectory diagram of information；

(3) convolutional neural networks is utilized, the skeleton trajectory diagram to three directions generated based on training dataset, does convolution respectively God, by the model training of network, respectively obtains a convolutional neural networks model；

(4) for test sample, after the process through step (2), be obtained by colouring information represent skeleton motion time Between each of information test skeleton trajectory diagram；

(5) by each test skeleton trajectory diagram, it is separately input in the convolutional neural networks model trained accordingly, it is possible to Show that the classification results of convolutional neural networks, each convolutional neural networks can export an a length of training class number Scores vector；