CN111797743A

CN111797743A - Human behavior recognition method based on bidirectional LSTM

Info

Publication number: CN111797743A
Application number: CN202010596167.4A
Authority: CN
Inventors: 潘赟; 肖沛文; 朱怀宇
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2020-06-28
Filing date: 2020-06-28
Publication date: 2020-10-20

Abstract

A human behavior recognition method based on bidirectional LSTM comprises a training stage and a recognition stage, wherein in the training stage, three-axis acceleration original data are collected through an accelerometer, then filtering denoising and data segmentation preprocessing are carried out on sensor data, finally, a data segment obtained through segmentation and a behavior label obtained during collection are utilized to train and verify a behavior recognition network model based on bidirectional LSTM, and a training model with the highest recognition rate on a verification set is selected to be applied to the recognition stage. The method has the advantages of high accuracy and few parameters. The invention provides a deep bidirectional LSTM human body behavior recognition method based on bidirectional LSTM, which can provide a recognition result with higher accuracy and rapidness.

Description

Human behavior recognition method based on bidirectional LSTM

Technical Field

The invention belongs to the field of human behavior recognition, and relates to a human behavior recognition method based on deep learning and sensor data.

Background

The human behavior recognition technology can fully reflect human behavior states and physiological information, and has wide application prospects in the fields of motion tracking, body building exercise, daily monitoring, medical rehabilitation, man-machine interaction, virtual reality, intelligent environments and the like. Based on the behavior recognition of the inertial sensor, the physical information such as acceleration, angular velocity and direction generated by human motion is collected by using sensors such as an accelerometer, a gyroscope and a direction sensor and is used for recognizing the current human behavior.

The accuracy of the traditional human behavior identification method depends on the extraction and selection of features to a great extent, and researchers need to extract the time domain, frequency domain and other features of signals by means of professional knowledge and skills and carry out feature reduction by combining feature engineering. The neural network can convert the original data into higher-order, more abstract and more complex expressions through a simple nonlinear model, and can replace an artificial feature extraction process of behavior recognition through automatic feature extraction.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention provides a deep bidirectional LSTM human body behavior recognition method based on bidirectional LSTM, which can provide a higher-accuracy and faster recognition result.

The technical scheme adopted by the invention for solving the technical problems is as follows:

a human behavior recognition method based on bidirectional LSTM comprises the following steps:

step (1): the method comprises the steps of collecting sensor data under different human behavior categories, wherein the collected action categories are set as M ═ M₁，m₂，...，m_JJ is a predefined number of behavior categories;

step (2): carrying out filtering and denoising processing on the sensor data by using a low-pass filtering method;

and (3): segmenting the filtered sensor data to obtain data segments, and for each data segment P_k＝[A₁，A₂，...，A_L]∈R^N×LAnd L is the length of the fragment,

1, 2, L, N is the sensor data dimension, k is 1, 2,.., K and K are the number of data segments, the behavior class label of the data segment is marked as j, and j corresponds to the behavior class m_j，m_jE.m, J1, 2, J, all data segments together forming a sample set S, S ═ P₁，P₂，...，P_K]；

And (4): constructing a deep bidirectional LSTM behavior recognition network model based on bidirectional LSTM;

and (5): dividing a sample set S into a model training set S_TAnd model validation set S_VUsing the model training set S_TTraining a deep bidirectional LSTM behavior recognition network model and performing verification on a verification set S_VCarrying out model verification;

and (6): and (4) identifying the behavior by using the trained deep bidirectional LSTM network model to obtain an identification result.

Further, in the step (1), the collected sensor data includes data of three axes x, y, and z of the accelerometer, and the collected motion set M includes 6 types of walking, jogging, going upstairs, going downstairs, standing, and sitting.

Still further, in the step (3), the segmentation is performed by using a sliding window method with a window length of L and an overlap coverage of ρ%.

Further, in the step (4), the constructed deep bidirectional LSTM behavior recognition network model includes an input layer, a fully-connected layer, a hidden layer and an output layer, and for each input data segment P_input＝ [A₁，A₂，...，A_L]∈R^N×LConverted into by using the full connection layer

i＝1，2，...，L，N_fcThe number of nodes of the full connection layer; the hidden layer comprises a forward propagation layer and a backward propagation layer, the depth of the forward propagation layer and the depth of the backward propagation layer are 3, each layer adopts LSTM units, and the number of the LSTM units in each layer is 64; the outputs of the forward pass and the reverse pass are spliced through a splicing operation,and inputting the sequence of the last spliced moment into an output layer, wherein the number of nodes of the output layer is 6, the index of the maximum value output by the output layer is used as a final identification category label corresponding to the number of the behavior categories.

Further, in the step (5), a training set S is used_TAnd S_TTraining a deep bidirectional LSTM behavior recognition network model according to the class label corresponding to each sample segment, and performing verification on the model in a verification set S_VThe verification is carried out, the accuracy is used as the evaluation standard of the model, and the verification set S is selected_VAnd obtaining model parameters under the iteration cycle with the highest accuracy to obtain a final depth bidirectional LSTM behavior recognition network model.

Further, in the step (6), a low-pass filtering method is adopted to perform filtering and denoising processing on accelerometer data acquired in real time, a sliding window method with a window length of L and an overlapping coverage rate of ρ% is adopted to perform segmentation to obtain a data segment at the current moment, and the data segment is input into a trained deep bidirectional LSTM behavior recognition network to recognize the current behavior category.

The invention has the following beneficial effects: the method can provide a higher-accuracy and faster identification result.

Drawings

FIG. 1 is a flow chart of a behavior recognition method according to the present invention;

FIG. 2 is a deep bidirectional LSTM behavior recognition network model of the present invention.

Detailed Description

The invention is further described below with reference to the accompanying drawings.

Referring to fig. 1 and 2, a bidirectional LSTM-based human behavior recognition method includes the following steps:

step (1): the implementation method comprises the steps of collecting sensor data under different human behavior categories, using a WISDM public data set, placing a smart phone in a trouser pocket, collecting accelerometer sensor data, wherein the sampling frequency is 20Hz, the number of people participating in data collection is 36, the collected actions are 6 types, namely walking, going upstairs, going downstairs, jogging, sitting and standing, the data set comprises 1098209 pieces of sampling data, and each piece of sampling data comprises an experimenter number, behaviors and 3-axis acceleration data.

Step (2): the method comprises the steps of utilizing a low-pass filtering method to carry out filtering and denoising processing on sensor data, and adopting a moving average filter with the window length of 5 to carry out denoising processing on accelerometer sensor data.

And (3): the implementation method comprises the steps of firstly removing missing points in a data set, then carrying out data segmentation by using a sliding window method with the window length of 60 and the overlapping coverage rate of 50%, carrying out total segmentation to obtain 35998 samples, wherein each sample is 60 points long, contains a 3-axis acceleration time sequence, and marks a behavior class label of each sample.

And (4): a deep bidirectional LSTM behavior recognition network model is constructed based on bidirectional LSTM, and the bidirectional LSTM behavior recognition network model designed by the implementation method is shown in figure 2. The model comprises an input layer, a full connection layer, a hidden layer and an output layer, wherein the number of nodes of the full connection layer is 32, the hidden layer comprises a forward propagation layer and a backward propagation layer, the depth of the forward propagation layer and the depth of the backward propagation layer are 3, each layer adopts an LSTM unit, and the number of the LSTM units of each layer is 64; and the output of the forward transmission and the reverse transmission is spliced through splicing operation, the sequence of the last moment after splicing is input into an output layer, the number of nodes of the output layer is 6, the index of the maximum value output by the output layer is used as a final identification category label corresponding to the number of behavior categories.

And (5): training and verifying a deep bidirectional LSTM behavior recognition network model, dividing 70% of samples obtained in the step (3) into a training set, and dividing 30% of the samples into a verification set, namely 25198 samples and 10800 samples are respectively contained in the training set and the verification set, training the deep bidirectional LSTM behavior recognition network model by using the model training set, initially setting the learning rate of the network to be 0.0025, and reducing the learning rate along with the increase of iteration times, namely after 50 iterations are carried out, setting the learning rate to be 0.8 originally. In order to enhance the generalization capability of the network and relieve the over-fitting problem, L2 regularization is adopted to limit the weight, the regularization coefficient is 0.0015, an Adam optimizer is adopted to train the model parameters, and the loss function adopts a cross entropy loss function. Model validation and selection is performed on the validation set. The recognition accuracy of the finally selected model on the validation set was 98.5%. Table 1 shows the identification confusion matrix of the model on the test set. The vertical direction is the real category and the horizontal direction is the recognition category.

	Walking machine	Go upstairs	Go downstairs	Jogging	Sitting position	Station
							Walking machine	4232	18	20	1	2	0
Go upstairs	14	1135	27	10	0	1
							Go downstairs	13	30	882	8	0	0
Jogging	8	4	7	3361	0	0
							Sitting position	0	0	0	0	559	4
Station	0	0	0	0	0	464

TABLE 1

And (6): and (4) identifying the behavior by using the trained deep bidirectional LSTM network model to obtain an identification result. The method comprises the steps of placing a smart phone or other mobile equipment in a trouser pocket to collect real-time data of an accelerometer sensor, carrying out filtering and denoising processing on the accelerometer data collected in real time by adopting a moving average filtering method with the window length of 5, segmenting by adopting a sliding window method with the window length of 60 and the overlapping coverage rate of 50% to obtain a data segment at the current moment, inputting the data segment in the step (5) into a trained bidirectional LSTM behavior recognition network, and recognizing the current behavior category.

Claims

1. A human behavior recognition method based on bidirectional LSTM is characterized by comprising the following steps:

n is the dimension of the sensor data, K is 1, 2, and K is the number of data segments, the behavior class label of the data segment is marked as j, and j corresponds to the behavior class m_j，m_jE.m, J1, 2, J, all data segments together forming a sample set S, S ═ P₁，P₂，...，P_K]；

and (5): dividing a sample set S into a model training set S_TAnd model validation set S_VTraining by means of modelsCollection S_TTraining a deep bidirectional LSTM behavior recognition network model and performing verification on a verification set S_VCarrying out model verification;

2. The bi-directional LSTM based behavior recognition method of claim 1, wherein: in the step (1), the collected sensor data includes data of three axes x, y and z of the accelerometer, and the collected motion set M includes 6 types including walking, jogging, going upstairs, going downstairs, standing and sitting.

3. The bidirectional LSTM-based behavior recognition method according to claim 1 or 2, wherein in the step (3), the segmentation is performed by using a sliding window method with a window length L and an overlapping coverage rate p%.

4. The bi-directional LSTM based behavior recognition method of claim 1 or 2, wherein in step (4), the constructed deep bi-directional LSTM behavior recognition network model comprises an input layer, a fully connected layer, a hidden layer and an output layer, and for each input data segment P_input＝[A₁，A₂，...，A_L]∈R^N×LConverted into by using the full connection layer

N_fcThe number of nodes of the full connection layer; the hidden layer comprises a forward propagation layer and a backward propagation layer, the depth of the forward propagation layer and the depth of the backward propagation layer are 3, each layer adopts LSTM units, and the number of the LSTM units in each layer is 64; the output of the forward transmission and the reverse transmission is spliced through splicing operation, the sequence of the last moment after splicing is input into an output layer, the number of nodes of the output layer is 6, the maximum value of the output layer is located according to the number of behavior categoriesQuoted as the final identification category label.

5. The bi-directional LSTM based behavior recognition method of claim 1 or 2, wherein in step (5), a training set S is used_TAnd S_TTraining a deep bidirectional LSTM behavior recognition network model according to the class label corresponding to each sample segment, and performing verification on a verification set S_VThe accuracy is used as the evaluation standard of the model, and the verification set S is selected_VAnd obtaining model parameters under the iteration cycle with the highest accuracy to obtain a final depth bidirectional LSTM behavior recognition network model.

6. The bidirectional LSTM-based behavior recognition method of claim 1 or 2, wherein in the step (6), the accelerometer data collected in real time is filtered and denoised by a low-pass filtering method, a data segment at the current moment is obtained by segmenting by a sliding window method with a window length L and an overlapping coverage rate p%, and the data segment is input into a trained deep bidirectional LSTM behavior recognition network to recognize the current behavior category.