CN114782987A

CN114782987A - Millimeter wave radar attitude identification method based on depth camera supervision

Info

Publication number: CN114782987A
Application number: CN202210314377.9A
Authority: CN
Inventors: 苟先太; 周晨晨; 魏亚林; 黄毅凯; 苟瀚文; 姚一可
Original assignee: Sichuan Bawei Jiuzhang Technology Co ltd; Southwest Jiaotong University
Current assignee: Sichuan Bawei Jiuzhang Technology Co ltd
Priority date: 2022-03-28
Filing date: 2022-03-28
Publication date: 2022-07-22
Anticipated expiration: 2042-03-28
Also published as: CN114782987B

Abstract

The invention discloses a millimeter wave radar attitude identification method based on depth camera supervision, which adopts a result of identification of a depth camera and a trained first deep learning model as a tag of millimeter wave radar data, trains a second deep learning model by using a generated tagged data set, can acquire a test set from a millimeter wave radar in real time after the training is finished to test the effect of the second deep learning model, and considers that the second deep learning model is trained when the accuracy reaches a set threshold. The data that the degree of depth camera acquireed need not the storage, and whole artifical the participation of not having, does not have visual scene, and the degree of depth camera can be withdrawn from after the training of second degree of depth learning model is accomplished, can effectively solve user privacy problem. In addition, the method can also flexibly adjust the second deep learning model in a specific environment according to the specific requirements of specific objects, and the problem of single application of the model is solved.

Description

Millimeter wave radar attitude identification method based on depth camera supervision

Technical Field

The invention relates to the field of human body posture recognition, in particular to a millimeter wave radar posture recognition method based on depth camera supervision.

Background

With the development of society, people pay more and more attention to physical conditions, especially to the physical health of the elderly. The elderly cannot get timely intervention treatment when in sudden diseases or accidents, and serious consequences are easy to cause. Although the real-time state of the old people can be concerned by advanced wearable equipment or video monitoring equipment, the real-time state of the old people can be concerned by the advanced wearable equipment or video monitoring equipment, but the real-time state of the old people also faces the difficult-to-overcome problems, including that contact type equipment is expensive and inconvenient to wear, and the old people are easy to conflict with the contact type equipment; and privacy issues arising from video surveillance equipment, make such solutions difficult to implement on the ground. In addition, different old people have different health problems, different health states have different processing on detection information, and classification and judgment based on a single model also leads to single application scene of the traditional non-contact detection scheme.

The depth camera is widely applied in the field of gesture recognition, has a good recognition effect, still faces the problem that privacy is not protected, and cannot be directly used in daily nursing of old people.

The millimeter wave radar has the characteristics of all weather, non-contact and no imaging, and is just suitable for daily nursing of the old. However, the millimeter wave radar is greatly influenced by changes of the use environment, and due to the characteristic that the millimeter wave radar is not imaged, great difficulty is caused to labeling of a data set during gesture recognition learning, human gestures corresponding to point cloud data are difficult to judge through human eyes, and a large amount of labor and energy are consumed in the learning process.

Disclosure of Invention

Aiming at the defects in the prior art, the millimeter wave radar attitude identification method based on the depth camera supervision solves the problems that privacy is not protected and a millimeter wave radar scheme is difficult to label a data set in the conventional mode of directly identifying and monitoring the attitude by adopting the depth camera.

In order to achieve the purpose of the invention, the invention adopts the technical scheme that:

the millimeter wave radar attitude identification method based on depth camera supervision comprises the following steps:

s1, respectively constructing a first deep learning model and a second deep learning model; simultaneously carrying out data acquisition on an acquisition object by a depth camera and a millimeter wave radar; for a first group of collected objects, taking data collected by a depth camera as a first data set, and taking data collected by a millimeter wave radar as a second data set; for a second group of collected objects, taking data collected by the depth camera as a third data set, and taking data collected by the millimeter wave radar as a fourth data set;

s2, performing posture recognition on data at each moment in the first data set through a first deep learning model to obtain a first target posture set; extracting the characteristic parameters of each moment in the second data set to obtain a first characteristic parameter set;

s3, for data at the same moment, carrying out attitude marking on the second characteristic parameter set by adopting the first target attitude set to obtain marked data;

s4, training the second deep learning model by taking the data with the labels as a training set to obtain a pre-trained second deep learning model;

s5, performing posture recognition on the data at each moment in the third data set through the first deep learning model to obtain a second target posture set; extracting the characteristic parameter of each moment in the fourth data set to obtain a second characteristic parameter set;

s6, taking the second characteristic parameter set as a training set of the pre-trained second deep learning model, and comparing the output of the pre-trained second deep learning model with a second target posture set to obtain the posture identification accuracy of the pre-trained second deep learning model;

s7, judging whether the gesture recognition accuracy of the pre-trained second deep learning model reaches a threshold value, if so, taking the current pre-trained second deep learning model as a final gesture recognition model, and entering the step S8; otherwise, modifying the parameters of the pre-trained second deep learning model, and returning to the step S3;

and S8, adopting the millimeter wave radar as a data acquirer of the object to be recognized, adopting the final posture recognition model to perform posture recognition on the data acquired by the millimeter wave radar, and outputting a recognition result.

Further, the specific method for performing gesture recognition on the data at each time in the first data set in step S2 includes the following sub-steps:

s2-1, acquiring position information of key points of human body features for data at each moment in the first data set;

s2-2, calculating human posture characteristic parameters according to the position information of the human characteristic key points;

s2-3, taking the extracted human body posture characteristic parameters as input of the first deep learning model, and obtaining a posture label output by the first deep learning model to obtain a first target posture set.

Further, the key points of the human body characteristics in the step S2-1 include a neck, a right shoulder, a right elbow, a right wrist, a left shoulder, a left elbow, a left wrist, a right hip, a right knee, a right ankle, a left hip, a left knee, and a left ankle.

Further, the specific method of step S2-2 is:

according to the formula:

acquiring the height H of a human body; wherein H_topThe distance from the neck to the central point of the connecting line of the left hip and the right hip; h_bottomThe mean value of the lengths of the left leg and the right leg; h_LLeft leg length; h_RIs the length of the right leg; x is a radical of a fluorine atom_neck、y_neckAnd z_neckRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the neck in the three-dimensional coordinate system; x is the number of_l-hip、y_l-hipAnd z_l-hipRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the left hip in a three-dimensional coordinate system; x is the number of_r-hip、y_r-hipAnd z_r-hipRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right hip in a three-dimensional coordinate system; x is a radical of a fluorine atom_l-knee、y_l-kneeAnd z_l-kneeThe coordinate values of the x axis, the y axis and the z axis of the left knee in the three-dimensional coordinate system are respectively; x is the number of_r-knee、y_r-kneeAnd z_r-kneeRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right knee in a three-dimensional coordinate system; x is the number of_l-ank、y_l-ankAnd z_l-ankRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the left ankle in a three-dimensional coordinate system; x is the number of_r-ank、y_r-ankAnd z_r-ankRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right ankle in a three-dimensional coordinate system; x is a radical of a fluorine atom_l-knee、y_l-kneeAnd z_l-kneeRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the left knee in a three-dimensional coordinate system; x is a radical of a fluorine atom_r-knee、y_r-kneeAnd z_r-kneeThe coordinate values of the x axis, the y axis and the z axis of the right knee in the three-dimensional coordinate system are respectively;

according to the formula:

H_p＝max[(z_neck-z_l-ank),(z_neck-z_r-ank)]

obtaining the maximum value H of the height difference between the neck and the left ankle and the height difference between the neck and the right ankle_pAnd taking it as the current height; wherein max [. C]Is a function of taking the maximum value;

according to the formula:

obtaining the included angle theta between the connecting line of the neck part and the midpoints of the left and right buttocks and the horizontal direction_topAnd the upper half body is taken as the horizontal inclination angle; wherein tan (·) is a tangent function;

according to the formula:

obtaining the mean value theta of the included angle between the connecting line from the left hip to the left ankle and the horizontal direction and the included angle between the connecting line from the right hip to the right ankle and the horizontal direction_bottomAnd the lower half body is used as a horizontal inclination angle;

according to the formula:

obtaining the mean value theta of the included angle between the connecting line from the left hip to the left knee and the connecting line from the left knee to the left ankle and the included angle between the connecting line from the right hip to the right knee and the connecting line from the right knee to the right ankle_thigh-calfAnd the included angle between the thigh and the shank is taken as the included angle;

according to the formula:

obtaining the included angle theta between the connecting line from the neck to the midpoint of the left and right buttocks and the connecting line from the midpoint of the left and right buttocks to the midpoint of the left and right knees_top-thighAnd the included angle between the upper half body and the thigh is taken as the included angle;

according to the formula:

obtaining the midpoint of the connecting line of the left hip and the right hipCoordinate (x) of_g,y_g,z_g) And using the coordinate as the center coordinate of the human body;

namely the human body posture characteristic parameters comprise the height H of the human body and the current height H_pUpper half and horizontal inclination angle theta_topLower half and horizontal inclination angle theta_bottomThe included angle theta between thigh and shank_thigh-calfUpper half and thigh angle theta_top-thighAnd body center coordinates (x)_g,y_g,z_g)。

Further, in step S2-3, the first deep learning model includes an input layer, a hidden layer, and an output layer, which are connected in sequence; wherein:

the input of the input layer is a characteristic vector formed by human posture characteristic parameters;

number of nodes N of hidden layer_hiddenComprises the following steps:

wherein N is_inThe number of nodes of the input layer; n is a radical of hydrogen_outThe number of nodes of the output layer; con is [1,10 ]]A constant between;

the number of output nodes of the output layer is 5, that is, 5 gesture recognition results are included, which are respectively: standing, falling, sitting, squatting and walking.

Further, the specific method for extracting the feature parameters of each time in the second data set and the fourth data set is the same, and comprises the following steps:

according to the formula:

x_max＝max{R_i cosθsinα}i∈[1,m]

x_min＝min{R_i cosθsinα}i∈[1,m]

y_max＝max{R_i cosθcosα}i∈[1,m]

y_min＝min{R_i cosθcosα}i∈[1,m]

z_max＝max{R_i sinθ}i∈[1,m]

z_min＝min{R_i sinθ}i∈[1,m]

obtaining the maximum value x in the x direction_maxMinimum value in x directionx_minY-direction maximum value y_maxMinimum value y in y-direction_minAnd a maximum value z in the z direction_maxAnd z-direction minimum value z_min(ii) a Wherein max {. is a function of taking the maximum value; min {. is a function for taking the minimum value; r is_iThe distance parameter is the ith distance parameter in the second data set or the fourth data set, and m is the total number of the distance parameters in the second data set or the fourth data set; cos is a cosine function; sin is a sine function; θ is the pitch angle in the second data set or the fourth data set; α is the azimuth in the second data set or the fourth data set;

according to the formula:

obtaining x-direction velocity V_xY-direction velocity V_yAnd velocity V in z direction_z(ii) a Wherein V_iRepresenting the ith radial velocity in the second data set or the fourth data set;

according to the formula:

obtaining target center coordinates

Namely, the characteristic parameter corresponding to the data collected by the millimeter wave radar comprises the maximum value x in the x direction_maxThe minimum value x in the x direction_minY-direction maximum value y_maxMinimum value y in y-direction_minAnd a maximum value z in the z direction_maxMinimum value z in z direction_minX direction velocity V_xY-direction velocity V_yZ-direction velocity V_zAnd target center coordinates

Further, in step S4, the second deep learning model includes a first convolution layer, a first ReLU active layer, a first dropout layer, a second convolution layer, a second ReLU active layer, a second dropout layer, a first fully-connected layer, a second fully-connected layer, and a softmax layer, which are connected in sequence; wherein:

the first convolution layer and the second convolution layer are both 1-D convolution layers, and the expression is as follows:

wherein f is_conv(k) An output vector representing the 1-D convolution layer when the number of sliding steps is k; x (j) represents the j-th data in the input; n is the input data length; w (k) represents a convolution kernel when the number of sliding steps is k;

the dropout rates of the first dropout layer and the second dropout layer are both 0.25;

the softmax layer includes 5 output neurons, and the expression of the softmax layer is:

wherein a is_gG-th input signal representing softmax layer; y is_gRepresenting the output of the g output neuron of the softmax layer; e is a constant; a is a_qIs the q outputThe signal, the qth gesture.

Further, the threshold value in step S7 is 0.9.

The invention has the beneficial effects that: the method can overcome the difficulty of training set generation in the gesture recognition learning process, save a large amount of manpower, remove the depth camera assistance after the training is finished, solve the privacy problem, carry out specific learning according to specific scenes and specific personnel, have stronger applicability and improve the accuracy of the gesture recognition of the millimeter wave radar.

Drawings

FIG. 1 is a schematic flow diagram of the process.

Detailed Description

The following description of the embodiments of the present invention is provided to facilitate the understanding of the present invention by those skilled in the art, but it should be understood that the present invention is not limited to the scope of the embodiments, and it will be apparent to those skilled in the art that various changes may be made without departing from the spirit and scope of the invention as defined and defined by the appended claims, and all changes that can be made by the invention using the inventive concept are intended to be protected.

As shown in fig. 1, the millimeter wave radar attitude identification method based on depth camera supervision comprises the following steps:

s1, respectively constructing a first deep learning model and a second deep learning model; simultaneously carrying out data acquisition on an acquisition object by a depth camera and a millimeter wave radar; for a first group of collected objects, taking data collected by a depth camera as a first data set, and taking data collected by a millimeter wave radar as a second data set; for a second group of collected objects, taking data collected by the depth camera as a third data set, and taking data collected by the millimeter wave radar as a fourth data set; the depth camera is adjacent to the millimeter wave radar, and the visual fields of the depth camera and the millimeter wave radar are approximately the same by adjusting angles and the like;

s2, performing gesture recognition on the data at each moment in the first data set through a first deep learning model to obtain a first target gesture set; extracting the characteristic parameters of each moment in the second data set to obtain a first characteristic parameter set;

s4, training the second deep learning model by taking the labeled data as a training set to obtain a pre-trained second deep learning model;

s5, performing gesture recognition on the data at each moment in the third data set through the first deep learning model to obtain a second target gesture set; extracting the characteristic parameter of each moment in the fourth data set to obtain a second characteristic parameter set;

s6, taking the second characteristic parameter set as a training set of the pre-trained second deep learning model, and comparing the output of the pre-trained second deep learning model with a second target posture set to obtain the posture recognition accuracy of the pre-trained second deep learning model;

s7, judging whether the gesture recognition accuracy of the pre-trained second deep learning model reaches a threshold value, if so, taking the currently pre-trained second deep learning model as a final gesture recognition model, and entering S8; otherwise, modifying the pre-trained second deep learning model parameters, and returning to the step S3;

and S8, adopting the millimeter wave radar as a data acquirer of the object to be recognized, adopting the final attitude recognition model to perform attitude recognition on the data acquired by the millimeter wave radar, and outputting a recognition result.

The specific method for performing gesture recognition on the data at each moment in the first data set in step S2 includes the following sub-steps:

s2-1, acquiring the position information of the key points of the human body features for the data at each moment in the first data set;

The key points of the human body characteristics in the step S2-1 include a neck, a right shoulder, a right elbow, a right wrist, a left shoulder, a left elbow, a left wrist, a right hip, a right knee, a right ankle, a left hip, a left knee, and a left ankle.

The specific method of step S2-2 is:

according to the formula:

acquiring the height H of a human body; wherein H_topThe distance from the neck to the central point of the connecting line of the left hip and the right hip; h_bottomThe mean value of the lengths of the left leg and the right leg; h_LLeft leg length; h_RIs the right leg length; x is the number of_neck、y_neckAnd z_neckRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the neck in a three-dimensional coordinate system; x is a radical of a fluorine atom_l-hip、y_l-hipAnd z_l-hipRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the left hip in a three-dimensional coordinate system; x is the number of_r-hip、y_r-hipAnd z_r-hipRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right hip in a three-dimensional coordinate system; x is the number of_l-knee、y_l-kneeAnd z_l-kneeThe coordinate values of the x axis, the y axis and the z axis of the left knee in the three-dimensional coordinate system are respectively; x is the number of_r-knee、y_r-kneeAnd z_r-kneeRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right knee in a three-dimensional coordinate system; x is a radical of a fluorine atom_l-ank、y_l-ankAnd z_l-ankRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the left ankle in a three-dimensional coordinate system; x is a radical of a fluorine atom_r-ank、y_r-ankAnd z_r-ankRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right ankle in a three-dimensional coordinate system; x is the number of_l-knee、y_l-kneeAnd z_l-kneeRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the left knee in a three-dimensional coordinate system; x is a radical of a fluorine atom_r-knee、y_r-kneeAnd z_r-kneeThe coordinate values of the x axis, the y axis and the z axis of the right knee in the three-dimensional coordinate system are respectively;

according to the formula:

H_p＝max[(z_neck-z_l-ank),(z_neck-z_r-ank)]

according to the formula:

according to the formula:

according to the formula:

obtaining the mean value theta of the included angle between the connecting line from the left hip to the left knee and the connecting line from the left knee to the left ankle and the included angle between the connecting line from the right hip to the right knee and the connecting line from the right knee to the right ankle_thigh-calfAnd the angle between the thigh and the shank is taken as the angle between the thigh and the shank;

according to the formula:

obtaining the included angle theta between the connecting line from the neck to the midpoint of the left and right buttocks and the connecting line from the midpoint of the left and right buttocks to the midpoint of the left and right knees_top-thighAnd the angle between the upper half body and the thigh is taken as the angle;

according to the formula:

obtaining the coordinate (x) of the middle point of the connecting line of the left hip and the right hip_g,y_g,z_g) And using the coordinate as the center coordinate of the human body;

namely the human body posture characteristic parameters comprise the height H of the human body and the current height H_pUpper body and horizontal inclination angle theta_topLower half and horizontal inclination angle theta_bottomThe included angle theta between thigh and shank_thigh-calfUpper half and thigh angle theta_top-thighAnd the center coordinates (x) of the human body_g,y_g,z_g)。

In the step S2-3, the first deep learning model comprises an input layer, a hidden layer and an output layer which are connected in sequence; wherein:

number of nodes N of hidden layer_hiddenComprises the following steps:

wherein N is_inThe number of nodes of the input layer; n is a radical of_outThe number of nodes of the output layer; con is [1,10 ]]A constant between;

The specific method for extracting the characteristic parameters of each moment in the second data set and the fourth data set is the same, and comprises the following steps:

according to the formula:

x_max＝max{R_i cosθsinα}i∈[1,m]

x_min＝min{R_i cosθsinα}i∈[1,m]

y_max＝max{R_i cosθcosα}i∈[1,m]

y_min＝min{R_i cosθcosα}i∈[1,m]

z_max＝max{R_i sinθ}i∈[1,m]

z_min＝min{R_i sinθ}i∈[1,m]

obtaining the x-squareTo a maximum value x_maxThe minimum value x in the x direction_minY-direction maximum value y_maxMinimum value y in y-direction_minAnd a maximum value z in the z direction_maxAnd z-direction minimum value z_min(ii) a Wherein max {. is a function taking the maximum value; min {. is a function for taking the minimum value; r_iThe distance parameter is the ith distance parameter in the second data set or the fourth data set, and m is the total number of the distance parameters in the second data set or the fourth data set; cos is a cosine function; sin is a sine function; θ is the pitch angle in the second data set or the fourth data set; α is the azimuth in the second data set or the fourth data set;

according to the formula:

obtaining x-direction velocity V_xY-direction velocity V_yAnd z-direction velocity V_z(ii) a Wherein V_iRepresenting the ith radial velocity in the second data set or the fourth data set;

according to the formula:

obtaining target center coordinates

Namely, the characteristic parameter corresponding to the data collected by the millimeter wave radar comprises the maximum value x in the x direction_maxThe minimum value x in the x direction_minY-direction maximum value y_maxMinimum value y in y-direction_minAnd a maximum value z in the z direction_maxThe minimum value z in the z direction_minX direction velocity V_xY-direction velocity V_yZ-direction velocity V_zAnd target center coordinates

Forming the characteristic parameters of the same time into vectors

And combining the posture labels obtained by the first deep learning model to form a training set of a second deep learning model.

In the step S4, the second deep learning model includes a first convolution layer, a first ReLU active layer, a first dropout layer, a second convolution layer, a second ReLU active layer, a second dropout layer, a first full-link layer, a second full-link layer, and a softmax layer, which are connected in sequence; wherein:

the softmax layer comprises 5 output neurons, and the expression of the softmax layer is as follows:

wherein a is_gThe g-th input signal representing the softmax layer; y is_gRepresenting the output of the g output neuron of the softmax layer; e is a constant; a is a_qIs the qth output signal, i.e., the qth gesture.

In one embodiment of the invention, the threshold in step S7 is 0.9.

In summary, the model provided by the invention integrates data acquisition, data labeling, model training, model testing, model correction and model application. Firstly, a result of recognition of the depth camera and the trained first depth learning model is used as a label of millimeter wave radar data, a three-dimensional point cloud picture generated by the millimeter wave radar does not have visual attitude characteristics, the attitude result of the three-dimensional point cloud picture is difficult to judge manually through simple naked eyes, manpower is effectively liberated in the process, and the difficulty of manually marking the millimeter wave radar data is solved. And then training a second deep learning model by using the generated labeled data set, obtaining a test set from the millimeter wave radar in real time after the training is finished, testing the effect of the second deep learning model by taking the recognition result of the first deep learning model as a standard, and when the accuracy reaches a set threshold, considering that the training of the second deep learning model is finished, or automatically adjusting the parameters of the model and repeating the process until the training of the model is finished. The data that the degree of depth camera acquireed need not the storage, and whole artifical the participation, does not have visual scene, and the degree of depth camera can be withdrawn from after the training of second degree of depth learning model is accomplished, can effectively solve user privacy problem. In addition, the method can also flexibly adjust the second deep learning model in a specific environment according to specific requirements of specific objects, and the problem of single model application is solved.

Claims

1. A millimeter wave radar attitude identification method based on depth camera supervision is characterized by comprising the following steps:

s3, for the data at the same moment, adopting the first target attitude set to perform attitude marking on the second characteristic parameter set to obtain the data with the labels;

2. The depth camera surveillance-based millimeter wave radar attitude recognition method according to claim 1, wherein the specific method for performing attitude recognition on the data at each moment in the first data set in step S2 comprises the following sub-steps:

3. The millimeter wave radar posture recognition method based on depth camera surveillance as claimed in claim 2, wherein the key points of the human body features in step S2-1 comprise neck, right shoulder, right elbow, right wrist, left shoulder, left elbow, left wrist, right hip, right knee, right ankle, left hip, left knee and left ankle.

4. The millimeter wave radar attitude identification method based on depth camera supervision according to claim 3, characterized in that the specific method of step S2-2 is:

according to the formula:

acquiring the height H of a human body; wherein H_topThe distance from the neck to the central point of the connecting line of the left hip and the right hip; h_bottomThe mean value of the lengths of the left leg and the right leg; h_LLeft leg length; h_RIs the right leg length; x is the number of_neck、y_neckAnd z_neckRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the neck in a three-dimensional coordinate system; x is the number of_l-hip、y_l-hipAnd z_l-hipRespectively is the left hipX-axis coordinate value, y-axis coordinate value and z-axis coordinate value in the dimensional coordinate system; x is a radical of a fluorine atom_r-hip、y_r-hipAnd z_r-hipRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right hip in a three-dimensional coordinate system; x is the number of_l-knee、y_l-kneeAnd z_l-kneeRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the left knee in a three-dimensional coordinate system; x is the number of_r-knee、y_r-kneeAnd z_r-kneeRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right knee in a three-dimensional coordinate system; x is the number of_l-ank、y_l-ankAnd z_l-ankRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the left ankle in a three-dimensional coordinate system; x is a radical of a fluorine atom_r-ank、y_r-ankAnd z_r-ankRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right ankle in a three-dimensional coordinate system; x is a radical of a fluorine atom_l-knee、y_l-kneeAnd z_l-kneeRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the left knee in a three-dimensional coordinate system; x is the number of_r-knee、y_r-kneeAnd z_r-kneeRespectively an x-axis coordinate value, a y-axis coordinate value and a z-axis coordinate value of the right knee in a three-dimensional coordinate system;

according to the formula:

H_p＝max[(z_neck-z_l-ank),(z_neck-z_r-ank)]

according to the formula:

obtaining the included angle theta between the connecting line of the neck part and the midpoints of the left and right buttocks and the horizontal direction_topAnd the upper half body is used as a horizontal inclination angle; wherein tan (·) is a tangent function;

according to the formula:

obtaining the mean value theta of the included angle between the connecting line of the left hip and the left ankle and the horizontal direction and the included angle between the connecting line of the right hip and the right ankle and the horizontal direction_bottomAnd the lower half body is taken as the horizontal inclination angle;

according to the formula:

obtaining the mean value theta of the included angle between the connecting line from the left hip to the left knee and the connecting line from the left knee to the left ankle and the included angle between the connecting line from the right hip to the right knee and the connecting line from the right knee to the right ankle_thigh-calfAnd the thigh and the shank are used as a clampAn angle;

according to the formula:

according to the formula:

namely the human body posture characteristic parameters comprise the height H of the human body and the current height H_pUpper body and horizontal inclination angle theta_topLower body and horizontal inclination angle theta_bottomThe included angle theta between thigh and shank_thigh-calfUpper half and thigh angle theta_top-thighAnd the center coordinates (x) of the human body_g,y_g,z_g)。

5. The depth camera surveillance-based millimeter wave radar attitude recognition method according to claim 4, wherein the first depth learning model in step S2-3 comprises an input layer, a hidden layer and an output layer connected in sequence; wherein:

number of nodes N of hidden layer_hiddenComprises the following steps:

6. The millimeter wave radar attitude identification method based on depth camera surveillance as claimed in claim 1, wherein the specific method for extracting the feature parameters at each moment in the second data set and the fourth data set is the same as that of the first data set and the second data set, and is characterized in that:

according to the formula:

x_max＝max{R_i cosθsinα}i∈[1,m]

x_min＝min{R_i cosθsinα}i∈[1,m]

y_max＝max{R_i cosθcosα}i∈[1,m]

y_min＝min{R_i cosθcosα}i∈[1,m]

z_max＝max{R_i sinθ}i∈[1,m]

z_min＝min{R_i sinθ}i∈[1,m]

obtaining the maximum value x in the x direction_maxThe minimum value x in the x direction_minY-direction maximum value y_maxY-direction minimum value y_minZ-direction maximum value z_maxAnd z-direction minimum z_min(ii) a Wherein max {. is a function of taking the maximum value; min {. is a function for taking the minimum value; r is_iThe distance parameter is the ith distance parameter in the second data set or the fourth data set, and m is the total number of the distance parameters in the second data set or the fourth data set; cos isA cosine function; sin is a sine function; θ is the pitch angle in the second data set or the fourth data set; α is the azimuth in the second data set or the fourth data set;

according to the formula:

according to the formula:

obtaining target center coordinates

Namely, the characteristic parameter corresponding to the data collected by the millimeter wave radar comprises the maximum value x in the x direction_maxMinimum value in x directionx_minY-direction maximum value y_maxMinimum value y in y-direction_minAnd a maximum value z in the z direction_maxMinimum value z in z direction_minX direction velocity V_xY-direction velocity V_yZ-direction velocity V_zAnd target center coordinates

7. The millimeter wave radar attitude identification method based on depth camera supervision according to claim 6, wherein the second depth learning model in step S4 includes a first convolution layer, a first ReLU active layer, a first dropout layer, a second convolution layer, a second ReLU active layer, a second dropout layer, a first fully-connected layer, a second fully-connected layer and a softmax layer which are connected in sequence; wherein:

wherein f is_conv(k) An output vector representing the 1-D convolution layer when the number of sliding steps is k; x (j) represents the j-th data in the input; n is the input data length; w (k) represents a convolution kernel for a sliding step number of k;

wherein a is_gThe g-th input signal representing the softmax layer; y is_gRepresents the output of the g output neuron of the softmax layer; e is a constant; a is_qIs the qth output signal, i.e., the qth gesture.

8. The depth camera surveillance-based millimeter wave radar gesture recognition method of claim 1, wherein the threshold in step S7 is 0.9.