CN111938670A - Depression identification method and system based on human skeleton kinematics characteristic information - Google Patents

Depression identification method and system based on human skeleton kinematics characteristic information Download PDF

Info

Publication number
CN111938670A
CN111938670A CN202010608130.9A CN202010608130A CN111938670A CN 111938670 A CN111938670 A CN 111938670A CN 202010608130 A CN202010608130 A CN 202010608130A CN 111938670 A CN111938670 A CN 111938670A
Authority
CN
China
Prior art keywords
data
depression
neural network
human skeleton
kinematic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010608130.9A
Other languages
Chinese (zh)
Other versions
CN111938670B (en
Inventor
王庆祥
李文涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qilu University of Technology
Original Assignee
Qilu University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qilu University of Technology filed Critical Qilu University of Technology
Priority to CN202010608130.9A priority Critical patent/CN111938670B/en
Publication of CN111938670A publication Critical patent/CN111938670A/en
Application granted granted Critical
Publication of CN111938670B publication Critical patent/CN111938670B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/16Devices for psychotechnics; Testing reaction times ; Devices for evaluating the psychological state
    • A61B5/165Evaluating the state of mind, e.g. depression, anxiety
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/103Detecting, measuring or recording devices for testing the shape, pattern, colour, size or movement of the body or parts thereof, for diagnostic purposes
    • A61B5/11Measuring movement of the entire body or parts thereof, e.g. head or hand tremor, mobility of a limb
    • A61B5/1126Measuring movement of the entire body or parts thereof, e.g. head or hand tremor, mobility of a limb using a particular sensing technique
    • A61B5/1128Measuring movement of the entire body or parts thereof, e.g. head or hand tremor, mobility of a limb using a particular sensing technique using image analysis
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • A61B5/7264Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
    • A61B5/7267Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems involving training the classification device

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Public Health (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Animal Behavior & Ethology (AREA)
  • Veterinary Medicine (AREA)
  • Psychiatry (AREA)
  • Surgery (AREA)
  • Artificial Intelligence (AREA)
  • Biophysics (AREA)
  • Pathology (AREA)
  • Biomedical Technology (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Medical Informatics (AREA)
  • Physiology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Child & Adolescent Psychology (AREA)
  • Fuzzy Systems (AREA)
  • Evolutionary Computation (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Radiology & Medical Imaging (AREA)
  • Mathematical Physics (AREA)
  • Signal Processing (AREA)
  • Dentistry (AREA)
  • Developmental Disabilities (AREA)
  • Educational Technology (AREA)
  • Hospice & Palliative Care (AREA)
  • Psychology (AREA)
  • Social Psychology (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a depression recognition method and system based on human skeleton kinematics characteristic information, belonging to the field of image processing and mode recognition, aiming at solving the technical problem of how to establish an objective and effective index, and diagnosing depression according to the prepared index parameters, and adopting the technical scheme that: the method comprises the following specific steps: s1, collecting human skeleton kinematic characteristic data; s2, extracting human skeleton kinematic characteristic data; s3, preprocessing the characteristic data and making a data set; s4, constructing and training a depression recognition neural network model; and S5, inputting the human skeleton kinematic data to be recognized into the trained depression recognition neural network model, completing the recognition and diagnosis of depression, obtaining a prediction result and calculating the accuracy. The system comprises a data collection module, a data extraction module, a preprocessing and data set making module, a model building and training module and a result prediction and accuracy calculation module.

Description

Depression identification method and system based on human skeleton kinematics characteristic information
Technical Field
The invention relates to the field of image processing and pattern recognition, in particular to a depression recognition method and system based on human skeleton kinematics characteristic information.
Background
Depression (depression) refers to a group of affective disorders characterized clinically by marked and persistent mood depression, hypomotility, and mental and cognitive retardation. Patients with depression have serious psychological disorder and bad mood, often lack sleep, listlessness and even suicide behavior in patients with severe depression.
At present, the diagnosis of depression is still in the stage of symptom deduction, and there is no objective physicochemical examination and diagnosis index. The existing depression identification method has more defects, which mainly comprise the following steps:
firstly, clinical diagnosis measures are judged in a subjective mode, objective and effective indexes are lacked, and misdiagnosis and missed diagnosis are easily caused;
(II) identifying the reference index to be single: the existing method for identifying the depression mainly takes hormone level and various biochemical indexes in a patient as identification reference indexes, and lacks of index parameters of bioelectricity signals, so that the depression identification has certain limitation.
Therefore, how to establish an objective and effective index is a technical problem to be solved urgently at present in the diagnosis of depression prepared according to the index parameters.
Disclosure of Invention
The technical task of the invention is to provide a depression identification method and system based on human skeleton kinematic characteristic information, so as to solve the problem of how to establish objective and effective indexes and prepare depression diagnosis according to index parameters.
The technical task of the invention is realized in the following way, the method for identifying the depression based on the human skeleton kinematic characteristic information comprises the steps of extracting the human joint kinematic characteristic information according to the human skeleton characteristic information captured by Kinect equipment, carrying out data preprocessing on the spatial position data of each joint based on time dimension, constructing a data set by utilizing the preprocessed data, and carrying out training and testing of a depression identification neural network model by using the data set; the method comprises the following specific steps:
s1, collecting human skeleton kinematic characteristic data;
s2, extracting human skeleton kinematic characteristic data;
s3, preprocessing the characteristic data and making a data set;
s4, constructing and training a depression recognition neural network model;
and S5, inputting the human skeleton kinematic data to be recognized into the trained depression recognition neural network model, completing the recognition and diagnosis of depression, obtaining a prediction result and calculating the accuracy.
Preferably, the collection of the human skeleton kinematic feature data in step S1 is specifically as follows:
s101, starting a Kinect device to collect kinematic characteristic data of a human skeleton;
s102, guiding the experimental subject to enter a designated position, and playing a pre-designed stimulation action task instruction; wherein the stimulation action comprises a total of five kinematic segments, respectively: lifting and resetting a left arm, lifting and resetting a right arm, lifting and resetting two arms, turning left and resetting, and turning right and resetting; the time interval between each kinematic segment is 5s, and the time required for completing the whole stimulation action is 60 s;
s103, opening Kinect studio V2 software to complete the recording work of human skeleton kinematic data;
s104, after the data recording is finished, the psychologist evaluates the depression state of the current experimental subject, and the evaluation process records the Hamilton depression scale (24 versions) scores of the experimental subject; wherein, the experimental subjects are divided into a depression group and a non-depression group;
and S105, screening effective experimental data when the score of the depression group is more than 20 and the score of the control group is less than 8 according to the Hamilton depression scale score standard.
Preferably, the extraction of the human skeleton kinematic feature data in step S2 is specifically as follows:
s201, traversing xef recording files in the folder, acquiring file names and obtaining recording file paths;
s202, executing a script command to run SkeletonExtractor.exe in batch to open xef record files and read original record data;
s203, extracting time series data based on the xef recorded files;
s204, extracting the spatial positions (x, y, z) of the human skeleton joint points, and eliminating noise data; the method specifically comprises the following steps: for any Kinect detected target i 1,2,3 …, each human body skeleton joint point detects a spatial position (x, y, z), and for any time t, the target object i to be locked has the following relation:
Figure BDA0002561502720000021
s205, extracting quaternion ((Rx, Ry, Rz), Rw) of the spatial coordinate system of the data subjected to the noise reduction processing, and the formula is as follows:
Figure BDA0002561502720000031
each captured human skeletal joint point is composed of (x, y, z) and ((Rx, Ry, Rz), Rw), that is, each human skeletal joint point n at the time t of the kinematic segment is composed of seven dimensional data expressing the spatial position relationship:
djt=[xjt,yjt,zjt,Rxjt,Ryjt,Rzjt,Rwjt];
and for 25 human skeleton joint points captured by the Kinect, the human skeleton data extracted within the kinematic fragment time T is as follows:
Figure BDA0002561502720000032
wherein t represents time; j represents a human skeletal joint point; t is more than or equal to 1 and less than or equal to T, j is more than or equal to 1 and less than or equal to 25;
s206, obtaining the extraction result (time, the space position coordinate value based on the time dimension of each joint and quaternion) of the experimental object, and saving the extraction result as a file in the csv format by the name of the experimental object.
Preferably, the characteristic data preprocessing and data set creation in step S3 are specifically as follows:
s301, reading the extracted csv file of the experimental object;
s302, reading the time sequence and rewriting the time sequence into a csv file with the same name in another folder;
s303, reading data of the csv file except the time sequence, and preprocessing the adopted standardized data, wherein the method specifically comprises the following steps: for human skeleton data time series x1,x2,...,xtThe linear transformation of the raw data using a dispersion normalization method, with the result mapped to [0,1]Interval, the formula is as follows:
Figure BDA0002561502720000033
where n denotes the sequence length, the new sequence y1,y2,...,yn∈[0,1]And is dimensionless;
s304, writing the standardized data into the csv file in the step S302;
s305, labeling the experimental subjects of a depressed group and a non-depressed group respectively, labeling the depressed group as 0, and labeling the non-depressed group as 1;
s306, making a data set through a new data sequence preprocessed by a pythonnnumpy library, and storing data extracted by all experimental objects in a binary data format as a npy format file;
and S307, adopting a random division mode for the data set, setting 70% of the data set as a training set, and using 30% of the data set as a test set.
Preferably, the construction and training of the depression recognition neural network model in step S4 are as follows:
s401, inputting training set data into a time Convolution Neural Network (Temporal Convolution Neural Network) to obtain an output result of the time Convolution Neural Network; the time convolution neural network comprises two time hole convolution residual blocks which are sequentially connected, each layer of each residual block is a one-dimensional hole convolution network, the random discarding rate is 0.5, the activation function is ReLU, and the output of each layer is directly used as the input of the next layer;
s402, inputting the obtained time convolution neural Network output result as characteristic information into a bottleneck convolution neural Network (BottleneckConvolationneural Network) to obtain an output result of the bottleneck convolution neural Network; wherein the bottleneck convolutional neural network comprises a bottleneck network residual block; the bottleneck network residual block comprises three layers of one-dimensional convolution networks, and the kernel function size K of the one-dimensional convolution networks is 1,3 and 1;
and S403, converting the output result of the bottleneck convolutional neural network into a one-dimensional vector, connecting the one-dimensional vector and the one-dimensional vector, inputting the one-dimensional vector into a full connection layer, and then identifying the depression through a softmax classifier.
Preferably, the identification and diagnosis of the depression are completed in step S5, and the prediction result is obtained and the calculation accuracy is as follows:
s501, identifying and diagnosing the depression by using softmax as a classifier, wherein the formula is as follows:
Figure BDA0002561502720000041
wherein S isiRepresenting a softmax predicted value of the ith class; i and j are category serial numbers;
s502, inputting the human skeleton kinematic feature data into a depression recognition neural network model, wherein when depression category prediction is carried out, each category (depression and non-depression) corresponds to a calculation value smaller than 1, the sum is 1, and the category corresponding to the maximum calculation value is a prediction category;
s503, comparing the prediction categories with the real categories, calculating the proportion of the number of the depression categories of the correct prediction categories in the training data set to the total number of the data, and outputting the accuracy of the depression recognition neural network model;
s504, calculating the Loss of the prediction error category by adopting a Loss function, setting the Loss function of the depression recognition neural network model as a cross entropy function Loss, and adopting the following formula:
Figure BDA0002561502720000051
wherein M represents the number of categories; c represents a category number; y iscRepresenting a real tag; p is a radical ofcIndicating the output of softmax.
A depression recognition system based on human skeleton kinematics characteristic information comprises,
the data collection module is used for collecting human skeleton kinematic characteristic data;
the data extraction module is used for extracting human skeleton kinematic characteristic data;
the preprocessing and data set making module is used for preprocessing the characteristic data and making a data set;
the model building and training module is used for building and training a depression recognition neural network model;
and the result prediction and accuracy rate calculation module is used for inputting the human skeleton kinematic data to be recognized into the trained depression recognition neural network model to complete the recognition diagnosis of the depression, obtain the prediction result and calculate the accuracy rate.
Preferably, the data collection module includes,
the device starting and data collecting submodule is used for starting the Kinect device to collect the kinematic characteristic data of the human skeleton;
the instruction playing submodule is used for guiding the experimental subject to enter a designated position and playing a pre-designed stimulation action task instruction; wherein the stimulation action comprises a total of five kinematic segments, respectively: lifting and resetting a left arm, lifting and resetting a right arm, lifting and resetting two arms, turning left and resetting, and turning right and resetting; the time interval between each kinematic segment is 5s, and the time required for completing the whole stimulation action is 60 s;
the data recording submodule is used for opening Kinect studio V2 software to finish the recording work of human skeleton kinematic data;
a depression state evaluation submodule for the psychiatrist to evaluate the depression state of the current subject after completing the data recording, wherein the evaluation process records the Hamilton depression scale (24 edition) scores of the subject; wherein, the experimental subjects are divided into a depression group and a non-depression group;
the screening submodule is used for screening effective experimental data when the score of a depression group is larger than 20 and the score of a control group is smaller than 8 according to the Hamilton depression scale scoring standard;
the data extraction module comprises a data extraction module and a data extraction module,
the file path recording submodule is used for traversing xef recorded files in the folder, acquiring file names and obtaining recording file paths;
an original record data reading submodule, configured to execute a script command to run skeletonextractor in batch to open.exe, xef record a file, and read original record data;
a time-series data extraction sub-module for extracting time-series data based on the xef record file;
the noise data removing submodule is used for extracting the spatial positions (x, y, z) of the human skeleton joint points and removing the noise data; the method specifically comprises the following steps: for any Kinect detected target i 1,2,3 …, each human body skeleton joint point detects a spatial position (x, y, z), and for any time t, the target object i to be locked has the following relation:
Figure BDA0002561502720000061
and the quaternion extraction submodule is used for extracting quaternion ((Rx, Ry, Rz), Rw) of a space coordinate system of the data subjected to the noise reduction processing, and the formula is as follows:
Figure BDA0002561502720000062
each captured human skeletal joint point is composed of (x, y, z) and ((Rx, Ry, Rz), Rw), that is, each human skeletal joint point n at the time t of the kinematic segment is composed of seven dimensional data expressing the spatial position relationship:
djt=[xjt,yjt,zjt,Rxjt,Ryjt,Rzjt,Rwjt];
and for 25 human skeleton joint points captured by the Kinect, the human skeleton data extracted within the kinematic fragment time T is as follows:
Figure BDA0002561502720000063
wherein t represents time; j represents a human skeletal joint point; t is more than or equal to 1 and less than or equal to T, j is more than or equal to 1 and less than or equal to 25;
the extraction result obtaining submodule is used for obtaining the extraction result (time, the space position coordinate value and the quaternion of each joint based on the time dimension) of the experimental object and saving the extraction result as a file in the csv format by the name of the experimental object;
the preprocessing and data set creation module includes,
the file reading submodule is used for reading the extracted csv file of the experimental object;
the time sequence reading submodule is used for reading the time sequence and rewriting the time sequence into a csv file with the same name in another folder;
the standardized data preprocessing submodule is used for reading the data of the csv file except the time sequence and preprocessing the adopted standardized data, and specifically comprises the following steps: for human skeleton data time series x1,x2,...,xtThe linear transformation of the raw data using a dispersion normalization method, with the result mapped to [0,1]Interval, the formula is as follows:
Figure BDA0002561502720000071
where n denotes the sequence length, the new sequence y1,y2,...,yn∈[0,1]And is dimensionless;
the file writing submodule is used for writing the standardized data into the csv file;
a subject labeling submodule for labeling the experimental subjects of the depressed group and the non-depressed group, respectively, labeling the depressed group as 0 and labeling the non-depressed group as 1;
the data set making submodule is used for making a data set through a new data sequence preprocessed by a pythonnnumpy library, and the data extracted by all experimental objects are stored as a. npy format file in a binary data format;
the data set dividing submodule is used for adopting a random dividing mode for the data set, setting 70% of the data set as a training set, and using 30% of the data set as a testing set;
the model building and training module comprises a model building and training module,
the time Convolution Neural Network result output submodule is used for inputting the training set data into a time Convolution Neural Network (Temporal Convolution Neural Network) to obtain an output result of the time Convolution Neural Network; the time convolution neural network comprises two time hole convolution residual blocks which are sequentially connected, each layer of each residual block is a one-dimensional hole convolution network, the random discarding rate is 0.5, the activation function is ReLU, and the output of each layer is directly used as the input of the next layer;
a bottleneck convolutional neural Network result output submodule, configured to input the obtained time convolutional neural Network output result as feature information into a bottleneck convolutional neural Network (bottleneckconvolutional convolutional neural Network), so as to obtain an output result of the bottleneck convolutional neural Network; wherein the bottleneck convolutional neural network comprises a bottleneck network residual block; the bottleneck network residual block comprises three layers of one-dimensional convolution networks, and the kernel function size K of the one-dimensional convolution networks is 1,3 and 1;
the depression recognition submodule is used for converting an output result of the bottleneck convolutional neural network into a one-dimensional vector, connecting the one-dimensional vector and the one-dimensional vector, inputting the one-dimensional vector into the full connection layer, and then recognizing the depression through a softmax classifier;
the result prediction and accuracy calculation module comprises,
the identification and diagnosis submodule is used for identifying and diagnosing the depression by adopting softmax as a classifier, and the formula is as follows:
Figure BDA0002561502720000081
wherein S isiSoftmax pre-representing class iMeasuring; i and j are category serial numbers;
the prediction class acquisition submodule is used for inputting the human skeleton kinematic feature data into the depression recognition neural network model, when depression class prediction is carried out, each class (depression and non-depression) corresponds to a calculation value smaller than 1, the sum is 1, and the class corresponding to the maximum calculation value is a prediction class;
the comparison submodule is used for comparing the prediction category with the real category, calculating the proportion of the number of depression categories of the correct prediction category in the training data set to the total number of data, and outputting the accuracy of the depression recognition neural network model;
the Loss calculation submodule is used for calculating the Loss of the prediction error category by adopting a Loss function, setting the Loss function of the depression recognition neural network model as a cross entropy function Loss, and the formula is as follows:
Figure BDA0002561502720000082
wherein M represents the number of categories; c represents a category number; y iscRepresenting a real tag; p is a radical ofcIndicating the output of softmax.
An electronic device, comprising: a memory and a processor;
wherein the memory stores computer-executable instructions;
the one processor executes the computer-executable instructions stored by the memory, so that the one processor executes the depression identification method based on the human skeleton kinematic feature information.
A computer readable storage medium, wherein computer executable instructions are stored in the computer readable storage medium, and when a processor executes the computer, the depression identification method based on human skeleton kinematic feature information is realized.
The depression identification method and system based on the human skeleton kinematics characteristic information have the following advantages:
according to the clinical manifestations of hypokinesia, thought and cognitive function retardation of depression patients, a high-efficiency depression identification method is constructed by designing specific instruction actions and using human skeleton kinematic characteristic information acquired by Kinect equipment, and a convenient and reliable screening means is provided for early identification and auxiliary clinical diagnosis and treatment of depression;
the method comprises the following steps of (II) completing capture of human body kinematic characteristic skeleton information through Kinect equipment, combining with data extraction and preprocessing technologies, and constructing a depression recognition model based on deep learning related theories to recognize depression, so that the accuracy of depression recognition is improved;
the invention is based on the time convolution bottleneck network recognition method, construct human skeleton kinematic characteristic learning network model, withdraw the key kinematic characteristic information, has avoided the interference that the irrelevant characteristic discerns depression effectively, improve the reliability that the depression discerns, the test set recognition rate accuracy rate to depression of the invention is up to more than 75%;
the human skeleton recognition task is completed by using Kinect equipment, and depression recognition work is performed on the basis of human body kinematic characteristic information of time dimension and space position; the stimulation task provided by the invention is originally designed according to the suggestion of psychiatrists, and a noise data removing and kinematic segment dividing and data extracting method is provided according to the action of the set stimulation task and by combining with experimental design; meanwhile, the identification of the depression by the time convolution and bottleneck neural network model more suitable for the method is improved based on the original time convolution neural network model.
Drawings
The invention is further described below with reference to the accompanying drawings.
FIG. 1 is a flow chart of a depression identification method based on human skeleton kinematic feature information;
FIG. 2 is a diagram of a human skeleton action key action frame;
FIG. 3 is a schematic diagram of an experimental environment layout of a Kinect device;
FIG. 4 is a schematic diagram of a neural network model structure;
fig. 5 is a schematic diagram of a confusion matrix of depression recognition results.
Detailed Description
The depression identification method and system based on human skeleton kinematics characteristic information of the invention are explained in detail with reference to the drawings and the specific embodiments in the specification.
Example 1:
as shown in the attached drawing 1, the method for identifying depression based on human skeleton kinematic feature information extracts 25 joint kinematic feature information of a human body according to the human skeleton feature information captured by a Kinect device, performs data preprocessing on spatial position data of each joint based on time dimension, constructs a data set by using the preprocessed data, and performs training and testing of a depression identification neural network model by using the data set; the method comprises the following specific steps:
s1, collecting human skeleton kinematic characteristic data; the method comprises the following specific steps:
s101, starting a Kinect device to collect kinematic characteristic data of a human skeleton;
s102, guiding the experimental subject to enter a designated position, and playing a pre-designed stimulation action task instruction, as shown in the attached figure 3;
and (3) stimulation action design: the invention combines the actual physical state of the depression patients according to the suggestion of psychiatrists, and simultaneously reduces the difference of the perception and action abilities of the depression patients caused by non-critical factors such as gender, occupation and the like; therefore, a set of stimulation task actions which are simple, relatively obvious in motion amplitude and convenient to capture by the Kinect is designed.
As shown in fig. 2, the stimulation action comprises a total of five kinematic segments, respectively: lifting and resetting a left arm, lifting and resetting a right arm, lifting and resetting two arms, turning left and resetting, and turning right and resetting; the time interval between each kinematic segment is 5s, and the time required for completing the whole stimulation action is 60 s;
s103, opening Kinect studio V2 software to complete the recording work of human skeleton kinematic data;
s104, after the data recording is finished, the psychologist evaluates the depression state of the current experimental subject, the evaluation process records the Hamilton depression scale (24 versions) scores of the experimental subject, and reference basis is provided for screening depression group data and non-depression group data and labeling data of subsequent experiments; wherein, the experimental subjects are divided into a depression group and a non-depression group;
and S105, screening effective experimental data when the score of the depression group is more than 20 and the score of the control group is less than 8 according to the Hamilton depression scale score standard.
S2, extracting human skeleton kinematic characteristic data; the method comprises the following specific steps:
s201, traversing xef recording files in the folder, acquiring file names and obtaining recording file paths;
s202, executing a script command to run SkeletonExtractor.exe in batch to open xef record files and read original record data; among them, the. xef record file is based on the. net core3.0 framework, which has developed for Kinect V2.0 records;
s203, extracting time series data based on the xef recorded files;
s204, extracting the spatial positions (x, y, z) of the human skeleton joint points, and eliminating noise data; since there may be non-target objects appearing in the video recording environment during the process of recording the target object by the Kinect V2, noise data such as non-experimental objects need to be eliminated and captured possibly. Kinect can detect out 6 human skeleton targets at most simultaneously, and according to experimental environment design, target object and amazing action home range all are in fixed detection area, specifically are: for any Kinect detected target i 1,2,3 …, each human body skeleton joint point detects a spatial position (x, y, z), and for any time t, the target object i to be locked has the following relation:
Figure BDA0002561502720000101
s205, extracting quaternion ((Rx, Ry, Rz), Rw) of the spatial coordinate system of the data subjected to the noise reduction processing, and the formula is as follows:
Figure BDA0002561502720000111
each captured human skeletal joint point is composed of (x, y, z) and ((Rx, Ry, Rz), Rw), that is, each human skeletal joint point n at the time t of the kinematic segment is composed of seven dimensional data expressing the spatial position relationship:
djt=[xjt,yjt,zjt,Rxjt,Ryjt,Rzjt,Rwjt];
and for 25 human skeleton joint points captured by the Kinect, the human skeleton data extracted within the kinematic fragment time T is as follows:
Figure BDA0002561502720000112
wherein t represents time; j represents a human skeletal joint point; t is more than or equal to 1 and less than or equal to T, j is more than or equal to 1 and less than or equal to 25;
s206, obtaining the extraction result (time, the space position coordinate value based on the time dimension of each joint and quaternion) of the experimental object, and saving the extraction result as a file in the csv format by the name of the experimental object.
S3, preprocessing the characteristic data and making a data set; the method comprises the following specific steps:
s301, reading the extracted csv file of the experimental object;
s302, reading the time sequence and rewriting the time sequence into a csv file with the same name in another folder;
s303, reading data of the csv file except the time sequence, and preprocessing the adopted standardized data, wherein the method specifically comprises the following steps: for human skeleton data time series x1,x2,...,xtThe linear transformation of the raw data using a dispersion normalization method, with the result mapped to [0,1]Interval, the formula is as follows:
Figure BDA0002561502720000113
where n denotes the sequence length, the new sequence y1,y2,...,yn∈[0,1]And is dimensionless;
and the depressed group data and the non-depressed group data are respectively positioned in the two folders, the two folders are traversed, and the csv files in the folders are read.
S304, writing the standardized data into the csv file in the step S302;
s305, labeling the experimental subjects of a depressed group and a non-depressed group respectively, labeling the depressed group as 0, and labeling the non-depressed group as 1;
s306, in order to improve the loading efficiency of experimental data, data set production is carried out through a new data sequence preprocessed by a pythonnnumpy library, and data extracted by all experimental objects are stored as a npy format file in a binary data format;
and S307, adopting a random division mode for the data set, setting 70% of the data set as a training set, and using 30% of the data set as a test set.
S4, constructing and training a depression recognition neural network model; as shown in fig. 4, the following is detailed:
s401, inputting training set data into a time Convolution Neural Network (Temporal Convolution Neural Network) to obtain an output result of the time Convolution Neural Network; the time convolution neural network comprises two time hole convolution residual blocks which are sequentially connected, each layer of each residual block is a one-dimensional hole convolution network, the random discarding rate is 0.5, the activation function is ReLU, and the output of each layer is directly used as the input of the next layer;
s402, inputting the obtained time convolution neural Network output result as characteristic information into a bottleneck convolution neural Network (BottleneckConvolationneural Network) to obtain an output result of the bottleneck convolution neural Network; wherein the bottleneck convolutional neural network comprises a bottleneck network residual block; the bottleneck network residual block comprises three layers of one-dimensional convolution networks, and the kernel function size K of the one-dimensional convolution networks is 1,3 and 1;
and S403, converting the output result of the bottleneck convolutional neural network into a one-dimensional vector, connecting the one-dimensional vector and the one-dimensional vector, inputting the one-dimensional vector into a full connection layer, and then identifying the depression through a softmax classifier.
The human skeleton kinematics feature learning network comprises a time convolution network, a bottleneck network, a full connection layer and a softmax classifier. Based on the existing time convolution neural network, the advantages of various networks are fully utilized, a learning network with efficient depression recognition is constructed, the kinematic skeleton characteristics of a human body are extracted, the characteristics output by the network are aggregated and then input into a full connection layer for characteristic fusion, and depression recognition is carried out based on a classical softmax classifier after the full connection layer.
For the kinematics skeleton characteristic learning, the invention designs a standard convolution filter to fix time understanding, and can ideally model the whole kinematics skeleton characteristic sequence by modeling a proper time window. Important kinematic characteristics are learned, and the scheme uses extended convolution to realize exponential time sequence scale on different layers of the network. For input event time index length and filter {0,1, … n } for some filters, the expanding convolution operation of the elements of the skeleton sequence is:
Figure BDA0002561502720000121
wherein d represents an expansion coefficient; n denotes an index length.
The bottleneck network adopts a one-dimensional convolution bottle network structure, the size of the three-layer kernel filter is matched to be K which is 1,3 and 1, and the compatibility of the length between layers is ensured. The bottleneck feature may be generated from a multi-layer perceptron, with an internal layer having a small number of hidden elements, relative to the relative sizes of other layers. Once the inner layer estimate and the fully connected layer estimate are available, the classifier block can track the exact source of relevant information through compressed sensing. Due to the forward augmented time convolution residual block feed, the bottleneck structure can better identify the relevant body event index sequence and key body joint movement mechanics characteristics.
S5, inputting the human skeleton kinematic data to be recognized into the trained depression recognition neural network model to complete the recognition and diagnosis of depression, obtaining a prediction result and calculating the accuracy; the method comprises the following specific steps:
s501, identifying and diagnosing the depression by using softmax as a classifier, wherein the formula is as follows:
Figure BDA0002561502720000131
wherein S isiRepresenting a softmax predicted value of the ith class; i and j are category serial numbers; the invention provides a human skeleton kinematic feature sequence model with efficient calculation on the basis of a given group of sequence data.
S502, inputting the human skeleton kinematic feature data into a depression recognition neural network model, wherein when depression category prediction is carried out, each category (depression and non-depression) corresponds to a calculation value smaller than 1, the sum is 1, and the category corresponding to the maximum calculation value is a prediction category;
s503, comparing the prediction categories with the real categories, calculating the proportion of the number of the depression categories of the correct prediction categories in the training data set to the total number of the data, and outputting the accuracy of the depression recognition neural network model;
s504, calculating the Loss of the prediction error category by adopting a Loss function, setting the Loss function of the depression recognition neural network model as a cross entropy function Loss, and adopting the following formula:
Figure BDA0002561502720000132
wherein M represents the number of categories; c represents a category number; y iscRepresenting a real tag; p is a radical ofcIndicating the output of softmax.
For example, as shown in fig. 5, in the confusion matrix, the total number of samples contained in the test set is 62, 23 samples in the depression group, and 39 samples in the normal group; wherein the number of predicted depressed groups is 16 and the number of predicted normal groups is 7; the number of normal groups predicted to be normal groups was 31, and the number of predicted depressed groups was 8; the confusion matrix shows 75.8% accuracy as (31+ 16)/62.
Example 2:
the invention relates to a depression recognition system based on human skeleton kinematics characteristic information, which comprises,
the data collection module is used for collecting human skeleton kinematic characteristic data; the data collection module comprises a data acquisition module,
the device starting and data collecting submodule is used for starting the Kinect device to collect the kinematic characteristic data of the human skeleton;
the instruction playing submodule is used for guiding the experimental subject to enter a designated position and playing a pre-designed stimulation action task instruction; wherein the stimulation action comprises a total of five kinematic segments, respectively: lifting and resetting a left arm, lifting and resetting a right arm, lifting and resetting two arms, turning left and resetting, and turning right and resetting; the time interval between each kinematic segment is 5s, and the time required for completing the whole stimulation action is 60 s;
the data recording submodule is used for opening Kinect studio V2 software to finish the recording work of human skeleton kinematic data;
a depression state evaluation submodule for the psychiatrist to evaluate the depression state of the current subject after completing the data recording, wherein the evaluation process records the Hamilton depression scale (24 edition) scores of the subject; wherein, the experimental subjects are divided into a depression group and a non-depression group;
and the screening submodule is used for screening effective experimental data when the score of the depression group is more than 20 and the score of the control group is less than 8 according to the Hamilton depression scale scoring standard.
The data extraction module is used for extracting human skeleton kinematic characteristic data; the data extraction module comprises a data extraction module,
the file path recording submodule is used for traversing xef recorded files in the folder, acquiring file names and obtaining recording file paths;
an original record data reading submodule, configured to execute a script command to run skeletonextractor in batch to open.exe, xef record a file, and read original record data;
a time-series data extraction sub-module for extracting time-series data based on the xef record file;
the noise data removing submodule is used for extracting the spatial positions (x, y, z) of the human skeleton joint points and removing the noise data; the method specifically comprises the following steps: for any Kinect detected target i 1,2,3 …, each human body skeleton joint point detects a spatial position (x, y, z), and for any time t, the target object i to be locked has the following relation:
Figure BDA0002561502720000141
and the quaternion extraction submodule is used for extracting quaternion ((Rx, Ry, Rz), Rw) of a space coordinate system of the data subjected to the noise reduction processing, and the formula is as follows:
Figure BDA0002561502720000151
each captured human skeletal joint point is composed of (x, y, z) and ((Rx, Ry, Rz), Rw), that is, each human skeletal joint point n at the time t of the kinematic segment is composed of seven dimensional data expressing the spatial position relationship:
djt=[xjt,yjt,zjt,Rxjt,Ryjt,Rzjt,Rwjt];
and for 25 human skeleton joint points captured by the Kinect, the human skeleton data extracted within the kinematic fragment time T is as follows:
Figure BDA0002561502720000152
wherein t represents time; j represents a human skeletal joint point; t is more than or equal to 1 and less than or equal to T, j is more than or equal to 1 and less than or equal to 25;
and the extraction result obtaining submodule is used for obtaining the extraction result (time, the space position coordinate value based on the time dimension of each joint and the quaternion) of the experimental object and saving the extraction result as a file in the csv format by the name of the experimental object.
The preprocessing and data set making module is used for preprocessing the characteristic data and making a data set; the preprocessing and data set generation module comprises a preprocessing module,
the file reading submodule is used for reading the extracted csv file of the experimental object;
the time sequence reading submodule is used for reading the time sequence and rewriting the time sequence into a csv file with the same name in another folder;
the standardized data preprocessing submodule is used for reading the data of the csv file except the time sequence and preprocessing the adopted standardized data, and specifically comprises the following steps: for human skeleton data time series x1,x2,...,xtThe linear transformation of the raw data using a dispersion normalization method, with the result mapped to [0,1]Interval, the formula is as follows:
Figure BDA0002561502720000161
where n denotes the sequence length, the new sequence y1,y2,...,yn∈[0,1]And is dimensionless;
the file writing submodule is used for writing the standardized data into the csv file;
a subject labeling submodule for labeling the experimental subjects of the depressed group and the non-depressed group, respectively, labeling the depressed group as 0 and labeling the non-depressed group as 1;
the data set making submodule is used for making a data set through a new data sequence preprocessed by a pythonnnumpy library, and the data extracted by all experimental objects are stored as a. npy format file in a binary data format;
and the data set division submodule is used for adopting a random division mode on the data set, setting 70% of the data set as a training set, and using 30% of the data set as a test set.
The model building and training module is used for building and training a depression recognition neural network model; the model building and training module comprises a model building and training module,
the time Convolution Neural Network result output submodule is used for inputting the training set data into a time Convolution Neural Network (Temporal Convolution Neural Network) to obtain an output result of the time Convolution Neural Network; the time convolution neural network comprises two time hole convolution residual blocks which are sequentially connected, each layer of each residual block is a one-dimensional hole convolution network, the random discarding rate is 0.5, the activation function is ReLU, and the output of each layer is directly used as the input of the next layer;
a bottleneck convolutional neural Network result output submodule, configured to input the obtained time convolutional neural Network output result as feature information into a bottleneck convolutional neural Network (bottleneckconvolutional convolutional neural Network), so as to obtain an output result of the bottleneck convolutional neural Network; wherein the bottleneck convolutional neural network comprises a bottleneck network residual block; the bottleneck network residual block comprises three layers of one-dimensional convolution networks, and the kernel function size K of the one-dimensional convolution networks is 1,3 and 1;
and the depression identification submodule is used for converting the output result of the bottleneck convolutional neural network into a one-dimensional vector, connecting the one-dimensional vector and the one-dimensional vector, inputting the one-dimensional vector into the full connection layer, and then identifying the depression through a softmax classifier.
The result prediction and accuracy rate calculation module is used for inputting the human skeleton kinematic data to be recognized into the trained depression recognition neural network model to complete the recognition diagnosis of the depression, obtain the prediction result and calculate the accuracy rate; the result prediction and accuracy calculation module comprises,
the identification and diagnosis submodule is used for identifying and diagnosing the depression by adopting softmax as a classifier, and the formula is as follows:
Figure BDA0002561502720000171
wherein S isiRepresenting a softmax predicted value of the ith class; i and j are category serial numbers;
the prediction class acquisition submodule is used for inputting the human skeleton kinematic feature data into the depression recognition neural network model, when depression class prediction is carried out, each class (depression and non-depression) corresponds to a calculation value smaller than 1, the sum is 1, and the class corresponding to the maximum calculation value is a prediction class;
the comparison submodule is used for comparing the prediction category with the real category, calculating the proportion of the number of depression categories of the correct prediction category in the training data set to the total number of data, and outputting the accuracy of the depression recognition neural network model;
the Loss calculation submodule is used for calculating the Loss of the prediction error category by adopting a Loss function, setting the Loss function of the depression recognition neural network model as a cross entropy function Loss, and the formula is as follows:
Figure BDA0002561502720000172
wherein M represents the number of categories; c represents a category number; y iscRepresenting a real tag; p is a radical ofcIndicating the output of softmax.
Example 3:
an embodiment of the present invention further provides an electronic device, including: a memory and at least one processor;
wherein the memory stores computer-executable instructions;
the at least one processor executing the computer-executable instructions stored by the memory causes the at least one processor to perform a depression identification method based on human skeletal kinematics characteristic information as in embodiment 1.
Example 4:
the embodiment of the invention also provides a computer-readable storage medium, wherein a plurality of instructions are stored, and the instructions are loaded by a processor, so that the processor executes the depression identification method based on the human skeleton kinematic feature information in any embodiment of the invention. Specifically, a system or an apparatus equipped with a storage medium on which software program codes that realize the functions of any of the above-described embodiments are stored may be provided, and a computer (or a CPU or MPU) of the system or the apparatus is caused to read out and execute the program codes stored in the storage medium.
In this case, the program code itself read from the storage medium can realize the functions of any of the above-described embodiments, and thus the program code and the storage medium storing the program code constitute a part of the present invention.
Examples of the storage medium for supplying the program code include a floppy disk, a hard disk, a magneto-optical disk, an optical disk (e.g., CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-R depression recognition method based on human skeleton kinematics characteristic information and system M, DVD-RW, DVD + RW), a magnetic tape, a nonvolatile memory card, and a ROM. Alternatively, the program code may be downloaded from a server computer via a communications network.
Further, it should be clear that the functions of any one of the above-described embodiments may be implemented not only by executing the program code read out by the computer, but also by causing an operating system or the like operating on the computer to perform a part or all of the actual operations based on instructions of the program code.
Further, it is to be understood that the program code read out from the storage medium is written to a memory provided in an expansion board inserted into the computer or to a memory provided in an expansion unit connected to the computer, and then causes a CPU or the like mounted on the expansion board or the expansion unit to perform part or all of the actual operations based on instructions of the program code, thereby realizing the functions of any of the above-described embodiments.
Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

Claims (10)

1. A depression recognition method based on human skeleton kinematic feature information is characterized in that the method comprises the steps of extracting human joint kinematic feature information according to the human skeleton feature information captured by Kinect equipment, carrying out data preprocessing on spatial position data of each joint based on time dimension, constructing a data set by utilizing the preprocessed data, and carrying out training and testing on a depression recognition neural network model by using the data set; the method comprises the following specific steps:
s1, collecting human skeleton kinematic characteristic data;
s2, extracting human skeleton kinematic characteristic data;
s3, preprocessing the characteristic data and making a data set;
s4, constructing and training a depression recognition neural network model;
and S5, inputting the human skeleton kinematic data to be recognized into the trained depression recognition neural network model, completing the recognition and diagnosis of depression, obtaining a prediction result and calculating the accuracy.
2. The method for identifying depression based on human skeleton kinematic feature information according to claim 1, wherein the step S1 is to collect human skeleton kinematic feature data as follows:
s101, starting a Kinect device to collect kinematic characteristic data of a human skeleton;
s102, guiding the experimental subject to enter a designated position, and playing a pre-designed stimulation action task instruction; wherein the stimulation action comprises a total of five kinematic segments, respectively: lifting and resetting a left arm, lifting and resetting a right arm, lifting and resetting two arms, turning left and resetting, and turning right and resetting; the time interval between each kinematic segment is 5s, and the time required for completing the whole stimulation action is 60 s;
s103, opening Kinect studio V2 software to complete the recording work of human skeleton kinematic data;
s104, after the data recording is finished, the psychologist evaluates the depression state of the current experimental subject, and the evaluation process records the Hamilton depression scale score of the experimental subject; wherein, the experimental subjects are divided into a depression group and a non-depression group;
and S105, screening effective experimental data when the score of the depression group is more than 20 and the score of the control group is less than 8 according to the Hamilton depression scale score standard.
3. The method for identifying depression based on human skeleton kinematic feature information according to claim 1, wherein the step S2 is to extract human skeleton kinematic feature data as follows:
s201, traversing xef recording files in the folder, acquiring file names and obtaining recording file paths;
s202, executing a script command to run SkeletonExtractor.exe in batch to open xef record files and read original record data;
s203, extracting time series data based on the xef recorded files;
s204, extracting the spatial positions (x, y, z) of the human skeleton joint points, and eliminating noise data; the method specifically comprises the following steps: for any Kinect detected target i 1,2,3 …, each human body skeleton joint point detects a spatial position (x, y, z), and for any time t, the target object i to be locked has the following relation:
Figure FDA0002561502710000021
s205, extracting quaternion ((Rx, Ry, Rz), Rw) of the spatial coordinate system of the data subjected to the noise reduction processing, and the formula is as follows:
Figure FDA0002561502710000022
each captured human skeletal joint point is composed of (x, y, z) and ((Rx, Ry, Rz), Rw), that is, each human skeletal joint point n at the time t of the kinematic segment is composed of seven dimensional data expressing the spatial position relationship:
djt=[xjt,yjt,zjt,Rxjt,Ryjt,Rzjt,Rwjt];
and for the human skeleton joint points captured by the Kinect, the human skeleton data extracted within the kinematic fragment time T is as follows:
Figure FDA0002561502710000031
wherein t represents time; j represents a human skeletal joint point; t is more than or equal to 1 and less than or equal to T, j is more than or equal to 1 and less than or equal to 25;
s206, obtaining the extraction result of the experimental object, and saving the extraction result as a file in the csv format by the name of the experimental object.
4. The method for identifying depression based on human skeleton kinematic feature information as claimed in claim 1, wherein the feature data preprocessing and making data set in step S3 are as follows:
s301, reading the extracted csv file of the experimental object;
s302, reading the time sequence and rewriting the time sequence into a csv file with the same name in another folder;
s303, reading data of the csv file except the time sequence, and preprocessing the adopted standardized data, wherein the method specifically comprises the following steps: for human skeleton data time series x1,x2,...,xtThe linear transformation of the raw data using a dispersion normalization method, with the result mapped to [0,1]Interval, the formula is as follows:
Figure FDA0002561502710000032
where n denotes the sequence length, the new sequence y1,y2,...,yn∈[0,1]And is dimensionless;
s304, writing the standardized data into the csv file in the step S302;
s305, labeling the experimental subjects of a depressed group and a non-depressed group respectively, labeling the depressed group as 0, and labeling the non-depressed group as 1;
s306, making a data set through a new data sequence preprocessed by a pythonnnumpy library, and storing data extracted by all experimental objects in a binary data format as a npy format file;
and S307, adopting a random division mode for the data set, setting 70% of the data set as a training set, and using 30% of the data set as a test set.
5. The method for identifying depression based on human skeleton kinematic feature information of claim 1, wherein the step S4 of constructing and training a depression identification neural network model is as follows:
s401, inputting training set data into a time convolution neural network to obtain a time convolution neural network output result; the time convolution neural network comprises two time hole convolution residual blocks which are sequentially connected, each layer of each residual block is a one-dimensional hole convolution network, the random discarding rate is 0.5, the activation function is ReLU, and the output of each layer is directly used as the input of the next layer;
s402, inputting the obtained time convolution neural network output result into a bottleneck convolution neural network as characteristic information to obtain an output result of the bottleneck convolution neural network; wherein the bottleneck convolutional neural network comprises a bottleneck network residual block; the bottleneck network residual block comprises three layers of one-dimensional convolution networks, and the kernel function size K of the one-dimensional convolution networks is 1,3 and 1;
and S403, converting the output result of the bottleneck convolutional neural network into a one-dimensional vector, connecting the one-dimensional vector and the one-dimensional vector, inputting the one-dimensional vector into a full connection layer, and then identifying the depression through a softmax classifier.
6. The method for identifying depression based on human skeleton kinematic feature information according to any one of claims 1 to 5, wherein the step S5 is implemented to identify and diagnose depression, obtain the prediction result and calculate the accuracy specifically as follows:
s501, identifying and diagnosing the depression by using softmax as a classifier, wherein the formula is as follows:
Figure FDA0002561502710000041
wherein S isiRepresenting a softmax predicted value of the ith class; i and j are category serial numbers;
s502, inputting human skeleton kinematic feature data into a depression recognition neural network model, wherein each category corresponds to a calculation value smaller than 1 and the sum is 1 when depression category prediction is carried out, and the category corresponding to the maximum calculation value is a prediction category;
s503, comparing the prediction categories with the real categories, calculating the proportion of the number of the depression categories of the correct prediction categories in the training data set to the total number of the data, and outputting the accuracy of the depression recognition neural network model;
s504, calculating the Loss of the prediction error category by adopting a Loss function, setting the Loss function of the depression recognition neural network model as a cross entropy function Loss, and adopting the following formula:
Figure FDA0002561502710000051
wherein M represents the number of categories; c represents a category number; y iscRepresenting a real tag; p is a radical ofcIndicating the output of softmax.
7. A depression recognition system based on human skeleton kinematics characteristic information is characterized by comprising,
the data collection module is used for collecting human skeleton kinematic characteristic data;
the data extraction module is used for extracting human skeleton kinematic characteristic data;
the preprocessing and data set making module is used for preprocessing the characteristic data and making a data set;
the model building and training module is used for building and training a depression recognition neural network model;
and the result prediction and accuracy rate calculation module is used for inputting the human skeleton kinematic data to be recognized into the trained depression recognition neural network model to complete the recognition diagnosis of the depression, obtain the prediction result and calculate the accuracy rate.
8. The system for identifying depression according to claim 7, wherein the data collecting module comprises,
the device starting and data collecting submodule is used for starting the Kinect device to collect the kinematic characteristic data of the human skeleton;
the instruction playing submodule is used for guiding the experimental subject to enter a designated position and playing a pre-designed stimulation action task instruction; wherein the stimulation action comprises a total of five kinematic segments, respectively: lifting and resetting a left arm, lifting and resetting a right arm, lifting and resetting two arms, turning left and resetting, and turning right and resetting; the time interval between each kinematic segment is 5s, and the time required for completing the whole stimulation action is 60 s;
the data recording submodule is used for opening Kinect studio V2 software to finish the recording work of human skeleton kinematic data;
the depression state evaluation submodule is used for evaluating the depression state of the current experimental subject by a psychologist after data recording is finished, and the evaluation process records the Hamilton depression scale score of the experimental subject; wherein, the experimental subjects are divided into a depression group and a non-depression group;
the screening submodule is used for screening effective experimental data when the score of a depression group is larger than 20 and the score of a control group is smaller than 8 according to the Hamilton depression scale scoring standard;
the data extraction module comprises a data extraction module and a data extraction module,
the file path recording submodule is used for traversing xef recorded files in the folder, acquiring file names and obtaining recording file paths;
an original record data reading submodule, configured to execute a script command to run skeletonextractor.exe in batch to open a xef record file, and read original record data;
a time-series data extraction sub-module for extracting time-series data based on the xef record file;
the noise data removing submodule is used for extracting the spatial positions (x, y, z) of the human skeleton joint points and removing the noise data; the method specifically comprises the following steps: for any Kinect detected target i 1,2,3 …, each human body skeleton joint point detects a spatial position (x, y, z), and for any time t, the target object i to be locked has the following relation:
Figure FDA0002561502710000061
and the quaternion extraction submodule is used for extracting quaternion ((Rx, Ry, Rz), Rw) of a space coordinate system of the data subjected to the noise reduction processing, and the formula is as follows:
Figure FDA0002561502710000071
each captured human skeletal joint point is composed of (x, y, z) and ((Rx, Ry, Rz), Rw), that is, each human skeletal joint point n at the time t of the kinematic segment is composed of seven dimensional data expressing the spatial position relationship:
djt=[xjt,yjt,zjt,Rxjt,Ryjt,Rzjt,Rwjt];
and for the human skeleton joint points captured by the Kinect, the human skeleton data extracted within the kinematic fragment time T is as follows:
Figure FDA0002561502710000072
wherein t represents time; j represents a human skeletal joint point; t is more than or equal to 1 and less than or equal to T, j is more than or equal to 1 and less than or equal to 25;
the extraction result obtaining submodule is used for obtaining the extraction result of the experimental object and saving the extraction result as a file in a csv format by the name of the experimental object;
the preprocessing and data set creation module includes,
the file reading submodule is used for reading the extracted csv file of the experimental object;
the time sequence reading submodule is used for reading the time sequence and rewriting the time sequence into a csv file with the same name in another folder;
the standardized data preprocessing submodule is used for reading the data of the csv file except the time sequence and preprocessing the adopted standardized data, and specifically comprises the following steps: for human skeleton data time series x1,x2,...,xtThe linear transformation of the raw data using a dispersion normalization method, with the result mapped to [0,1]Interval, the formula is as follows:
Figure FDA0002561502710000081
where n denotes the sequence length, the new sequence y1,y2,...,yn∈[0,1]And is dimensionless;
the file writing submodule is used for writing the standardized data into the csv file;
a subject labeling submodule for labeling the experimental subjects of the depressed group and the non-depressed group, respectively, labeling the depressed group as 0 and labeling the non-depressed group as 1;
the data set making submodule is used for making a data set through a new data sequence preprocessed by a pythonnnumpy library, and the data extracted by all experimental objects are stored as a. npy format file in a binary data format;
the data set dividing submodule is used for adopting a random dividing mode for the data set, setting 70% of the data set as a training set, and using 30% of the data set as a testing set;
the model building and training module comprises a model building and training module,
the time convolution neural network result output submodule is used for inputting the training set data to the time convolution neural network to obtain an output result of the time convolution neural network; the time convolution neural network comprises two time hole convolution residual blocks which are sequentially connected, each layer of each residual block is a one-dimensional hole convolution network, the random discarding rate is 0.5, the activation function is ReLU, and the output of each layer is directly used as the input of the next layer;
the bottleneck convolutional neural network result output submodule is used for inputting the obtained time convolutional neural network output result into the bottleneck convolutional neural network as characteristic information to obtain an output result of the bottleneck convolutional neural network; wherein the bottleneck convolutional neural network comprises a bottleneck network residual block; the bottleneck network residual block comprises three layers of one-dimensional convolution networks, and the kernel function size K of the one-dimensional convolution networks is 1,3 and 1;
the depression recognition submodule is used for converting an output result of the bottleneck convolutional neural network into a one-dimensional vector, connecting the one-dimensional vector and the one-dimensional vector, inputting the one-dimensional vector into the full connection layer, and then recognizing the depression through a softmax classifier;
the result prediction and accuracy calculation module comprises,
the identification and diagnosis submodule is used for identifying and diagnosing the depression by adopting softmax as a classifier, and the formula is as follows:
Figure FDA0002561502710000091
wherein S isiRepresenting a softmax predicted value of the ith class; i and j are category serial numbers;
the prediction class acquisition submodule is used for inputting the human skeleton kinematic feature data into the depression recognition neural network model, when depression class prediction is carried out, each class corresponds to a calculation value smaller than 1, the sum is 1, and the class corresponding to the maximum calculation value is a prediction class;
the comparison submodule is used for comparing the prediction category with the real category, calculating the proportion of the number of depression categories of the correct prediction category in the training data set to the total number of data, and outputting the accuracy of the depression recognition neural network model;
the Loss calculation submodule is used for calculating the Loss of the prediction error category by adopting a Loss function, setting the Loss function of the depression recognition neural network model as a cross entropy function Loss, and the formula is as follows:
Figure FDA0002561502710000092
wherein M represents the number of categories; c represents a category number; y iscRepresenting a real tag; p is a radical ofcIndicating the output of softmax.
9. An electronic device, comprising: a memory and at least one processor;
wherein the memory stores computer-executable instructions;
the at least one processor executing the computer-executable instructions stored by the memory causes the at least one processor to perform the method for identifying depression based on human skeletal kinematics feature information according to any of claims 1 to 6.
10. A computer-readable storage medium having stored thereon computer-executable instructions, which when executed by a processor, implement the method for identifying depression based on human skeletal kinematics characteristic information as claimed in claims 1 to 6.
CN202010608130.9A 2020-06-30 2020-06-30 Depression identification method and system based on human skeleton kinematics characteristic information Active CN111938670B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010608130.9A CN111938670B (en) 2020-06-30 2020-06-30 Depression identification method and system based on human skeleton kinematics characteristic information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010608130.9A CN111938670B (en) 2020-06-30 2020-06-30 Depression identification method and system based on human skeleton kinematics characteristic information

Publications (2)

Publication Number Publication Date
CN111938670A true CN111938670A (en) 2020-11-17
CN111938670B CN111938670B (en) 2022-07-05

Family

ID=73337907

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010608130.9A Active CN111938670B (en) 2020-06-30 2020-06-30 Depression identification method and system based on human skeleton kinematics characteristic information

Country Status (1)

Country Link
CN (1) CN111938670B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113111760A (en) * 2021-04-07 2021-07-13 同济大学 Lightweight graph convolution human skeleton action identification method based on channel attention
CN113793662A (en) * 2021-08-16 2021-12-14 北京百度网讯科技有限公司 Correlation identification method, correlation identification device, electronic equipment and computer-readable storage medium
CN113827243A (en) * 2021-11-29 2021-12-24 江苏瑞脑启智医疗科技有限公司 Attention assessment method and system
CN116434908A (en) * 2023-06-14 2023-07-14 中国科学院自动化研究所 Method and device for quantitatively and hierarchically evaluating bradykinesia based on time convolution network
CN117530691A (en) * 2024-01-09 2024-02-09 南通大学 Depression tendency detection system, method and related equipment based on indoor network

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012075708A (en) * 2010-10-01 2012-04-19 Sharp Corp Stress state estimation device, stress state estimation method, program, and recording medium
CN104615983A (en) * 2015-01-28 2015-05-13 中国科学院自动化研究所 Behavior identification method based on recurrent neural network and human skeleton movement sequences
CN106175800A (en) * 2016-07-10 2016-12-07 北京工业大学 A kind of Mood State quantitative estimation method based on human motion state data
US20170251985A1 (en) * 2016-02-12 2017-09-07 Newton Howard Detection Of Disease Conditions And Comorbidities
US20180039745A1 (en) * 2016-08-02 2018-02-08 Atlas5D, Inc. Systems and methods to identify persons and/or identify and quantify pain, fatigue, mood, and intent with protection of privacy
CN111178312A (en) * 2020-01-02 2020-05-19 西北工业大学 Face expression recognition method based on multi-task feature learning network

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012075708A (en) * 2010-10-01 2012-04-19 Sharp Corp Stress state estimation device, stress state estimation method, program, and recording medium
CN104615983A (en) * 2015-01-28 2015-05-13 中国科学院自动化研究所 Behavior identification method based on recurrent neural network and human skeleton movement sequences
US20170251985A1 (en) * 2016-02-12 2017-09-07 Newton Howard Detection Of Disease Conditions And Comorbidities
CN106175800A (en) * 2016-07-10 2016-12-07 北京工业大学 A kind of Mood State quantitative estimation method based on human motion state data
US20180039745A1 (en) * 2016-08-02 2018-02-08 Atlas5D, Inc. Systems and methods to identify persons and/or identify and quantify pain, fatigue, mood, and intent with protection of privacy
CN111178312A (en) * 2020-01-02 2020-05-19 西北工业大学 Face expression recognition method based on multi-task feature learning network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JOSHI JYOTI,ET AL: "Relative Body Parts Movement for Automatic Depression Analysis", 《2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113111760A (en) * 2021-04-07 2021-07-13 同济大学 Lightweight graph convolution human skeleton action identification method based on channel attention
CN113793662A (en) * 2021-08-16 2021-12-14 北京百度网讯科技有限公司 Correlation identification method, correlation identification device, electronic equipment and computer-readable storage medium
CN113793662B (en) * 2021-08-16 2023-07-28 北京百度网讯科技有限公司 Correlation identification method, apparatus, electronic device, and computer-readable storage medium
CN113827243A (en) * 2021-11-29 2021-12-24 江苏瑞脑启智医疗科技有限公司 Attention assessment method and system
CN116434908A (en) * 2023-06-14 2023-07-14 中国科学院自动化研究所 Method and device for quantitatively and hierarchically evaluating bradykinesia based on time convolution network
CN117530691A (en) * 2024-01-09 2024-02-09 南通大学 Depression tendency detection system, method and related equipment based on indoor network
CN117530691B (en) * 2024-01-09 2024-04-09 南通大学 Depression tendency detection system, method and related equipment based on indoor network

Also Published As

Publication number Publication date
CN111938670B (en) 2022-07-05

Similar Documents

Publication Publication Date Title
CN111938670B (en) Depression identification method and system based on human skeleton kinematics characteristic information
CN111160367B (en) Image classification method, apparatus, computer device, and readable storage medium
US20220051405A1 (en) Image processing method and apparatus, server, medical image processing device and storage medium
US11715198B2 (en) Medical use artificial neural network-based medical image analysis apparatus and method for evaluating analysis results of medical use artificial neural network
CN103177179A (en) Diagnostic factor set determination apparatus and method
Garcia Caballero et al. V‐awake: a visual analytics approach for correcting sleep predictions from deep learning models
CN116864140A (en) Intracardiac branch of academic or vocational study postoperative care monitoring data processing method and system thereof
Amiri et al. Feature selection for bleeding detection in capsule endoscopy images using genetic algorithm
CN116452592B (en) Method, device and system for constructing brain vascular disease AI cognitive function evaluation model
CN113658697A (en) Psychological assessment system based on video fixation difference
US11817216B2 (en) Search method and information processing system
CN111081374B (en) Autism auxiliary diagnosis device based on common attention paradigm
CN113888663B (en) Reconstruction model training method, anomaly detection method, device, equipment and medium
US11571156B2 (en) Renal function assessment method, renal function assessment system and kidney care device
WO2022047043A1 (en) Surgical phase recognition with sufficient statistical model
Yoo et al. Residual one-dimensional convolutional neural network for neuromuscular disorder classification from needle electromyography signals with explainability
Huang et al. Parkinson's severity diagnosis explainable model based on 3D multi-head attention residual network
JP7506943B2 (en) Signal processing method, signal processing device, and signal processing program
TWI764233B (en) Alzheimer's disease assessment system
CN113393940B (en) Control method and medical system
CN117893528B (en) Method and device for constructing cardiovascular and cerebrovascular disease classification model
CN117831789B (en) Cancer treatment response prediction system and control method thereof
CN117503056A (en) Multi-scale respiratory event automatic positioning method and device based on bimodal feature learning
EP3926540A1 (en) Method for filtering normal medical image, method for interpreting medical image, and computing device implementing the methods
Irfan et al. Machine learning in neurological disorders: A multivariate LSTM and AdaBoost approach to Alzheimer's disease time series analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20201117

Assignee: Jinan Hualang Electronic Technology Co.,Ltd.

Assignor: Qilu University of Technology

Contract record no.: X2023980048589

Denomination of invention: A Depression Recognition Method and System Based on Human Skeleton Kinematic Feature Information

Granted publication date: 20220705

License type: Common License

Record date: 20231204

EE01 Entry into force of recordation of patent licensing contract