CN109376663A - A kind of human posture recognition method and relevant apparatus - Google Patents

A kind of human posture recognition method and relevant apparatus Download PDF

Info

Publication number
CN109376663A
CN109376663A CN201811268507.XA CN201811268507A CN109376663A CN 109376663 A CN109376663 A CN 109376663A CN 201811268507 A CN201811268507 A CN 201811268507A CN 109376663 A CN109376663 A CN 109376663A
Authority
CN
China
Prior art keywords
human body
obtains
model
human
neural network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811268507.XA
Other languages
Chinese (zh)
Inventor
丁鹏程
成思源
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Technology
Original Assignee
Guangdong University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Technology filed Critical Guangdong University of Technology
Priority to CN201811268507.XA priority Critical patent/CN109376663A/en
Publication of CN109376663A publication Critical patent/CN109376663A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/103Static body considered as a whole, e.g. static pedestrian or occupant recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

This application discloses a kind of human posture recognition methods, comprising: carries out human skeleton feature extraction processing to the human body image of preset quantity, obtains training set data;Neural metwork training is carried out using the training set data, obtains gesture recognition model;Testing and debugging processing is carried out to the gesture recognition model according to the test set data of acquisition, obtains optimum attitude identification model;Human body image to be identified is identified according to the optimum attitude identification model, obtains recognition result.It by reducing characteristic amount using framework characteristic as characteristic, and is identified using neural network, improves the efficiency of human body attitude estimation.Disclosed herein as well is a kind of human body attitude identifying system, server and computer readable storage mediums, have the above beneficial effect.

Description

A kind of human posture recognition method and relevant apparatus
Technical field
This application involves field of computer technology, in particular to a kind of human posture recognition method, human body attitude identification system System, server and computer readable storage medium.
Background technique
With the continuous development of computer vision technique, human body attitude estimation has been supplied in intelligent video monitoring, intelligent family The fields such as residence, motion analysis, human-computer interaction and medical rehabilitation.Wherein, human body attitude estimation is mainly relevant by human body attitude Image data is as input data, then is estimated by related algorithm input data to obtain the probable value of human body attitude, It is exactly the estimated value of human body attitude.
Currently, the method for feature extraction is carried out in order to retain effective human action information to image data, it is dynamic to human body The dimension for making to extract data is more and more.Although increasing the information content of action description, guarantee the accuracy of posture judgement, More and more characteristic dimensions, can make redundant computation amount more and more, reduce speed when data calculate, reduce data The real-time of calculating.Also, human action information is judged generally by the method that mode compares in the prior art, is sentenced Disconnected accuracy rate is lower, as a result inaccurate, and human body attitude estimation procedure is caused to obtain the inefficiency of correct result.
Therefore, how to improve the efficiency of human body attitude estimation is the Important Problems of those skilled in the art's concern.
Summary of the invention
The purpose of the application is to provide a kind of human posture recognition method, human body attitude identifying system, server and meter Calculation machine readable storage medium storing program for executing by reducing characteristic amount using framework characteristic as characteristic, and uses neural network It is identified, improves the efficiency of human body attitude estimation.
In order to solve the above technical problems, the application provides a kind of human posture recognition method, comprising:
Human skeleton feature extraction processing is carried out to the human body image of preset quantity, obtains training set data;
Neural metwork training is carried out using the training set data, obtains gesture recognition model;
Testing and debugging processing is carried out to the gesture recognition model according to the test set data of acquisition, obtains optimum attitude knowledge Other model;
Human body image to be identified is identified according to the optimum attitude identification model, obtains recognition result.
Optionally, human skeleton feature extraction processing is carried out to the human body image of preset quantity, obtains training set data, wrapped It includes:
Median filtering is carried out to the human body image of the preset quantity, obtains multiple filtering human body images;
The extraction of human skeleton data is carried out to all filtering human body images, obtains multiple human skeleton pose presentations;
All human body skeleton pose images are normalized, processing is obtained into image data as the training set Data.
Optionally, neural metwork training is carried out using the training set data, obtains gesture recognition model, comprising:
The training set data is input in convolutional neural networks and long Memory Neural Networks in short-term, is constructed to obtain Deep neural network model;
Loss function is calculated to the deep neural network model according to propagated forward algorithm, obtains loss function value;
Judge whether the deep neural network model is convergence state according to the loss function value;
If so, using the deep neural network model as the gesture recognition model;
If it is not, the parameter of the deep neural network model is then adjusted according to back-propagation algorithm, until depth mind It is convergence state through network model, using the deep neural network model as the gesture recognition model.
Optionally, testing and debugging processing is carried out to the gesture recognition model according to the test set data of acquisition, obtained most Yogci state identification model, comprising:
The gesture recognition model is tested according to the test set data, obtains test result;
The gesture recognition model is adjusted according to the test result, obtains optimum attitude identification model.
Optionally, human body image to be identified is identified according to the optimum attitude identification model, obtains recognition result, Include:
The human body image to be identified is subjected to human skeleton feature extraction processing, obtains human skeleton image to be identified;
The human skeleton image to be identified is input to convolutional neural networks to handle, obtains characteristic spectrum;
The characteristic spectrum is input to the length, and Memory Neural Networks are handled in short-term, obtain long short-term memory nerve Web results;
By the length, Memory Neural Networks result is input to Softmax classifier and handles in short-term, obtains multiple generics Probability;
Using all generic probability as the recognition result.
The application also provides a kind of human body attitude identifying system, includes:
Training data obtains module, carries out human skeleton feature extraction processing for the human body image to preset quantity, obtains To training set data;
Neural metwork training module obtains gesture recognition for carrying out neural metwork training using the training set data Model;
Neural network test module, for carrying out test tune to the gesture recognition model according to the test set data of acquisition Whole processing obtains optimum attitude identification model;
Identification module is known for being identified according to the optimum attitude identification model to human body image to be identified Other result.
Optionally, the training data obtains module, comprising:
Filter unit carries out median filtering for the human body image to the preset quantity, obtains multiple filtering human figures Picture;
Skeleton data acquiring unit obtains multiple for carrying out the extraction of human skeleton data to all filtering human body images Human skeleton pose presentation;
Processing is obtained figure for all human body skeleton pose images to be normalized by normalized unit As data are as the training set data.
Optionally, the neural metwork training module, comprising:
Network struction unit, for the training set data to be input to convolutional neural networks and long short-term memory nerve net In network, constructed to obtain deep neural network model;
Loss function computing unit, for calculating loss letter to the deep neural network model according to propagated forward algorithm Number, obtains loss function value;
Judging unit is restrained, for judging whether the deep neural network model is convergence according to the loss function value State;
Identification model acquiring unit is used for when the deep neural network model is convergence state, by the depth mind Through network model as the gesture recognition model;
Model parameter adjusts unit, for being passed according to reversed when the deep neural network model is not convergence state The parameter that algorithm adjusts the deep neural network model is broadcast, until the deep neural network model is convergence state, by institute Deep neural network model is stated as the gesture recognition model.
The application also provides a kind of server, comprising:
Memory, for storing computer program;
Processor, the step of human posture recognition method as described above is realized when for executing the computer program.
The application also provides a kind of computer readable storage medium, and calculating is stored on the computer readable storage medium The step of machine program, the computer program realizes human posture recognition method as described above when being executed by processor.
A kind of human posture recognition method provided herein, comprising: human body is carried out to the human body image of preset quantity Framework characteristic extraction process, obtains training set data;Neural metwork training is carried out using the training set data, obtains posture knowledge Other model;Testing and debugging processing is carried out to the gesture recognition model according to the test set data of acquisition, obtains optimum attitude knowledge Other model;Human body image to be identified is identified according to the optimum attitude identification model, obtains recognition result.
Characteristic by using human skeleton feature as neural metwork training effectively reduces the number of characteristic According to amount, reduce calculation amount, accelerate computational efficiency, and framework characteristic can completely reflect the characteristic of human body attitude, It ensure that the accuracy of identification process, while being identified using the gesture recognition model that neural network model training obtains, mentioned The high accuracy rate and precision of gesture recognition, also improves the Generalization Capability of identification model, increases human body attitude identification process In robustness, reduce block, the influence of background, illumination, multi-angle of view and multi-angle to recognition result, improve human body appearance The reliability of state identification.
The application also provides a kind of human body attitude identifying system, server and computer readable storage medium, have with Upper beneficial effect, details are not described herein.
Detailed description of the invention
In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this The embodiment of application for those of ordinary skill in the art without creative efforts, can also basis The attached drawing of offer obtains other attached drawings.
Fig. 1 is a kind of flow chart of human posture recognition method provided by the embodiment of the present application;
Fig. 2 is the process of human skeleton feature extracting method in human posture recognition method provided by the embodiment of the present application Figure;
Fig. 3 is the flow chart of neural network training method in human posture recognition method provided by the embodiment of the present application;
Fig. 4 is a kind of structural schematic diagram of human body attitude identifying system provided by the embodiment of the present application.
Specific embodiment
The core of the application is to provide a kind of human posture recognition method, human body attitude identifying system, server and meter Calculation machine readable storage medium storing program for executing by reducing characteristic amount using framework characteristic as characteristic, and uses neural network It is identified, improves the efficiency of human body attitude estimation.
To keep the purposes, technical schemes and advantages of the embodiment of the present application clearer, below in conjunction with the embodiment of the present application In attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is Some embodiments of the present application, instead of all the embodiments.Based on the embodiment in the application, those of ordinary skill in the art Every other embodiment obtained without making creative work, shall fall in the protection scope of this application.
In current techniques, to image data carry out feature extraction method in order to retain effective human action information, it is right The dimension that human action extracts data is more and more.Although increasing the information content of action description, guarantee the accurate of posture judgement Property, but more and more characteristic dimensions, redundant computation amount can be made more and more, reduce speed when data calculate, Reduce the real-time that data calculate.Also, in the prior art generally by mode comparison method to human action information into Row judgement, the accuracy rate of judgement is lower, as a result inaccurate, and human body attitude estimation procedure is caused to obtain the low efficiency of correct result Under.
Therefore, the embodiment of the present application provides a kind of human posture recognition method, by using human skeleton feature as mind Characteristic through network training effectively reduces the data volume of characteristic, reduces calculation amount, accelerates computational efficiency, And framework characteristic can completely reflect the characteristic of human body attitude, ensure that the accuracy of identification process, while using mind The gesture recognition model obtained through network model training is identified, is improved the accuracy rate and precision of gesture recognition, is also improved The Generalization Capability of identification model, increases the robustness in human body attitude identification process, reduces and blocks, is background, illumination, more The influence of visual angle and multi-angle to recognition result improves the reliability of human body attitude identification.
Referring to FIG. 1, Fig. 1 is a kind of flow chart of human posture recognition method provided by the embodiment of the present application.
This method may include:
S101 carries out human skeleton feature extraction processing to the human body image of preset quantity, obtains training set data;
This step is intended to carry out human skeleton feature extraction processing to human body image, obtains training set data.
During general neural metwork training, also needs to carry out feature extraction to original data, be closed Suitable characteristic.Currently, during for human body attitude identification, the profile information of human body is mainly got, although obtaining Taken complete information that can carry out relatively accurate posture judgement, but the data volume of profile information is larger, data dimension compared with More, redundant computation amount is increasing, and the speed of training process can be made lower, equally also will affect the speed of identification process, unfavorable In the real-time of identification process.
Therefore, human skeleton feature is mainly extracted in original human body image in this step, by multiple groups human body bone Frame characteristic is as training set data.Since human skeleton is made of multiple bones with certain length, that is, only The beginning and end for describing the bone can be obtained by the position of the direction or the bone of the bone in space, it is possible to reduce The case where carrying out the data volume of feature description, and can completely showing human body attitude by framework information, guarantees feature The accuracy of data.So selecting human skeleton characteristic as training data in the present embodiment, reduction is trained and knows Other data volume improves the speed of training and identification in the case where keeping accuracy.
Wherein, characteristic relevant to skeleton mainly is got for human skeleton feature extraction processing, specifically, Since the mode of description skeleton is different, the characteristic of available whole skeletons, the also spy of available part main skeleton Data are levied, no matter special using any skeleton any one the framework characteristic acquisition methods that can also be provided using the prior art are The method for levying data acquisition, the data volume and data dimension of the characteristic finally obtained can be reduced.
Wherein, the number of preset quantity will affect the data volume of training set data, if the effect of the more training of quantity Can be better, the time of negligible amounts training will be shorter, and therefore, preset quantity can be 1000 quantity less in this way, It can be 50,000, suitable quantity can also be selected according to actual applicable cases, be not specifically limited herein.
S102 carries out neural metwork training using training set data, obtains gesture recognition model;
On the basis of this step S101, this step be intended to use more than training set data carry out neural metwork training, Obtain gesture recognition model.
Neural network training method conducted in this step can be general neural network training method, can also be Any one neural network training method provided in the prior art is also possible to the instruction of neural network provided by following embodiment Practice method, it is seen then that neural network training method is not unique, is not specifically limited herein.
Wherein, the gesture recognition model obtained is the model judged the characteristic that human body image extracts, can To export appearance probability of state.For example, being mentioned now with a specific human body image according to the human skeleton feature in previous step It takes processing to carry out feature extraction, obtains framework characteristic, then carry out identification by the gesture recognition model to export accordingly Generic probability, if the posture classification of setting has station, sits, lies, walking, jumping this five kinds, then every kind of appearance probability of state is just exported, The probability 80% stood, the probability 10% of seat, the probability 5% lain, the probability 5% walked, the probability 20% of jump.
S103 carries out testing and debugging processing to gesture recognition model according to the test set data of acquisition, obtains optimum attitude Identification model;
On the basis of step S102, this step is intended to the gesture recognition mould obtained using test set data to previous step Type optimizes, that is, testing and debugging processing, is specifically first tested, is adjusted further according to test result, obtained most Yogci state identification model.
Wherein, test set data are the characteristic data sets of known posture, thus can by known posture and identification posture into Row comparison, to be adjusted according to comparing result.The data volume of test set data will affect that testing and debugging is handled as a result, more More test results are more accurate, and fewer test speed is faster, can be selected according to the actual situation.
S104 identifies human body image to be identified according to optimum attitude identification model, obtains recognition result.
On the basis of step S103, this step is intended for human body attitude identification, that is, is identified using optimum attitude Model identifies human body image to be identified, obtains recognition result.Wherein, including to human body image to be identified human body bone is carried out Frame feature extraction processing is obtained framework characteristic to be identified, then is carried out using optimum attitude identification model to framework characteristic to be identified Identification, obtains recognition result.
Wherein, it carries out knowing any one recognition methods that method for distinguishing can be provided using the prior art, can also use The recognition methods that following optinal plan provides.
Optionally, the S103 in this step may include:
Step 1, gesture recognition model is tested according to test set data, obtains test result;
Step 2, gesture recognition model is adjusted according to test result, obtains optimum attitude identification model.
This optinal plan mainly introduces a kind of specific testing and debugging processing method, first according to test set data to posture Identification model is tested, and test result is obtained, and is adjusted further according to test result to gesture recognition model, is obtained most Yogci State identification model.
Specifically, first passing through camera collects 300 human body images, classifies to these human body images, indicate it Specific human body attitude, such as stand, sit, lie, walk, jump.Then it pre-processes, extracts pair to all people's body image The characteristic answered, then these characteristics are identified by gesture recognition model, obtains recognition result, then by the identification As a result it is compared with known classification, determines that model is that correctly, then output is corresponding right for the identification of which image Than as a result, as test result.Finally gesture recognition model is adjusted according to the test result, obtains optimum attitude identification Model.Wherein, it is adjusted the parameter for usually adjusting neural network.
Wherein, multiple human body images can be tested simultaneously, is adjusted according to test result, it can also will be more Human body image obtains test result in batches and is adjusted again, it is seen then that the execution sequence of the specific steps in this optinal plan has A variety of sequences and a variety of numbers can be selected according to practical situation, be not specifically limited herein.
Optionally, the S104 in this step may include:
Step 1, human body image to be identified is subjected to human skeleton feature extraction processing, obtains human skeleton figure to be identified Picture;
Step 2, human skeleton image to be identified is input to convolutional neural networks to handle, obtains characteristic spectrum;
Step 3, characteristic spectrum is input to long Memory Neural Networks in short-term to handle, obtains long short-term memory nerve net Network result;
Step 4, long Memory Neural Networks result in short-term is input to Softmax classifier to handle, obtains multiple classes Belong to probability;
Step 5, using all generic probability as recognition result.
Wherein, the convolution kernel size in convolutional neural networks is respectively 3 × 3, and down-sampling layer is reduced using maximum pond Convolution exports dimension, to handle in image redundant data, while can reduce motion frequency variation and cause ground to accuracy of identification It influences, pond area size is 2 × 2.Each convolutional layer is followed by a pond layer in convolutional neural networks, and pond technology can improve The translation invariance of algorithm, all convolution, the setting of the step-length of pond operation are all 1.
By place in CNN (convolutional neural networks) treated characteristic spectrum input LSTM (long Memory Neural Networks in short-term) Reason, to prevent long-time data from inputting excessive the problem of causing gradient to be exploded.
Full articulamentum is set as 1 layer in this optinal plan, neuron number 1024, and using feed forward type neural network Connection type.Classify layer using Softmax classifier, is normalized by Softmax layers, determine that the generic of human skeleton is general Rate.
Wherein, Softmax function is as follows:
Wherein, p (Ck) indicate generic probability;ok、oiRespectively indicate the kth and i-th of element of full articulamentum output;C is indicated Human body attitude classification sum.
The all hyperbolic tangent function tanh of the activation primitive of convolutional neural networks in this optinal plan.
Formula is as follows:
Wherein, x is input variable.
To sum up, characteristic of the present embodiment by using human skeleton feature as neural metwork training, is effectively reduced The data volume of characteristic, reduces calculation amount, accelerates computational efficiency, and framework characteristic can completely reflect people The characteristic of body posture ensure that the accuracy of identification process, while the gesture recognition mould obtained using neural network model training Type is identified, is improved the accuracy rate and precision of gesture recognition, is also improved the Generalization Capability of identification model, increase human body Robustness during gesture recognition, reduce block, the influence of background, illumination, multi-angle of view and multi-angle to recognition result, Improve the reliability of human body attitude identification.
On the basis of a upper embodiment, the present embodiment mainly introduces a kind of extracting method of human skeleton feature.Upper one Human skeleton feature extracting method conducted in embodiment can be using any one feature extraction provided in the prior art Method can use the extracting method of human skeleton feature provided in this embodiment, other parts and a upper embodiment substantially phase Together, same section can refer to a upper embodiment, and this will not be repeated here.
Referring to FIG. 2, Fig. 2 is human skeleton feature extraction in human posture recognition method provided by the embodiment of the present application The flow chart of method.
This method may include:
S201 carries out median filtering to the human body image of preset quantity, obtains multiple filtering human body images;
This step is mainly to carry out median filtering to the human body image of preset quantity first, obtains multiple filtering human figures Picture.The human body image for obtaining preset quantity is original image, and original image exists due to the difference of acquiring way and acquisition modes Different degrees of clutter noise point and noise fritter, noise therein can impact the accuracy of feature extraction.Therefore, originally Median filtering is carried out to multiple human body images of acquisition in step, to remove the clutter noise point and noise fritter in image, Reduce noise data amount.
Wherein, the method for the median filtering of progress can select 3 × 3 filtering templates to be handled.
S202 carries out the extraction of human skeleton data to all filtering human body images, obtains multiple human skeleton pose presentations;
On the basis of previous step, this step is intended to obtaining all filtering human body images and carry out human skeleton data to mention It takes, obtains human skeleton pose presentation.This step is exactly to analyze the human body in filtering human body image, obtains corresponding people Body skeleton data is the equal of that corresponding skeleton pose image is partitioned into from human body image.It is conceivable that having compared people The image of external shape wherein the data volume for describing skeleton pose also can be fewer, therefore passes through this due to being skeleton pose image The characteristic of sample carries out the calculation amount that neural metwork training can be significantly reduced data operation, reduces computing redundancy, improves Speed.
Wherein, any one human skeleton data that the method that human skeleton data are extracted can be provided using the prior art The method of extraction, is not specifically limited herein.
All human body skeleton pose images are normalized in S203, and processing is obtained image data as training Collect data.
On the basis of step S202, this step is intended to that all people's body skeleton pose image is normalized, Multiple images data are obtained, then using all image datas as training set data.
Wherein, the figure that image data is mainly all adjusted to unified size specification is specifically normalized Data, wherein size specification can select suitable size specification according to practical situation, can also set size specification to 256×256。
It is less that data volume can be extracted through this embodiment, and the less characteristic of computing redundancy amount improves subsequent step Calculating speed in rapid.
On the basis of all of above embodiment, the present embodiment mainly does one specifically to the method for neural metwork training Illustrate, the corresponding portion of other parts and all of above embodiment is substantially the same, and same section can refer to all of above implementation Example, this will not be repeated here.
Referring to FIG. 3, Fig. 3 is neural network training method in human posture recognition method provided by the embodiment of the present application Flow chart.
This method may include:
Training set data is input in convolutional neural networks and long Memory Neural Networks in short-term, construct by S301 To deep neural network model;
S302 calculates loss function to deep neural network model according to propagated forward algorithm, obtains loss function value;
S303 judges whether deep neural network model is convergence state according to loss function value;
S304, if so, using deep neural network model as gesture recognition model;
S305, if it is not, the parameter of deep neural network model is then adjusted according to back-propagation algorithm, until depth nerve net Network model is convergence state, using deep neural network model as gesture recognition model.
Wherein, for loss function using entropy function is intersected, formula is as follows:
Wherein, p (x) is sample label, indicates true sample distribution state, and it is to pass through number that q (x), which is estimating for model, According to the estimated probability being calculated, the distribution of model is indicated.
Backpropagation (BP Backpropagation) algorithm regulating networks parameter is used in the present embodiment, using boarding steps It spends descent method (SGD Stochastic gradient descent) and carries out parameter optimization, learning rate when training is 0.001, weight attenuation coefficient is 0.0001.
It jumps out and follows it can be found that the step in the present embodiment is that circulation carries out by the S303 to S305 in this implementation The judgement of ring is to judge whether deep neural network model is convergence state.If being unsatisfactory for S303 to continue to execute S301, directly To S303 is met, S304 is executed, gesture recognition model is obtained.
The embodiment of the present application provides neural network training method, the gesture recognition obtained by neural network model training Model is identified, is improved the accuracy rate and precision of gesture recognition, is also improved the Generalization Capability of identification model, increase people Robustness during body gesture recognition, reduce block, background, illumination, multi-angle of view and multi-angle be to the shadow of recognition result It rings, improves the reliability of human body attitude identification.
A kind of human body attitude identifying system provided by the embodiments of the present application is introduced below, a kind of people described below Body gesture recognition system can correspond to each other reference with a kind of above-described human posture recognition method.
Referring to FIG. 4, Fig. 4 is a kind of structural schematic diagram of human body attitude identifying system provided by the embodiment of the present application.
The system may include:
Training data obtains module 100, carries out human skeleton feature extraction processing for the human body image to preset quantity, Obtain training set data;
Neural metwork training module 200 obtains gesture recognition mould for carrying out neural metwork training using training set data Type;
Neural network test module 300, for carrying out test tune to gesture recognition model according to the test set data of acquisition Whole processing obtains optimum attitude identification model;
Identification module 400 is identified for being identified according to optimum attitude identification model to human body image to be identified As a result.
Optionally, which obtains module 100, may include:
Filter unit carries out median filtering for the human body image to preset quantity, obtains multiple filtering human body images;
Skeleton data acquiring unit obtains multiple for carrying out the extraction of human skeleton data to all filtering human body images Human skeleton pose presentation;
Processing is obtained figure for all human body skeleton pose images to be normalized by normalized unit As data are as training set data.
Optionally, the neural metwork training module 200 may include:
Network struction unit, for training set data to be input to convolutional neural networks and long Memory Neural Networks in short-term In, it is constructed to obtain deep neural network model;
Loss function computing unit, for calculating loss function to deep neural network model according to propagated forward algorithm, Obtain loss function value;
Judging unit is restrained, for judging whether deep neural network model is convergence state according to loss function value;
Identification model acquiring unit is used for when deep neural network model is convergence state, by deep neural network mould Type is as gesture recognition model;
Model parameter adjusts unit, for being calculated according to backpropagation when deep neural network model is not convergence state Method adjusts the parameter of deep neural network model, until deep neural network model is convergence state, by deep neural network mould Type is as gesture recognition model.
The embodiment of the present application can also provide a kind of server, comprising:
Memory, for storing computer program;
Processor realizes human posture recognition method as described above in Example when for executing the computer program The step of.
The embodiment of the present application can also provide a kind of computer readable storage medium, which is characterized in that the computer can It reads to be stored with computer program on storage medium, be realized when the computer program is executed by processor as described above in Example Human posture recognition method the step of.
The computer readable storage medium may include: USB flash disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (RandomAccess Memory, RAM), magnetic or disk etc. is various to deposit Store up the medium of program code.
Each embodiment is described in a progressive manner in specification, the highlights of each of the examples are with other realities The difference of example is applied, the same or similar parts in each embodiment may refer to each other.For device disclosed in embodiment Speech, since it is corresponded to the methods disclosed in the examples, so being described relatively simple, related place is referring to method part illustration ?.
Professional further appreciates that, unit described in conjunction with the examples disclosed in the embodiments of the present disclosure And algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These Function is implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Profession Technical staff can use different methods to achieve the described function each specific application, but this realization is not answered Think beyond scope of the present application.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology In any other form of storage medium well known in field.
Above to a kind of human posture recognition method provided herein, human body attitude identifying system, server and Computer readable storage medium is described in detail.Principle and embodiment of the specific case to the application used herein It is expounded, the description of the example is only used to help understand the method for the present application and its core ideas.It should be pointed out that For those skilled in the art, under the premise of not departing from the application principle, can also to the application into Row some improvements and modifications, these improvement and modification are also fallen into the protection scope of the claim of this application.

Claims (10)

1. a kind of human posture recognition method characterized by comprising
Human skeleton feature extraction processing is carried out to the human body image of preset quantity, obtains training set data;
Neural metwork training is carried out using the training set data, obtains gesture recognition model;
Testing and debugging processing is carried out to the gesture recognition model according to the test set data of acquisition, obtains optimum attitude identification mould Type;
Human body image to be identified is identified according to the optimum attitude identification model, obtains recognition result.
2. human posture recognition method according to claim 1, which is characterized in that carried out to the human body image of preset quantity Human skeleton feature extraction processing, obtains training set data, comprising:
Median filtering is carried out to the human body image of the preset quantity, obtains multiple filtering human body images;
The extraction of human skeleton data is carried out to all filtering human body images, obtains multiple human skeleton pose presentations;
All human body skeleton pose images are normalized, processing is obtained into image data as the training set number According to.
3. human posture recognition method according to claim 1, which is characterized in that carry out mind using the training set data Through network training, gesture recognition model is obtained, comprising:
The training set data is input in convolutional neural networks and long Memory Neural Networks in short-term, is constructed to obtain depth Neural network model;
Loss function is calculated to the deep neural network model according to propagated forward algorithm, obtains loss function value;
Judge whether the deep neural network model is convergence state according to the loss function value;
If so, using the deep neural network model as the gesture recognition model;
If it is not, the parameter of the deep neural network model is then adjusted according to back-propagation algorithm, until the depth nerve net Network model is convergence state, using the deep neural network model as the gesture recognition model.
4. human posture recognition method according to claim 1, which is characterized in that according to the test set data of acquisition to institute It states gesture recognition model and carries out testing and debugging processing, obtain optimum attitude identification model, comprising:
The gesture recognition model is tested according to the test set data, obtains test result;
The gesture recognition model is adjusted according to the test result, obtains optimum attitude identification model.
5. human posture recognition method according to claim 1, which is characterized in that according to the optimum attitude identification model Human body image to be identified is identified, recognition result is obtained, comprising:
The human body image to be identified is subjected to human skeleton feature extraction processing, obtains human skeleton image to be identified;
The human skeleton image to be identified is input to convolutional neural networks to handle, obtains characteristic spectrum;
The characteristic spectrum is input to the length, and Memory Neural Networks are handled in short-term, obtain long Memory Neural Networks in short-term As a result;
By the length, Memory Neural Networks result is input to Softmax classifier and handles in short-term, obtains multiple generic probability;
Using all generic probability as the recognition result.
6. a kind of human body attitude identifying system characterized by comprising
Training data obtains module, carries out human skeleton feature extraction processing for the human body image to preset quantity, is instructed Practice collection data;
Neural metwork training module obtains gesture recognition model for carrying out neural metwork training using the training set data;
Neural network test module, for being carried out at testing and debugging according to the test set data of acquisition to the gesture recognition model Reason, obtains optimum attitude identification model;
Identification module obtains identification knot for identifying according to the optimum attitude identification model to human body image to be identified Fruit.
7. human body attitude identifying system according to claim 6, which is characterized in that the training data obtains module, packet It includes:
Filter unit carries out median filtering for the human body image to the preset quantity, obtains multiple filtering human body images;
Skeleton data acquiring unit obtains multiple human bodies for carrying out the extraction of human skeleton data to all filtering human body images Skeleton pose image;
Processing is obtained picture number for all human body skeleton pose images to be normalized by normalized unit According to as the training set data.
8. human body attitude identifying system according to claim 6, which is characterized in that the neural metwork training module, packet It includes:
Network struction unit, for the training set data to be input to convolutional neural networks and long Memory Neural Networks in short-term In, it is constructed to obtain deep neural network model;
Loss function computing unit, for calculating loss function to the deep neural network model according to propagated forward algorithm, Obtain loss function value;
Judging unit is restrained, for judging whether the deep neural network model is convergence shape according to the loss function value State;
Identification model acquiring unit is used for when the deep neural network model is convergence state, by the depth nerve net Network model is as the gesture recognition model;
Model parameter adjusts unit, for being calculated according to backpropagation when the deep neural network model is not convergence state Method adjusts the parameter of the deep neural network model, until the deep neural network model is convergence state, by the depth Neural network model is spent as the gesture recognition model.
9. a kind of server characterized by comprising
Memory, for storing computer program;
Processor realizes that human body attitude described in any one of claim 1 to 5 such as identifies when for executing the computer program The step of method.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium Program realizes such as human body attitude identification side described in any one of claim 1 to 5 when the computer program is executed by processor The step of method.
CN201811268507.XA 2018-10-29 2018-10-29 A kind of human posture recognition method and relevant apparatus Pending CN109376663A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811268507.XA CN109376663A (en) 2018-10-29 2018-10-29 A kind of human posture recognition method and relevant apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811268507.XA CN109376663A (en) 2018-10-29 2018-10-29 A kind of human posture recognition method and relevant apparatus

Publications (1)

Publication Number Publication Date
CN109376663A true CN109376663A (en) 2019-02-22

Family

ID=65390435

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811268507.XA Pending CN109376663A (en) 2018-10-29 2018-10-29 A kind of human posture recognition method and relevant apparatus

Country Status (1)

Country Link
CN (1) CN109376663A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110135304A (en) * 2019-04-30 2019-08-16 北京地平线机器人技术研发有限公司 Human body method for recognizing position and attitude and device
CN110310350A (en) * 2019-06-24 2019-10-08 清华大学 Action prediction generation method and device based on animation
CN110490109A (en) * 2019-08-09 2019-11-22 郑州大学 A kind of online human body recovery action identification method based on monocular vision
CN110728209A (en) * 2019-09-24 2020-01-24 腾讯科技(深圳)有限公司 Gesture recognition method and device, electronic equipment and storage medium
CN110826401A (en) * 2019-09-26 2020-02-21 广州视觉风科技有限公司 Human body limb language identification method and system
CN111723729A (en) * 2020-06-18 2020-09-29 成都颜禾曦科技有限公司 Intelligent identification method for dog posture and behavior of surveillance video based on knowledge graph
CN112364922A (en) * 2020-11-13 2021-02-12 苏州浪潮智能科技有限公司 Method and system for predicting human skeleton motion in machine room environment
CN112733722A (en) * 2021-01-11 2021-04-30 深圳力维智联技术有限公司 Gesture recognition method, device and system and computer readable storage medium
CN113111939A (en) * 2021-04-12 2021-07-13 中国人民解放军海军航空大学航空作战勤务学院 Aircraft flight action identification method and device
CN113688790A (en) * 2021-09-22 2021-11-23 武汉工程大学 Human body action early warning method and system based on image recognition
CN113705542A (en) * 2021-10-27 2021-11-26 北京理工大学 Pedestrian behavior state identification method and system
WO2024036825A1 (en) * 2022-08-16 2024-02-22 深圳先进技术研究院 Attitude processing method, apparatus and system, and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104615983A (en) * 2015-01-28 2015-05-13 中国科学院自动化研究所 Behavior identification method based on recurrent neural network and human skeleton movement sequences
CN105912991A (en) * 2016-04-05 2016-08-31 湖南大学 Behavior identification method based on 3D point cloud and key bone nodes
CN105930767A (en) * 2016-04-06 2016-09-07 南京华捷艾米软件科技有限公司 Human body skeleton-based action recognition method
CN106897670A (en) * 2017-01-19 2017-06-27 南京邮电大学 A kind of express delivery violence sorting recognition methods based on computer vision
CN107492121A (en) * 2017-07-03 2017-12-19 广州新节奏智能科技股份有限公司 A kind of two-dimension human body bone independent positioning method of monocular depth video
CN108537145A (en) * 2018-03-21 2018-09-14 东北电力大学 Human bodys' response method based on space-time skeleton character and depth belief network

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104615983A (en) * 2015-01-28 2015-05-13 中国科学院自动化研究所 Behavior identification method based on recurrent neural network and human skeleton movement sequences
CN105912991A (en) * 2016-04-05 2016-08-31 湖南大学 Behavior identification method based on 3D point cloud and key bone nodes
CN105930767A (en) * 2016-04-06 2016-09-07 南京华捷艾米软件科技有限公司 Human body skeleton-based action recognition method
CN106897670A (en) * 2017-01-19 2017-06-27 南京邮电大学 A kind of express delivery violence sorting recognition methods based on computer vision
CN107492121A (en) * 2017-07-03 2017-12-19 广州新节奏智能科技股份有限公司 A kind of two-dimension human body bone independent positioning method of monocular depth video
CN108537145A (en) * 2018-03-21 2018-09-14 东北电力大学 Human bodys' response method based on space-time skeleton character and depth belief network

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
JEFF DONAHUE等: "Long-term Recurrent Convolutional Networks for Visual Recognition and Description", 《ARXIV:1411.4389V4 [CS.CV] 》 *
WEI SHEN等: "DeepSkeleton: Learning Multi-Task Scale-Associated Deep Side Outputs for Object Skeleton Extraction in Natural Images", 《IEEE TRANSACTIONS ON IMAGE PROCESSING》 *
刘鹏等: "基于深度学习的园林智能浇灌系统", 《湖北汽车工业学院学报》 *
吕耀坤: "基于卷积神经网络的实景交通标志识别", 《物联网技术》 *
秦阳等: "3D CNNs与LSTMs在行为识别中的组合及其应用", 《测控技术》 *
董军: "《"心迹"的计算——隐性知识的人工智能途径》", 31 December 2016 *
郑攀海等: "基于TensorFlow的卷积神经网络的研究与实现", 《电子技术与工程》 *
黄丁: "基于骨架特征的人体姿态识别研究", 《万方数据知识服务平台》 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110135304A (en) * 2019-04-30 2019-08-16 北京地平线机器人技术研发有限公司 Human body method for recognizing position and attitude and device
CN110310350A (en) * 2019-06-24 2019-10-08 清华大学 Action prediction generation method and device based on animation
CN110490109B (en) * 2019-08-09 2022-03-25 郑州大学 Monocular vision-based online human body rehabilitation action recognition method
CN110490109A (en) * 2019-08-09 2019-11-22 郑州大学 A kind of online human body recovery action identification method based on monocular vision
CN110728209A (en) * 2019-09-24 2020-01-24 腾讯科技(深圳)有限公司 Gesture recognition method and device, electronic equipment and storage medium
CN110728209B (en) * 2019-09-24 2023-08-08 腾讯科技(深圳)有限公司 Gesture recognition method and device, electronic equipment and storage medium
CN110826401A (en) * 2019-09-26 2020-02-21 广州视觉风科技有限公司 Human body limb language identification method and system
CN110826401B (en) * 2019-09-26 2023-12-26 广州视觉风科技有限公司 Human body limb language identification method and system
CN111723729B (en) * 2020-06-18 2022-08-05 四川千图禾科技有限公司 Intelligent identification method for dog posture and behavior of surveillance video based on knowledge graph
CN111723729A (en) * 2020-06-18 2020-09-29 成都颜禾曦科技有限公司 Intelligent identification method for dog posture and behavior of surveillance video based on knowledge graph
CN112364922B (en) * 2020-11-13 2023-01-10 苏州浪潮智能科技有限公司 Method and system for predicting human skeleton motion in machine room environment
CN112364922A (en) * 2020-11-13 2021-02-12 苏州浪潮智能科技有限公司 Method and system for predicting human skeleton motion in machine room environment
CN112733722A (en) * 2021-01-11 2021-04-30 深圳力维智联技术有限公司 Gesture recognition method, device and system and computer readable storage medium
CN113111939A (en) * 2021-04-12 2021-07-13 中国人民解放军海军航空大学航空作战勤务学院 Aircraft flight action identification method and device
CN113688790A (en) * 2021-09-22 2021-11-23 武汉工程大学 Human body action early warning method and system based on image recognition
CN113705542A (en) * 2021-10-27 2021-11-26 北京理工大学 Pedestrian behavior state identification method and system
WO2024036825A1 (en) * 2022-08-16 2024-02-22 深圳先进技术研究院 Attitude processing method, apparatus and system, and storage medium

Similar Documents

Publication Publication Date Title
CN109376663A (en) A kind of human posture recognition method and relevant apparatus
CN109948647B (en) Electrocardiogram classification method and system based on depth residual error network
CN106778854B (en) Behavior identification method based on trajectory and convolutional neural network feature extraction
CN106709461B (en) Activity recognition method and device based on video
Liu et al. Learning spatio-temporal representations for action recognition: A genetic programming approach
US20220004744A1 (en) Human posture detection method and apparatus, device and storage medium
CN109215013A (en) Automatic stone age prediction technique, system, computer equipment and storage medium
CN109919928A (en) Detection method, device and the storage medium of medical image
CN109785928A (en) Diagnosis and treatment proposal recommending method, device and storage medium
CN116051574A (en) Semi-supervised segmentation model construction and image analysis method, device and system
CN110084313A (en) A method of generating object detection model
CN110580445A (en) Face key point detection method based on GIoU and weighted NMS improvement
CN111401106B (en) Behavior identification method, device and equipment
CN110543895B (en) Image classification method based on VGGNet and ResNet
CN111832516A (en) Video behavior identification method based on unsupervised video representation learning
CN109583331B (en) Deep learning-based accurate positioning method for positions of wrist vein and mouth of person
CN112836602B (en) Behavior recognition method, device, equipment and medium based on space-time feature fusion
CN109284761A (en) A kind of image characteristic extracting method, device, equipment and readable storage medium storing program for executing
CN113569805A (en) Action recognition method and device, electronic equipment and storage medium
CN109766873A (en) pedestrian re-identification method based on hybrid deformable convolution
CN111428761A (en) Image feature visualization method, image feature visualization device and electronic equipment
CN112949654A (en) Image detection method and related device and equipment
CN111429414B (en) Artificial intelligence-based focus image sample determination method and related device
WO2021179198A1 (en) Image feature visualization method, image feature visualization apparatus, and electronic device
CN109284700A (en) The method of multiple Face datections, storage medium, equipment and system in image

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190222