CN101989326B

CN101989326B - Human posture recognition method and device

Info

Publication number: CN101989326B
Application number: CN200910161452.7A
Authority: CN
Inventors: 楚汝峰; 陈茂林
Original assignee: Beijing Samsung Telecommunications Technology Research Co Ltd; Samsung Electronics Co Ltd
Current assignee: Beijing Samsung Telecommunications Technology Research Co Ltd; Samsung Electronics Co Ltd
Priority date: 2009-07-31
Filing date: 2009-07-31
Publication date: 2015-04-01
Anticipated expiration: 2029-07-31
Also published as: CN101989326A; KR20110013200A

Abstract

The invention provides a human posture recognition method and a human posture recognition device. The device comprises an input module, a preprocessing module, a training module, a feature extraction module, a template database building module, a search module and an output module, wherein the input module captures a human posture and forms an input image; the preprocessing module normalizes the input image to a fixed size and generates a sample in an independent shape; the training module reduces the dimensionality of sample data by a statistical learning method in a training phase to obtain a projection transformation matrix and build a nearest neighbor classifier; the feature extraction module extracts differentiated posture features from the sample data according to the projection transformation matrix in the training phase and a human recognition phase respectively; the template database building module builds a posture template database according to the differentiated posture features extracted in the training phase; the search module compares the differentiated posture features extracted in the human posture recognition phase with the posture template in the posture template database by using the nearest neighbor classifier to perform human posture matching; and the output module outputs an optimal matching posture and repositions a virtual human model.

Description

Human posture recognition method and device

Technical field

Present invention relates in general to computer vision, more particularly, relate to the estimation of real-time body's gesture recognition and motion analysis.

Background technology

Human motion analysis and human body attitude identification are very important technology, and this technology uses significant human body attitude, to contribute to realizing man-machine interaction, virtual three-dimensional (3D) interactive game, 3D gesture recognition etc.In recent years, due to the promising learning value of its tool and commercial value, human motion capture research receives increasing concern.

Current existence is used for the kinds of schemes of human motion analysis.Some schemes need on object, to stick specific tag block or need specific capturing movement equipment, in general environment (such as home entertaining, 3D interactive game etc.), above-mentioned needs are inconvenient for user, and limit the application of these schemes.For the application of some reality, make a very big effort to make the mark for human motion analysis less.Existing method is mainly divided into two classes, that is, based on the method for human body analysis and the method based on sample.On the other hand, existing method also can be divided into method based on coloured image and 3D laser scanning manikin householder method.

As everyone knows, coloured image can only provide two dimension (2D) information, such as color, texture, shape etc.Therefore, the attitude uncertain problem in 2D information is inevitably caused.Such as, if some positions of human body are from blocking (self-occlusion), then due to the uncertainty of the human body attitude in coloured image, the method based on coloured image can not be used to carry out correct human body attitude identification.Even if employ more advanced attitude estimating method, the probabilistic colouring information of attitude also can cause reduction process speed and inaccurate attitude inferred results.In addition, according to the dress ornament of different seasons, people and ambient lighting change, colouring information is unstable (or not robust), and therefore, in complex environment, the human posture recognition method based on colouring information can not meet the demands.Therefore, some researchists and slip-stick artist use the 3D model of laser scanning to obtain more accurate result.But due to high cost and the large volume of acquisition equipment, laser scanner is impracticable in real environment (such as home entertaining, 3D interactive game etc.).In order to solve this problem, need a kind of method and apparatus carrying out real-time body's gesture recognition in mixed and disorderly environment.

Summary of the invention

The present invention still concentrates on human body attitude identification without the need to tag block or human motion analysis.But, in the present invention, solve the problems of the prior art by new mode.First, the present invention adopts the TOF depth camera of combination (can simultaneously provide depth image and intensity image) and colour TV camera (providing coloured image).Secondly, the invention provides a kind of method and apparatus identifying human body attitude in complex environment, the method and device can effectively utilize depth information and colouring information to carry out human body attitude identification.

According to an aspect of the present invention, provide a kind of human body attitude recognition device, this device comprises: load module, comprises depth camera and colour TV camera, for catching human body attitude simultaneously, forms input picture; Input picture pre-service is applicable form by pretreatment module, and is fixed size by image normalization, produces shape independently attitude sampling, forms sampled data; Training module, the dimension of carrying out sampled data in training stage Using statistics learning method reduces, and to obtain the projective transformation matrix of original image space to feature space, and builds nearest neighbor classifier; Characteristic extracting module, extracts distinguishing posture feature in training stage and human body attitude cognitive phase from sampled data respectively according to described projective transformation matrix; Template database builds module, builds pose template database according to the distinguishing posture feature that characteristic extracting module is extracted in the training stage; Search module, distinguishing posture feature characteristic extracting module extracted in human body attitude cognitive phase by nearest neighbor classifier is compared with the pose template in pose template database, to carry out human body attitude coupling; Output module, exports the attitude of optimum matching, and reorientates virtual human model based on the attitude of optimum matching.

According to a further aspect in the invention, provide a kind of human posture recognition method, the method comprises: (a) utilizes depth camera and colour TV camera to catch human body attitude simultaneously, forms input picture; B input picture pre-service is applicable form by (), and be fixed size by image normalization, produces shape independently attitude sampling, forms sampled data; C dimension that () carries out sampled data in training stage Using statistics learning method reduces, and to obtain the projective transformation matrix of original image space to feature space, and builds nearest neighbor classifier; D () extracts distinguishing posture feature in training stage and human body attitude cognitive phase from sampled data respectively according to described projective transformation matrix; E () builds pose template database according to the distinguishing posture feature extracted in the training stage; F the distinguishing posture feature extracted in human body attitude cognitive phase is compared with the pose template in pose template database by nearest neighbor classifier by (), to carry out human body attitude coupling; G () exports the attitude of optimum matching, and reorientate virtual human model based on the attitude of optimum matching.

Accompanying drawing explanation

In conjunction with the drawings, from the description of the following examples, the present invention these and/or other side and advantage will become clear, and are easier to understand, wherein:

Fig. 1 is the block diagram of the human body attitude recognition device according to the embodiment of the present invention;

Fig. 2 shows the sampled images of catching according to the load module of the embodiment of the present invention;

Fig. 3 is the process flow diagram of the human posture recognition method according to the embodiment of the present invention;

Fig. 4 shows the Image semantic classification process of the pretreatment module according to the embodiment of the present invention;

Fig. 5 shows the example of the location shoulder point according to the embodiment of the present invention;

Fig. 6 shows the sorter training process of the training module according to the embodiment of the present invention;

Fig. 7 shows the template database building process building module according to the template database of the embodiment of the present invention.

Fig. 8 shows the characteristic extraction procedure of the characteristic extracting module according to the embodiment of the present invention;

Fig. 9 shows the human body attitude output procedure of characteristic matching according to the search module of the embodiment of the present invention and output module;

Figure 10 to Figure 13 shows the experiment 1 and experiment 2 carried out according to the present invention.

Embodiment

Below, embodiments of the invention are described in detail with reference to accompanying drawing.

Fig. 1 is the block diagram of the human body attitude recognition device according to the embodiment of the present invention.As shown in Figure 1, this human body attitude recognition device comprises load module 101, pretreatment module 102, training module 103, template database (DB) build module 104, characteristic extracting module 105, search module 106 and output module 107.

Load module 101 comprises two video cameras, that is, depth camera and colour TV camera, and depth camera can be such as TOF (Time of Flight) depth camera.TOF depth camera and colour TV camera are used for catching human body attitude simultaneously, form input picture.Input picture pre-service is applicable form by pretreatment module 102, and is fixed size by image normalization, produces shape and independently samples.The raw data of normalized sampling is high-dimensional.After pre-processing, training module 103 the training stage (namely, learning phase) Using statistics learning method (such as PCA (pivot analysis), LLE (local linear embedding) etc.), the dimension of carrying out sampled data reduces, with obtain original image space to feature space projective transformation matrix (namely, obtain the Feature Selection mechanism being used for feature extraction), and build nearest neighbor classifier.In order to identify human body attitude, template DB builds the previous pose template database that module 104 builds off-line.Build in module 104 at template DB, different human body attitudes is manually marked.Then, characteristic extracting module 105 extracts distinguishing posture feature in the training stage from sampled data according to projective transformation matrix, so that template DB structure module 104 finally sets up the attitude corresponding relation between posture feature and relevant attitude.

In the online gesture recognition stage, characteristic extracting module 105 only extracts distinguishing posture feature according to projective transformation matrix.Search module 106 receives described distinguishing posture feature, distinguishing posture feature characteristic extracting module 105 extracted in human body attitude cognitive phase by nearest neighbor classifier is compared with the pose template in pose template database, to carry out human body attitude coupling.Afterwards, output module 107 provides the attitude of optimum matching, reorientates virtual human model.Thus, whole human body attitude identifying is just completed.

In the present invention, use two video cameras to catch identical scene simultaneously.A video camera is TOF depth camera, and another video camera is colour TV camera.Colour TV camera can be traditional CCD/CMOS video camera, can provide coloured image.TOF depth camera can provide depth image and intensity image.Depth image represents the distance between reference object and TOF depth camera.Intensity image represents the light intensity energy that TOF depth camera receives.

Fig. 2 shows the sampled images of catching according to the load module 101 of the embodiment of the present invention.As can be seen from Figure 2, intensity image provides background image clearly, and this background image is very suitable for carrying out foreground extraction and outline (silhouette) segmentation.Intuitively, can easily use clearly background intensity image to locate head and the trunk of human body.When the glasses reflection that people wears is very serious, if want to locate eye locations, then intensity image may not be optimal selection.Therefore, coloured image can be used to locate eye locations.There is multiple diverse ways and locate eye locations in coloured image.In addition, in some cases, coloured image and sketch figure picture have ambiguity for human body attitude analysis, therefore can make full use of depth image to alleviate the ambiguousness of human body attitude.

After the input picture (coloured image, depth image and intensity image) obtaining three types, needing these Image semantic classification is applicable form.Utilize the input picture of this three types to carry out Image semantic classification.

Fig. 3 is the process flow diagram of the human posture recognition method according to the embodiment of the present invention.

With reference to Fig. 3, in operation 301, the depth camera in load module 101 and colour TV camera catch human body attitude simultaneously, form input picture.In operation 302, input picture pre-service is applicable form by pretreatment module 102, and is fixed size by image normalization, produces shape and independently samples.In operation 303, the dimension that training module 103 carries out sampled data in training stage Using statistics learning method reduces dimension reduction, to obtain the projective transformation matrix of original image space to feature space, and builds nearest neighbor classifier.In operation 304, levy extraction module 104 and extract distinguishing posture feature in training stage and human body attitude cognitive phase from sampled data respectively according to projective transformation matrix.In operation 305, template database (DB) builds module and builds pose template database according to the distinguishing posture feature in the training stage.In operation 306, the distinguishing posture feature that characteristic extracting module 105 is extracted by nearest neighbor classifier by search module 106 in human body attitude cognitive phase compares with the pose template in pose template database, to carry out human body attitude coupling.In operation 307, output module 107 exports the attitude of optimum matching, and reorientates virtual human model based on the attitude of optimum matching.

Describe according to Image semantic classification of the present invention referring to Fig. 4 and Fig. 5.Fig. 4 shows the Image semantic classification process of the pretreatment module 102 according to the embodiment of the present invention.

With reference to Fig. 4, in operation 401, pretreatment module 102 working strength image is split human region and extracts outline.In this process, threshold segmentation method can be used.In operation 402, pretreatment module 102 uses the human region of segmentation as the mask (mask) in coloured image, so that head and trunk.Head and trunk are detected, the detecting device training that pretreatment module 102 can use existing AdaBoost algorithm to provide and local feature.Image normalization is fixed size by pretreatment module 102, therefore needs some reference point.In operation 403, pretreatment module 102 select eye locations and shoulder position as a reference point, this is because for the front view of human body, eye locations is strong reference point in head zone, and shoulder position is strong reference point in torso area.In order to locate eye locations con vigore, pretreatment module 102 can use the eye detecting device of existing training, and this eye detecting device also can be trained based on AdaBoost algorithm and local characterization method.In order to location shoulder position (comprises left shoulder point P con vigore _lSwith right shoulder point P _rS), pretreatment module 102 adopts a kind of simple method, and the method has the advantage of the depth image of mask as shown in Figure 4.Pretreatment module 102 detects bending point in the horizontal projection and vertical projection of torso area as shoulder point.

After located eye locations and shoulder position, in operation 404, pretreatment module 102 carries out shape normalized.The normalized object of shape produces shape independently to sample.Suppose P ₁represent the center between left eye and right eye, P ₂represent left shoulder point P _lSwith right shoulder point P _rSbetween center, D ₁represent P ₁and P ₂between distance, D ₂represent left shoulder point P _lSwith right shoulder point P _rSbetween distance, then adopt D ₁as the reference length of height of sampling h, adopt D ₂as the reference length of sampling width w.Shape normalized part 1024 uses equation below sampling pruned and be normalized to the size of 80 × 48: D ₂/ D ₁=5: 2 (this ratio is used for being normalized shape); W=4 × D ₂and h=6 × D ₁(for sample area size).For punch action, sampling is pruned and is normalized to the size of 80 × 80 by pretreatment module 102, and arranges w=h=6 × D ₁, because the image gathered does not comprise complicated punch action.

Fig. 5 shows the example of the location shoulder point according to the embodiment of the present invention.Particularly, (a) in Fig. 5 is the outline of human body foreground area.(b) in Fig. 5 is this histogram in this image (this outline) vertical direction, the position of the horizontal direction of horizontal ordinate representative image (namely, the row coordinate of image, span is 0 ~ picture traverse), ordinate implication is in some row coordinate points, the aggregate-value (that is, the vertical direction projection value of this row coordinate points) of all pixel values of these row in image.(c) in Fig. 5 is image histogram in the horizontal direction, the position of the vertical direction of horizontal ordinate representative image (namely, the row-coordinate of image, span is 0 ~ picture altitude), ordinate implication is at some row-coordinate points, the aggregate-value (i.e. the horizontal direction projection value of this row-coordinate point) of all pixel values of this row in image.(d) in Fig. 5 is the result of location shoulders of human body point (region detection).

Describe sorter according to the present invention referring to Fig. 6 to train.Fig. 6 shows the sorter training process of the training module 103 according to the embodiment of the present invention.

Training module 103 adopts PCA (pivot analysis) and LLE (local linear embedding) learning method to obtain the projective transformation matrix of original image space to feature space.

With reference to Fig. 6, in operation 601, training module 103 creates training dataset.The standard that training dataset is selected makes to train sampling (that is, the attitude sampling in training stage) diversified and representative, makes training dataset comprise human action as much as possible.Training module 103 mainly selects diversified training sampling according to different punch action, and training sampling is uniformly distributed in image space.Then, in operation 602, training sampled data is transformed to suitable input vector, to learn by training module 103.That is, 2D data are directly expanded into one dimension (1D) vector by training module 103.Then, in operation 603, training module 103 adopts the statistical learning methods such as PCA (pivot analysis) and LLE (local linear embedding) to carry out dimension reduction, to obtain projective transformation matrix.Those skilled in the art can obtain the concrete introduction about PCA and LLE from prior art, are therefore no longer described in greater detail here.After this, in operation 604, training module 103 builds has L ₁nN (arest neighbors) sorter of distance (measuring similarity value), L ₁be defined in and hereinafter describe.

Describe referring to Fig. 7 and build according to template DB of the present invention.Fig. 7 shows the template DB building process building module 104 according to the template DB of the embodiment of the present invention.It is part and parcel that template DB builds for the motion analysis based on sample.

With reference to Fig. 7, build module 104 in operation 701, template DB and select different attitude samplings.Then, build module 104 pairs of attitude sampled images in operation 702, template DB manually to mark.Preferably, template DB structure module 104 uses and produces based on the motion capture system marked or suitable computer graphical software the data set be marked.Because present apparatus and layout limit, acquire the punch action attitude of 8 types in the present invention, and eliminate the process of mark.Characteristic extracting module 105 extracts the distinguishing feature of low dimension according to the projective transformation matrix that training module 103 obtains from sampling.Then, the corresponding relation between described distinguishing characteristics and attitude (skeleton) is set up in operation 703, template DB structure module 104 based on the distinguishing feature extracted.In the present invention, the corresponding relation between the index establishing the punch action of described distinguishing characteristics and 8 types.After this, build module 104 in operation 704, template DB and produce the template of skeleton (or action) index comprising eigenvector and be associated based on the corresponding relation set up.

Describe according to online gesture recognition of the present invention referring to Fig. 8 and Fig. 9.After the suitable template DB that trained sorter and foundation, online gesture recognition can be carried out.Similar with the training stage, first pre-service is carried out to input picture.Process subsequently comprises feature extraction, characteristic matching and human body attitude and exports.

Fig. 8 shows the characteristic extraction procedure of the characteristic extracting module 105 according to the embodiment of the present invention, and Fig. 9 shows the human body attitude output procedure of characteristic matching according to the search module 106 of the embodiment of the present invention and output module 107.

The object of feature extraction is to extract distinguishing feature to mate.With reference to Fig. 8, in operation 801, the depth data transform of input picture is applicable image vector by characteristic extracting module 105, that is, directly 2D data are expanded into 1D vector.Then, in operation 802, characteristic extracting module 105 is used in the projective transformation matrix that obtains in the training stage by the data projection from image space to feature space.In the present invention, PCA and the LLE projective transformation matrix of training can be used.

Suppose X={x ₁, x ₂... x _nrepresent the 1D view data of input (wherein, N=w × h, w are sampling width, and h is height of sampling), W represents the PCA/LLE projective transformation matrix (dimension of W is: N × M, M < < N) of training.Therefore, in operation 803, characteristic extracting module 105 can obtain eigenvector V, V=W ^tthe dimension of X, eigenvector V is M.

After having carried out feature extraction, utilize NN (arest neighbors) sorter in template database, take out top-n optimum matching attitude.Namely, the distinguishing posture feature extracted in human body attitude cognitive phase is compared with the pose template in pose template database 104 by nearest neighbor classifier by search module 106, to carry out human body attitude coupling.Specifically, with reference to Fig. 9, in operation 901, the distance between the eigenvector that search module 106 utilizes nearest neighbor classifier to calculate in current eigenvector and template database.Suppose V ₀current eigenvector (that is, the eigenvector of input), V _ibe eigenvector in template DB (i=1 ..., N), S _ibe be associated skeleton (attitude) index (i=1 ..., N).Service range measures L ₁=| V ₀-V _i| (i=1 ..., N), the eigenvector V of input ₀by with all N number of template V in template DB _imate, obtain a series of L ₁measuring similarity value.In operation 902, search module 106 is based on described L ₁distance can obtain the index of top-n optimum matching in template database.In operation 903, output module 107 obtains the attitude (skeleton) of optimum matching according to the index profit of optimum matching in template database.Then, in operation 904, output module 107 reorientates virtual human model based on the attitude (skeleton) of optimum matching.

Such as, establish pose template database 104 in the off-line learning stage, pose template database 104 comprises the maneuver library of a set of taijiquan, has the image of 500 actions.When setting up pose template database 104, be extracted the eigenvector of each human action respectively, and the position of each articulation point is marked (being convenient to output module 107 drives virtual portrait to show).In the stage of the online actions identification of reality, user has done an action, captures the image of this action, and carried out pre-service by pretreatment module 102, characteristic extracting module 105 extracts distinguishing posture feature, then obtains the eigenvector of this action; 500 stack features vectors in this eigenvector and pose template database 104 are compared by nearest neighbor classifier by search module 106 respectively, calculate similarity, the n finding a similarity maximum action, and this process is exactly the process of top-n arest neighbors classification; If n=1, find a action the most close exactly; Output module 107 exports the human synovial dot information corresponding with this action, carries out driving or the display of virtual portrait.

The experiment 1 and experiment 2 carried out according to the present invention are described referring to Figure 10 to Figure 13.

With reference to Figure 10, in experiment 1, for specific people.The attitude data of the people of test is included in training data.In the training stage, relate to 4 people, have the punch action of 8 kinds of attitudes, have 1079 samplings (each sample size is 80 × 80), carry out reorientating manikin according to 100 dimensions.At test phase, relate to 4 people identical with the training stage, have the punch action of 8 kinds of attitudes, test 1079 samplings.

Figure 11 shows the result of experiment 1.(a) in Figure 11 shows the Search Results adopting LLE method to obtain, (b) in Figure 11 shows the Search Results adopting PCA method to obtain, (a) is in fig. 11 with in (b), an image in the upper left corner is transfused to as inquiry, and other image is output as rreturn value.

With reference to Figure 12, in experiment 2, for unspecific people.The attitude data of the people of test is not comprised in training data.In the training stage, relate to 4 people, have the punch action of 8 kinds of attitudes, have 1079 samplings, carry out reorientating manikin according to 100 dimensions.At test phase, relate to 2 people different from the training stage, have the punch action of 8 kinds of attitudes, test 494 samplings.

Figure 13 shows the result of experiment 2.(a) in Figure 13 shows the Search Results adopting LLE method to obtain, (b) in Figure 13 shows the Search Results adopting PCA method to obtain, (a) is in fig. 13 with in (b), an image in the upper left corner is transfused to as inquiry, and other image is output as rreturn value.

Therefore, compared with traditional method based on coloured image, the present invention can solve fuzzy problem in outline owing to using depth data.Present invention utilizes depth information and colouring information, can provide a kind of shape method for normalizing, the method can obtain shape independently gesture recognition.In addition, present invention employs statistical learning method and method for fast searching, make human body attitude recognition device structure simply and more effective.

Although the present invention is described in detail with reference to its exemplary embodiment and shows, but will be understood by those skilled in the art that, when not departing from the spirit and scope of the present invention be defined by the claims, the various changes of form and details can be carried out to it.

Claims

1. a human body attitude recognition device, comprising:

Load module, comprises depth camera and colour TV camera, for catching human body attitude simultaneously, forms input picture;

Input picture pre-service is applicable form by pretreatment module, and is fixed size by image normalization, produces shape independently attitude sampling, forms sampled data;

Training module, the dimension of carrying out sampled data in training stage Using statistics learning method reduces, and to obtain the projective transformation matrix of original image space to feature space, and builds nearest neighbor classifier;

Characteristic extracting module, extracts distinguishing posture feature in training stage and human body attitude cognitive phase from sampled data respectively according to described projective transformation matrix;

Template database builds module, builds pose template database according to the distinguishing posture feature that characteristic extracting module is extracted in the training stage;

Search module, distinguishing posture feature characteristic extracting module extracted in human body attitude cognitive phase by nearest neighbor classifier is compared with the pose template in pose template database, to carry out human body attitude coupling;

Output module, exports the attitude of optimum matching, and reorientates virtual human model based on the attitude of optimum matching,

Wherein, training module creates training dataset, attitude is sampled and is uniformly distributed in image space, sampled data is transformed to input vector, and the dimension adopting statistical learning method to carry out sampled data reduces, to obtain described projective transformation matrix,

Wherein, the depth data transform of input picture is one-dimensional data vector by characteristic extracting module, is used in the projective transformation matrix that obtains in the training stage by the data projection from image space to feature space, to obtain eigenvector.

2. human body attitude recognition device according to claim 1, wherein, depth camera forms depth image and the intensity image of human body attitude, and colour TV camera forms the coloured image of human body attitude.

3. human body attitude recognition device according to claim 2, wherein, pretreatment module working strength image is split human body attitude and is extracted outline, use human region head and the trunk of segmentation, select eye locations and shoulder position is as a reference point carries out shape normalized, produce shape independently attitude sampling.

4. human body attitude recognition device according to claim 3, wherein, described statistical learning method comprises pca method or local linear embedding grammar.

5. human body attitude recognition device according to claim 4, wherein, template database builds the different attitude sampling of model choice, manually marks attitude sampled images; Characteristic extracting module extracts the distinguishing feature of low dimension from attitude sampling according to described projective transformation matrix; Template database structure module sets up the corresponding relation between described distinguishing characteristics and attitude based on the distinguishing feature extracted, and produce the template of attitude index comprising eigenvector and be associated based on the corresponding relation set up, to build pose template database.

6. human body attitude recognition device according to claim 5, wherein, search module calculates the distance between the eigenvector in current eigenvector and pose template database by nearest neighbor classifier, obtains the index of optimum matching based on described distance in pose template database.

7. human body attitude recognition device according to claim 6, wherein, output module obtains the attitude of optimum matching in pose template database according to the index of optimum matching, and reorientates virtual human model based on the attitude of optimum matching.

8. a human posture recognition method, comprises the following steps:

A () utilizes depth camera and colour TV camera to catch human body attitude simultaneously, form input picture;

B input picture pre-service is applicable form by (), and be fixed size by image normalization, produces shape independently attitude sampling, forms sampled data;

C dimension that () carries out sampled data in training stage Using statistics learning method reduces, and to obtain the projective transformation matrix of original image space to feature space, and builds nearest neighbor classifier;

D () extracts distinguishing posture feature in training stage and human body attitude cognitive phase from sampled data respectively according to described projective transformation matrix;

E () builds pose template database according to the distinguishing posture feature extracted in the training stage;

F the distinguishing posture feature extracted in human body attitude cognitive phase is compared with the pose template in pose template database by nearest neighbor classifier by (), to carry out human body attitude coupling;

G () exports the attitude of optimum matching, and reorientate virtual human model based on the attitude of optimum matching,

Wherein, step (c) comprising: create training dataset, attitude is sampled and is uniformly distributed in image space; Sampled data is transformed to input vector; The dimension adopting statistical learning method to carry out sampled data reduces, to obtain described projective transformation matrix,

Wherein, step (d) comprising: be one-dimensional data vector by the depth data transform of input picture; Be used in the projective transformation matrix that obtains in the training stage by the data projection from image space to feature space, to obtain eigenvector.

9. human posture recognition method according to claim 8, wherein, depth camera forms depth image and the intensity image of human body attitude, and colour TV camera forms the coloured image of human body attitude.

10. human posture recognition method according to claim 9, wherein, step (b) comprising:

Working strength image is split human body attitude and is extracted outline;

Use the human region head after segmentation and trunk,

Select eye locations and shoulder position is as a reference point carries out shape normalized, produce shape independently attitude sampling.

11. human posture recognition methods according to claim 10, wherein, described statistical learning method comprises pca method or local linear embedding grammar.

12. human posture recognition methods according to claim 11, wherein, step (e) comprising:

Select different attitude samplings, attitude sampled images is manually marked;

The corresponding relation between described distinguishing characteristics and attitude is set up based on the distinguishing feature extracted in the training stage;

The template of attitude index comprising eigenvector and be associated is produced, to build pose template database based on the corresponding relation set up.

13. human posture recognition methods according to claim 12, wherein, step (f) comprising:

The distance between the eigenvector in current eigenvector and pose template database is calculated by nearest neighbor classifier;

In pose template database, the index of optimum matching is obtained based on described distance.

14. human posture recognition methods according to claim 13, wherein, step (g) comprising:

In pose template database, the attitude of optimum matching is obtained according to the index of optimum matching;

Attitude based on optimum matching reorientates virtual human model.