CN106886741A

CN106886741A - A kind of gesture identification method of base finger identification

Info

Publication number: CN106886741A
Application number: CN201510943700.9A
Authority: CN
Inventors: 祝铭明
Original assignee: Yutou Technology Hangzhou Co Ltd
Current assignee: Yutou Technology Hangzhou Co Ltd
Priority date: 2015-12-16
Filing date: 2015-12-16
Publication date: 2017-06-23

Abstract

The invention discloses a kind of gesture identification method of base finger identification, belong to technical field of hand gesture recognition；Method includes：The video data stream for being associated with user's whole body is obtained by an image collecting device, and treatment obtains skeleton point information；According to skeleton point information, determine palm of the hand positional information with pushing information；Judge that whether height of the palm of the hand of user apart from ground is more than a default height threshold, and step is continued executing with when being according to palm of the hand positional information；Judgement obtains the image of palm area, and image to palm area carries out partition, cutting and pre-processes, and obtains corresponding hand mask and exports；According to result, the fingertip area of hand is identified, and the gesture of user is identified according to the geometrical relationship of fingertip area.The beneficial effect of above-mentioned technical proposal is：Eliminate the figure viewed from behind to ring, it is to avoid some invalid gestures are mistaken as user and carry out the situation generation of gesture instruction input, improve the accuracy of gesture identification.

Description

A kind of gesture identification method of base finger identification

Technical field

The present invention relates to technical field of hand gesture recognition, more particularly to a kind of gesture identification method.

Background technology

Current gesture identification method both domestic and external is roughly divided into 2 classes, based on wearing kind equipment and based on Conventional visual.Gesture identification based on wearable device is to obtain finger motion characteristic from sensors such as data glove, position trackers, and the analysis that incoming computer carries out joint data using neutral net simultaneously reaches man-machine interaction so as to obtain gesture.Major advantage can be the posture and gesture for determining finger, but comparatively costly, be unfavorable for large-scale popularization application.Method for distinguishing is known based on Conventional visual and gathers gesture video or image information using common camera, then be identified treatment.Although which brings good man-machine interaction to user, but for the extraction for improving the robustness of system and effectively carry out hand position, hand, finger orientation etc. and be characterized, identification people needs to wear coloured gloves, it is installed with the clothes of particular requirement, and recognizing the background of people needs unified color, therefore is easily influenceed by environmental factors such as background, light, the positions of camera based on Conventional visual knowledge method for distinguishing.

The content of the invention

According to the above-mentioned problems in the prior art, a kind of technical scheme of the finger gesture recognition methods based on depth image is now provided, specifically included：

A kind of gesture identification method, wherein, comprise the following steps：

Step S1, the video data stream for being associated with user's whole body is obtained by an image collecting device, and process the skeleton point information of each skeleton point for obtaining being associated with the user；

Step S2, according to the skeleton point information, it is determined that representing the palm of the hand positional information and the pushing pushing information for representing the user of the palm of the hand position of the user；

Step S3, judges whether height of the palm of the hand of the user apart from ground is more than a default height threshold according to the palm of the hand positional information：

If so, then continuing executing with the step S4；

If it is not, then exiting；

Step S4, judgement obtains the image of palm area, and image to the palm area carries out partition, cutting and pre-processes, and obtains corresponding hand mask and exports；

Step S5, according to the result, identifies the fingertip area of hand, and the gesture of the user is identified according to the geometrical relationship of the fingertip area.

Preferably, the gesture identification method, wherein, in the step S1, described image harvester is depth of field camera；

The video data is the depth of field video data of the whole body for being associated with the user.

Preferably, the gesture identification method, wherein, the step S1 includes：

Step S11, the video data stream of the depth image of the whole body of background and the user is included using the collection of described image harvester；

Step S12, the three-dimensional information of the pixel of the depth image of every frame that the video data stream is included carries out spatial alternation, to obtain corresponding cloud information in real space；

Step S13, according to the corresponding described cloud information of each described pixel, obtains the distance between each described pixel and described depth of field camera；

Step S14, respectively according to the corresponding distance of each described pixel, treatment obtains the skeleton point information.

Preferably, the gesture identification method, wherein, the step S2 includes：

Step S21, the skeleton point information of each skeleton point for being associated with the user obtained according to treatment obtains the palm of the hand positional information of the user；

Step S22, the skeleton point information of each skeleton point for being associated with the user obtained according to treatment, the height information of the user is calculated according to following formula：

Wherein, H₁Represent the height values of the user, H₂Represent the pixels tall numerical value of background, H₃Represent pixels tall numerical value of the user in collected video image, d represents the distance between the user and depth of field camera numerical value, θ represents depth of field camera vertical angle numerical value in the horizontal direction；

Step S23, according to the corresponding relation of default Human Height and human body between pushing, obtains the described pushing information of the user.

Preferably, the gesture identification method, wherein, the step S4 includes：

Step S41, according to the palm of the hand positional information and the pushing information, the information of the distance more than the pixel of the pushing half of all and palm of the hand position that the hand of the user includes is removed, and the information of all described pixel included according to the hand after removal obtains hand data；

The hand data that treatment is obtained are carried out clustering processing by step S42 by K mean cluster algorithm, are obtained by the hand data after clustering processing；

Step S43, sets min cluster number, is excluded with the filtering that the hand data are carried out with noise jamming pixel clusters, so as to obtain being associated with the hand mask of the hand data and export.

Preferably, the gesture identification method, wherein, the hand data are included in a spheric region with the described pushing half of the user as radius and with the palm of the hand position of the user as the center of circle.

Preferably, the gesture identification method, wherein, the step S5 includes：

Step S51, the edge contour of the hand mask is obtained using the detection of Moore neighborhoods contour following algorithm, and obtains first chain set of all profile points included on the edge contour；

Step S52, the convex closure collection on the hand profile of the hand mask is obtained using the detection of Graham scanning algorithms, and acquisition includes the second point chain set of all convex closures；

Step S53, using profile maximum depression points scanning algorithm, maximum depression points between detection on the edge contour of the hand mask and the convex closure collection of the hand profile obtains all salient points, and obtain the thirdly chain set of the sags and crests included on the hand profile；

Step S54, using concavo-convex angle recognizer, obtains including the 4th chain set of all finger tip points of hand according to the thirdly chain process of aggregation for being associated with the hand profile；

Step S55, each finger of hand is obtained according to finger tip point identification, then performs gesture identification operation.

Preferably, the gesture identification method, wherein, in the step S55, perform the step of gesture identification is operated and specifically include：

Step S551, identification obtains the number of all described finger of hand；

Step S552, judged to obtain the angle between the every title of the finger, direction vector and the adjacent finger and export according to presupposed information；

Step S553, forms one or three layers of decision tree, and gesture is identified according to three layers of decision tree according to the information exported in the step S552.

Preferably, the gesture identification method, wherein, in the step S42, the K mean cluster is calculated K values in method are set as fixed numbers 2.

The beneficial effect of above-mentioned technical proposal is to provide a kind of gesture identification method, can eliminate the influence of extraneous background, and avoid some invalid gestures be mistaken as user carry out gesture instruction input situation occur, improve the accuracy of gesture identification.

Brief description of the drawings

During Fig. 1 is preferred embodiment of the invention, a kind of overall procedure schematic diagram of gesture identification method；

During Fig. 2 is preferred embodiment of the invention, the schematic flow sheet of the skeleton point information for obtaining user is gathered and processed；

During Fig. 3 is preferred embodiment of the invention, treatment obtains the schematic flow sheet of palm of the hand positional information and pushing information；

During Fig. 4 is preferred embodiment of the invention, treatment obtains the schematic flow sheet of hand mask；

During Fig. 5 is preferred embodiment of the invention, the schematic flow sheet being identified to gesture；

During Fig. 6 is preferred embodiment of the invention, the schematic flow sheet of profile maximum depression points scanning algorithm；

During Fig. 7 is preferred embodiment of the invention, the schematic flow sheet of concavo-convex angle recognizer；

Specific embodiment

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.Based on the embodiment in the present invention, the every other embodiment that those of ordinary skill in the art are obtained on the premise of creative work is not made belongs to the scope of protection of the invention.

It should be noted that in the case where not conflicting, the embodiment in the present invention and the feature in embodiment can be mutually combined.

The invention will be further described with specific embodiment below in conjunction with the accompanying drawings, but not as limiting to the invention.

In preferred embodiment of the invention, there is provided a kind of gesture identification method, the overall procedure of the method is as shown in figure 1, comprise the steps：

Step S1, the video data stream for being associated with user's whole body is obtained by an image collecting device, and process the skeleton point information of each skeleton point for obtaining being associated with user；

Specifically, as shown in Fig. 2 above-mentioned steps S1 comprises the steps：

Step S11, the video data stream of the depth image of the whole body of background and user is included using image acquisition device；

In preferred embodiment of the invention, above-mentioned image collecting device can be the camera being installed on the intelligent terminal for supporting gesture instruction interaction, depth of field camera is can preferably be, that is, the camera of the function of supporting to be capable of blur-free imaging in the range of the longitudinal separation of imaging object.

Then in above-mentioned steps S11, the video flowing of the whole body depth image of the background and user of picture where user is directly shot using above-mentioned depth of field camera, eventually form above-mentioned video data stream and export.

Step S12, the three-dimensional information of the pixel of the depth image of every frame that video data stream is included carries out spatial alternation, to obtain corresponding cloud information in real space；

In preferred embodiment of the invention, in above-mentioned steps S12, the voxel information of each pixel respectively obtains its corresponding cloud information in real space after carrying out spatial alternation in the depth image of acquired every frame in the video data stream that will be photographed.

Step S13, according to corresponding cloud information of each pixel, obtains the distance between each pixel and depth of field camera；

By the point cloud information obtained in above-mentioned steps S12, the distance obtained between each pixel of correspondence and depth of field camera can be further processed in above-mentioned steps S13.

Step S14, respectively according to the corresponding distance of each pixel, treatment obtains skeleton point information.

In above-mentioned steps S14, the skeleton point information for obtaining user finally can be processed according to the distance between each pixel and above-mentioned depth of field camera respectively.So-called skeleton point, it is believed that be a kind of human body markup model, multiple skeleton points that can be used to mark human body different parts are included in the markup model, and different skeleton points may be respectively used for marking each joint of human body.Human visual's model that for example certain class multiple skeleton point is formed, the skeleton state that human body is under standing state is represented there is defined 20 skeleton points, and each skeleton point is an artis.In other words, before above-mentioned gesture identification method is performed, it is necessary first to a pre-defined physically weak analog model of people for including multiple skeleton points, there is the more technical scheme that can realize presetting above-mentioned human visual's model in the prior art, in this not go into detail.

Then in preferred embodiment of the invention, spatial alternation is carried out per the voxel information of frame depth image for the detailed process of the point cloud information in real space can be realized according to related software, only need to be called when realizing the api interface of related software code, will not be repeated here.

Step S2, according to skeleton point information, it is determined that representing the palm of the hand positional information and the pushing pushing information for representing user of the palm of the hand position of user；

In preferred embodiment of the invention, above-mentioned palm of the hand positional information is used to indicate the palm of the hand position of user, and further, palm of the hand positional information can serve to indicate that the position of the hand of user.

In preferred embodiment of the invention, pushing information can serve to indicate that the hand length of user.The pushing information is usually default, and the Human Height and pushing ratio for for example being obtained by training in advance are calculated, and above-mentioned calculating process can hereinafter be described in detail.

Then in preferred embodiment of the invention, as shown in figure 3, above-mentioned steps S2 is further included：

Step S21, the skeleton point information of each skeleton point for being associated with user obtained according to treatment obtains the palm of the hand positional information of user；

Step S22, the skeleton point information of each skeleton point for being associated with user obtained according to treatment, the height information of user is calculated according to following formula：

Wherein, H₁Represent the height values of user, H₂Represent the pixels tall numerical value of background, H₃Pixels tall numerical value of the user in collected video image is represented, d represents the distance between user and depth of field camera numerical value, and θ represents depth of field camera vertical angle numerical value in the horizontal direction.Then above-mentioned H₂Numerical value can preset, for example value be 240, similarly θ can also preset, be for example set as 21.5 °.

Step S23, according to the corresponding relation of default Human Height and human body between pushing, obtains the pushing information of user.

In preferred embodiment of the invention, above-mentioned Human Height and pushing corresponding relation can be according to substantial amounts of human body related datas, and by way of big data is counted carrying out multiple linear regression analysis obtains.

Step S3, judges whether height of the palm of the hand of user apart from ground is more than a default height threshold according to palm of the hand positional information：

If so, then continuing executing with step S4；

If it is not, then exiting；

In the prior art, generally there is such a case in the intelligent terminal for supporting gesture identification：Although user stands in the range of the picture catching of depth of field camera, it is not intended to carry out gesture operation to corresponding intelligent terminal.Then now, user may unconsciously brandish arm when some other affairs (such as being talked with other people) is carried out, this a series of actions is possible to that intelligent terminal can be caused to misread gesture motion, will user do not feel in some gesture motions for doing be identified as the gesture motion for needing to be controlled intelligent terminal.

Then in preferred embodiment of the invention, the situation for misreading gesture motion in order to avoid above-mentioned occurs, and presets a height threshold first before gesture identification, when the height threshold does the gesture motion of standard for user, the calibrated altitude that hand is liftoff.In other words, as long as the liftoff height of the hand of user is higher than above-mentioned height threshold, then may indicate that user is current and attempting being input into gesture instruction to intelligent terminal.Conversely, then it is considered that user does not have a mind to control intelligent terminal by gesture motion.

Then in above-mentioned steps S3, the liftoff distance of hand can be determined by the image of palm of the hand position and user's whole body first.During the pre-setting of above-mentioned height threshold, the height distance that the center (i.e. the palm of the hand) of the corresponding hand of gesture motion that height threshold directly can be set into standard is liftoff, so in actual calculating process, terrain clearance directly can be calculated using above-mentioned palm of the hand position and be contrasted with default height threshold.Similarly, above-mentioned height threshold is it can also be provided that the liftoff height distance of the corresponding hand bottom/top edge of the gesture motion of standard, in this case, in actual calculating process, the marginal position on the bottom/top for being accomplished by extrapolating hand substantially according to palm of the hand positional information first, and and then be calculated the actual terrain clearance of hand, and contrasted with default height threshold.

Step S4, judgement obtains the image of palm area, and image to palm area carries out partition, cutting and pre-processes, and obtains corresponding hand mask and exports；

In preferred embodiment of the invention, above-mentioned steps S4 is specific as shown in figure 4, comprising the steps：

Step S41, according to palm of the hand positional information and pushing information, removes the information of the distance more than the pixel of pushing half of all and palm of the hand position that the hand of user includes, and the information of all pixels point included according to the hand after removal obtains hand data；

In preferred embodiment of the invention, based on the filter algorithm apart from depth, the data of all pixels for being apart more than pushing half with above-mentioned palm of the hand position are removed such that it is able to quick obtaining hand data.In other words, remain is a spheric region with pushing half as radius with palm of the hand position as the center of circle after eventually passing through filtering, and all pixels point in the spheric region is retained, as the pixel of hand data.

Therefore, in preferred embodiment of the invention, the hand data of user are comprised in a spheric region with the pushing half of user as radius and with the palm of the hand position of user as the center of circle.

Specifically, in preferred embodiment of the invention, in above-mentioned steps S41, the set of the pixel in above-mentioned spheric region is calculated according to following formula, obtains final product hand data：

Wherein, p₀Represent the set of the pixel in above-mentioned spheric region, the pixel that p includes for the hand of user, p (x, y, z) it is used for denotation coordination for (x, y, z) pixel p, it is (x0 that p (x0, y0, z0) is used for denotation coordination, y0, z0), i.e., for representing palm of the hand position where pixel, H₄Numerical value for representing pushing information.

In preferred embodiment of the invention, the K values in K mean cluster algorithm in above-mentioned steps S42 can be specified the number of class by developer, and in a preferred embodiment of the invention, K values take fixed numerical value 2.

Step S43, sets min cluster number, is excluded with the filtering that hand data are carried out with noise jamming pixel clusters, so as to obtain being associated with the hand mask of hand data and export.

In preferred embodiment of the invention, above-mentioned hand mask can be a binary picture being made up of 0 and 1.Then in a preferred embodiment of the invention, in above-mentioned steps S43, the min cluster number (minimum cluster numbers pixel threshold) of setting is 50 pixels.

Step S5, according to result, identifies the fingertip area of hand, and the gesture of user is identified according to the geometrical relationship of fingertip area.

In preferred embodiment of the invention, the characteristic of the finger tip detection method combination depth image of contour curvature can be based on, propose a kind of sags and crests angle recognizer, this algorithm overcomes 3 points of conventional alignment methods and (for example lacks relative consistency to the deficiency of finger tip detection, there is requirement higher to the distance between image and camera, and the operand of program can be increased etc.).And on the basis of above-mentioned sags and crests angle recognizer, each finger of hand is recognized using the spatial relation of human body and hand.Finally can be by three layers of decision tree of formation, the fingertip area for relying on each finger is analyzed treatment to gesture, so as to recognize the gesture motion of user.

Specifically, in preferred embodiment of the invention, as shown in figure 5, above-mentioned steps S5 includes：

Step S51, the edge contour of hand mask is obtained using the detection of Moore neighborhoods contour following algorithm, and obtains first chain set of all profile points included on edge contour；

Moore neighborhoods contour following algorithm is the algorithm for detecting profile relatively more classical present in prior art, be will not be repeated here.

Step S52, the convex closure collection on the hand profile of hand mask is obtained using the detection of Graham scanning algorithms, and acquisition includes the second point chain set of all convex closures；

The Graham also classical algorithm for monitoring profile, also repeats no more herein.

Step S53, using profile maximum depression points scanning algorithm, the maximum depression points between detection on the edge contour of hand mask and the convex closure collection of hand profile obtains all salient points, and obtains the thirdly chain set of the sags and crests included on hand profile；

Further, in preferred embodiment of the invention, as shown in fig. 6, in above-mentioned steps S53, so-called profile maximum depression points scanning algorithm is specifically included：

Step S531, using the second point chain set on hand profile as initial thirdly chain set.

Step S532, successively to each hand outline concave point between 2 salient points adjacent before and after second point chain set, thirdly in chain set with point to line a range formula, detecting the depression points of its hand profile has the concave point of ultimate range to the connection straight line between above-mentioned 2 adjacent salient points.

Step S533, the above-mentioned concave point with ultimate range is inserted into the set of above-mentioned thirdly chain between above-mentioned 2 adjacent salient points.

Step S534, repeats above-mentioned steps S532 to S533, is finished until the point in the set of above-mentioned thirdly chain is all detected.

Step S535, the point of its maximum is obtained by iteration, then be maximum depression points, and generates the thirdly chain set on orderly hand profile.

Step S54, using concavo-convex angle recognizer, obtains including the 4th chain set of all finger tip points of hand according to the thirdly chain process of aggregation for being associated with hand profile；

Specifically, in preferred embodiment of the invention, as shown in fig. 7, in above-mentioned steps S54, so-called concavo-convex angle recognizer includes：

Step S541, from top to bottom finds a salient point P1 in thirdly chain set sequentially on hand profile, and from its front and rear 2 direction chooses adjacent concave point P2 and concave point P3 respectively.

Step S542, from concave point P2 to salient point P1, salient point P1 to concave point P2 make 2 vectors, calculate its angle in salient point P1 points, if threshold value of its angle less than setting, salient point P1 points are identified as fingertip area and are stored in above-mentioned 4th chain set.

Step S543, if the thirdly chain set on hand profile has not detected that repeat the above steps S541 also, to detect next candidate's salient point；Otherwise terminate.

In preferred embodiment of the invention, in above-mentioned steps S55, the distance of every 2 adjacent and non-adjacent finger tip points in above-mentioned 4th chain set can be successively calculated, and the different corresponding fingers of fingertip area are determined according to distance.

Specifically, in a preferred embodiment of the invention, adjacent 2 finger tip points can be defined as thumb apart from maximum and non-adjacent 2 finger tip points most big-and-middle public finger tip point of distance, adjacent with thumb and distance maximum finger tip point is defined as forefinger, non-adjacent with thumb and distance maximum finger tip point is defined as little finger, and the finger tip point nearest with forefinger is defined as middle finger；Left finger tip point is defined as the third finger.

In a preferred embodiment of the invention, the default threshold value of above-mentioned sags and crests angle can be set to 40 °, then be can effectively solve the problem that using technical solution of the present invention and judge problem by accident present in traditional finger tip detection, while reducing amount of calculation.

In preferred embodiment of the invention, for an identification for gesture, the number of finger is identified according to above-mentioned steps first, and obtain the title of finger, and the direction vector and the angle between them of each finger, and three layers of decision tree are formed with above three condition, the identification of gesture motion is finally realized according to above-mentioned three layers of decision tree.

In preferred embodiment of the invention, above-mentioned three layers of decision tree is a kind of sorting technique then classified to new data according to decision tree or rule by carrying out inductive learning to sample, generating corresponding decision tree or decision rule, in various sorting algorithms, decision tree is most intuitively a kind of.Three layers of decision tree are exactly respectively as one layer of classification foundation of decision node in tree, so as to reach classification purpose by above three condition.

The processing procedure of hand detection in the present invention and finger identification is carried out when having depth image data input each time, if same object still exists in next frame depth image, and profile is when having deformed with previous frame image, then all of object properties will continue to quote the characteristic point that old depth image frame analysis draws, program work amount can be thus reduced, efficiency is improved.

In a preferred embodiment of the invention, gesture and gesture that the process being identified to gesture according to above-mentioned three layers of decision tree is for example gesticulated in identification numeral " I love you "：

Identify that the current gesture motion of user is related to three fingers first, and obtain corresponding finger name further to be recognized.

Can be known by advance training, thumb, forefinger and little finger of toe have been used in gesture " I love you ", and gesticulate when numeral for example gesticulates Arabic numerals " 3 " and used forefinger, middle finger and the third finger, therefore directly can carry out the differentiation of gesture motion by having used which root finger in the gesture motion.

Again for example, the gesture motion for equally carrying out numeral is gesticulated, for example, gesticulate Arabic numerals " 2 " and gesticulate Chinese figure " seven ", the finger number and finger name that two gestures are used are all identical, then can be distinguished by two vector angles of gesture：

For Arabic numerals " 2 ", when gesticulating, the direction vector angle of its two fingers must be an acute angle to user, and can realize a default threshold value less than us, now can just allow computer to identify that this is Arabic numerals " 2 ".

Correspondingly, for Chinese gesture " seven ", when gesticulating, the direction vector angle of two finger is more than angle when gesticulating Arabic numerals " 2 " to user, when can then work as angle more than above-mentioned default threshold value, current gesture motion is identified as " seven ".

In preferred embodiment of the invention; it is above-mentioned that gesture motion is identified to include various specific embodiments using three layers of decision tree; it is numerous to list herein, as long as being that the three layers of decision tree formed using above three condition are included in protection scope of the present invention to the mode that gesture is identified.

The foregoing is only preferred embodiments of the present invention; not thereby embodiments of the present invention and protection domain are limited; to those skilled in the art; the scheme obtained by all utilization description of the invention and the equivalent done by diagramatic content and obvious change should be can appreciate that, should be included in protection scope of the present invention.

Claims

1. a kind of gesture identification method, it is characterised in that comprise the following steps：

Step S1, the video data stream for being associated with user's whole body is obtained by an image collecting device, and Treatment obtains the skeleton point information of each skeleton point for being associated with the user；

Step S2, according to the skeleton point information, it is determined that representing the palm of the hand of the palm of the hand position of the user Positional information and the pushing pushing information for representing the user；

Step S3, the height of the palm of the hand apart from ground of the user is judged according to the palm of the hand positional information Whether a default height threshold is more than：

If so, then continuing executing with the step S4；

If it is not, then exiting；

Step S4, judgement obtains the image of palm area, and image to the palm area is split Cut and pre-processed, obtain corresponding hand mask and export；

Step S5, according to the result, identifies the fingertip area of hand, and according to the finger tip The geometrical relationship in region is identified to the gesture of the user.

2. gesture identification method as claimed in claim 1, it is characterised in that in the step S1, Described image harvester is depth of field camera；

3. gesture identification method as claimed in claim 2, it is characterised in that the step S1 includes：

Step S11, the whole body of background and the user is included using the collection of described image harvester Depth image video data stream；

Step S12, the three of the pixel of the depth image of every frame that the video data stream is included Dimension information carries out spatial alternation, to obtain corresponding cloud information in real space；

Step S13, according to the corresponding described cloud information of each described pixel, obtains each described pixel The distance between with the depth of field camera；

Step S14, respectively according to the corresponding distance of each described pixel, treatment obtains the bone Point information.

4. gesture identification method as claimed in claim 1, it is characterised in that the step S2 includes：

Step S21, according to each described skeleton point for being associated with the user that treatment is obtained Skeleton point information, obtains the palm of the hand positional information of the user；

Step S22, according to each described skeleton point for being associated with the user that treatment is obtained Skeleton point information, the height information of the user is calculated according to following formula：

H_{1} = \frac{2 * d * t a n θ * H_{3}}{H_{2}}

Wherein, H₁Represent the height values of the user, H₂The pixels tall numerical value of background is represented, H₃Pixels tall numerical value of the user in collected video image is represented, d represents described and uses The distance between person and depth of field camera numerical value, θ represents the depth of field camera in the horizontal direction Vertical angle numerical value；

Step S23, according to the corresponding relation of default Human Height and human body between pushing, obtains described The described pushing information of user.

5. gesture identification method as claimed in claim 1, it is characterised in that the step S4 includes：

Step S41, according to the palm of the hand positional information and the pushing information, removes the user's Letter of the distance of all and palm of the hand position that hand includes more than the pixel of the pushing half Breath, and the information of all described pixel included according to the hand after removal obtains hand data；

The hand data that treatment is obtained are carried out at cluster by step S42 by K mean cluster algorithm Reason, obtains by the hand data after clustering processing；

Step S43, sets min cluster number, and noise jamming pixel clusters are carried out with to the hand data Filtering is excluded, so as to obtain being associated with the hand mask of the hand data and export.

6. gesture identification method as claimed in claim 5, it is characterised in that the hand packet contains With the described pushing half of the user as radius and with the palm of the hand position of the user For in a spheric region in the center of circle.

7. gesture identification method as claimed in claim 1, it is characterised in that the step S5 includes：

Step S51, the edge of the hand mask is obtained using the detection of Moore neighborhoods contour following algorithm Profile, and obtain first chain set of all profile points included on the edge contour；

Step S52, is obtained on the hand profile of the hand mask using the detection of Graham scanning algorithms Convex closure collection, and acquisition include the second point chain set of all convex closures；

Step S53, using profile maximum depression points scanning algorithm, in the edge of the hand mask Detection obtains the maximum depression between all salient points on the convex closure collection of profile and the hand profile Point, and obtain the thirdly chain set of the sags and crests included on the hand profile；

Step S54, using concavo-convex angle recognizer, according to being associated with described the of the hand profile 3 chain process of aggregation obtain including the 4th chain set of all finger tip points of hand；

Step S55, each finger of hand is obtained according to finger tip point identification, is then performed gesture and is known Do not operate.

8. gesture identification method as claimed in claim 7, it is characterised in that in the step S55, Perform the step of gesture identification is operated and specifically include：

Step S551, identification obtains the number of all described finger of hand；

Step S552, according to presupposed information judge obtain the every title of the finger, direction vector with And angle between the adjacent finger and export；

Step S553, one or three layers of decision tree, and root are formed according to the information exported in the step S552 Gesture is identified according to three layers of decision tree.

9. gesture identification method as claimed in claim 5, it is characterised in that in the step S42, K values in the K mean cluster algorithm are set as fixed numbers 2.