CN106648068A - Method for recognizing three-dimensional dynamic gesture by two hands - Google Patents
Method for recognizing three-dimensional dynamic gesture by two hands Download PDFInfo
- Publication number
- CN106648068A CN106648068A CN201610998758.8A CN201610998758A CN106648068A CN 106648068 A CN106648068 A CN 106648068A CN 201610998758 A CN201610998758 A CN 201610998758A CN 106648068 A CN106648068 A CN 106648068A
- Authority
- CN
- China
- Prior art keywords
- mobile terminal
- gesture
- data
- slave
- master
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 66
- 238000012545 processing Methods 0.000 claims abstract description 10
- 230000008569 process Effects 0.000 claims description 15
- 230000001133 acceleration Effects 0.000 claims description 12
- 230000009471 action Effects 0.000 claims description 9
- 230000033001 locomotion Effects 0.000 claims description 8
- 238000001514 detection method Methods 0.000 claims description 5
- 230000003993 interaction Effects 0.000 abstract description 28
- 241000282414 Homo sapiens Species 0.000 abstract description 8
- 238000005516 engineering process Methods 0.000 description 15
- 238000013528 artificial neural network Methods 0.000 description 9
- 238000004422 calculation algorithm Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 238000004891 communication Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 6
- 230000008901 benefit Effects 0.000 description 5
- 238000012549 training Methods 0.000 description 5
- 239000013598 vector Substances 0.000 description 5
- 210000004556 brain Anatomy 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000007704 transition Effects 0.000 description 4
- 238000012706 support-vector machine Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- RZVHIXYEVGDQDX-UHFFFAOYSA-N 9,10-anthraquinone Chemical compound C1=CC=C2C(=O)C3=CC=CC=C3C(=O)C2=C1 RZVHIXYEVGDQDX-UHFFFAOYSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 238000003825 pressing Methods 0.000 description 2
- 230000003238 somatosensory effect Effects 0.000 description 2
- 230000006641 stabilisation Effects 0.000 description 2
- 238000011105 stabilization Methods 0.000 description 2
- 241000238558 Eucarida Species 0.000 description 1
- 101000739754 Homo sapiens Semenogelin-1 Proteins 0.000 description 1
- 102100037550 Semenogelin-1 Human genes 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 210000005252 bulbus oculi Anatomy 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005265 energy consumption Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 210000001508 eye Anatomy 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000006996 mental state Effects 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000011241 protective layer Substances 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000010079 rubber tapping Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000005309 stochastic process Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/03—Arrangements for converting the position or the displacement of a member into a coded form
- G06F3/033—Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor
- G06F3/0346—Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor with detection of the device orientation or free movement in a 3D space, e.g. 3D mice, 6-DOF [six degrees of freedom] pointers using gyroscopes, accelerometers or tilt-sensors
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
本发明提出了一种双手三维动态手势识别方法及系统,利用两台移动终端,两台移动终端设备采用主从模式,将其中一台作为主移动终端,另一台作为从移动终端;两台移动终端分别跟踪左手和右手的手势;主移动终端向外界扫描可连接的外部设备,与从移动终端建立无线数据连接;主移动终端采集手势数据;从移动终端采集手势数据,并通过无线数据连接将手势数据实时的发送给主移动终端进行数据处理、同步及双手手势识别;主移动终端将主移动终端与从移动终端获得的手势数据所识别的结果发送给待控制设备。本发明直接以人手作为对智能设备的输入端,人机间的交互不再需要其他的媒介,提高了交互的效率并避免了因中间媒介所导致的各种误识别。
The present invention proposes a two-hands three-dimensional dynamic gesture recognition method and system, using two mobile terminals, the two mobile terminal devices adopt the master-slave mode, one of them is used as the master mobile terminal, and the other is used as the slave mobile terminal; The mobile terminal tracks the gestures of the left hand and the right hand respectively; the master mobile terminal scans the external devices that can be connected to the outside world, and establishes a wireless data connection with the slave mobile terminal; the master mobile terminal collects gesture data; The gesture data is sent to the main mobile terminal in real time for data processing, synchronization and two-hand gesture recognition; the main mobile terminal sends the recognition result between the main mobile terminal and the gesture data obtained from the mobile terminal to the device to be controlled. The present invention directly uses the human hand as the input terminal to the smart device, and the interaction between the human and the machine does not need other media, which improves the efficiency of the interaction and avoids various misidentifications caused by the intermediate media.
Description
技术领域technical field
本发明涉及人机交互技术领域,尤其涉及一种动态手势识别方法及系统。The invention relates to the technical field of human-computer interaction, in particular to a dynamic gesture recognition method and system.
背景技术Background technique
随着各方面技术的进步,人们希望得到更好的人机交互体验,对交互的实时性、识别率等硬性指标的要求也越来越苛刻,甚至开始希望通过一些自身的微小信息,例如手指信息、声波信息、眼珠信息、脑电波信息、肌肉信息等来实现与机器的沟通。这促使许多专家学者不断地寻找挖掘更便捷、更快速、更准确、更自然的人机交互方式,并开始关注利用人类自身的身体语言来实现机器控制,而其中手势作为变化方式最多、最能够表达人们意愿的方式之一,成为了关注的重点。在脑电波控制世界的时代来临之前,手势是最有可能改变传统人机交互方式的研究切入点。因为双手是身体姿态最多变、最具有表达性的一项,它可以最大可能地表达人们的想法和意愿。不通过任何媒介,直接利用一个简单的手势动作就可以操控生活,是目前人们所追求的理想人机交互方式。这种全新的操作方式,使得人类在这个物物相连的互联智能时代,真正实现用自己的手来掌控自己的生活。手势也会将人机交互方式从传统的平面二维识别技术扩展到了空间三维识别技术,实现更多技术发展的可能性。本发明所要做的基于IOS移动端的双手三维动态手势识别技术,利用人们生活中越来越普及的IOS智能移动端直接作为手势采集和识别的工具,来与PC端或其他终端实现友好的人机交互方式。With the advancement of technology in all aspects, people hope to get a better human-computer interaction experience, and the requirements for hard indicators such as real-time interaction and recognition rate are becoming more and more stringent. Information, sound wave information, eyeball information, brain wave information, muscle information, etc. to achieve communication with the machine. This has prompted many experts and scholars to constantly search for more convenient, faster, more accurate, and more natural human-computer interaction methods, and began to pay attention to the use of human body language to achieve machine control, among which gestures are the most varied and most capable. One of the ways to express people's wishes has become the focus of attention. Before the era of brain waves controlling the world, gestures are the research entry point that is most likely to change the traditional way of human-computer interaction. Because the hands are the most changeable and expressive item of body posture, they can express people's thoughts and wishes to the greatest extent possible. Directly using a simple gesture to control life without any medium is the ideal way of human-computer interaction that people are currently pursuing. This brand-new operation method enables human beings to truly control their own lives with their own hands in this era of interconnected intelligence where everything is connected. Gestures will also expand the human-computer interaction method from the traditional two-dimensional recognition technology to three-dimensional spatial recognition technology, realizing more possibilities for technological development. The two-hands three-dimensional dynamic gesture recognition technology based on the IOS mobile terminal to be done in the present invention uses the IOS intelligent mobile terminal which is more and more popular in people's life directly as a tool for gesture collection and recognition to realize friendly human-computer interaction with the PC terminal or other terminals Way.
手势识别技术的发展过程中产生过几种主流的解决方案,例如:Several mainstream solutions have been produced during the development of gesture recognition technology, such as:
1)基于数据手套的方案:起步最早,也是最直接处理问题的一种方案。该方案具有数据量小、速度高、受环境影响较小和能够直接获取手势数据等诸多优点。但是要求用户穿戴复杂的手套,大大降低了用户体验,并且对附着其上的传感器披覆较高硬度的保护层在技术上有一定难度,而相对脆弱的设备在携带上也存在不方便。1) The solution based on data gloves: the earliest start and the most direct solution to the problem. This scheme has many advantages, such as small amount of data, high speed, less affected by the environment, and the ability to directly acquire gesture data. However, users are required to wear complicated gloves, which greatly reduces the user experience, and it is technically difficult to coat the sensors attached to them with a relatively hard protective layer, and relatively fragile devices are also inconvenient to carry.
2)基于机器视觉的处理方案:当前最热门的手势识别技术解决方案,也就是摄像头图像处理,比如说微软的Kinect。最大的优点是用户无需佩戴任何设备,人机交互性好。但是这种技术对于空间、环境有比较大的限制性,因此除了一些大型的体感游戏使用,在生活中的推广度并不高。2) Processing solutions based on machine vision: the most popular gesture recognition technology solutions, that is, camera image processing, such as Microsoft's Kinect. The biggest advantage is that the user does not need to wear any equipment, and the human-computer interaction is good. However, this technology has relatively large restrictions on space and environment, so except for some large-scale somatosensory games, the promotion degree in life is not high.
3)基于SEMG的方案:处在多学科交叉的领域,除了和数据手套方案一样需要用户佩戴电极外,和机器视觉的方案类似,该方案也容易受外部环境和用户等因素的干扰。外部环境的干扰包括电磁环境的干扰和环境湿度变化造成的测量电极接触电阻的变化。同时,用户的身体和精神状态在不同的情况下具有不同的基准值,导致系统难以标定。3) The SEMG-based solution: In a multidisciplinary field, in addition to requiring the user to wear electrodes like the data glove solution, similar to the machine vision solution, this solution is also susceptible to interference from factors such as the external environment and users. The interference of the external environment includes the interference of the electromagnetic environment and the change of the contact resistance of the measuring electrode caused by the change of the environmental humidity. At the same time, the user's physical and mental state has different benchmark values in different situations, making it difficult to calibrate the system.
现有的人机交互技术存在以下缺点:The existing human-computer interaction technology has the following disadvantages:
(1)机械键盘交互是最成熟,但同时也是最低效的交互方式,因其成本、损耗、低效等各种原因,正在被移动设备慢慢淘汰;(1) Mechanical keyboard interaction is the most mature, but at the same time, the least efficient way of interaction. Due to various reasons such as cost, loss, and inefficiency, it is being gradually eliminated by mobile devices;
(2)触摸屏操作作为近几年新兴的人机交互方式,以其操作简洁、符合用户习惯、学习成本低、可多点触控等优点,成为新一代移动设备有效的交互方式。但是,触控技术不可避免的需要用户在触模板上进行操作,限制了触控技术的应用场景和范围。(2) Touch screen operation, as an emerging human-computer interaction method in recent years, has become an effective interaction method for a new generation of mobile devices due to its advantages of simple operation, user habit, low learning cost, and multi-touch. However, the touch technology inevitably requires the user to operate on the touch panel, which limits the application scenarios and scope of the touch technology.
(3)语音识别目前也是一种非常成熟的交互方式,识别率较高,尤其是随着iPhone中Siri的出现,基于自然语音的识别技术使得语音交互方式迈上了一个新的台阶。然而,语音交互方式会受到地区语言差异以及个人语音差异的影响,同时,自然语音识别需要时刻联网、能耗高的缺点,大大限制了语音技术在可穿戴设备中的应用。(3) Speech recognition is currently a very mature interaction method with a high recognition rate, especially with the emergence of Siri in the iPhone, the recognition technology based on natural speech has brought the speech interaction method to a new level. However, the voice interaction method will be affected by regional language differences and personal voice differences. At the same time, natural voice recognition needs to be connected to the Internet at all times and has high energy consumption, which greatly limits the application of voice technology in wearable devices.
鉴于上述问题,本案的发明人在专利文献1(中国专利公开号CN105929940A)中提出了一种基于细分特征值法的快速三维动态手势识别方法及系统,针对现有技术中存在的问题,设计了一个外观小巧的硬件平台,用加速度计来采集用户手势的3轴加速度,陀螺仪来采集用户手势的3轴角速度,磁力计来采集用户手势的3轴磁感应强度,使用滤波算法来消除数据误差之后再进行姿态解算,求出实时的三个姿态角(航向角、横滚角、俯仰角),然后将这些信息数据通过所提出的特征分析算法,在确保一定的精确度和实时性的基础上,分别对移动、旋转、打钩、画叉、晃动、敲击等自身具有一定区分度的三维动态手势进行识别,从而实现相应的应用。但是,专利文献1后仅针对单手手势识别,因此,亟需一种双手三维动态手势识别方法。In view of the above problems, the inventor of this case proposed a fast three-dimensional dynamic gesture recognition method and system based on the subdivision eigenvalue method in Patent Document 1 (Chinese Patent Publication No. CN105929940A), aiming at the problems existing in the prior art, the design A small-looking hardware platform is built, using an accelerometer to collect the 3-axis acceleration of the user gesture, a gyroscope to collect the 3-axis angular velocity of the user gesture, a magnetometer to collect the 3-axis magnetic induction intensity of the user gesture, and a filtering algorithm to eliminate data errors Afterwards, the attitude calculation is carried out to obtain the real-time three attitude angles (heading angle, roll angle, and pitch angle), and then these information data are passed through the proposed feature analysis algorithm to ensure certain accuracy and real-time performance. On the basis, three-dimensional dynamic gestures with a certain degree of discrimination, such as moving, rotating, ticking, crossing, shaking, and tapping, are recognized respectively, so as to realize corresponding applications. However, since Patent Document 1 is only for single-handed gesture recognition, there is an urgent need for a two-handed three-dimensional dynamic gesture recognition method.
发明内容Contents of the invention
本发明的目的在于克服双手手势识别不准确、不方便等问题,提供一种双手三维动态手势识别方法及系统。The object of the present invention is to overcome the problems of inaccurate and inconvenient recognition of two-hand gestures, and provide a method and system for two-hand three-dimensional dynamic gesture recognition.
为达上述目的,本发明通过以下技术方案实现:For reaching above-mentioned object, the present invention realizes by following technical scheme:
一种双手三维动态手势识别方法,利用两台移动终端,两台移动终端设备采用主从模式,将其中一台移动终端作为主机,称之为主移动终端,另一台移动终端作为从机,称之为从移动终端;两台移动终端分别跟踪左手和右手的手势;主移动终端向外界扫描可连接的外部设备,与从移动终端建立无线数据连接;主移动终端采集手势数据;从移动终端采集手势数据,并通过无线数据连接将手势数据实时的发送给主移动终端进行数据处理、同步及双手手势识别;主移动终端将主移动终端与从移动终端获得的手势数据所识别的结果发送给待控制设备。A two-hands three-dimensional dynamic gesture recognition method, using two mobile terminals, the two mobile terminal devices adopt a master-slave mode, and one of the mobile terminals is used as the master, which is called the master mobile terminal, and the other mobile terminal is used as the slave. It is called the slave mobile terminal; the two mobile terminals track the gestures of the left hand and the right hand respectively; the master mobile terminal scans the external devices that can be connected to the outside world, and establishes a wireless data connection with the slave mobile terminal; the master mobile terminal collects gesture data; the slave mobile terminal Collect gesture data, and send the gesture data to the main mobile terminal in real time through the wireless data connection for data processing, synchronization and two-hand gesture recognition; the main mobile terminal sends the recognition result of the main mobile terminal and the gesture data obtained from the mobile terminal to the The device to be controlled.
进一步地,所述移动终端设备上集成了9轴传感器模块,其中,3轴加速度计测量加速度、3轴陀螺仪测量角速度和3轴磁力计测量磁场。Further, the mobile terminal device is integrated with a 9-axis sensor module, wherein a 3-axis accelerometer measures acceleration, a 3-axis gyroscope measures angular velocity, and a 3-axis magnetometer measures magnetic field.
进一步地,所述同步的过程包括:利用移动端操作系统的键值检测KVO,监听两台移动终端设备是否完成手势的采集和识别,若一方完成,进入等待状态,外界操作对其无效;直到另一方也完成采集识别过程时,将数据指令打包发送给待控制设备,接着使两台移动终端设备设备进入就绪状态,等待下一次的双手手势。Further, the synchronization process includes: using the key value detection KVO of the mobile terminal operating system to monitor whether the two mobile terminal devices have completed the collection and recognition of gestures. If one of them is completed, it enters a waiting state, and external operations are invalid for it; until When the other party also completes the collection and identification process, it packs the data instructions and sends them to the device to be controlled, and then makes the two mobile terminal devices enter the ready state, waiting for the next gesture with both hands.
进一步地,在手势截取时,通过一个较强的阀值Ats来检测手势的运动,在此基础之上通过两个较小值的阀值Atb、Atf来判断手势的开始和结束,其中,Ats>Atf、Ats>Atb;需要限定Ats与Atf、Atb与两者时间差的大小,从而避免将一个手势的连续动作划分为多个手势;此外,需要限定Atf与Atb时间差的大小,避免将用户手部偶然的抖动识别为手势的开始;为了保证手势数据的完整性,Atb和Atf两处截取的时间序列数据需要往两端适当扩展时间长度Te。Further, when the gesture is intercepted, a stronger threshold Ats is used to detect the movement of the gesture, and on this basis, two smaller thresholds Atb and Atf are used to judge the start and end of the gesture, where Ats >Atf, Ats>Atb; it is necessary to limit the time difference between Ats and Atf, Atb and the two, so as to avoid dividing the continuous action of a gesture into multiple gestures; in addition, it is necessary to limit the time difference between Atf and Atb to avoid the user’s hand Part of the occasional jitter is recognized as the beginning of the gesture; in order to ensure the integrity of the gesture data, the time series data intercepted at Atb and Atf need to be appropriately extended to both ends of the time length Te.
具体地,首先检测到Ats的时间序列点,然后在Ats的基础之上通过向前、向后搜索,找到Atf和Atb的时间序列点,在此之上,适当扩展时间Te长度,获得手势的完整加速度数据。Specifically, first detect the time series points of Ats, and then search forward and backward on the basis of Ats to find the time series points of Atf and Atb. On top of this, appropriately expand the length of time Te to obtain the Complete acceleration data.
另一方面,本发明还提出了一种双手三维动态手势识别系统,所述系统包括两台移动终端、以及待控制设备;所述移动终端的每一台都包括手势数据采集单元、无线数据通信单元,两台移动终端设备采用主从模式,将其中一台移动终端作为主机,称之为主移动终端,另一台移动终端作为从机,称之为从移动终端;两台移动终端分别跟踪左手和右手的手势;主移动终端向外界扫描可连接的外部设备,与从移动终端建立无线数据连接;主移动终端的手势数据采集单元采集手势数据;从移动终端的手势数据采集单元采集手势数据,并通过无线数据通信单元将手势数据实时的发送给主移动终端进行数据处理、同步及双手手势识别;主移动终端通过无线数据通信单元将主移动终端与从移动终端获得的手势数据所识别的结果发送给待控制设备。On the other hand, the present invention also proposes a two-hands three-dimensional dynamic gesture recognition system, the system includes two mobile terminals, and a device to be controlled; each of the mobile terminals includes a gesture data acquisition unit, a wireless data communication Unit, two mobile terminal devices adopt the master-slave mode, one of the mobile terminals is used as the master, which is called the master mobile terminal, and the other mobile terminal is used as the slave, which is called the slave mobile terminal; the two mobile terminals track respectively Gestures of the left hand and right hand; the master mobile terminal scans external devices that can be connected to the outside world, and establishes a wireless data connection with the slave mobile terminal; the gesture data collection unit of the master mobile terminal collects gesture data; the gesture data collection unit of the slave mobile terminal collects gesture data , and send the gesture data to the main mobile terminal in real time through the wireless data communication unit for data processing, synchronization and gesture recognition with both hands; The result is sent to the device to be controlled.
进一步地,所述无线数据通信单元为蓝牙模块、WiFi模块或者RF模块。Further, the wireless data communication unit is a Bluetooth module, a WiFi module or an RF module.
进一步地,所述待控制设备为PC机、游戏机、无人机或者VR头盔。Further, the device to be controlled is a PC, a game console, a drone or a VR helmet.
进一步地,所述主移动终端为智能手机,所述从移动终端为可穿戴设备。或者,所述主移动终端和从移动终端都为智能手机。或者,所述主移动终端和从移动终端都为可穿戴设备。Further, the master mobile terminal is a smart phone, and the slave mobile terminal is a wearable device. Alternatively, both the master mobile terminal and the slave mobile terminal are smart phones. Alternatively, both the master mobile terminal and the slave mobile terminal are wearable devices.
本发明的有益效果是:本发明的双手手势识别方法直接以人手作为对智能设备的输入端,人机间的交互不再需要其他的媒介,提高了交互的效率并避免了因中间媒介所导致的各种误识别;扩展了智能设备的应用场合,使得人机交互不再局限于设备的特定部位,可以让用户在不方便讲话或不方便拿出设备的情况下可以进行一定的操作,比如在洗衣服时通过一定手势来接听手机来电;丰富了智能设备的应用场景,比如可以将双手手势操作与虚拟场景相结合,丰富了手持设备的操作性、娱乐性;双手手势输入提供了一种机器人示范学习的新思路,手势输入中重要的一环就是用户自定义手势;IOS智能移动端中的9轴传感器有效地增加了识别的精准度,丰富了实现的功能;采用蓝牙等无线通信技术进行数据传输,功耗低,并支持多用户同时使用。The beneficial effects of the present invention are: the two-handed gesture recognition method of the present invention directly uses human hands as the input terminal to the smart device, and the interaction between man and machine does not need other media, which improves the efficiency of interaction and avoids the problems caused by intermediate media. Various misidentifications; expand the application occasions of smart devices, so that human-computer interaction is no longer limited to specific parts of the device, allowing users to perform certain operations when it is inconvenient to speak or take out the device, such as Use certain gestures to answer mobile phone calls while washing clothes; enrich the application scenarios of smart devices, for example, the combination of two-hand gesture operation and virtual scene can enrich the operability and entertainment of handheld devices; two-hand gesture input provides a A new idea of robot demonstration learning, an important part of gesture input is user-defined gestures; the 9-axis sensor in the IOS smart mobile terminal effectively increases the accuracy of recognition and enriches the realized functions; it adopts wireless communication technologies such as Bluetooth For data transmission, low power consumption, and supports simultaneous use by multiple users.
附图说明Description of drawings
图1是本发明的方法所基于的双手势识别系统的硬件框图;Fig. 1 is the hardware block diagram of the double gesture recognition system that the method of the present invention is based on;
图2是双手手势数据同步示意图;Fig. 2 is a schematic diagram of hand gesture data synchronization;
图3是移动设备的手势识别系统实现方案流程图;Fig. 3 is the flowchart of the implementation scheme of the gesture recognition system of the mobile device;
图4(a)是基于FBGD的手势数据截取示意图;Figure 4(a) is a schematic diagram of gesture data interception based on FBGD;
图4(b)是基于FBGD的手势数据截取后的数据。Figure 4(b) is the intercepted data based on FBGD gesture data.
具体实施方案specific implementation plan
下面通过具体实施方式结合附图对本发明作进一步详细说明。The present invention will be further described in detail below through specific embodiments in conjunction with the accompanying drawings.
智能移动端设备已经全面普及,基于移动终端平台开发手势识别,将会让更多的人使用上手势识别的便利。移动终端设备上集成了9轴传感器模块(3轴加速度计测量加速度、3轴陀螺仪测量角速度和3轴磁力计测量磁场),并且拥有较高的识别精度,可以实现对于复杂手势的有效识别。本发明利用手机与手表或手环二者配合交互,实现数据的交互,实现双手手势交互,极大的丰富了单一手势的功能。Smart mobile devices have been fully popularized, and the development of gesture recognition based on mobile terminal platforms will allow more people to use the convenience of gesture recognition. The mobile terminal device integrates a 9-axis sensor module (3-axis accelerometer to measure acceleration, 3-axis gyroscope to measure angular velocity, and 3-axis magnetometer to measure magnetic field), and has high recognition accuracy, which can realize effective recognition of complex gestures. The present invention utilizes the cooperation and interaction of the mobile phone and the watch or bracelet to realize data interaction and gesture interaction with both hands, which greatly enriches the function of a single gesture.
本发明的方法所基于的双手势识别系统的硬件框图如附图1所示,包括两个移动终端设备、以及待控制设备,如PC机、游戏机、无人机等。三方之间的通信通过蓝牙技术实现。两个移动终端设备采用主从模式,将其中一台移动终端(如iPhone)作为主机,称之为主移动终端,另一台移动终端(如iWatch)作为从机,称之为从移动终端;两台移动终端分别跟踪左手和右手的手势(例如,左手手握iPhone,右手佩戴iWatch)。主移动终端向外界扫描可连接的外部设备,与从移动终端建立蓝牙数据连接;从移动终端采集手势数据,并通过蓝牙将手势数据实时的发送给主移动终端进行数据处理、同步及双手手势识别;接着,将主移动终端与从移动终端获得的手势数据所识别的结果发送给待控制设备的蓝牙接收模块;最后,待控制设备根据所述识别结果进行对应的控制操作。The hardware block diagram of the dual-gesture recognition system based on the method of the present invention is shown in Figure 1, including two mobile terminal devices and devices to be controlled, such as PCs, game consoles, and drones. The communication between the three parties is realized through Bluetooth technology. The two mobile terminal devices adopt the master-slave mode, and one of the mobile terminals (such as iPhone) is used as the master, which is called the master mobile terminal, and the other mobile terminal (such as iWatch) is used as the slave, which is called the slave mobile terminal; The two mobile terminals track the gestures of the left hand and the right hand respectively (for example, the left hand holds the iPhone and the right hand wears the iWatch). The master mobile terminal scans the external devices that can be connected to the outside world, establishes a Bluetooth data connection with the slave mobile terminal; collects gesture data from the mobile terminal, and sends the gesture data to the master mobile terminal in real time through Bluetooth for data processing, synchronization and two-hand gesture recognition ; Next, send the recognition result of the master mobile terminal and the gesture data obtained from the mobile terminal to the Bluetooth receiving module of the device to be controlled; finally, the device to be controlled performs corresponding control operations according to the recognition result.
由于双手手势存在动作先后的顺序,当一方采集到手势时,也许另一方还未开始,或者也还在识别中。这样就造成了双手数据的不同步的问题,因此必须制定一套数据完成协议来控制手势的异步问题。本发明提出利用移动端操作系统的键值检测(KVO),监听两台设备是否完成手势的采集和识别。若一方完成,进入等待状态,外界操作对其无效。直到另一方也完成采集识别过程时,将数据指令打包发送给待控制设备,接着使设备进入就绪状态,等待下一次的双手手势。其运行原理如附图2所示。Since there is a sequence of actions in the gestures of both hands, when one party collects gestures, the other party may not have started or is still in the process of recognition. This has caused the problem of asynchronous data of both hands, so a set of data completion protocol must be developed to control the asynchronous problem of gestures. The present invention proposes to use the key-value detection (KVO) of the operating system of the mobile terminal to monitor whether the two devices have completed the collection and recognition of gestures. If one party completes, it enters the waiting state, and external operations are invalid for it. When the other party also completes the collection and identification process, the data command is packaged and sent to the device to be controlled, and then the device enters the ready state, waiting for the next gesture of both hands. Its operating principle is shown in Figure 2.
手势识别的本质是根据手势模型利用手势识别算法将手势动作分类。手势识别方法的优劣直接关系到手势识别的效率和精度。常用的手势识别方法包括:The essence of gesture recognition is to use gesture recognition algorithms to classify gesture actions according to the gesture model. The pros and cons of gesture recognition methods are directly related to the efficiency and accuracy of gesture recognition. Common gesture recognition methods include:
(1)DTW算法是把时间规整和距离测度计算结合起来的一种非线性规整技术,具有非线性时间归一化效果。使用某种指定属性的非线性规整函数对时间轴上的波动近似建模,通过伸缩其中一个模式的时间轴使之跟另一个模式达到最大程度的重叠,使残差距离最小,从而消除两个时空表示模式之间的时间差别。实际上它是,隐马尔可夫模型的简化,对于比较简单的时间序列,他们两者是等价的。方法允许在测试模式和参考模式之间有充分的弹性,从而实现分类。(1) The DTW algorithm is a nonlinear warping technique that combines time warping and distance measure calculation, and has a nonlinear time normalization effect. Using a nonlinear regularization function with specified properties to approximate the fluctuations on the time axis, by stretching the time axis of one of the modes to maximize the overlap with the other mode, the residual distance is minimized, thereby eliminating the two Spatiotemporal representations of time differences between patterns. In fact, it is a simplification of the hidden Markov model. For simpler time series, the two of them are equivalent. Methods allow sufficient flexibility between test and reference patterns to enable classification.
(2)隐马尔可夫模型是马尔可夫模型的扩展。马尔可夫模型描述的是一个随机过程与状态之间的转移。隐马尔可夫模型描述两个随机过程一个随机过程描述输出与状态间的概率关系,即输出是状态的随机过程函数另一个随机过程描述状态之间的转移关系。观察者能够看到输出,却不能看到状态之间的转移,也就是状态之间的转移是隐含的。由于隐马尔可夫模型拓扑结构的特殊性,导致其在分析手势信号时过于复杂,使得训练和识别的计算量大,尤其是在连续的隐马尔可夫模型中,需要计算大量的状态概率密度,需要估计的参数个数较多,使得训练及识别的速度相对较慢。(2) Hidden Markov model is an extension of Markov model. The Markov model describes a random process and transitions between states. Hidden Markov model describes two random processes, one random process describes the probability relationship between the output and the state, that is, the output is a random process function of the state, and the other stochastic process describes the transition relationship between the states. The observer can see the output, but cannot see the transition between states, that is, the transition between states is implicit. Due to the particularity of the hidden Markov model topology, it is too complicated when analyzing gesture signals, which makes the calculation of training and recognition large, especially in the continuous hidden Markov model, it is necessary to calculate a large number of state probability densities , the number of parameters to be estimated is large, making the speed of training and recognition relatively slow.
(3)人工神经网络,在手势识别领域,人工神经网络是一种应用非常广泛的工具。人工神经网络具有自组织和自学习能力,抗噪声能力强,具有很强的容错性和鲁棒性。人工神经网络是通过把大量的简单处理单元广泛连接起来而构成一种复杂的信息处理网络,其中处理单元及其相互连接模式是借鉴人脑神经元的结构及连接机制设计的。这种网络具有与人脑相类似的学习记忆能力、知识概括和输入信息特征抽取能力。经过多年的发展,人工神经网络已经具有很多模型,例如模糊神经网络和BP神经网络。目前应用比较广泛的是以反向传播学习算法为基础的BP神经网络。(3) Artificial neural network. In the field of gesture recognition, artificial neural network is a very widely used tool. Artificial neural network has self-organization and self-learning ability, strong anti-noise ability, strong fault tolerance and robustness. The artificial neural network forms a complex information processing network by extensively connecting a large number of simple processing units. The processing units and their interconnection modes are designed by referring to the structure and connection mechanism of human brain neurons. This kind of network has the ability of learning and memory, knowledge generalization and input information feature extraction similar to the human brain. After years of development, artificial neural networks already have many models, such as fuzzy neural network and BP neural network. The BP neural network based on the backpropagation learning algorithm is widely used at present.
(4)机器学习,常见的基于机器学习的分类方法有K_最邻近方法和支持向量机的方法。K_最邻近方法的基本思想是根据传统的向量空间模型,将文本内容形式化为特征空间中的加权特征向量,对于一个测试对象,计算其与训练样本集中的每个样本相似度,找出个K最相似的文本,根据加权距离判断测试对象所属的类别。支持向量机以训练误差作为优化问题的约束条件,以置信范围值最小化作为目标,是一种基于结构风险最小化准则的学习方法。支持向量机的计算复杂度取决于支持向量的数目,而不是样本空间的维数,这在某种意义上避免了“维数空间”。而且在该方法中,增删非支持向量样本对模型没有影响,对函数的选择也不是很敏感。此外,支持向量样本集有一定的鲁棒性。(4) Machine learning, common classification methods based on machine learning include K_nearest neighbor method and support vector machine method. The basic idea of the K_nearest neighbor method is to formalize the text content into a weighted feature vector in the feature space according to the traditional vector space model. For a test object, calculate its similarity with each sample in the training sample set, and find out K most similar texts, according to the weighted distance to judge the category of the test object. The support vector machine takes the training error as the constraint condition of the optimization problem, and takes the minimization of the confidence range value as the goal. It is a learning method based on the criterion of structural risk minimization. The computational complexity of SVM depends on the number of support vectors, not the dimensionality of the sample space, which in a sense avoids the "dimensional space". Moreover, in this method, adding and deleting non-support vector samples has no effect on the model, and it is not very sensitive to the choice of function. In addition, the support vector sample set has a certain robustness.
除此之外,还有HMM和阀值比较结合的方法,HMM与神经网络相结合的方法,贝叶斯网络和支持向量机相结合的方法,利用手势倾斜特性等特征的识别方法。In addition, there are methods of combining HMM and threshold comparison, methods of combining HMM and neural network, methods of combining Bayesian network and support vector machine, and recognition methods using features such as gesture tilt characteristics.
因为对手势识别速度要求较高,本发明采用E_DTW动态时间模板匹配法与特征粗分类相互结合的方法。因此在选择手势动作时,需要对手势进行大量的样本观察和分析,通过样本采集和波形观察,获取手势识别的标准模板,同时构建自定义手势识别数据库实现,实现动态的手势模板匹配。目前暂时采用以下的8类手势作为系统默认的手势。如表1所示:Because the speed of gesture recognition is high, the present invention adopts the method of combining the E_DTW dynamic time template matching method and feature rough classification. Therefore, when selecting a gesture action, it is necessary to observe and analyze a large number of samples of the gesture. Through sample collection and waveform observation, the standard template for gesture recognition is obtained, and a custom gesture recognition database is constructed to realize dynamic gesture template matching. Currently, the following 8 types of gestures are temporarily used as the default gestures of the system. As shown in Table 1:
表1手势集定义Table 1 Gesture set definition
上述的手势动作之间具有一定的区分度,它们各自的传感器信息所构成的特征量之间具有比较高的可识别性,因此可以根据不同手势的特征值建立不同的分类器。首先根据计算出来的特征值对八大类手势进行预分成3类,识别出其所在的类别,再设计算法针对旋转类和移动类动作进行细划分,识别出运动的方向,从而是想利用细分特征值分析法快速高效地实现手势的分类和识别。The above-mentioned gestures have a certain degree of discrimination, and the feature quantities formed by their respective sensor information have relatively high recognizability, so different classifiers can be established according to the feature values of different gestures. First, the eight categories of gestures are pre-divided into 3 categories according to the calculated eigenvalues, and the categories are identified, and then the algorithm is designed to subdivide the rotation and movement movements, and identify the direction of motion, so that we want to use subdivision The eigenvalue analysis method realizes the classification and recognition of gestures quickly and efficiently.
移动设备的手势识别系统实现方案流程如附图3所示。The process flow of the gesture recognition system implementation scheme for mobile devices is shown in Figure 3.
DTW模板匹配识别技术,其核心思想是将输入的原始数据与预先存储的模板进行匹配,通过测量两个模板之间的相似度来完成识别任务。常用的距离相似度计算方法有加权欧氏距离法、相关系数法以及对数距离法。模板匹配需要解决输入数据与预先存储模板时间序列长度不一致的问题,因为即使是同一个手势,其持续时间都会随机的改变。为了解决这个时间校准问题,典型的时序模板匹配是DTW。DTW template matching recognition technology, its core idea is to match the input raw data with the pre-stored template, and complete the recognition task by measuring the similarity between the two templates. Commonly used distance similarity calculation methods include weighted Euclidean distance method, correlation coefficient method and logarithmic distance method. Template matching needs to solve the problem that the input data is inconsistent with the length of the pre-stored template time series, because even for the same gesture, its duration will change randomly. To solve this time alignment problem, a typical timing template matching is DTW.
针对手势数据的长度特征进行预分类,依据不同手势的长度和其能量的大小,来实现对于差异较大的手势进行预分类。因此该前提是对手势数据有效的截取,即从所有读取到的原始数据中截取从手势的起点到终点的段落,然后再对截取得来的数据段进行坐标系的转换。进行了切割和坐标系转换之后的手势数据段才能用来提取特征进行手势识别,接下来,将对手势切割、坐标系转换和特征识别法进行详细介绍。Pre-classify the length characteristics of gesture data, and realize the pre-classification of gestures with large differences according to the length and energy of different gestures. Therefore, the premise is to effectively intercept the gesture data, that is, intercept the paragraph from the start point to the end point of the gesture from all the read raw data, and then convert the coordinate system of the intercepted data segment. The gesture data segment after cutting and coordinate system conversion can be used to extract features for gesture recognition. Next, the gesture cutting, coordinate system conversion and feature recognition methods will be introduced in detail.
由于用户移动或者在手势过程中手不可避免的抖动以及传感器本身精度的影响,加速度传感器采集的手势动作数据不可避免地受到噪声干扰,出现上下抖动的情况。平稳去噪主要是对环境噪声等干扰数据进行处理,尽量将影响降至最低。我采用,简单移动平均线滤波器的方法对获取的加速度数据进行平稳去噪处理。SMA能在保持快速响应的基础上滤去随机噪声。SMA滤波器的推导公式如下所示:Due to the inevitable shaking of the hand during the user's movement or gestures and the influence of the accuracy of the sensor itself, the gesture data collected by the acceleration sensor is inevitably disturbed by noise, resulting in up and down jitter. Smooth denoising is mainly to process the interference data such as environmental noise, and minimize the impact as much as possible. I use a simple moving average filter method to smooth and denoise the acquired acceleration data. SMA can filter out random noise while maintaining fast response. The derivation formula of the SMA filter is as follows:
SMAnow=(Xi+Xi-1+....+Xi-n+1)/n;n=1,2,3,4,.......SMA now =(X i +X i-1 +....+X i-n+1 )/n; n=1,2,3,4,....
上式中n表示数据序列的长度。n的大小关系到平滑的效果。n太小,平稳效果不明显n过大,平稳效果好,但是容易导致手势信息丢失。参照经验,视不同的情况,n一般取5至15。In the above formula, n represents the length of the data sequence. The size of n is related to the smoothing effect. If n is too small, the stabilization effect is not obvious. If n is too large, the stabilization effect is good, but it is easy to cause gesture information loss. Referring to experience, depending on different situations, n generally takes 5 to 15.
计算的时候,可以直接使用下式When calculating, you can directly use the following formula
SMAnow=SMAprevious-Xi-n/n+Xi/n。SMA now = SMA previous -X in /n+X i /n.
手势动作的开始和结束的判定在手势识别中起着至关重要的作用。本文前述部分已经提及大多数基于加速度传感器的手势识别在手势执行的过程中需要按钮等额外的操作来告知手势识别系统手势的开始和结束。通过按按钮这种方式来判定手势的开始和结束,确实可以保证手势识别系统对手势动作准确采样,但是其存在以下不足一是用户在按按钮的过程中,由于手部的抖动,必然会对手势数据带来噪声干扰二是用户在按按钮的过程中,需要通过眼睛来确定按钮的位置,分散了用户的注意力,降低了用户的体验度,不利于更加自由、自然的人机交互三是对于盲哑人或者手指不灵活的人,按按钮等操作对其比较困难,不利于手势识别系统的推广普及。The determination of the start and end of a gesture action plays a crucial role in gesture recognition. The previous part of this article has mentioned that most gesture recognition based on acceleration sensors require additional operations such as buttons to inform the gesture recognition system of the start and end of the gesture during the gesture execution process. By pressing the button to determine the start and end of the gesture, it is indeed possible to ensure that the gesture recognition system accurately samples the gesture action, but it has the following shortcomings. Gesture data brings noise interference. The second is that the user needs to determine the position of the button through the eyes when pressing the button, which distracts the user's attention, reduces the user experience, and is not conducive to a more free and natural human-computer interaction. Especially for blind people or people with inflexible fingers, it is more difficult to press buttons and other operations, which is not conducive to the promotion and popularization of gesture recognition systems.
在基于门限值的手势动作判定中,阀值的设置很大程度上决定了判定的准确性。如果阀值过低,用户不经意之间的细微动作都会判定为手势的开始,引起误判如果阀值过高,则会错过对正常手势的判定。为此,结合手势动作的特点,发明采用FBGD的手势判定方法。通过一个较强的阀值Ats来检测手势的运动,在此基础之上通过两个较小值的阀值Atf、Atb来判断手势的开始和结束,Ats、Atf、Atb为时间T内,手势加速度数据序列的最大值和最小值之差的阀值。同时,我们限定Ats与Atf、Atb与两者时间差的大小,从而避免将一个手势的连续动作划分为多个手势。此外,需要限定Atf与Atb时间差的大小,避免将用户手部偶然的抖动识别为手势的开始。为了保证手势数据的完整性,Atb和Atf两处截取的时间序列数据需要往两端适当扩展时间长度Te。如附图4(a)和附图4(b)所示,当手势的数据经平滑后,送入手势检测时,首先检测到Ats的时间序列点,然后在Ats的基础之上通过向前、向后搜索,找到Atf和Atb的时间序列点,在此之上,适当扩展时间Te长度,获得手势的完整加速度数据。In threshold-based gesture judgment, the setting of the threshold largely determines the accuracy of the judgment. If the threshold is too low, the user's inadvertent subtle movements will be judged as the beginning of the gesture, causing misjudgment. If the threshold is too high, the judgment of normal gestures will be missed. Therefore, in combination with the characteristics of gesture actions, a gesture determination method using FBGD is invented. A strong threshold Ats is used to detect the motion of the gesture, and on this basis, two smaller thresholds Atf and Atb are used to judge the start and end of the gesture. Ats, Atf and Atb are within the time T. Threshold for the difference between the maximum and minimum values of the acceleration data series. At the same time, we limit the size of the time difference between Ats and Atf, Atb and the two, so as to avoid dividing the continuous action of one gesture into multiple gestures. In addition, it is necessary to limit the size of the time difference between Atf and Atb, so as to avoid recognizing the occasional shaking of the user's hand as the beginning of the gesture. In order to ensure the integrity of the gesture data, the time series data intercepted at Atb and Atf need to be appropriately extended to both ends of the time length Te. As shown in Figure 4(a) and Figure 4(b), when the gesture data is smoothed and sent to gesture detection, the time series point of Ats is first detected, and then based on Ats, it passes forward , and search backward to find the time series points of Atf and Atb, above which, the length of time Te is appropriately extended to obtain the complete acceleration data of the gesture.
不同手势数据数据长度各不相同,幅度也不相同。即便是同一个人的同一种手势间,也存在差异,因此必须将数据归一化到同样幅度范围,采样到相同的长度,这样的优点有:1.统一手势量化标准,避免幅度过大或者过小的手势被误判;2.在保证波形不失真前提下,固定采样点,实际上是减少了运算复杂度,提高识别速度。Different gesture data have different data lengths and different amplitudes. Even among the same gestures of the same person, there are differences. Therefore, the data must be normalized to the same amplitude range and sampled to the same length. The advantages of this are: 1. Unify the gesture quantification standard to avoid excessive or excessive amplitude. Small gestures are misjudged; 2. On the premise of ensuring that the waveform is not distorted, the fixed sampling point actually reduces the computational complexity and improves the recognition speed.
本发明所采用的手势识别算法包括但不限于上述算法,也可以采用专利文献1中记载的基于细分特征值法的快速三维动态手势识别方法。The gesture recognition algorithm used in the present invention includes but is not limited to the above-mentioned algorithm, and the fast three-dimensional dynamic gesture recognition method based on the subdivision eigenvalue method described in Patent Document 1 can also be used.
实施例1Example 1
传统的无人机飞行控制,一般是采用RF射频遥控器。它外部拥有一个长的天线,遥控指令都是通过机壳外部的控制开关和按钮,经过内部电路的调制、编码,再通过高频信号放大电路由天线将电磁波发射出去。目前常用的遥控发射机有三种类型:盒式按键手持式、便携杆式遥控、手持枪式遥控。例如遥控杆式发射机有两个操纵杆,左边杆用来控制无人机的上升和下降,右边杆控制无人机飞行方向。中央配有液晶显示屏幕,显示工作状态和功能。其优点是通信距离远,功能集成度高,能适应无人机在不同环境下的飞行控制。但其对人员操作要求较高,一般需要进行专业培训。Traditional UAV flight control generally uses RF radio frequency remote control. It has a long antenna outside, and the remote control commands are all through the control switches and buttons outside the casing, modulated and coded by the internal circuit, and then the electromagnetic waves are emitted by the antenna through the high-frequency signal amplification circuit. There are three types of remote control transmitters commonly used at present: box-type button-type handheld, portable rod-type remote control, and hand-held gun-type remote control. For example, the remote control stick transmitter has two joysticks, the left stick is used to control the ascent and descent of the drone, and the right stick controls the flight direction of the drone. The center is equipped with a liquid crystal display screen to display the working status and functions. Its advantages are long communication distance and high functional integration, which can adapt to the flight control of UAVs in different environments. However, it has high requirements for personnel operation, and generally requires professional training.
随着手势识别的深入研究,可将手势识别与无人机控制相结合,实现手势控制无人机飞行轨迹。通过移动端设备对空间手势轨迹进行识别,将识别结果通过无线设备向无人机发送控制指令。在保障无人机稳定安全飞行的前提下,用户可以自定义无人机的飞行轨迹。本发明是基于MEMS传感器的动态手势轨迹识别。对有效的轨迹样本进行训练,并对输入手势的质心轨迹进行识别,从而达到控制无人机飞行轨迹的需求。将手势与飞行器的系统进行整合,在系统中实现了对无人机飞行轨迹的手势控制。With the in-depth study of gesture recognition, gesture recognition can be combined with UAV control to realize gesture control of UAV flight trajectory. The space gesture trajectory is recognized through the mobile terminal device, and the recognition result is sent to the drone through the wireless device to control the command. Under the premise of ensuring the stable and safe flight of the UAV, users can customize the flight trajectory of the UAV. The invention is based on the dynamic gesture trajectory recognition of the MEMS sensor. Train the valid trajectory samples and recognize the centroid trajectory of the input gesture, so as to meet the needs of controlling the flight trajectory of the drone. The gesture is integrated with the aircraft system, and the gesture control of the drone's flight trajectory is realized in the system.
实施例2Example 2
第一人称视角的游戏中,结合手势识别,将手势识别设备平台当做游戏手柄来使用。此外,目前市面上流行的基于机器视觉的体感游戏灵敏度仍不够,体验效果不佳。如果能在智能终端上集成摄像头利用视觉和硬件平台,相结合,就能确定硬件平台的位置,再进一步通过硬件平台上的陀螺仪等数据进行精细定位,在很大程度上提高了识别精度和灵敏度。由于双手的识别,能运用于许多的大型竞技格斗的游戏中,提供更加良好的人机交互体验。In the first-person perspective game, combined with gesture recognition, the gesture recognition device platform is used as a gamepad. In addition, the currently popular somatosensory games based on machine vision on the market are still not sensitive enough, and the experience effect is not good. If the camera can be integrated on the smart terminal and combined with the vision and hardware platform, the position of the hardware platform can be determined, and then fine positioning can be carried out through the gyroscope and other data on the hardware platform, which greatly improves the recognition accuracy and sensitivity. Due to the recognition of both hands, it can be used in many large-scale competitive fighting games to provide a better human-computer interaction experience.
实施例3Example 3
双手设备,结合VR头盔,实现沉浸式的用户游戏体验,利用头盔投影出3D影像,结合移动设备,就可以实现身历其境的游戏体验。因为双手的扩展性,能实现众多的交互方式,能给用户一般的操作手柄所不能提供的丰富功能。Two-handed devices, combined with VR helmets, can realize immersive user gaming experience, use the helmets to project 3D images, and combine with mobile devices, you can realize immersive gaming experience. Because of the expansibility of both hands, it can realize many interactive modes, and can provide users with rich functions that ordinary operating handles cannot provide.
实施例4Example 4
PC端广泛存在需要方向性控制的操作,如视频的播放、图片的切换、网页的上下滑动等,传统的鼠标控制在距离上限制较大,操控性低,影响人与PC端的友好交互。There are many operations that require directional control on the PC side, such as video playback, picture switching, and web page sliding up and down. Traditional mouse control has a relatively large distance limit and low maneuverability, which affects the friendly interaction between people and the PC side.
目前PC端看视频时快进和后退,基本上是通过鼠标和键盘实现,然而对于使用者来说并不是十分的便利,因此提出使用,便携式的移动设备,利用手势识别,来控制视频的播放,或者图片查看器的缩放,切换的功能。可以定义了四种手势,分别与这四种命令相对应,最终通过手势就达到了控制播放器播放的目的,且由于手势识别设备的普及性,进一步满足大多数的用户。At present, fast forward and rewind when watching video on the PC end are basically realized through the mouse and keyboard, but it is not very convenient for users. Therefore, it is proposed to use a portable mobile device to use gesture recognition to control video playback. , or the zooming and switching functions of the picture viewer. Four kinds of gestures can be defined, which correspond to these four kinds of commands respectively. Finally, the purpose of controlling the playback of the player is achieved through gestures, and due to the popularity of gesture recognition devices, it further satisfies most users.
以上内容是结合具体的优选实施方式对本发明所作的进一步详细说明,不能认定本发明的具体实施只局限于这些说明。对于本发明所属技术领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干简单推演或替换,都应当视为属于本发明的保护范围。The above content is a further detailed description of the present invention in conjunction with specific preferred embodiments, and it cannot be assumed that the specific implementation of the present invention is limited to these descriptions. For those of ordinary skill in the technical field of the present invention, without departing from the concept of the present invention, some simple deduction or replacement can be made, which should be regarded as belonging to the protection scope of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610998758.8A CN106648068A (en) | 2016-11-11 | 2016-11-11 | Method for recognizing three-dimensional dynamic gesture by two hands |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610998758.8A CN106648068A (en) | 2016-11-11 | 2016-11-11 | Method for recognizing three-dimensional dynamic gesture by two hands |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106648068A true CN106648068A (en) | 2017-05-10 |
Family
ID=58806616
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610998758.8A Pending CN106648068A (en) | 2016-11-11 | 2016-11-11 | Method for recognizing three-dimensional dynamic gesture by two hands |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106648068A (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107346207A (en) * | 2017-06-30 | 2017-11-14 | 广州幻境科技有限公司 | A kind of dynamic gesture cutting recognition methods based on HMM |
CN107592422A (en) * | 2017-09-20 | 2018-01-16 | 上海交通大学 | A kind of identity identifying method and system based on gesture feature |
CN108052237A (en) * | 2018-01-05 | 2018-05-18 | 上海昶音通讯科技有限公司 | 3D projection touch device and touch method thereof |
CN108375913A (en) * | 2018-03-28 | 2018-08-07 | 山东大学 | It is a kind of based on the smart home things system and its operation method of NAO robots and application |
CN109313455A (en) * | 2017-11-16 | 2019-02-05 | 深圳市大疆创新科技有限公司 | Intelligent glasses and its method, holder, control method and the unmanned plane for controlling holder |
CN109445464A (en) * | 2019-01-08 | 2019-03-08 | 深圳市道通智能航空技术有限公司 | A kind of flight control method and flight control system |
CN113268143A (en) * | 2020-09-29 | 2021-08-17 | 中国人民解放军军事科学院国防科技创新研究院 | Multimodal man-machine interaction method based on reinforcement learning |
CN113574496A (en) * | 2019-07-12 | 2021-10-29 | Oppo广东移动通信有限公司 | Method for recognizing gesture for operating terminal device and terminal device |
CN114115537A (en) * | 2021-11-22 | 2022-03-01 | 中国电子科技集团公司第五十四研究所 | Gesture control method and system for unmanned system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104134060A (en) * | 2014-08-03 | 2014-11-05 | 上海威璞电子科技有限公司 | Sign language interpreting, displaying and sound producing system based on electromyographic signals and motion sensors |
CN104601795A (en) * | 2014-11-03 | 2015-05-06 | 中国科学技术大学苏州研究院 | Convenient and efficient smartphone user hand identification method |
CN105160323A (en) * | 2015-09-07 | 2015-12-16 | 哈尔滨市一舍科技有限公司 | Gesture identification method |
CN105929940A (en) * | 2016-04-13 | 2016-09-07 | 哈尔滨工业大学深圳研究生院 | Rapid three-dimensional dynamic gesture recognition method and system based on character value subdivision method |
-
2016
- 2016-11-11 CN CN201610998758.8A patent/CN106648068A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104134060A (en) * | 2014-08-03 | 2014-11-05 | 上海威璞电子科技有限公司 | Sign language interpreting, displaying and sound producing system based on electromyographic signals and motion sensors |
CN104601795A (en) * | 2014-11-03 | 2015-05-06 | 中国科学技术大学苏州研究院 | Convenient and efficient smartphone user hand identification method |
CN105160323A (en) * | 2015-09-07 | 2015-12-16 | 哈尔滨市一舍科技有限公司 | Gesture identification method |
CN105929940A (en) * | 2016-04-13 | 2016-09-07 | 哈尔滨工业大学深圳研究生院 | Rapid three-dimensional dynamic gesture recognition method and system based on character value subdivision method |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107346207A (en) * | 2017-06-30 | 2017-11-14 | 广州幻境科技有限公司 | A kind of dynamic gesture cutting recognition methods based on HMM |
CN107346207B (en) * | 2017-06-30 | 2019-12-20 | 广州幻境科技有限公司 | Dynamic gesture segmentation recognition method based on hidden Markov model |
CN107592422B (en) * | 2017-09-20 | 2019-07-02 | 上海交通大学 | A method and system for identity authentication based on gesture features |
CN107592422A (en) * | 2017-09-20 | 2018-01-16 | 上海交通大学 | A kind of identity identifying method and system based on gesture feature |
CN109313455B (en) * | 2017-11-16 | 2021-09-28 | 深圳市大疆创新科技有限公司 | Intelligent glasses, method for controlling holder of intelligent glasses, holder, control method and unmanned aerial vehicle |
CN109313455A (en) * | 2017-11-16 | 2019-02-05 | 深圳市大疆创新科技有限公司 | Intelligent glasses and its method, holder, control method and the unmanned plane for controlling holder |
CN108052237A (en) * | 2018-01-05 | 2018-05-18 | 上海昶音通讯科技有限公司 | 3D projection touch device and touch method thereof |
CN108052237B (en) * | 2018-01-05 | 2022-01-14 | 上海昶音通讯科技有限公司 | 3D projection touch device and touch method thereof |
CN108375913A (en) * | 2018-03-28 | 2018-08-07 | 山东大学 | It is a kind of based on the smart home things system and its operation method of NAO robots and application |
CN109445464A (en) * | 2019-01-08 | 2019-03-08 | 深圳市道通智能航空技术有限公司 | A kind of flight control method and flight control system |
CN113574496A (en) * | 2019-07-12 | 2021-10-29 | Oppo广东移动通信有限公司 | Method for recognizing gesture for operating terminal device and terminal device |
CN113268143A (en) * | 2020-09-29 | 2021-08-17 | 中国人民解放军军事科学院国防科技创新研究院 | Multimodal man-machine interaction method based on reinforcement learning |
CN114115537A (en) * | 2021-11-22 | 2022-03-01 | 中国电子科技集团公司第五十四研究所 | Gesture control method and system for unmanned system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106648068A (en) | Method for recognizing three-dimensional dynamic gesture by two hands | |
CN105929940B (en) | Fast 3D dynamic gesture recognition method and system based on subdivision eigenvalue method | |
US11383166B2 (en) | Interaction method of application scene, mobile terminal, and storage medium | |
Yang et al. | Gesture interaction in virtual reality | |
KR102792655B1 (en) | Gesture control system using biopotentials and vectors | |
KR102531542B1 (en) | Virual user input controls in a mixed reality environment | |
WO2021082749A1 (en) | Action identification method based on artificial intelligence and related apparatus | |
Gong et al. | Pyro: Thumb-tip gesture recognition using pyroelectric infrared sensing | |
CN205721628U (en) | A kind of quick three-dimensional dynamic hand gesture recognition system and gesture data collecting device | |
CN102789313B (en) | User interaction system and method | |
US11822729B2 (en) | Systems and methods for gesture-based control | |
LaViola Jr | 3d gestural interaction: The state of the field | |
CN117178247A (en) | Gestures for animating and controlling virtual and graphical elements | |
US20130010207A1 (en) | Gesture based interactive control of electronic equipment | |
CN118295526A (en) | Systems, methods, and interfaces for performing input based on neuromuscular control | |
Moazen et al. | AirDraw: Leveraging smart watch motion sensors for mobile human computer interactions | |
CN104731307B (en) | A kind of body-sensing action identification method and human-computer interaction device | |
WO2010011923A1 (en) | Enhanced detection of circular engagement gesture | |
CN104780217A (en) | Method, system and client terminal for detecting working efficiency of user | |
Luo et al. | Wearable air-writing recognition system employing dynamic time warping | |
Li et al. | Hand gesture recognition and real-time game control based on a wearable band with 6-axis sensors | |
Linqin et al. | Dynamic hand gesture recognition using RGB-D data for natural human-computer interaction | |
CN111461059A (en) | A multi-region multi-class scalable gesture recognition control device and control method | |
CN107390867A (en) | A kind of man-machine interactive system based on Android wrist-watch | |
Li et al. | Hand gesture tracking and recognition based human-computer interaction system and its applications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170510 |
|
RJ01 | Rejection of invention patent application after publication |