WO2019105376A1 - Gesture recognition method, terminal and storage medium - Google Patents

Gesture recognition method, terminal and storage medium Download PDF

Info

Publication number
WO2019105376A1
WO2019105376A1 PCT/CN2018/117864 CN2018117864W WO2019105376A1 WO 2019105376 A1 WO2019105376 A1 WO 2019105376A1 CN 2018117864 W CN2018117864 W CN 2018117864W WO 2019105376 A1 WO2019105376 A1 WO 2019105376A1
Authority
WO
WIPO (PCT)
Prior art keywords
gesture recognition
terminal
gesture
ultrasonic wave
application
Prior art date
Application number
PCT/CN2018/117864
Other languages
French (fr)
Chinese (zh)
Inventor
刘永霞
史波
易科
李�瑞
刘杉
Original Assignee
腾讯科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯科技(深圳)有限公司 filed Critical 腾讯科技(深圳)有限公司
Publication of WO2019105376A1 publication Critical patent/WO2019105376A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • GPHYSICS
    • G08SIGNALLING
    • G08CTRANSMISSION SYSTEMS FOR MEASURED VALUES, CONTROL OR SIMILAR SIGNALS
    • G08C23/00Non-electrical signal transmission systems, e.g. optical systems
    • G08C23/02Non-electrical signal transmission systems, e.g. optical systems using infrasonic, sonic or ultrasonic waves

Definitions

  • the present application relates to the field of computer technologies, and in particular, to a gesture recognition method, a terminal, and a storage medium.
  • the input mode of the user on the mobile phone browser is mainly contact input, that is, the user needs to manually input on the touch screen or physical button of the mobile phone terminal.
  • gesture recognition Users can also use non-contact input methods for input, such as gesture recognition, which is convenient for users.
  • the gesture refers to various actions made by the human hand under the control of the human consciousness, such as finger bending, stretching and hand movement in space, etc., which may be performing a certain task or communicating with people to express a certain Meaning or intent.
  • gesture recognition Based on three-dimensional interactive input technology of gesture recognition, there are commonly used data glove-based and vision-based (such as camera) gesture recognition.
  • An example of the present application provides a gesture recognition method, where the method is applied to a terminal, where a speaker and a microphone are disposed, and an application is installed on the terminal, and the method includes:
  • Corresponding gesture instructions are executed in the application according to the gesture recognition result.
  • the application example further provides a terminal, where the terminal is configured with a speaker and a microphone, and the terminal is installed with an application program, and the terminal includes:
  • a mode determining module configured to determine, according to an input control indication of the application, whether the terminal turns on a gesture recognition mode
  • An ultrasonic transmitting module configured to: when the terminal determines to enable the gesture recognition mode, play the first ultrasonic wave through the speaker;
  • An ultrasonic acquisition module configured to receive a second ultrasonic wave by using the microphone, where the second ultrasonic wave is waveform data obtained after collecting the first ultrasonic wave;
  • a gesture recognition module configured to perform gesture recognition according to the received second ultrasonic wave, to obtain a gesture recognition result
  • an instruction execution module configured to execute a corresponding gesture instruction in the application according to the gesture recognition result.
  • the application example further provides a terminal, where the terminal includes: a speaker, a microphone, a processor, and a memory;
  • the processor and the speaker, the microphone, and the memory communicate with each other;
  • the memory is for storing instructions
  • the speaker for playing a first ultrasonic wave under the control of the processor
  • the microphone for receiving a second ultrasonic wave under the control of the processor
  • the processor is operative to execute the instructions in the memory, performing the method of any of the preceding aspects.
  • the present application examples provide a computer readable storage medium having instructions stored therein that, when executed on a computer, cause the computer to perform the methods described in the various aspects above.
  • 1a is a schematic structural diagram of a system involved in an example of the present application.
  • 1b is a schematic block diagram of a gesture recognition method provided by an example of the present application.
  • 1c is a schematic structural diagram of a system involved in an example of the present application.
  • FIG. 3 is a schematic flow chart of ultrasonic receiving provided by an example of the present application.
  • FIG. 4 is a schematic flowchart of a one-dimensional gesture recognition process provided by an example of the present application.
  • FIG. 5 is a schematic flowchart of a two-dimensional gesture recognition process provided by an example of the present application.
  • FIG. 6 is a schematic flowchart of gesture recognition action management provided by an example of the present application.
  • 7-a is a schematic diagram of the action of gesture recognition provided by the example of the present application.
  • 7-b is a schematic diagram of the end of the action of gesture recognition provided by the example of the present application.
  • FIG. 8 is a schematic diagram of a basic operation of gesture recognition provided by an example of the present application.
  • 9-a is a schematic structural diagram of a terminal provided by an example of the present application.
  • 9-b is a schematic structural diagram of a mode determining module provided by an example of the present application.
  • 9-c is a schematic structural diagram of another terminal provided by an example of the present application.
  • 9-d is a schematic structural diagram of another terminal provided by an example of the present application.
  • FIG. 10 is a schematic structural diagram of a gesture recognition method provided by an example of the present application applied to a terminal;
  • FIG. 11 is a schematic structural diagram of another terminal according to an example of the present application.
  • the application example provides a gesture recognition method and a terminal for reducing the complexity of gesture recognition on an application of the terminal and reducing the computing performance requirement of the terminal.
  • the terminal can recognize the gesture of the user based on the image of the gesture made by the user.
  • the image recognition algorithm is complex, computationally intensive, high in power consumption, and susceptible to light, and has high performance requirements on the terminal device. Therefore, this method cannot achieve accurate recognition on the terminal.
  • the present application provides a gesture recognition method.
  • the method may be specifically applied to a gesture recognition scene of a terminal to a user.
  • the gesture recognition method provided by the example of the present application is applied to the terminal 101.
  • the terminal 101 is configured with a speaker 102 and a microphone 103.
  • the terminal 101 also has an application 104 installed thereon.
  • the application program installed on the terminal includes : Browser APP, or office software APP, or game app, etc., is not limited here.
  • FIG. 1b is a schematic flow chart of a gesture recognition method according to an illustrative example of the present application.
  • the gesture recognition method includes the following steps:
  • S101 Determine, according to an input control indication of the application, whether the terminal turns on the gesture recognition mode.
  • an application is installed on the terminal, and the user can use the application, for example, the user can use the browser APP to query the webpage.
  • the input control instruction may be specifically set in the application's setting menu, or may be set in the application's configuration file.
  • the input control indication may be set in the installation configuration file, and configured by the user when the application is installed on the terminal. This input control indication.
  • the open page of the gesture recognition mode can be set. If the user selects to enable the gesture recognition mode, it can be determined that the gesture recognition mode is enabled on the terminal, and if the user does not enable the gesture recognition mode, It is determined that the terminal has turned off the gesture recognition mode.
  • the terminal determines to enable the gesture recognition mode, the subsequent steps are triggered, otherwise the gesture recognition method in the example of the present application is ended.
  • the input control indication of the application can control whether the terminal enables the gesture recognition mode.
  • step S101 determines whether the terminal turns on the gesture recognition mode according to the input control indication of the application, including:
  • the terminal turns on the gesture recognition mode by default, it is determined that the gesture recognition mode is enabled in the terminal.
  • the input control indication of the application can be configured as the default open gesture recognition mode, and the gesture recognition mode can be automatically turned on after the application runs on the terminal, thereby eliminating the trouble of whether the user resets or not.
  • the gesture recognition mode can be automatically turned on after the application runs on the terminal, thereby eliminating the trouble of whether the user resets or not.
  • step S101 determines whether the terminal turns on the gesture recognition mode according to the input control indication of the application, including:
  • the opening and closing of the gesture recognition mode in the example of the present application may also be determined by some preset gesture actions.
  • some two-dimensional gestures may be preset, and the preset two-dimensional gesture is used by the user to control whether the terminal turns on the gesture. Identify the mode to meet the user's real-time control needs for the terminal. For example, determining that the input control indication of the application may open the gesture recognition mode for the first two-dimensional gesture according to the configuration file of the application, the terminal may transmit the ultrasonic wave through the speaker, and acquire the ultrasonic wave through the microphone, thereby identifying whether the user has made a preset.
  • One of the gesture actions (for example, the first two-dimensional gesture) can be turned on only when the preset gesture motion is detected.
  • the terminal can transmit the ultrasonic wave through the speaker, and acquire the ultrasonic wave through the microphone, thereby identifying whether the user has made a preset.
  • One of the gesture actions eg, the second two-dimensional gesture
  • the preset gesture action can be turned off only when the preset gesture action is detected.
  • the gesture recognition method provided by the example of the present application may further include the following steps:
  • the waveform data of the specified gesture action is recorded through the microphone
  • the gesture recognition result corresponding to the specified gesture action is determined according to the gesture waveform template library.
  • the gesture waveform is established in the example of the present application.
  • Template library For example, in order to solve the problem of insufficient computing power of the terminal, a cloud server-based gesture waveform template library can also be established. That is, in some examples, the gesture recognition method provided by the examples of the present application can also be applied to the system architecture as shown in FIG. 1c.
  • the system architecture includes a terminal 101 and a cloud server 105 that interact via one or more Internet 106.
  • the user records the waveform data of the specified gesture action through the terminal 101, and the cloud server 105 can establish a gesture waveform template library based on the above interaction, and adopts a machine learning method to perform gesture recognition, thereby improving the accuracy of gesture matching.
  • cloud server 105 can include two components: one is offline gesture waveform template library training and the other is online gesture recognition.
  • the terminal 101 may extract the received second ultrasonic wave to the ultrasonic data, and then send the ultrasonic data to the cloud server 105, and the cloud server 105 may acquire the gesture recognition result through the gesture waveform template library by using online cloud recognition, and then the cloud The server 105 sends the gesture recognition result to the terminal 101, so that different terminal types can obtain the gesture recognition result applicable to the terminal type.
  • the type information of the terminal and the configuration information of the terminal can be obtained through the configuration file of the terminal.
  • the system type of the terminal, the operating system version, the number of processors used by the terminal, and the size of the content space thereby entering the gesture waveform template library set for the terminal according to the type information of the terminal and the configuration information of the terminal, thereby improving gesture matching. Accuracy.
  • the terminal has a built-in speaker, and the terminal can first generate an ultrasonic signal, and then play the ultrasonic wave through the speaker, and define the ultrasonic wave played by the speaker as the “first ultrasonic wave”. After the first ultrasonic wave is emitted from the position where the terminal is located, the first ultrasonic wave hits the target obstacle and reflects to the position where the terminal is located.
  • the target obstacle is mainly a user's hand, such as a single or multiple fingers of the user, or one or two palms of the user.
  • the terminal can use the built-in ultrasonic generator to generate ultrasonic waves, and then transmit the ultrasonic waves through the built-in speakers of the terminal.
  • the ultrasonic generator generates ultrasonic waves by mechanical vibration, and generally propagates in an elastic medium in a longitudinal wave manner, which is a form of energy propagation.
  • the ultrasonic wave has a short wavelength and good directivity.
  • the ultrasonic generator generates ultrasonic waves with a frequency higher than 20,000 Hz, good directionality, strong penetrating power, and easy to obtain concentrated sound energy.
  • the gesture recognition method provided by the example of the present application further includes:
  • step S102 playing the first ultrasonic wave through the speaker
  • the trigger performs the following step S103: receiving the second ultrasonic wave through the microphone.
  • abnormality detection may be performed on the playing data, for example, the signal waveform of the first ultrasonic wave is detected from multiple dimensions such as time domain and frequency domain.
  • the terminal can detect whether the played data has dropped frames, for example, whether the sampled frequency points are the same as the frequency points played by the speaker in one second.
  • the terminal can check whether the energy of the signal is normal in the frequency domain, for example, the ultrasonic signal played is 20khz, and it is determined whether the received ultrasonic energy is 20khz.
  • the terminal when the terminal plays an ultrasonic wave, it can also detect whether there is a frame loss by playing music with fast rhythm.
  • the success rate of the received ultrasonic wave can be improved.
  • S103 Receive a second ultrasonic wave through a microphone, and the second ultrasonic wave is waveform data obtained by collecting the first ultrasonic wave.
  • the sound wave signal can be received through the microphone built in the terminal, and the sound wave signal received by the microphone is defined as the “second ultrasonic wave”.
  • the microphone built in the terminal may be one or more, which is not limited herein.
  • the second ultrasonic wave normally received from the surroundings of the terminal is a mixed signal mixed with ultrasonic waves reflected by the target obstacle, and ultrasonic waves emitted from the speaker, and ultrasonic waves emitted from the vicinity of the terminal.
  • the gesture recognition method provided by the example of the present application further includes:
  • the trigger performs the following steps: performing gesture recognition according to the received second ultrasonic wave.
  • the terminal After the terminal acquires the second ultrasonic wave, it can also determine whether there is an abnormality in the recorded signal waveform, for example, whether the received signal waveform is complete, that is, whether there is a frame loss. For example, the detection can be performed from multiple dimensions such as frequency domain and time domain, otherwise the subsequent gesture recognition result is affected. There are various methods of detecting, for example, determining whether the received signal waveform energy is equal to the played signal waveform energy. Another example is whether the frequency points recorded per second are the same as the sampling frequency. Another example is the energy of the time-frequency diagram of the signal.
  • the gesture recognition is performed only when there is no abnormality in the received second ultrasonic wave, thereby improving the accuracy of the gesture recognition.
  • step S103 receives the second ultrasonic wave through the microphone, including:
  • the recorded sound wave signal is stored in the recording buffer of the terminal, wherein the sound wave signal stored in the recording buffer is the second ultrasonic wave.
  • the terminal preset recording parameters can be various, such as mono or multi-channel.
  • the terminal can select mono, multi-channel, etc. according to the needs.
  • the recording parameter may further include a sampling frequency, which is the same as the transmission frequency of the ultrasonic wave.
  • the terminal can store the acoustic signal in the recording buffer.
  • the size of the recording buffer can be set as follows. If the recording buffer is set too small, it may cause frame loss. Generally, the minimum buffer size is more than 2 times.
  • the received second ultrasonic wave may be gesture-recognized. Since the first ultrasonic wave emitted by the speaker is blocked by the user's hand, the microphone collects the human hand. The blocked ultrasonic wave, so by analyzing the receiving position, waveform energy, and the like of the second ultrasonic wave received by the microphone, the gesture made by the user can be recognized, and the gesture recognition result is obtained.
  • the step S104 performs gesture recognition according to the received second ultrasonic wave, and after the gesture recognition result is obtained, the gesture recognition method provided by the example of the present application may further include the following steps:
  • the prompt information for successful gesture recognition is displayed through the display interface of the application.
  • the display interface of the application of the terminal in the example of the present application can also implement real-time interaction with the user, and the second ultrasonic wave received by the step S104 performs gesture recognition to obtain a gesture recognition result, indicating that the current gesture recognition is successful, in order to avoid
  • the user repeatedly performs multiple identical actions and the display interface of the application of the terminal can also display prompt information, for example, by using an animation displayed on the display interface to inform the user of the current gesture recognition program, or by prompting the user by text or sound, here is not Make a limit.
  • the terminal may respond to the gesture of the user, and may execute a corresponding gesture instruction according to the gesture recognition result, for example, operating the application of the terminal in response to the gesture instruction of the user.
  • the gesture recognition result is a circle
  • the terminal executes a gesture instruction corresponding to the gesture recognition result on the application: opening the application or operating a certain menu of the application.
  • the correspondence between the gesture recognition result and the gesture instruction may be determined according to a pre-configured list of the user on the terminal.
  • the terminal recognizes the gesture of the user to obtain a gesture recognition result, and the terminal may execute a gesture instruction of the user on the browser application, for example, inputting a text in a search box of the browser.
  • step S102 plays the first ultrasonic wave through the speaker, including:
  • N being a positive integer greater than or equal to 1;
  • step S103 receives the second ultrasonic wave through the microphone, including:
  • a second ultrasonic wave of N frequencies is collected from the surroundings of the terminal through a microphone.
  • the terminal can transmit ultrasonic waves of one or more frequencies, for example, transmitting a plurality of ultrasonic waves of different frequencies, and the ultrasonic waves can be separated by the same distance.
  • the terminal can separately acquire ultrasonic waves of one or more frequencies played by the speaker through the microphone.
  • step S104 performs gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, including:
  • the one-dimensional gesture recognition is performed according to the waveform energy of the second ultrasonic wave, and the first gesture recognition result is obtained.
  • the first gesture recognition result includes: the gesture is close to the terminal, or is away from the terminal.
  • the terminal can transmit an ultrasonic wave through the speaker, and the terminal can collect the ultrasonic wave through the microphone, and the terminal can perform coarse-grained one-dimensional gesture recognition on the received one ultrasonic wave, for example, the first gesture is obtained by Doppler waveform energy change.
  • the result of the recognition, the first gesture recognition result includes: the gesture is close to the terminal, or is away from the terminal.
  • step S104 performs gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, including:
  • the moving distance of the gesture is calculated for each of the N second ultrasonic waves
  • the gesture recognition result includes: the gesture is close to the terminal, or is away from the terminal, or Moving to the left relative to the terminal, or moving to the right relative to the terminal, or moving up relative to the terminal, or moving downward relative to the terminal.
  • the terminal can transmit a plurality of ultrasonic waves through the speaker, and the terminal can separately collect each ultrasonic wave through the microphone, and the terminal can perform fine-grained one-dimensional gesture recognition on the received plurality of ultrasonic waves. For example, using N-frequency ultrasonic phase changes to calculate the distance, the response is more sensitive. N ultrasonic waves in the middle of 17500HZ-23000HZ are emitted. The distance between adjacent ultrasonic waves is the same. The sampling frequency is 48000Hz. 512 points can be sampled to calculate the distance change in real time. The reaction time is 10.7ms. In the process of sound transmission, there will be some multipath effects, and some background static objects.
  • the dynamic signal can be subtracted from the background interference, and N distances can be calculated by N frequencies.
  • the least squares method is used to calculate the deviation, and the frequency distance with large deviation is removed, and the deviation is small. By excluding the abnormal distance, the calculation accuracy of the distance can be improved.
  • step S103 receives the second ultrasonic wave through the microphone, including:
  • the second ultrasonic wave is separately collected by the two microphones of the terminal.
  • step S104 performs gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, including:
  • the two-dimensional gesture recognition is performed according to the calculated relative position and the initial position, and the second gesture recognition result is obtained, and the second gesture recognition result includes: two-dimensional coordinates of the gesture change.
  • At least two microphones may be disposed in the terminal to separately acquire the second ultrasonic waves. Then, for the second ultrasonic wave received by each microphone, the relative position and initial position of the gesture can be calculated.
  • the relative position of the gesture refers to the position of the gesture relative to the terminal based on the phase measurement of the second ultrasonic wave
  • the initial position of the gesture refers to the position of the user gesture at the time of initial recognition.
  • the description of the present application shows that a speaker and a microphone are disposed in the terminal, and an application program is installed on the terminal. Firstly, according to the input control indication of the application, determining whether the terminal turns on the gesture recognition mode, when the terminal determines to turn on the gesture recognition mode, playing the first ultrasonic wave through the speaker, and then receiving the second ultrasonic wave through the microphone, and the second ultrasonic wave is for the first ultrasonic wave. The waveform data obtained after the acquisition is then subjected to gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, and finally a corresponding gesture instruction is executed in the application according to the gesture recognition result.
  • the gesture recognition can be realized by using the built-in speaker and microphone of the terminal, that is, the motion of the user can be recognized through the transmission and detection of the ultrasonic wave, the complexity of the gesture recognition is reduced, and the calculation performance requirement of the terminal is reduced. And implements gesture control of the user on the application. Further, by calculating the second ultrasonic wave, the moving distance of the gesture or the two-dimensional change of the gesture is obtained, and the accuracy of the gesture recognition is improved.
  • the application example can be applied to a mobile phone browser and a game application (Application, APP).
  • the application example can realize high-precision, low-delay, no-peripheral, non-contact one-dimensional and two-dimensional gesture recognition.
  • the example of the present application performs ultrasonic gesture detection based on the speaker and microphone of the terminal, and does not require any external equipment. For example, in the example of the present application, the user does not need to manually operate the touch screen of the terminal, and does not need to input any command from the keyboard, and the user only needs to make a gesture action.
  • the terminal in the example of the present application can detect by sending and receiving ultrasonic waves.
  • the gesture action made by the user is performed to execute the gesture instruction corresponding to the gesture action on the application of the terminal, for example, the user operates the game application by making a gesture, and the character movement in the game application can be controlled, and the game application is opened. With off and so on.
  • the user can operate basic browsing scenarios such as upper, lower, left, and right.
  • the terminal can adopt multiple playback modes such as static and streaming, and can be set according to user requirements.
  • Ultrasonic playback requirements are strict. If there is abnormal playback or frame loss, it will lead to calculation errors or errors. Therefore, the data acquisition process of the ultrasonic playback process needs to be prepared in advance. The playback process logic needs to ensure that the waveform data played is not dropped. That is, the waveform data that needs to be played is complete.
  • the terminal can detect whether the played data is abnormal, and can detect from multiple dimensions such as time domain and frequency domain. There are many ways to detect. For example, if the data being played is detected, there is no frame loss, and it is not the number of points for playing the sampling frequency in 1 second. Another example is to detect whether the playback module has reported an exception. Another example is to check whether the energy of the signal is normal in the frequency domain. For example, the frequency of the broadcast is 20khz, and the frequency of the received signal is 20khz. In addition, the playback process can also detect the loss of frames with fast-paced music playback.
  • the sampling frequency is twice the transmission frequency of the ultrasonic wave. According to Nyquist's sampling law, the sampling frequency is compared with the playback frequency, and the sampling frequency can be twice the playback frequency. If the recording buffer is set too small, it may cause frame loss. Generally, the size of the minimum buffer is more than 2 times.
  • the logic of the recording thread is as light as possible, ensuring that the sampling frequency is received in one second.
  • the recorded signal waveform is complete, whether there is no frame loss, need to detect from multiple dimensions such as frequency domain and time domain, otherwise it will affect the subsequent gesture recognition result.
  • There are many ways to detect such as whether the received signal energy is the energy of the frequency of playback.
  • the number of points recorded per second is the same as the sampling frequency.
  • the ultrasonic processing may be specifically a one-dimensional gesture recognition process or a two-dimensional gesture recognition process, which is respectively illustrated by way of example.
  • the one-dimensional gesture recognition processing will be described.
  • the one-dimensional gesture recognition processing can implement two schemes.
  • One of the coarse-grained gesture recognitions mainly includes:
  • Modeling a multi-dimensional feature such as a velocity, an acceleration, and the like of the waveform to obtain a one-dimensional gesture change Mainly according to the Doppler effect, in the process of the gesture approaching and far away, the waveform has different trends, and the calculation is based on the direction and acceleration of the waveform change. For example, as follows, an ultrasonic wave of 20,000 Hz is transmitted, and the sampling frequency is 44100 Hz, and 4096 points can be sampled each time for calculation.
  • Another fine-grained gesture recognition includes:
  • the reaction is more sensitive. N ultrasonic waves in the middle of 17500HZ-23000HZ, the distance between adjacent two ultrasonic waves is the same, the sampling frequency is 48000Hz, and 512 points can be sampled to calculate the distance change in real time.
  • the reaction time is 10.7ms, that is, in this example, the gesture
  • the accuracy of the recognition can reach the ms level.
  • the distance calculated for the phase changes of the N frequencies is eliminated by an algorithm to improve the calculation accuracy of the distance.
  • the background interference will be subtracted, and N distances will be calculated by N frequencies. According to the least squares method, Calculate the deviation, remove the frequency distance with large deviation, and keep the distance with small deviation.
  • the influence of the two microphones on the waveform is different, so that the change of the two directions can be obtained.
  • the two-dimensional gesture recognition processing mainly solves the recognition of the two-dimensional coordinates of the gesture action, thereby realizing the recognition of the two-dimensional graphics (drawing circles, drawing squares, drawing triangles, etc.) or writing Chinese characters. With two speakers, two channels of data can be sampled.
  • Steps 503 and 504 are performed for each microphone channel.
  • the phase based on the one-dimensional gesture recognition can calculate the relative distance of the gesture movement.
  • the coarse-grained initial position of the gesture can be calculated.
  • the initial position is the initial position of the gesture detected when the ultrasonic wave is received.
  • the two-dimensional gesture recognition can be combined with the coordinates in the two directions of x and y, and the precise coordinates (x, y) of the gesture recognition can be obtained through the initial The position and relative position changes can get the real-time gesture position.
  • the management of gesture recognition is exemplified next.
  • the action management of gesture recognition avoid misuse and unnecessary interruption. mainly includes:
  • the motion of the gesture recognition may be a circular shape
  • the motion recognition of the gesture recognition may be a square.
  • the user can set the gesture recognition action to meet the basic operational needs.
  • the basic operation of the gesture recognition may include basic actions such as up, down, left, and right.
  • the playback and recording of the ultrasonic wave may be affected, thereby affecting the accuracy of the gesture recognition.
  • an efficient programming language may be used, such as calculation logic.
  • Language implementation such as the use of pointer operations, optimize the processor's high-cost logic operations, minimize memory copy, input and output operations, and reduce computation time. Therefore, the time consumption of one-dimensional gesture recognition calculation in the example of the present application can be reduced to the ms level.
  • the accuracy of the gesture recognition and the fastest response time are the most important.
  • the user's misoperation can be prevented, and the opening and ending of the gesture recognition mode can be recognized by the two-dimensional gesture.
  • the present application example proposes a one-dimensional gesture recognition manner, wherein the ultrasonic phase-based approach can achieve millisecond (mm) level accuracy.
  • the example of the present application can improve the calculation performance as much as possible and save the calculation time.
  • a terminal 900 is provided.
  • the terminal is configured with a speaker and a microphone.
  • the terminal is installed with an application program, and the terminal may include: a mode determining module 901, and an ultrasound.
  • the mode determining module 901 is configured to determine, according to an input control indication of the application, whether the terminal turns on the gesture recognition mode;
  • the ultrasonic sending module 902 is configured to: when the terminal determines to enable the gesture recognition mode, play the first ultrasonic wave through the speaker;
  • the ultrasonic acquisition module 903 is configured to receive a second ultrasonic wave by using the microphone, where the second ultrasonic wave is waveform data obtained after collecting the first ultrasonic wave;
  • a gesture recognition module 904 configured to perform gesture recognition according to the received second ultrasonic wave, to obtain a gesture recognition result
  • the instruction execution module 905 is configured to execute a corresponding gesture instruction in the application according to the gesture recognition result.
  • the ultrasonic transmitting module 902 is specifically configured to play a first ultrasonic wave of N frequencies through the speaker, where N is a positive integer greater than or equal to 1;
  • the ultrasonic acquisition module is configured to collect, by the microphone, a second ultrasonic wave of N frequencies from a surrounding environment of the terminal.
  • the gesture recognition module 904 is specifically configured to perform one-dimensional gesture recognition according to the waveform energy of the second ultrasonic wave when the value of the N is 1, to obtain a first A gesture recognition result, the first gesture recognition result includes: the gesture is close to the terminal, or is away from the terminal.
  • the mode determining module 901 includes:
  • the control indication parsing module 9011 is configured to determine, according to the configuration file of the application, whether the input control indication of the application is a default open gesture recognition mode;
  • the program detection module 9012 is configured to detect whether the application runs successfully on the terminal
  • the mode-opening module 9013 is configured to determine, when the application runs successfully, that the terminal turns on the gesture recognition mode by default, and determines that the terminal has enabled the gesture recognition mode.
  • the mode determining module 901 is specifically configured to determine, according to the configuration file of the application, that the input control indication of the application is a first two-dimensional gesture open gesture recognition mode, or a second two-dimensional gesture Turning off the gesture recognition mode; when detecting the first two-dimensional gesture, determining that the terminal has turned on the gesture recognition mode; and when detecting the second two-dimensional gesture, determining that the terminal turns off the gesture recognition mode.
  • the terminal 900 further includes: a template library establishing module 906 and a gesture action matching module 907, where
  • the ultrasonic acquisition module 903 is configured to record waveform data of the specified gesture action through the microphone when the speaker plays the training ultrasonic wave;
  • a template library establishing module 906, configured to establish a gesture waveform template library according to the waveform data of the specified gesture action
  • the gesture action matching module 907 is configured to determine a gesture recognition result corresponding to the specified gesture action according to the gesture waveform template library.
  • the terminal 900 further includes:
  • the display module 908 is configured to perform gesture recognition according to the received second ultrasonic wave to obtain gesture recognition result, and then display prompt information for successful gesture recognition through the display interface of the application.
  • the gesture recognition module 904 includes:
  • a distance calculating module configured to calculate a moving distance of the gesture for each of the N second ultrasonic waves when the value of the N is greater than or equal to 2;
  • a one-dimensional identification module configured to exclude an abnormal moving distance from the moving distances of the N gestures, perform one-dimensional gesture recognition on the retained moving distance, and obtain a first gesture recognition result, where the first gesture recognition result is
  • the method includes: the gesture is close to the terminal, or is away from the terminal, or moves to the left relative to the terminal, or moves to the right relative to the terminal, or moves upward relative to the terminal, or is opposite to the terminal Move down.
  • the ultrasonic acquisition module 903 is specifically configured to separately collect a second ultrasonic wave through two microphones of the terminal;
  • the gesture recognition module 904 includes:
  • a position calculation module configured to calculate a relative position and an initial position of the gesture according to the second ultrasonic waves respectively received by the two microphones;
  • the two-dimensional recognition module is configured to perform two-dimensional gesture recognition according to the calculated relative position and the initial position to obtain a second gesture recognition result, where the second gesture recognition result includes: two-dimensional coordinates of the gesture change.
  • the ultrasonic acquisition module 903 includes:
  • a sound wave recording module configured to control the microphone to record an acoustic signal in a surrounding environment of the terminal according to preset recording parameters
  • a signal storage module configured to store the recorded acoustic wave signal into a recording buffer of the terminal, wherein the sound wave signal stored in the recording buffer is the second ultrasonic wave.
  • a speaker and a microphone are disposed in the terminal, and an application is installed on the terminal.
  • determining whether the terminal turns on the gesture recognition mode when the terminal determines to turn on the gesture recognition mode, playing the first ultrasonic wave through the speaker, and then receiving the second ultrasonic wave through the microphone, and the second ultrasonic wave is for the first ultrasonic wave
  • the waveform data obtained after the acquisition is then subjected to gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, and finally a corresponding gesture instruction is executed in the application according to the gesture recognition result.
  • the gesture recognition can be realized by using the built-in speaker and microphone of the terminal, that is, the motion of the user can be recognized through the transmission and detection of the ultrasonic wave, the complexity of the gesture recognition is reduced, and the calculation performance requirement of the terminal is reduced. And implements gesture control of the user on the application. Further, by calculating the second ultrasonic wave, the moving distance of the gesture or the two-dimensional change of the gesture is obtained, and the accuracy of the gesture recognition is improved.
  • the present application also provides a terminal, as shown in FIG. 10, for the convenience of description, only the parts related to the examples of the present application are shown. For the specific technical details not disclosed, please refer to the example method part of the present application.
  • the terminal may be any terminal device including a mobile phone, a tablet computer, a PDA (Personal Digital Assistant), a POS (Point of Sales), an in-vehicle computer, and the terminal is a mobile phone as an example:
  • FIG. 10 is a block diagram showing a partial structure of a mobile phone associated with a terminal provided by an example of the present application.
  • the mobile phone includes: a radio frequency (RF) circuit 1010, a memory 1020, an input unit 1030, a display unit 1040, a sensor 1050, an audio circuit 1060, a wireless fidelity (WiFi) module 1070, and a processor 1080. And power supply 1090 and other components.
  • RF radio frequency
  • the RF circuit 1010 can be used for receiving and transmitting signals during the transmission or reception of information or during a call. In particular, after receiving the downlink information of the base station, it is processed by the processor 1080. In addition, the uplink data is designed to be sent to the base station. Generally, RF circuit 1010 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a Low Noise Amplifier (LNA), a duplexer, and the like. In addition, RF circuit 1010 can also communicate with the network and other devices via wireless communication. The above wireless communication may use any communication standard or protocol, including but not limited to Global System of Mobile communication (GSM), General Packet Radio Service (GPRS), Code Division Multiple Access (Code Division). Multiple Access (CDMA), Wideband Code Division Multiple Access (WCDMA), Long Term Evolution (LTE), E-mail, Short Messaging Service (SMS), and the like.
  • GSM Global System of Mobile communication
  • GPRS General Packet Radio Service
  • the memory 1020 can be used to store software programs and modules, and the processor 1080 executes various functional applications and data processing of the mobile phone by running software programs and modules stored in the memory 1020.
  • the memory 1020 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may be stored according to Data created by the use of the mobile phone (such as audio data, phone book, etc.).
  • memory 1020 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
  • Input unit 1030 can be used to receive input numeric or character information and to generate key signal inputs related to user settings and function control of the handset.
  • the input unit 1030 may include a touch panel 1031 and other input devices 1032.
  • the touch panel 1031 also referred to as a touch screen, can collect touch operations on or near the user (such as the user using a finger, a stylus, or the like on the touch panel 1031 or near the touch panel 1031. Operation), and drive the corresponding connecting device according to a preset program.
  • the touch panel 1031 can include two portions of a touch detection device and a touch controller.
  • the touch detection device detects the touch orientation of the user, and detects a signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts the touch information into contact coordinates, and sends the touch information.
  • the processor 1080 is provided and can receive commands from the processor 1080 and execute them.
  • the touch panel 1031 can be implemented in various types such as resistive, capacitive, infrared, and surface acoustic waves.
  • the input unit 1030 may also include other input devices 1032.
  • other input devices 1032 may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control buttons, switch buttons, etc.), trackballs, mice, joysticks, and the like.
  • the display unit 1040 can be used to display information input by the user or information provided to the user as well as various menus of the mobile phone.
  • the display unit 1040 may include a display panel 1041.
  • the display panel 1041 may be configured in the form of a liquid crystal display (LCD), an organic light-emitting diode (OLED), or the like.
  • the touch panel 1031 may cover the display panel 1041, and when the touch panel 1031 detects a touch operation thereon or nearby, the touch panel 1031 transmits to the processor 1080 to determine the type of the touch event, and then the processor 1080 according to the touch event.
  • the type provides a corresponding visual output on display panel 1041.
  • the touch panel 1031 and the display panel 1041 are two independent components to implement the input and input functions of the mobile phone, in some examples, the touch panel 1031 and the display panel 1041 may be integrated. The input and output functions of the phone.
  • the handset can also include at least one type of sensor 1050, such as a light sensor, motion sensor, and other sensors.
  • the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display panel 1041 according to the brightness of the ambient light, and the proximity sensor may close the display panel 1041 and/or when the mobile phone moves to the ear. Or backlight.
  • the accelerometer sensor can detect the magnitude of acceleration in all directions (usually three axes). When it is stationary, it can detect the magnitude and direction of gravity.
  • the mobile phone can be used to identify the gesture of the mobile phone (such as horizontal and vertical screen switching, related Game, magnetometer attitude calibration), vibration recognition related functions (such as pedometer, tapping), etc.; as for the mobile phone can also be configured with gyroscopes, barometers, hygrometers, thermometers, infrared sensors and other sensors, no longer Narration.
  • the gesture of the mobile phone such as horizontal and vertical screen switching, related Game, magnetometer attitude calibration
  • vibration recognition related functions such as pedometer, tapping
  • the mobile phone can also be configured with gyroscopes, barometers, hygrometers, thermometers, infrared sensors and other sensors, no longer Narration.
  • Audio circuit 1060, speaker 1061, microphone 1062, microphone 1063 can provide an audio interface between the user and the handset.
  • the audio circuit 1060 can transmit the converted electrical data of the received audio data to the speaker 1061, and convert it into a sound signal output by the speaker 1061; on the other hand, the microphone 1062 converts the collected sound signal into an electrical signal, by the audio circuit 1060. After receiving, it is converted into audio data, and then processed by the audio data output processor 1080, sent to the other mobile phone via the RF circuit 1010, or outputted to the memory 1020 for further processing.
  • the speaker 1061 can generate an ultrasonic signal, and then the microphone 1063 can acquire an ultrasonic signal from around the mobile phone.
  • WiFi is a short-range wireless transmission technology.
  • the mobile phone through the WiFi module 1070 can help users to send and receive e-mail, browse the web and access streaming media, etc. It provides users with wireless broadband Internet access.
  • FIG. 10 shows the WiFi module 1070, it can be understood that it does not belong to the essential configuration of the mobile phone, and can be omitted as needed within the scope of not changing the essence of the application.
  • the processor 1080 is the control center of the handset, which connects various portions of the entire handset using various interfaces and lines, by executing or executing software programs and/or modules stored in the memory 1020, and invoking data stored in the memory 1020, The phone's various functions and processing data, so that the overall monitoring of the phone.
  • processor 1080 can include one or more processing units; preferably, processor 1080 can integrate an application processor and a modem processor, wherein the application processor primarily processes an operating system, a user interface, and an application Etc.
  • the modem processor primarily handles wireless communications. It will be appreciated that the above described modem processor may also not be integrated into the processor 1080.
  • the mobile phone also includes a power source 1090 (such as a battery) that supplies power to various components.
  • a power source 1090 such as a battery
  • the power source can be logically coupled to the processor 1080 through a power management system to manage functions such as charging, discharging, and power management through the power management system.
  • the mobile phone may further include a camera, a Bluetooth module, and the like, and details are not described herein again.
  • the processor 1080 included in the terminal further has a flow of controlling a gesture recognition method performed by the terminal.
  • the process of identifying the ultrasonic waves by the processor 1080 can be referred to the description in the foregoing examples.
  • the terminal 1100 includes:
  • the speaker 1101, the microphone 1102, the processor 1103, and the memory 1104 (wherein the number of the processors 1103 in the terminal 1100 may be one or more, and one processor in FIG. 11 is taken as an example).
  • the speaker 1101, the microphone 1102, the processor 1103, and the memory 1104 may be connected by a bus or other means, wherein FIG. 11 is exemplified by a bus connection.
  • Memory 1104 can include read only memory and random access memory and provides instructions and data to processor 1103. A portion of the memory 1104 may also include a non-volatile random access memory (English name: Non-Volatile Random Access Memory, English abbreviation: NVRAM).
  • the memory 1104 stores operating systems and operational instructions, executable modules or data structures, or a subset thereof, or an extended set thereof, wherein the operational instructions can include various operational instructions for implementing various operations.
  • the operating system can include a variety of system programs for implementing various basic services and handling hardware-based tasks.
  • the processor 1103 controls the operation of the terminal, and the processor 1103 can also be referred to as a central processing unit (English full name: Central Processing Unit, English abbreviation: CPU).
  • the components of the terminal are coupled together by a bus system.
  • the bus system may include a power bus, a control bus, and a status signal bus in addition to the data bus.
  • the various buses are referred to as bus systems in the figures.
  • the method disclosed in the above examples of the present application may be applied to the processor 1103 or implemented by the processor 1103.
  • the processor 1103 can be an integrated circuit chip with signal processing capabilities. In the implementation process, each step of the above method may be completed by an integrated logic circuit of hardware in the processor 1103 or an instruction in a form of software.
  • the processor 1103 may be a general-purpose processor, a digital signal processor (English full name: digital signal processing, English abbreviation: DSP), an application specific integrated circuit (English name: Application Specific Integrated Circuit, English abbreviation: ASIC), field programmable Gate array (English name: Field-Programmable Gate Array, English abbreviation: FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
  • the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
  • the steps of the method disclosed in connection with the examples of the present application may be directly embodied by the execution of the hardware decoding processor or by a combination of hardware and software modules in the decoding processor.
  • the software module can be located in a conventional storage medium such as random access memory, flash memory, read only memory, programmable read only memory or electrically erasable programmable memory, registers, and the like.
  • the storage medium is located in the memory 1104, and the processor 1103 reads the information in the memory 1104 and performs the steps of the above method in combination with its hardware.
  • the speaker 1101 is configured to play a first ultrasonic wave under the control of the processor
  • the microphone 1102 is configured to receive a second ultrasonic wave under the control of the processor
  • the processor 1103 is configured to execute the instructions in the memory, and perform the method in the foregoing example.
  • the device examples described above are merely illustrative, wherein the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical. Units can be located in one place or distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present example.
  • the connection relationship between the modules indicates that there is a communication connection between them, and specifically may be implemented as one or more communication buses or signal lines.
  • U disk mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), disk or optical disk, etc., including a number of instructions to make a computer device (may be A personal computer, server, or network device, etc.) performs the methods described in various examples of the present application.
  • a computer device may be A personal computer, server, or network device, etc.

Abstract

The embodiments of the present application provide a gesture recognition method. The method is applied to a terminal, which terminal is provided with a speaker and a microphone and has an application program installed thereon. The method comprises: determining, according to an input control instruction of the application program, whether the terminal starts a gesture recognition mode; when it is determined that the terminal starts the gesture recognition mode, playing a first ultrasonic wave by means of the speaker; receiving a second ultrasonic wave by means of the microphone, the second ultrasonic wave being waveform data obtained upon acquisition of the first ultrasonic wave; performing gesture recognition according to the received second ultrasonic wave, to obtain a gesture recognition result; and executing a corresponding gesture instruction on the application program according to the gesture recognition result.

Description

手势识别方法、终端及存储介质Gesture recognition method, terminal and storage medium
本申请要求于2017年11月30日提交中国专利局、申请号为201711237225.9、发明名称为“手势识别方法和终端”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。The present application claims priority to Chinese Patent Application No. PCT Application No. No. No. No. No. No. No. No. No. No. No. No. No. No. No. No.
技术领域Technical field
本申请涉及计算机技术领域,尤其涉及一种手势识别方法、终端及存储介质。The present application relates to the field of computer technologies, and in particular, to a gesture recognition method, a terminal, and a storage medium.
背景background
用户可以使用手机浏览器来获取网络资源,例如访问网页。用户在手机浏览器上的输入方式主要是接触式输入,即用户需要在手机终端的触摸屏或者物理按键上进行手动输入。Users can use a mobile browser to access network resources, such as accessing web pages. The input mode of the user on the mobile phone browser is mainly contact input, that is, the user needs to manually input on the touch screen or physical button of the mobile phone terminal.
用户还可以使用非接触的输入方式来进行输入,例如手势识别,可以方便用户的使用。其中,手势是指在人的意识支配下,人手作出的各类动作,如手指弯曲、伸展和手在空间的运动等,可以是执行某项任务,也可以是与人的交流,以表达某种含义或意图。基于手势识别的三维交互输入技术,常用的有基于数据手套的和基于视觉(如摄象机)的手势识别。Users can also use non-contact input methods for input, such as gesture recognition, which is convenient for users. Among them, the gesture refers to various actions made by the human hand under the control of the human consciousness, such as finger bending, stretching and hand movement in space, etc., which may be performing a certain task or communicating with people to express a certain Meaning or intent. Based on three-dimensional interactive input technology of gesture recognition, there are commonly used data glove-based and vision-based (such as camera) gesture recognition.
技术内容Technical content
本申请实例提供一种手势识别方法,所述方法应用于终端,所述终端中配置有扬声器和麦克风,所述终端上安装有应用程序,所述方法包括:An example of the present application provides a gesture recognition method, where the method is applied to a terminal, where a speaker and a microphone are disposed, and an application is installed on the terminal, and the method includes:
根据所述应用程序的输入控制指示确定所述终端是否开启手势识别模式;Determining whether the terminal turns on the gesture recognition mode according to an input control indication of the application;
当所述终端确定开启手势识别模式时,通过所述扬声器播放第一超声波;When the terminal determines to turn on the gesture recognition mode, playing the first ultrasonic wave through the speaker;
通过所述麦克风接收第二超声波,所述第二超声波为对所述第一超声波进行采集后得到的波形数据;Receiving, by the microphone, a second ultrasonic wave, wherein the second ultrasonic wave is waveform data obtained after collecting the first ultrasonic wave;
根据接收到的所述第二超声波进行手势识别,得到手势识别结果;Performing gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result;
根据所述手势识别结果在所述应用程序执行相应的手势指令。Corresponding gesture instructions are executed in the application according to the gesture recognition result.
本申请实例还提供一种终端,所述终端中配置有扬声器和麦克风,所述终端上安装有应用程序,所述终端包括:The application example further provides a terminal, where the terminal is configured with a speaker and a microphone, and the terminal is installed with an application program, and the terminal includes:
模式确定模块,用于根据所述应用程序的输入控制指示确定所述终端是否开启手势识别模式;a mode determining module, configured to determine, according to an input control indication of the application, whether the terminal turns on a gesture recognition mode;
超声波发送模块,用于当所述终端确定开启手势识别模式时,通过所述扬声器播放第一超声波;An ultrasonic transmitting module, configured to: when the terminal determines to enable the gesture recognition mode, play the first ultrasonic wave through the speaker;
超声波采集模块,用于通过所述麦克风接收第二超声波,所述第二超声波为对所述第一超声波进行采集后得到的波形数据;An ultrasonic acquisition module, configured to receive a second ultrasonic wave by using the microphone, where the second ultrasonic wave is waveform data obtained after collecting the first ultrasonic wave;
手势识别模块,用于根据接收到的所述第二超声波进行手势识别,得到手势识别结果;a gesture recognition module, configured to perform gesture recognition according to the received second ultrasonic wave, to obtain a gesture recognition result;
指令执行模块,用于根据所述手势识别结果在所述应用程序执行相应的手势指令。And an instruction execution module, configured to execute a corresponding gesture instruction in the application according to the gesture recognition result.
本申请实例还提供一种终端,所述终端包括:扬声器、麦克风、处理器和存储器;The application example further provides a terminal, where the terminal includes: a speaker, a microphone, a processor, and a memory;
所述处理器和所述扬声器、所述麦克风、所述存储器进行相互的通信;The processor and the speaker, the microphone, and the memory communicate with each other;
所述存储器用于存储指令;The memory is for storing instructions;
所述扬声器,用于在所述处理器的控制下播放第一超声波;The speaker for playing a first ultrasonic wave under the control of the processor;
所述麦克风,用于在所述处理器的控制下接收第二超声波;The microphone for receiving a second ultrasonic wave under the control of the processor;
所述处理器用于执行所述存储器中的所述指令,执行如前述第一方 面中任一项所述的方法。The processor is operative to execute the instructions in the memory, performing the method of any of the preceding aspects.
本申请实例提供了一种计算机可读存储介质,所述计算机可读存储介质中存储有指令,当其在计算机上运行时,使得计算机执行上述各方面所述的方法。The present application examples provide a computer readable storage medium having instructions stored therein that, when executed on a computer, cause the computer to perform the methods described in the various aspects above.
附图简要说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实例中的技术方案,下面将对实例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实例,对于本领域的技术人员来讲,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the examples of the present application, the drawings used in the description of the examples will be briefly described below. Obviously, the drawings in the following description are only some examples of the present application, For the skilled person, other figures can also be obtained from these figures.
图1a为本申请实例涉及的一种系统构架示意图;1a is a schematic structural diagram of a system involved in an example of the present application;
图1b为本申请实例提供的一种手势识别方法的流程方框示意图;1b is a schematic block diagram of a gesture recognition method provided by an example of the present application;
图1c为本申请实例涉及的一种系统构架示意图;1c is a schematic structural diagram of a system involved in an example of the present application;
图2为本申请实例提供的超声波发送的流程示意图;2 is a schematic flow chart of ultrasonic transmission provided by an example of the present application;
图3为本申请实例提供的超声波接收的流程示意图;3 is a schematic flow chart of ultrasonic receiving provided by an example of the present application;
图4为本申请实例提供的一维手势识别处理的流程示意图;4 is a schematic flowchart of a one-dimensional gesture recognition process provided by an example of the present application;
图5为本申请实例提供的二维手势识别处理的流程示意图;FIG. 5 is a schematic flowchart of a two-dimensional gesture recognition process provided by an example of the present application;
图6为本申请实例提供的手势识别动作管理的流程示意图;6 is a schematic flowchart of gesture recognition action management provided by an example of the present application;
图7-a为本申请实例提供的手势识别的动作开启的示意图;7-a is a schematic diagram of the action of gesture recognition provided by the example of the present application;
图7-b为本申请实例提供的手势识别的动作结束的示意图;7-b is a schematic diagram of the end of the action of gesture recognition provided by the example of the present application;
图8为本申请实例提供的手势识别的基本操作的示意图;FIG. 8 is a schematic diagram of a basic operation of gesture recognition provided by an example of the present application; FIG.
图9-a为本申请实例提供的一种终端的组成结构示意图;9-a is a schematic structural diagram of a terminal provided by an example of the present application;
图9-b为本申请实例提供的一种模式确定模块的组成结构示意图;9-b is a schematic structural diagram of a mode determining module provided by an example of the present application;
图9-c为本申请实例提供的另一种终端的组成结构示意图;9-c is a schematic structural diagram of another terminal provided by an example of the present application;
图9-d为本申请实例提供的另一种终端的组成结构示意图;9-d is a schematic structural diagram of another terminal provided by an example of the present application;
图10为本申请实例提供的手势识别方法应用于终端的组成结构示意图;FIG. 10 is a schematic structural diagram of a gesture recognition method provided by an example of the present application applied to a terminal;
图11为本申请实例提供另一种终端的组成结构示意图。FIG. 11 is a schematic structural diagram of another terminal according to an example of the present application.
实施方式Implementation
本申请实例提供了一种手势识别方法和终端,用于降低在终端的应用程序上的手势识别的复杂度,降低对终端的计算性能的要求。The application example provides a gesture recognition method and a terminal for reducing the complexity of gesture recognition on an application of the terminal and reducing the computing performance requirement of the terminal.
为使得本申请的目的、特征、优点能够更加的明显和易懂,下面将结合本申请实例中的附图,对本申请实例中的技术方案进行清楚、完整地描述,显然,下面所描述的实例仅仅是本申请一部分实例,而非全部实例。基于本申请中的实例,本领域的技术人员所获得的所有其他实例,都属于本申请保护的范围。In order to make the objects, features, and advantages of the present application more obvious and easy to understand, the technical solutions in the examples of the present application will be clearly and completely described in conjunction with the drawings in the examples of the present application. It is only a part of the examples of this application, not all examples. All other examples obtained by those skilled in the art based on the examples in the present application are within the scope of the present application.
本申请的说明书和权利要求书及上述附图中的术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,以便包含一系列单元的过程、方法、系统、产品或设备不必限于那些单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它单元。The terms "including" and "comprising", and any variations thereof, are intended to cover a non-exclusive inclusion in order to include a series of units of processes, methods, systems, or products. The devices are not necessarily limited to those units, but may include other units not explicitly listed or inherent to such processes, methods, products, or devices.
以下分别进行详细说明。The details are described below separately.
接触式输入方式中,在有些场合不方便或者无法进行接触式输入时,例如,在厨房做菜,需要翻食谱时不方便进行翻页切换等操作,又如在开车的时候,不方便进行手机操作,用户均无法进行输入。In the contact input mode, when it is inconvenient or impossible to make contact input in some occasions, for example, cooking in the kitchen, it is inconvenient to switch pages when switching recipes, and it is inconvenient to carry out the mobile phone when driving. Operation, the user can not input.
基于图像识别的方式中,终端可以基于用户所作出的手势的图像来识别用户的手势,然而图像识别算法复杂、计算量大、功耗高、容易受到光线的影响,对终端设备的性能要求高,因此,该方式无法实现在终端上的精确识别。In the image recognition-based manner, the terminal can recognize the gesture of the user based on the image of the gesture made by the user. However, the image recognition algorithm is complex, computationally intensive, high in power consumption, and susceptible to light, and has high performance requirements on the terminal device. Therefore, this method cannot achieve accurate recognition on the terminal.
基于上述缺陷,本申请提出一种手势识别方法,在一些实例中,该方法具体可以应用于终端对用户的手势识别场景中。请参阅图1a所示,本申请实例提供的手势识别方法应用于终端101,终端101中配置有扬声器102和麦克风103,终端101上还安装有应用程序104,例如终端 上可安装的应用程序包括:浏览器APP,或者办公软件APP,或者游戏APP等,此处不做限定。Based on the above deficiencies, the present application provides a gesture recognition method. In some examples, the method may be specifically applied to a gesture recognition scene of a terminal to a user. Referring to FIG. 1a, the gesture recognition method provided by the example of the present application is applied to the terminal 101. The terminal 101 is configured with a speaker 102 and a microphone 103. The terminal 101 also has an application 104 installed thereon. For example, the application program installed on the terminal includes : Browser APP, or office software APP, or game app, etc., is not limited here.
图1b是根据本申请一个示例性实例示出的一种手势识别方法的流程示意图。FIG. 1b is a schematic flow chart of a gesture recognition method according to an illustrative example of the present application.
如图1b所示,该手势识别方法,包括如下步骤:As shown in FIG. 1b, the gesture recognition method includes the following steps:
S101、根据应用程序的输入控制指示确定终端是否开启手势识别模式。S101. Determine, according to an input control indication of the application, whether the terminal turns on the gesture recognition mode.
在本申请实例中,终端上安装有应用程序,用户可以使用该应用程序,例如用户可以使用浏览器APP查询网页。输入控制指示具体可以设置在应用程序的设置菜单上,也可以设置在该应用程序的配置文件中,例如输入控制指示可以设置在安装配置文件中,在应用程序安装到终端上时由用户来配置该输入控制指示。又如,在应用程序的输入控制指示菜单上可以设置手势识别模式的开启页面,若用户选择启用手势识别模式,则可以确定该终端开启了手势识别模式,若用户没有开启手势识别模式,则可以确定该终端关闭了手势识别模式。In the example of the present application, an application is installed on the terminal, and the user can use the application, for example, the user can use the browser APP to query the webpage. The input control instruction may be specifically set in the application's setting menu, or may be set in the application's configuration file. For example, the input control indication may be set in the installation configuration file, and configured by the user when the application is installed on the terminal. This input control indication. For example, in the input control indication menu of the application, the open page of the gesture recognition mode can be set. If the user selects to enable the gesture recognition mode, it can be determined that the gesture recognition mode is enabled on the terminal, and if the user does not enable the gesture recognition mode, It is determined that the terminal has turned off the gesture recognition mode.
需要说明的是,在本申请实例中,当终端确定开启手势识别模式时,才触发执行后续步骤,否则结束本申请实例中的手势识别方法。本申请实例中通过应用程序的输入控制指示可以控制终端是否启用手势识别模式。It should be noted that, in the example of the present application, when the terminal determines to enable the gesture recognition mode, the subsequent steps are triggered, otherwise the gesture recognition method in the example of the present application is ended. In the example of the present application, the input control indication of the application can control whether the terminal enables the gesture recognition mode.
在本申请的一些实例中,步骤S101根据应用程序的输入控制指示确定终端是否开启手势识别模式,包括:In some examples of the present application, step S101 determines whether the terminal turns on the gesture recognition mode according to the input control indication of the application, including:
根据应用程序的配置文件确定应用程序的输入控制指示是否为默认开启手势识别模式;Determining whether the input control indication of the application is the default open gesture recognition mode according to the configuration file of the application;
检测应用程序在终端上是否运行成功;Detect whether the application runs successfully on the terminal;
在应用程序运行成功时,若终端默认开启手势识别模式,确定终端开启了手势识别模式。When the application runs successfully, if the terminal turns on the gesture recognition mode by default, it is determined that the gesture recognition mode is enabled in the terminal.
其中,本申请实例中可以配置应用程序的输入控制指示为默认开启手势识别模式,则可以在应用程序在终端上运行起来后就自动打开手势识别模式,从而可以省去用户重新设置是否开启的麻烦,简化用户的操作。使得用户在运行某个应用程序时,就可以自动启用手势识别模式。In the example of the present application, the input control indication of the application can be configured as the default open gesture recognition mode, and the gesture recognition mode can be automatically turned on after the application runs on the terminal, thereby eliminating the trouble of whether the user resets or not. To simplify the user's operation. Enables the user to automatically enable gesture recognition mode when running an application.
在本申请的一些实例中,步骤S101根据应用程序的输入控制指示确定终端是否开启手势识别模式,包括:In some examples of the present application, step S101 determines whether the terminal turns on the gesture recognition mode according to the input control indication of the application, including:
根据应用程序的配置文件确定应用程序的输入控制指示为第一二维手势开启手势识别模式,或者第二二维手势关闭手势识别模式;Determining, according to a profile of the application, an input control indication of the application to turn on a gesture recognition mode for the first two-dimensional gesture, or a second two-dimensional gesture to turn off the gesture recognition mode;
当检测到第一二维手势时,确定终端开启了手势识别模式;When the first two-dimensional gesture is detected, determining that the terminal has turned on the gesture recognition mode;
当检测到第二二维手势时,确定终端关闭了手势识别模式。When the second two-dimensional gesture is detected, it is determined that the terminal has turned off the gesture recognition mode.
其中,本申请实例中手势识别模式的开启和关闭还可以由预设的一些手势动作来决定,例如可以预设一些二维手势,通过这些预设的二维手势由用户来控制终端是否开启手势识别模式,从而满足用户对终端的实时控制需求。举例说明,根据应用程序的配置文件确定应用程序的输入控制指示可以为第一二维手势开启手势识别模式,则终端可以通过扬声器发射超声波,通过麦克风采集超声波,从而识别出用户是否做了预设的某一个手势动作(例如第一二维手势),只有该预设的手势动作被检测出时可以开启手势识别模式。又如,根据应用程序的配置文件确定应用程序的输入控制指示可以为第二二维手势关闭手势识别模式,则终端可以通过扬声器发射超声波,通过麦克风采集超声波,从而识别出用户是否做了预设的某一个手势动作(例如第二二维手势),只有该预设的手势动作被检测出时就可以关闭手势识别模式。The opening and closing of the gesture recognition mode in the example of the present application may also be determined by some preset gesture actions. For example, some two-dimensional gestures may be preset, and the preset two-dimensional gesture is used by the user to control whether the terminal turns on the gesture. Identify the mode to meet the user's real-time control needs for the terminal. For example, determining that the input control indication of the application may open the gesture recognition mode for the first two-dimensional gesture according to the configuration file of the application, the terminal may transmit the ultrasonic wave through the speaker, and acquire the ultrasonic wave through the microphone, thereby identifying whether the user has made a preset. One of the gesture actions (for example, the first two-dimensional gesture) can be turned on only when the preset gesture motion is detected. For another example, according to the configuration file of the application, determining that the input control indication of the application can turn off the gesture recognition mode for the second two-dimensional gesture, the terminal can transmit the ultrasonic wave through the speaker, and acquire the ultrasonic wave through the microphone, thereby identifying whether the user has made a preset. One of the gesture actions (eg, the second two-dimensional gesture) can be turned off only when the preset gesture action is detected.
在本申请的一些实例中,除了执行前述的方法步骤之外,本申请实例提供的手势识别方法还可以包括如下步骤:In some examples of the present application, in addition to performing the foregoing method steps, the gesture recognition method provided by the example of the present application may further include the following steps:
当扬声器播放训练超声波时,通过麦克风录制指定手势动作的波形数据;When the speaker plays the training ultrasonic wave, the waveform data of the specified gesture action is recorded through the microphone;
根据指定手势动作的波形数据建立手势波形模板库;Establishing a gesture waveform template library according to waveform data of the specified gesture action;
根据手势波形模板库确定指定手势动作对应的手势识别结果。The gesture recognition result corresponding to the specified gesture action is determined according to the gesture waveform template library.
本申请实例中,用户操作不同类型的终端进行手势识别时,由于不同的终端机型对用户手势动作接收到的波形数据存在差异,为了提升手势动作的匹配准确度,本申请实例中建立手势波形模板库。例如为了解决终端的计算能力不足问题,还可以建立基于云端服务器的手势波形模板库。也即,在一些实例中,本申请实例提供的手势识别方法还可以应用于如图1c所示的系统构架中。In the example of the present application, when the user operates different types of terminals for gesture recognition, since different terminal models have different waveform data received by the user gestures, in order to improve the matching accuracy of the gesture actions, the gesture waveform is established in the example of the present application. Template library. For example, in order to solve the problem of insufficient computing power of the terminal, a cloud server-based gesture waveform template library can also be established. That is, in some examples, the gesture recognition method provided by the examples of the present application can also be applied to the system architecture as shown in FIG. 1c.
如图1c所示,该系统构架包括终端101和云端服务器105,两者通过一个或多个互联网106进行交互。As shown in FIG. 1c, the system architecture includes a terminal 101 and a cloud server 105 that interact via one or more Internet 106.
用户通过终端101录制了指定手势动作的波形数据,云端服务器105基于上述交互就可以建立手势波形模板库,采用机器学习的方法进行手势识别,提升手势匹配的准确度。在一些实例中,云端服务器105可以包括两个组成部分:一部分是离线手势波形模板库训练,一部分是在线手势识别。The user records the waveform data of the specified gesture action through the terminal 101, and the cloud server 105 can establish a gesture waveform template library based on the above interaction, and adopts a machine learning method to perform gesture recognition, thereby improving the accuracy of gesture matching. In some examples, cloud server 105 can include two components: one is offline gesture waveform template library training and the other is online gesture recognition.
终端101可以将接收到的第二超声波提取到超声波数据,然后将该超声波数据发送给云端服务器105,该云端服务器105可以采用在线云端识别的方式通过手势波形模板库获取到手势识别结果,然后云端服务器105将该手势识别结果发送给终端101,使得不同终端类型都可以获取到适用于该终端类型的手势识别结果,例如,通过终端的配置文件可以获取到终端的类型信息和终端的配置信息,例如终端的系统类型、操作系统版本、终端采用的处理器个数、内容空间大小,从而根据终端的类型信息和终端的配置信息进入为这种终端所设置的手势波形模板库,从而提高手势匹配的准确度。The terminal 101 may extract the received second ultrasonic wave to the ultrasonic data, and then send the ultrasonic data to the cloud server 105, and the cloud server 105 may acquire the gesture recognition result through the gesture waveform template library by using online cloud recognition, and then the cloud The server 105 sends the gesture recognition result to the terminal 101, so that different terminal types can obtain the gesture recognition result applicable to the terminal type. For example, the type information of the terminal and the configuration information of the terminal can be obtained through the configuration file of the terminal. For example, the system type of the terminal, the operating system version, the number of processors used by the terminal, and the size of the content space, thereby entering the gesture waveform template library set for the terminal according to the type information of the terminal and the configuration information of the terminal, thereby improving gesture matching. Accuracy.
S102、当终端确定开启手势识别模式时,通过扬声器播放第一超声波。S102. When the terminal determines to enable the gesture recognition mode, the first ultrasonic wave is played through the speaker.
在本申请实例中,终端内置有扬声器,终端可以首先生成超声波信号,然后通过扬声器播放出超声波,将扬声器播放出的超声波定义为“第一超声波”。该第一超声波从终端所在的位置发射出去之后,该第一超声波碰到目标障碍物会向终端所在的位置进行反射。其中,目标障碍物主要是用户的人手,例如用户的单个或多个手指,或者用户的一个或两个手掌等。In the example of the present application, the terminal has a built-in speaker, and the terminal can first generate an ultrasonic signal, and then play the ultrasonic wave through the speaker, and define the ultrasonic wave played by the speaker as the “first ultrasonic wave”. After the first ultrasonic wave is emitted from the position where the terminal is located, the first ultrasonic wave hits the target obstacle and reflects to the position where the terminal is located. The target obstacle is mainly a user's hand, such as a single or multiple fingers of the user, or one or two palms of the user.
在本申请实例中,终端可以使用内置的超声波发生器来产生超声波,然后通过终端内置的扬声器来发送超声波。超声波发生器通过机械振动的方式产生超声波,通常以纵波的方式在弹性介质内传播,是一种能量的传播形式。超声波的波长短,方向性好。超声波发生器产生的超声波频率高于20000赫兹,方向性好,穿透能力强,易于获得较集中的声能。In the example of the present application, the terminal can use the built-in ultrasonic generator to generate ultrasonic waves, and then transmit the ultrasonic waves through the built-in speakers of the terminal. The ultrasonic generator generates ultrasonic waves by mechanical vibration, and generally propagates in an elastic medium in a longitudinal wave manner, which is a form of energy propagation. The ultrasonic wave has a short wavelength and good directivity. The ultrasonic generator generates ultrasonic waves with a frequency higher than 20,000 Hz, good directionality, strong penetrating power, and easy to obtain concentrated sound energy.
在本申请的一些实例中,通过扬声器播放第一超声波之后,本申请实例提供的手势识别方法还包括:In some examples of the present application, after the first ultrasonic wave is played through the speaker, the gesture recognition method provided by the example of the present application further includes:
检测扬声器播放的第一超声波的信号波形是否存在异常;Detecting whether there is an abnormality in a signal waveform of the first ultrasonic wave played by the speaker;
若第一超声波的信号波形存在异常,重新执行前述步骤S102:通过扬声器播放第一超声波;If there is an abnormality in the signal waveform of the first ultrasonic wave, re-execute the foregoing step S102: playing the first ultrasonic wave through the speaker;
若第一超声波的信号波形不存在异常,触发执行如下步骤S103:通过麦克风接收第二超声波。If there is no abnormality in the signal waveform of the first ultrasonic wave, the trigger performs the following step S103: receiving the second ultrasonic wave through the microphone.
其中,在扬声器播放第一超声波之后,可以对播放数据进行异常检测,例如从时域、频域等多维度对第一超声波的信号波形进行检测。举例说明如下,终端可以检测播放的数据有没有丢帧,例如判断采样到的频率点数是否与扬声器在1秒钟内播放的频率点数相同。又如,可以检测终端内置的扬声器是否存在播放异常。又如,终端可以在频域查看信号的能量是不是正常,比如播放的超声波信号为20khz,判断接收到的超声波能量是不是20khz。又如,终端在播放超声波时,还可以用快节 奏的音乐播放检测有没有丢帧。通过前述对第一超声波的信号波形的异常判断,可以提高接收超声波的成功率。After the first ultrasonic wave is played by the speaker, abnormality detection may be performed on the playing data, for example, the signal waveform of the first ultrasonic wave is detected from multiple dimensions such as time domain and frequency domain. For example, the terminal can detect whether the played data has dropped frames, for example, whether the sampled frequency points are the same as the frequency points played by the speaker in one second. For another example, it is possible to detect whether there is a playback abnormality in the speaker built in the terminal. For another example, the terminal can check whether the energy of the signal is normal in the frequency domain, for example, the ultrasonic signal played is 20khz, and it is determined whether the received ultrasonic energy is 20khz. For example, when the terminal plays an ultrasonic wave, it can also detect whether there is a frame loss by playing music with fast rhythm. By the above-described abnormality determination of the signal waveform of the first ultrasonic wave, the success rate of the received ultrasonic wave can be improved.
S103、通过麦克风接收第二超声波,第二超声波为对第一超声波进行采集后得到的波形数据。S103. Receive a second ultrasonic wave through a microphone, and the second ultrasonic wave is waveform data obtained by collecting the first ultrasonic wave.
在本申请实例中,扬声器播出第一超声波之后,可以通过终端内置的麦克风来接收声波信号,将麦克风接收到的声波信号定义为“第二超声波”。其中终端内置的麦克风可以是一个或者多个,此处不做限定。通常从终端的周围环境中接收到的第二超声波是混合信号,混合有通过目标障碍物反射后的超声波,以及扬声器发射的超声波,以及从终端的附近发射出的超声波。In the example of the present application, after the speaker broadcasts the first ultrasonic wave, the sound wave signal can be received through the microphone built in the terminal, and the sound wave signal received by the microphone is defined as the “second ultrasonic wave”. The microphone built in the terminal may be one or more, which is not limited herein. The second ultrasonic wave normally received from the surroundings of the terminal is a mixed signal mixed with ultrasonic waves reflected by the target obstacle, and ultrasonic waves emitted from the speaker, and ultrasonic waves emitted from the vicinity of the terminal.
在本申请的一些实例中,步骤S103通过麦克风接收第二超声波之后,本申请实例提供的手势识别方法还包括:In some examples of the present application, after the step S103 receives the second ultrasonic wave through the microphone, the gesture recognition method provided by the example of the present application further includes:
检测接收到的第二超声波的信号波形是否存在异常;Detecting whether there is an abnormality in the signal waveform of the received second ultrasonic wave;
若第二超声波的信号波形存在异常,丢弃接收到的第二超声波;If the signal waveform of the second ultrasonic wave is abnormal, discarding the received second ultrasonic wave;
若第二超声波的信号波形不存在异常,触发执行如下步骤:根据接收到的第二超声波进行手势识别。If there is no abnormality in the signal waveform of the second ultrasonic wave, the trigger performs the following steps: performing gesture recognition according to the received second ultrasonic wave.
其中,终端采集到第二超声波之后,还可以判断录制的信号波形是否存在异常,例如接收到的信号波形是否完整,即是否存在丢帧的情况。例如可以从频域、时域等多维度进行检测,否则影响后面的手势识别结果。检测的方法有多种,例如判断接收到的信号波形能量是否等于播放的信号波形能量。又如每秒录制的频率点数是不是和采样频率相同。又如信号的时频图能量有没有杂质。通过前述对第二超声波的异常判断,只有在接收到的第二超声波不存在异常时才执行手势识别,从而提高手势识别的准确率。After the terminal acquires the second ultrasonic wave, it can also determine whether there is an abnormality in the recorded signal waveform, for example, whether the received signal waveform is complete, that is, whether there is a frame loss. For example, the detection can be performed from multiple dimensions such as frequency domain and time domain, otherwise the subsequent gesture recognition result is affected. There are various methods of detecting, for example, determining whether the received signal waveform energy is equal to the played signal waveform energy. Another example is whether the frequency points recorded per second are the same as the sampling frequency. Another example is the energy of the time-frequency diagram of the signal. Through the aforementioned abnormal determination of the second ultrasonic wave, the gesture recognition is performed only when there is no abnormality in the received second ultrasonic wave, thereby improving the accuracy of the gesture recognition.
在本申请的一些实例中,步骤S103通过麦克风接收第二超声波,包括:In some examples of the present application, step S103 receives the second ultrasonic wave through the microphone, including:
按照预设的录制参数控制麦克风录制终端的周围环境中的声波信号;Controlling sound wave signals in the surrounding environment of the microphone recording terminal according to preset recording parameters;
将录制到的声波信号存储到终端的录制缓冲区中,其中,录制缓冲区中存储的声波信号为第二超声波。The recorded sound wave signal is stored in the recording buffer of the terminal, wherein the sound wave signal stored in the recording buffer is the second ultrasonic wave.
其中,终端预设的录制参数可以有多种,例如单声道或者多声道。终端可以根据需求选择单声道、多声道等方式录制。又如录制参数还可以包括采样频率,该采样频率和超声波的发送频率相同。在采集到声波信号之后,终端可以将该声波信号存储到录制缓冲区中。录制缓冲区的大小可以按照如下方式设置,若录制缓冲区设置的太小,有可能会导致丢帧,一般设置2倍以上的最小缓冲区的大小。Among them, the terminal preset recording parameters can be various, such as mono or multi-channel. The terminal can select mono, multi-channel, etc. according to the needs. For example, the recording parameter may further include a sampling frequency, which is the same as the transmission frequency of the ultrasonic wave. After the acoustic signal is acquired, the terminal can store the acoustic signal in the recording buffer. The size of the recording buffer can be set as follows. If the recording buffer is set too small, it may cause frame loss. Generally, the minimum buffer size is more than 2 times.
S104、根据接收到的第二超声波进行手势识别,得到手势识别结果。S104. Perform gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result.
在本申请实例中,通过前述步骤S103采集到第二超声波之后,可以对接收到的第二超声波进行手势识别,由于扬声器发射出的第一超声波会被用户的人手所阻挡,麦克风会采集到人手阻挡后的超声波,因此通过对麦克风接收到的第二超声波的接收位置、波形能量等分析,就可以识别出用户所做的手势,得到手势识别结果。In the example of the present application, after the second ultrasonic wave is collected by the foregoing step S103, the received second ultrasonic wave may be gesture-recognized. Since the first ultrasonic wave emitted by the speaker is blocked by the user's hand, the microphone collects the human hand. The blocked ultrasonic wave, so by analyzing the receiving position, waveform energy, and the like of the second ultrasonic wave received by the microphone, the gesture made by the user can be recognized, and the gesture recognition result is obtained.
在本申请的一些实例中,除了执行前述的方法步骤之外,步骤S104根据接收到的第二超声波进行手势识别,得到手势识别结果之后,本申请实例提供的手势识别方法还可以包括如下步骤:In some examples of the present application, in addition to performing the foregoing method steps, the step S104 performs gesture recognition according to the received second ultrasonic wave, and after the gesture recognition result is obtained, the gesture recognition method provided by the example of the present application may further include the following steps:
通过所述应用程序的显示界面显示手势识别成功的提示信息。The prompt information for successful gesture recognition is displayed through the display interface of the application.
其中,本申请实例中终端的应用程序的显示界面还可以实现与用户的实时互动,通过步骤S104据接收到的第二超声波进行手势识别,得到手势识别结果,说明当前的手势识别成功,为了避免用户重复做多个相同的动作,终端的应用程序的显示界面还可以显示提示信息,例如通过显示界面上显示的一个动画告诉用户当前手势识别程序,或者通过文字或者声音来提示用户,此处不做限定。The display interface of the application of the terminal in the example of the present application can also implement real-time interaction with the user, and the second ultrasonic wave received by the step S104 performs gesture recognition to obtain a gesture recognition result, indicating that the current gesture recognition is successful, in order to avoid The user repeatedly performs multiple identical actions, and the display interface of the application of the terminal can also display prompt information, for example, by using an animation displayed on the display interface to inform the user of the current gesture recognition program, or by prompting the user by text or sound, here is not Make a limit.
S105、根据手势识别结果在应用程序执行相应的手势指令。S105. Perform a corresponding gesture instruction in the application according to the gesture recognition result.
在本申请实例中,得到手势识别结果之后,终端可以响应用户所在的手势,可以根据手势识别结果执行相应的手势指令,例如响应用户的手势指令来操作终端的应用程序。例如手势识别结果为画圆圈,终端在应用程序上执行手势识别结果对应的手势指令:打开应用程序或者对应用程序的某个菜单进行操作等。其中,手势识别结果与手势指令的对应关系可以根据用户在终端上的预配置列表来确定。举例说明,终端对用户的手势进行识别得到手势识别结果,终端可以在浏览器应用程序上执行用户的手势指令,例如在浏览器的搜索框中输入文字。In the example of the present application, after the gesture recognition result is obtained, the terminal may respond to the gesture of the user, and may execute a corresponding gesture instruction according to the gesture recognition result, for example, operating the application of the terminal in response to the gesture instruction of the user. For example, the gesture recognition result is a circle, and the terminal executes a gesture instruction corresponding to the gesture recognition result on the application: opening the application or operating a certain menu of the application. The correspondence between the gesture recognition result and the gesture instruction may be determined according to a pre-configured list of the user on the terminal. For example, the terminal recognizes the gesture of the user to obtain a gesture recognition result, and the terminal may execute a gesture instruction of the user on the browser application, for example, inputting a text in a search box of the browser.
在本申请的一些实例中,步骤S102通过扬声器播放第一超声波,包括:In some examples of the present application, step S102 plays the first ultrasonic wave through the speaker, including:
通过扬声器播放N个频率的第一超声波,N为大于或等于1的正整数;Playing a first ultrasonic wave of N frequencies through a speaker, N being a positive integer greater than or equal to 1;
在这种实现场景下,步骤S103通过麦克风接收第二超声波,包括:In this implementation scenario, step S103 receives the second ultrasonic wave through the microphone, including:
通过麦克风从终端的周围环境中采集到N个频率的第二超声波。A second ultrasonic wave of N frequencies is collected from the surroundings of the terminal through a microphone.
其中,终端可以发射一个或多个频率的超声波,例如发送多个不同频率的超声波,这些超声波之间可以间隔相同的距离。终端通过麦克风可以分别采集扬声器播放的一个或多个频率的超声波。Wherein, the terminal can transmit ultrasonic waves of one or more frequencies, for example, transmitting a plurality of ultrasonic waves of different frequencies, and the ultrasonic waves can be separated by the same distance. The terminal can separately acquire ultrasonic waves of one or more frequencies played by the speaker through the microphone.
在本申请的一些实例中,步骤S104根据接收到的第二超声波进行手势识别,得到手势识别结果,包括:In some examples of the present application, step S104 performs gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, including:
当N的取值为1时,根据第二超声波的波形能量进行一维度的手势识别,得到第一手势识别结果,第一手势识别结果包括:手势靠近终端,或者远离终端。When the value of N is 1, the one-dimensional gesture recognition is performed according to the waveform energy of the second ultrasonic wave, and the first gesture recognition result is obtained. The first gesture recognition result includes: the gesture is close to the terminal, or is away from the terminal.
其中,终端可以通过扬声器发射一个超声波,则终端可以通过麦克风采集到该超声波,终端对接收到的一个超声波可以进行粗粒度的一维手势识别,例如通过多普勒的波形能量变化得到第一手势识别结果,该 第一手势识别结果包括:手势靠近终端,或者远离终端。Wherein, the terminal can transmit an ultrasonic wave through the speaker, and the terminal can collect the ultrasonic wave through the microphone, and the terminal can perform coarse-grained one-dimensional gesture recognition on the received one ultrasonic wave, for example, the first gesture is obtained by Doppler waveform energy change. The result of the recognition, the first gesture recognition result includes: the gesture is close to the terminal, or is away from the terminal.
在本申请的另一些实例中,步骤S104根据接收到的第二超声波进行手势识别,得到手势识别结果,包括:In other examples of the present application, step S104 performs gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, including:
当N的取值大于或等于2时,对N个第二超声波分别计算出手势的移动距离;When the value of N is greater than or equal to 2, the moving distance of the gesture is calculated for each of the N second ultrasonic waves;
从N个手势的移动距离中排除出异常的移动距离,对于保留的移动距离进行一维度的手势识别,得到第一手势识别结果,第一手势识别结果包括:手势靠近终端,或者远离终端,或者相对于终端向左移动,或者相对于终端向右移动,或者相对于终端向上移动,或者相对于终端向下移动。Excluding an abnormal moving distance from the moving distance of the N gestures, performing one-dimensional gesture recognition on the retained moving distance, and obtaining a first gesture recognition result, where the first gesture recognition result includes: the gesture is close to the terminal, or is away from the terminal, or Moving to the left relative to the terminal, or moving to the right relative to the terminal, or moving up relative to the terminal, or moving downward relative to the terminal.
其中,终端可以通过扬声器发射多个超声波,则终端可以通过麦克风分别采集到各个超声波,终端对接收到的多个超声波可以进行细粒度的一维手势识别。例如,采用N个频率的超声波相位变化计算距离,反应比较灵敏。发射17500HZ-23000HZ中间的N个超声波,相邻两个超声波之间间隔相同的距离,采样频率48000Hz,可以采样512个点实时计算距离变化,反应时间在10.7ms。在声音传输的过程中会有些多径效应,还有些背景静态物体的干扰,为了减少多径效应和背景的干扰,可以用动态信号减去背景的干扰,用N个频率计算N个距离,根据最小二乘法,计算偏差,把偏差大的频率距离去掉,保留偏差小的距离。通过排除异常的距离,可以提升距离的计算精度。The terminal can transmit a plurality of ultrasonic waves through the speaker, and the terminal can separately collect each ultrasonic wave through the microphone, and the terminal can perform fine-grained one-dimensional gesture recognition on the received plurality of ultrasonic waves. For example, using N-frequency ultrasonic phase changes to calculate the distance, the response is more sensitive. N ultrasonic waves in the middle of 17500HZ-23000HZ are emitted. The distance between adjacent ultrasonic waves is the same. The sampling frequency is 48000Hz. 512 points can be sampled to calculate the distance change in real time. The reaction time is 10.7ms. In the process of sound transmission, there will be some multipath effects, and some background static objects. In order to reduce the multipath effect and background interference, the dynamic signal can be subtracted from the background interference, and N distances can be calculated by N frequencies. The least squares method is used to calculate the deviation, and the frequency distance with large deviation is removed, and the deviation is small. By excluding the abnormal distance, the calculation accuracy of the distance can be improved.
在本申请的一些实例中,步骤S103通过麦克风接收第二超声波,包括:In some examples of the present application, step S103 receives the second ultrasonic wave through the microphone, including:
通过终端的两个麦克风分别采集到第二超声波。The second ultrasonic wave is separately collected by the two microphones of the terminal.
在这种实现场景下,步骤S104根据接收到的第二超声波进行手势识别,得到手势识别结果,包括:In this implementation scenario, step S104 performs gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, including:
根据两个麦克风分别接收到的第二超声波计算手势的相对位置和 初始位置;Calculating a relative position and an initial position of the gesture according to the second ultrasonic waves respectively received by the two microphones;
根据计算出的相对位置和初始位置进行二维度的手势识别,得到第二手势识别结果,第二手势识别结果包括:手势变化的二维坐标。The two-dimensional gesture recognition is performed according to the calculated relative position and the initial position, and the second gesture recognition result is obtained, and the second gesture recognition result includes: two-dimensional coordinates of the gesture change.
其中,为了提高手势识别的精度,终端内可以设置至少两个麦克风来分别采集第二超声波。则对于每个麦克风接收到的第二超声波,都可以计算出手势的相对位置和初始位置。其中,手势的相对位置是指基于第二超声波的相位测量得到手势相对于终端的位置,手势的初始位置是指用户手势在初次识别时的位置。通过前述计算出的手势的相对位置和初始位置,可以进行二维的手势识别,得到手势的精确坐标,提高手势识别的准确度,防止误操作。In order to improve the accuracy of the gesture recognition, at least two microphones may be disposed in the terminal to separately acquire the second ultrasonic waves. Then, for the second ultrasonic wave received by each microphone, the relative position and initial position of the gesture can be calculated. Wherein, the relative position of the gesture refers to the position of the gesture relative to the terminal based on the phase measurement of the second ultrasonic wave, and the initial position of the gesture refers to the position of the user gesture at the time of initial recognition. Through the foregoing calculated relative position and initial position of the gesture, two-dimensional gesture recognition can be performed, the precise coordinates of the gesture can be obtained, the accuracy of gesture recognition can be improved, and misoperation can be prevented.
通过前述实例对本申请的举例说明可知,终端内设置有扬声器和麦克风,且该终端上安装有应用程序。首先根据应用程序的输入控制指示确定终端是否开启手势识别模式,在该终端确定开启手势识别模式时,通过扬声器播放第一超声波,然后通过麦克风接收第二超声波,第二超声波为对第一超声波进行采集后得到的波形数据,接下来根据接收到的第二超声波进行手势识别,得到手势识别结果,最后根据该手势识别结果在应用程序执行相应的手势指令。本申请实例中,使用终端内置的扬声器和麦克风就可以实现手势识别,即通过超声波的发送与检测就可以识别出用户的动作,降低了手势识别的复杂度,降低了对终端的计算性能的要求,并且实现了对用户在应用程序上的手势指令控制。进一步的,通过对第二超声波进行计算,得出手势的移动距离或手势的二维变化,提高了手势识别的精确率。Through the foregoing examples, the description of the present application shows that a speaker and a microphone are disposed in the terminal, and an application program is installed on the terminal. Firstly, according to the input control indication of the application, determining whether the terminal turns on the gesture recognition mode, when the terminal determines to turn on the gesture recognition mode, playing the first ultrasonic wave through the speaker, and then receiving the second ultrasonic wave through the microphone, and the second ultrasonic wave is for the first ultrasonic wave The waveform data obtained after the acquisition is then subjected to gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, and finally a corresponding gesture instruction is executed in the application according to the gesture recognition result. In the example of the present application, the gesture recognition can be realized by using the built-in speaker and microphone of the terminal, that is, the motion of the user can be recognized through the transmission and detection of the ultrasonic wave, the complexity of the gesture recognition is reduced, and the calculation performance requirement of the terminal is reduced. And implements gesture control of the user on the application. Further, by calculating the second ultrasonic wave, the moving distance of the gesture or the two-dimensional change of the gesture is obtained, and the accuracy of the gesture recognition is improved.
为便于更好的理解和实施本申请实例的上述方案,下面举例相应的应用场景,例如,以图1a所示的应用场景,来进行具体说明。To facilitate a better understanding and implementation of the above solution of the example of the present application, the following application scenarios are exemplified, for example, the application scenario shown in FIG. 1a is used for specific description.
本申请实例可以应用到手机浏览器、游戏应用程序(Application,APP)中,本申请实例可以实现高精度、低延迟、无需任何外设、非接 触式的一维和二维的手势识别。本申请实例基于终端的扬声器和麦克风进行超声波手势检测,不需要任何外部设备。举例说明如下,本申请实例中用户不需要手动的操作终端的触摸屏,也不需要从键盘输入任何指令,用户只需要作出手势动作,本申请实例中的终端就可以通过超声波的发送与接收来检测出用户所作出的手势动作,从而在终端的应用程序上执行该手势动作对应的手势指令,例如用户通过做手势来操作游戏应用程序,可以控制游戏应用程序中的角色移动,游戏应用程序的开启与关闭等。The application example can be applied to a mobile phone browser and a game application (Application, APP). The application example can realize high-precision, low-delay, no-peripheral, non-contact one-dimensional and two-dimensional gesture recognition. The example of the present application performs ultrasonic gesture detection based on the speaker and microphone of the terminal, and does not require any external equipment. For example, in the example of the present application, the user does not need to manually operate the touch screen of the terminal, and does not need to input any command from the keyboard, and the user only needs to make a gesture action. The terminal in the example of the present application can detect by sending and receiving ultrasonic waves. The gesture action made by the user is performed to execute the gesture instruction corresponding to the gesture action on the application of the terminal, for example, the user operates the game application by making a gesture, and the character movement in the game application can be controlled, and the game application is opened. With off and so on.
接下来以在浏览器APP的浏览场景中的手势识别为例,用户可以操作基本的上、下、左、右等常见的浏览场景。Taking the gesture recognition in the browsing scenario of the browser APP as an example, the user can operate basic browsing scenarios such as upper, lower, left, and right.
请参阅图2所示,对超声波发送过程进行举例说明。Please refer to Figure 2 for an example of the ultrasonic transmission process.
201、设置超声波的发送频率和采样频率。例如发送N(N>=1)个频率的超声波,采样频率大于等于两倍的发射频率。201. Set the transmission frequency and sampling frequency of the ultrasonic wave. For example, an ultrasonic wave of N (N>=1) frequencies is transmitted, and the sampling frequency is twice or more of the transmission frequency.
202、设置超声波播放方式。202. Set the ultrasonic playback mode.
其中,终端可以采用静态和流式等多种播放方式,可以根据用户的需求设置。The terminal can adopt multiple playback modes such as static and streaming, and can be set according to user requirements.
超声波的播放要求比较严格,如果出现播放异常或者丢帧的情况,会导致计算异常或者错误,所以超声波的播放流程数据获取流程需要提前准备好,播放流程逻辑需要保证播放的波形数据没有丢帧,即需要播放的波形数据是完整的。Ultrasonic playback requirements are strict. If there is abnormal playback or frame loss, it will lead to calculation errors or errors. Therefore, the data acquisition process of the ultrasonic playback process needs to be prepared in advance. The playback process logic needs to ensure that the waveform data played is not dropped. That is, the waveform data that needs to be played is complete.
203、检测扬声器播放的声波信号。203. Detect a sound wave signal played by the speaker.
其中,终端可以检测播放的数据是否异常,可以从时域、频域等多维度检测。检测有多种方式,举例说明,播放的数据检测有没有丢帧,是不是1秒钟播放采样频率个点数。又如检测播放模块有没有报异常。又如在频域上查看信号的能量是不是正常,比如播放的频率是20khz,接收的信号频率是不是20khz。另外,播放过程还可以用快节奏的音乐 播放检测有没有丢帧。The terminal can detect whether the played data is abnormal, and can detect from multiple dimensions such as time domain and frequency domain. There are many ways to detect. For example, if the data being played is detected, there is no frame loss, and it is not the number of points for playing the sampling frequency in 1 second. Another example is to detect whether the playback module has reported an exception. Another example is to check whether the energy of the signal is normal in the frequency domain. For example, the frequency of the broadcast is 20khz, and the frequency of the received signal is 20khz. In addition, the playback process can also detect the loss of frames with fast-paced music playback.
如图3所示,接下来对超声波接收进行举例说明。As shown in Fig. 3, the ultrasonic reception is exemplified next.
301、设置麦克风的录制参数。301. Set the recording parameters of the microphone.
其中,可以根据需求选择单声道、多声道等方式录制,采样频率为超声波的发送频率的两倍。根据奈奎斯特采样定律,采样频率和播放频率比较,采样频率可以是播放频率的2倍。如果录制缓冲区设置的太小,有可能会导致丢帧,一般设置2倍以上的最小缓冲区的大小。Among them, you can select mono, multi-channel, etc. according to your needs, and the sampling frequency is twice the transmission frequency of the ultrasonic wave. According to Nyquist's sampling law, the sampling frequency is compared with the playback frequency, and the sampling frequency can be twice the playback frequency. If the recording buffer is set too small, it may cause frame loss. Generally, the size of the minimum buffer is more than 2 times.
302、启动录制线程。302. Start the recording thread.
其中,从麦克风读取数据,要防止播放的数据录制线程没有录制完全,从而导致出现丢帧的情况。录制线程的逻辑尽量轻量,保证一秒钟接收采样频率个点数。Among them, reading data from the microphone, to prevent the recorded data recording thread from being completely recorded, resulting in frame loss. The logic of the recording thread is as light as possible, ensuring that the sampling frequency is received in one second.
303、检测麦克风的录制数据。303. Detect the recorded data of the microphone.
其中,录制的信号波形是否完整,有无丢帧,需要从频域、时域等多维度进行检测,否则影响后面的手势识别结果。检测的方法有多种,比如接收到的信号能量是不是播放的频率的能量。又如,每秒录制的点数是不是和采样频率相同。又如,可以通过信号的时频图能量有没有杂质。Among them, whether the recorded signal waveform is complete, whether there is no frame loss, need to detect from multiple dimensions such as frequency domain and time domain, otherwise it will affect the subsequent gesture recognition result. There are many ways to detect, such as whether the received signal energy is the energy of the frequency of playback. As another example, is the number of points recorded per second the same as the sampling frequency. As another example, there is no impurity in the energy of the time-frequency diagram of the signal.
本申请实例中,超声波处理过程可以具体是一维手势识别处理,或者二维手势识别处理,接下来分别进行举例说明。In the example of the present application, the ultrasonic processing may be specifically a one-dimensional gesture recognition process or a two-dimensional gesture recognition process, which is respectively illustrated by way of example.
首先对一维手势识别处理进行说明,请参阅图4所示,一维手势识别处理可实现两种方案。First, the one-dimensional gesture recognition processing will be described. Referring to FIG. 4, the one-dimensional gesture recognition processing can implement two schemes.
其中一种粗粒度的手势识别,主要包括:One of the coarse-grained gesture recognitions mainly includes:
401、获取超声波的波形能量。采用多普勒原理,手势靠近远离的过程中,在频谱上可以看到波形能量的明显变化。401. Acquire waveform energy of ultrasonic waves. Using the Doppler principle, in the process of moving away from the gesture, a significant change in the waveform energy can be seen in the spectrum.
402、对于波形的变化的速度、加速度等多维特征进行建模,得到一维的手势变化。主要是根据多普勒效应,手势靠近远离的过程中,波 形的变化趋势不同,根据波形变化的方向和加速度进行计算。举例说明如下,发射20000Hz的超声波,采样频率44100Hz,每次可以采样4096个点进行计算。402. Modeling a multi-dimensional feature such as a velocity, an acceleration, and the like of the waveform to obtain a one-dimensional gesture change. Mainly according to the Doppler effect, in the process of the gesture approaching and far away, the waveform has different trends, and the calculation is based on the direction and acceleration of the waveform change. For example, as follows, an ultrasonic wave of 20,000 Hz is transmitted, and the sampling frequency is 44100 Hz, and 4096 points can be sampled each time for calculation.
另一种细粒度的手势识别,主要包括:Another fine-grained gesture recognition includes:
403、采用N个频率的超声波相位变化计算距离,反应比较灵敏。发射17500HZ-23000HZ中间的N个超声波,相邻两个超声波之间间隔相同的距离,采样频率48000Hz,可以采样512个点实时计算距离变化,反应时间在10.7ms,也即,该实例中,手势识别的精确度可以达到ms级别。403. Calculating the distance by using the ultrasonic phase changes of N frequencies, the reaction is more sensitive. N ultrasonic waves in the middle of 17500HZ-23000HZ, the distance between adjacent two ultrasonic waves is the same, the sampling frequency is 48000Hz, and 512 points can be sampled to calculate the distance change in real time. The reaction time is 10.7ms, that is, in this example, the gesture The accuracy of the recognition can reach the ms level.
404、为了计算更准确,针对N个频率相位变化计算的距离,通过算法排除异常的距离,提升距离的计算精度。声音传输的过程中会有些多径效应,还有些背景静态物体的干扰,为了减少多径效应和背景的干扰,会减去背景的干扰,用N个频率计算N个距离,根据最小二乘法,计算偏差,把偏差大的频率距离去掉,保留偏差小的距离。404. In order to calculate more accurately, the distance calculated for the phase changes of the N frequencies is eliminated by an algorithm to improve the calculation accuracy of the distance. In the process of sound transmission, there will be some multipath effects, and some background static objects will interfere. In order to reduce the multipath effect and background interference, the background interference will be subtracted, and N distances will be calculated by N frequencies. According to the least squares method, Calculate the deviation, remove the frequency distance with large deviation, and keep the distance with small deviation.
接下来对二维手势识别处理进行说明,请参阅图5所示,主要包括如下过程:Next, the two-dimensional gesture recognition processing will be described. Referring to FIG. 5, the following processes are mainly included:
501、利用两个麦克风分别采集超声波。501. Acquire ultrasonic waves using two microphones respectively.
502、判断两个麦克风的超声波计算是否完成。502. Determine whether the ultrasonic calculation of the two microphones is completed.
其中,在手势移动的过程中,两个麦克风对波形产生的影响不同,因此可以得到两个方向的变化。Among them, in the process of gesture movement, the influence of the two microphones on the waveform is different, so that the change of the two directions can be obtained.
二维手势识别处理主要解决手势动作的二维坐标的识别,从而实现二维图形(画圈、画正方形、画三角形等)或者写汉字的识别。利用扬声器和2个麦克风,可以采样两个声道的数据。The two-dimensional gesture recognition processing mainly solves the recognition of the two-dimensional coordinates of the gesture action, thereby realizing the recognition of the two-dimensional graphics (drawing circles, drawing squares, drawing triangles, etc.) or writing Chinese characters. With two speakers, two channels of data can be sampled.
对于每个麦克风声道都执行步骤503和步骤504。 Steps 503 and 504 are performed for each microphone channel.
503、基于一维手势识别,计算出手势移动的相对距离。503. Calculate a relative distance of the gesture movement based on the one-dimensional gesture recognition.
其中,基于一维手势识别的相位可以计算出手势移动的相对距离。The phase based on the one-dimensional gesture recognition can calculate the relative distance of the gesture movement.
504、基于信号计算得到手势的初始位置。504. Calculate an initial position of the gesture based on the signal.
其中,基于麦克风接收到的信号可以计算出手势的粗粒度的初始位置。初始位置是指接收到超声波时检测出的手势的初始位置。Wherein, based on the signal received by the microphone, the coarse-grained initial position of the gesture can be calculated. The initial position is the initial position of the gesture detected when the ultrasonic wave is received.
505、结合两个方向上的坐标,得到二维的手势识别。505. Combine the coordinates in two directions to obtain two-dimensional gesture recognition.
在两个麦克风声道的初始位置和相对位置计算完成之后,可以结合x和y两个方向上的坐标的计算二维的手势识别,可以得到手势识别的精确坐标(x,y),通过初始位置以及相对位置的变化,可以得到实时的手势位置。After the initial position and relative position calculation of the two microphone channels are completed, the two-dimensional gesture recognition can be combined with the coordinates in the two directions of x and y, and the precise coordinates (x, y) of the gesture recognition can be obtained through the initial The position and relative position changes can get the real-time gesture position.
如图6所示,接下来对手势识别的管理进行举例说明。通过对手势识别行动作管理,避免误操作和不必要的打扰。主要包括:As shown in FIG. 6, the management of gesture recognition is exemplified next. Through the action management of gesture recognition, avoid misuse and unnecessary interruption. mainly includes:
601、根据用户的二维手势判断手势识别的功能的开启和关闭。601. Determine, according to the two-dimensional gesture of the user, turning on and off the function of the gesture recognition.
用户可以根据需求设置自己喜欢的手势,建议采用复杂的二维手势,比如画圈、画正方形、画三角形之类的。如图7-a所示,手势识别的动作开启可以是比划一个圆形,如图7-b所示,手势识别的动作关闭可以是比划一个正方形。Users can set their favorite gestures according to their needs. It is recommended to use complex two-dimensional gestures, such as drawing circles, drawing squares, drawing triangles and the like. As shown in FIG. 7-a, the motion of the gesture recognition may be a circular shape, as shown in FIG. 7-b, the motion recognition of the gesture recognition may be a square.
602、根据用户的一维手势执行手势识别的操作功能。602. Perform an operation function of gesture recognition according to a one-dimensional gesture of the user.
其中,用户可以设置手势识别的动作,可以满足基本的操作需求。如图8所示,手势识别基本操作可以包括上下左右等基本动作。Among them, the user can set the gesture recognition action to meet the basic operational needs. As shown in FIG. 8, the basic operation of the gesture recognition may include basic actions such as up, down, left, and right.
需要说明的是,在本申请实例中,为了解决计算负担太重,会影响超声波的播放和录制,进而影响手势识别的准确度,本申请实例中可以采用高效的编程语言,比如计算逻辑用c语言来实现,例如采用指针运算,优化处理器的消耗高的逻辑操作,尽量减少内存拷贝、输入输出操作,减少计算时间。因此,本申请实例中一维的手势识别计算的耗时可以减少到ms级别。It should be noted that, in the example of the present application, in order to solve the computational burden too much, the playback and recording of the ultrasonic wave may be affected, thereby affecting the accuracy of the gesture recognition. In the example of the present application, an efficient programming language may be used, such as calculation logic. Language implementation, such as the use of pointer operations, optimize the processor's high-cost logic operations, minimize memory copy, input and output operations, and reduce computation time. Therefore, the time consumption of one-dimensional gesture recognition calculation in the example of the present application can be reduced to the ms level.
在手势识别场景中,手势识别的准确性和最快速的响应时间是最重要的,本申请实例中可以防止用户的误操作,手势识别模式的开启和结 束可以采用二维手势动作的识别方式。为了更快更精确的识别一个动作,比如靠近、远离、向左、向右,本申请实例提出一维手势的识别方式,其中,基于超声波相位的方式可以达到毫秒(mm)级别的精度。在手机等终端的硬件受限的情况下,本申请实例可以尽可能的提高计算性能,节省计算的时间。In the gesture recognition scene, the accuracy of the gesture recognition and the fastest response time are the most important. In the example of the present application, the user's misoperation can be prevented, and the opening and ending of the gesture recognition mode can be recognized by the two-dimensional gesture. In order to recognize an action more quickly and accurately, such as approaching, moving away, leftward, and rightward, the present application example proposes a one-dimensional gesture recognition manner, wherein the ultrasonic phase-based approach can achieve millisecond (mm) level accuracy. In the case where the hardware of the terminal such as the mobile phone is limited, the example of the present application can improve the calculation performance as much as possible and save the calculation time.
需要说明的是,对于前述的各方法实例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请并不受所描述的动作顺序的限制,因为依据本申请,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实例均属于优选实例,所涉及的动作和模块并不一定是本申请所必须的。It should be noted that, for the foregoing method examples, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should understand that the present application is not limited by the described action sequence, because In accordance with the present application, certain steps may be performed in other sequences or concurrently. Secondly, those skilled in the art should also understand that the examples described in the specification are all preferred examples, and the actions and modules involved are not necessarily required by the present application.
为便于更好的实施本申请实例的上述方案,下面还提供用于实施上述方案的相关装置。In order to facilitate the better implementation of the above described solution of the examples of the present application, related devices for implementing the above solutions are also provided below.
请参阅图9-a所示,本申请实例提供的一种终端900,所述终端中配置有扬声器和麦克风,所述终端上安装有应用程序,所述终端可以包括:模式确定模块901、超声波发送模块902、超声波采集模块903、手势识别模块904和指令执行模块905,其中,Referring to FIG. 9-a, a terminal 900 is provided. The terminal is configured with a speaker and a microphone. The terminal is installed with an application program, and the terminal may include: a mode determining module 901, and an ultrasound. a sending module 902, an ultrasonic acquiring module 903, a gesture recognition module 904, and an instruction execution module 905, wherein
模式确定模块901,用于根据所述应用程序的输入控制指示确定所述终端是否开启手势识别模式;The mode determining module 901 is configured to determine, according to an input control indication of the application, whether the terminal turns on the gesture recognition mode;
超声波发送模块902,用于当所述终端确定开启手势识别模式时,通过所述扬声器播放第一超声波;The ultrasonic sending module 902 is configured to: when the terminal determines to enable the gesture recognition mode, play the first ultrasonic wave through the speaker;
超声波采集模块903,用于通过所述麦克风接收第二超声波,所述第二超声波为对所述第一超声波进行采集后得到的波形数据;The ultrasonic acquisition module 903 is configured to receive a second ultrasonic wave by using the microphone, where the second ultrasonic wave is waveform data obtained after collecting the first ultrasonic wave;
手势识别模块904,用于根据接收到的所述第二超声波进行手势识别,得到手势识别结果;a gesture recognition module 904, configured to perform gesture recognition according to the received second ultrasonic wave, to obtain a gesture recognition result;
指令执行模块905,用于根据所述手势识别结果在所述应用程序执 行相应的手势指令。The instruction execution module 905 is configured to execute a corresponding gesture instruction in the application according to the gesture recognition result.
在本申请的一些实例中,所述超声波发送模块902,具体用于通过所述扬声器播放N个频率的第一超声波,所述N为大于或等于1的正整数;In some examples of the present application, the ultrasonic transmitting module 902 is specifically configured to play a first ultrasonic wave of N frequencies through the speaker, where N is a positive integer greater than or equal to 1;
所述超声波采集模块,用于通过所述麦克风从所述终端的周围环境中采集到N个频率的第二超声波。The ultrasonic acquisition module is configured to collect, by the microphone, a second ultrasonic wave of N frequencies from a surrounding environment of the terminal.
进一步的,在本申请的一些实例中,所述手势识别模块904,具体用于当所述N的取值为1时,根据所述第二超声波的波形能量进行一维度的手势识别,得到第一手势识别结果,所述第一手势识别结果包括:手势靠近所述终端,或者远离所述终端。Further, in some examples of the present application, the gesture recognition module 904 is specifically configured to perform one-dimensional gesture recognition according to the waveform energy of the second ultrasonic wave when the value of the N is 1, to obtain a first A gesture recognition result, the first gesture recognition result includes: the gesture is close to the terminal, or is away from the terminal.
在本申请的一些实例中,请参阅图9-b所示,模式确定模块901,包括:In some examples of the present application, referring to FIG. 9-b, the mode determining module 901 includes:
控制指示解析模块9011,用于根据所述应用程序的配置文件确定所述应用程序的输入控制指示是否为默认开启手势识别模式;The control indication parsing module 9011 is configured to determine, according to the configuration file of the application, whether the input control indication of the application is a default open gesture recognition mode;
程序检测模块9012,用于检测所述应用程序在所述终端上是否运行成功;The program detection module 9012 is configured to detect whether the application runs successfully on the terminal;
模式开启模块9013,用于在所述应用程序运行成功时,若所述终端默认开启手势识别模式,确定所述终端开启了手势识别模式。The mode-opening module 9013 is configured to determine, when the application runs successfully, that the terminal turns on the gesture recognition mode by default, and determines that the terminal has enabled the gesture recognition mode.
在本申请的一些实例中,模式确定模块901,具体用于根据所述应用程序的配置文件确定所述应用程序的输入控制指示为第一二维手势开启手势识别模式,或者第二二维手势关闭手势识别模式;当检测到第一二维手势时,确定所述终端开启了手势识别模式;当检测到第二二维手势时,确定所述终端关闭了手势识别模式。In some examples of the present application, the mode determining module 901 is specifically configured to determine, according to the configuration file of the application, that the input control indication of the application is a first two-dimensional gesture open gesture recognition mode, or a second two-dimensional gesture Turning off the gesture recognition mode; when detecting the first two-dimensional gesture, determining that the terminal has turned on the gesture recognition mode; and when detecting the second two-dimensional gesture, determining that the terminal turns off the gesture recognition mode.
在本申请的一些实例中,请参阅图9-c所示,所述终端900,还包括:模板库建立模块906和手势动作匹配模块907,其中,In some examples of the present application, as shown in FIG. 9-c, the terminal 900 further includes: a template library establishing module 906 and a gesture action matching module 907, where
超声波采集模块903,用于当所述扬声器播放训练超声波时,通过 所述麦克风录制指定手势动作的波形数据;The ultrasonic acquisition module 903 is configured to record waveform data of the specified gesture action through the microphone when the speaker plays the training ultrasonic wave;
模板库建立模块906,用于根据所述指定手势动作的波形数据建立手势波形模板库;a template library establishing module 906, configured to establish a gesture waveform template library according to the waveform data of the specified gesture action;
手势动作匹配模块907,用于根据所述手势波形模板库确定所述指定手势动作对应的手势识别结果。The gesture action matching module 907 is configured to determine a gesture recognition result corresponding to the specified gesture action according to the gesture waveform template library.
在本申请的一些实例中,请参阅图9-d所示,所述终端900,还包括:In some examples of the present application, as shown in FIG. 9-d, the terminal 900 further includes:
显示模块908,用于所述手势识别模块904根据接收到的所述第二超声波进行手势识别,得到手势识别结果之后,通过所述应用程序的显示界面显示手势识别成功的提示信息。The display module 908 is configured to perform gesture recognition according to the received second ultrasonic wave to obtain gesture recognition result, and then display prompt information for successful gesture recognition through the display interface of the application.
在本申请的一些实例中,所述手势识别模块904,包括:In some examples of the present application, the gesture recognition module 904 includes:
距离计算模块,用于当所述N的取值大于或等于2时,对N个所述第二超声波分别计算出手势的移动距离;a distance calculating module, configured to calculate a moving distance of the gesture for each of the N second ultrasonic waves when the value of the N is greater than or equal to 2;
一维识别模块,用于从N个所述手势的移动距离中排除出异常的移动距离,对于保留的移动距离进行一维度的手势识别,得到第一手势识别结果,所述第一手势识别结果包括:手势靠近所述终端,或者远离所述终端,或者相对于所述终端向左移动,或者相对于所述终端向右移动,或者相对于所述终端向上移动,或者相对于所述终端向下移动。a one-dimensional identification module, configured to exclude an abnormal moving distance from the moving distances of the N gestures, perform one-dimensional gesture recognition on the retained moving distance, and obtain a first gesture recognition result, where the first gesture recognition result is The method includes: the gesture is close to the terminal, or is away from the terminal, or moves to the left relative to the terminal, or moves to the right relative to the terminal, or moves upward relative to the terminal, or is opposite to the terminal Move down.
在本申请的一些实例中,所述超声波采集模块903,具体用于通过所述终端的两个麦克风分别采集到第二超声波;In some examples of the present application, the ultrasonic acquisition module 903 is specifically configured to separately collect a second ultrasonic wave through two microphones of the terminal;
所述手势识别模块904,包括:The gesture recognition module 904 includes:
位置计算模块,用于根据所述两个麦克风分别接收到的第二超声波计算手势的相对位置和初始位置;a position calculation module, configured to calculate a relative position and an initial position of the gesture according to the second ultrasonic waves respectively received by the two microphones;
二维识别模块,用于根据计算出的相对位置和初始位置进行二维度的手势识别,得到第二手势识别结果,所述第二手势识别结果包括:手势变化的二维坐标。The two-dimensional recognition module is configured to perform two-dimensional gesture recognition according to the calculated relative position and the initial position to obtain a second gesture recognition result, where the second gesture recognition result includes: two-dimensional coordinates of the gesture change.
在本申请的一些实例中,所述超声波采集模块903,包括:In some examples of the present application, the ultrasonic acquisition module 903 includes:
声波录制模块,用于按照预设的录制参数控制所述麦克风录制所述终端的周围环境中的声波信号;a sound wave recording module, configured to control the microphone to record an acoustic signal in a surrounding environment of the terminal according to preset recording parameters;
信号存储模块,用于将录制到的所述声波信号存储到所述终端的录制缓冲区中,其中,所述录制缓冲区中存储的声波信号为所述第二超声波。And a signal storage module, configured to store the recorded acoustic wave signal into a recording buffer of the terminal, wherein the sound wave signal stored in the recording buffer is the second ultrasonic wave.
通过以上对本申请实例的描述可知,终端内设置有扬声器和麦克风,且该终端上安装有应用程序。首先根据应用程序的输入控制指示确定终端是否开启手势识别模式,在该终端确定开启手势识别模式时,通过扬声器播放第一超声波,然后通过麦克风接收第二超声波,第二超声波为对第一超声波进行采集后得到的波形数据,接下来根据接收到的第二超声波进行手势识别,得到手势识别结果,最后根据该手势识别结果在应用程序执行相应的手势指令。本申请实例中,使用终端内置的扬声器和麦克风就可以实现手势识别,即通过超声波的发送与检测就可以识别出用户的动作,降低了手势识别的复杂度,降低了对终端的计算性能的要求,并且实现了对用户在应用程序上的手势指令控制。进一步的,通过对第二超声波进行计算,得出手势的移动距离或手势的二维变化,提高了手势识别的精确率。As can be seen from the above description of the examples of the present application, a speaker and a microphone are disposed in the terminal, and an application is installed on the terminal. Firstly, according to the input control indication of the application, determining whether the terminal turns on the gesture recognition mode, when the terminal determines to turn on the gesture recognition mode, playing the first ultrasonic wave through the speaker, and then receiving the second ultrasonic wave through the microphone, and the second ultrasonic wave is for the first ultrasonic wave The waveform data obtained after the acquisition is then subjected to gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, and finally a corresponding gesture instruction is executed in the application according to the gesture recognition result. In the example of the present application, the gesture recognition can be realized by using the built-in speaker and microphone of the terminal, that is, the motion of the user can be recognized through the transmission and detection of the ultrasonic wave, the complexity of the gesture recognition is reduced, and the calculation performance requirement of the terminal is reduced. And implements gesture control of the user on the application. Further, by calculating the second ultrasonic wave, the moving distance of the gesture or the two-dimensional change of the gesture is obtained, and the accuracy of the gesture recognition is improved.
本申请实例还提供了一种终端,如图10所示,为了便于说明,仅示出了与本申请实例相关的部分,具体技术细节未揭示的,请参照本申请实例方法部分。该终端可以为包括手机、平板电脑、PDA(Personal Digital Assistant,个人数字助理)、POS(Point of Sales,销售终端)、车载电脑等任意终端设备,以终端为手机为例:The present application also provides a terminal, as shown in FIG. 10, for the convenience of description, only the parts related to the examples of the present application are shown. For the specific technical details not disclosed, please refer to the example method part of the present application. The terminal may be any terminal device including a mobile phone, a tablet computer, a PDA (Personal Digital Assistant), a POS (Point of Sales), an in-vehicle computer, and the terminal is a mobile phone as an example:
图10示出的是与本申请实例提供的终端相关的手机的部分结构的框图。参考图10,手机包括:射频(Radio Frequency,RF)电路1010、存储器1020、输入单元1030、显示单元1040、传感器1050、音频电路 1060、无线保真(wireless fidelity,WiFi)模块1070、处理器1080、以及电源1090等部件。本领域技术人员可以理解,图10中示出的手机结构并不构成对手机的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。FIG. 10 is a block diagram showing a partial structure of a mobile phone associated with a terminal provided by an example of the present application. Referring to FIG. 10, the mobile phone includes: a radio frequency (RF) circuit 1010, a memory 1020, an input unit 1030, a display unit 1040, a sensor 1050, an audio circuit 1060, a wireless fidelity (WiFi) module 1070, and a processor 1080. And power supply 1090 and other components. It will be understood by those skilled in the art that the structure of the handset shown in FIG. 10 does not constitute a limitation to the handset, and may include more or less components than those illustrated, or some components may be combined, or different component arrangements.
下面结合图10对手机的各个构成部件进行具体的介绍:The following describes the components of the mobile phone in detail with reference to FIG. 10:
RF电路1010可用于收发信息或通话过程中,信号的接收和发送,特别地,将基站的下行信息接收后,给处理器1080处理;另外,将设计上行的数据发送给基站。通常,RF电路1010包括但不限于天线、至少一个放大器、收发信机、耦合器、低噪声放大器(Low Noise Amplifier,LNA)、双工器等。此外,RF电路1010还可以通过无线通信与网络和其他设备通信。上述无线通信可以使用任一通信标准或协议,包括但不限于全球移动通讯系统(Global System of Mobile communication,GSM)、通用分组无线服务(General Packet Radio Service,GPRS)、码分多址(Code Division Multiple Access,CDMA)、宽带码分多址(Wideband Code Division Multiple Access,WCDMA)、长期演进(Long Term Evolution,LTE)、电子邮件、短消息服务(Short Messaging Service,SMS)等。The RF circuit 1010 can be used for receiving and transmitting signals during the transmission or reception of information or during a call. In particular, after receiving the downlink information of the base station, it is processed by the processor 1080. In addition, the uplink data is designed to be sent to the base station. Generally, RF circuit 1010 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a Low Noise Amplifier (LNA), a duplexer, and the like. In addition, RF circuit 1010 can also communicate with the network and other devices via wireless communication. The above wireless communication may use any communication standard or protocol, including but not limited to Global System of Mobile communication (GSM), General Packet Radio Service (GPRS), Code Division Multiple Access (Code Division). Multiple Access (CDMA), Wideband Code Division Multiple Access (WCDMA), Long Term Evolution (LTE), E-mail, Short Messaging Service (SMS), and the like.
存储器1020可用于存储软件程序以及模块,处理器1080通过运行存储在存储器1020的软件程序以及模块,从而执行手机的各种功能应用以及数据处理。存储器1020可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序(比如声音播放功能、图像播放功能等)等;存储数据区可存储根据手机的使用所创建的数据(比如音频数据、电话本等)等。此外,存储器1020可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。The memory 1020 can be used to store software programs and modules, and the processor 1080 executes various functional applications and data processing of the mobile phone by running software programs and modules stored in the memory 1020. The memory 1020 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may be stored according to Data created by the use of the mobile phone (such as audio data, phone book, etc.). Moreover, memory 1020 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
输入单元1030可用于接收输入的数字或字符信息,以及产生与手 机的用户设置以及功能控制有关的键信号输入。具体地,输入单元1030可包括触控面板1031以及其他输入设备1032。触控面板1031,也称为触摸屏,可收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在触控面板1031上或在触控面板1031附近的操作),并根据预先设定的程式驱动相应的连接装置。在一些实例中,触控面板1031可包括触摸检测装置和触摸控制器两个部分。其中,触摸检测装置检测用户的触摸方位,并检测触摸操作带来的信号,将信号传送给触摸控制器;触摸控制器从触摸检测装置上接收触摸信息,并将它转换成触点坐标,再送给处理器1080,并能接收处理器1080发来的命令并加以执行。此外,可以采用电阻式、电容式、红外线以及表面声波等多种类型实现触控面板1031。除了触控面板1031,输入单元1030还可以包括其他输入设备1032。具体地,其他输入设备1032可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆等中的一种或多种。 Input unit 1030 can be used to receive input numeric or character information and to generate key signal inputs related to user settings and function control of the handset. Specifically, the input unit 1030 may include a touch panel 1031 and other input devices 1032. The touch panel 1031, also referred to as a touch screen, can collect touch operations on or near the user (such as the user using a finger, a stylus, or the like on the touch panel 1031 or near the touch panel 1031. Operation), and drive the corresponding connecting device according to a preset program. In some examples, the touch panel 1031 can include two portions of a touch detection device and a touch controller. Wherein, the touch detection device detects the touch orientation of the user, and detects a signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts the touch information into contact coordinates, and sends the touch information. The processor 1080 is provided and can receive commands from the processor 1080 and execute them. In addition, the touch panel 1031 can be implemented in various types such as resistive, capacitive, infrared, and surface acoustic waves. In addition to the touch panel 1031, the input unit 1030 may also include other input devices 1032. Specifically, other input devices 1032 may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control buttons, switch buttons, etc.), trackballs, mice, joysticks, and the like.
显示单元1040可用于显示由用户输入的信息或提供给用户的信息以及手机的各种菜单。显示单元1040可包括显示面板1041,在一些实例中,可以采用液晶显示器(Liquid Crystal Display,LCD)、有机发光二极管(Organic Light-Emitting Diode,OLED)等形式来配置显示面板1041。进一步的,触控面板1031可覆盖显示面板1041,当触控面板1031检测到在其上或附近的触摸操作后,传送给处理器1080以确定触摸事件的类型,随后处理器1080根据触摸事件的类型在显示面板1041上提供相应的视觉输出。虽然在图10中,触控面板1031与显示面板1041是作为两个独立的部件来实现手机的输入和输入功能,但是在某些实例中,可以将触控面板1031与显示面板1041集成而实现手机的输入和输出功能。The display unit 1040 can be used to display information input by the user or information provided to the user as well as various menus of the mobile phone. The display unit 1040 may include a display panel 1041. In some examples, the display panel 1041 may be configured in the form of a liquid crystal display (LCD), an organic light-emitting diode (OLED), or the like. Further, the touch panel 1031 may cover the display panel 1041, and when the touch panel 1031 detects a touch operation thereon or nearby, the touch panel 1031 transmits to the processor 1080 to determine the type of the touch event, and then the processor 1080 according to the touch event. The type provides a corresponding visual output on display panel 1041. Although in FIG. 10, the touch panel 1031 and the display panel 1041 are two independent components to implement the input and input functions of the mobile phone, in some examples, the touch panel 1031 and the display panel 1041 may be integrated. The input and output functions of the phone.
手机还可包括至少一种传感器1050,比如光传感器、运动传感器以 及其他传感器。具体地,光传感器可包括环境光传感器及接近传感器,其中,环境光传感器可根据环境光线的明暗来调节显示面板1041的亮度,接近传感器可在手机移动到耳边时,关闭显示面板1041和/或背光。作为运动传感器的一种,加速计传感器可检测各个方向上(一般为三轴)加速度的大小,静止时可检测出重力的大小及方向,可用于识别手机姿态的应用(比如横竖屏切换、相关游戏、磁力计姿态校准)、振动识别相关功能(比如计步器、敲击)等;至于手机还可配置的陀螺仪、气压计、湿度计、温度计、红外线传感器等其他传感器,在此不再赘述。The handset can also include at least one type of sensor 1050, such as a light sensor, motion sensor, and other sensors. Specifically, the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display panel 1041 according to the brightness of the ambient light, and the proximity sensor may close the display panel 1041 and/or when the mobile phone moves to the ear. Or backlight. As a kind of motion sensor, the accelerometer sensor can detect the magnitude of acceleration in all directions (usually three axes). When it is stationary, it can detect the magnitude and direction of gravity. It can be used to identify the gesture of the mobile phone (such as horizontal and vertical screen switching, related Game, magnetometer attitude calibration), vibration recognition related functions (such as pedometer, tapping), etc.; as for the mobile phone can also be configured with gyroscopes, barometers, hygrometers, thermometers, infrared sensors and other sensors, no longer Narration.
音频电路1060、扬声器1061,传声器1062、麦克风1063可提供用户与手机之间的音频接口。音频电路1060可将接收到的音频数据转换后的电信号,传输到扬声器1061,由扬声器1061转换为声音信号输出;另一方面,传声器1062将收集的声音信号转换为电信号,由音频电路1060接收后转换为音频数据,再将音频数据输出处理器1080处理后,经RF电路1010以发送给比如另一手机,或者将音频数据输出至存储器1020以便进一步处理。扬声器1061可以生成超声波信号,然后麦克风1063可以从手机周围采集超声波信号。 Audio circuit 1060, speaker 1061, microphone 1062, microphone 1063 can provide an audio interface between the user and the handset. The audio circuit 1060 can transmit the converted electrical data of the received audio data to the speaker 1061, and convert it into a sound signal output by the speaker 1061; on the other hand, the microphone 1062 converts the collected sound signal into an electrical signal, by the audio circuit 1060. After receiving, it is converted into audio data, and then processed by the audio data output processor 1080, sent to the other mobile phone via the RF circuit 1010, or outputted to the memory 1020 for further processing. The speaker 1061 can generate an ultrasonic signal, and then the microphone 1063 can acquire an ultrasonic signal from around the mobile phone.
WiFi属于短距离无线传输技术,手机通过WiFi模块1070可以帮助用户收发电子邮件、浏览网页和访问流式媒体等,它为用户提供了无线的宽带互联网访问。虽然图10示出了WiFi模块1070,但是可以理解的是,其并不属于手机的必须构成,完全可以根据需要在不改变申请的本质的范围内而省略。WiFi is a short-range wireless transmission technology. The mobile phone through the WiFi module 1070 can help users to send and receive e-mail, browse the web and access streaming media, etc. It provides users with wireless broadband Internet access. Although FIG. 10 shows the WiFi module 1070, it can be understood that it does not belong to the essential configuration of the mobile phone, and can be omitted as needed within the scope of not changing the essence of the application.
处理器1080是手机的控制中心,利用各种接口和线路连接整个手机的各个部分,通过运行或执行存储在存储器1020内的软件程序和/或模块,以及调用存储在存储器1020内的数据,执行手机的各种功能和处理数据,从而对手机进行整体监控。在一些实例中,处理器1080可包括一个或多个处理单元;优选的,处理器1080可集成应用处理器和 调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器1080中。The processor 1080 is the control center of the handset, which connects various portions of the entire handset using various interfaces and lines, by executing or executing software programs and/or modules stored in the memory 1020, and invoking data stored in the memory 1020, The phone's various functions and processing data, so that the overall monitoring of the phone. In some examples, processor 1080 can include one or more processing units; preferably, processor 1080 can integrate an application processor and a modem processor, wherein the application processor primarily processes an operating system, a user interface, and an application Etc. The modem processor primarily handles wireless communications. It will be appreciated that the above described modem processor may also not be integrated into the processor 1080.
手机还包括给各个部件供电的电源1090(比如电池),优选的,电源可以通过电源管理系统与处理器1080逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。The mobile phone also includes a power source 1090 (such as a battery) that supplies power to various components. Preferably, the power source can be logically coupled to the processor 1080 through a power management system to manage functions such as charging, discharging, and power management through the power management system.
尽管未示出,手机还可以包括摄像头、蓝牙模块等,在此不再赘述。Although not shown, the mobile phone may further include a camera, a Bluetooth module, and the like, and details are not described herein again.
在本申请实例中,该终端所包括的处理器1080还具有控制执行以上由终端执行的手势识别方法流程。例如,处理器1080对超声波的识别过程可以参阅前述实例中的描述。In the example of the present application, the processor 1080 included in the terminal further has a flow of controlling a gesture recognition method performed by the terminal. For example, the process of identifying the ultrasonic waves by the processor 1080 can be referred to the description in the foregoing examples.
接下来介绍本申请实例提供的另一种终端,请参阅图11所示,终端1100包括:Next, another terminal provided by the example of the present application is introduced. Referring to FIG. 11, the terminal 1100 includes:
扬声器1101、麦克风1102、处理器1103和存储器1104(其中终端1100中的处理器1103的数量可以一个或多个,图11中以一个处理器为例)。在本申请的一些实例中,扬声器1101、麦克风1102、处理器1103和存储器1104可通过总线或其它方式连接,其中,图11中以通过总线连接为例。The speaker 1101, the microphone 1102, the processor 1103, and the memory 1104 (wherein the number of the processors 1103 in the terminal 1100 may be one or more, and one processor in FIG. 11 is taken as an example). In some examples of the present application, the speaker 1101, the microphone 1102, the processor 1103, and the memory 1104 may be connected by a bus or other means, wherein FIG. 11 is exemplified by a bus connection.
存储器1104可以包括只读存储器和随机存取存储器,并向处理器1103提供指令和数据。存储器1104的一部分还可以包括非易失性随机存取存储器(英文全称:Non-Volatile Random Access Memory,英文缩写:NVRAM)。存储器1104存储有操作系统和操作指令、可执行模块或者数据结构,或者它们的子集,或者它们的扩展集,其中,操作指令可包括各种操作指令,用于实现各种操作。操作系统可包括各种系统程序,用于实现各种基础业务以及处理基于硬件的任务。 Memory 1104 can include read only memory and random access memory and provides instructions and data to processor 1103. A portion of the memory 1104 may also include a non-volatile random access memory (English name: Non-Volatile Random Access Memory, English abbreviation: NVRAM). The memory 1104 stores operating systems and operational instructions, executable modules or data structures, or a subset thereof, or an extended set thereof, wherein the operational instructions can include various operational instructions for implementing various operations. The operating system can include a variety of system programs for implementing various basic services and handling hardware-based tasks.
处理器1103控制终端的操作,处理器1103还可以称为中央处理单元(英文全称:Central Processing Unit,英文简称:CPU)。具体的应用 中,终端的各个组件通过总线系统耦合在一起,其中总线系统除包括数据总线之外,还可以包括电源总线、控制总线和状态信号总线等。但是为了清楚说明起见,在图中将各种总线都称为总线系统。The processor 1103 controls the operation of the terminal, and the processor 1103 can also be referred to as a central processing unit (English full name: Central Processing Unit, English abbreviation: CPU). In a specific application, the components of the terminal are coupled together by a bus system. The bus system may include a power bus, a control bus, and a status signal bus in addition to the data bus. However, for the sake of clarity, the various buses are referred to as bus systems in the figures.
上述本申请实例揭示的方法可以应用于处理器1103中,或者由处理器1103实现。处理器1103可以是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步骤可以通过处理器1103中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器1103可以是通用处理器、数字信号处理器(英文全称:digital signal processing,英文缩写:DSP)、专用集成电路(英文全称:Application Specific Integrated Circuit,英文缩写:ASIC)、现场可编程门阵列(英文全称:Field-Programmable Gate Array,英文缩写:FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器1104,处理器1103读取存储器1104中的信息,结合其硬件完成上述方法的步骤。The method disclosed in the above examples of the present application may be applied to the processor 1103 or implemented by the processor 1103. The processor 1103 can be an integrated circuit chip with signal processing capabilities. In the implementation process, each step of the above method may be completed by an integrated logic circuit of hardware in the processor 1103 or an instruction in a form of software. The processor 1103 may be a general-purpose processor, a digital signal processor (English full name: digital signal processing, English abbreviation: DSP), an application specific integrated circuit (English name: Application Specific Integrated Circuit, English abbreviation: ASIC), field programmable Gate array (English name: Field-Programmable Gate Array, English abbreviation: FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. The methods, steps, and logical block diagrams disclosed in the examples of the present application can be implemented or executed. The general purpose processor may be a microprocessor or the processor or any conventional processor or the like. The steps of the method disclosed in connection with the examples of the present application may be directly embodied by the execution of the hardware decoding processor or by a combination of hardware and software modules in the decoding processor. The software module can be located in a conventional storage medium such as random access memory, flash memory, read only memory, programmable read only memory or electrically erasable programmable memory, registers, and the like. The storage medium is located in the memory 1104, and the processor 1103 reads the information in the memory 1104 and performs the steps of the above method in combination with its hardware.
所述扬声器1101,用于在所述处理器的控制下播放第一超声波;The speaker 1101 is configured to play a first ultrasonic wave under the control of the processor;
所述麦克风1102,用于在所述处理器的控制下接收第二超声波;The microphone 1102 is configured to receive a second ultrasonic wave under the control of the processor;
本申请实例中,处理器1103,用于执行所述存储器中的所述指令,执行如前述实例中的方法。In the example of the present application, the processor 1103 is configured to execute the instructions in the memory, and perform the method in the foregoing example.
另外需说明的是,以上所描述的装置实例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为 单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实例方案的目的。另外,本申请提供的装置实例附图中,模块之间的连接关系表示它们之间具有通信连接,具体可以实现为一条或多条通信总线或信号线。本领域普通技术人员在不付出创造性劳动的情况下,即可以理解并实施。It should be further noted that the device examples described above are merely illustrative, wherein the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical. Units can be located in one place or distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present example. In addition, in the example of the device provided by the present application, the connection relationship between the modules indicates that there is a communication connection between them, and specifically may be implemented as one or more communication buses or signal lines. Those of ordinary skill in the art can understand and implement without any creative effort.
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到本申请可借助软件加必需的通用硬件的方式来实现,当然也可以通过专用硬件包括专用集成电路、专用CPU、专用存储器、专用元器件等来实现。一般情况下,凡由计算机程序完成的功能都可以很容易地用相应的硬件来实现,而且,用来实现同一功能的具体硬件结构也可以是多种多样的,例如模拟电路、数字电路或专用电路等。但是,对本申请而言更多情况下软件程序实现是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在可读取的存储介质中,如计算机的软盘、U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the present application can be implemented by means of software plus necessary general hardware, and of course, dedicated hardware, dedicated CPU, dedicated memory, dedicated memory, Special components and so on. In general, functions performed by computer programs can be easily implemented with the corresponding hardware, and the specific hardware structure used to implement the same function can be various, such as analog circuits, digital circuits, or dedicated circuits. Circuits, etc. However, software program implementation is a better implementation for more applications in this application. Based on such understanding, the technical solution of the present application, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a readable storage medium, such as a floppy disk of a computer. , U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), disk or optical disk, etc., including a number of instructions to make a computer device (may be A personal computer, server, or network device, etc.) performs the methods described in various examples of the present application.
综上所述,以上实例仅用以说明本申请的技术方案,而非对其限制;尽管参照上述实例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对上述各实例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实例技术方案的精神和范围。In summary, the above examples are only used to illustrate the technical solutions of the present application, and are not limited thereto; although the present application is described in detail with reference to the above examples, those skilled in the art should understand that they can still The technical solutions described in the examples are modified, or the equivalents of some of the technical features are replaced. The modifications and substitutions do not detract from the spirit and scope of the technical solutions of the examples of the present application.

Claims (10)

  1. 一种手势识别方法,所述方法应用于终端,所述终端中配置有扬声器和麦克风,所述终端上安装有应用程序,所述方法包括:A gesture recognition method is applied to a terminal, wherein the terminal is configured with a speaker and a microphone, and an application is installed on the terminal, and the method includes:
    根据所述应用程序的输入控制指示确定所述终端是否开启手势识别模式;Determining whether the terminal turns on the gesture recognition mode according to an input control indication of the application;
    当所述终端确定开启手势识别模式时,通过所述扬声器播放第一超声波;When the terminal determines to turn on the gesture recognition mode, playing the first ultrasonic wave through the speaker;
    通过所述麦克风接收第二超声波,所述第二超声波为对所述第一超声波进行采集后得到的波形数据;Receiving, by the microphone, a second ultrasonic wave, wherein the second ultrasonic wave is waveform data obtained after collecting the first ultrasonic wave;
    根据接收到的所述第二超声波进行手势识别,得到手势识别结果;Performing gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result;
    根据所述手势识别结果在所述应用程序执行相应的手势指令。Corresponding gesture instructions are executed in the application according to the gesture recognition result.
  2. 根据权利要求1所述的方法,其中,所述根据所述应用程序的输入控制指示确定所述终端是否开启手势识别模式,包括:The method according to claim 1, wherein the determining, according to an input control indication of the application, whether the terminal turns on a gesture recognition mode comprises:
    根据所述应用程序的配置文件确定所述应用程序的输入控制指示是否为默认开启手势识别模式;Determining, according to a configuration file of the application, whether an input control indication of the application is a default open gesture recognition mode;
    检测所述应用程序在所述终端上是否运行成功;Detecting whether the application runs successfully on the terminal;
    在所述应用程序运行成功时,若所述终端默认开启手势识别模式,确定所述终端开启了手势识别模式。When the application runs successfully, if the terminal turns on the gesture recognition mode by default, it is determined that the terminal has turned on the gesture recognition mode.
  3. 根据权利要求1所述的方法,其中,所述根据所述应用程序的输入控制指示确定所述终端是否开启手势识别模式,包括:The method according to claim 1, wherein the determining, according to an input control indication of the application, whether the terminal turns on a gesture recognition mode comprises:
    根据所述应用程序的配置文件确定所述应用程序的输入控制指示为第一二维手势开启手势识别模式,或者第二二维手势关闭手势识别模式;Determining, according to a configuration file of the application, an input control indication of the application to turn on a gesture recognition mode for a first two-dimensional gesture, or a second two-dimensional gesture to turn off a gesture recognition mode;
    当检测到第一二维手势时,确定所述终端开启了手势识别模式;When the first two-dimensional gesture is detected, determining that the terminal turns on the gesture recognition mode;
    当检测到第二二维手势时,确定所述终端关闭了手势识别模式。When the second two-dimensional gesture is detected, it is determined that the terminal turns off the gesture recognition mode.
  4. 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1 wherein the method further comprises:
    当所述扬声器播放训练超声波时,通过所述麦克风录制指定手势动作的波形数据;Recording waveform data of the specified gesture action through the microphone when the speaker plays the training ultrasonic wave;
    根据所述指定手势动作的波形数据建立手势波形模板库;Establishing a gesture waveform template library according to the waveform data of the specified gesture action;
    根据所述手势波形模板库确定所述指定手势动作对应的手势识别结果。Determining, according to the gesture waveform template library, a gesture recognition result corresponding to the specified gesture action.
  5. 根据权利要求1所述的方法,其中,所述根据接收到的所述第二超声波进行手势识别,得到手势识别结果之后,所述方法还包括:The method according to claim 1, wherein after the gesture recognition is performed according to the received second ultrasonic wave to obtain a gesture recognition result, the method further includes:
    通过所述应用程序的显示界面显示手势识别成功的提示信息。The prompt information for successful gesture recognition is displayed through the display interface of the application.
  6. 根据权利要求1所述的方法,其中,所述通过所述扬声器播放第一超声波,包括:The method of claim 1 wherein said playing the first ultrasonic wave through said speaker comprises:
    通过所述扬声器播放N个频率的第一超声波,所述N为大于或等于1的正整数;Playing a first ultrasonic wave of N frequencies through the speaker, the N being a positive integer greater than or equal to 1;
    所述根据接收到的所述第二超声波进行手势识别,得到手势识别结果,包括:Performing gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, including:
    当所述N的取值大于或等于2时,对N个所述第二超声波分别计算出手势的移动距离;When the value of N is greater than or equal to 2, the moving distance of the gesture is calculated for each of the N second ultrasonic waves;
    从N个所述手势的移动距离中排除出异常的移动距离,对于保留的移动距离进行一维度的手势识别,得到第一手势识别结果,所述第一手势识别结果包括:手势靠近所述终端,或者远离所述终端,或者相对于所述终端向左移动,或者相对于所述终端向右移动,或者相对于所述终端向上移动,或者相对于所述终端向下移动。Excluding an abnormal moving distance from the moving distances of the N gestures, and performing one-dimensional gesture recognition on the retained moving distance to obtain a first gesture recognition result, where the first gesture recognition result includes: the gesture is close to the terminal Or moving away from the terminal, or moving to the left relative to the terminal, or moving to the right relative to the terminal, or moving upward relative to the terminal, or moving downward relative to the terminal.
  7. 根据权利要求1所述的方法,其中,所述通过所述麦克风接收第二超声波,包括:The method of claim 1, wherein the receiving the second ultrasonic wave through the microphone comprises:
    通过所述终端的两个麦克风分别采集到第二超声波;Collecting a second ultrasonic wave through two microphones of the terminal;
    所述根据接收到的所述第二超声波进行手势识别,得到手势识别结果,包括:Performing gesture recognition according to the received second ultrasonic wave to obtain a gesture recognition result, including:
    根据所述两个麦克风分别接收到的第二超声波计算手势的相对位置和初始位置;Calculating a relative position and an initial position of the gesture according to the second ultrasonic waves respectively received by the two microphones;
    根据计算出的相对位置和初始位置进行二维度的手势识别,得到第二手势识别结果,所述第二手势识别结果包括:手势变化的二维坐标。Performing a two-dimensional gesture recognition according to the calculated relative position and the initial position, and obtaining a second gesture recognition result, where the second gesture recognition result includes: two-dimensional coordinates of the gesture change.
  8. 一种终端,所述终端中配置有扬声器和麦克风,所述终端上安装有应用程序,所述终端包括:A terminal is configured with a speaker and a microphone, and an application is installed on the terminal, and the terminal includes:
    模式确定模块,用于根据所述应用程序的输入控制指示确定所述终端是否开启手势识别模式;a mode determining module, configured to determine, according to an input control indication of the application, whether the terminal turns on a gesture recognition mode;
    超声波发送模块,用于当所述终端确定开启手势识别模式时,通过所述扬声器播放第一超声波;An ultrasonic transmitting module, configured to: when the terminal determines to enable the gesture recognition mode, play the first ultrasonic wave through the speaker;
    超声波采集模块,用于通过所述麦克风接收第二超声波,所述第二超声波为对所述第一超声波进行采集后得到的波形数据;An ultrasonic acquisition module, configured to receive a second ultrasonic wave by using the microphone, where the second ultrasonic wave is waveform data obtained after collecting the first ultrasonic wave;
    手势识别模块,用于根据接收到的所述第二超声波进行手势识别,得到手势识别结果;a gesture recognition module, configured to perform gesture recognition according to the received second ultrasonic wave, to obtain a gesture recognition result;
    指令执行模块,用于根据所述手势识别结果在所述应用程序执行相应的手势指令。And an instruction execution module, configured to execute a corresponding gesture instruction in the application according to the gesture recognition result.
  9. 一种终端,所述终端包括:扬声器、麦克风、处理器和存储器;A terminal comprising: a speaker, a microphone, a processor, and a memory;
    所述处理器和所述扬声器、所述麦克风、所述存储器进行相互的通信;The processor and the speaker, the microphone, and the memory communicate with each other;
    所述存储器用于存储指令;The memory is for storing instructions;
    所述扬声器,用于在所述处理器的控制下播放第一超声波;The speaker for playing a first ultrasonic wave under the control of the processor;
    所述麦克风,用于在所述处理器的控制下接收第二超声波;The microphone for receiving a second ultrasonic wave under the control of the processor;
    所述处理器,用于执行所述存储器中的所述指令,执行如权利要求1至7中任一项所述的方法。The processor, for executing the instructions in the memory, performing the method of any one of claims 1 to 7.
  10. 一种计算机可读存储介质,包括指令,当其在计算机上运行时,使得计算机执行如权利要求1-7任意一项所述的方法。A computer readable storage medium comprising instructions which, when executed on a computer, cause the computer to perform the method of any of claims 1-7.
PCT/CN2018/117864 2017-11-30 2018-11-28 Gesture recognition method, terminal and storage medium WO2019105376A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711237225.9A CN109857245B (en) 2017-11-30 2017-11-30 Gesture recognition method and terminal
CN201711237225.9 2017-11-30

Publications (1)

Publication Number Publication Date
WO2019105376A1 true WO2019105376A1 (en) 2019-06-06

Family

ID=66664706

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/117864 WO2019105376A1 (en) 2017-11-30 2018-11-28 Gesture recognition method, terminal and storage medium

Country Status (2)

Country Link
CN (1) CN109857245B (en)
WO (1) WO2019105376A1 (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111722715A (en) * 2020-06-17 2020-09-29 上海思立微电子科技有限公司 Gesture control system and electronic equipment
CN111929689B (en) * 2020-07-22 2023-04-07 杭州电子科技大学 Object imaging method based on sensor of mobile phone
CN112860070A (en) * 2021-03-03 2021-05-28 北京小米移动软件有限公司 Device interaction method, device interaction apparatus, storage medium and terminal
CN112987925B (en) * 2021-03-04 2022-11-25 歌尔科技有限公司 Earphone and control method and device thereof
CN112965639A (en) * 2021-03-17 2021-06-15 北京小米移动软件有限公司 Gesture recognition method and device, electronic equipment and storage medium
CN115981454A (en) * 2021-10-13 2023-04-18 华为技术有限公司 Non-contact gesture control method and electronic equipment
CN115002278B (en) * 2022-05-12 2023-10-10 中国电信股份有限公司 Gesture control method and device for wireless device, storage medium and electronic device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103577107A (en) * 2013-10-29 2014-02-12 广东欧珀移动通信有限公司 Method for rapidly starting application by multi-point touch and smart terminal
CN105718064A (en) * 2016-01-22 2016-06-29 南京大学 Gesture recognition system and method based on ultrasonic waves
CN105807923A (en) * 2016-03-07 2016-07-27 中国科学院计算技术研究所 Ultrasonic wave based volley gesture identification method and system

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11106273B2 (en) * 2015-10-30 2021-08-31 Ostendo Technologies, Inc. System and methods for on-body gestural interfaces and projection displays
US10632500B2 (en) * 2016-05-10 2020-04-28 Invensense, Inc. Ultrasonic transducer with a non-uniform membrane
CN107066086A (en) * 2017-01-17 2017-08-18 上海与德信息技术有限公司 A kind of gesture identification method and device based on ultrasonic wave
CN107291308A (en) * 2017-07-26 2017-10-24 上海科世达-华阳汽车电器有限公司 A kind of gesture identifying device and its recognition methods

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103577107A (en) * 2013-10-29 2014-02-12 广东欧珀移动通信有限公司 Method for rapidly starting application by multi-point touch and smart terminal
CN105718064A (en) * 2016-01-22 2016-06-29 南京大学 Gesture recognition system and method based on ultrasonic waves
CN105807923A (en) * 2016-03-07 2016-07-27 中国科学院计算技术研究所 Ultrasonic wave based volley gesture identification method and system

Also Published As

Publication number Publication date
CN109857245B (en) 2021-06-15
CN109857245A (en) 2019-06-07

Similar Documents

Publication Publication Date Title
WO2019105376A1 (en) Gesture recognition method, terminal and storage medium
US10466961B2 (en) Method for processing audio signal and related products
US11237724B2 (en) Mobile terminal and method for split screen control thereof, and computer readable storage medium
US7650445B2 (en) System and method for enabling a mobile device as a portable character input peripheral device
WO2016119580A1 (en) Method, device and terminal for starting voice input function of terminal
CN108334272B (en) Control method and mobile terminal
TW201839594A (en) Mobile Terminal, Fingerprint Identification Control Method And Device
WO2019000287A1 (en) Icon display method and device
CN107870674B (en) Program starting method and mobile terminal
WO2018076380A1 (en) Electronic device, and method for generating video thumbnail in electronic device
CN111050370A (en) Network switching method and device, storage medium and electronic equipment
WO2015000429A1 (en) Intelligent word selection method and device
CN108536509B (en) Application body-splitting method and mobile terminal
WO2017193496A1 (en) Application data processing method and apparatus, and terminal device
TW201516844A (en) Apparatus and method for selecting object
JP2018504798A (en) Gesture control method, device, and system
WO2017215635A1 (en) Sound effect processing method and mobile terminal
WO2018166204A1 (en) Method for controlling fingerprint recognition module, and mobile terminal and storage medium
WO2019154360A1 (en) Interface switching method and mobile terminal
CN108073405B (en) Application program unloading method and mobile terminal
WO2019052551A1 (en) Terminal device interaction method, storage medium and terminal device
WO2019061512A1 (en) Task switching method and terminal
WO2019011108A1 (en) Iris recognition method and related product
WO2015014135A1 (en) Mouse pointer control method and apparatus, and terminal device
WO2017031647A1 (en) Method and apparatus for detecting touch mode

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18882926

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18882926

Country of ref document: EP

Kind code of ref document: A1