CN106959746A - The processing method and processing device of speech data - Google Patents
The processing method and processing device of speech data Download PDFInfo
- Publication number
- CN106959746A CN106959746A CN201610019033.XA CN201610019033A CN106959746A CN 106959746 A CN106959746 A CN 106959746A CN 201610019033 A CN201610019033 A CN 201610019033A CN 106959746 A CN106959746 A CN 106959746A
- Authority
- CN
- China
- Prior art keywords
- user
- terminal
- gesture
- speech data
- specified
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The present invention provides a kind of processing method and processing device of speech data.The embodiment of the present invention is by obtaining beginning operating gesture of the user to terminal, if the operating gesture that starts meets the specified beginning gesture pre-set, make it possible to open speech voice input function, to gather the speech data of the user, due to performing voice service using the beginning operating gesture triggering specified, so that without in the functionality controls for specifying the specified location at interface to be provided for inputting speech data, it can avoid in the prior art due to being arranged on caused by the specified location at specified interface and inflexible technical problem cumbersome when user needs input speech data for inputting the functionality controls of speech data, so as to improve efficiency and the flexibility of language data process.
Description
【Technical field】
The present invention relates to the communication technology, more particularly to a kind of processing method and processing device of speech data.
【Background technology】
With the development of the communication technology, terminal is integrated with increasing function, so that terminal is
More and more corresponding applications (Application, APP) are contained in system feature list.In some applications
Voice service can be related to, for example, the speech voice input function in wechat application, the language in Baidu search application
Sound assistant, etc..In voice service, it can be used to input specifying the specified location at interface to provide one
The functionality controls of speech data.When user operates this functionality controls using input equipment, then it can open
Begin collection speech data.
However, the specified location due to being arranged on specified interface for inputting the functionality controls of speech data,
Therefore, when user needs input speech data, it is necessary to represent specified interface according to the operation of user, and
Functionality controls and the operation of specified location are found on interface is specified by user, user could be gathered and carried
The speech data of confession, it is cumbersome and dumb, so as to result in efficiency and the spirit of language data process
The reduction of activity.
【The content of the invention】
The many aspects of the present invention provide a kind of processing method and processing device of speech data, to improve voice
The efficiency of data processing and flexibility.
An aspect of of the present present invention there is provided a kind of processing method of speech data, including:
Obtain beginning operating gesture of the user to terminal;
If the operating gesture that starts meets the specified beginning gesture pre-set, speech voice input function is opened,
To gather the speech data of the user.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
The beginning operating gesture for obtaining user to terminal is stated, including:
Based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
At least one of state beginning operating gesture of the user to terminal, including in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
Contact slide of the user on specific interface is stated, including:
User operates in the long-press of specific interface blank region.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
If stating the beginning operating gesture meets the specified beginning gesture pre-set, speech voice input function is opened,
To gather the speech data of the user, including:
If the operating gesture that starts meets the specified beginning gesture pre-set, voice number has been detected whether
According to input, untill voice stopping input instruction being received;
If having detected speech data input, the speech data is handled.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
State after having detected whether that speech data is inputted, also include:
Obtain end operation gesture of the user to the terminal;
If the end operation gesture meets the specified end gesture pre-set, receive the voice and stop
Input instruction.
Another aspect of the present invention there is provided a kind of processing unit of speech data, including:
Acquiring unit, for obtaining beginning operating gesture of the user to terminal;
Voice unit, if meeting the specified beginning gesture pre-set for the beginning operating gesture, is opened
Speech voice input function is opened, to gather the speech data of the user.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
Acquiring unit is stated, specifically for
Based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
At least one of state beginning operating gesture of the user to terminal, including in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
Contact slide of the user on specific interface is stated, including:
User operates in the long-press of specific interface blank region.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
Speech units, specifically for
If the operating gesture that starts meets the specified beginning gesture pre-set, voice number has been detected whether
According to input, untill voice stopping input instruction being received;
If having detected speech data input, the speech data is handled.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute
Speech units, are additionally operable to
Obtain end operation gesture of the user to the terminal;
If the end operation gesture meets the specified end gesture pre-set, receive the voice and stop
Input instruction.
As shown from the above technical solution, the embodiment of the present invention is by obtaining beginning manipulator of the user to terminal
Gesture, if the operating gesture that starts meets the specified beginning gesture pre-set, enabling open voice
Input function, to gather the speech data of the user, due to using the beginning operating gesture triggering specified
Perform voice service so that without in the work(for specifying the specified location at interface to be provided for inputting speech data
Can control, can avoid in the prior art due to for input the functionality controls of speech data be arranged on it is specified
Caused by the specified location at interface when user needs input speech data cumbersome and inflexible skill
Art problem, so as to improve efficiency and the flexibility of language data process.
In addition, using technical scheme provided by the present invention, due to being touched using the beginning operating gesture specified
Hair performs voice service so that operating area is no longer limited by the functionality controls for inputting speech data
Size and location, can effectively improve the reliability and efficiency of language data process.
【Brief description of the drawings】
Technical scheme in order to illustrate more clearly the embodiments of the present invention, below will be to embodiment or existing
The accompanying drawing to be used needed for technology description is briefly described, it should be apparent that, in describing below
Accompanying drawing is some embodiments of the present invention, for those of ordinary skill in the art, is not paying creation
Property it is laborious on the premise of, other accompanying drawings can also be obtained according to these accompanying drawings.
The schematic flow sheet of the processing method for the speech data that Fig. 1 provides for one embodiment of the invention;
The structural representation of the processing unit for the speech data that Fig. 2 provides for another embodiment of the present invention.
【Embodiment】
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with this hair
Accompanying drawing in bright embodiment, the technical scheme in the embodiment of the present invention is clearly and completely described,
Obviously, described embodiment is a part of embodiment of the invention, rather than whole embodiments.It is based on
Embodiment in the present invention, those of ordinary skill in the art are obtained under the premise of creative work is not made
The whole other embodiments obtained, belong to the scope of protection of the invention.
It should be noted that terminal involved in the embodiment of the present application can include but is not limited to mobile phone,
Personal digital assistant (Personal Digital Assistant, PDA), wireless handheld device, it is wireless on
Net sheet, it is PC, portable computer, panel computer, MP3 player, MP4 players, wearable
Equipment (for example, intelligent glasses, intelligent watch, Intelligent bracelet etc.) etc..
In addition, the terms "and/or", only a kind of incidence relation for describing affiliated partner, is represented
There may be three kinds of relations, for example, A and/or B, can be represented:Individualism A, while there is A
And B, individualism B these three situations.In addition, character "/" herein, typicallys represent forward-backward correlation pair
As if a kind of relation of "or".
The schematic flow sheet of the processing method for the speech data that Fig. 1 provides for one embodiment of the invention, such as Fig. 1
It is shown.
101st, beginning operating gesture of the user to terminal is obtained.
If the 102, the operating gesture that starts meets the specified beginning gesture pre-set, phonetic entry is opened
Function, to gather the speech data of the user.
It should be noted that 101~102 executive agent can be partly or entirely to be located locally terminal
Application, or can also be the plug-in unit or SDK being arranged in the application of local terminal
Functional units such as (Software Development Kit, SDK) is wrapped, or can also be positioned at grid
Processing engine in the server of side, or can also be the distributed system positioned at grid side, the present embodiment
To this without being particularly limited to.
It is understood that the application can be mounted in the local program (nativeApp) in terminal,
Or can also be a web page program (webApp) of the browser in terminal, the present embodiment to this not
It is particularly limited.
So, by obtaining beginning operating gesture of the user to terminal, if the beginning operating gesture is met
The specified beginning gesture pre-set, enabling speech voice input function is opened, to gather the user's
Speech data, due to performing voice service using the beginning operating gesture triggering specified so that without referring to
The specified location in demarcation face is provided for inputting the functionality controls of speech data, can avoid in the prior art
Due to for input the functionality controls of speech data be arranged on caused by the specified location at specified interface with
Cumbersome and inflexible technical problem when family needs input speech data, so as to improve speech data
The efficiency of processing and flexibility.
Alternatively, in a possible implementation of the present embodiment, in 101, it can specifically examine
Survey beginning operating gesture of the user to terminal.
Specifically, the user can include but is not limited to following operation to the beginning operating gesture of terminal
At least one of in gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
Wherein,
Operation of the user to the button of the terminal, can refer to user operation terminal some button by
Key is identified, either the marking keys of some button of user's operation terminal and operation direction or user behaviour
Make the marking keys and operation order of multiple buttons of terminal, or user operates multiple buttons of terminal
Operation trace of marking keys, operation order and each button, etc., the present embodiment to this without
It is particularly limited to.
Hanging slip of the user above the terminal, can refer to imaging sensor of the user in terminal
Within acquisition range, the hanging sliding trace above terminal.Wherein, described image sensor can be
Charge coupled cell (Charge Coupled Device, CCD) sensor, or can also be metal
Oxide-semiconductor devices (Complementary Metal-Oxide Semiconductor, CMOS)
Sensor, the present embodiment is to this without being particularly limited to.The hanging sliding trace can include but not limit
In the straight line or the song of arbitrary shape that are made up of several corresponding dwell points of several continuously slipping events
Line.
Contact slide of the user on specific interface, can refer to shown by display device of the user in terminal
Specific interface on contact slide track.Generally, terminal be able to can be touched according to whether display device has
The characteristic of control, is divided into two types, and a type is touch terminal, and another type is non-touch
Terminal.Specifically, the specific interface shown by touch screen of the user in touch terminal can specifically be detected
On contact slide data.The contact slide track can include but is not limited to by several continuous touches
The straight line of corresponding several touch points composition of event or the curve of arbitrary shape.Specifically, specifically may be used
Think that user operates in the long-press of specific interface blank region.For example, instant messaging class APP dialogue
Interface.
User drives the motion of the terminal, can refer to user's handheld terminal, drives terminal to be transported
Dynamic movement locus, for example, rocking, overturning.
In a concrete implementation mode, sensor device can be specifically utilized, user is to terminal for detection
Beginning operating gesture.Specifically, the sensor device can include but is not limited to gravity sensor,
In acceleration transducer, pressure sensor, infrared ray sensor, range sensor and imaging sensor
At least one, the present embodiment is to this without being particularly limited to.
Wherein, the range sensor can be ultrasonic distance sensor, or can also for it is infrared away from
From sensor, it can also be either laser distance sensor or can also be microwave range sensor,
The present embodiment is to this without being particularly limited to.These range sensors are all existing mature technologies, in detail
Description may refer to related content of the prior art, and here is omitted.
Wherein, described image sensor can for charge coupled cell (Charge Coupled Device,
CCD) sensor, or can also be metal oxide semiconductor device (Complementary
Metal-Oxide Semiconductor, CMOS) sensor, the present embodiment is to this without especially limit
It is fixed.
Specifically, detection user can specifically refer to detection user to end the beginning operating gesture of terminal
Starting point, end point and the track formed by starting point to end point of the beginning operating gesture at end,
Or can also further detect the radian data corresponding to the track.
Alternatively,, specifically can be with base in 101 in a possible implementation of the present embodiment
In the specified interface pre-set, beginning operating gesture of the detection user to terminal.
During a concrete implementation, the specified interface can be the desktop of the operating system of terminal.
Wherein, the operating system can include but is not limited to the ios operating systems of apple, the operation of the Android of Google
System or the Windows operating system or other terminal operating systems of Microsoft.
The desktop of so-called operating system, refers to the desktop that the operating system that terminal is run is provided, is
The main entrance that user interacts with terminal, is also the graphic user interface of man-machine interaction.Operating system
Desktop could be arranged to including but not limited to any operation object.For example, the icon of application program is such as,
A figure in phone, information, memorandum, photo, microblogging, wechat, mobile phone house keeper and various game
Mark or its any icon combination etc., or, for another example the icon that the icon of systemic-function such as system is set
Or System menu etc..
During another concrete implementation, the specified interface can be the specified any page applied.
Wherein, the specified application can include but is not limited in terminal any APP or pre-set at least
One application.For example, instant messaging class APP, searching class APP etc..
During another concrete implementation, the specified interface can be the specified specified interface applied.
Wherein, the specified application can include but is not limited in terminal any APP or pre-set at least
One application.For example, instant messaging class APP, searching class APP etc..The specified interface can include
But it is not limited to specify at least one page pre-set of application.For example, instant messaging class APP pair
Talk about interface etc..
In the present embodiment, in order to shorten the time using input speech data, user, which can use, starts behaviour
Make a sign with the hand, triggered, without, when user needs input speech data, being needed as prior art
To represent specified interface according to the operation of user, and specified location is found on interface is specified by user
Functionality controls are simultaneously operated, and could gather the speech data that user is provided.In such manner, it is possible to so that terminal not
Be laid out again by the page, and the other application being currently currently running limitation, language can be effectively improved
The efficiency of sound data processing and flexibility.
In order to realize above-mentioned functions, alternatively, in a possible implementation of the present embodiment,
Before 102, it can also further pre-set several and specify beginning gesture.Only as acquired user
During the specified beginning gesture that the beginning operating gesture satisfaction to terminal is pre-set, follow-up operation is just performed.
Wherein, the specified data for starting gesture can be stored in the storage device of terminal.
During a concrete implementation, the storage device of the terminal can have with slow storage device
Body can be the hard disk of computer system, or can also be physical memory for the inoperative internal memory of mobile phone,
For example, read-only storage (Read-Only Memory, ROM) and RAM card etc., the present embodiment is to this
Without being particularly limited to.
During another concrete implementation, the storage device of the terminal can also set for quick storage
It is standby, it is specifically as follows the internal memory of computer system, or can also be in system for the running memory of mobile phone
Deposit, for example, random access memory (Random Access Memory, RAM) etc., the present embodiment pair
This is without being particularly limited to.
If for example, acquired beginning operating gesture is operation of the user to the button of the terminal, in advance
The specified beginning gesture first set can be then the predetermined registration operation data of one group of button.
Or, if for another example acquired beginning operating gesture is that user is hanging above the terminal
Slide, then the specified beginning gesture pre-set can be then the track data of a desired guiding trajectory, for example,
Track data of the track data of the straight-line pattern of all directions, " Z " pattern or " L " pattern etc..
Or, if for another example acquired beginning operating gesture is contact cunning of the user on specific interface
Dynamic, then the specified beginning gesture pre-set can be then the track data of a desired guiding trajectory, for example,
The track data of long-press, the track data etc. for sliding to assigned direction certain distance.
Or, if for another example acquired beginning operating gesture is the motion that user drives the terminal,
The specified beginning gesture then pre-set can be then the event data of a predeterminable event, for example, rocking
Event.
Alternatively, in a possible implementation of the present embodiment, in 102, if described start
Operating gesture meets the specified beginning gesture pre-set, and explanation can open speech voice input function.Opening
Open after speech voice input function, microphone prompting icon can be exported in current interface, to point out user
Talk, and output content of text, to point out the operating gesture for cancelling current audio data input.
Now, then speech data input can have been detected whether, until reception voice stopping input instruction is
Only.If having detected speech data input, the speech data is handled.
In such manner, it is possible to during whole session is voice service, detect whether that speech data is defeated all the time
Enter, client effectively reduces instruction and handed over without obtaining beginning operating gesture of the user to terminal repeatedly
Mutually processing, so as to further increase the efficiency of phonetic entry.
In the implementation, any voice processing technology of the prior art can be used, to speech data
Handled, detailed description may refer to related content of the prior art, and here is omitted.
, can be with while speech data input has been detected whether during a concrete implementation
End operation gesture of the user to the terminal is further obtained, if the end operation gesture meets advance
The specified end gesture set, explanation can terminate phonetic entry, then can receive the voice stopping defeated
Enter instruction.
The end operation gesture, can be and the corresponding gesture corresponding to the beginning operating gesture, tool
Body, the user can also include but is not limited to following operating gesture to the end operation gesture of terminal
At least one of in:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
It is specifically described can also be referring to the specific descriptions for starting operating gesture, and here is omitted.
Difference of the present invention from prior art is essentially consisted in, by carrying out the setting of function hot-zone to terminal,
To terminal increase hot-zone operation, for example, clicking on or specific interface blank region increase hot-zone to terminal
Increase the focus incidents such as shake event operation, etc., in input process, simplify the operation of phonetic entry
Step, the convenient and swift property of increase application phonetic entry.Phonetic search function, society to searching class application
The service efficiency of the voice-enabled chat function of class application etc. is handed over, can be substantially improved.
In the present embodiment, by obtaining beginning operating gesture of the user to terminal, if the beginning manipulator
Gesture meets the specified beginning gesture pre-set, enabling open speech voice input function, described to gather
The speech data of user, due to performing voice service using the beginning operating gesture triggering specified so that nothing
It need to can be avoided existing in the functionality controls for specifying the specified location at interface to be provided for inputting speech data
Due to being arranged on the specified location at specified interface for inputting the functionality controls of speech data and cause in technology
The cumbersome and inflexible technical problem when user needs input speech data, so as to improve language
The efficiency of sound data processing and flexibility.
In addition, using technical scheme provided by the present invention, due to being touched using the beginning operating gesture specified
Hair performs voice service so that operating area is no longer limited by the functionality controls for inputting speech data
Size and location, can effectively improve the reliability and efficiency of language data process.
It should be noted that for foregoing each method embodiment, in order to be briefly described, therefore by its all table
State as a series of combination of actions, but those skilled in the art should know, the present invention is not by being retouched
The limitation for the sequence of movement stated, because according to the present invention, some steps can be using other orders or same
Shi Jinhang.Secondly, those skilled in the art should also know, embodiment described in this description belongs to
In preferred embodiment, involved action and the module not necessarily present invention are necessary.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have in some embodiment
The part of detailed description, may refer to the associated description of other embodiment.
The structural representation of the processing unit for the speech data that Fig. 2 provides for another embodiment of the present invention, such as
Shown in Fig. 2.The processing unit of the speech data of the present embodiment can include acquiring unit 21 and voice unit
22.Wherein, acquiring unit 21, for obtaining beginning operating gesture of the user to terminal;Voice unit 22,
If meeting the specified beginning gesture pre-set for the beginning operating gesture, speech voice input function is opened,
To gather the speech data of the user.
It should be noted that the processing unit of the speech data of the present embodiment can be partly or entirely position
Application in local terminal, or can also be the plug-in unit being arranged in the application of local terminal or soft
The functional units such as part development kit (Software Development Kit, SDK), or can be with
For the processing engine in the server of grid side, or can also be the distributed system positioned at grid side,
The present embodiment is to this without being particularly limited to.
It is understood that the application can be mounted in the local program (nativeApp) in terminal,
Or can also be a web page program (webApp) of the browser in terminal, the present embodiment to this not
It is particularly limited.
Alternatively, in a possible implementation of the present embodiment, the acquiring unit 21, specifically
It can be used for based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
Alternatively, in a possible implementation of the present embodiment, beginning of the user to terminal
Operating gesture, can include but is not limited at least one in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
Wherein, contact slide of the user on specific interface can be user in specific interface overhead
The long-press operation of white region.
Alternatively, in a possible implementation of the present embodiment, institute's speech units 22, specifically
If can be used for the beginning operating gesture meets the specified beginning gesture pre-set, language has been detected whether
Sound data input, untill voice stopping input instruction being received;If having detected speech data input,
The speech data can then be handled.
In the implementation, institute's speech units 22 can also be further used for obtaining user to described
The end operation gesture of terminal;If the end operation gesture meets the specified end gesture pre-set,
The voice can then be received and stop input instruction.
It should be noted that method in the corresponding embodiments of Fig. 1, the voice that can be provided by the present embodiment
The processing unit of data is realized.The related content that may refer in the corresponding embodiments of Fig. 1 is described in detail,
Here is omitted.
In the present embodiment, user is obtained to the beginning operating gesture of terminal, voice unit by acquiring unit
If the operating gesture that starts meets the specified beginning gesture pre-set, enabling open phonetic entry
Function, to gather the speech data of the user, is performed due to being triggered using the beginning operating gesture specified
Voice service so that without in the function control for specifying the specified location at interface to be provided for inputting speech data
Part, can be avoided in the prior art due to being arranged on specified interface for inputting the functionality controls of speech data
Specified location caused by when user needs input speech data cumbersome and inflexible technology ask
Topic, so as to improve efficiency and the flexibility of language data process.
In addition, using technical scheme provided by the present invention, due to being touched using the beginning operating gesture specified
Hair performs voice service so that operating area is no longer limited by the functionality controls for inputting speech data
Size and location, can effectively improve the reliability and efficiency of language data process.
In several embodiments provided by the present invention, it should be understood that disclosed system, device and
Method, can be realized by another way.For example, device embodiment described above is only to show
Meaning property, for example, the division of the unit, only a kind of division of logic function can when actually realizing
To there is other dividing mode, such as multiple units or component can combine or be desirably integrated into another
System, or some features can be ignored, or not perform.It is another, it is shown or discussed each other
Coupling or direct-coupling or communication connection can be the INDIRECT COUPLING of device or unit by some interfaces
Or communication connection, can be electrical, machinery or other forms.
The unit illustrated as separating component can be or may not be it is physically separate, make
It can be for the part that unit is shown or may not be physical location, you can with positioned at a place,
Or can also be distributed on multiple NEs.Can select according to the actual needs part therein or
Person's whole units realize the purpose of this embodiment scheme.
In addition, each functional unit in each embodiment of the invention can be integrated in a processing unit,
Can also be that unit is individually physically present, can also two or more units be integrated in a list
In member.Above-mentioned integrated unit can both be realized in the form of hardware, it would however also be possible to employ hardware adds software
The form of functional unit is realized.
The above-mentioned integrated unit realized in the form of SFU software functional unit, can be stored in a computer
In read/write memory medium.Above-mentioned SFU software functional unit is stored in a storage medium, including some fingers
Order is make it that a computer installation (can be personal computer, audio frequency process engine, or network
Device etc.) or processor (processor) perform the part steps of each of the invention embodiment methods described.
And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (Read-Only Memory,
ROM), random access memory (Random Access Memory, RAM), magnetic disc or light
Disk etc. is various can be with the medium of store program codes.
Finally it should be noted that:The above embodiments are merely illustrative of the technical solutions of the present invention, rather than to it
Limitation;Although the present invention is described in detail with reference to the foregoing embodiments, the ordinary skill of this area
Personnel should be understood:It can still modify to the technical scheme described in foregoing embodiments, or
Person carries out equivalent to which part technical characteristic;And these modifications or replacement, do not make corresponding skill
The essence of art scheme departs from the spirit and scope of various embodiments of the present invention technical scheme.
Claims (12)
1. a kind of processing method of speech data, it is characterised in that including:
Obtain beginning operating gesture of the user to terminal;
If the operating gesture that starts meets the specified beginning gesture pre-set, speech voice input function is opened,
To gather the speech data of the user.
2. according to the method described in claim 1, it is characterised in that the acquisition user is opened terminal
Beginning operating gesture, including:
Based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
3. according to the method described in claim 1, it is characterised in that the user starts behaviour to terminal
At least one of make a sign with the hand, including in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
4. method according to claim 3, it is characterised in that the user is on specific interface
Contact slide, including:
User operates in the long-press of specific interface blank region.
5. the method according to Claims 1 to 4 any claim, it is characterised in that if the institute
State and start the specified beginning gesture that operating gesture satisfaction is pre-set, open speech voice input function, to gather
The speech data of the user, including:
If the operating gesture that starts meets the specified beginning gesture pre-set, voice number has been detected whether
According to input, untill voice stopping input instruction being received;
If having detected speech data input, the speech data is handled.
6. method according to claim 5, it is characterised in that described to have detected whether speech data
After input, also include:
Obtain end operation gesture of the user to the terminal;
If the end operation gesture meets the specified end gesture pre-set, receive the voice and stop
Input instruction.
7. a kind of processing unit of speech data, it is characterised in that including:
Acquiring unit, for obtaining beginning operating gesture of the user to terminal;
Voice unit, if meeting the specified beginning gesture pre-set for the beginning operating gesture, is opened
Speech voice input function is opened, to gather the speech data of the user.
8. device according to claim 7, it is characterised in that the acquiring unit, specifically for
Based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
9. device according to claim 7, it is characterised in that the user starts behaviour to terminal
At least one of make a sign with the hand, including in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
10. device according to claim 9, it is characterised in that the user is on specific interface
Contact slide, including:
User operates in the long-press of specific interface blank region.
11. the device according to claim 7~10 any claim, it is characterised in that institute's predicate
Sound unit, specifically for
If the operating gesture that starts meets the specified beginning gesture pre-set, voice number has been detected whether
According to input, untill voice stopping input instruction being received;
If having detected speech data input, the speech data is handled.
12. device according to claim 11, it is characterised in that institute's speech units, is additionally operable to
Obtain end operation gesture of the user to the terminal;
If the end operation gesture meets the specified end gesture pre-set, receive the voice and stop
Input instruction.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610019033.XA CN106959746A (en) | 2016-01-12 | 2016-01-12 | The processing method and processing device of speech data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610019033.XA CN106959746A (en) | 2016-01-12 | 2016-01-12 | The processing method and processing device of speech data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106959746A true CN106959746A (en) | 2017-07-18 |
Family
ID=59480855
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610019033.XA Pending CN106959746A (en) | 2016-01-12 | 2016-01-12 | The processing method and processing device of speech data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106959746A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107390881A (en) * | 2017-09-14 | 2017-11-24 | 西安领讯卓越信息技术有限公司 | A kind of gestural control method |
CN107592416A (en) * | 2017-08-31 | 2018-01-16 | 努比亚技术有限公司 | Method for sending voice message, terminal and computer-readable recording medium |
CN107864289A (en) * | 2017-11-17 | 2018-03-30 | 珠海市魅族科技有限公司 | A kind of pronunciation inputting method and device, terminal, readable storage medium storing program for executing |
CN108965584A (en) * | 2018-06-21 | 2018-12-07 | 北京百度网讯科技有限公司 | A kind of processing method of voice messaging, device, terminal and storage medium |
CN109120793A (en) * | 2018-09-07 | 2019-01-01 | 无线生活(杭州)信息科技有限公司 | Method of speech processing and device |
CN109979442A (en) * | 2017-12-27 | 2019-07-05 | 珠海市君天电子科技有限公司 | A kind of sound control method, device and electronic equipment |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103197911A (en) * | 2013-04-12 | 2013-07-10 | 广东国笔科技股份有限公司 | Method, system and device for providing speech input |
CN103488401A (en) * | 2013-09-30 | 2014-01-01 | 乐视致新电子科技(天津)有限公司 | Voice assistant activating method and device |
CN103488384A (en) * | 2013-09-30 | 2014-01-01 | 乐视致新电子科技(天津)有限公司 | Voice assistant application interface display method and device |
CN104111728A (en) * | 2014-06-26 | 2014-10-22 | 联想(北京)有限公司 | Electronic device and voice command input method based on operation gestures |
CN204679955U (en) * | 2015-05-22 | 2015-09-30 | 广东好帮手电子科技股份有限公司 | A kind of device by the voice activated control module of gesture identification |
CN104978014A (en) * | 2014-04-11 | 2015-10-14 | 维沃移动通信有限公司 | Method for quickly calling application program or system function, and mobile terminal thereof |
-
2016
- 2016-01-12 CN CN201610019033.XA patent/CN106959746A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103197911A (en) * | 2013-04-12 | 2013-07-10 | 广东国笔科技股份有限公司 | Method, system and device for providing speech input |
CN103488401A (en) * | 2013-09-30 | 2014-01-01 | 乐视致新电子科技(天津)有限公司 | Voice assistant activating method and device |
CN103488384A (en) * | 2013-09-30 | 2014-01-01 | 乐视致新电子科技(天津)有限公司 | Voice assistant application interface display method and device |
CN104978014A (en) * | 2014-04-11 | 2015-10-14 | 维沃移动通信有限公司 | Method for quickly calling application program or system function, and mobile terminal thereof |
CN104111728A (en) * | 2014-06-26 | 2014-10-22 | 联想(北京)有限公司 | Electronic device and voice command input method based on operation gestures |
CN204679955U (en) * | 2015-05-22 | 2015-09-30 | 广东好帮手电子科技股份有限公司 | A kind of device by the voice activated control module of gesture identification |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107592416A (en) * | 2017-08-31 | 2018-01-16 | 努比亚技术有限公司 | Method for sending voice message, terminal and computer-readable recording medium |
CN107592416B (en) * | 2017-08-31 | 2020-11-17 | 努比亚技术有限公司 | Voice message transmitting method, terminal and computer readable storage medium |
CN107390881A (en) * | 2017-09-14 | 2017-11-24 | 西安领讯卓越信息技术有限公司 | A kind of gestural control method |
CN107864289A (en) * | 2017-11-17 | 2018-03-30 | 珠海市魅族科技有限公司 | A kind of pronunciation inputting method and device, terminal, readable storage medium storing program for executing |
CN109979442A (en) * | 2017-12-27 | 2019-07-05 | 珠海市君天电子科技有限公司 | A kind of sound control method, device and electronic equipment |
CN108965584A (en) * | 2018-06-21 | 2018-12-07 | 北京百度网讯科技有限公司 | A kind of processing method of voice messaging, device, terminal and storage medium |
CN109120793A (en) * | 2018-09-07 | 2019-01-01 | 无线生活(杭州)信息科技有限公司 | Method of speech processing and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3951576B1 (en) | Content sharing method and electronic device | |
JP6130926B2 (en) | Gesture conversation processing method, apparatus, terminal device, program, and recording medium | |
JP6997734B2 (en) | Handwritten keyboard for screen | |
CN110046238B (en) | Dialogue interaction method, graphic user interface, terminal equipment and network equipment | |
CN106959746A (en) | The processing method and processing device of speech data | |
CN104571852B (en) | The moving method and device of icon | |
EP2680257B1 (en) | Mobile terminal and method for recognizing voice thereof | |
CN108701001A (en) | Show the method and electronic equipment of graphic user interface | |
CN107077295A (en) | A kind of method, device, electronic equipment, display interface and the storage medium of quick split screen | |
CN107896279A (en) | Screenshotss processing method, device and the mobile terminal of a kind of mobile terminal | |
EP2731028A2 (en) | Mobile terminal and control method thereof | |
US20150277748A1 (en) | Edit providing method according to multi-touch-based text block setting | |
CN107967055A (en) | A kind of man-machine interaction method, terminal and computer-readable medium | |
CN106796789A (en) | Interacted with the speech that cooperates with of speech reference point | |
CN104076916A (en) | Information processing method and electronic device | |
CN103870133A (en) | Method and apparatus for scrolling screen of display device | |
CN107870705B (en) | Method and device for changing icon position of application menu | |
CN104765525A (en) | Operation interface switching method and device | |
JP6612351B2 (en) | Device, method and graphic user interface used to move application interface elements | |
KR20160016526A (en) | Method for Providing Information and Device thereof | |
CN105373318B (en) | Information display method and device | |
CN104750375A (en) | Interface display method and device | |
CN106197394A (en) | Air navigation aid and device | |
KR101880310B1 (en) | Terminal having chatting information display function in the chatting thread and control method thereof | |
CN108028869A (en) | The method of terminal device and processing incoming call |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170718 |