CN106959746A - The processing method and processing device of speech data - Google Patents

The processing method and processing device of speech data Download PDF

Info

Publication number
CN106959746A
CN106959746A CN201610019033.XA CN201610019033A CN106959746A CN 106959746 A CN106959746 A CN 106959746A CN 201610019033 A CN201610019033 A CN 201610019033A CN 106959746 A CN106959746 A CN 106959746A
Authority
CN
China
Prior art keywords
user
terminal
gesture
speech data
specified
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610019033.XA
Other languages
Chinese (zh)
Inventor
韩璐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201610019033.XA priority Critical patent/CN106959746A/en
Publication of CN106959746A publication Critical patent/CN106959746A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/017Gesture based interaction, e.g. based on a set of recognized hand gestures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The present invention provides a kind of processing method and processing device of speech data.The embodiment of the present invention is by obtaining beginning operating gesture of the user to terminal, if the operating gesture that starts meets the specified beginning gesture pre-set, make it possible to open speech voice input function, to gather the speech data of the user, due to performing voice service using the beginning operating gesture triggering specified, so that without in the functionality controls for specifying the specified location at interface to be provided for inputting speech data, it can avoid in the prior art due to being arranged on caused by the specified location at specified interface and inflexible technical problem cumbersome when user needs input speech data for inputting the functionality controls of speech data, so as to improve efficiency and the flexibility of language data process.

Description

The processing method and processing device of speech data
【Technical field】
The present invention relates to the communication technology, more particularly to a kind of processing method and processing device of speech data.
【Background technology】
With the development of the communication technology, terminal is integrated with increasing function, so that terminal is More and more corresponding applications (Application, APP) are contained in system feature list.In some applications Voice service can be related to, for example, the speech voice input function in wechat application, the language in Baidu search application Sound assistant, etc..In voice service, it can be used to input specifying the specified location at interface to provide one The functionality controls of speech data.When user operates this functionality controls using input equipment, then it can open Begin collection speech data.
However, the specified location due to being arranged on specified interface for inputting the functionality controls of speech data, Therefore, when user needs input speech data, it is necessary to represent specified interface according to the operation of user, and Functionality controls and the operation of specified location are found on interface is specified by user, user could be gathered and carried The speech data of confession, it is cumbersome and dumb, so as to result in efficiency and the spirit of language data process The reduction of activity.
【The content of the invention】
The many aspects of the present invention provide a kind of processing method and processing device of speech data, to improve voice The efficiency of data processing and flexibility.
An aspect of of the present present invention there is provided a kind of processing method of speech data, including:
Obtain beginning operating gesture of the user to terminal;
If the operating gesture that starts meets the specified beginning gesture pre-set, speech voice input function is opened, To gather the speech data of the user.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute The beginning operating gesture for obtaining user to terminal is stated, including:
Based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute At least one of state beginning operating gesture of the user to terminal, including in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute Contact slide of the user on specific interface is stated, including:
User operates in the long-press of specific interface blank region.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute If stating the beginning operating gesture meets the specified beginning gesture pre-set, speech voice input function is opened, To gather the speech data of the user, including:
If the operating gesture that starts meets the specified beginning gesture pre-set, voice number has been detected whether According to input, untill voice stopping input instruction being received;
If having detected speech data input, the speech data is handled.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute State after having detected whether that speech data is inputted, also include:
Obtain end operation gesture of the user to the terminal;
If the end operation gesture meets the specified end gesture pre-set, receive the voice and stop Input instruction.
Another aspect of the present invention there is provided a kind of processing unit of speech data, including:
Acquiring unit, for obtaining beginning operating gesture of the user to terminal;
Voice unit, if meeting the specified beginning gesture pre-set for the beginning operating gesture, is opened Speech voice input function is opened, to gather the speech data of the user.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute Acquiring unit is stated, specifically for
Based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute At least one of state beginning operating gesture of the user to terminal, including in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute Contact slide of the user on specific interface is stated, including:
User operates in the long-press of specific interface blank region.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute Speech units, specifically for
If the operating gesture that starts meets the specified beginning gesture pre-set, voice number has been detected whether According to input, untill voice stopping input instruction being received;
If having detected speech data input, the speech data is handled.
Aspect as described above and any possible implementation, it is further provided a kind of implementation, institute Speech units, are additionally operable to
Obtain end operation gesture of the user to the terminal;
If the end operation gesture meets the specified end gesture pre-set, receive the voice and stop Input instruction.
As shown from the above technical solution, the embodiment of the present invention is by obtaining beginning manipulator of the user to terminal Gesture, if the operating gesture that starts meets the specified beginning gesture pre-set, enabling open voice Input function, to gather the speech data of the user, due to using the beginning operating gesture triggering specified Perform voice service so that without in the work(for specifying the specified location at interface to be provided for inputting speech data Can control, can avoid in the prior art due to for input the functionality controls of speech data be arranged on it is specified Caused by the specified location at interface when user needs input speech data cumbersome and inflexible skill Art problem, so as to improve efficiency and the flexibility of language data process.
In addition, using technical scheme provided by the present invention, due to being touched using the beginning operating gesture specified Hair performs voice service so that operating area is no longer limited by the functionality controls for inputting speech data Size and location, can effectively improve the reliability and efficiency of language data process.
【Brief description of the drawings】
Technical scheme in order to illustrate more clearly the embodiments of the present invention, below will be to embodiment or existing The accompanying drawing to be used needed for technology description is briefly described, it should be apparent that, in describing below Accompanying drawing is some embodiments of the present invention, for those of ordinary skill in the art, is not paying creation Property it is laborious on the premise of, other accompanying drawings can also be obtained according to these accompanying drawings.
The schematic flow sheet of the processing method for the speech data that Fig. 1 provides for one embodiment of the invention;
The structural representation of the processing unit for the speech data that Fig. 2 provides for another embodiment of the present invention.
【Embodiment】
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with this hair Accompanying drawing in bright embodiment, the technical scheme in the embodiment of the present invention is clearly and completely described, Obviously, described embodiment is a part of embodiment of the invention, rather than whole embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained under the premise of creative work is not made The whole other embodiments obtained, belong to the scope of protection of the invention.
It should be noted that terminal involved in the embodiment of the present application can include but is not limited to mobile phone, Personal digital assistant (Personal Digital Assistant, PDA), wireless handheld device, it is wireless on Net sheet, it is PC, portable computer, panel computer, MP3 player, MP4 players, wearable Equipment (for example, intelligent glasses, intelligent watch, Intelligent bracelet etc.) etc..
In addition, the terms "and/or", only a kind of incidence relation for describing affiliated partner, is represented There may be three kinds of relations, for example, A and/or B, can be represented:Individualism A, while there is A And B, individualism B these three situations.In addition, character "/" herein, typicallys represent forward-backward correlation pair As if a kind of relation of "or".
The schematic flow sheet of the processing method for the speech data that Fig. 1 provides for one embodiment of the invention, such as Fig. 1 It is shown.
101st, beginning operating gesture of the user to terminal is obtained.
If the 102, the operating gesture that starts meets the specified beginning gesture pre-set, phonetic entry is opened Function, to gather the speech data of the user.
It should be noted that 101~102 executive agent can be partly or entirely to be located locally terminal Application, or can also be the plug-in unit or SDK being arranged in the application of local terminal Functional units such as (Software Development Kit, SDK) is wrapped, or can also be positioned at grid Processing engine in the server of side, or can also be the distributed system positioned at grid side, the present embodiment To this without being particularly limited to.
It is understood that the application can be mounted in the local program (nativeApp) in terminal, Or can also be a web page program (webApp) of the browser in terminal, the present embodiment to this not It is particularly limited.
So, by obtaining beginning operating gesture of the user to terminal, if the beginning operating gesture is met The specified beginning gesture pre-set, enabling speech voice input function is opened, to gather the user's Speech data, due to performing voice service using the beginning operating gesture triggering specified so that without referring to The specified location in demarcation face is provided for inputting the functionality controls of speech data, can avoid in the prior art Due to for input the functionality controls of speech data be arranged on caused by the specified location at specified interface with Cumbersome and inflexible technical problem when family needs input speech data, so as to improve speech data The efficiency of processing and flexibility.
Alternatively, in a possible implementation of the present embodiment, in 101, it can specifically examine Survey beginning operating gesture of the user to terminal.
Specifically, the user can include but is not limited to following operation to the beginning operating gesture of terminal At least one of in gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
Wherein,
Operation of the user to the button of the terminal, can refer to user operation terminal some button by Key is identified, either the marking keys of some button of user's operation terminal and operation direction or user behaviour Make the marking keys and operation order of multiple buttons of terminal, or user operates multiple buttons of terminal Operation trace of marking keys, operation order and each button, etc., the present embodiment to this without It is particularly limited to.
Hanging slip of the user above the terminal, can refer to imaging sensor of the user in terminal Within acquisition range, the hanging sliding trace above terminal.Wherein, described image sensor can be Charge coupled cell (Charge Coupled Device, CCD) sensor, or can also be metal Oxide-semiconductor devices (Complementary Metal-Oxide Semiconductor, CMOS) Sensor, the present embodiment is to this without being particularly limited to.The hanging sliding trace can include but not limit In the straight line or the song of arbitrary shape that are made up of several corresponding dwell points of several continuously slipping events Line.
Contact slide of the user on specific interface, can refer to shown by display device of the user in terminal Specific interface on contact slide track.Generally, terminal be able to can be touched according to whether display device has The characteristic of control, is divided into two types, and a type is touch terminal, and another type is non-touch Terminal.Specifically, the specific interface shown by touch screen of the user in touch terminal can specifically be detected On contact slide data.The contact slide track can include but is not limited to by several continuous touches The straight line of corresponding several touch points composition of event or the curve of arbitrary shape.Specifically, specifically may be used Think that user operates in the long-press of specific interface blank region.For example, instant messaging class APP dialogue Interface.
User drives the motion of the terminal, can refer to user's handheld terminal, drives terminal to be transported Dynamic movement locus, for example, rocking, overturning.
In a concrete implementation mode, sensor device can be specifically utilized, user is to terminal for detection Beginning operating gesture.Specifically, the sensor device can include but is not limited to gravity sensor, In acceleration transducer, pressure sensor, infrared ray sensor, range sensor and imaging sensor At least one, the present embodiment is to this without being particularly limited to.
Wherein, the range sensor can be ultrasonic distance sensor, or can also for it is infrared away from From sensor, it can also be either laser distance sensor or can also be microwave range sensor, The present embodiment is to this without being particularly limited to.These range sensors are all existing mature technologies, in detail Description may refer to related content of the prior art, and here is omitted.
Wherein, described image sensor can for charge coupled cell (Charge Coupled Device, CCD) sensor, or can also be metal oxide semiconductor device (Complementary Metal-Oxide Semiconductor, CMOS) sensor, the present embodiment is to this without especially limit It is fixed.
Specifically, detection user can specifically refer to detection user to end the beginning operating gesture of terminal Starting point, end point and the track formed by starting point to end point of the beginning operating gesture at end, Or can also further detect the radian data corresponding to the track.
Alternatively,, specifically can be with base in 101 in a possible implementation of the present embodiment In the specified interface pre-set, beginning operating gesture of the detection user to terminal.
During a concrete implementation, the specified interface can be the desktop of the operating system of terminal. Wherein, the operating system can include but is not limited to the ios operating systems of apple, the operation of the Android of Google System or the Windows operating system or other terminal operating systems of Microsoft.
The desktop of so-called operating system, refers to the desktop that the operating system that terminal is run is provided, is The main entrance that user interacts with terminal, is also the graphic user interface of man-machine interaction.Operating system Desktop could be arranged to including but not limited to any operation object.For example, the icon of application program is such as, A figure in phone, information, memorandum, photo, microblogging, wechat, mobile phone house keeper and various game Mark or its any icon combination etc., or, for another example the icon that the icon of systemic-function such as system is set Or System menu etc..
During another concrete implementation, the specified interface can be the specified any page applied. Wherein, the specified application can include but is not limited in terminal any APP or pre-set at least One application.For example, instant messaging class APP, searching class APP etc..
During another concrete implementation, the specified interface can be the specified specified interface applied. Wherein, the specified application can include but is not limited in terminal any APP or pre-set at least One application.For example, instant messaging class APP, searching class APP etc..The specified interface can include But it is not limited to specify at least one page pre-set of application.For example, instant messaging class APP pair Talk about interface etc..
In the present embodiment, in order to shorten the time using input speech data, user, which can use, starts behaviour Make a sign with the hand, triggered, without, when user needs input speech data, being needed as prior art To represent specified interface according to the operation of user, and specified location is found on interface is specified by user Functionality controls are simultaneously operated, and could gather the speech data that user is provided.In such manner, it is possible to so that terminal not Be laid out again by the page, and the other application being currently currently running limitation, language can be effectively improved The efficiency of sound data processing and flexibility.
In order to realize above-mentioned functions, alternatively, in a possible implementation of the present embodiment, Before 102, it can also further pre-set several and specify beginning gesture.Only as acquired user During the specified beginning gesture that the beginning operating gesture satisfaction to terminal is pre-set, follow-up operation is just performed.
Wherein, the specified data for starting gesture can be stored in the storage device of terminal.
During a concrete implementation, the storage device of the terminal can have with slow storage device Body can be the hard disk of computer system, or can also be physical memory for the inoperative internal memory of mobile phone, For example, read-only storage (Read-Only Memory, ROM) and RAM card etc., the present embodiment is to this Without being particularly limited to.
During another concrete implementation, the storage device of the terminal can also set for quick storage It is standby, it is specifically as follows the internal memory of computer system, or can also be in system for the running memory of mobile phone Deposit, for example, random access memory (Random Access Memory, RAM) etc., the present embodiment pair This is without being particularly limited to.
If for example, acquired beginning operating gesture is operation of the user to the button of the terminal, in advance The specified beginning gesture first set can be then the predetermined registration operation data of one group of button.
Or, if for another example acquired beginning operating gesture is that user is hanging above the terminal Slide, then the specified beginning gesture pre-set can be then the track data of a desired guiding trajectory, for example, Track data of the track data of the straight-line pattern of all directions, " Z " pattern or " L " pattern etc..
Or, if for another example acquired beginning operating gesture is contact cunning of the user on specific interface Dynamic, then the specified beginning gesture pre-set can be then the track data of a desired guiding trajectory, for example, The track data of long-press, the track data etc. for sliding to assigned direction certain distance.
Or, if for another example acquired beginning operating gesture is the motion that user drives the terminal, The specified beginning gesture then pre-set can be then the event data of a predeterminable event, for example, rocking Event.
Alternatively, in a possible implementation of the present embodiment, in 102, if described start Operating gesture meets the specified beginning gesture pre-set, and explanation can open speech voice input function.Opening Open after speech voice input function, microphone prompting icon can be exported in current interface, to point out user Talk, and output content of text, to point out the operating gesture for cancelling current audio data input.
Now, then speech data input can have been detected whether, until reception voice stopping input instruction is Only.If having detected speech data input, the speech data is handled.
In such manner, it is possible to during whole session is voice service, detect whether that speech data is defeated all the time Enter, client effectively reduces instruction and handed over without obtaining beginning operating gesture of the user to terminal repeatedly Mutually processing, so as to further increase the efficiency of phonetic entry.
In the implementation, any voice processing technology of the prior art can be used, to speech data Handled, detailed description may refer to related content of the prior art, and here is omitted.
, can be with while speech data input has been detected whether during a concrete implementation End operation gesture of the user to the terminal is further obtained, if the end operation gesture meets advance The specified end gesture set, explanation can terminate phonetic entry, then can receive the voice stopping defeated Enter instruction.
The end operation gesture, can be and the corresponding gesture corresponding to the beginning operating gesture, tool Body, the user can also include but is not limited to following operating gesture to the end operation gesture of terminal At least one of in:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
It is specifically described can also be referring to the specific descriptions for starting operating gesture, and here is omitted.
Difference of the present invention from prior art is essentially consisted in, by carrying out the setting of function hot-zone to terminal, To terminal increase hot-zone operation, for example, clicking on or specific interface blank region increase hot-zone to terminal Increase the focus incidents such as shake event operation, etc., in input process, simplify the operation of phonetic entry Step, the convenient and swift property of increase application phonetic entry.Phonetic search function, society to searching class application The service efficiency of the voice-enabled chat function of class application etc. is handed over, can be substantially improved.
In the present embodiment, by obtaining beginning operating gesture of the user to terminal, if the beginning manipulator Gesture meets the specified beginning gesture pre-set, enabling open speech voice input function, described to gather The speech data of user, due to performing voice service using the beginning operating gesture triggering specified so that nothing It need to can be avoided existing in the functionality controls for specifying the specified location at interface to be provided for inputting speech data Due to being arranged on the specified location at specified interface for inputting the functionality controls of speech data and cause in technology The cumbersome and inflexible technical problem when user needs input speech data, so as to improve language The efficiency of sound data processing and flexibility.
In addition, using technical scheme provided by the present invention, due to being touched using the beginning operating gesture specified Hair performs voice service so that operating area is no longer limited by the functionality controls for inputting speech data Size and location, can effectively improve the reliability and efficiency of language data process.
It should be noted that for foregoing each method embodiment, in order to be briefly described, therefore by its all table State as a series of combination of actions, but those skilled in the art should know, the present invention is not by being retouched The limitation for the sequence of movement stated, because according to the present invention, some steps can be using other orders or same Shi Jinhang.Secondly, those skilled in the art should also know, embodiment described in this description belongs to In preferred embodiment, involved action and the module not necessarily present invention are necessary.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have in some embodiment The part of detailed description, may refer to the associated description of other embodiment.
The structural representation of the processing unit for the speech data that Fig. 2 provides for another embodiment of the present invention, such as Shown in Fig. 2.The processing unit of the speech data of the present embodiment can include acquiring unit 21 and voice unit 22.Wherein, acquiring unit 21, for obtaining beginning operating gesture of the user to terminal;Voice unit 22, If meeting the specified beginning gesture pre-set for the beginning operating gesture, speech voice input function is opened, To gather the speech data of the user.
It should be noted that the processing unit of the speech data of the present embodiment can be partly or entirely position Application in local terminal, or can also be the plug-in unit being arranged in the application of local terminal or soft The functional units such as part development kit (Software Development Kit, SDK), or can be with For the processing engine in the server of grid side, or can also be the distributed system positioned at grid side, The present embodiment is to this without being particularly limited to.
It is understood that the application can be mounted in the local program (nativeApp) in terminal, Or can also be a web page program (webApp) of the browser in terminal, the present embodiment to this not It is particularly limited.
Alternatively, in a possible implementation of the present embodiment, the acquiring unit 21, specifically It can be used for based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
Alternatively, in a possible implementation of the present embodiment, beginning of the user to terminal Operating gesture, can include but is not limited at least one in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
Wherein, contact slide of the user on specific interface can be user in specific interface overhead The long-press operation of white region.
Alternatively, in a possible implementation of the present embodiment, institute's speech units 22, specifically If can be used for the beginning operating gesture meets the specified beginning gesture pre-set, language has been detected whether Sound data input, untill voice stopping input instruction being received;If having detected speech data input, The speech data can then be handled.
In the implementation, institute's speech units 22 can also be further used for obtaining user to described The end operation gesture of terminal;If the end operation gesture meets the specified end gesture pre-set, The voice can then be received and stop input instruction.
It should be noted that method in the corresponding embodiments of Fig. 1, the voice that can be provided by the present embodiment The processing unit of data is realized.The related content that may refer in the corresponding embodiments of Fig. 1 is described in detail, Here is omitted.
In the present embodiment, user is obtained to the beginning operating gesture of terminal, voice unit by acquiring unit If the operating gesture that starts meets the specified beginning gesture pre-set, enabling open phonetic entry Function, to gather the speech data of the user, is performed due to being triggered using the beginning operating gesture specified Voice service so that without in the function control for specifying the specified location at interface to be provided for inputting speech data Part, can be avoided in the prior art due to being arranged on specified interface for inputting the functionality controls of speech data Specified location caused by when user needs input speech data cumbersome and inflexible technology ask Topic, so as to improve efficiency and the flexibility of language data process.
In addition, using technical scheme provided by the present invention, due to being touched using the beginning operating gesture specified Hair performs voice service so that operating area is no longer limited by the functionality controls for inputting speech data Size and location, can effectively improve the reliability and efficiency of language data process.
In several embodiments provided by the present invention, it should be understood that disclosed system, device and Method, can be realized by another way.For example, device embodiment described above is only to show Meaning property, for example, the division of the unit, only a kind of division of logic function can when actually realizing To there is other dividing mode, such as multiple units or component can combine or be desirably integrated into another System, or some features can be ignored, or not perform.It is another, it is shown or discussed each other Coupling or direct-coupling or communication connection can be the INDIRECT COUPLING of device or unit by some interfaces Or communication connection, can be electrical, machinery or other forms.
The unit illustrated as separating component can be or may not be it is physically separate, make It can be for the part that unit is shown or may not be physical location, you can with positioned at a place, Or can also be distributed on multiple NEs.Can select according to the actual needs part therein or Person's whole units realize the purpose of this embodiment scheme.
In addition, each functional unit in each embodiment of the invention can be integrated in a processing unit, Can also be that unit is individually physically present, can also two or more units be integrated in a list In member.Above-mentioned integrated unit can both be realized in the form of hardware, it would however also be possible to employ hardware adds software The form of functional unit is realized.
The above-mentioned integrated unit realized in the form of SFU software functional unit, can be stored in a computer In read/write memory medium.Above-mentioned SFU software functional unit is stored in a storage medium, including some fingers Order is make it that a computer installation (can be personal computer, audio frequency process engine, or network Device etc.) or processor (processor) perform the part steps of each of the invention embodiment methods described. And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disc or light Disk etc. is various can be with the medium of store program codes.
Finally it should be noted that:The above embodiments are merely illustrative of the technical solutions of the present invention, rather than to it Limitation;Although the present invention is described in detail with reference to the foregoing embodiments, the ordinary skill of this area Personnel should be understood:It can still modify to the technical scheme described in foregoing embodiments, or Person carries out equivalent to which part technical characteristic;And these modifications or replacement, do not make corresponding skill The essence of art scheme departs from the spirit and scope of various embodiments of the present invention technical scheme.

Claims (12)

1. a kind of processing method of speech data, it is characterised in that including:
Obtain beginning operating gesture of the user to terminal;
If the operating gesture that starts meets the specified beginning gesture pre-set, speech voice input function is opened, To gather the speech data of the user.
2. according to the method described in claim 1, it is characterised in that the acquisition user is opened terminal Beginning operating gesture, including:
Based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
3. according to the method described in claim 1, it is characterised in that the user starts behaviour to terminal At least one of make a sign with the hand, including in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
4. method according to claim 3, it is characterised in that the user is on specific interface Contact slide, including:
User operates in the long-press of specific interface blank region.
5. the method according to Claims 1 to 4 any claim, it is characterised in that if the institute State and start the specified beginning gesture that operating gesture satisfaction is pre-set, open speech voice input function, to gather The speech data of the user, including:
If the operating gesture that starts meets the specified beginning gesture pre-set, voice number has been detected whether According to input, untill voice stopping input instruction being received;
If having detected speech data input, the speech data is handled.
6. method according to claim 5, it is characterised in that described to have detected whether speech data After input, also include:
Obtain end operation gesture of the user to the terminal;
If the end operation gesture meets the specified end gesture pre-set, receive the voice and stop Input instruction.
7. a kind of processing unit of speech data, it is characterised in that including:
Acquiring unit, for obtaining beginning operating gesture of the user to terminal;
Voice unit, if meeting the specified beginning gesture pre-set for the beginning operating gesture, is opened Speech voice input function is opened, to gather the speech data of the user.
8. device according to claim 7, it is characterised in that the acquiring unit, specifically for
Based on the specified interface pre-set, beginning operating gesture of the detection user to terminal.
9. device according to claim 7, it is characterised in that the user starts behaviour to terminal At least one of make a sign with the hand, including in following operating gesture:
Operation of the user to the button of the terminal;
Hanging slip of the user above the terminal;
Contact slide of the user on specific interface;And
User drives the motion of the terminal.
10. device according to claim 9, it is characterised in that the user is on specific interface Contact slide, including:
User operates in the long-press of specific interface blank region.
11. the device according to claim 7~10 any claim, it is characterised in that institute's predicate Sound unit, specifically for
If the operating gesture that starts meets the specified beginning gesture pre-set, voice number has been detected whether According to input, untill voice stopping input instruction being received;
If having detected speech data input, the speech data is handled.
12. device according to claim 11, it is characterised in that institute's speech units, is additionally operable to
Obtain end operation gesture of the user to the terminal;
If the end operation gesture meets the specified end gesture pre-set, receive the voice and stop Input instruction.
CN201610019033.XA 2016-01-12 2016-01-12 The processing method and processing device of speech data Pending CN106959746A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610019033.XA CN106959746A (en) 2016-01-12 2016-01-12 The processing method and processing device of speech data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610019033.XA CN106959746A (en) 2016-01-12 2016-01-12 The processing method and processing device of speech data

Publications (1)

Publication Number Publication Date
CN106959746A true CN106959746A (en) 2017-07-18

Family

ID=59480855

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610019033.XA Pending CN106959746A (en) 2016-01-12 2016-01-12 The processing method and processing device of speech data

Country Status (1)

Country Link
CN (1) CN106959746A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107390881A (en) * 2017-09-14 2017-11-24 西安领讯卓越信息技术有限公司 A kind of gestural control method
CN107592416A (en) * 2017-08-31 2018-01-16 努比亚技术有限公司 Method for sending voice message, terminal and computer-readable recording medium
CN107864289A (en) * 2017-11-17 2018-03-30 珠海市魅族科技有限公司 A kind of pronunciation inputting method and device, terminal, readable storage medium storing program for executing
CN108965584A (en) * 2018-06-21 2018-12-07 北京百度网讯科技有限公司 A kind of processing method of voice messaging, device, terminal and storage medium
CN109120793A (en) * 2018-09-07 2019-01-01 无线生活(杭州)信息科技有限公司 Method of speech processing and device
CN109979442A (en) * 2017-12-27 2019-07-05 珠海市君天电子科技有限公司 A kind of sound control method, device and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103197911A (en) * 2013-04-12 2013-07-10 广东国笔科技股份有限公司 Method, system and device for providing speech input
CN103488401A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Voice assistant activating method and device
CN103488384A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Voice assistant application interface display method and device
CN104111728A (en) * 2014-06-26 2014-10-22 联想(北京)有限公司 Electronic device and voice command input method based on operation gestures
CN204679955U (en) * 2015-05-22 2015-09-30 广东好帮手电子科技股份有限公司 A kind of device by the voice activated control module of gesture identification
CN104978014A (en) * 2014-04-11 2015-10-14 维沃移动通信有限公司 Method for quickly calling application program or system function, and mobile terminal thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103197911A (en) * 2013-04-12 2013-07-10 广东国笔科技股份有限公司 Method, system and device for providing speech input
CN103488401A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Voice assistant activating method and device
CN103488384A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Voice assistant application interface display method and device
CN104978014A (en) * 2014-04-11 2015-10-14 维沃移动通信有限公司 Method for quickly calling application program or system function, and mobile terminal thereof
CN104111728A (en) * 2014-06-26 2014-10-22 联想(北京)有限公司 Electronic device and voice command input method based on operation gestures
CN204679955U (en) * 2015-05-22 2015-09-30 广东好帮手电子科技股份有限公司 A kind of device by the voice activated control module of gesture identification

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107592416A (en) * 2017-08-31 2018-01-16 努比亚技术有限公司 Method for sending voice message, terminal and computer-readable recording medium
CN107592416B (en) * 2017-08-31 2020-11-17 努比亚技术有限公司 Voice message transmitting method, terminal and computer readable storage medium
CN107390881A (en) * 2017-09-14 2017-11-24 西安领讯卓越信息技术有限公司 A kind of gestural control method
CN107864289A (en) * 2017-11-17 2018-03-30 珠海市魅族科技有限公司 A kind of pronunciation inputting method and device, terminal, readable storage medium storing program for executing
CN109979442A (en) * 2017-12-27 2019-07-05 珠海市君天电子科技有限公司 A kind of sound control method, device and electronic equipment
CN108965584A (en) * 2018-06-21 2018-12-07 北京百度网讯科技有限公司 A kind of processing method of voice messaging, device, terminal and storage medium
CN109120793A (en) * 2018-09-07 2019-01-01 无线生活(杭州)信息科技有限公司 Method of speech processing and device

Similar Documents

Publication Publication Date Title
EP3951576B1 (en) Content sharing method and electronic device
JP6130926B2 (en) Gesture conversation processing method, apparatus, terminal device, program, and recording medium
JP6997734B2 (en) Handwritten keyboard for screen
CN110046238B (en) Dialogue interaction method, graphic user interface, terminal equipment and network equipment
CN106959746A (en) The processing method and processing device of speech data
CN104571852B (en) The moving method and device of icon
EP2680257B1 (en) Mobile terminal and method for recognizing voice thereof
CN108701001A (en) Show the method and electronic equipment of graphic user interface
CN107077295A (en) A kind of method, device, electronic equipment, display interface and the storage medium of quick split screen
CN107896279A (en) Screenshotss processing method, device and the mobile terminal of a kind of mobile terminal
EP2731028A2 (en) Mobile terminal and control method thereof
US20150277748A1 (en) Edit providing method according to multi-touch-based text block setting
CN107967055A (en) A kind of man-machine interaction method, terminal and computer-readable medium
CN106796789A (en) Interacted with the speech that cooperates with of speech reference point
CN104076916A (en) Information processing method and electronic device
CN103870133A (en) Method and apparatus for scrolling screen of display device
CN107870705B (en) Method and device for changing icon position of application menu
CN104765525A (en) Operation interface switching method and device
JP6612351B2 (en) Device, method and graphic user interface used to move application interface elements
KR20160016526A (en) Method for Providing Information and Device thereof
CN105373318B (en) Information display method and device
CN104750375A (en) Interface display method and device
CN106197394A (en) Air navigation aid and device
KR101880310B1 (en) Terminal having chatting information display function in the chatting thread and control method thereof
CN108028869A (en) The method of terminal device and processing incoming call

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170718