CN108520750A - A kind of voice input control method, equipment and computer readable storage medium - Google Patents
A kind of voice input control method, equipment and computer readable storage medium Download PDFInfo
- Publication number
- CN108520750A CN108520750A CN201810202888.5A CN201810202888A CN108520750A CN 108520750 A CN108520750 A CN 108520750A CN 201810202888 A CN201810202888 A CN 201810202888A CN 108520750 A CN108520750 A CN 108520750A
- Authority
- CN
- China
- Prior art keywords
- input
- voice
- demand
- word
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 39
- 238000003860 storage Methods 0.000 title claims abstract description 18
- 230000002452 interceptive effect Effects 0.000 claims abstract description 98
- 230000005540 biological transmission Effects 0.000 claims abstract description 34
- 230000001960 triggered effect Effects 0.000 claims abstract description 17
- 238000004590 computer program Methods 0.000 claims description 15
- 230000002045 lasting effect Effects 0.000 description 20
- 238000003825 pressing Methods 0.000 description 16
- 230000006870 function Effects 0.000 description 15
- 230000006854 communication Effects 0.000 description 11
- 238000004891 communication Methods 0.000 description 10
- 230000000694 effects Effects 0.000 description 9
- 230000007613 environmental effect Effects 0.000 description 9
- 238000012545 processing Methods 0.000 description 8
- 230000006399 behavior Effects 0.000 description 5
- 230000001143 conditioned effect Effects 0.000 description 5
- 230000003993 interaction Effects 0.000 description 4
- 238000010295 mobile communication Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 2
- 230000005611 electricity Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000011946 reduction process Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000005314 correlation function Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000011017 operating method Methods 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 238000009527 percussion Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000010897 surface acoustic wave method Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Telephone Function (AREA)
Abstract
The invention discloses a kind of voice input control method, equipment and computer readable storage mediums, wherein this method includes:The input operation of the voice in current interactive interface is triggered according to speech-input instructions;Then, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified, wherein the input demand includes voice input demand and word input demand;Finally, it executes the voice messaging by voice input demand and sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes transmission operation.Realize a kind of voice input control scheme of hommization, user is allow quickly to carry out voice input operation, simultaneously, it is adaptively adjusted, switches the voice messaging after being inputted by voice or text information, user is eliminated in the handover operation of word input and voice input, larger improves the globality and adaptability of voice input.
Description
Technical field
The present invention relates to mobile communication field more particularly to a kind of voice input control method, equipment and computer-readable
Storage medium.
Background technology
In the prior art, with the intelligent development of terminal device, the function having is more and more abundant, and user can be frequent
Using terminal equipment carry out information processing, particularly, user generally pass through word input, voice input carry out data input behaviour
Make, still, under some scenes, frequently switching character input modes and voice input mode can undoubtedly be brought larger to user
Influence, when make it is complex for operation step, two are reduction of the efficiency of data input, and user experience is bad.
Invention content
In order to solve in the prior art, user by word when being inputted, voice input carrying out data input operation, frequently
Switching character input modes and voice input mode can undoubtedly be brought greater impact to user, when make operating procedure it is numerous
Trivial, two are reduction of the efficiency of data input, the bad technological deficiency of user experience, and the present invention proposes a kind of voice input control
Method processed, this method include:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand includes that voice input demand and word are defeated
Enter demand;
The voice messaging is executed by voice input demand and sends operation, alternatively, will by word input demand
The voice messaging is converted to text information and executes transmission operation.
Optionally, it is described by speech-input instructions trigger the voice in current interactive interface input operation include:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows that the voice is defeated in the input area
The status information entered.
Optionally, described to include by voice input operation acquisition voice messaging:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
Optionally, the input demand of the identification interactive interface, wherein the input demand, which includes voice input, to be needed
Summation word inputs demand:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand includes that voice is defeated
Enter demand and word input demand;
If the last item information in the dialog region is voice messaging or the input area is voice input shape
State, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or the input
Region is word input state, it is determined that inputs demand for word.
Optionally, described to execute the voice messaging transmission operation by voice input demand, alternatively, pressing the word
The voice messaging is converted to text information and executes transmission operation by input demand:
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and by described in input demand execution
Voice messaging sends operation or the text information sends operation.
The invention also discloses a kind of voice input control apparatus, which includes memory, processor and is stored in institute
The computer program that can be run on memory and on the processor is stated, when the computer program is executed by the processor
It realizes:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand includes that voice input demand and word are defeated
Enter demand;
The voice messaging is executed by voice input demand and sends operation, alternatively, will by word input demand
The voice messaging is converted to text information and executes transmission operation.
Optionally, it is realized when the computer program is also executed by the processor:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows that the voice is defeated in the input area
The status information entered.
Optionally, it is realized when the computer program is also executed by the processor:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
Optionally, it is realized when the computer program is also executed by the processor:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand includes that voice is defeated
Enter demand and word input demand;
If the last item information in the dialog region is voice messaging or the input area is voice input shape
State, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or the input
Region is word input state, it is determined that inputs demand for word;
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and by described in input demand execution
Voice messaging sends operation or the text information sends operation.
The invention also provides a kind of computer readable storage medium, voice is stored on the computer readable storage medium
Input control program, voice input control program realize voice input control as described in any one of the above embodiments when being executed by processor
The step of method.
Voice input control method, equipment and the computer readable storage medium for implementing the present invention are referred to by voice input
Enable the voice input operation triggered in current interactive interface;Then, operation is inputted by the voice and obtains voice messaging;Again so
Afterwards, the input demand of the interactive interface is identified, wherein the input demand, which includes voice input demand and word input, to be needed
It asks;Finally, it executes the voice messaging by voice input demand and sends operation, alternatively, will by word input demand
The voice messaging is converted to text information and executes transmission operation.A kind of voice input control scheme of hommization is realized,
User is allow quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the letter of the voice after being inputted by voice
Breath or text information eliminate user in the handover operation of word input and voice input, larger improve voice input
Globality and adaptability.
Description of the drawings
Present invention will be further explained below with reference to the attached drawings and examples, in attached drawing:
Fig. 1 is a kind of hardware architecture diagram of mobile terminal of the present invention;
Fig. 2 is a kind of communications network system Organization Chart provided in an embodiment of the present invention;
Fig. 3 is the flow chart of voice input control method first embodiment of the present invention;
Fig. 4 is the flow chart of voice input control method second embodiment of the present invention;
Fig. 5 is the flow chart of voice input control method 3rd embodiment of the present invention;
Fig. 6 is the flow chart of voice input control method fourth embodiment of the present invention;
Fig. 7 is the flow chart of the 5th embodiment of voice input control method of the present invention.
Specific implementation mode
It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.
In subsequent description, using for indicating that the suffix of such as " module ", " component " or " unit " of element is only
The explanation for being conducive to the present invention, itself does not have a specific meaning.Therefore, " module ", " component " or " unit " can mix
Ground uses.
Terminal can be implemented in a variety of manners.For example, terminal described in the present invention may include such as mobile phone, tablet
Computer, laptop, palm PC, personal digital assistant (Personal Digital Assistant, PDA), portable
The shiftings such as media player (Portable Media Player, PMP), navigation device, wearable device, Intelligent bracelet, pedometer
The fixed terminals such as dynamic terminal, and number TV, desktop computer.
It will be illustrated by taking mobile terminal as an example in subsequent descriptions, it will be appreciated by those skilled in the art that in addition to special
Except element for moving purpose, construction according to the embodiment of the present invention can also apply to the terminal of fixed type.
Referring to Fig. 1, a kind of hardware architecture diagram of its mobile terminal of each embodiment to realize the present invention, the shifting
Moving terminal 100 may include:RF (Radio Frequency, radio frequency) unit 101, WiFi module 102, audio output unit
103, A/V (audio/video) input unit 104, sensor 105, display unit 106, user input unit 107, interface unit
108, the components such as memory 109, processor 110 and power supply 111.It will be understood by those skilled in the art that shown in Fig. 1
Mobile terminal structure does not constitute the restriction to mobile terminal, and mobile terminal may include components more more or fewer than diagram,
Either combine certain components or different components arrangement.
The all parts of mobile terminal are specifically introduced with reference to Fig. 1:
Radio frequency unit 101 can be used for receiving and sending messages or communication process in, signal sends and receivees, specifically, by base station
Downlink information receive after, to processor 110 handle;In addition, the data of uplink are sent to base station.In general, radio frequency unit 101
Including but not limited to antenna, at least one amplifier, transceiver, coupler, low-noise amplifier, duplexer etc..In addition, penetrating
Frequency unit 101 can also be communicated with network and other equipment by radio communication.Above-mentioned wireless communication can use any communication
Standard or agreement, including but not limited to GSM (Global System of Mobile communication, global system for mobile telecommunications
System), GPRS (General Packet Radio Service, general packet radio service), CDMA2000 (Code
Division Multiple Access 2000, CDMA 2000), WCDMA (Wideband Code Division
Multiple Access, wideband code division multiple access), TD-SCDMA (Time Division-Synchronous Code
Division Multiple Access, TD SDMA), FDD-LTE (Frequency Division
Duplexing-Long Term Evolution, frequency division duplex long term evolution) and TDD-LTE (Time Division
Duplexing-Long Term Evolution, time division duplex long term evolution) etc..
WiFi belongs to short range wireless transmission technology, and mobile terminal can help user to receive and dispatch electricity by WiFi module 102
Sub- mail, browsing webpage and access streaming video etc., it has provided wireless broadband internet to the user and has accessed.Although Fig. 1 shows
Go out WiFi module 102, but it is understood that, and it is not belonging to must be configured into for mobile terminal, it completely can be according to need
It to be omitted in the range for the essence for not changing invention.
Audio output unit 103 can be in call signal reception pattern, call mode, record mould in mobile terminal 100
When under the isotypes such as formula, speech recognition mode, broadcast reception mode, it is that radio frequency unit 101 or WiFi module 102 are received or
The audio data stored in memory 109 is converted into audio signal and exports to be sound.Moreover, audio output unit 103
The relevant audio output of specific function executed with mobile terminal 100 can also be provided (for example, call signal receives sound, disappears
Breath receives sound etc.).Audio output unit 103 may include loud speaker, buzzer etc..
A/V input units 104 are for receiving audio or video signal.A/V input units 104 may include graphics processor
(Graphics Processing Unit, GPU) 1041 and microphone 1042, graphics processor 1041 is in video acquisition mode
Or the image data of the static images or video obtained by image capture apparatus (such as camera) in image capture mode carries out
Reason.Treated, and picture frame may be displayed on display unit 106.Through graphics processor 1041, treated that picture frame can be deposited
Storage is sent in memory 109 (or other storage mediums) or via radio frequency unit 101 or WiFi module 102.Mike
Wind 1042 can connect in telephone calling model, logging mode, speech recognition mode etc. operational mode via microphone 1042
Quiet down sound (audio data), and can be audio data by such acoustic processing.Audio that treated (voice) data can
To be converted to the format output that can be sent to mobile communication base station via radio frequency unit 101 in the case of telephone calling model.
Microphone 1042 can implement various types of noises elimination (or inhibition) algorithms and send and receive sound to eliminate (or inhibition)
The noise generated during frequency signal or interference.
Mobile terminal 100 further includes at least one sensor 105, such as optical sensor, motion sensor and other biographies
Sensor.Specifically, optical sensor includes ambient light sensor and proximity sensor, wherein ambient light sensor can be according to environment
The light and shade of light adjusts the brightness of display panel 1061, and proximity sensor can close when mobile terminal 100 is moved in one's ear
Display panel 1061 and/or backlight.As a kind of motion sensor, accelerometer sensor can detect in all directions (general
For three axis) size of acceleration, size and the direction of gravity are can detect that when static, can be used to identify the application of mobile phone posture
(such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, percussion) etc.;
The fingerprint sensor that can also configure as mobile phone, pressure sensor, iris sensor, molecule sensor, gyroscope, barometer,
The other sensors such as hygrometer, thermometer, infrared sensor, details are not described herein.
Display unit 106 is for showing information input by user or being supplied to the information of user.Display unit 106 can wrap
Display panel 1061 is included, liquid crystal display (Liquid Crystal Display, LCD), Organic Light Emitting Diode may be used
Forms such as (Organic Light-Emitting Diode, OLED) configure display panel 1061.
User input unit 107 can be used for receiving the number or character information of input, and generate the use with mobile terminal
Family is arranged and the related key signals input of function control.Specifically, user input unit 107 may include touch panel 1071 with
And other input equipments 1072.Touch panel 1071, also referred to as touch screen collect user on it or neighbouring touch operation
(for example user uses any suitable objects or attachment such as finger, stylus on touch panel 1071 or in touch panel 1071
Neighbouring operation), and corresponding attachment device is driven according to preset formula.Touch panel 1071 may include touch detection
Two parts of device and touch controller.Wherein, the touch orientation of touch detecting apparatus detection user, and detect touch operation band
The signal come, transmits a signal to touch controller;Touch controller receives touch information from touch detecting apparatus, and by it
It is converted into contact coordinate, then gives processor 110, and order that processor 110 is sent can be received and executed.In addition, can
To realize touch panel 1071 using multiple types such as resistance-type, condenser type, infrared ray and surface acoustic waves.In addition to touch panel
1071, user input unit 107 can also include other input equipments 1072.Specifically, other input equipments 1072 can wrap
It includes but is not limited in physical keyboard, function key (such as volume control button, switch key etc.), trace ball, mouse, operating lever etc.
It is one or more, do not limit herein specifically.
Further, touch panel 1071 can cover display panel 1061, when touch panel 1071 detect on it or
After neighbouring touch operation, processor 110 is sent to determine the type of touch event, is followed by subsequent processing device 110 according to touch thing
The type of part provides corresponding visual output on display panel 1061.Although in Fig. 1, touch panel 1071 and display panel
1061 be to realize the function that outputs and inputs of mobile terminal as two independent components, but in certain embodiments, can
The function that outputs and inputs of mobile terminal is realized so that touch panel 1071 and display panel 1061 is integrated, is not done herein specifically
It limits.
Interface unit 108 be used as at least one external device (ED) connect with mobile terminal 100 can by interface.For example,
External device (ED) may include wired or wireless headphone port, external power supply (or battery charger) port, wired or nothing
Line data port, memory card port, the port for connecting the device with identification module, audio input/output (I/O) end
Mouth, video i/o port, ear port etc..Interface unit 108 can be used for receiving the input from external device (ED) (for example, number
It is believed that breath, electric power etc.) and the input received is transferred to one or more elements in mobile terminal 100 or can be with
For the transmission data between mobile terminal 100 and external device (ED).
Memory 109 can be used for storing software program and various data.Memory 109 can include mainly storing program area
And storage data field, wherein storing program area can storage program area, application program (such as the sound needed at least one function
Sound playing function, image player function etc.) etc.;Storage data field can store according to mobile phone use created data (such as
Audio data, phone directory etc.) etc..In addition, memory 109 may include high-speed random access memory, can also include non-easy
The property lost memory, a for example, at least disk memory, flush memory device or other volatile solid-state parts.
Processor 110 is the control centre of mobile terminal, utilizes each of various interfaces and the entire mobile terminal of connection
A part by running or execute the software program and/or module that are stored in memory 109, and calls and is stored in storage
Data in device 109 execute the various functions and processing data of mobile terminal, to carry out integral monitoring to mobile terminal.Place
Reason device 110 may include one or more processing units;Preferably, processor 110 can integrate application processor and modulatedemodulate is mediated
Manage device, wherein the main processing operation system of application processor, user interface and application program etc., modem processor is main
Processing wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor 110.
Mobile terminal 100 can also include the power supply 111 (such as battery) powered to all parts, it is preferred that power supply 111
Can be logically contiguous by power-supply management system and processor 110, to realize management charging by power-supply management system, put
The functions such as electricity and power managed.
Although Fig. 1 is not shown, mobile terminal 100 can also be including bluetooth module etc., and details are not described herein.
Embodiment to facilitate the understanding of the present invention, below to the communications network system that is based on of mobile terminal of the present invention into
Row description.
Referring to Fig. 2, Fig. 2 is a kind of communications network system Organization Chart provided in an embodiment of the present invention, the communication network system
System is the LTE system of universal mobile communications technology, which includes communicating UE (User Equipment, the use of connection successively
Family equipment) (the lands Evolved UMTS Terrestrial Radio Access Network, evolved UMTS 201, E-UTRAN
Ground wireless access network) 202, EPC (Evolved Packet Core, evolved packet-based core networks) 203 and operator IP operation
204。
Specifically, UE201 can be above-mentioned terminal 100, and details are not described herein again.
E-UTRAN202 includes eNodeB2021 and other eNodeB2022 etc..Wherein, eNodeB2021 can be by returning
Journey (backhaul) (such as X2 interface) is connect with other eNodeB2022, and eNodeB2021 is connected to EPC203,
ENodeB2021 can provide the access of UE201 to EPC203.
EPC203 may include MME (Mobility Management Entity, mobility management entity) 2031, HSS
(Home Subscriber Server, home subscriber server) 2032, other MME2033, SGW (Serving Gate Way,
Gateway) 2034, PGW (PDN Gate Way, grouped data network gateway) 2035 and PCRF (Policy and
Charging Rules Function, policy and rate functional entity) 2036 etc..Wherein, MME2031 be processing UE201 and
The control node of signaling, provides carrying and connection management between EPC203.HSS2032 is all to manage for providing some registers
Such as the function of home location register (not shown) etc, and some are preserved in relation to use such as service features, data rates
The dedicated information in family.All customer data can be sent by SGW2034, and PGW2035 can provide the IP of UE 201
Address is distributed and other functions, and PCRF2036 is strategy and the charging control strategic decision-making of business data flow and IP bearing resources
Point, it selects and provides available strategy and charging control decision with charge execution function unit (not shown) for strategy.
IP operation 204 may include internet, Intranet, IMS (IP Multimedia Subsystem, IP multimedia
System) or other IP operations etc..
Although above-mentioned be described by taking LTE system as an example, those skilled in the art it is to be understood that the present invention not only
Suitable for LTE system, be readily applicable to other wireless communication systems, such as GSM, CDMA2000, WCDMA, TD-SCDMA with
And the following new network system etc., it does not limit herein.
Based on above-mentioned mobile terminal hardware configuration and communications network system, each embodiment of the method for the present invention is proposed.
Embodiment one
Fig. 3 is the flow chart of voice input control method first embodiment of the present invention.A kind of voice input control method, should
Method includes:
S1, the input operation of the voice in current interactive interface is triggered by speech-input instructions;
S2, operation acquisition voice messaging is inputted by the voice;
The input demand of S3, the identification interactive interface, wherein the input demand includes voice input demand and word
Input demand;
S4, the voice messaging transmission operation is executed by voice input demand, alternatively, inputting demand by the word
The voice messaging is converted into text information and executes transmission operation.
In the present embodiment, first, the input operation of the voice in current interactive interface is triggered by speech-input instructions;So
Afterwards, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified, wherein institute
It includes voice input demand and word input demand to state input demand;Finally, the voice is executed by voice input demand
Information sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes transmission behaviour
Make.
Specifically, the voice control scheme that the present invention is implemented is suitable for the smart machines such as smart mobile phone, tablet computer,
In the present embodiment, by taking cell phone apparatus as an example, first, voice messaging is enrolled and parses, cell phone apparatus has the recording groups such as microphone
Part obtains extraneous audio-frequency information by the component of recording such as microphone, by the caching component of cell phone apparatus to audio-frequency information into
Row caching, then, parses the voice messaging of caching by preset algorithm via processor.It is understood that passing through
The external audio frequency information that the recording component such as microphone of cell phone apparatus obtains includes the voice messaging of user and other environmental noises,
Before being parsed to voice messaging, if environmental noise is more than certain threshold value, noise reduction process is carried out to it first, then again
Parsing operation is carried out to it.
Specifically, in the present embodiment, the input operation of the voice in current interactive interface is triggered by speech-input instructions.When
Preceding interactive interface can be the information transmit-receive interface of system, message session interface, can also be information transmit-receive circle of application program
Face, message session interface.For example, the physical button for opening voice control is arranged in the side key in terminal device, by this
Physical button opens voice control.
Further, the physical button for opening voice control is set in the side key of terminal device, passes through the physics
Press switch to open voice control, when user pins the physical button, voice input operation is opened, when user unclamps the physical button
When, terminate voice input operation.
Further, in the apex zone of terminal device or bottom zone or rear surface regions or front surface region
Physical button for opening voice control is set, by the physical button open voice control, when user pin the physics by
When key, voice input operation is opened, when user unclamps the physical button, terminates voice input operation.
Further include that there is word, voice input demand to answer it is understood that this programme is not limited to message session interface
With scene, for example, voice assistant, Voice Navigation etc..
In the present embodiment, after triggering the input operation of the voice in current interactive interface by speech-input instructions, pass through
The voice input operation obtains voice messaging.Likewise, as above inputting and operating by voice described in example, start voice messaging
Obtain operation, it is to be understood that voice input operation can be directed to the touch command of touch scheme, or be directed to
In the physical button instruction of pressing scheme, specifically, by the predeterminable area of lasting touch-control interactive interface, with lasting acquisition voice
Information, alternatively, by the preset physical button of Continued depression, with lasting acquisition voice messaging.Further, by from beginning to end twice
The predeterminable area of touch-control interactive interface, to obtain the voice messaging in the period, alternatively, preset by pressing twice from beginning to end
Physical button, to obtain the voice messaging in the period.
Specifically, in the present embodiment, after inputting operation acquisition voice messaging by the voice, identifying the interaction
The input demand at interface, wherein the input demand includes voice input demand and word input demand.Wherein, this programme institute
The voice input demand stated refers under current state, and it is voice messaging that this time voice input, which operates corresponding information format, equally
, the word input demand described in this programme refers under current state, and it is text that this time voice input, which operates corresponding information format,
Word information.For example, temporally sequential arrangement shows dialogue entries with the dialog interface of contact person, it is to be understood that directly
The voice messaging of input is sent, corresponding dialogue entries are voice messagings, directly transmit the text information of input, corresponding dialogue
Entry is text information.
In the present embodiment, this input demand is determined by the information type of the last item dialogue entries, that is, finally
The information type of one dialogue entries is text information, then corresponding, and the input demand for recognizing interactive interface at this time is word
Input demand;
Further, this input demand is determined by the information type of the last item dialogue entries, that is, the last item
The information type of dialogue entries is voice messaging, then corresponding, recognizes the input demand of interactive interface at this time and is inputted for voice
Demand;
Further, the information type of the last item dialogue entries sent by user determines that this input needs
It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time
It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this
When interactive interface input demand be word input demand.
In the present embodiment, the input demand of the interactive interface is identified, wherein the input demand includes voice input
Demand and word input demand;Later, it executes the voice messaging by voice input demand and sends operation, alternatively, pressing institute
Word input demand is stated the voice messaging is converted to text information and executes transmission operation.Likewise, as above being pressed described in example
The information type of the last item dialogue entries determines this input demand, that is, the information type of the last item dialogue entries
It is text information, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, then, by the text
Word inputs demand and the voice messaging is converted to text information and executes transmission operation.Alternatively, user's transmission is last
The information type of one dialogue entries determines this input demand, if the information type of the last item dialogue entries is word letter
Breath, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, and then, being inputted by the word needs
It asks and the voice messaging is converted into text information and executes transmission operation.
The advantageous effect of the present embodiment is, the input behaviour of the voice in current interactive interface is triggered by speech-input instructions
Make;Then, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified,
In, the input demand includes voice input demand and word input demand;Finally, by described in voice input demand execution
Voice messaging sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes hair
Send operation.Realizing a kind of voice input control scheme of hommization so that user can quickly carry out voice input operation,
Meanwhile being adaptively adjusted, switching the voice messaging after being inputted by voice or text information, it eliminates user and is inputted in word
With the handover operation of voice input, the globality and adaptability of voice input are larger improved.
Embodiment two
Fig. 4 is the flow chart of voice input control method second embodiment of the present invention, is based on above-described embodiment, described by language
Voice input in the sound input current interactive interface of instruction triggers, which operates, includes:
S11, dialog region and input area are shown in the interactive interface;
S12, the input area is activated by the speech-input instructions, and shows institute's predicate in the input area
The status information of sound input.
In the present embodiment, first, dialog region and input area are shown in the interactive interface;Then, pass through institute
It states speech-input instructions and activates the input area, and show the status information of the voice input in the input area.
Specifically, in the present embodiment, the input operation of the voice in current interactive interface is triggered by speech-input instructions.When
Preceding interactive interface can be the information transmit-receive interface of system, message session interface, can also be information transmit-receive circle of application program
Face, message session interface.For example, the physical button for opening voice control is arranged in the side key in terminal device, by this
Physical button opens voice control.
Further, the physical button for opening voice control is set in the side key of terminal device, passes through the physics
Press switch to open voice control, when user pins the physical button, voice input operation is opened, when user unclamps the physical button
When, terminate voice input operation.
Further, in the apex zone of terminal device or bottom zone or rear surface regions or front surface region
Physical button for opening voice control is set, by the physical button open voice control, when user pin the physics by
When key, voice input operation is opened, when user unclamps the physical button, terminates voice input operation.
Further include that there is word, voice input demand to answer it is understood that this programme is not limited to message session interface
With scene, for example, voice assistant, Voice Navigation etc..
Further, the predeterminable area for continuing touch-control interactive interface is shown by status information, is believed with lasting acquisition voice
Breath;
Further, the preset physical button of Continued depression is shown by status information, with lasting acquisition voice messaging;
Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by status information, to obtain the time
Voice messaging in section;
Further, preset physical button is pressed twice by status information display head and the tail, to obtain in the period
Voice messaging.
The advantageous effect of the present embodiment is, by showing dialog region and input area in the interactive interface;So
Afterwards, the input area is activated by the speech-input instructions, and shows the voice input in the input area
Status information, to realize that a kind of voice input control scheme of hommization provides environmental basis and conditioned basic so that user
Voice input operation can be quickly carried out, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or text
Word information eliminates user in the handover operation of word input and voice input, larger improves the globality of voice input
And adaptability.
Embodiment three
Fig. 5 is the flow chart of voice input control method 3rd embodiment of the present invention, is based on above-described embodiment, described to pass through
The voice input operation obtains voice messaging and includes:
S21, the voice messaging, and status information described in real-time update are obtained;
S22, the voice messaging for caching the acquisition.
In the present embodiment, first, the voice messaging, and status information described in real-time update are obtained;Then, institute is cached
State the voice messaging of acquisition.
In the present embodiment, after triggering the input operation of the voice in current interactive interface by speech-input instructions, pass through
The voice input operation obtains voice messaging.Likewise, as above inputting and operating by voice described in example, start voice messaging
Obtain operation, it is to be understood that voice input operation can be directed to the touch command of touch scheme, or be directed to
In the physical button instruction of pressing scheme, specifically, by the predeterminable area of lasting touch-control interactive interface, with lasting acquisition voice
Information, alternatively, by the preset physical button of Continued depression, with lasting acquisition voice messaging.Further, by from beginning to end twice
The predeterminable area of touch-control interactive interface, to obtain the voice messaging in the period, alternatively, preset by pressing twice from beginning to end
Physical button, to obtain the voice messaging in the period.
Further, the predeterminable area for continuing touch-control interactive interface is shown by newer status information, with lasting acquisition
Voice messaging;
Further, the preset physical button of Continued depression is shown by newer status information, with lasting acquisition voice
Information;
Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by newer status information, to obtain
Voice messaging in the period;
Further, preset physical button is pressed twice by newer status information display head and the tail, when obtaining this
Between voice messaging in section.
The advantageous effect of the present embodiment is, by obtaining the voice messaging, and status information described in real-time update;So
Afterwards, the voice messaging for caching the acquisition, for realize a kind of voice input control scheme of hommization provide environmental basis and
Conditioned basic so that user can quickly carry out voice input operation, meanwhile, it is adaptively adjusted, switches and inputted by voice
Voice messaging afterwards or text information eliminate user in the handover operation of word input and voice input, larger improve
The globality and adaptability of voice input.
Example IV
Fig. 6 is the flow chart of voice input control method fourth embodiment of the present invention, is based on above-described embodiment, the identification
The input demand of the interactive interface, wherein the input demand includes voice input demand and word input demand includes:
S31, the dialog region and the input area are detected;
S32, judge the dialog region and the input demand of the input area, wherein the input demand includes language
Sound inputs demand and word inputs demand;
If the last item information in S33, the dialog region is voice messaging or the input area is that voice is defeated
Enter state, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or described
Input area is word input state, it is determined that inputs demand for word.
In the present embodiment, first, the dialog region and the input area are detected;Then, judge the dialog region
The input demand in domain and the input area, wherein the input demand includes voice input demand and word input demand;Most
Afterwards, if the last item information in the dialog region is voice messaging or the input area is voice input state,
It is determined as voice input demand, if the last item information in the dialog region is text information or the input area
For word input state, it is determined that input demand for word.
Specifically, in the present embodiment, after inputting operation acquisition voice messaging by the voice, identifying the interaction
The input demand at interface, wherein the input demand includes voice input demand and word input demand.Wherein, this programme institute
The voice input demand stated refers under current state, and it is voice messaging that this time voice input, which operates corresponding information format, equally
, the word input demand described in this programme refers under current state, and it is text that this time voice input, which operates corresponding information format,
Word information.For example, temporally sequential arrangement shows dialogue entries with the dialog interface of contact person, it is to be understood that directly
The voice messaging of input is sent, corresponding dialogue entries are voice messagings, directly transmit the text information of input, corresponding dialogue
Entry is text information.
In the present embodiment, this input demand is determined by the information type of the last item dialogue entries, that is, finally
The information type of one dialogue entries is text information, then corresponding, and the input demand for recognizing interactive interface at this time is word
Input demand;
Further, this input demand is determined by the information type of the last item dialogue entries, that is, the last item
The information type of dialogue entries is voice messaging, then corresponding, recognizes the input demand of interactive interface at this time and is inputted for voice
Demand;
Further, the information type of the last item dialogue entries sent by user determines that this input needs
It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time
It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this
When interactive interface input demand be word input demand;
Further, by (for example, in ten minutes) in the certain predetermined time, in all dialogue entries, voice messaging dialogue
The accounting of number of entries or text information dialogue entries quantity determines current input demand, that is, larger by quantity accounting
As current input demand;
Further, by (for example, in nearest ten dialogue entries) in certain historical bar mesh number, in all dialogue entries,
The accounting of voice messaging dialogue entries quantity or text information dialogue entries quantity determines current input demand, that is, pressing
Quantity accounting it is larger as current input demand.
The advantageous effect of the present embodiment is, by detecting the dialog region and the input area;Then, judge institute
State dialog region and the input demand of the input area, wherein the input demand includes that voice input demand and word are defeated
Enter demand;Finally, if the last item information in the dialog region is voice messaging or the input area is that voice is defeated
Enter state, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or described
Input area is word input state, it is determined that demand is inputted for word, to realize a kind of voice input control side of hommization
Case provides environmental basis and conditioned basic so that and user can quickly carry out voice input operation, meanwhile, adaptively adjust
Whole, switching inputted by voice after voice messaging or text information, eliminate user and inputted in word and cut with what voice inputted
Operation is changed, the globality and adaptability of voice input are larger improved.
Embodiment five
Fig. 7 is the flow chart of the 5th embodiment of voice input control method of the present invention, is based on above-described embodiment, described to press institute
Predicate sound inputs demand and executes the voice messaging transmission operation, alternatively, pressing word input demand by the voice messaging
It is converted to text information and executes transmission operation and include:
S41, the input demand fresh information for recording the input area;
S42, input demand next time is determined according to the input demand fresh information, and is executed by the input demand
The voice messaging sends operation or the text information sends operation.
In the present embodiment, first, the input demand fresh information of the input area is recorded;Then, according to described defeated
Enter demand fresh information and determine input demand next time, and by the input demand execute the voice messaging send operation or
Text information described in person sends operation.
It is understood that the input demand fresh information of the input area of the present embodiment is under current input demand
Input demand next time is then updated to text information by input information for example, current input demand is text information.
In the present embodiment, the input demand of the interactive interface is identified, wherein the input demand includes voice input
Demand and word input demand;Later, it executes the voice messaging by voice input demand and sends operation, alternatively, pressing institute
Word input demand is stated the voice messaging is converted to text information and executes transmission operation.Likewise, as above being pressed described in example
The information type of the last item dialogue entries determines this input demand, that is, the information type of the last item dialogue entries
It is text information, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, then, by the text
Word inputs demand and the voice messaging is converted to text information and executes transmission operation.Alternatively, user's transmission is last
The information type of one dialogue entries determines this input demand, if the information type of the last item dialogue entries is word letter
Breath, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, and then, being inputted by the word needs
It asks and the voice messaging is converted into text information and executes transmission operation.
The advantageous effect of the present embodiment is, by the input demand fresh information for recording the input area;Then, root
Input demand next time is determined according to the input demand fresh information, and executes the voice messaging hair by the input demand
It send operation or the text information to send operation, realizes a kind of voice input control scheme of hommization so that Yong Huke
Quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or word
Information, eliminate user word input with voice input handover operation, larger improve voice input globality and
Adaptability.
Embodiment six
Based on above-described embodiment, the invention also discloses a kind of voice input control apparatus, which includes memory, place
It manages device and is stored in the computer program that can be run on the memory and on the processor, the computer program is by institute
It states when processor executes and realizes:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand includes that voice input demand and word are defeated
Enter demand;
The voice messaging is executed by voice input demand and sends operation, alternatively, will by word input demand
The voice messaging is converted to text information and executes transmission operation.
In the present embodiment, first, the input operation of the voice in current interactive interface is triggered by speech-input instructions;So
Afterwards, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified, wherein institute
It includes voice input demand and word input demand to state input demand;Finally, the voice is executed by voice input demand
Information sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes transmission behaviour
Make.
Specifically, the voice control scheme that the present invention is implemented is suitable for the smart machines such as smart mobile phone, tablet computer,
In the present embodiment, by taking cell phone apparatus as an example, first, voice messaging is enrolled and parses, cell phone apparatus has the recording groups such as microphone
Part obtains extraneous audio-frequency information by the component of recording such as microphone, by the caching component of cell phone apparatus to audio-frequency information into
Row caching, then, parses the voice messaging of caching by preset algorithm via processor.It is understood that passing through
The external audio frequency information that the recording component such as microphone of cell phone apparatus obtains includes the voice messaging of user and other environmental noises,
Before being parsed to voice messaging, if environmental noise is more than certain threshold value, noise reduction process is carried out to it first, then again
Parsing operation is carried out to it.
Specifically, in the present embodiment, the input operation of the voice in current interactive interface is triggered by speech-input instructions.When
Preceding interactive interface can be the information transmit-receive interface of system, message session interface, can also be information transmit-receive circle of application program
Face, message session interface.For example, the physical button for opening voice control is arranged in the side key in terminal device, by this
Physical button opens voice control.
Further, the physical button for opening voice control is set in the side key of terminal device, passes through the physics
Press switch to open voice control, when user pins the physical button, voice input operation is opened, when user unclamps the physical button
When, terminate voice input operation.
Further, in the apex zone of terminal device or bottom zone or rear surface regions or front surface region
Physical button for opening voice control is set, by the physical button open voice control, when user pin the physics by
When key, voice input operation is opened, when user unclamps the physical button, terminates voice input operation.
Further include that there is word, voice input demand to answer it is understood that this programme is not limited to message session interface
With scene, for example, voice assistant, Voice Navigation etc..
In the present embodiment, after triggering the input operation of the voice in current interactive interface by speech-input instructions, pass through
The voice input operation obtains voice messaging.Likewise, as above inputting and operating by voice described in example, start voice messaging
Obtain operation, it is to be understood that voice input operation can be directed to the touch command of touch scheme, or be directed to
In the physical button instruction of pressing scheme, specifically, by the predeterminable area of lasting touch-control interactive interface, with lasting acquisition voice
Information, alternatively, by the preset physical button of Continued depression, with lasting acquisition voice messaging.Further, by from beginning to end twice
The predeterminable area of touch-control interactive interface, to obtain the voice messaging in the period, alternatively, preset by pressing twice from beginning to end
Physical button, to obtain the voice messaging in the period.
Specifically, in the present embodiment, after inputting operation acquisition voice messaging by the voice, identifying the interaction
The input demand at interface, wherein the input demand includes voice input demand and word input demand.Wherein, this programme institute
The voice input demand stated refers under current state, and it is voice messaging that this time voice input, which operates corresponding information format, equally
, the word input demand described in this programme refers under current state, and it is text that this time voice input, which operates corresponding information format,
Word information.For example, temporally sequential arrangement shows dialogue entries with the dialog interface of contact person, it is to be understood that directly
The voice messaging of input is sent, corresponding dialogue entries are voice messagings, directly transmit the text information of input, corresponding dialogue
Entry is text information.
In the present embodiment, this input demand is determined by the information type of the last item dialogue entries, that is, finally
The information type of one dialogue entries is text information, then corresponding, and the input demand for recognizing interactive interface at this time is word
Input demand;
Further, this input demand is determined by the information type of the last item dialogue entries, that is, the last item
The information type of dialogue entries is voice messaging, then corresponding, recognizes the input demand of interactive interface at this time and is inputted for voice
Demand;
Further, the information type of the last item dialogue entries sent by user determines that this input needs
It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time
It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this
When interactive interface input demand be word input demand.
In the present embodiment, the input demand of the interactive interface is identified, wherein the input demand includes voice input
Demand and word input demand;Later, it executes the voice messaging by voice input demand and sends operation, alternatively, pressing institute
Word input demand is stated the voice messaging is converted to text information and executes transmission operation.Likewise, as above being pressed described in example
The information type of the last item dialogue entries determines this input demand, that is, the information type of the last item dialogue entries
It is text information, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, then, by the text
Word inputs demand and the voice messaging is converted to text information and executes transmission operation.Alternatively, user's transmission is last
The information type of one dialogue entries determines this input demand, if the information type of the last item dialogue entries is word letter
Breath, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, and then, being inputted by the word needs
It asks and the voice messaging is converted into text information and executes transmission operation.
The advantageous effect of the present embodiment is, the input behaviour of the voice in current interactive interface is triggered by speech-input instructions
Make;Then, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified,
In, the input demand includes voice input demand and word input demand;Finally, by described in voice input demand execution
Voice messaging sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes hair
Send operation.Realizing a kind of voice input control scheme of hommization so that user can quickly carry out voice input operation,
Meanwhile being adaptively adjusted, switching the voice messaging after being inputted by voice or text information, it eliminates user and is inputted in word
With the handover operation of voice input, the globality and adaptability of voice input are larger improved.
Embodiment seven
Based on above-described embodiment, optionally, the computer program is realized when also being executed by the processor:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows that the voice is defeated in the input area
The status information entered.
In the present embodiment, first, dialog region and input area are shown in the interactive interface;Then, pass through institute
It states speech-input instructions and activates the input area, and show the status information of the voice input in the input area.
Specifically, in the present embodiment, the input operation of the voice in current interactive interface is triggered by speech-input instructions.When
Preceding interactive interface can be the information transmit-receive interface of system, message session interface, can also be information transmit-receive circle of application program
Face, message session interface.For example, the physical button for opening voice control is arranged in the side key in terminal device, by this
Physical button opens voice control.
Further, the physical button for opening voice control is set in the side key of terminal device, passes through the physics
Press switch to open voice control, when user pins the physical button, voice input operation is opened, when user unclamps the physical button
When, terminate voice input operation.
Further, in the apex zone of terminal device or bottom zone or rear surface regions or front surface region
Physical button for opening voice control is set, by the physical button open voice control, when user pin the physics by
When key, voice input operation is opened, when user unclamps the physical button, terminates voice input operation.
Further include that there is word, voice input demand to answer it is understood that this programme is not limited to message session interface
With scene, for example, voice assistant, Voice Navigation etc..
Further, the predeterminable area for continuing touch-control interactive interface is shown by status information, is believed with lasting acquisition voice
Breath;
Further, the preset physical button of Continued depression is shown by status information, with lasting acquisition voice messaging;
Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by status information, to obtain the time
Voice messaging in section;
Further, preset physical button is pressed twice by status information display head and the tail, to obtain in the period
Voice messaging.
The advantageous effect of the present embodiment is, by showing dialog region and input area in the interactive interface;So
Afterwards, the input area is activated by the speech-input instructions, and shows the voice input in the input area
Status information, to realize that a kind of voice input control scheme of hommization provides environmental basis and conditioned basic so that user
Voice input operation can be quickly carried out, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or text
Word information eliminates user in the handover operation of word input and voice input, larger improves the globality of voice input
And adaptability.
Embodiment eight
Based on above-described embodiment, optionally, the computer program is realized when also being executed by the processor:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
In the present embodiment, first, the voice messaging, and status information described in real-time update are obtained;Then, institute is cached
State the voice messaging of acquisition.
In the present embodiment, after triggering the input operation of the voice in current interactive interface by speech-input instructions, pass through
The voice input operation obtains voice messaging.Likewise, as above inputting and operating by voice described in example, start voice messaging
Obtain operation, it is to be understood that voice input operation can be directed to the touch command of touch scheme, or be directed to
In the physical button instruction of pressing scheme, specifically, by the predeterminable area of lasting touch-control interactive interface, with lasting acquisition voice
Information, alternatively, by the preset physical button of Continued depression, with lasting acquisition voice messaging.Further, by from beginning to end twice
The predeterminable area of touch-control interactive interface, to obtain the voice messaging in the period, alternatively, preset by pressing twice from beginning to end
Physical button, to obtain the voice messaging in the period.
Further, the predeterminable area for continuing touch-control interactive interface is shown by newer status information, with lasting acquisition
Voice messaging;
Further, the preset physical button of Continued depression is shown by newer status information, with lasting acquisition voice
Information;
Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by newer status information, to obtain
Voice messaging in the period;
Further, preset physical button is pressed twice by newer status information display head and the tail, when obtaining this
Between voice messaging in section.
The advantageous effect of the present embodiment is, by obtaining the voice messaging, and status information described in real-time update;So
Afterwards, the voice messaging for caching the acquisition, for realize a kind of voice input control scheme of hommization provide environmental basis and
Conditioned basic so that user can quickly carry out voice input operation, meanwhile, it is adaptively adjusted, switches and inputted by voice
Voice messaging afterwards or text information eliminate user in the handover operation of word input and voice input, larger improve
The globality and adaptability of voice input.
Embodiment nine
Based on above-described embodiment, optionally, the computer program is realized when also being executed by the processor:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand includes that voice is defeated
Enter demand and word input demand;
If the last item information in the dialog region is voice messaging or the input area is voice input shape
State, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or the input
Region is word input state, it is determined that inputs demand for word;
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and by described in input demand execution
Voice messaging sends operation or the text information sends operation.
In the present embodiment, first, the dialog region and the input area are detected;Then, judge the dialog region
The input demand in domain and the input area, wherein the input demand includes voice input demand and word input demand;Most
Afterwards, if the last item information in the dialog region is voice messaging or the input area is voice input state,
It is determined as voice input demand, if the last item information in the dialog region is text information or the input area
For word input state, it is determined that input demand for word.
Specifically, in the present embodiment, after inputting operation acquisition voice messaging by the voice, identifying the interaction
The input demand at interface, wherein the input demand includes voice input demand and word input demand.Wherein, this programme institute
The voice input demand stated refers under current state, and it is voice messaging that this time voice input, which operates corresponding information format, equally
, the word input demand described in this programme refers under current state, and it is text that this time voice input, which operates corresponding information format,
Word information.For example, temporally sequential arrangement shows dialogue entries with the dialog interface of contact person, it is to be understood that directly
The voice messaging of input is sent, corresponding dialogue entries are voice messagings, directly transmit the text information of input, corresponding dialogue
Entry is text information.
In the present embodiment, this input demand is determined by the information type of the last item dialogue entries, that is, finally
The information type of one dialogue entries is text information, then corresponding, and the input demand for recognizing interactive interface at this time is word
Input demand;
Further, this input demand is determined by the information type of the last item dialogue entries, that is, the last item
The information type of dialogue entries is voice messaging, then corresponding, recognizes the input demand of interactive interface at this time and is inputted for voice
Demand;
Further, the information type of the last item dialogue entries sent by user determines that this input needs
It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time
It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this
When interactive interface input demand be word input demand;
Further, by (for example, in ten minutes) in the certain predetermined time, in all dialogue entries, voice messaging dialogue
The accounting of number of entries or text information dialogue entries quantity determines current input demand, that is, larger by quantity accounting
As current input demand;
Further, by (for example, in nearest ten dialogue entries) in certain historical bar mesh number, in all dialogue entries,
The accounting of voice messaging dialogue entries quantity or text information dialogue entries quantity determines current input demand, that is, pressing
Quantity accounting it is larger as current input demand.
In the present embodiment, first, the input demand fresh information of the input area is recorded;Then, according to described defeated
Enter demand fresh information and determine input demand next time, and by the input demand execute the voice messaging send operation or
Text information described in person sends operation.
It is understood that the input demand fresh information of the input area of the present embodiment is under current input demand
Input demand next time is then updated to text information by input information for example, current input demand is text information.
In the present embodiment, the input demand of the interactive interface is identified, wherein the input demand includes voice input
Demand and word input demand;Later, it executes the voice messaging by voice input demand and sends operation, alternatively, pressing institute
Word input demand is stated the voice messaging is converted to text information and executes transmission operation.Likewise, as above being pressed described in example
The information type of the last item dialogue entries determines this input demand, that is, the information type of the last item dialogue entries
It is text information, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, then, by the text
Word inputs demand and the voice messaging is converted to text information and executes transmission operation.Alternatively, user's transmission is last
The information type of one dialogue entries determines this input demand, if the information type of the last item dialogue entries is word letter
Breath, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, and then, being inputted by the word needs
It asks and the voice messaging is converted into text information and executes transmission operation.
The advantageous effect of the present embodiment is, by the input demand fresh information for recording the input area;Then, root
Input demand next time is determined according to the input demand fresh information, and executes the voice messaging hair by the input demand
It send operation or the text information to send operation, realizes a kind of voice input control scheme of hommization so that Yong Huke
Quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or word
Information, eliminate user word input with voice input handover operation, larger improve voice input globality and
Adaptability.
Embodiment ten
Based on above-described embodiment, the invention also provides a kind of computer readable storage medium, the computer-readable storages
It is stored with voice input control program on medium, is realized such as any of the above-described institute when voice input control program is executed by processor
The step of voice input control method stated.
Voice input control method, equipment and the computer readable storage medium for implementing the present invention are referred to by voice input
Enable the voice input operation triggered in current interactive interface;Then, operation is inputted by the voice and obtains voice messaging;Again so
Afterwards, the input demand of the interactive interface is identified, wherein the input demand, which includes voice input demand and word input, to be needed
It asks;Finally, it executes the voice messaging by voice input demand and sends operation, alternatively, will by word input demand
The voice messaging is converted to text information and executes transmission operation.A kind of voice input control scheme of hommization is realized,
User is allow quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the letter of the voice after being inputted by voice
Breath or text information eliminate user in the handover operation of word input and voice input, larger improve voice input
Globality and adaptability.
It should be noted that herein, the terms "include", "comprise" or its any other variant are intended to non-row
His property includes, so that process, method, article or device including a series of elements include not only those elements, and
And further include other elements that are not explicitly listed, or further include for this process, method, article or device institute it is intrinsic
Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including this
There is also other identical elements in the process of element, method, article or device.
The embodiments of the present invention are for illustration only, can not represent the quality of embodiment.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side
Method can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but in many cases
The former is more preferably embodiment.Based on this understanding, technical scheme of the present invention substantially in other words does the prior art
Going out the part of contribution can be expressed in the form of software products, which is stored in a storage medium
In (such as ROM/RAM, magnetic disc, CD), including some instructions are used so that a station terminal (can be mobile phone, computer, service
Device, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.
The embodiment of the present invention is described with above attached drawing, but the invention is not limited in above-mentioned specific
Embodiment, the above mentioned embodiment is only schematical, rather than restrictive, those skilled in the art
Under the inspiration of the present invention, without breaking away from the scope protected by the purposes and claims of the present invention, it can also make very much
Form, all of these belong to the protection of the present invention.
Claims (10)
1. a kind of voice input control method, which is characterized in that the method includes:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand, which includes voice input demand and word input, to be needed
It asks;
The voice messaging is executed by voice input demand and sends operation, alternatively, will be described by word input demand
Voice messaging is converted to text information and executes transmission operation.
2. voice input control method according to claim 1, which is characterized in that described to be worked as by speech-input instructions triggering
Voice input in preceding interactive interface, which operates, includes:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows the voice input in the input area
Status information.
3. voice input control method according to claim 2, which is characterized in that described inputted by the voice operates
Obtaining voice messaging includes:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
4. voice input control method according to claim 3, which is characterized in that described to identify the defeated of the interactive interface
Enter demand, wherein the input demand includes voice input demand and word input demand includes:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand, which includes voice input, to be needed
Word of summing inputs demand;
If the last item information in the dialog region is voice messaging or the input area is voice input state,
Then it is determined as voice input demand, if the last item information in the dialog region is text information or the input area
Domain is word input state, it is determined that inputs demand for word.
5. voice input control method according to claim 4, which is characterized in that described to be held by voice input demand
The row voice messaging sends operation, alternatively, the voice messaging is converted to text information simultaneously by word input demand
It executes to send to operate and includes:
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and executes the voice by the input demand
Information sends operation or the text information sends operation.
6. a kind of voice input control apparatus, which is characterized in that the equipment includes memory, processor and is stored in described deposit
It is real when the computer program is executed by the processor on reservoir and the computer program that can run on the processor
It is existing:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand, which includes voice input demand and word input, to be needed
It asks;
The voice messaging is executed by voice input demand and sends operation, alternatively, will be described by word input demand
Voice messaging is converted to text information and executes transmission operation.
7. voice input control apparatus according to claim 6, which is characterized in that the computer program is also by the place
Reason device is realized when executing:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows the voice input in the input area
Status information.
8. voice input control apparatus according to claim 7, which is characterized in that the computer program is also by the place
Reason device is realized when executing:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
9. voice input control apparatus according to claim 8, which is characterized in that the computer program is also by the place
Reason device is realized when executing:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand, which includes voice input, to be needed
Word of summing inputs demand;
If the last item information in the dialog region is voice messaging or the input area is voice input state,
Then it is determined as voice input demand, if the last item information in the dialog region is text information or the input area
Domain is word input state, it is determined that inputs demand for word;
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and executes the voice by the input demand
Information sends operation or the text information sends operation.
10. a kind of computer readable storage medium, which is characterized in that it is defeated to be stored with voice on the computer readable storage medium
Enter and control program, is realized as described in any one of claim 1 to 5 when the voice input control program is executed by processor
The step of voice input control method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810202888.5A CN108520750A (en) | 2018-03-13 | 2018-03-13 | A kind of voice input control method, equipment and computer readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810202888.5A CN108520750A (en) | 2018-03-13 | 2018-03-13 | A kind of voice input control method, equipment and computer readable storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108520750A true CN108520750A (en) | 2018-09-11 |
Family
ID=63433037
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810202888.5A Pending CN108520750A (en) | 2018-03-13 | 2018-03-13 | A kind of voice input control method, equipment and computer readable storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108520750A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109584879A (en) * | 2018-11-23 | 2019-04-05 | 华为技术有限公司 | A kind of sound control method and electronic equipment |
CN114697717A (en) * | 2020-12-28 | 2022-07-01 | 深圳Tcl新技术有限公司 | Text input method and terminal equipment |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6021178A (en) * | 1996-03-29 | 2000-02-01 | Siemens Information And Communication Networks, Inc. | System and method for detecting types of signals in messaging systems |
US7035805B1 (en) * | 2000-07-14 | 2006-04-25 | Miller Stephen S | Switching the modes of operation for voice-recognition applications |
CN101308654A (en) * | 2007-05-14 | 2008-11-19 | 华为技术有限公司 | Speech analysis and recognition method, system and apparatus |
CN104869225A (en) * | 2014-02-21 | 2015-08-26 | 宏达国际电子股份有限公司 | Smart conversation method and electronic device using the same |
CN106550146A (en) * | 2016-10-28 | 2017-03-29 | 努比亚技术有限公司 | A kind of chat message dispensing device and method |
CN106710586A (en) * | 2016-12-27 | 2017-05-24 | 北京智能管家科技有限公司 | Speech recognition engine automatic switching method and device |
CN107124352A (en) * | 2017-05-26 | 2017-09-01 | 维沃移动通信有限公司 | The processing method and mobile terminal of a kind of voice messaging |
CN107342088A (en) * | 2017-06-19 | 2017-11-10 | 联想(北京)有限公司 | A kind of conversion method of acoustic information, device and equipment |
CN107395878A (en) * | 2017-07-04 | 2017-11-24 | 合肥市乐腾科技咨询有限公司 | Automatic voice and text conversion communication system |
CN107483736A (en) * | 2017-08-23 | 2017-12-15 | 广东小天才科技有限公司 | Message processing method and device for instant messaging application program |
CN107608957A (en) * | 2017-09-06 | 2018-01-19 | 百度在线网络技术(北京)有限公司 | Text modification method, apparatus and its equipment based on voice messaging |
-
2018
- 2018-03-13 CN CN201810202888.5A patent/CN108520750A/en active Pending
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6021178A (en) * | 1996-03-29 | 2000-02-01 | Siemens Information And Communication Networks, Inc. | System and method for detecting types of signals in messaging systems |
US7035805B1 (en) * | 2000-07-14 | 2006-04-25 | Miller Stephen S | Switching the modes of operation for voice-recognition applications |
CN101308654A (en) * | 2007-05-14 | 2008-11-19 | 华为技术有限公司 | Speech analysis and recognition method, system and apparatus |
CN104869225A (en) * | 2014-02-21 | 2015-08-26 | 宏达国际电子股份有限公司 | Smart conversation method and electronic device using the same |
CN106550146A (en) * | 2016-10-28 | 2017-03-29 | 努比亚技术有限公司 | A kind of chat message dispensing device and method |
CN106710586A (en) * | 2016-12-27 | 2017-05-24 | 北京智能管家科技有限公司 | Speech recognition engine automatic switching method and device |
CN107124352A (en) * | 2017-05-26 | 2017-09-01 | 维沃移动通信有限公司 | The processing method and mobile terminal of a kind of voice messaging |
CN107342088A (en) * | 2017-06-19 | 2017-11-10 | 联想(北京)有限公司 | A kind of conversion method of acoustic information, device and equipment |
CN107395878A (en) * | 2017-07-04 | 2017-11-24 | 合肥市乐腾科技咨询有限公司 | Automatic voice and text conversion communication system |
CN107483736A (en) * | 2017-08-23 | 2017-12-15 | 广东小天才科技有限公司 | Message processing method and device for instant messaging application program |
CN107608957A (en) * | 2017-09-06 | 2018-01-19 | 百度在线网络技术(北京)有限公司 | Text modification method, apparatus and its equipment based on voice messaging |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109584879A (en) * | 2018-11-23 | 2019-04-05 | 华为技术有限公司 | A kind of sound control method and electronic equipment |
CN109584879B (en) * | 2018-11-23 | 2021-07-06 | 华为技术有限公司 | Voice control method and electronic equipment |
US11450322B2 (en) | 2018-11-23 | 2022-09-20 | Huawei Technologies Co., Ltd. | Speech control method and electronic device |
CN114697717A (en) * | 2020-12-28 | 2022-07-01 | 深圳Tcl新技术有限公司 | Text input method and terminal equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108572764A (en) | A kind of word input control method, equipment and computer readable storage medium | |
CN107229402A (en) | Dynamic screenshotss method, device and the readable storage medium storing program for executing of terminal | |
CN107864357A (en) | Video calling special effect controlling method, terminal and computer-readable recording medium | |
CN106961706A (en) | Method, mobile terminal and the computer-readable recording medium of communication pattern switching | |
CN108810437A (en) | Record screen method, terminal and computer readable storage medium | |
CN107436779A (en) | A kind of application management method, equipment and computer-readable recording medium | |
CN107682547A (en) | A kind of voice messaging regulation and control method, equipment and computer-readable recording medium | |
CN108551520A (en) | A kind of phonetic search response method, equipment and computer readable storage medium | |
CN110177177A (en) | Message back method, mobile terminal and computer readable storage medium | |
CN108200275A (en) | A kind of record screen control method, equipment and computer readable storage medium | |
CN107463243A (en) | A kind of screen control method, mobile terminal and computer-readable recording medium | |
CN108600513A (en) | A kind of record screen control method, equipment and computer readable storage medium | |
CN108196777A (en) | A kind of flexible screen application process, equipment and computer readable storage medium | |
CN108521500A (en) | A kind of voice scenery control method, equipment and computer readable storage medium | |
CN107181865A (en) | Processing method, terminal and the computer-readable recording medium of unread short messages | |
CN107844230A (en) | A kind of advertisement page method of adjustment, mobile terminal and computer-readable recording medium | |
CN108322609A (en) | A kind of notification information regulation and control method, equipment and computer readable storage medium | |
CN108924352A (en) | Sound quality method for improving, terminal and computer readable storage medium | |
CN107992455A (en) | A kind of text handling method, terminal and computer-readable recording medium | |
CN107390856A (en) | A kind of method, mobile terminal and storage medium for reducing mobile terminal power consumption | |
CN108536383A (en) | A kind of game control method, equipment and computer readable storage medium | |
CN109117105A (en) | A kind of collaboration desktop interaction regulation method, equipment and computer readable storage medium | |
CN108062241A (en) | A kind of switching method of display interface, terminal and storage medium | |
CN107844759A (en) | A kind of gesture identification method, terminal and storage medium | |
CN107368241A (en) | A kind of information processing method, equipment and computer-readable recording medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180911 |
|
RJ01 | Rejection of invention patent application after publication |