CN108520750A

CN108520750A - A kind of voice input control method, equipment and computer readable storage medium

Info

Publication number: CN108520750A
Application number: CN201810202888.5A
Authority: CN
Inventors: 王彦文
Original assignee: Nubia Technology Co Ltd
Current assignee: Nubia Technology Co Ltd
Priority date: 2018-03-13
Filing date: 2018-03-13
Publication date: 2018-09-11

Abstract

The invention discloses a kind of voice input control method, equipment and computer readable storage mediums, wherein this method includes：The input operation of the voice in current interactive interface is triggered according to speech-input instructions；Then, operation is inputted by the voice and obtains voice messaging；Subsequently, the input demand of the interactive interface is identified, wherein the input demand includes voice input demand and word input demand；Finally, it executes the voice messaging by voice input demand and sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes transmission operation.Realize a kind of voice input control scheme of hommization, user is allow quickly to carry out voice input operation, simultaneously, it is adaptively adjusted, switches the voice messaging after being inputted by voice or text information, user is eliminated in the handover operation of word input and voice input, larger improves the globality and adaptability of voice input.

Description

A kind of voice input control method, equipment and computer readable storage medium

Technical field

The present invention relates to mobile communication field more particularly to a kind of voice input control method, equipment and computer-readable Storage medium.

Background technology

In the prior art, with the intelligent development of terminal device, the function having is more and more abundant, and user can be frequent Using terminal equipment carry out information processing, particularly, user generally pass through word input, voice input carry out data input behaviour Make, still, under some scenes, frequently switching character input modes and voice input mode can undoubtedly be brought larger to user Influence, when make it is complex for operation step, two are reduction of the efficiency of data input, and user experience is bad.

Invention content

In order to solve in the prior art, user by word when being inputted, voice input carrying out data input operation, frequently Switching character input modes and voice input mode can undoubtedly be brought greater impact to user, when make operating procedure it is numerous Trivial, two are reduction of the efficiency of data input, the bad technological deficiency of user experience, and the present invention proposes a kind of voice input control Method processed, this method include：

The input operation of the voice in current interactive interface is triggered by speech-input instructions；

Operation, which is inputted, by the voice obtains voice messaging；

Identify the input demand of the interactive interface, wherein the input demand includes that voice input demand and word are defeated Enter demand；

The voice messaging is executed by voice input demand and sends operation, alternatively, will by word input demand The voice messaging is converted to text information and executes transmission operation.

Optionally, it is described by speech-input instructions trigger the voice in current interactive interface input operation include：

Dialog region and input area are shown in the interactive interface；

The input area is activated by the speech-input instructions, and shows that the voice is defeated in the input area The status information entered.

Optionally, described to include by voice input operation acquisition voice messaging：

Obtain the voice messaging, and status information described in real-time update；

Cache the voice messaging of the acquisition.

Optionally, the input demand of the identification interactive interface, wherein the input demand, which includes voice input, to be needed Summation word inputs demand：

Detect the dialog region and the input area；

Judge the dialog region and the input demand of the input area, wherein the input demand includes that voice is defeated Enter demand and word input demand；

If the last item information in the dialog region is voice messaging or the input area is voice input shape State, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or the input Region is word input state, it is determined that inputs demand for word.

Optionally, described to execute the voice messaging transmission operation by voice input demand, alternatively, pressing the word The voice messaging is converted to text information and executes transmission operation by input demand：

Record the input demand fresh information of the input area；

Input demand next time is determined according to the input demand fresh information, and by described in input demand execution Voice messaging sends operation or the text information sends operation.

The invention also discloses a kind of voice input control apparatus, which includes memory, processor and is stored in institute The computer program that can be run on memory and on the processor is stated, when the computer program is executed by the processor It realizes：

Operation, which is inputted, by the voice obtains voice messaging；

Optionally, it is realized when the computer program is also executed by the processor：

Dialog region and input area are shown in the interactive interface；

Cache the voice messaging of the acquisition.

Detect the dialog region and the input area；

If the last item information in the dialog region is voice messaging or the input area is voice input shape State, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or the input Region is word input state, it is determined that inputs demand for word；

Record the input demand fresh information of the input area；

The invention also provides a kind of computer readable storage medium, voice is stored on the computer readable storage medium Input control program, voice input control program realize voice input control as described in any one of the above embodiments when being executed by processor The step of method.

Voice input control method, equipment and the computer readable storage medium for implementing the present invention are referred to by voice input Enable the voice input operation triggered in current interactive interface；Then, operation is inputted by the voice and obtains voice messaging；Again so Afterwards, the input demand of the interactive interface is identified, wherein the input demand, which includes voice input demand and word input, to be needed It asks；Finally, it executes the voice messaging by voice input demand and sends operation, alternatively, will by word input demand The voice messaging is converted to text information and executes transmission operation.A kind of voice input control scheme of hommization is realized, User is allow quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the letter of the voice after being inputted by voice Breath or text information eliminate user in the handover operation of word input and voice input, larger improve voice input Globality and adaptability.

Description of the drawings

Present invention will be further explained below with reference to the attached drawings and examples, in attached drawing：

Fig. 1 is a kind of hardware architecture diagram of mobile terminal of the present invention；

Fig. 2 is a kind of communications network system Organization Chart provided in an embodiment of the present invention；

Fig. 3 is the flow chart of voice input control method first embodiment of the present invention；

Fig. 4 is the flow chart of voice input control method second embodiment of the present invention；

Fig. 5 is the flow chart of voice input control method 3rd embodiment of the present invention；

Fig. 6 is the flow chart of voice input control method fourth embodiment of the present invention；

Fig. 7 is the flow chart of the 5th embodiment of voice input control method of the present invention.

Specific implementation mode

It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.

In subsequent description, using for indicating that the suffix of such as " module ", " component " or " unit " of element is only The explanation for being conducive to the present invention, itself does not have a specific meaning.Therefore, " module ", " component " or " unit " can mix Ground uses.

Terminal can be implemented in a variety of manners.For example, terminal described in the present invention may include such as mobile phone, tablet Computer, laptop, palm PC, personal digital assistant (Personal Digital Assistant, PDA), portable The shiftings such as media player (Portable Media Player, PMP), navigation device, wearable device, Intelligent bracelet, pedometer The fixed terminals such as dynamic terminal, and number TV, desktop computer.

It will be illustrated by taking mobile terminal as an example in subsequent descriptions, it will be appreciated by those skilled in the art that in addition to special Except element for moving purpose, construction according to the embodiment of the present invention can also apply to the terminal of fixed type.

Referring to Fig. 1, a kind of hardware architecture diagram of its mobile terminal of each embodiment to realize the present invention, the shifting Moving terminal 100 may include：RF (Radio Frequency, radio frequency) unit 101, WiFi module 102, audio output unit 103, A/V (audio/video) input unit 104, sensor 105, display unit 106, user input unit 107, interface unit 108, the components such as memory 109, processor 110 and power supply 111.It will be understood by those skilled in the art that shown in Fig. 1 Mobile terminal structure does not constitute the restriction to mobile terminal, and mobile terminal may include components more more or fewer than diagram, Either combine certain components or different components arrangement.

The all parts of mobile terminal are specifically introduced with reference to Fig. 1：

Radio frequency unit 101 can be used for receiving and sending messages or communication process in, signal sends and receivees, specifically, by base station Downlink information receive after, to processor 110 handle；In addition, the data of uplink are sent to base station.In general, radio frequency unit 101 Including but not limited to antenna, at least one amplifier, transceiver, coupler, low-noise amplifier, duplexer etc..In addition, penetrating Frequency unit 101 can also be communicated with network and other equipment by radio communication.Above-mentioned wireless communication can use any communication Standard or agreement, including but not limited to GSM (Global System of Mobile communication, global system for mobile telecommunications System), GPRS (General Packet Radio Service, general packet radio service), CDMA2000 (Code Division Multiple Access 2000, CDMA 2000), WCDMA (Wideband Code Division Multiple Access, wideband code division multiple access), TD-SCDMA (Time Division-Synchronous Code Division Multiple Access, TD SDMA), FDD-LTE (Frequency Division Duplexing-Long Term Evolution, frequency division duplex long term evolution) and TDD-LTE (Time Division Duplexing-Long Term Evolution, time division duplex long term evolution) etc..

WiFi belongs to short range wireless transmission technology, and mobile terminal can help user to receive and dispatch electricity by WiFi module 102 Sub- mail, browsing webpage and access streaming video etc., it has provided wireless broadband internet to the user and has accessed.Although Fig. 1 shows Go out WiFi module 102, but it is understood that, and it is not belonging to must be configured into for mobile terminal, it completely can be according to need It to be omitted in the range for the essence for not changing invention.

Audio output unit 103 can be in call signal reception pattern, call mode, record mould in mobile terminal 100 When under the isotypes such as formula, speech recognition mode, broadcast reception mode, it is that radio frequency unit 101 or WiFi module 102 are received or The audio data stored in memory 109 is converted into audio signal and exports to be sound.Moreover, audio output unit 103 The relevant audio output of specific function executed with mobile terminal 100 can also be provided (for example, call signal receives sound, disappears Breath receives sound etc.).Audio output unit 103 may include loud speaker, buzzer etc..

A/V input units 104 are for receiving audio or video signal.A/V input units 104 may include graphics processor (Graphics Processing Unit, GPU) 1041 and microphone 1042, graphics processor 1041 is in video acquisition mode Or the image data of the static images or video obtained by image capture apparatus (such as camera) in image capture mode carries out Reason.Treated, and picture frame may be displayed on display unit 106.Through graphics processor 1041, treated that picture frame can be deposited Storage is sent in memory 109 (or other storage mediums) or via radio frequency unit 101 or WiFi module 102.Mike Wind 1042 can connect in telephone calling model, logging mode, speech recognition mode etc. operational mode via microphone 1042 Quiet down sound (audio data), and can be audio data by such acoustic processing.Audio that treated (voice) data can To be converted to the format output that can be sent to mobile communication base station via radio frequency unit 101 in the case of telephone calling model. Microphone 1042 can implement various types of noises elimination (or inhibition) algorithms and send and receive sound to eliminate (or inhibition) The noise generated during frequency signal or interference.

Mobile terminal 100 further includes at least one sensor 105, such as optical sensor, motion sensor and other biographies Sensor.Specifically, optical sensor includes ambient light sensor and proximity sensor, wherein ambient light sensor can be according to environment The light and shade of light adjusts the brightness of display panel 1061, and proximity sensor can close when mobile terminal 100 is moved in one's ear Display panel 1061 and/or backlight.As a kind of motion sensor, accelerometer sensor can detect in all directions (general For three axis) size of acceleration, size and the direction of gravity are can detect that when static, can be used to identify the application of mobile phone posture (such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, percussion) etc.； The fingerprint sensor that can also configure as mobile phone, pressure sensor, iris sensor, molecule sensor, gyroscope, barometer, The other sensors such as hygrometer, thermometer, infrared sensor, details are not described herein.

Display unit 106 is for showing information input by user or being supplied to the information of user.Display unit 106 can wrap Display panel 1061 is included, liquid crystal display (Liquid Crystal Display, LCD), Organic Light Emitting Diode may be used Forms such as (Organic Light-Emitting Diode, OLED) configure display panel 1061.

User input unit 107 can be used for receiving the number or character information of input, and generate the use with mobile terminal Family is arranged and the related key signals input of function control.Specifically, user input unit 107 may include touch panel 1071 with And other input equipments 1072.Touch panel 1071, also referred to as touch screen collect user on it or neighbouring touch operation (for example user uses any suitable objects or attachment such as finger, stylus on touch panel 1071 or in touch panel 1071 Neighbouring operation), and corresponding attachment device is driven according to preset formula.Touch panel 1071 may include touch detection Two parts of device and touch controller.Wherein, the touch orientation of touch detecting apparatus detection user, and detect touch operation band The signal come, transmits a signal to touch controller；Touch controller receives touch information from touch detecting apparatus, and by it It is converted into contact coordinate, then gives processor 110, and order that processor 110 is sent can be received and executed.In addition, can To realize touch panel 1071 using multiple types such as resistance-type, condenser type, infrared ray and surface acoustic waves.In addition to touch panel 1071, user input unit 107 can also include other input equipments 1072.Specifically, other input equipments 1072 can wrap It includes but is not limited in physical keyboard, function key (such as volume control button, switch key etc.), trace ball, mouse, operating lever etc. It is one or more, do not limit herein specifically.

Further, touch panel 1071 can cover display panel 1061, when touch panel 1071 detect on it or After neighbouring touch operation, processor 110 is sent to determine the type of touch event, is followed by subsequent processing device 110 according to touch thing The type of part provides corresponding visual output on display panel 1061.Although in Fig. 1, touch panel 1071 and display panel 1061 be to realize the function that outputs and inputs of mobile terminal as two independent components, but in certain embodiments, can The function that outputs and inputs of mobile terminal is realized so that touch panel 1071 and display panel 1061 is integrated, is not done herein specifically It limits.

Interface unit 108 be used as at least one external device (ED) connect with mobile terminal 100 can by interface.For example, External device (ED) may include wired or wireless headphone port, external power supply (or battery charger) port, wired or nothing Line data port, memory card port, the port for connecting the device with identification module, audio input/output (I/O) end Mouth, video i/o port, ear port etc..Interface unit 108 can be used for receiving the input from external device (ED) (for example, number It is believed that breath, electric power etc.) and the input received is transferred to one or more elements in mobile terminal 100 or can be with For the transmission data between mobile terminal 100 and external device (ED).

Memory 109 can be used for storing software program and various data.Memory 109 can include mainly storing program area And storage data field, wherein storing program area can storage program area, application program (such as the sound needed at least one function Sound playing function, image player function etc.) etc.；Storage data field can store according to mobile phone use created data (such as Audio data, phone directory etc.) etc..In addition, memory 109 may include high-speed random access memory, can also include non-easy The property lost memory, a for example, at least disk memory, flush memory device or other volatile solid-state parts.

Processor 110 is the control centre of mobile terminal, utilizes each of various interfaces and the entire mobile terminal of connection A part by running or execute the software program and/or module that are stored in memory 109, and calls and is stored in storage Data in device 109 execute the various functions and processing data of mobile terminal, to carry out integral monitoring to mobile terminal.Place Reason device 110 may include one or more processing units；Preferably, processor 110 can integrate application processor and modulatedemodulate is mediated Manage device, wherein the main processing operation system of application processor, user interface and application program etc., modem processor is main Processing wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor 110.

Mobile terminal 100 can also include the power supply 111 (such as battery) powered to all parts, it is preferred that power supply 111 Can be logically contiguous by power-supply management system and processor 110, to realize management charging by power-supply management system, put The functions such as electricity and power managed.

Although Fig. 1 is not shown, mobile terminal 100 can also be including bluetooth module etc., and details are not described herein.

Embodiment to facilitate the understanding of the present invention, below to the communications network system that is based on of mobile terminal of the present invention into Row description.

Referring to Fig. 2, Fig. 2 is a kind of communications network system Organization Chart provided in an embodiment of the present invention, the communication network system System is the LTE system of universal mobile communications technology, which includes communicating UE (User Equipment, the use of connection successively Family equipment) (the lands Evolved UMTS Terrestrial Radio Access Network, evolved UMTS 201, E-UTRAN Ground wireless access network) 202, EPC (Evolved Packet Core, evolved packet-based core networks) 203 and operator IP operation 204。

Specifically, UE201 can be above-mentioned terminal 100, and details are not described herein again.

E-UTRAN202 includes eNodeB2021 and other eNodeB2022 etc..Wherein, eNodeB2021 can be by returning Journey (backhaul) (such as X2 interface) is connect with other eNodeB2022, and eNodeB2021 is connected to EPC203, ENodeB2021 can provide the access of UE201 to EPC203.

EPC203 may include MME (Mobility Management Entity, mobility management entity) 2031, HSS (Home Subscriber Server, home subscriber server) 2032, other MME2033, SGW (Serving Gate Way, Gateway) 2034, PGW (PDN Gate Way, grouped data network gateway) 2035 and PCRF (Policy and Charging Rules Function, policy and rate functional entity) 2036 etc..Wherein, MME2031 be processing UE201 and The control node of signaling, provides carrying and connection management between EPC203.HSS2032 is all to manage for providing some registers Such as the function of home location register (not shown) etc, and some are preserved in relation to use such as service features, data rates The dedicated information in family.All customer data can be sent by SGW2034, and PGW2035 can provide the IP of UE 201 Address is distributed and other functions, and PCRF2036 is strategy and the charging control strategic decision-making of business data flow and IP bearing resources Point, it selects and provides available strategy and charging control decision with charge execution function unit (not shown) for strategy.

IP operation 204 may include internet, Intranet, IMS (IP Multimedia Subsystem, IP multimedia System) or other IP operations etc..

Although above-mentioned be described by taking LTE system as an example, those skilled in the art it is to be understood that the present invention not only Suitable for LTE system, be readily applicable to other wireless communication systems, such as GSM, CDMA2000, WCDMA, TD-SCDMA with And the following new network system etc., it does not limit herein.

Based on above-mentioned mobile terminal hardware configuration and communications network system, each embodiment of the method for the present invention is proposed.

Embodiment one

Fig. 3 is the flow chart of voice input control method first embodiment of the present invention.A kind of voice input control method, should Method includes：

S1, the input operation of the voice in current interactive interface is triggered by speech-input instructions；

S2, operation acquisition voice messaging is inputted by the voice；

The input demand of S3, the identification interactive interface, wherein the input demand includes voice input demand and word Input demand；

S4, the voice messaging transmission operation is executed by voice input demand, alternatively, inputting demand by the word The voice messaging is converted into text information and executes transmission operation.

In the present embodiment, first, the input operation of the voice in current interactive interface is triggered by speech-input instructions；So Afterwards, operation is inputted by the voice and obtains voice messaging；Subsequently, the input demand of the interactive interface is identified, wherein institute It includes voice input demand and word input demand to state input demand；Finally, the voice is executed by voice input demand Information sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes transmission behaviour Make.

Specifically, the voice control scheme that the present invention is implemented is suitable for the smart machines such as smart mobile phone, tablet computer, In the present embodiment, by taking cell phone apparatus as an example, first, voice messaging is enrolled and parses, cell phone apparatus has the recording groups such as microphone Part obtains extraneous audio-frequency information by the component of recording such as microphone, by the caching component of cell phone apparatus to audio-frequency information into Row caching, then, parses the voice messaging of caching by preset algorithm via processor.It is understood that passing through The external audio frequency information that the recording component such as microphone of cell phone apparatus obtains includes the voice messaging of user and other environmental noises, Before being parsed to voice messaging, if environmental noise is more than certain threshold value, noise reduction process is carried out to it first, then again Parsing operation is carried out to it.

Specifically, in the present embodiment, the input operation of the voice in current interactive interface is triggered by speech-input instructions.When Preceding interactive interface can be the information transmit-receive interface of system, message session interface, can also be information transmit-receive circle of application program Face, message session interface.For example, the physical button for opening voice control is arranged in the side key in terminal device, by this Physical button opens voice control.

Further, the physical button for opening voice control is set in the side key of terminal device, passes through the physics Press switch to open voice control, when user pins the physical button, voice input operation is opened, when user unclamps the physical button When, terminate voice input operation.

Further, in the apex zone of terminal device or bottom zone or rear surface regions or front surface region Physical button for opening voice control is set, by the physical button open voice control, when user pin the physics by When key, voice input operation is opened, when user unclamps the physical button, terminates voice input operation.

Further include that there is word, voice input demand to answer it is understood that this programme is not limited to message session interface With scene, for example, voice assistant, Voice Navigation etc..

In the present embodiment, after triggering the input operation of the voice in current interactive interface by speech-input instructions, pass through The voice input operation obtains voice messaging.Likewise, as above inputting and operating by voice described in example, start voice messaging Obtain operation, it is to be understood that voice input operation can be directed to the touch command of touch scheme, or be directed to In the physical button instruction of pressing scheme, specifically, by the predeterminable area of lasting touch-control interactive interface, with lasting acquisition voice Information, alternatively, by the preset physical button of Continued depression, with lasting acquisition voice messaging.Further, by from beginning to end twice The predeterminable area of touch-control interactive interface, to obtain the voice messaging in the period, alternatively, preset by pressing twice from beginning to end Physical button, to obtain the voice messaging in the period.

Specifically, in the present embodiment, after inputting operation acquisition voice messaging by the voice, identifying the interaction The input demand at interface, wherein the input demand includes voice input demand and word input demand.Wherein, this programme institute The voice input demand stated refers under current state, and it is voice messaging that this time voice input, which operates corresponding information format, equally , the word input demand described in this programme refers under current state, and it is text that this time voice input, which operates corresponding information format, Word information.For example, temporally sequential arrangement shows dialogue entries with the dialog interface of contact person, it is to be understood that directly The voice messaging of input is sent, corresponding dialogue entries are voice messagings, directly transmit the text information of input, corresponding dialogue Entry is text information.

In the present embodiment, this input demand is determined by the information type of the last item dialogue entries, that is, finally The information type of one dialogue entries is text information, then corresponding, and the input demand for recognizing interactive interface at this time is word Input demand；

Further, this input demand is determined by the information type of the last item dialogue entries, that is, the last item The information type of dialogue entries is voice messaging, then corresponding, recognizes the input demand of interactive interface at this time and is inputted for voice Demand；

Further, the information type of the last item dialogue entries sent by user determines that this input needs It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this When interactive interface input demand be word input demand.

In the present embodiment, the input demand of the interactive interface is identified, wherein the input demand includes voice input Demand and word input demand；Later, it executes the voice messaging by voice input demand and sends operation, alternatively, pressing institute Word input demand is stated the voice messaging is converted to text information and executes transmission operation.Likewise, as above being pressed described in example The information type of the last item dialogue entries determines this input demand, that is, the information type of the last item dialogue entries It is text information, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, then, by the text Word inputs demand and the voice messaging is converted to text information and executes transmission operation.Alternatively, user's transmission is last The information type of one dialogue entries determines this input demand, if the information type of the last item dialogue entries is word letter Breath, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, and then, being inputted by the word needs It asks and the voice messaging is converted into text information and executes transmission operation.

The advantageous effect of the present embodiment is, the input behaviour of the voice in current interactive interface is triggered by speech-input instructions Make；Then, operation is inputted by the voice and obtains voice messaging；Subsequently, the input demand of the interactive interface is identified, In, the input demand includes voice input demand and word input demand；Finally, by described in voice input demand execution Voice messaging sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes hair Send operation.Realizing a kind of voice input control scheme of hommization so that user can quickly carry out voice input operation, Meanwhile being adaptively adjusted, switching the voice messaging after being inputted by voice or text information, it eliminates user and is inputted in word With the handover operation of voice input, the globality and adaptability of voice input are larger improved.

Embodiment two

Fig. 4 is the flow chart of voice input control method second embodiment of the present invention, is based on above-described embodiment, described by language Voice input in the sound input current interactive interface of instruction triggers, which operates, includes：

S11, dialog region and input area are shown in the interactive interface；

S12, the input area is activated by the speech-input instructions, and shows institute's predicate in the input area The status information of sound input.

In the present embodiment, first, dialog region and input area are shown in the interactive interface；Then, pass through institute It states speech-input instructions and activates the input area, and show the status information of the voice input in the input area.

Further, the predeterminable area for continuing touch-control interactive interface is shown by status information, is believed with lasting acquisition voice Breath；

Further, the preset physical button of Continued depression is shown by status information, with lasting acquisition voice messaging；

Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by status information, to obtain the time Voice messaging in section；

Further, preset physical button is pressed twice by status information display head and the tail, to obtain in the period Voice messaging.

The advantageous effect of the present embodiment is, by showing dialog region and input area in the interactive interface；So Afterwards, the input area is activated by the speech-input instructions, and shows the voice input in the input area Status information, to realize that a kind of voice input control scheme of hommization provides environmental basis and conditioned basic so that user Voice input operation can be quickly carried out, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or text Word information eliminates user in the handover operation of word input and voice input, larger improves the globality of voice input And adaptability.

Embodiment three

Fig. 5 is the flow chart of voice input control method 3rd embodiment of the present invention, is based on above-described embodiment, described to pass through The voice input operation obtains voice messaging and includes：

S21, the voice messaging, and status information described in real-time update are obtained；

S22, the voice messaging for caching the acquisition.

In the present embodiment, first, the voice messaging, and status information described in real-time update are obtained；Then, institute is cached State the voice messaging of acquisition.

Further, the predeterminable area for continuing touch-control interactive interface is shown by newer status information, with lasting acquisition Voice messaging；

Further, the preset physical button of Continued depression is shown by newer status information, with lasting acquisition voice Information；

Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by newer status information, to obtain Voice messaging in the period；

Further, preset physical button is pressed twice by newer status information display head and the tail, when obtaining this Between voice messaging in section.

The advantageous effect of the present embodiment is, by obtaining the voice messaging, and status information described in real-time update；So Afterwards, the voice messaging for caching the acquisition, for realize a kind of voice input control scheme of hommization provide environmental basis and Conditioned basic so that user can quickly carry out voice input operation, meanwhile, it is adaptively adjusted, switches and inputted by voice Voice messaging afterwards or text information eliminate user in the handover operation of word input and voice input, larger improve The globality and adaptability of voice input.

Example IV

Fig. 6 is the flow chart of voice input control method fourth embodiment of the present invention, is based on above-described embodiment, the identification The input demand of the interactive interface, wherein the input demand includes voice input demand and word input demand includes：

S31, the dialog region and the input area are detected；

S32, judge the dialog region and the input demand of the input area, wherein the input demand includes language Sound inputs demand and word inputs demand；

If the last item information in S33, the dialog region is voice messaging or the input area is that voice is defeated Enter state, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or described Input area is word input state, it is determined that inputs demand for word.

In the present embodiment, first, the dialog region and the input area are detected；Then, judge the dialog region The input demand in domain and the input area, wherein the input demand includes voice input demand and word input demand；Most Afterwards, if the last item information in the dialog region is voice messaging or the input area is voice input state, It is determined as voice input demand, if the last item information in the dialog region is text information or the input area For word input state, it is determined that input demand for word.

Further, the information type of the last item dialogue entries sent by user determines that this input needs It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this When interactive interface input demand be word input demand；

Further, by (for example, in ten minutes) in the certain predetermined time, in all dialogue entries, voice messaging dialogue The accounting of number of entries or text information dialogue entries quantity determines current input demand, that is, larger by quantity accounting As current input demand；

Further, by (for example, in nearest ten dialogue entries) in certain historical bar mesh number, in all dialogue entries, The accounting of voice messaging dialogue entries quantity or text information dialogue entries quantity determines current input demand, that is, pressing Quantity accounting it is larger as current input demand.

The advantageous effect of the present embodiment is, by detecting the dialog region and the input area；Then, judge institute State dialog region and the input demand of the input area, wherein the input demand includes that voice input demand and word are defeated Enter demand；Finally, if the last item information in the dialog region is voice messaging or the input area is that voice is defeated Enter state, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or described Input area is word input state, it is determined that demand is inputted for word, to realize a kind of voice input control side of hommization Case provides environmental basis and conditioned basic so that and user can quickly carry out voice input operation, meanwhile, adaptively adjust Whole, switching inputted by voice after voice messaging or text information, eliminate user and inputted in word and cut with what voice inputted Operation is changed, the globality and adaptability of voice input are larger improved.

Embodiment five

Fig. 7 is the flow chart of the 5th embodiment of voice input control method of the present invention, is based on above-described embodiment, described to press institute Predicate sound inputs demand and executes the voice messaging transmission operation, alternatively, pressing word input demand by the voice messaging It is converted to text information and executes transmission operation and include：

S41, the input demand fresh information for recording the input area；

S42, input demand next time is determined according to the input demand fresh information, and is executed by the input demand The voice messaging sends operation or the text information sends operation.

In the present embodiment, first, the input demand fresh information of the input area is recorded；Then, according to described defeated Enter demand fresh information and determine input demand next time, and by the input demand execute the voice messaging send operation or Text information described in person sends operation.

It is understood that the input demand fresh information of the input area of the present embodiment is under current input demand Input demand next time is then updated to text information by input information for example, current input demand is text information.

The advantageous effect of the present embodiment is, by the input demand fresh information for recording the input area；Then, root Input demand next time is determined according to the input demand fresh information, and executes the voice messaging hair by the input demand It send operation or the text information to send operation, realizes a kind of voice input control scheme of hommization so that Yong Huke Quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or word Information, eliminate user word input with voice input handover operation, larger improve voice input globality and Adaptability.

Embodiment six

Based on above-described embodiment, the invention also discloses a kind of voice input control apparatus, which includes memory, place It manages device and is stored in the computer program that can be run on the memory and on the processor, the computer program is by institute It states when processor executes and realizes：

Operation, which is inputted, by the voice obtains voice messaging；

Embodiment seven

Based on above-described embodiment, optionally, the computer program is realized when also being executed by the processor：

Dialog region and input area are shown in the interactive interface；

Embodiment eight

Cache the voice messaging of the acquisition.

Embodiment nine

Detect the dialog region and the input area；

Record the input demand fresh information of the input area；

Embodiment ten

Based on above-described embodiment, the invention also provides a kind of computer readable storage medium, the computer-readable storages It is stored with voice input control program on medium, is realized such as any of the above-described institute when voice input control program is executed by processor The step of voice input control method stated.

It should be noted that herein, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that process, method, article or device including a series of elements include not only those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or device institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including this There is also other identical elements in the process of element, method, article or device.

The embodiments of the present invention are for illustration only, can not represent the quality of embodiment.

Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical scheme of the present invention substantially in other words does the prior art Going out the part of contribution can be expressed in the form of software products, which is stored in a storage medium In (such as ROM/RAM, magnetic disc, CD), including some instructions are used so that a station terminal (can be mobile phone, computer, service Device, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.

The embodiment of the present invention is described with above attached drawing, but the invention is not limited in above-mentioned specific Embodiment, the above mentioned embodiment is only schematical, rather than restrictive, those skilled in the art Under the inspiration of the present invention, without breaking away from the scope protected by the purposes and claims of the present invention, it can also make very much Form, all of these belong to the protection of the present invention.

Claims

1. a kind of voice input control method, which is characterized in that the method includes：

Operation, which is inputted, by the voice obtains voice messaging；

Identify the input demand of the interactive interface, wherein the input demand, which includes voice input demand and word input, to be needed It asks；

The voice messaging is executed by voice input demand and sends operation, alternatively, will be described by word input demand Voice messaging is converted to text information and executes transmission operation.

2. voice input control method according to claim 1, which is characterized in that described to be worked as by speech-input instructions triggering Voice input in preceding interactive interface, which operates, includes：

Dialog region and input area are shown in the interactive interface；

The input area is activated by the speech-input instructions, and shows the voice input in the input area Status information.

3. voice input control method according to claim 2, which is characterized in that described inputted by the voice operates Obtaining voice messaging includes：

Cache the voice messaging of the acquisition.

4. voice input control method according to claim 3, which is characterized in that described to identify the defeated of the interactive interface Enter demand, wherein the input demand includes voice input demand and word input demand includes：

Detect the dialog region and the input area；

Judge the dialog region and the input demand of the input area, wherein the input demand, which includes voice input, to be needed Word of summing inputs demand；

If the last item information in the dialog region is voice messaging or the input area is voice input state, Then it is determined as voice input demand, if the last item information in the dialog region is text information or the input area Domain is word input state, it is determined that inputs demand for word.

5. voice input control method according to claim 4, which is characterized in that described to be held by voice input demand The row voice messaging sends operation, alternatively, the voice messaging is converted to text information simultaneously by word input demand It executes to send to operate and includes：

Record the input demand fresh information of the input area；

Input demand next time is determined according to the input demand fresh information, and executes the voice by the input demand Information sends operation or the text information sends operation.

6. a kind of voice input control apparatus, which is characterized in that the equipment includes memory, processor and is stored in described deposit It is real when the computer program is executed by the processor on reservoir and the computer program that can run on the processor It is existing：

Operation, which is inputted, by the voice obtains voice messaging；

7. voice input control apparatus according to claim 6, which is characterized in that the computer program is also by the place Reason device is realized when executing：

Dialog region and input area are shown in the interactive interface；

8. voice input control apparatus according to claim 7, which is characterized in that the computer program is also by the place Reason device is realized when executing：

Cache the voice messaging of the acquisition.

9. voice input control apparatus according to claim 8, which is characterized in that the computer program is also by the place Reason device is realized when executing：

Detect the dialog region and the input area；

If the last item information in the dialog region is voice messaging or the input area is voice input state, Then it is determined as voice input demand, if the last item information in the dialog region is text information or the input area Domain is word input state, it is determined that inputs demand for word；

Record the input demand fresh information of the input area；

10. a kind of computer readable storage medium, which is characterized in that it is defeated to be stored with voice on the computer readable storage medium Enter and control program, is realized as described in any one of claim 1 to 5 when the voice input control program is executed by processor The step of voice input control method.