CN108520750A - A kind of voice input control method, equipment and computer readable storage medium - Google Patents

A kind of voice input control method, equipment and computer readable storage medium Download PDF

Info

Publication number
CN108520750A
CN108520750A CN201810202888.5A CN201810202888A CN108520750A CN 108520750 A CN108520750 A CN 108520750A CN 201810202888 A CN201810202888 A CN 201810202888A CN 108520750 A CN108520750 A CN 108520750A
Authority
CN
China
Prior art keywords
input
voice
demand
word
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810202888.5A
Other languages
Chinese (zh)
Inventor
王彦文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nubia Technology Co Ltd
Original Assignee
Nubia Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nubia Technology Co Ltd filed Critical Nubia Technology Co Ltd
Priority to CN201810202888.5A priority Critical patent/CN108520750A/en
Publication of CN108520750A publication Critical patent/CN108520750A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Theoretical Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Telephone Function (AREA)

Abstract

The invention discloses a kind of voice input control method, equipment and computer readable storage mediums, wherein this method includes:The input operation of the voice in current interactive interface is triggered according to speech-input instructions;Then, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified, wherein the input demand includes voice input demand and word input demand;Finally, it executes the voice messaging by voice input demand and sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes transmission operation.Realize a kind of voice input control scheme of hommization, user is allow quickly to carry out voice input operation, simultaneously, it is adaptively adjusted, switches the voice messaging after being inputted by voice or text information, user is eliminated in the handover operation of word input and voice input, larger improves the globality and adaptability of voice input.

Description

A kind of voice input control method, equipment and computer readable storage medium
Technical field
The present invention relates to mobile communication field more particularly to a kind of voice input control method, equipment and computer-readable Storage medium.
Background technology
In the prior art, with the intelligent development of terminal device, the function having is more and more abundant, and user can be frequent Using terminal equipment carry out information processing, particularly, user generally pass through word input, voice input carry out data input behaviour Make, still, under some scenes, frequently switching character input modes and voice input mode can undoubtedly be brought larger to user Influence, when make it is complex for operation step, two are reduction of the efficiency of data input, and user experience is bad.
Invention content
In order to solve in the prior art, user by word when being inputted, voice input carrying out data input operation, frequently Switching character input modes and voice input mode can undoubtedly be brought greater impact to user, when make operating procedure it is numerous Trivial, two are reduction of the efficiency of data input, the bad technological deficiency of user experience, and the present invention proposes a kind of voice input control Method processed, this method include:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand includes that voice input demand and word are defeated Enter demand;
The voice messaging is executed by voice input demand and sends operation, alternatively, will by word input demand The voice messaging is converted to text information and executes transmission operation.
Optionally, it is described by speech-input instructions trigger the voice in current interactive interface input operation include:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows that the voice is defeated in the input area The status information entered.
Optionally, described to include by voice input operation acquisition voice messaging:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
Optionally, the input demand of the identification interactive interface, wherein the input demand, which includes voice input, to be needed Summation word inputs demand:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand includes that voice is defeated Enter demand and word input demand;
If the last item information in the dialog region is voice messaging or the input area is voice input shape State, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or the input Region is word input state, it is determined that inputs demand for word.
Optionally, described to execute the voice messaging transmission operation by voice input demand, alternatively, pressing the word The voice messaging is converted to text information and executes transmission operation by input demand:
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and by described in input demand execution Voice messaging sends operation or the text information sends operation.
The invention also discloses a kind of voice input control apparatus, which includes memory, processor and is stored in institute The computer program that can be run on memory and on the processor is stated, when the computer program is executed by the processor It realizes:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand includes that voice input demand and word are defeated Enter demand;
The voice messaging is executed by voice input demand and sends operation, alternatively, will by word input demand The voice messaging is converted to text information and executes transmission operation.
Optionally, it is realized when the computer program is also executed by the processor:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows that the voice is defeated in the input area The status information entered.
Optionally, it is realized when the computer program is also executed by the processor:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
Optionally, it is realized when the computer program is also executed by the processor:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand includes that voice is defeated Enter demand and word input demand;
If the last item information in the dialog region is voice messaging or the input area is voice input shape State, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or the input Region is word input state, it is determined that inputs demand for word;
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and by described in input demand execution Voice messaging sends operation or the text information sends operation.
The invention also provides a kind of computer readable storage medium, voice is stored on the computer readable storage medium Input control program, voice input control program realize voice input control as described in any one of the above embodiments when being executed by processor The step of method.
Voice input control method, equipment and the computer readable storage medium for implementing the present invention are referred to by voice input Enable the voice input operation triggered in current interactive interface;Then, operation is inputted by the voice and obtains voice messaging;Again so Afterwards, the input demand of the interactive interface is identified, wherein the input demand, which includes voice input demand and word input, to be needed It asks;Finally, it executes the voice messaging by voice input demand and sends operation, alternatively, will by word input demand The voice messaging is converted to text information and executes transmission operation.A kind of voice input control scheme of hommization is realized, User is allow quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the letter of the voice after being inputted by voice Breath or text information eliminate user in the handover operation of word input and voice input, larger improve voice input Globality and adaptability.
Description of the drawings
Present invention will be further explained below with reference to the attached drawings and examples, in attached drawing:
Fig. 1 is a kind of hardware architecture diagram of mobile terminal of the present invention;
Fig. 2 is a kind of communications network system Organization Chart provided in an embodiment of the present invention;
Fig. 3 is the flow chart of voice input control method first embodiment of the present invention;
Fig. 4 is the flow chart of voice input control method second embodiment of the present invention;
Fig. 5 is the flow chart of voice input control method 3rd embodiment of the present invention;
Fig. 6 is the flow chart of voice input control method fourth embodiment of the present invention;
Fig. 7 is the flow chart of the 5th embodiment of voice input control method of the present invention.
Specific implementation mode
It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.
In subsequent description, using for indicating that the suffix of such as " module ", " component " or " unit " of element is only The explanation for being conducive to the present invention, itself does not have a specific meaning.Therefore, " module ", " component " or " unit " can mix Ground uses.
Terminal can be implemented in a variety of manners.For example, terminal described in the present invention may include such as mobile phone, tablet Computer, laptop, palm PC, personal digital assistant (Personal Digital Assistant, PDA), portable The shiftings such as media player (Portable Media Player, PMP), navigation device, wearable device, Intelligent bracelet, pedometer The fixed terminals such as dynamic terminal, and number TV, desktop computer.
It will be illustrated by taking mobile terminal as an example in subsequent descriptions, it will be appreciated by those skilled in the art that in addition to special Except element for moving purpose, construction according to the embodiment of the present invention can also apply to the terminal of fixed type.
Referring to Fig. 1, a kind of hardware architecture diagram of its mobile terminal of each embodiment to realize the present invention, the shifting Moving terminal 100 may include:RF (Radio Frequency, radio frequency) unit 101, WiFi module 102, audio output unit 103, A/V (audio/video) input unit 104, sensor 105, display unit 106, user input unit 107, interface unit 108, the components such as memory 109, processor 110 and power supply 111.It will be understood by those skilled in the art that shown in Fig. 1 Mobile terminal structure does not constitute the restriction to mobile terminal, and mobile terminal may include components more more or fewer than diagram, Either combine certain components or different components arrangement.
The all parts of mobile terminal are specifically introduced with reference to Fig. 1:
Radio frequency unit 101 can be used for receiving and sending messages or communication process in, signal sends and receivees, specifically, by base station Downlink information receive after, to processor 110 handle;In addition, the data of uplink are sent to base station.In general, radio frequency unit 101 Including but not limited to antenna, at least one amplifier, transceiver, coupler, low-noise amplifier, duplexer etc..In addition, penetrating Frequency unit 101 can also be communicated with network and other equipment by radio communication.Above-mentioned wireless communication can use any communication Standard or agreement, including but not limited to GSM (Global System of Mobile communication, global system for mobile telecommunications System), GPRS (General Packet Radio Service, general packet radio service), CDMA2000 (Code Division Multiple Access 2000, CDMA 2000), WCDMA (Wideband Code Division Multiple Access, wideband code division multiple access), TD-SCDMA (Time Division-Synchronous Code Division Multiple Access, TD SDMA), FDD-LTE (Frequency Division Duplexing-Long Term Evolution, frequency division duplex long term evolution) and TDD-LTE (Time Division Duplexing-Long Term Evolution, time division duplex long term evolution) etc..
WiFi belongs to short range wireless transmission technology, and mobile terminal can help user to receive and dispatch electricity by WiFi module 102 Sub- mail, browsing webpage and access streaming video etc., it has provided wireless broadband internet to the user and has accessed.Although Fig. 1 shows Go out WiFi module 102, but it is understood that, and it is not belonging to must be configured into for mobile terminal, it completely can be according to need It to be omitted in the range for the essence for not changing invention.
Audio output unit 103 can be in call signal reception pattern, call mode, record mould in mobile terminal 100 When under the isotypes such as formula, speech recognition mode, broadcast reception mode, it is that radio frequency unit 101 or WiFi module 102 are received or The audio data stored in memory 109 is converted into audio signal and exports to be sound.Moreover, audio output unit 103 The relevant audio output of specific function executed with mobile terminal 100 can also be provided (for example, call signal receives sound, disappears Breath receives sound etc.).Audio output unit 103 may include loud speaker, buzzer etc..
A/V input units 104 are for receiving audio or video signal.A/V input units 104 may include graphics processor (Graphics Processing Unit, GPU) 1041 and microphone 1042, graphics processor 1041 is in video acquisition mode Or the image data of the static images or video obtained by image capture apparatus (such as camera) in image capture mode carries out Reason.Treated, and picture frame may be displayed on display unit 106.Through graphics processor 1041, treated that picture frame can be deposited Storage is sent in memory 109 (or other storage mediums) or via radio frequency unit 101 or WiFi module 102.Mike Wind 1042 can connect in telephone calling model, logging mode, speech recognition mode etc. operational mode via microphone 1042 Quiet down sound (audio data), and can be audio data by such acoustic processing.Audio that treated (voice) data can To be converted to the format output that can be sent to mobile communication base station via radio frequency unit 101 in the case of telephone calling model. Microphone 1042 can implement various types of noises elimination (or inhibition) algorithms and send and receive sound to eliminate (or inhibition) The noise generated during frequency signal or interference.
Mobile terminal 100 further includes at least one sensor 105, such as optical sensor, motion sensor and other biographies Sensor.Specifically, optical sensor includes ambient light sensor and proximity sensor, wherein ambient light sensor can be according to environment The light and shade of light adjusts the brightness of display panel 1061, and proximity sensor can close when mobile terminal 100 is moved in one's ear Display panel 1061 and/or backlight.As a kind of motion sensor, accelerometer sensor can detect in all directions (general For three axis) size of acceleration, size and the direction of gravity are can detect that when static, can be used to identify the application of mobile phone posture (such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, percussion) etc.; The fingerprint sensor that can also configure as mobile phone, pressure sensor, iris sensor, molecule sensor, gyroscope, barometer, The other sensors such as hygrometer, thermometer, infrared sensor, details are not described herein.
Display unit 106 is for showing information input by user or being supplied to the information of user.Display unit 106 can wrap Display panel 1061 is included, liquid crystal display (Liquid Crystal Display, LCD), Organic Light Emitting Diode may be used Forms such as (Organic Light-Emitting Diode, OLED) configure display panel 1061.
User input unit 107 can be used for receiving the number or character information of input, and generate the use with mobile terminal Family is arranged and the related key signals input of function control.Specifically, user input unit 107 may include touch panel 1071 with And other input equipments 1072.Touch panel 1071, also referred to as touch screen collect user on it or neighbouring touch operation (for example user uses any suitable objects or attachment such as finger, stylus on touch panel 1071 or in touch panel 1071 Neighbouring operation), and corresponding attachment device is driven according to preset formula.Touch panel 1071 may include touch detection Two parts of device and touch controller.Wherein, the touch orientation of touch detecting apparatus detection user, and detect touch operation band The signal come, transmits a signal to touch controller;Touch controller receives touch information from touch detecting apparatus, and by it It is converted into contact coordinate, then gives processor 110, and order that processor 110 is sent can be received and executed.In addition, can To realize touch panel 1071 using multiple types such as resistance-type, condenser type, infrared ray and surface acoustic waves.In addition to touch panel 1071, user input unit 107 can also include other input equipments 1072.Specifically, other input equipments 1072 can wrap It includes but is not limited in physical keyboard, function key (such as volume control button, switch key etc.), trace ball, mouse, operating lever etc. It is one or more, do not limit herein specifically.
Further, touch panel 1071 can cover display panel 1061, when touch panel 1071 detect on it or After neighbouring touch operation, processor 110 is sent to determine the type of touch event, is followed by subsequent processing device 110 according to touch thing The type of part provides corresponding visual output on display panel 1061.Although in Fig. 1, touch panel 1071 and display panel 1061 be to realize the function that outputs and inputs of mobile terminal as two independent components, but in certain embodiments, can The function that outputs and inputs of mobile terminal is realized so that touch panel 1071 and display panel 1061 is integrated, is not done herein specifically It limits.
Interface unit 108 be used as at least one external device (ED) connect with mobile terminal 100 can by interface.For example, External device (ED) may include wired or wireless headphone port, external power supply (or battery charger) port, wired or nothing Line data port, memory card port, the port for connecting the device with identification module, audio input/output (I/O) end Mouth, video i/o port, ear port etc..Interface unit 108 can be used for receiving the input from external device (ED) (for example, number It is believed that breath, electric power etc.) and the input received is transferred to one or more elements in mobile terminal 100 or can be with For the transmission data between mobile terminal 100 and external device (ED).
Memory 109 can be used for storing software program and various data.Memory 109 can include mainly storing program area And storage data field, wherein storing program area can storage program area, application program (such as the sound needed at least one function Sound playing function, image player function etc.) etc.;Storage data field can store according to mobile phone use created data (such as Audio data, phone directory etc.) etc..In addition, memory 109 may include high-speed random access memory, can also include non-easy The property lost memory, a for example, at least disk memory, flush memory device or other volatile solid-state parts.
Processor 110 is the control centre of mobile terminal, utilizes each of various interfaces and the entire mobile terminal of connection A part by running or execute the software program and/or module that are stored in memory 109, and calls and is stored in storage Data in device 109 execute the various functions and processing data of mobile terminal, to carry out integral monitoring to mobile terminal.Place Reason device 110 may include one or more processing units;Preferably, processor 110 can integrate application processor and modulatedemodulate is mediated Manage device, wherein the main processing operation system of application processor, user interface and application program etc., modem processor is main Processing wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor 110.
Mobile terminal 100 can also include the power supply 111 (such as battery) powered to all parts, it is preferred that power supply 111 Can be logically contiguous by power-supply management system and processor 110, to realize management charging by power-supply management system, put The functions such as electricity and power managed.
Although Fig. 1 is not shown, mobile terminal 100 can also be including bluetooth module etc., and details are not described herein.
Embodiment to facilitate the understanding of the present invention, below to the communications network system that is based on of mobile terminal of the present invention into Row description.
Referring to Fig. 2, Fig. 2 is a kind of communications network system Organization Chart provided in an embodiment of the present invention, the communication network system System is the LTE system of universal mobile communications technology, which includes communicating UE (User Equipment, the use of connection successively Family equipment) (the lands Evolved UMTS Terrestrial Radio Access Network, evolved UMTS 201, E-UTRAN Ground wireless access network) 202, EPC (Evolved Packet Core, evolved packet-based core networks) 203 and operator IP operation 204。
Specifically, UE201 can be above-mentioned terminal 100, and details are not described herein again.
E-UTRAN202 includes eNodeB2021 and other eNodeB2022 etc..Wherein, eNodeB2021 can be by returning Journey (backhaul) (such as X2 interface) is connect with other eNodeB2022, and eNodeB2021 is connected to EPC203, ENodeB2021 can provide the access of UE201 to EPC203.
EPC203 may include MME (Mobility Management Entity, mobility management entity) 2031, HSS (Home Subscriber Server, home subscriber server) 2032, other MME2033, SGW (Serving Gate Way, Gateway) 2034, PGW (PDN Gate Way, grouped data network gateway) 2035 and PCRF (Policy and Charging Rules Function, policy and rate functional entity) 2036 etc..Wherein, MME2031 be processing UE201 and The control node of signaling, provides carrying and connection management between EPC203.HSS2032 is all to manage for providing some registers Such as the function of home location register (not shown) etc, and some are preserved in relation to use such as service features, data rates The dedicated information in family.All customer data can be sent by SGW2034, and PGW2035 can provide the IP of UE 201 Address is distributed and other functions, and PCRF2036 is strategy and the charging control strategic decision-making of business data flow and IP bearing resources Point, it selects and provides available strategy and charging control decision with charge execution function unit (not shown) for strategy.
IP operation 204 may include internet, Intranet, IMS (IP Multimedia Subsystem, IP multimedia System) or other IP operations etc..
Although above-mentioned be described by taking LTE system as an example, those skilled in the art it is to be understood that the present invention not only Suitable for LTE system, be readily applicable to other wireless communication systems, such as GSM, CDMA2000, WCDMA, TD-SCDMA with And the following new network system etc., it does not limit herein.
Based on above-mentioned mobile terminal hardware configuration and communications network system, each embodiment of the method for the present invention is proposed.
Embodiment one
Fig. 3 is the flow chart of voice input control method first embodiment of the present invention.A kind of voice input control method, should Method includes:
S1, the input operation of the voice in current interactive interface is triggered by speech-input instructions;
S2, operation acquisition voice messaging is inputted by the voice;
The input demand of S3, the identification interactive interface, wherein the input demand includes voice input demand and word Input demand;
S4, the voice messaging transmission operation is executed by voice input demand, alternatively, inputting demand by the word The voice messaging is converted into text information and executes transmission operation.
In the present embodiment, first, the input operation of the voice in current interactive interface is triggered by speech-input instructions;So Afterwards, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified, wherein institute It includes voice input demand and word input demand to state input demand;Finally, the voice is executed by voice input demand Information sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes transmission behaviour Make.
Specifically, the voice control scheme that the present invention is implemented is suitable for the smart machines such as smart mobile phone, tablet computer, In the present embodiment, by taking cell phone apparatus as an example, first, voice messaging is enrolled and parses, cell phone apparatus has the recording groups such as microphone Part obtains extraneous audio-frequency information by the component of recording such as microphone, by the caching component of cell phone apparatus to audio-frequency information into Row caching, then, parses the voice messaging of caching by preset algorithm via processor.It is understood that passing through The external audio frequency information that the recording component such as microphone of cell phone apparatus obtains includes the voice messaging of user and other environmental noises, Before being parsed to voice messaging, if environmental noise is more than certain threshold value, noise reduction process is carried out to it first, then again Parsing operation is carried out to it.
Specifically, in the present embodiment, the input operation of the voice in current interactive interface is triggered by speech-input instructions.When Preceding interactive interface can be the information transmit-receive interface of system, message session interface, can also be information transmit-receive circle of application program Face, message session interface.For example, the physical button for opening voice control is arranged in the side key in terminal device, by this Physical button opens voice control.
Further, the physical button for opening voice control is set in the side key of terminal device, passes through the physics Press switch to open voice control, when user pins the physical button, voice input operation is opened, when user unclamps the physical button When, terminate voice input operation.
Further, in the apex zone of terminal device or bottom zone or rear surface regions or front surface region Physical button for opening voice control is set, by the physical button open voice control, when user pin the physics by When key, voice input operation is opened, when user unclamps the physical button, terminates voice input operation.
Further include that there is word, voice input demand to answer it is understood that this programme is not limited to message session interface With scene, for example, voice assistant, Voice Navigation etc..
In the present embodiment, after triggering the input operation of the voice in current interactive interface by speech-input instructions, pass through The voice input operation obtains voice messaging.Likewise, as above inputting and operating by voice described in example, start voice messaging Obtain operation, it is to be understood that voice input operation can be directed to the touch command of touch scheme, or be directed to In the physical button instruction of pressing scheme, specifically, by the predeterminable area of lasting touch-control interactive interface, with lasting acquisition voice Information, alternatively, by the preset physical button of Continued depression, with lasting acquisition voice messaging.Further, by from beginning to end twice The predeterminable area of touch-control interactive interface, to obtain the voice messaging in the period, alternatively, preset by pressing twice from beginning to end Physical button, to obtain the voice messaging in the period.
Specifically, in the present embodiment, after inputting operation acquisition voice messaging by the voice, identifying the interaction The input demand at interface, wherein the input demand includes voice input demand and word input demand.Wherein, this programme institute The voice input demand stated refers under current state, and it is voice messaging that this time voice input, which operates corresponding information format, equally , the word input demand described in this programme refers under current state, and it is text that this time voice input, which operates corresponding information format, Word information.For example, temporally sequential arrangement shows dialogue entries with the dialog interface of contact person, it is to be understood that directly The voice messaging of input is sent, corresponding dialogue entries are voice messagings, directly transmit the text information of input, corresponding dialogue Entry is text information.
In the present embodiment, this input demand is determined by the information type of the last item dialogue entries, that is, finally The information type of one dialogue entries is text information, then corresponding, and the input demand for recognizing interactive interface at this time is word Input demand;
Further, this input demand is determined by the information type of the last item dialogue entries, that is, the last item The information type of dialogue entries is voice messaging, then corresponding, recognizes the input demand of interactive interface at this time and is inputted for voice Demand;
Further, the information type of the last item dialogue entries sent by user determines that this input needs It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this When interactive interface input demand be word input demand.
In the present embodiment, the input demand of the interactive interface is identified, wherein the input demand includes voice input Demand and word input demand;Later, it executes the voice messaging by voice input demand and sends operation, alternatively, pressing institute Word input demand is stated the voice messaging is converted to text information and executes transmission operation.Likewise, as above being pressed described in example The information type of the last item dialogue entries determines this input demand, that is, the information type of the last item dialogue entries It is text information, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, then, by the text Word inputs demand and the voice messaging is converted to text information and executes transmission operation.Alternatively, user's transmission is last The information type of one dialogue entries determines this input demand, if the information type of the last item dialogue entries is word letter Breath, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, and then, being inputted by the word needs It asks and the voice messaging is converted into text information and executes transmission operation.
The advantageous effect of the present embodiment is, the input behaviour of the voice in current interactive interface is triggered by speech-input instructions Make;Then, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified, In, the input demand includes voice input demand and word input demand;Finally, by described in voice input demand execution Voice messaging sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes hair Send operation.Realizing a kind of voice input control scheme of hommization so that user can quickly carry out voice input operation, Meanwhile being adaptively adjusted, switching the voice messaging after being inputted by voice or text information, it eliminates user and is inputted in word With the handover operation of voice input, the globality and adaptability of voice input are larger improved.
Embodiment two
Fig. 4 is the flow chart of voice input control method second embodiment of the present invention, is based on above-described embodiment, described by language Voice input in the sound input current interactive interface of instruction triggers, which operates, includes:
S11, dialog region and input area are shown in the interactive interface;
S12, the input area is activated by the speech-input instructions, and shows institute's predicate in the input area The status information of sound input.
In the present embodiment, first, dialog region and input area are shown in the interactive interface;Then, pass through institute It states speech-input instructions and activates the input area, and show the status information of the voice input in the input area.
Specifically, in the present embodiment, the input operation of the voice in current interactive interface is triggered by speech-input instructions.When Preceding interactive interface can be the information transmit-receive interface of system, message session interface, can also be information transmit-receive circle of application program Face, message session interface.For example, the physical button for opening voice control is arranged in the side key in terminal device, by this Physical button opens voice control.
Further, the physical button for opening voice control is set in the side key of terminal device, passes through the physics Press switch to open voice control, when user pins the physical button, voice input operation is opened, when user unclamps the physical button When, terminate voice input operation.
Further, in the apex zone of terminal device or bottom zone or rear surface regions or front surface region Physical button for opening voice control is set, by the physical button open voice control, when user pin the physics by When key, voice input operation is opened, when user unclamps the physical button, terminates voice input operation.
Further include that there is word, voice input demand to answer it is understood that this programme is not limited to message session interface With scene, for example, voice assistant, Voice Navigation etc..
Further, the predeterminable area for continuing touch-control interactive interface is shown by status information, is believed with lasting acquisition voice Breath;
Further, the preset physical button of Continued depression is shown by status information, with lasting acquisition voice messaging;
Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by status information, to obtain the time Voice messaging in section;
Further, preset physical button is pressed twice by status information display head and the tail, to obtain in the period Voice messaging.
The advantageous effect of the present embodiment is, by showing dialog region and input area in the interactive interface;So Afterwards, the input area is activated by the speech-input instructions, and shows the voice input in the input area Status information, to realize that a kind of voice input control scheme of hommization provides environmental basis and conditioned basic so that user Voice input operation can be quickly carried out, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or text Word information eliminates user in the handover operation of word input and voice input, larger improves the globality of voice input And adaptability.
Embodiment three
Fig. 5 is the flow chart of voice input control method 3rd embodiment of the present invention, is based on above-described embodiment, described to pass through The voice input operation obtains voice messaging and includes:
S21, the voice messaging, and status information described in real-time update are obtained;
S22, the voice messaging for caching the acquisition.
In the present embodiment, first, the voice messaging, and status information described in real-time update are obtained;Then, institute is cached State the voice messaging of acquisition.
In the present embodiment, after triggering the input operation of the voice in current interactive interface by speech-input instructions, pass through The voice input operation obtains voice messaging.Likewise, as above inputting and operating by voice described in example, start voice messaging Obtain operation, it is to be understood that voice input operation can be directed to the touch command of touch scheme, or be directed to In the physical button instruction of pressing scheme, specifically, by the predeterminable area of lasting touch-control interactive interface, with lasting acquisition voice Information, alternatively, by the preset physical button of Continued depression, with lasting acquisition voice messaging.Further, by from beginning to end twice The predeterminable area of touch-control interactive interface, to obtain the voice messaging in the period, alternatively, preset by pressing twice from beginning to end Physical button, to obtain the voice messaging in the period.
Further, the predeterminable area for continuing touch-control interactive interface is shown by newer status information, with lasting acquisition Voice messaging;
Further, the preset physical button of Continued depression is shown by newer status information, with lasting acquisition voice Information;
Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by newer status information, to obtain Voice messaging in the period;
Further, preset physical button is pressed twice by newer status information display head and the tail, when obtaining this Between voice messaging in section.
The advantageous effect of the present embodiment is, by obtaining the voice messaging, and status information described in real-time update;So Afterwards, the voice messaging for caching the acquisition, for realize a kind of voice input control scheme of hommization provide environmental basis and Conditioned basic so that user can quickly carry out voice input operation, meanwhile, it is adaptively adjusted, switches and inputted by voice Voice messaging afterwards or text information eliminate user in the handover operation of word input and voice input, larger improve The globality and adaptability of voice input.
Example IV
Fig. 6 is the flow chart of voice input control method fourth embodiment of the present invention, is based on above-described embodiment, the identification The input demand of the interactive interface, wherein the input demand includes voice input demand and word input demand includes:
S31, the dialog region and the input area are detected;
S32, judge the dialog region and the input demand of the input area, wherein the input demand includes language Sound inputs demand and word inputs demand;
If the last item information in S33, the dialog region is voice messaging or the input area is that voice is defeated Enter state, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or described Input area is word input state, it is determined that inputs demand for word.
In the present embodiment, first, the dialog region and the input area are detected;Then, judge the dialog region The input demand in domain and the input area, wherein the input demand includes voice input demand and word input demand;Most Afterwards, if the last item information in the dialog region is voice messaging or the input area is voice input state, It is determined as voice input demand, if the last item information in the dialog region is text information or the input area For word input state, it is determined that input demand for word.
Specifically, in the present embodiment, after inputting operation acquisition voice messaging by the voice, identifying the interaction The input demand at interface, wherein the input demand includes voice input demand and word input demand.Wherein, this programme institute The voice input demand stated refers under current state, and it is voice messaging that this time voice input, which operates corresponding information format, equally , the word input demand described in this programme refers under current state, and it is text that this time voice input, which operates corresponding information format, Word information.For example, temporally sequential arrangement shows dialogue entries with the dialog interface of contact person, it is to be understood that directly The voice messaging of input is sent, corresponding dialogue entries are voice messagings, directly transmit the text information of input, corresponding dialogue Entry is text information.
In the present embodiment, this input demand is determined by the information type of the last item dialogue entries, that is, finally The information type of one dialogue entries is text information, then corresponding, and the input demand for recognizing interactive interface at this time is word Input demand;
Further, this input demand is determined by the information type of the last item dialogue entries, that is, the last item The information type of dialogue entries is voice messaging, then corresponding, recognizes the input demand of interactive interface at this time and is inputted for voice Demand;
Further, the information type of the last item dialogue entries sent by user determines that this input needs It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this When interactive interface input demand be word input demand;
Further, by (for example, in ten minutes) in the certain predetermined time, in all dialogue entries, voice messaging dialogue The accounting of number of entries or text information dialogue entries quantity determines current input demand, that is, larger by quantity accounting As current input demand;
Further, by (for example, in nearest ten dialogue entries) in certain historical bar mesh number, in all dialogue entries, The accounting of voice messaging dialogue entries quantity or text information dialogue entries quantity determines current input demand, that is, pressing Quantity accounting it is larger as current input demand.
The advantageous effect of the present embodiment is, by detecting the dialog region and the input area;Then, judge institute State dialog region and the input demand of the input area, wherein the input demand includes that voice input demand and word are defeated Enter demand;Finally, if the last item information in the dialog region is voice messaging or the input area is that voice is defeated Enter state, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or described Input area is word input state, it is determined that demand is inputted for word, to realize a kind of voice input control side of hommization Case provides environmental basis and conditioned basic so that and user can quickly carry out voice input operation, meanwhile, adaptively adjust Whole, switching inputted by voice after voice messaging or text information, eliminate user and inputted in word and cut with what voice inputted Operation is changed, the globality and adaptability of voice input are larger improved.
Embodiment five
Fig. 7 is the flow chart of the 5th embodiment of voice input control method of the present invention, is based on above-described embodiment, described to press institute Predicate sound inputs demand and executes the voice messaging transmission operation, alternatively, pressing word input demand by the voice messaging It is converted to text information and executes transmission operation and include:
S41, the input demand fresh information for recording the input area;
S42, input demand next time is determined according to the input demand fresh information, and is executed by the input demand The voice messaging sends operation or the text information sends operation.
In the present embodiment, first, the input demand fresh information of the input area is recorded;Then, according to described defeated Enter demand fresh information and determine input demand next time, and by the input demand execute the voice messaging send operation or Text information described in person sends operation.
It is understood that the input demand fresh information of the input area of the present embodiment is under current input demand Input demand next time is then updated to text information by input information for example, current input demand is text information.
In the present embodiment, the input demand of the interactive interface is identified, wherein the input demand includes voice input Demand and word input demand;Later, it executes the voice messaging by voice input demand and sends operation, alternatively, pressing institute Word input demand is stated the voice messaging is converted to text information and executes transmission operation.Likewise, as above being pressed described in example The information type of the last item dialogue entries determines this input demand, that is, the information type of the last item dialogue entries It is text information, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, then, by the text Word inputs demand and the voice messaging is converted to text information and executes transmission operation.Alternatively, user's transmission is last The information type of one dialogue entries determines this input demand, if the information type of the last item dialogue entries is word letter Breath, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, and then, being inputted by the word needs It asks and the voice messaging is converted into text information and executes transmission operation.
The advantageous effect of the present embodiment is, by the input demand fresh information for recording the input area;Then, root Input demand next time is determined according to the input demand fresh information, and executes the voice messaging hair by the input demand It send operation or the text information to send operation, realizes a kind of voice input control scheme of hommization so that Yong Huke Quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or word Information, eliminate user word input with voice input handover operation, larger improve voice input globality and Adaptability.
Embodiment six
Based on above-described embodiment, the invention also discloses a kind of voice input control apparatus, which includes memory, place It manages device and is stored in the computer program that can be run on the memory and on the processor, the computer program is by institute It states when processor executes and realizes:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand includes that voice input demand and word are defeated Enter demand;
The voice messaging is executed by voice input demand and sends operation, alternatively, will by word input demand The voice messaging is converted to text information and executes transmission operation.
In the present embodiment, first, the input operation of the voice in current interactive interface is triggered by speech-input instructions;So Afterwards, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified, wherein institute It includes voice input demand and word input demand to state input demand;Finally, the voice is executed by voice input demand Information sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes transmission behaviour Make.
Specifically, the voice control scheme that the present invention is implemented is suitable for the smart machines such as smart mobile phone, tablet computer, In the present embodiment, by taking cell phone apparatus as an example, first, voice messaging is enrolled and parses, cell phone apparatus has the recording groups such as microphone Part obtains extraneous audio-frequency information by the component of recording such as microphone, by the caching component of cell phone apparatus to audio-frequency information into Row caching, then, parses the voice messaging of caching by preset algorithm via processor.It is understood that passing through The external audio frequency information that the recording component such as microphone of cell phone apparatus obtains includes the voice messaging of user and other environmental noises, Before being parsed to voice messaging, if environmental noise is more than certain threshold value, noise reduction process is carried out to it first, then again Parsing operation is carried out to it.
Specifically, in the present embodiment, the input operation of the voice in current interactive interface is triggered by speech-input instructions.When Preceding interactive interface can be the information transmit-receive interface of system, message session interface, can also be information transmit-receive circle of application program Face, message session interface.For example, the physical button for opening voice control is arranged in the side key in terminal device, by this Physical button opens voice control.
Further, the physical button for opening voice control is set in the side key of terminal device, passes through the physics Press switch to open voice control, when user pins the physical button, voice input operation is opened, when user unclamps the physical button When, terminate voice input operation.
Further, in the apex zone of terminal device or bottom zone or rear surface regions or front surface region Physical button for opening voice control is set, by the physical button open voice control, when user pin the physics by When key, voice input operation is opened, when user unclamps the physical button, terminates voice input operation.
Further include that there is word, voice input demand to answer it is understood that this programme is not limited to message session interface With scene, for example, voice assistant, Voice Navigation etc..
In the present embodiment, after triggering the input operation of the voice in current interactive interface by speech-input instructions, pass through The voice input operation obtains voice messaging.Likewise, as above inputting and operating by voice described in example, start voice messaging Obtain operation, it is to be understood that voice input operation can be directed to the touch command of touch scheme, or be directed to In the physical button instruction of pressing scheme, specifically, by the predeterminable area of lasting touch-control interactive interface, with lasting acquisition voice Information, alternatively, by the preset physical button of Continued depression, with lasting acquisition voice messaging.Further, by from beginning to end twice The predeterminable area of touch-control interactive interface, to obtain the voice messaging in the period, alternatively, preset by pressing twice from beginning to end Physical button, to obtain the voice messaging in the period.
Specifically, in the present embodiment, after inputting operation acquisition voice messaging by the voice, identifying the interaction The input demand at interface, wherein the input demand includes voice input demand and word input demand.Wherein, this programme institute The voice input demand stated refers under current state, and it is voice messaging that this time voice input, which operates corresponding information format, equally , the word input demand described in this programme refers under current state, and it is text that this time voice input, which operates corresponding information format, Word information.For example, temporally sequential arrangement shows dialogue entries with the dialog interface of contact person, it is to be understood that directly The voice messaging of input is sent, corresponding dialogue entries are voice messagings, directly transmit the text information of input, corresponding dialogue Entry is text information.
In the present embodiment, this input demand is determined by the information type of the last item dialogue entries, that is, finally The information type of one dialogue entries is text information, then corresponding, and the input demand for recognizing interactive interface at this time is word Input demand;
Further, this input demand is determined by the information type of the last item dialogue entries, that is, the last item The information type of dialogue entries is voice messaging, then corresponding, recognizes the input demand of interactive interface at this time and is inputted for voice Demand;
Further, the information type of the last item dialogue entries sent by user determines that this input needs It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this When interactive interface input demand be word input demand.
In the present embodiment, the input demand of the interactive interface is identified, wherein the input demand includes voice input Demand and word input demand;Later, it executes the voice messaging by voice input demand and sends operation, alternatively, pressing institute Word input demand is stated the voice messaging is converted to text information and executes transmission operation.Likewise, as above being pressed described in example The information type of the last item dialogue entries determines this input demand, that is, the information type of the last item dialogue entries It is text information, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, then, by the text Word inputs demand and the voice messaging is converted to text information and executes transmission operation.Alternatively, user's transmission is last The information type of one dialogue entries determines this input demand, if the information type of the last item dialogue entries is word letter Breath, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, and then, being inputted by the word needs It asks and the voice messaging is converted into text information and executes transmission operation.
The advantageous effect of the present embodiment is, the input behaviour of the voice in current interactive interface is triggered by speech-input instructions Make;Then, operation is inputted by the voice and obtains voice messaging;Subsequently, the input demand of the interactive interface is identified, In, the input demand includes voice input demand and word input demand;Finally, by described in voice input demand execution Voice messaging sends operation, alternatively, the voice messaging is converted to text information by word input demand and executes hair Send operation.Realizing a kind of voice input control scheme of hommization so that user can quickly carry out voice input operation, Meanwhile being adaptively adjusted, switching the voice messaging after being inputted by voice or text information, it eliminates user and is inputted in word With the handover operation of voice input, the globality and adaptability of voice input are larger improved.
Embodiment seven
Based on above-described embodiment, optionally, the computer program is realized when also being executed by the processor:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows that the voice is defeated in the input area The status information entered.
In the present embodiment, first, dialog region and input area are shown in the interactive interface;Then, pass through institute It states speech-input instructions and activates the input area, and show the status information of the voice input in the input area.
Specifically, in the present embodiment, the input operation of the voice in current interactive interface is triggered by speech-input instructions.When Preceding interactive interface can be the information transmit-receive interface of system, message session interface, can also be information transmit-receive circle of application program Face, message session interface.For example, the physical button for opening voice control is arranged in the side key in terminal device, by this Physical button opens voice control.
Further, the physical button for opening voice control is set in the side key of terminal device, passes through the physics Press switch to open voice control, when user pins the physical button, voice input operation is opened, when user unclamps the physical button When, terminate voice input operation.
Further, in the apex zone of terminal device or bottom zone or rear surface regions or front surface region Physical button for opening voice control is set, by the physical button open voice control, when user pin the physics by When key, voice input operation is opened, when user unclamps the physical button, terminates voice input operation.
Further include that there is word, voice input demand to answer it is understood that this programme is not limited to message session interface With scene, for example, voice assistant, Voice Navigation etc..
Further, the predeterminable area for continuing touch-control interactive interface is shown by status information, is believed with lasting acquisition voice Breath;
Further, the preset physical button of Continued depression is shown by status information, with lasting acquisition voice messaging;
Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by status information, to obtain the time Voice messaging in section;
Further, preset physical button is pressed twice by status information display head and the tail, to obtain in the period Voice messaging.
The advantageous effect of the present embodiment is, by showing dialog region and input area in the interactive interface;So Afterwards, the input area is activated by the speech-input instructions, and shows the voice input in the input area Status information, to realize that a kind of voice input control scheme of hommization provides environmental basis and conditioned basic so that user Voice input operation can be quickly carried out, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or text Word information eliminates user in the handover operation of word input and voice input, larger improves the globality of voice input And adaptability.
Embodiment eight
Based on above-described embodiment, optionally, the computer program is realized when also being executed by the processor:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
In the present embodiment, first, the voice messaging, and status information described in real-time update are obtained;Then, institute is cached State the voice messaging of acquisition.
In the present embodiment, after triggering the input operation of the voice in current interactive interface by speech-input instructions, pass through The voice input operation obtains voice messaging.Likewise, as above inputting and operating by voice described in example, start voice messaging Obtain operation, it is to be understood that voice input operation can be directed to the touch command of touch scheme, or be directed to In the physical button instruction of pressing scheme, specifically, by the predeterminable area of lasting touch-control interactive interface, with lasting acquisition voice Information, alternatively, by the preset physical button of Continued depression, with lasting acquisition voice messaging.Further, by from beginning to end twice The predeterminable area of touch-control interactive interface, to obtain the voice messaging in the period, alternatively, preset by pressing twice from beginning to end Physical button, to obtain the voice messaging in the period.
Further, the predeterminable area for continuing touch-control interactive interface is shown by newer status information, with lasting acquisition Voice messaging;
Further, the preset physical button of Continued depression is shown by newer status information, with lasting acquisition voice Information;
Further, the predeterminable area that touch-control interactive interface twice is shown from beginning to end by newer status information, to obtain Voice messaging in the period;
Further, preset physical button is pressed twice by newer status information display head and the tail, when obtaining this Between voice messaging in section.
The advantageous effect of the present embodiment is, by obtaining the voice messaging, and status information described in real-time update;So Afterwards, the voice messaging for caching the acquisition, for realize a kind of voice input control scheme of hommization provide environmental basis and Conditioned basic so that user can quickly carry out voice input operation, meanwhile, it is adaptively adjusted, switches and inputted by voice Voice messaging afterwards or text information eliminate user in the handover operation of word input and voice input, larger improve The globality and adaptability of voice input.
Embodiment nine
Based on above-described embodiment, optionally, the computer program is realized when also being executed by the processor:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand includes that voice is defeated Enter demand and word input demand;
If the last item information in the dialog region is voice messaging or the input area is voice input shape State, it is determined that demand is inputted for voice, if the last item information in the dialog region is text information or the input Region is word input state, it is determined that inputs demand for word;
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and by described in input demand execution Voice messaging sends operation or the text information sends operation.
In the present embodiment, first, the dialog region and the input area are detected;Then, judge the dialog region The input demand in domain and the input area, wherein the input demand includes voice input demand and word input demand;Most Afterwards, if the last item information in the dialog region is voice messaging or the input area is voice input state, It is determined as voice input demand, if the last item information in the dialog region is text information or the input area For word input state, it is determined that input demand for word.
Specifically, in the present embodiment, after inputting operation acquisition voice messaging by the voice, identifying the interaction The input demand at interface, wherein the input demand includes voice input demand and word input demand.Wherein, this programme institute The voice input demand stated refers under current state, and it is voice messaging that this time voice input, which operates corresponding information format, equally , the word input demand described in this programme refers under current state, and it is text that this time voice input, which operates corresponding information format, Word information.For example, temporally sequential arrangement shows dialogue entries with the dialog interface of contact person, it is to be understood that directly The voice messaging of input is sent, corresponding dialogue entries are voice messagings, directly transmit the text information of input, corresponding dialogue Entry is text information.
In the present embodiment, this input demand is determined by the information type of the last item dialogue entries, that is, finally The information type of one dialogue entries is text information, then corresponding, and the input demand for recognizing interactive interface at this time is word Input demand;
Further, this input demand is determined by the information type of the last item dialogue entries, that is, the last item The information type of dialogue entries is voice messaging, then corresponding, recognizes the input demand of interactive interface at this time and is inputted for voice Demand;
Further, the information type of the last item dialogue entries sent by user determines that this input needs It asks, it is that is, the information type of the last item dialogue entries is voice messaging, then corresponding, recognize the defeated of interactive interface at this time It is that voice inputs demand to enter demand, corresponding if the information type of the last item dialogue entries is text information, recognizes this When interactive interface input demand be word input demand;
Further, by (for example, in ten minutes) in the certain predetermined time, in all dialogue entries, voice messaging dialogue The accounting of number of entries or text information dialogue entries quantity determines current input demand, that is, larger by quantity accounting As current input demand;
Further, by (for example, in nearest ten dialogue entries) in certain historical bar mesh number, in all dialogue entries, The accounting of voice messaging dialogue entries quantity or text information dialogue entries quantity determines current input demand, that is, pressing Quantity accounting it is larger as current input demand.
In the present embodiment, first, the input demand fresh information of the input area is recorded;Then, according to described defeated Enter demand fresh information and determine input demand next time, and by the input demand execute the voice messaging send operation or Text information described in person sends operation.
It is understood that the input demand fresh information of the input area of the present embodiment is under current input demand Input demand next time is then updated to text information by input information for example, current input demand is text information.
In the present embodiment, the input demand of the interactive interface is identified, wherein the input demand includes voice input Demand and word input demand;Later, it executes the voice messaging by voice input demand and sends operation, alternatively, pressing institute Word input demand is stated the voice messaging is converted to text information and executes transmission operation.Likewise, as above being pressed described in example The information type of the last item dialogue entries determines this input demand, that is, the information type of the last item dialogue entries It is text information, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, then, by the text Word inputs demand and the voice messaging is converted to text information and executes transmission operation.Alternatively, user's transmission is last The information type of one dialogue entries determines this input demand, if the information type of the last item dialogue entries is word letter Breath, then corresponding, the input demand for recognizing interactive interface at this time is that word inputs demand, and then, being inputted by the word needs It asks and the voice messaging is converted into text information and executes transmission operation.
The advantageous effect of the present embodiment is, by the input demand fresh information for recording the input area;Then, root Input demand next time is determined according to the input demand fresh information, and executes the voice messaging hair by the input demand It send operation or the text information to send operation, realizes a kind of voice input control scheme of hommization so that Yong Huke Quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the voice messaging after being inputted by voice or word Information, eliminate user word input with voice input handover operation, larger improve voice input globality and Adaptability.
Embodiment ten
Based on above-described embodiment, the invention also provides a kind of computer readable storage medium, the computer-readable storages It is stored with voice input control program on medium, is realized such as any of the above-described institute when voice input control program is executed by processor The step of voice input control method stated.
Voice input control method, equipment and the computer readable storage medium for implementing the present invention are referred to by voice input Enable the voice input operation triggered in current interactive interface;Then, operation is inputted by the voice and obtains voice messaging;Again so Afterwards, the input demand of the interactive interface is identified, wherein the input demand, which includes voice input demand and word input, to be needed It asks;Finally, it executes the voice messaging by voice input demand and sends operation, alternatively, will by word input demand The voice messaging is converted to text information and executes transmission operation.A kind of voice input control scheme of hommization is realized, User is allow quickly to carry out voice input operation, meanwhile, it is adaptively adjusted, switches the letter of the voice after being inputted by voice Breath or text information eliminate user in the handover operation of word input and voice input, larger improve voice input Globality and adaptability.
It should be noted that herein, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that process, method, article or device including a series of elements include not only those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or device institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including this There is also other identical elements in the process of element, method, article or device.
The embodiments of the present invention are for illustration only, can not represent the quality of embodiment.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical scheme of the present invention substantially in other words does the prior art Going out the part of contribution can be expressed in the form of software products, which is stored in a storage medium In (such as ROM/RAM, magnetic disc, CD), including some instructions are used so that a station terminal (can be mobile phone, computer, service Device, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.
The embodiment of the present invention is described with above attached drawing, but the invention is not limited in above-mentioned specific Embodiment, the above mentioned embodiment is only schematical, rather than restrictive, those skilled in the art Under the inspiration of the present invention, without breaking away from the scope protected by the purposes and claims of the present invention, it can also make very much Form, all of these belong to the protection of the present invention.

Claims (10)

1. a kind of voice input control method, which is characterized in that the method includes:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand, which includes voice input demand and word input, to be needed It asks;
The voice messaging is executed by voice input demand and sends operation, alternatively, will be described by word input demand Voice messaging is converted to text information and executes transmission operation.
2. voice input control method according to claim 1, which is characterized in that described to be worked as by speech-input instructions triggering Voice input in preceding interactive interface, which operates, includes:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows the voice input in the input area Status information.
3. voice input control method according to claim 2, which is characterized in that described inputted by the voice operates Obtaining voice messaging includes:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
4. voice input control method according to claim 3, which is characterized in that described to identify the defeated of the interactive interface Enter demand, wherein the input demand includes voice input demand and word input demand includes:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand, which includes voice input, to be needed Word of summing inputs demand;
If the last item information in the dialog region is voice messaging or the input area is voice input state, Then it is determined as voice input demand, if the last item information in the dialog region is text information or the input area Domain is word input state, it is determined that inputs demand for word.
5. voice input control method according to claim 4, which is characterized in that described to be held by voice input demand The row voice messaging sends operation, alternatively, the voice messaging is converted to text information simultaneously by word input demand It executes to send to operate and includes:
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and executes the voice by the input demand Information sends operation or the text information sends operation.
6. a kind of voice input control apparatus, which is characterized in that the equipment includes memory, processor and is stored in described deposit It is real when the computer program is executed by the processor on reservoir and the computer program that can run on the processor It is existing:
The input operation of the voice in current interactive interface is triggered by speech-input instructions;
Operation, which is inputted, by the voice obtains voice messaging;
Identify the input demand of the interactive interface, wherein the input demand, which includes voice input demand and word input, to be needed It asks;
The voice messaging is executed by voice input demand and sends operation, alternatively, will be described by word input demand Voice messaging is converted to text information and executes transmission operation.
7. voice input control apparatus according to claim 6, which is characterized in that the computer program is also by the place Reason device is realized when executing:
Dialog region and input area are shown in the interactive interface;
The input area is activated by the speech-input instructions, and shows the voice input in the input area Status information.
8. voice input control apparatus according to claim 7, which is characterized in that the computer program is also by the place Reason device is realized when executing:
Obtain the voice messaging, and status information described in real-time update;
Cache the voice messaging of the acquisition.
9. voice input control apparatus according to claim 8, which is characterized in that the computer program is also by the place Reason device is realized when executing:
Detect the dialog region and the input area;
Judge the dialog region and the input demand of the input area, wherein the input demand, which includes voice input, to be needed Word of summing inputs demand;
If the last item information in the dialog region is voice messaging or the input area is voice input state, Then it is determined as voice input demand, if the last item information in the dialog region is text information or the input area Domain is word input state, it is determined that inputs demand for word;
Record the input demand fresh information of the input area;
Input demand next time is determined according to the input demand fresh information, and executes the voice by the input demand Information sends operation or the text information sends operation.
10. a kind of computer readable storage medium, which is characterized in that it is defeated to be stored with voice on the computer readable storage medium Enter and control program, is realized as described in any one of claim 1 to 5 when the voice input control program is executed by processor The step of voice input control method.
CN201810202888.5A 2018-03-13 2018-03-13 A kind of voice input control method, equipment and computer readable storage medium Pending CN108520750A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810202888.5A CN108520750A (en) 2018-03-13 2018-03-13 A kind of voice input control method, equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810202888.5A CN108520750A (en) 2018-03-13 2018-03-13 A kind of voice input control method, equipment and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN108520750A true CN108520750A (en) 2018-09-11

Family

ID=63433037

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810202888.5A Pending CN108520750A (en) 2018-03-13 2018-03-13 A kind of voice input control method, equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN108520750A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109584879A (en) * 2018-11-23 2019-04-05 华为技术有限公司 A kind of sound control method and electronic equipment
CN114697717A (en) * 2020-12-28 2022-07-01 深圳Tcl新技术有限公司 Text input method and terminal equipment

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6021178A (en) * 1996-03-29 2000-02-01 Siemens Information And Communication Networks, Inc. System and method for detecting types of signals in messaging systems
US7035805B1 (en) * 2000-07-14 2006-04-25 Miller Stephen S Switching the modes of operation for voice-recognition applications
CN101308654A (en) * 2007-05-14 2008-11-19 华为技术有限公司 Speech analysis and recognition method, system and apparatus
CN104869225A (en) * 2014-02-21 2015-08-26 宏达国际电子股份有限公司 Smart conversation method and electronic device using the same
CN106550146A (en) * 2016-10-28 2017-03-29 努比亚技术有限公司 A kind of chat message dispensing device and method
CN106710586A (en) * 2016-12-27 2017-05-24 北京智能管家科技有限公司 Speech recognition engine automatic switching method and device
CN107124352A (en) * 2017-05-26 2017-09-01 维沃移动通信有限公司 The processing method and mobile terminal of a kind of voice messaging
CN107342088A (en) * 2017-06-19 2017-11-10 联想(北京)有限公司 A kind of conversion method of acoustic information, device and equipment
CN107395878A (en) * 2017-07-04 2017-11-24 合肥市乐腾科技咨询有限公司 Automatic voice and text conversion communication system
CN107483736A (en) * 2017-08-23 2017-12-15 广东小天才科技有限公司 Message processing method and device for instant messaging application program
CN107608957A (en) * 2017-09-06 2018-01-19 百度在线网络技术(北京)有限公司 Text modification method, apparatus and its equipment based on voice messaging

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6021178A (en) * 1996-03-29 2000-02-01 Siemens Information And Communication Networks, Inc. System and method for detecting types of signals in messaging systems
US7035805B1 (en) * 2000-07-14 2006-04-25 Miller Stephen S Switching the modes of operation for voice-recognition applications
CN101308654A (en) * 2007-05-14 2008-11-19 华为技术有限公司 Speech analysis and recognition method, system and apparatus
CN104869225A (en) * 2014-02-21 2015-08-26 宏达国际电子股份有限公司 Smart conversation method and electronic device using the same
CN106550146A (en) * 2016-10-28 2017-03-29 努比亚技术有限公司 A kind of chat message dispensing device and method
CN106710586A (en) * 2016-12-27 2017-05-24 北京智能管家科技有限公司 Speech recognition engine automatic switching method and device
CN107124352A (en) * 2017-05-26 2017-09-01 维沃移动通信有限公司 The processing method and mobile terminal of a kind of voice messaging
CN107342088A (en) * 2017-06-19 2017-11-10 联想(北京)有限公司 A kind of conversion method of acoustic information, device and equipment
CN107395878A (en) * 2017-07-04 2017-11-24 合肥市乐腾科技咨询有限公司 Automatic voice and text conversion communication system
CN107483736A (en) * 2017-08-23 2017-12-15 广东小天才科技有限公司 Message processing method and device for instant messaging application program
CN107608957A (en) * 2017-09-06 2018-01-19 百度在线网络技术(北京)有限公司 Text modification method, apparatus and its equipment based on voice messaging

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109584879A (en) * 2018-11-23 2019-04-05 华为技术有限公司 A kind of sound control method and electronic equipment
CN109584879B (en) * 2018-11-23 2021-07-06 华为技术有限公司 Voice control method and electronic equipment
US11450322B2 (en) 2018-11-23 2022-09-20 Huawei Technologies Co., Ltd. Speech control method and electronic device
CN114697717A (en) * 2020-12-28 2022-07-01 深圳Tcl新技术有限公司 Text input method and terminal equipment

Similar Documents

Publication Publication Date Title
CN108572764A (en) A kind of word input control method, equipment and computer readable storage medium
CN107229402A (en) Dynamic screenshotss method, device and the readable storage medium storing program for executing of terminal
CN107864357A (en) Video calling special effect controlling method, terminal and computer-readable recording medium
CN106961706A (en) Method, mobile terminal and the computer-readable recording medium of communication pattern switching
CN108810437A (en) Record screen method, terminal and computer readable storage medium
CN107436779A (en) A kind of application management method, equipment and computer-readable recording medium
CN107682547A (en) A kind of voice messaging regulation and control method, equipment and computer-readable recording medium
CN108551520A (en) A kind of phonetic search response method, equipment and computer readable storage medium
CN110177177A (en) Message back method, mobile terminal and computer readable storage medium
CN108200275A (en) A kind of record screen control method, equipment and computer readable storage medium
CN107463243A (en) A kind of screen control method, mobile terminal and computer-readable recording medium
CN108600513A (en) A kind of record screen control method, equipment and computer readable storage medium
CN108196777A (en) A kind of flexible screen application process, equipment and computer readable storage medium
CN108521500A (en) A kind of voice scenery control method, equipment and computer readable storage medium
CN107181865A (en) Processing method, terminal and the computer-readable recording medium of unread short messages
CN107844230A (en) A kind of advertisement page method of adjustment, mobile terminal and computer-readable recording medium
CN108322609A (en) A kind of notification information regulation and control method, equipment and computer readable storage medium
CN108924352A (en) Sound quality method for improving, terminal and computer readable storage medium
CN107992455A (en) A kind of text handling method, terminal and computer-readable recording medium
CN107390856A (en) A kind of method, mobile terminal and storage medium for reducing mobile terminal power consumption
CN108536383A (en) A kind of game control method, equipment and computer readable storage medium
CN109117105A (en) A kind of collaboration desktop interaction regulation method, equipment and computer readable storage medium
CN108062241A (en) A kind of switching method of display interface, terminal and storage medium
CN107844759A (en) A kind of gesture identification method, terminal and storage medium
CN107368241A (en) A kind of information processing method, equipment and computer-readable recording medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180911

RJ01 Rejection of invention patent application after publication