CN108196814A - Pronunciation inputting method and Related product - Google Patents
Pronunciation inputting method and Related product Download PDFInfo
- Publication number
- CN108196814A CN108196814A CN201711461498.1A CN201711461498A CN108196814A CN 108196814 A CN108196814 A CN 108196814A CN 201711461498 A CN201711461498 A CN 201711461498A CN 108196814 A CN108196814 A CN 108196814A
- Authority
- CN
- China
- Prior art keywords
- operating system
- sound bank
- word
- application
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/02—Input arrangements using manually operated switches, e.g. using keyboards or dials
- G06F3/023—Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
- G06F3/0233—Character input methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Abstract
The embodiment of the present application discloses a kind of pronunciation inputting method and Related product.Method includes:The operating system handles collected voice as voice feature data, and the front stage operation of the mobile terminal has the runnable interface of destination application;The operating system obtains the corresponding exclusive sound bank of the destination application;The operating system determines the corresponding word of the voice according to the voice feature data and the exclusive sound bank;The operating system determines the phonetic entry result of the runnable interface according to the word.The real-time of phonetic entry and accuracy when the embodiment of the present application is conducive to improve running of mobile terminal destination application.
Description
Technical field
This application involves technical field of mobile terminals, and in particular to pronunciation inputting method and Related product.
Background technology
With the fast development of the relevant technologies of the mobile terminals such as smart mobile phone, more and more applications are installed in user
In mobile phone, such as read class application, the application of payment class, game class application, the application of music class, the clothing, food, lodging and transportion -- basic necessities of life of people with hand
Secret is inseparable.People can enable input method function during application is used, and the input method for enabling system default carries out
Keyword search and words input.
Invention content
The embodiment of the present application provides pronunciation inputting method and Related product, can improve running of mobile terminal intended application
The real-time of phonetic entry and accuracy during program.
In a first aspect, the embodiment of the present application provides a kind of pronunciation inputting method, applied to mobile terminal, above-mentioned mobile terminal
Upper operation has operating system and one or more application program, the above method to include:
Second aspect, the embodiment of the present application provide a kind of speech input device, applied to mobile terminal, above-mentioned mobile terminal
Upper operation has operating system and one or more application program, and the speech input device includes processing unit, acquiring unit
And determination unit, wherein,
The processing unit, for handling collected voice as voice feature data, the foreground of the mobile terminal is transported
Row has the runnable interface of destination application;
The acquiring unit, for obtaining the corresponding exclusive sound bank of the destination application;
The determination unit, for according to the voice feature data and the exclusive sound bank, determining the voice pair
The word answered;
The determination unit, also with the phonetic entry result that the runnable interface is determined according to the word.
The third aspect, the embodiment of the present application provide a kind of mobile terminal, including processor, memory, communication interface and
One or more programs, wherein, said one or multiple programs are stored in above-mentioned memory, and be configured by above-mentioned
It manages device to perform, above procedure includes the instruction for performing the step in the embodiment of the present application first aspect either method.
Fourth aspect, the embodiment of the present application provide a kind of computer readable storage medium, wherein, above computer is readable
Storage medium storage is used for the computer program of electronic data interchange, wherein, above computer program causes computer to perform such as
Part or all of step described in the embodiment of the present application first aspect either method, above computer include mobile terminal.
5th aspect, the embodiment of the present application provide a kind of computer program product, wherein, above computer program product
Non-transient computer readable storage medium including storing computer program, above computer program are operable to make calculating
Machine is performed such as the part or all of step described in the embodiment of the present application first aspect either method.The computer program product
Can be a software installation packet, above computer includes mobile terminal.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank
Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank
The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word
It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met
Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target
The real-time of phonetic entry and accuracy during application program.
Description of the drawings
The attached drawing involved by the embodiment of the present application will be briefly described below.
Figure 1A is that the embodiment of the present application provides a kind of structure diagram of smart mobile phone;
Figure 1B is a kind of schematic diagram of the program running space of smart mobile phone;
Fig. 1 C are a kind of system architecture diagrams of Android system;
Fig. 2 is a kind of flow diagram of pronunciation inputting method provided by the embodiments of the present application;
Fig. 3 is a kind of flow diagram of pronunciation inputting method disclosed in the embodiment of the present application;
Fig. 4 is a kind of flow diagram of pronunciation inputting method disclosed in the embodiment of the present application;
Fig. 5 is a kind of structure diagram of mobile terminal disclosed in the embodiment of the present application;
Fig. 6 is a kind of functional unit composition block diagram of mobile terminal disclosed in the embodiment of the present application.
Specific embodiment
In order to which those skilled in the art is made to more fully understand application scheme, below in conjunction in the embodiment of the present application
The technical solution in the embodiment of the present application is clearly and completely described in attached drawing, it is clear that described embodiment is only
Some embodiments of the present application, instead of all the embodiments.Based on the embodiment in the application, those of ordinary skill in the art
All other embodiments obtained without creative efforts shall fall in the protection scope of this application.
Term " first ", " second " in the description and claims of this application and above-mentioned attached drawing etc. are for distinguishing
Different objects rather than for describing particular order.In addition, term " comprising " and " having " and their any deformations, it is intended that
It is to cover non-exclusive include.Such as process, method, system, product or the equipment for containing series of steps or unit do not have
The step of having listed or unit are defined in, but optionally further includes the step of not listing or unit or optionally also wraps
It includes for other intrinsic steps of these processes, method, product or equipment or unit.
Referenced herein " embodiment " is it is meant that a particular feature, structure, or characteristic described can wrap in conjunction with the embodiments
It is contained at least one embodiment of the application.Each position in the description occur the phrase might not each mean it is identical
Embodiment, nor the independent or alternative embodiment with other embodiments mutual exclusion.Those skilled in the art explicitly and
Implicitly understand, embodiment described herein can be combined with other embodiments.
Mobile terminal involved by the embodiment of the present application can include the various handheld devices with wireless communication function
(such as smart mobile phone), mobile unit, wearable device, computing device are connected to other processing of radio modem and set
Standby and various forms of user equipmenies (User Equipment, UE), mobile station (Mobile Station, MS), terminal is set
Standby (terminal device) etc..For convenience of description, apparatus mentioned above is referred to as mobile terminal.The embodiment of the present invention
Involved operating system is that hardware resource is managed collectively, and provides a user the software systems of business interface.Under
Face is introduced the exemplary construction of mobile terminal by taking smart mobile phone as an example.
Figure 1A is that the embodiment of the present application provides a kind of structure diagram of smart mobile phone 100, and above-mentioned smart mobile phone 100 wraps
It includes:Housing 110, touching display screen 120, mainboard 130, battery 140 and subplate 150 are provided with front camera on mainboard 130
131st, systems-on-a-chip (System on Chip, SoC) 132 (including application processors and baseband processor), memory 133,
Power management chip 134, radio frequency system 135 etc. are provided with oscillator 151, integrated sound chamber 152, VOOC and dodge and fill interface on subplate
153.Wherein, the touching display screen 120 can be comprehensive screen or abnormity screen, not do unique restriction herein.
The SoC132 is the control centre of smart mobile phone, utilizes each of various interfaces and the entire smart mobile phone of connection
A part is stored in storage by running or performing the software program being stored in memory 133 and/or module and call
Data in device 133 perform the various functions of smart mobile phone and processing data, so as to carry out integral monitoring to smart mobile phone.It should
SoC132 may include one or more processing units, can such as integrate application processor AP and baseband processor (also known as base band core
Piece, base band) etc., wherein, the main processing operation system of application processor, user interface and application program etc., baseband processor master
Handle wireless communication.It is understood that above-mentioned baseband processor can not also be integrated into SoC132.The SoC132 is for example
Can be central processing unit (Central Processing Unit, CPU), general processor, digital signal processor
(Digital Signal Processor, DSP), application-specific integrated circuit (Application-Specific Integrated
Circuit, ASIC), field programmable gate array (Field Programmable Gate Array, FPGA) or other can
Programmed logic device, transistor logic, hardware component or its arbitrary combination.It can realize or perform with reference to the application
The described various illustrative logic blocks of disclosure, module and circuit.Above-mentioned processor can also realize to calculate work(
The combination of energy, such as include one or more microprocessors and combine, combination of DSP and microprocessor etc..
The memory 133 can be used for storage software program and module, and SoC132 is stored in memory 133 by operation
Software program and module, so as to perform the various function application of smart mobile phone and data processing.Memory 133 can be main
Including storing program area and storage data field, wherein, storing program area can storage program area, needed at least one function should
With program etc.;Storage data field can be stored uses created data etc. according to smart mobile phone.In addition, memory 133 can be with
Including high-speed random access memory, nonvolatile memory, for example, at least disk memory, a flash memory can also be included
Device or other volatile solid-state parts.The memory 133 for example can be random access memory (Random
Access Memory, RAM), flash memory, read-only memory (Read Only Memory, ROM), the read-only storage of erasable programmable
Device (Erasable Programmable ROM, EPROM), Electrically Erasable Programmable Read-Only Memory (Electrically
EPROM, EEPROM), register, hard disk, mobile hard disk, CD-ROM (CD-ROM) or any other shape well known in the art
The storage medium of formula.
Figure 1B is the schematic diagram of the program running space of smart mobile phone provided by the embodiments of the present application, at present smart mobile phone etc.
Mobile terminal is typically provided with program running space, which includes user's space and operating system space, wherein,
There are one user's space operations or multiple application programs, which should for the third party of mobile terminal installation
With program, operating system space motion has the operating system of mobile terminal.The mobile terminal can specifically run Android Android
Mobile operating system iOS that system, Apple Inc. develop etc., does not do unique restriction herein.As shown in Figure 1 C, with above-mentioned mobile whole
For end operation has android system, corresponding user's space includes the application layer in the android system
(Applications), operating system space can include the application framework layer in the android system
(Application Framework), system operation library layer including system operation library layer Libraries and Android (when running
Android Runtime), Linux inner core (Linux Kernel).Wherein, it is directly handed over user including all kinds of in application layer
Mutual application program or the service routine for running on backstage write by Java language.For example, that is realized on smart mobile phone is common
The program of basic function, such as short message service (Short Messaging Service, SMS) short message, dialing, picture
The programs such as browser, calendar, game, map, WWW (World Wide Web, Web) browser and developer's exploitation
Other applications.Application framework layer provides a series of class libraries needed for exploitation Android application programs, can be used in
Component is reused, can also realize personalized extension by inheriting.System operation library layer is the support of application framework, is
Various components in android system provide service.System operation library layer is formed when being run by system class libraries and Android.
Core library and Dalvik virtual machine two parts are included when Android is run.Linux inner core is used to implement hardware device drivers,
The Core Features such as process and memory management, network protocol stack, power management, wireless communication.
In the game application scene of mobile terminal, player particularly requires other to be linked up in more preferable team faster
Team member's Emergency Assistance etc. information requires the response of input method higher than normal application requirement.And under normal circumstances, behaviour
Make system and do not differentiate between game and non-gaming scene, also will not more be directed to scene of game and carry out special optimization.
For the above situation, the embodiment of the present application proposes a kind of phonetic entry of destination application for mobile terminal
Method, in this method, mobile terminal can carry out Precise control in application program rank to speech voice input function, especially by
It determines the exclusive sound bank that destination application is adapted to, the corresponding text of voice input by user is determined according to the exclusive sound bank
Word, and the phonetic entry of runnable interface is determined as a result, all applications is avoided to load same Default sound library according to the word
And the individual demand of different application can not be met, and the exclusive sound bank of application-specific is real in accuracy and response
It can be better than the sound bank of system default, phonetic entry when being conducive to improve running of mobile terminal destination application in terms of when property
Real-time and accuracy.
The embodiment of the present application is introduced below in conjunction with the accompanying drawings.
A kind of flow diagram of pronunciation inputting method, above-mentioned shifting are provided referring to Fig. 2, Fig. 2 is the embodiment of the present application
Operation has operating system and one or more application program in dynamic terminal, as shown in the figure, this pronunciation inputting method includes:
S201, the operating system handle collected voice as voice feature data, and the foreground of the mobile terminal is transported
Row has the runnable interface of destination application.
Wherein, destination application refers to the third party application of the user's space mounted on mobile terminal, the third
Square application program is such as can be camera application program, the application of instant messaging class, game class application, the third party application
It can also be pre-installed before mobile terminal dispatches from the factory by developer by user installation, do not done unique restriction herein.
In the specific implementation, voice first can be divided into multiframe voice by the operating system, for multiframe voice waveform into
Row transformation, i.e., extract strategy based on preset acoustic feature, extract the voice feature data in the voice, such as extraction institute predicate
Mel-frequency cepstrum coefficient (Mel Frequency Cepstral Coefficients, MFCCs) in sound.
S202, the operating system obtain the corresponding exclusive sound bank of the destination application.
Wherein, the exclusive sound bank includes the correspondence between voice feature data and word, and the text included
Word is the common word in destination application operational process, and the data volume of the exclusive sound bank is less than the acquiescence of mobile terminal
Sound bank.
S203, the operating system determine the voice pair according to the voice feature data and the exclusive sound bank
The word answered.
Wherein, word can include word, phrase, phrase and expression etc., not do unique restriction herein.
S204, the operating system determine the phonetic entry result of the runnable interface according to the word.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank
Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank
The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word
It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met
Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target
The real-time of phonetic entry and accuracy during application program.
In a possible example, the operating system obtains the corresponding exclusive sound bank of the destination application,
Including:The operating system inquires the mapping relations between the application program to prestore and exclusive sound bank, determines that the target should
With the corresponding exclusive sound bank of program.
Wherein, mobile terminal can record and analyze to obtain based on the history phonetic entry of user the application program and exclusive
Mapping relations and/or mobile terminal between sound bank can receive the application program that sends from server and exclusive
Mapping relations between sound bank, the former is more adapted to the use habit of householder user, and more accurately, the latter is based on big data point
Analysis, it is more comprehensive.
As it can be seen that in this example, the operating system of mobile terminal can be based on preset using journey for each application program
Mapping relations between sequence and exclusive sound bank are adapted to corresponding exclusive sound bank for each application program, so as to fulfill answering
With program level Precise control speech voice input function, the accuracy of mobile terminal control voice input function is improved.
In a possible example, the operating system obtains the corresponding exclusive sound bank of the destination application,
Including:The operating system determines the corresponding exclusive sound bank set of the destination application;The operating system is according to institute
State the internal operation scene that runnable interface determines the destination application;The operating system inquires the exclusive sound bank collection
Mapping relations in conjunction between internal operation scene and exclusive sound bank determine the corresponding exclusive voice of the internal operation scene
Library is the corresponding exclusive sound bank of the destination application.
Wherein, the internal operation scene of destination application can be divided according to runnable interface or according to function
It is divided and is either divided by developer or User Defined, unique restriction is not done herein, for example, with intended application journey
Sequence is for game application, the internal operation scene in game application can include store scene, group's battlefield scape, group
Team's scene etc..
Wherein, mobile terminal can record and analyze to obtain based on the history phonetic entry of user the internal operation scene and
Mapping relations and/or mobile terminal between exclusive sound bank can be received from the internal operation field that server is sent
Mapping relations between scape and exclusive sound bank, the former is more adapted to the use habit of householder user, and more accurately, the latter is based on big
Data analysis, it is more comprehensive.
In the specific implementation, the operating system determines the internal operation of the destination application according to the runnable interface
The specific implementation of scene can be:The interface of runnable interface that the operating system reception is sent from destination application
Information determines the internal operation scene of the destination application according to the interface information.
Wherein, operating system can set voice management module and sound bank policy module, pass through the voice management module
It receives the interface information of destination application and corresponding internal operation scene is determined according to the interface information, it is interior according to this
Portion's Run-time scenario determines corresponding sound bank policy module, inquires the sound bank policy module and obtains corresponding exclusive sound bank collection
Exclusive sound bank in conjunction.
Wherein, preset data channel is included between the management module of the destination application and the operating system, is led to
The interface information of the runnable interface can be transmitted by crossing the preset data channel;The preset data channel is destination application
When being currently running, the effective data transmission link established between destination application and operating system, destination application is not
After operation, which can be eliminated.Wherein, it when destination application and operating system are communicated, can adopt
With the data transmission format and data transfer mode appointed, data communication form can select at present the relatively JS objects of mainstream
Mark the numbers such as (JavaScript Object Notation, JSON), agreement buffering (Protocol Buffer, Protobuf)
According to transformat or customized outputting communication form.Game application and operating system can select relatively common
Data transfer mode, such as the schemes such as socket communication, shared drive or file, FIFO, game application and operation
System, which must be appointed, takes specified data transmission mode, and such game application and operating system can just be established feasible
Data transmission channel.
As it can be seen that in this example, the operating system of mobile terminal is directed to the internal operation scene of each application program, Neng Gouji
Mapping relations between preset internal operation scene and exclusive sound bank, it is corresponding specially for each internal operation scene adaptation
Belong to sound bank, so as to fulfill in internal Run-time scenario level Precise control speech voice input function, raising mobile terminal control language
The accuracy of sound input function.
In a possible example, the operating system according to the voice feature data and the exclusive sound bank,
Determine the corresponding word of the voice, including:The operating system inquires the exclusive voice according to the voice feature data
Mapping relations in library between voice feature data and word obtain and the matched word of the voice feature data;The behaviour
Determine that the word is the corresponding word of the voice as system.
Wherein, mobile terminal can record and analyze to obtain based on the history phonetic entry of user the voice feature data with
Mapping relations and/or mobile terminal between word can be received from the voice feature data that server is sent and text
Mapping relations between word, the former is more adapted to the use habit of householder user, and more accurately, the latter is based on big data analysis, more
Add comprehensive.
As it can be seen that in this example, the operating system of mobile terminal is by inquiring voice feature data and text in exclusive sound bank
Mapping relations between word, can be with the matched word of determining voice feature data of quickness and high efficiency.
In a possible example, the method further includes:The operating system is detecting mesh described in front stage operation
When marking application program, the exclusive sound bank is loaded.
As it can be seen that in this example, by the corresponding exclusive sound bank of pre-loaded front stage operation destination application, so as to hold
During row speech voice input function, the loading of sound bank is carried out without short time consumption, improves real-time.
In a possible example, the operating system determines the phonetic entry of the runnable interface according to the word
As a result, including:The operating system determines the reference language classification that the chat feature of the runnable interface is supported;The operation
The language category of system detectio to the word is consistent with the reference language classification, and it is the runnable interface to determine the word
Phonetic entry result;The operating system detects that the language category of the word and the reference language classification are inconsistent,
According to word described in the reference language category-translation, and determine the phonetic entry knot that the word after translation is the runnable interface
Fruit.
Wherein, the reference language classification can be Chinese, English, Korean, Japanese etc., not do unique restriction herein.
As it can be seen that in this example, the operating system of mobile terminal is after the corresponding word of voice is determined, moreover it is possible to by the word
Corresponding translation is carried out according to the language category that current runnable interface is supported, improves the flexibility and accuracy of word input.
It is consistent with above-mentioned embodiment shown in Fig. 2, referring to Fig. 3, Fig. 3 is a kind of voice provided by the embodiments of the present application
The flow diagram of input method, applied to mobile terminal, the running of mobile terminal has operating system and one or more should
Use program.As shown in the figure, this pronunciation inputting method includes:
S301, the operating system handle collected voice as voice feature data, and the foreground of the mobile terminal is transported
Row has the runnable interface of destination application.
S302, the operating system inquire the mapping relations between the application program to prestore and exclusive sound bank, determine institute
State the corresponding exclusive sound bank of destination application.
S303, the operating system determine the voice pair according to the voice feature data and the exclusive sound bank
The word answered.
S304, the operating system determine the phonetic entry result of the runnable interface according to the word.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank
Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank
The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word
It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met
Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target
The real-time of phonetic entry and accuracy during application program.
In addition, the operating system of mobile terminal is directed to each application program, preset application program and exclusive can be based on
Mapping relations between sound bank are adapted to corresponding exclusive sound bank, so as to fulfill in application layer for each application program
Face Precise control speech voice input function improves the accuracy of mobile terminal control voice input function.
It is consistent with above-mentioned embodiment shown in Fig. 2, referring to Fig. 4, Fig. 4 is a kind of voice provided by the embodiments of the present application
The flow diagram of input method, being run applied to mobile terminal, on above-mentioned mobile terminal has operating system and one or more
Destination application.As shown in the figure, this pronunciation inputting method includes:
S401, the operating system load the exclusive voice when detecting destination application described in front stage operation
Library.
S402, the operating system handle collected voice as voice feature data, and the foreground of the mobile terminal is transported
Row has the runnable interface of destination application.
S403, the operating system determine the corresponding exclusive sound bank set of the destination application;
S404, the operating system determine the internal operation scene of the destination application according to the runnable interface.
S405, the operating system are inquired in the exclusive sound bank set between internal operation scene and exclusive sound bank
Mapping relations, determine the corresponding exclusive sound bank of the internal operation scene be the corresponding exclusive language of the destination application
Sound library.
S406, the operating system inquire phonetic feature number in the exclusive sound bank according to the voice feature data
According to the mapping relations between word, obtain and the matched word of the voice feature data.
S407, the operating system determine that the word is the corresponding word of the voice.
S408, the operating system determine the reference language classification that the chat feature of the runnable interface is supported,
S409, the operating system detect that the language category of the word is consistent with the reference language classification, determine
The word is the phonetic entry result of the runnable interface.
S4010, the operating system detect that the language category of the word and the reference language classification are inconsistent, root
According to word described in the reference language category-translation, and determine the phonetic entry knot that the word after translation is the runnable interface
Fruit.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank
Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank
The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word
It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met
Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target
The real-time of phonetic entry and accuracy during application program.
In addition, the operating system of mobile terminal is directed to the internal operation scene of each application program, can be based on preset
Mapping relations between internal operation scene and exclusive sound bank are adapted to corresponding exclusive voice for each internal operation scene
Library, so as to fulfill in internal Run-time scenario level Precise control speech voice input function, the control voice input of raising mobile terminal
The accuracy of function.
In addition, the operating system of mobile terminal is by inquiring reflecting between voice feature data and word in exclusive sound bank
Relationship is penetrated, it can be with the matched word of determining voice feature data of quickness and high efficiency.
In addition, by the corresponding exclusive sound bank of pre-loaded front stage operation destination application, it is defeated so as to perform voice
When entering function, the loading of sound bank is carried out without short time consumption, improves real-time.
In addition, the operating system of mobile terminal is after the corresponding word of voice is determined, moreover it is possible to by the word according to current
The language category that runnable interface is supported carries out corresponding translation, improves the flexibility and accuracy of word input.
It is consistent with above-mentioned Fig. 2, Fig. 3, embodiment shown in Fig. 4, referring to Fig. 5, Fig. 5 is provided by the embodiments of the present application
A kind of structure diagram of mobile terminal, the running of mobile terminal there are one or multiple application programs and operating system, such as figure institute
Show, which includes processor, memory, communication interface and one or more programs, wherein, said one or multiple
Program is different from said one or multiple application programs, and said one or multiple programs are stored in above-mentioned memory, and
And be configured to be performed by above-mentioned processor, above procedure includes the instruction for performing following steps;
Collected voice is handled as voice feature data, the front stage operation of the mobile terminal has destination application
Runnable interface;
Obtain the corresponding exclusive sound bank of the destination application;
According to the voice feature data and the exclusive sound bank, the corresponding word of the voice is determined;
The phonetic entry result of the runnable interface is determined according to the word.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank
Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank
The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word
It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met
Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target
The real-time of phonetic entry and accuracy during application program.
In a possible example, in terms of the corresponding exclusive sound bank of the acquisition destination application, institute
The instruction in program is stated to be specifically used for performing following operate:The mapping inquired between the application program to prestore and exclusive sound bank is closed
System, determines the corresponding exclusive sound bank of the destination application.
In a possible example, in terms of the corresponding exclusive sound bank of the acquisition destination application, institute
The instruction in program is stated to be specifically used for performing following operate:Determine the corresponding exclusive sound bank set of the destination application;
The internal operation scene of the destination application is determined according to the runnable interface;And the inquiry exclusive sound bank set
Mapping relations between middle internal operation scene and exclusive sound bank determine the corresponding exclusive sound bank of the internal operation scene
For the corresponding exclusive sound bank of the destination application.
In a possible example, described according to the voice feature data and the exclusive sound bank, institute is determined
In terms of the corresponding word of predicate sound, the instruction in described program is specifically used for performing following operate:According to the phonetic feature number
According to the mapping relations in the inquiry exclusive sound bank between voice feature data and word obtain and the phonetic feature number
According to matched word;
In a possible example, described program further includes instructions for performing the following operations:Detecting foreground
When running the destination application, the exclusive sound bank is loaded.
In a possible example, in the phonetic entry result side that the runnable interface is determined according to the word
Face, the instruction in described program are specifically used for performing following operate:Determine the ginseng that the chat feature of the runnable interface is supported
Examine language category;And detect that the language category of the word is consistent with the reference language classification, determine that the word is
The phonetic entry result of the runnable interface;And detect that the language category of the word and the reference language classification differ
It causes, according to word described in the reference language category-translation, and determines that the word after translation is defeated for the voice of the runnable interface
Enter result.
It is above-mentioned that mainly the scheme of the embodiment of the present application is described from the angle of method side implementation procedure.It is appreciated that
, for mobile terminal in order to realize above-mentioned function, it comprises perform the corresponding hardware configuration of each function and/or software mould
Block.Those skilled in the art should be readily appreciated that, with reference to each exemplary unit of the embodiments described herein description
And algorithm steps, the application can be realized with the combining form of hardware or hardware and computer software.Some function actually with
Hardware or computer software drive the mode of hardware to perform, depending on the specific application of technical solution and design constraint item
Part.Professional technician specifically can realize described function to each using distinct methods, but this reality
Now it is not considered that beyond scope of the present application.
The embodiment of the present application can carry out mobile terminal according to the above method example division of functional unit, for example, can
Each functional unit is divided with each function of correspondence, two or more functions can also be integrated in a processing unit
In.The form that hardware had both may be used in above-mentioned integrated unit is realized, can also be realized in the form of SFU software functional unit.It needs
It is noted that be schematical, only a kind of division of logic function to the division of unit in the embodiment of the present application, it is practical real
There can be other dividing mode now.
In the case of using integrated unit, Fig. 6 shows speech input device involved in above-described embodiment
A kind of possible functional unit composition block diagram.The speech input device 600 is applied to mobile terminal, including processing unit 601, obtains
Unit 602 and determination unit 603 are taken, wherein,
The processing unit 601, for handling collected voice as voice feature data, the foreground of the mobile terminal
Operation has the runnable interface of destination application;
The acquiring unit 602, for obtaining the corresponding exclusive sound bank of the destination application;
The determination unit 603, for according to the voice feature data and the exclusive sound bank, determining the voice
Corresponding word;
The determination unit 603, also with the phonetic entry result that the runnable interface is determined according to the word.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank
Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank
The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word
It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met
Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target
The real-time of phonetic entry and accuracy during application program.
In a possible example, in terms of the corresponding exclusive sound bank of the acquisition destination application, institute
Acquiring unit 602 is stated to be specifically used for:The mapping relations between the application program to prestore and exclusive sound bank are inquired, determine the mesh
Mark the corresponding exclusive sound bank of application program.
In a possible example, in terms of the corresponding exclusive sound bank of the acquisition destination application, institute
Acquiring unit 602 is stated to be specifically used for:Determine the corresponding exclusive sound bank set of the destination application;According to operation circle
Face determines the internal operation scene of the destination application;And internal operation scene in the inquiry exclusive sound bank set
Mapping relations between exclusive sound bank, it is the intended application to determine the corresponding exclusive sound bank of the internal operation scene
The corresponding exclusive sound bank of program.
In a possible example, described according to the voice feature data and the exclusive sound bank, institute is determined
In terms of the corresponding word of predicate sound, the determination unit 603 is specifically used for:According to the voice feature data, inquire described special
Belong to the mapping relations between voice feature data and word in sound bank, obtain and the matched word of the voice feature data;
And determine that the word is the corresponding word of the voice.
In a possible example, the speech input device further includes loading unit;The loading unit, for
When detecting destination application described in front stage operation, the exclusive sound bank is loaded.
In a possible example, in the phonetic entry result side that the runnable interface is determined according to the word
Face, the determination unit 603 are specifically used for:Determine the reference language classification that the chat feature of the runnable interface is supported;With
And detect that the language category of the word is consistent with the reference language classification, determine the word for the runnable interface
Phonetic entry result;And detect that the language category of the word and the reference language classification are inconsistent, according to the ginseng
Word described in written comments on the work, etc of public of officials speech category-translation, and determine the phonetic entry result that the word after translation is the runnable interface.
Wherein, the processing unit 601 and the determination unit 603 can be application processors, the acquiring unit 602
Can be application processor and memory.
The embodiment of the present application also provides a kind of computer storage media, wherein, computer storage media storage is for electricity
The computer program that subdata exchanges, the computer program cause computer to perform any as described in above-mentioned embodiment of the method
The part or all of step of method, above computer include mobile terminal.
The embodiment of the present application also provides a kind of computer program product, and above computer program product includes storing calculating
The non-transient computer readable storage medium of machine program, above computer program are operable to that computer is made to perform such as above-mentioned side
The part or all of step of either method described in method embodiment.The computer program product can be a software installation
Packet, above computer include mobile terminal.
It should be noted that for aforementioned each method embodiment, in order to be briefly described, therefore it is all expressed as a series of
Combination of actions, but those skilled in the art should know, the application is not limited by described sequence of movement because
According to the application, certain steps may be used other sequences or be carried out at the same time.Secondly, those skilled in the art should also know
It knows, embodiment described in this description belongs to preferred embodiment, involved action and module not necessarily the application
It is necessary.
In the above-described embodiments, it all emphasizes particularly on different fields to the description of each embodiment, there is no the portion being described in detail in some embodiment
Point, it may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed device, it can be by another way
It realizes.For example, the apparatus embodiments described above are merely exemplary, such as the division of said units, it is only a kind of
Division of logic function, can there is an other dividing mode in actual implementation, such as multiple units or component can combine or can
To be integrated into another system or some features can be ignored or does not perform.Another point, shown or discussed is mutual
Coupling, direct-coupling or communication connection can be by some interfaces, the INDIRECT COUPLING or communication connection of device or unit,
Can be electrical or other forms.
The above-mentioned unit illustrated as separating component may or may not be physically separate, be shown as unit
The component shown may or may not be physical unit, you can be located at a place or can also be distributed to multiple
In network element.Some or all of unit therein can be selected according to the actual needs to realize the mesh of this embodiment scheme
's.
In addition, each functional unit in each embodiment of the application can be integrated in a processing unit, it can also
That each unit is individually physically present, can also two or more units integrate in a unit.Above-mentioned integrated list
The form that hardware had both may be used in member is realized, can also be realized in the form of SFU software functional unit.
If above-mentioned integrated unit is realized in the form of SFU software functional unit and is independent product sale or uses
When, it can be stored in a computer-readable access to memory.Based on such understanding, the technical solution of the application substantially or
Person say the part contribute to the prior art or the technical solution all or part can in the form of software product body
Reveal and, which is stored in a memory, is used including some instructions so that a computer equipment
(can be personal computer, server or network equipment etc.) performs all or part of each embodiment above method of the application
Step.And aforementioned memory includes:USB flash disk, read-only memory (ROM, Read-Only Memory), random access memory
The various media that can store program code such as (RAM, Random Access Memory), mobile hard disk, magnetic disc or CD.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of above-described embodiment is can
It is completed with instructing relevant hardware by program, which can be stored in a computer-readable memory, memory
It can include:Flash disk, read-only memory (English:Read-Only Memory, referred to as:ROM), random access device (English:
Random Access Memory, referred to as:RAM), disk or CD etc..
The embodiment of the present application is described in detail above, specific case used herein to the principle of the application and
Embodiment is expounded, and the explanation of above example is only intended to help to understand the present processes and its core concept;
Meanwhile for those of ordinary skill in the art, according to the thought of the application, can in specific embodiments and applications
There is change part, in conclusion the content of the present specification should not be construed as the limitation to the application.
Claims (10)
1. a kind of pronunciation inputting method, which is characterized in that applied to mobile terminal, the running of mobile terminal have operating system and
One or more application program, the method includes:
The operating system handles collected voice as voice feature data, and the front stage operation of the mobile terminal has target should
With the runnable interface of program;
The operating system obtains the corresponding exclusive sound bank of the destination application;
The operating system determines the corresponding word of the voice according to the voice feature data and the exclusive sound bank;
The operating system determines the phonetic entry result of the runnable interface according to the word.
2. according to the method described in claim 1, it is characterized in that, the operating system obtains the destination application correspondence
Exclusive sound bank, including:
The operating system inquires the mapping relations between the application program to prestore and exclusive sound bank, determines the intended application
The corresponding exclusive sound bank of program.
3. according to the method described in claim 1, it is characterized in that, the operating system obtains the destination application correspondence
Exclusive sound bank, including:
The operating system determines the corresponding exclusive sound bank set of the destination application;
The operating system determines the internal operation scene of the destination application according to the runnable interface;
The mapping that the operating system inquires in the exclusive sound bank set between internal operation scene and exclusive sound bank is closed
System, it is the corresponding exclusive sound bank of the destination application to determine the corresponding exclusive sound bank of the internal operation scene.
4. according to claim 1-3 any one of them methods, which is characterized in that the operating system is according to the phonetic feature
Data and the exclusive sound bank, determine the corresponding word of the voice, including:
The operating system according to the voice feature data, inquire in the exclusive sound bank voice feature data and word it
Between mapping relations, obtain with the matched word of the voice feature data;
The operating system determines that the word is the corresponding word of the voice.
5. according to claim 1-4 any one of them methods, which is characterized in that the method further includes:
The operating system loads the exclusive sound bank when detecting destination application described in front stage operation.
6. according to claim 1-5 any one of them methods, which is characterized in that the operating system is determined according to the word
The phonetic entry of the runnable interface as a result, including:
The operating system determines the reference language classification that the chat feature of the runnable interface is supported;
The operating system detects that the language category of the word is consistent with the reference language classification, determines that the word is
The phonetic entry result of the runnable interface;
The operating system detects that the language category of the word and the reference language classification are inconsistent, according to the reference
Language category translates the word, and determines the phonetic entry result that the word after translation is the runnable interface.
7. a kind of speech input device, which is characterized in that applied to mobile terminal, the running of mobile terminal have operating system and
One or more application program, the speech input device include processing unit, acquiring unit and determination unit, wherein,
The processing unit, for handling collected voice as voice feature data, the front stage operation of the mobile terminal has
The runnable interface of destination application;
The acquiring unit, for obtaining the corresponding exclusive sound bank of the destination application;
The determination unit, for according to the voice feature data and the exclusive sound bank, determining that the voice is corresponding
Word;
The determination unit, also with the phonetic entry result that the runnable interface is determined according to the word.
8. speech input device according to claim 7, which is characterized in that obtain the destination application pair described
In terms of the exclusive sound bank answered, the acquiring unit is specifically used for:It inquires between the application program to prestore and exclusive sound bank
Mapping relations determine the corresponding exclusive sound bank of the destination application.
9. a kind of mobile terminal, which is characterized in that including processor, memory, communication interface and one or more program,
In, one or more of programs are stored in the memory, and are configured to be performed by the processor, described program
Instruction including being used for the step in any one of perform claim requirement 1-6 methods.
10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage is used for electron number
According to the computer program of exchange, wherein, the computer program causes computer to perform such as claim 1-6 any one of them
Method, the computer include mobile terminal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711461498.1A CN108196814A (en) | 2017-12-28 | 2017-12-28 | Pronunciation inputting method and Related product |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711461498.1A CN108196814A (en) | 2017-12-28 | 2017-12-28 | Pronunciation inputting method and Related product |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108196814A true CN108196814A (en) | 2018-06-22 |
Family
ID=62585696
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711461498.1A Pending CN108196814A (en) | 2017-12-28 | 2017-12-28 | Pronunciation inputting method and Related product |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108196814A (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109086276A (en) * | 2018-08-27 | 2018-12-25 | Oppo广东移动通信有限公司 | Data translating method, device, terminal and storage medium |
CN110034976A (en) * | 2019-04-08 | 2019-07-19 | Oppo广东移动通信有限公司 | A kind of method and device of data identification |
CN110910889A (en) * | 2018-08-28 | 2020-03-24 | 宏碁股份有限公司 | Multimedia processing circuit and electronic system |
CN111354360A (en) * | 2020-03-17 | 2020-06-30 | 北京百度网讯科技有限公司 | Voice interaction processing method and device and electronic equipment |
CN111841006A (en) * | 2019-04-19 | 2020-10-30 | 宏碁股份有限公司 | Multimedia processing method and electronic system |
CN113886100A (en) * | 2021-09-23 | 2022-01-04 | 阿波罗智联(北京)科技有限公司 | Voice data processing method, device, equipment and storage medium |
US11699429B2 (en) | 2018-08-28 | 2023-07-11 | Acer Incorporated | Multimedia processing method and electronic system |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102855873A (en) * | 2012-08-03 | 2013-01-02 | 海信集团有限公司 | Electronic equipment and method used for controlling same |
CN103593134A (en) * | 2012-08-17 | 2014-02-19 | 上海博泰悦臻电子设备制造有限公司 | Control method of vehicle device and voice function |
CN104063136A (en) * | 2013-07-02 | 2014-09-24 | 姜洪明 | Mobile operation system |
CN104965824A (en) * | 2015-06-11 | 2015-10-07 | 胡开标 | Real-time text and speech translation system |
US20150378671A1 (en) * | 2014-06-27 | 2015-12-31 | Nuance Communications, Inc. | System and method for allowing user intervention in a speech recognition process |
CN105893131A (en) * | 2016-04-01 | 2016-08-24 | 惠州Tcl移动通信有限公司 | Method and system for starting mobile phone application through voice |
CN106920546A (en) * | 2015-12-23 | 2017-07-04 | 小米科技有限责任公司 | The method and device of Intelligent Recognition voice |
-
2017
- 2017-12-28 CN CN201711461498.1A patent/CN108196814A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102855873A (en) * | 2012-08-03 | 2013-01-02 | 海信集团有限公司 | Electronic equipment and method used for controlling same |
CN103593134A (en) * | 2012-08-17 | 2014-02-19 | 上海博泰悦臻电子设备制造有限公司 | Control method of vehicle device and voice function |
CN104063136A (en) * | 2013-07-02 | 2014-09-24 | 姜洪明 | Mobile operation system |
US20150378671A1 (en) * | 2014-06-27 | 2015-12-31 | Nuance Communications, Inc. | System and method for allowing user intervention in a speech recognition process |
CN104965824A (en) * | 2015-06-11 | 2015-10-07 | 胡开标 | Real-time text and speech translation system |
CN106920546A (en) * | 2015-12-23 | 2017-07-04 | 小米科技有限责任公司 | The method and device of Intelligent Recognition voice |
CN105893131A (en) * | 2016-04-01 | 2016-08-24 | 惠州Tcl移动通信有限公司 | Method and system for starting mobile phone application through voice |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109086276A (en) * | 2018-08-27 | 2018-12-25 | Oppo广东移动通信有限公司 | Data translating method, device, terminal and storage medium |
CN109086276B (en) * | 2018-08-27 | 2022-12-06 | Oppo广东移动通信有限公司 | Data translation method, device, terminal and storage medium |
CN110910889A (en) * | 2018-08-28 | 2020-03-24 | 宏碁股份有限公司 | Multimedia processing circuit and electronic system |
US11482229B2 (en) | 2018-08-28 | 2022-10-25 | Acer Incorporated | Multimedia processing circuit and electronic system |
US11699429B2 (en) | 2018-08-28 | 2023-07-11 | Acer Incorporated | Multimedia processing method and electronic system |
US11948581B2 (en) | 2018-08-28 | 2024-04-02 | Acer Incorporated | Smart interpreter engine and electronic system |
CN110034976A (en) * | 2019-04-08 | 2019-07-19 | Oppo广东移动通信有限公司 | A kind of method and device of data identification |
CN111841006A (en) * | 2019-04-19 | 2020-10-30 | 宏碁股份有限公司 | Multimedia processing method and electronic system |
CN111354360A (en) * | 2020-03-17 | 2020-06-30 | 北京百度网讯科技有限公司 | Voice interaction processing method and device and electronic equipment |
CN113886100A (en) * | 2021-09-23 | 2022-01-04 | 阿波罗智联(北京)科技有限公司 | Voice data processing method, device, equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108196814A (en) | Pronunciation inputting method and Related product | |
CN107861814A (en) | Resource allocation method and equipment | |
CN107426432B (en) | Resource allocation method and Related product | |
CN108091333A (en) | Sound control method and Related product | |
CN108242837A (en) | The method of the charging of electronic equipment and control electronics | |
CN108037999A (en) | Resource allocation method and Related product | |
US11274932B2 (en) | Navigation method, navigation device, and storage medium | |
CN108536480A (en) | Input method configuration method and related product | |
CN107635078A (en) | Game control method and equipment | |
CN107797868A (en) | resource adjusting method and device | |
CN109379247A (en) | The method and device that the network delay of a kind of pair of application program is detected | |
CN103763112B (en) | A kind of user identity protection method and apparatus | |
CN111050370A (en) | Network switching method and device, storage medium and electronic equipment | |
CN107807852A (en) | Application program capacity control method and equipment | |
CN107786738B (en) | Network control method and equipment | |
CN107995357A (en) | Resource allocation method and device | |
CN107861603A (en) | Power consumption control method and equipment | |
CN108549568A (en) | Using entrance processing method, apparatus, storage medium and electronic equipment | |
CN109254793A (en) | Engine partition method, relevant device and computer readable storage medium | |
CN108369479A (en) | Target selection on small form factor display | |
CN107993672A (en) | Frequency expansion method and device | |
CN103959199A (en) | Power saving method and apparatus for first in first out (FIFO) memories | |
CN109753425A (en) | Pop-up processing method and processing device | |
CN109348467A (en) | Emergency call realization method, electronic device and computer readable storage medium | |
CN107832148A (en) | Performance optimization method and equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180622 |