CN108196814A - Pronunciation inputting method and Related product - Google Patents

Pronunciation inputting method and Related product Download PDF

Info

Publication number
CN108196814A
CN108196814A CN201711461498.1A CN201711461498A CN108196814A CN 108196814 A CN108196814 A CN 108196814A CN 201711461498 A CN201711461498 A CN 201711461498A CN 108196814 A CN108196814 A CN 108196814A
Authority
CN
China
Prior art keywords
operating system
sound bank
word
application
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711461498.1A
Other languages
Chinese (zh)
Inventor
陈岩
程杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN201711461498.1A priority Critical patent/CN108196814A/en
Publication of CN108196814A publication Critical patent/CN108196814A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Abstract

The embodiment of the present application discloses a kind of pronunciation inputting method and Related product.Method includes:The operating system handles collected voice as voice feature data, and the front stage operation of the mobile terminal has the runnable interface of destination application;The operating system obtains the corresponding exclusive sound bank of the destination application;The operating system determines the corresponding word of the voice according to the voice feature data and the exclusive sound bank;The operating system determines the phonetic entry result of the runnable interface according to the word.The real-time of phonetic entry and accuracy when the embodiment of the present application is conducive to improve running of mobile terminal destination application.

Description

Pronunciation inputting method and Related product
Technical field
This application involves technical field of mobile terminals, and in particular to pronunciation inputting method and Related product.
Background technology
With the fast development of the relevant technologies of the mobile terminals such as smart mobile phone, more and more applications are installed in user In mobile phone, such as read class application, the application of payment class, game class application, the application of music class, the clothing, food, lodging and transportion -- basic necessities of life of people with hand Secret is inseparable.People can enable input method function during application is used, and the input method for enabling system default carries out Keyword search and words input.
Invention content
The embodiment of the present application provides pronunciation inputting method and Related product, can improve running of mobile terminal intended application The real-time of phonetic entry and accuracy during program.
In a first aspect, the embodiment of the present application provides a kind of pronunciation inputting method, applied to mobile terminal, above-mentioned mobile terminal Upper operation has operating system and one or more application program, the above method to include:
Second aspect, the embodiment of the present application provide a kind of speech input device, applied to mobile terminal, above-mentioned mobile terminal Upper operation has operating system and one or more application program, and the speech input device includes processing unit, acquiring unit And determination unit, wherein,
The processing unit, for handling collected voice as voice feature data, the foreground of the mobile terminal is transported Row has the runnable interface of destination application;
The acquiring unit, for obtaining the corresponding exclusive sound bank of the destination application;
The determination unit, for according to the voice feature data and the exclusive sound bank, determining the voice pair The word answered;
The determination unit, also with the phonetic entry result that the runnable interface is determined according to the word.
The third aspect, the embodiment of the present application provide a kind of mobile terminal, including processor, memory, communication interface and One or more programs, wherein, said one or multiple programs are stored in above-mentioned memory, and be configured by above-mentioned It manages device to perform, above procedure includes the instruction for performing the step in the embodiment of the present application first aspect either method.
Fourth aspect, the embodiment of the present application provide a kind of computer readable storage medium, wherein, above computer is readable Storage medium storage is used for the computer program of electronic data interchange, wherein, above computer program causes computer to perform such as Part or all of step described in the embodiment of the present application first aspect either method, above computer include mobile terminal.
5th aspect, the embodiment of the present application provide a kind of computer program product, wherein, above computer program product Non-transient computer readable storage medium including storing computer program, above computer program are operable to make calculating Machine is performed such as the part or all of step described in the embodiment of the present application first aspect either method.The computer program product Can be a software installation packet, above computer includes mobile terminal.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target The real-time of phonetic entry and accuracy during application program.
Description of the drawings
The attached drawing involved by the embodiment of the present application will be briefly described below.
Figure 1A is that the embodiment of the present application provides a kind of structure diagram of smart mobile phone;
Figure 1B is a kind of schematic diagram of the program running space of smart mobile phone;
Fig. 1 C are a kind of system architecture diagrams of Android system;
Fig. 2 is a kind of flow diagram of pronunciation inputting method provided by the embodiments of the present application;
Fig. 3 is a kind of flow diagram of pronunciation inputting method disclosed in the embodiment of the present application;
Fig. 4 is a kind of flow diagram of pronunciation inputting method disclosed in the embodiment of the present application;
Fig. 5 is a kind of structure diagram of mobile terminal disclosed in the embodiment of the present application;
Fig. 6 is a kind of functional unit composition block diagram of mobile terminal disclosed in the embodiment of the present application.
Specific embodiment
In order to which those skilled in the art is made to more fully understand application scheme, below in conjunction in the embodiment of the present application The technical solution in the embodiment of the present application is clearly and completely described in attached drawing, it is clear that described embodiment is only Some embodiments of the present application, instead of all the embodiments.Based on the embodiment in the application, those of ordinary skill in the art All other embodiments obtained without creative efforts shall fall in the protection scope of this application.
Term " first ", " second " in the description and claims of this application and above-mentioned attached drawing etc. are for distinguishing Different objects rather than for describing particular order.In addition, term " comprising " and " having " and their any deformations, it is intended that It is to cover non-exclusive include.Such as process, method, system, product or the equipment for containing series of steps or unit do not have The step of having listed or unit are defined in, but optionally further includes the step of not listing or unit or optionally also wraps It includes for other intrinsic steps of these processes, method, product or equipment or unit.
Referenced herein " embodiment " is it is meant that a particular feature, structure, or characteristic described can wrap in conjunction with the embodiments It is contained at least one embodiment of the application.Each position in the description occur the phrase might not each mean it is identical Embodiment, nor the independent or alternative embodiment with other embodiments mutual exclusion.Those skilled in the art explicitly and Implicitly understand, embodiment described herein can be combined with other embodiments.
Mobile terminal involved by the embodiment of the present application can include the various handheld devices with wireless communication function (such as smart mobile phone), mobile unit, wearable device, computing device are connected to other processing of radio modem and set Standby and various forms of user equipmenies (User Equipment, UE), mobile station (Mobile Station, MS), terminal is set Standby (terminal device) etc..For convenience of description, apparatus mentioned above is referred to as mobile terminal.The embodiment of the present invention Involved operating system is that hardware resource is managed collectively, and provides a user the software systems of business interface.Under Face is introduced the exemplary construction of mobile terminal by taking smart mobile phone as an example.
Figure 1A is that the embodiment of the present application provides a kind of structure diagram of smart mobile phone 100, and above-mentioned smart mobile phone 100 wraps It includes:Housing 110, touching display screen 120, mainboard 130, battery 140 and subplate 150 are provided with front camera on mainboard 130 131st, systems-on-a-chip (System on Chip, SoC) 132 (including application processors and baseband processor), memory 133, Power management chip 134, radio frequency system 135 etc. are provided with oscillator 151, integrated sound chamber 152, VOOC and dodge and fill interface on subplate 153.Wherein, the touching display screen 120 can be comprehensive screen or abnormity screen, not do unique restriction herein.
The SoC132 is the control centre of smart mobile phone, utilizes each of various interfaces and the entire smart mobile phone of connection A part is stored in storage by running or performing the software program being stored in memory 133 and/or module and call Data in device 133 perform the various functions of smart mobile phone and processing data, so as to carry out integral monitoring to smart mobile phone.It should SoC132 may include one or more processing units, can such as integrate application processor AP and baseband processor (also known as base band core Piece, base band) etc., wherein, the main processing operation system of application processor, user interface and application program etc., baseband processor master Handle wireless communication.It is understood that above-mentioned baseband processor can not also be integrated into SoC132.The SoC132 is for example Can be central processing unit (Central Processing Unit, CPU), general processor, digital signal processor (Digital Signal Processor, DSP), application-specific integrated circuit (Application-Specific Integrated Circuit, ASIC), field programmable gate array (Field Programmable Gate Array, FPGA) or other can Programmed logic device, transistor logic, hardware component or its arbitrary combination.It can realize or perform with reference to the application The described various illustrative logic blocks of disclosure, module and circuit.Above-mentioned processor can also realize to calculate work( The combination of energy, such as include one or more microprocessors and combine, combination of DSP and microprocessor etc..
The memory 133 can be used for storage software program and module, and SoC132 is stored in memory 133 by operation Software program and module, so as to perform the various function application of smart mobile phone and data processing.Memory 133 can be main Including storing program area and storage data field, wherein, storing program area can storage program area, needed at least one function should With program etc.;Storage data field can be stored uses created data etc. according to smart mobile phone.In addition, memory 133 can be with Including high-speed random access memory, nonvolatile memory, for example, at least disk memory, a flash memory can also be included Device or other volatile solid-state parts.The memory 133 for example can be random access memory (Random Access Memory, RAM), flash memory, read-only memory (Read Only Memory, ROM), the read-only storage of erasable programmable Device (Erasable Programmable ROM, EPROM), Electrically Erasable Programmable Read-Only Memory (Electrically EPROM, EEPROM), register, hard disk, mobile hard disk, CD-ROM (CD-ROM) or any other shape well known in the art The storage medium of formula.
Figure 1B is the schematic diagram of the program running space of smart mobile phone provided by the embodiments of the present application, at present smart mobile phone etc. Mobile terminal is typically provided with program running space, which includes user's space and operating system space, wherein, There are one user's space operations or multiple application programs, which should for the third party of mobile terminal installation With program, operating system space motion has the operating system of mobile terminal.The mobile terminal can specifically run Android Android Mobile operating system iOS that system, Apple Inc. develop etc., does not do unique restriction herein.As shown in Figure 1 C, with above-mentioned mobile whole For end operation has android system, corresponding user's space includes the application layer in the android system (Applications), operating system space can include the application framework layer in the android system (Application Framework), system operation library layer including system operation library layer Libraries and Android (when running Android Runtime), Linux inner core (Linux Kernel).Wherein, it is directly handed over user including all kinds of in application layer Mutual application program or the service routine for running on backstage write by Java language.For example, that is realized on smart mobile phone is common The program of basic function, such as short message service (Short Messaging Service, SMS) short message, dialing, picture The programs such as browser, calendar, game, map, WWW (World Wide Web, Web) browser and developer's exploitation Other applications.Application framework layer provides a series of class libraries needed for exploitation Android application programs, can be used in Component is reused, can also realize personalized extension by inheriting.System operation library layer is the support of application framework, is Various components in android system provide service.System operation library layer is formed when being run by system class libraries and Android. Core library and Dalvik virtual machine two parts are included when Android is run.Linux inner core is used to implement hardware device drivers, The Core Features such as process and memory management, network protocol stack, power management, wireless communication.
In the game application scene of mobile terminal, player particularly requires other to be linked up in more preferable team faster Team member's Emergency Assistance etc. information requires the response of input method higher than normal application requirement.And under normal circumstances, behaviour Make system and do not differentiate between game and non-gaming scene, also will not more be directed to scene of game and carry out special optimization.
For the above situation, the embodiment of the present application proposes a kind of phonetic entry of destination application for mobile terminal Method, in this method, mobile terminal can carry out Precise control in application program rank to speech voice input function, especially by It determines the exclusive sound bank that destination application is adapted to, the corresponding text of voice input by user is determined according to the exclusive sound bank Word, and the phonetic entry of runnable interface is determined as a result, all applications is avoided to load same Default sound library according to the word And the individual demand of different application can not be met, and the exclusive sound bank of application-specific is real in accuracy and response It can be better than the sound bank of system default, phonetic entry when being conducive to improve running of mobile terminal destination application in terms of when property Real-time and accuracy.
The embodiment of the present application is introduced below in conjunction with the accompanying drawings.
A kind of flow diagram of pronunciation inputting method, above-mentioned shifting are provided referring to Fig. 2, Fig. 2 is the embodiment of the present application Operation has operating system and one or more application program in dynamic terminal, as shown in the figure, this pronunciation inputting method includes:
S201, the operating system handle collected voice as voice feature data, and the foreground of the mobile terminal is transported Row has the runnable interface of destination application.
Wherein, destination application refers to the third party application of the user's space mounted on mobile terminal, the third Square application program is such as can be camera application program, the application of instant messaging class, game class application, the third party application It can also be pre-installed before mobile terminal dispatches from the factory by developer by user installation, do not done unique restriction herein.
In the specific implementation, voice first can be divided into multiframe voice by the operating system, for multiframe voice waveform into Row transformation, i.e., extract strategy based on preset acoustic feature, extract the voice feature data in the voice, such as extraction institute predicate Mel-frequency cepstrum coefficient (Mel Frequency Cepstral Coefficients, MFCCs) in sound.
S202, the operating system obtain the corresponding exclusive sound bank of the destination application.
Wherein, the exclusive sound bank includes the correspondence between voice feature data and word, and the text included Word is the common word in destination application operational process, and the data volume of the exclusive sound bank is less than the acquiescence of mobile terminal Sound bank.
S203, the operating system determine the voice pair according to the voice feature data and the exclusive sound bank The word answered.
Wherein, word can include word, phrase, phrase and expression etc., not do unique restriction herein.
S204, the operating system determine the phonetic entry result of the runnable interface according to the word.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target The real-time of phonetic entry and accuracy during application program.
In a possible example, the operating system obtains the corresponding exclusive sound bank of the destination application, Including:The operating system inquires the mapping relations between the application program to prestore and exclusive sound bank, determines that the target should With the corresponding exclusive sound bank of program.
Wherein, mobile terminal can record and analyze to obtain based on the history phonetic entry of user the application program and exclusive Mapping relations and/or mobile terminal between sound bank can receive the application program that sends from server and exclusive Mapping relations between sound bank, the former is more adapted to the use habit of householder user, and more accurately, the latter is based on big data point Analysis, it is more comprehensive.
As it can be seen that in this example, the operating system of mobile terminal can be based on preset using journey for each application program Mapping relations between sequence and exclusive sound bank are adapted to corresponding exclusive sound bank for each application program, so as to fulfill answering With program level Precise control speech voice input function, the accuracy of mobile terminal control voice input function is improved.
In a possible example, the operating system obtains the corresponding exclusive sound bank of the destination application, Including:The operating system determines the corresponding exclusive sound bank set of the destination application;The operating system is according to institute State the internal operation scene that runnable interface determines the destination application;The operating system inquires the exclusive sound bank collection Mapping relations in conjunction between internal operation scene and exclusive sound bank determine the corresponding exclusive voice of the internal operation scene Library is the corresponding exclusive sound bank of the destination application.
Wherein, the internal operation scene of destination application can be divided according to runnable interface or according to function It is divided and is either divided by developer or User Defined, unique restriction is not done herein, for example, with intended application journey Sequence is for game application, the internal operation scene in game application can include store scene, group's battlefield scape, group Team's scene etc..
Wherein, mobile terminal can record and analyze to obtain based on the history phonetic entry of user the internal operation scene and Mapping relations and/or mobile terminal between exclusive sound bank can be received from the internal operation field that server is sent Mapping relations between scape and exclusive sound bank, the former is more adapted to the use habit of householder user, and more accurately, the latter is based on big Data analysis, it is more comprehensive.
In the specific implementation, the operating system determines the internal operation of the destination application according to the runnable interface The specific implementation of scene can be:The interface of runnable interface that the operating system reception is sent from destination application Information determines the internal operation scene of the destination application according to the interface information.
Wherein, operating system can set voice management module and sound bank policy module, pass through the voice management module It receives the interface information of destination application and corresponding internal operation scene is determined according to the interface information, it is interior according to this Portion's Run-time scenario determines corresponding sound bank policy module, inquires the sound bank policy module and obtains corresponding exclusive sound bank collection Exclusive sound bank in conjunction.
Wherein, preset data channel is included between the management module of the destination application and the operating system, is led to The interface information of the runnable interface can be transmitted by crossing the preset data channel;The preset data channel is destination application When being currently running, the effective data transmission link established between destination application and operating system, destination application is not After operation, which can be eliminated.Wherein, it when destination application and operating system are communicated, can adopt With the data transmission format and data transfer mode appointed, data communication form can select at present the relatively JS objects of mainstream Mark the numbers such as (JavaScript Object Notation, JSON), agreement buffering (Protocol Buffer, Protobuf) According to transformat or customized outputting communication form.Game application and operating system can select relatively common Data transfer mode, such as the schemes such as socket communication, shared drive or file, FIFO, game application and operation System, which must be appointed, takes specified data transmission mode, and such game application and operating system can just be established feasible Data transmission channel.
As it can be seen that in this example, the operating system of mobile terminal is directed to the internal operation scene of each application program, Neng Gouji Mapping relations between preset internal operation scene and exclusive sound bank, it is corresponding specially for each internal operation scene adaptation Belong to sound bank, so as to fulfill in internal Run-time scenario level Precise control speech voice input function, raising mobile terminal control language The accuracy of sound input function.
In a possible example, the operating system according to the voice feature data and the exclusive sound bank, Determine the corresponding word of the voice, including:The operating system inquires the exclusive voice according to the voice feature data Mapping relations in library between voice feature data and word obtain and the matched word of the voice feature data;The behaviour Determine that the word is the corresponding word of the voice as system.
Wherein, mobile terminal can record and analyze to obtain based on the history phonetic entry of user the voice feature data with Mapping relations and/or mobile terminal between word can be received from the voice feature data that server is sent and text Mapping relations between word, the former is more adapted to the use habit of householder user, and more accurately, the latter is based on big data analysis, more Add comprehensive.
As it can be seen that in this example, the operating system of mobile terminal is by inquiring voice feature data and text in exclusive sound bank Mapping relations between word, can be with the matched word of determining voice feature data of quickness and high efficiency.
In a possible example, the method further includes:The operating system is detecting mesh described in front stage operation When marking application program, the exclusive sound bank is loaded.
As it can be seen that in this example, by the corresponding exclusive sound bank of pre-loaded front stage operation destination application, so as to hold During row speech voice input function, the loading of sound bank is carried out without short time consumption, improves real-time.
In a possible example, the operating system determines the phonetic entry of the runnable interface according to the word As a result, including:The operating system determines the reference language classification that the chat feature of the runnable interface is supported;The operation The language category of system detectio to the word is consistent with the reference language classification, and it is the runnable interface to determine the word Phonetic entry result;The operating system detects that the language category of the word and the reference language classification are inconsistent, According to word described in the reference language category-translation, and determine the phonetic entry knot that the word after translation is the runnable interface Fruit.
Wherein, the reference language classification can be Chinese, English, Korean, Japanese etc., not do unique restriction herein.
As it can be seen that in this example, the operating system of mobile terminal is after the corresponding word of voice is determined, moreover it is possible to by the word Corresponding translation is carried out according to the language category that current runnable interface is supported, improves the flexibility and accuracy of word input.
It is consistent with above-mentioned embodiment shown in Fig. 2, referring to Fig. 3, Fig. 3 is a kind of voice provided by the embodiments of the present application The flow diagram of input method, applied to mobile terminal, the running of mobile terminal has operating system and one or more should Use program.As shown in the figure, this pronunciation inputting method includes:
S301, the operating system handle collected voice as voice feature data, and the foreground of the mobile terminal is transported Row has the runnable interface of destination application.
S302, the operating system inquire the mapping relations between the application program to prestore and exclusive sound bank, determine institute State the corresponding exclusive sound bank of destination application.
S303, the operating system determine the voice pair according to the voice feature data and the exclusive sound bank The word answered.
S304, the operating system determine the phonetic entry result of the runnable interface according to the word.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target The real-time of phonetic entry and accuracy during application program.
In addition, the operating system of mobile terminal is directed to each application program, preset application program and exclusive can be based on Mapping relations between sound bank are adapted to corresponding exclusive sound bank, so as to fulfill in application layer for each application program Face Precise control speech voice input function improves the accuracy of mobile terminal control voice input function.
It is consistent with above-mentioned embodiment shown in Fig. 2, referring to Fig. 4, Fig. 4 is a kind of voice provided by the embodiments of the present application The flow diagram of input method, being run applied to mobile terminal, on above-mentioned mobile terminal has operating system and one or more Destination application.As shown in the figure, this pronunciation inputting method includes:
S401, the operating system load the exclusive voice when detecting destination application described in front stage operation Library.
S402, the operating system handle collected voice as voice feature data, and the foreground of the mobile terminal is transported Row has the runnable interface of destination application.
S403, the operating system determine the corresponding exclusive sound bank set of the destination application;
S404, the operating system determine the internal operation scene of the destination application according to the runnable interface.
S405, the operating system are inquired in the exclusive sound bank set between internal operation scene and exclusive sound bank Mapping relations, determine the corresponding exclusive sound bank of the internal operation scene be the corresponding exclusive language of the destination application Sound library.
S406, the operating system inquire phonetic feature number in the exclusive sound bank according to the voice feature data According to the mapping relations between word, obtain and the matched word of the voice feature data.
S407, the operating system determine that the word is the corresponding word of the voice.
S408, the operating system determine the reference language classification that the chat feature of the runnable interface is supported,
S409, the operating system detect that the language category of the word is consistent with the reference language classification, determine The word is the phonetic entry result of the runnable interface.
S4010, the operating system detect that the language category of the word and the reference language classification are inconsistent, root According to word described in the reference language category-translation, and determine the phonetic entry knot that the word after translation is the runnable interface Fruit.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target The real-time of phonetic entry and accuracy during application program.
In addition, the operating system of mobile terminal is directed to the internal operation scene of each application program, can be based on preset Mapping relations between internal operation scene and exclusive sound bank are adapted to corresponding exclusive voice for each internal operation scene Library, so as to fulfill in internal Run-time scenario level Precise control speech voice input function, the control voice input of raising mobile terminal The accuracy of function.
In addition, the operating system of mobile terminal is by inquiring reflecting between voice feature data and word in exclusive sound bank Relationship is penetrated, it can be with the matched word of determining voice feature data of quickness and high efficiency.
In addition, by the corresponding exclusive sound bank of pre-loaded front stage operation destination application, it is defeated so as to perform voice When entering function, the loading of sound bank is carried out without short time consumption, improves real-time.
In addition, the operating system of mobile terminal is after the corresponding word of voice is determined, moreover it is possible to by the word according to current The language category that runnable interface is supported carries out corresponding translation, improves the flexibility and accuracy of word input.
It is consistent with above-mentioned Fig. 2, Fig. 3, embodiment shown in Fig. 4, referring to Fig. 5, Fig. 5 is provided by the embodiments of the present application A kind of structure diagram of mobile terminal, the running of mobile terminal there are one or multiple application programs and operating system, such as figure institute Show, which includes processor, memory, communication interface and one or more programs, wherein, said one or multiple Program is different from said one or multiple application programs, and said one or multiple programs are stored in above-mentioned memory, and And be configured to be performed by above-mentioned processor, above procedure includes the instruction for performing following steps;
Collected voice is handled as voice feature data, the front stage operation of the mobile terminal has destination application Runnable interface;
Obtain the corresponding exclusive sound bank of the destination application;
According to the voice feature data and the exclusive sound bank, the corresponding word of the voice is determined;
The phonetic entry result of the runnable interface is determined according to the word.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target The real-time of phonetic entry and accuracy during application program.
In a possible example, in terms of the corresponding exclusive sound bank of the acquisition destination application, institute The instruction in program is stated to be specifically used for performing following operate:The mapping inquired between the application program to prestore and exclusive sound bank is closed System, determines the corresponding exclusive sound bank of the destination application.
In a possible example, in terms of the corresponding exclusive sound bank of the acquisition destination application, institute The instruction in program is stated to be specifically used for performing following operate:Determine the corresponding exclusive sound bank set of the destination application; The internal operation scene of the destination application is determined according to the runnable interface;And the inquiry exclusive sound bank set Mapping relations between middle internal operation scene and exclusive sound bank determine the corresponding exclusive sound bank of the internal operation scene For the corresponding exclusive sound bank of the destination application.
In a possible example, described according to the voice feature data and the exclusive sound bank, institute is determined In terms of the corresponding word of predicate sound, the instruction in described program is specifically used for performing following operate:According to the phonetic feature number According to the mapping relations in the inquiry exclusive sound bank between voice feature data and word obtain and the phonetic feature number According to matched word;
In a possible example, described program further includes instructions for performing the following operations:Detecting foreground When running the destination application, the exclusive sound bank is loaded.
In a possible example, in the phonetic entry result side that the runnable interface is determined according to the word Face, the instruction in described program are specifically used for performing following operate:Determine the ginseng that the chat feature of the runnable interface is supported Examine language category;And detect that the language category of the word is consistent with the reference language classification, determine that the word is The phonetic entry result of the runnable interface;And detect that the language category of the word and the reference language classification differ It causes, according to word described in the reference language category-translation, and determines that the word after translation is defeated for the voice of the runnable interface Enter result.
It is above-mentioned that mainly the scheme of the embodiment of the present application is described from the angle of method side implementation procedure.It is appreciated that , for mobile terminal in order to realize above-mentioned function, it comprises perform the corresponding hardware configuration of each function and/or software mould Block.Those skilled in the art should be readily appreciated that, with reference to each exemplary unit of the embodiments described herein description And algorithm steps, the application can be realized with the combining form of hardware or hardware and computer software.Some function actually with Hardware or computer software drive the mode of hardware to perform, depending on the specific application of technical solution and design constraint item Part.Professional technician specifically can realize described function to each using distinct methods, but this reality Now it is not considered that beyond scope of the present application.
The embodiment of the present application can carry out mobile terminal according to the above method example division of functional unit, for example, can Each functional unit is divided with each function of correspondence, two or more functions can also be integrated in a processing unit In.The form that hardware had both may be used in above-mentioned integrated unit is realized, can also be realized in the form of SFU software functional unit.It needs It is noted that be schematical, only a kind of division of logic function to the division of unit in the embodiment of the present application, it is practical real There can be other dividing mode now.
In the case of using integrated unit, Fig. 6 shows speech input device involved in above-described embodiment A kind of possible functional unit composition block diagram.The speech input device 600 is applied to mobile terminal, including processing unit 601, obtains Unit 602 and determination unit 603 are taken, wherein,
The processing unit 601, for handling collected voice as voice feature data, the foreground of the mobile terminal Operation has the runnable interface of destination application;
The acquiring unit 602, for obtaining the corresponding exclusive sound bank of the destination application;
The determination unit 603, for according to the voice feature data and the exclusive sound bank, determining the voice Corresponding word;
The determination unit 603, also with the phonetic entry result that the runnable interface is determined according to the word.
As can be seen that in the embodiment of the present application, mobile terminal can carry out speech voice input function in application program rank Precise control, the exclusive sound bank being adapted to especially by determining destination application determine to use according to the exclusive sound bank The corresponding word of voice of family input, and the phonetic entry of runnable interface is determined as a result, avoiding all applications equal according to the word It loads same Default sound library and the individual demand of different application, and the exclusive language of application-specific can not be met Sound library can be better than the sound bank of system default in terms of accuracy and response real-time, be conducive to improve running of mobile terminal target The real-time of phonetic entry and accuracy during application program.
In a possible example, in terms of the corresponding exclusive sound bank of the acquisition destination application, institute Acquiring unit 602 is stated to be specifically used for:The mapping relations between the application program to prestore and exclusive sound bank are inquired, determine the mesh Mark the corresponding exclusive sound bank of application program.
In a possible example, in terms of the corresponding exclusive sound bank of the acquisition destination application, institute Acquiring unit 602 is stated to be specifically used for:Determine the corresponding exclusive sound bank set of the destination application;According to operation circle Face determines the internal operation scene of the destination application;And internal operation scene in the inquiry exclusive sound bank set Mapping relations between exclusive sound bank, it is the intended application to determine the corresponding exclusive sound bank of the internal operation scene The corresponding exclusive sound bank of program.
In a possible example, described according to the voice feature data and the exclusive sound bank, institute is determined In terms of the corresponding word of predicate sound, the determination unit 603 is specifically used for:According to the voice feature data, inquire described special Belong to the mapping relations between voice feature data and word in sound bank, obtain and the matched word of the voice feature data; And determine that the word is the corresponding word of the voice.
In a possible example, the speech input device further includes loading unit;The loading unit, for When detecting destination application described in front stage operation, the exclusive sound bank is loaded.
In a possible example, in the phonetic entry result side that the runnable interface is determined according to the word Face, the determination unit 603 are specifically used for:Determine the reference language classification that the chat feature of the runnable interface is supported;With And detect that the language category of the word is consistent with the reference language classification, determine the word for the runnable interface Phonetic entry result;And detect that the language category of the word and the reference language classification are inconsistent, according to the ginseng Word described in written comments on the work, etc of public of officials speech category-translation, and determine the phonetic entry result that the word after translation is the runnable interface.
Wherein, the processing unit 601 and the determination unit 603 can be application processors, the acquiring unit 602 Can be application processor and memory.
The embodiment of the present application also provides a kind of computer storage media, wherein, computer storage media storage is for electricity The computer program that subdata exchanges, the computer program cause computer to perform any as described in above-mentioned embodiment of the method The part or all of step of method, above computer include mobile terminal.
The embodiment of the present application also provides a kind of computer program product, and above computer program product includes storing calculating The non-transient computer readable storage medium of machine program, above computer program are operable to that computer is made to perform such as above-mentioned side The part or all of step of either method described in method embodiment.The computer program product can be a software installation Packet, above computer include mobile terminal.
It should be noted that for aforementioned each method embodiment, in order to be briefly described, therefore it is all expressed as a series of Combination of actions, but those skilled in the art should know, the application is not limited by described sequence of movement because According to the application, certain steps may be used other sequences or be carried out at the same time.Secondly, those skilled in the art should also know It knows, embodiment described in this description belongs to preferred embodiment, involved action and module not necessarily the application It is necessary.
In the above-described embodiments, it all emphasizes particularly on different fields to the description of each embodiment, there is no the portion being described in detail in some embodiment Point, it may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed device, it can be by another way It realizes.For example, the apparatus embodiments described above are merely exemplary, such as the division of said units, it is only a kind of Division of logic function, can there is an other dividing mode in actual implementation, such as multiple units or component can combine or can To be integrated into another system or some features can be ignored or does not perform.Another point, shown or discussed is mutual Coupling, direct-coupling or communication connection can be by some interfaces, the INDIRECT COUPLING or communication connection of device or unit, Can be electrical or other forms.
The above-mentioned unit illustrated as separating component may or may not be physically separate, be shown as unit The component shown may or may not be physical unit, you can be located at a place or can also be distributed to multiple In network element.Some or all of unit therein can be selected according to the actual needs to realize the mesh of this embodiment scheme 's.
In addition, each functional unit in each embodiment of the application can be integrated in a processing unit, it can also That each unit is individually physically present, can also two or more units integrate in a unit.Above-mentioned integrated list The form that hardware had both may be used in member is realized, can also be realized in the form of SFU software functional unit.
If above-mentioned integrated unit is realized in the form of SFU software functional unit and is independent product sale or uses When, it can be stored in a computer-readable access to memory.Based on such understanding, the technical solution of the application substantially or Person say the part contribute to the prior art or the technical solution all or part can in the form of software product body Reveal and, which is stored in a memory, is used including some instructions so that a computer equipment (can be personal computer, server or network equipment etc.) performs all or part of each embodiment above method of the application Step.And aforementioned memory includes:USB flash disk, read-only memory (ROM, Read-Only Memory), random access memory The various media that can store program code such as (RAM, Random Access Memory), mobile hard disk, magnetic disc or CD.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of above-described embodiment is can It is completed with instructing relevant hardware by program, which can be stored in a computer-readable memory, memory It can include:Flash disk, read-only memory (English:Read-Only Memory, referred to as:ROM), random access device (English: Random Access Memory, referred to as:RAM), disk or CD etc..
The embodiment of the present application is described in detail above, specific case used herein to the principle of the application and Embodiment is expounded, and the explanation of above example is only intended to help to understand the present processes and its core concept; Meanwhile for those of ordinary skill in the art, according to the thought of the application, can in specific embodiments and applications There is change part, in conclusion the content of the present specification should not be construed as the limitation to the application.

Claims (10)

1. a kind of pronunciation inputting method, which is characterized in that applied to mobile terminal, the running of mobile terminal have operating system and One or more application program, the method includes:
The operating system handles collected voice as voice feature data, and the front stage operation of the mobile terminal has target should With the runnable interface of program;
The operating system obtains the corresponding exclusive sound bank of the destination application;
The operating system determines the corresponding word of the voice according to the voice feature data and the exclusive sound bank;
The operating system determines the phonetic entry result of the runnable interface according to the word.
2. according to the method described in claim 1, it is characterized in that, the operating system obtains the destination application correspondence Exclusive sound bank, including:
The operating system inquires the mapping relations between the application program to prestore and exclusive sound bank, determines the intended application The corresponding exclusive sound bank of program.
3. according to the method described in claim 1, it is characterized in that, the operating system obtains the destination application correspondence Exclusive sound bank, including:
The operating system determines the corresponding exclusive sound bank set of the destination application;
The operating system determines the internal operation scene of the destination application according to the runnable interface;
The mapping that the operating system inquires in the exclusive sound bank set between internal operation scene and exclusive sound bank is closed System, it is the corresponding exclusive sound bank of the destination application to determine the corresponding exclusive sound bank of the internal operation scene.
4. according to claim 1-3 any one of them methods, which is characterized in that the operating system is according to the phonetic feature Data and the exclusive sound bank, determine the corresponding word of the voice, including:
The operating system according to the voice feature data, inquire in the exclusive sound bank voice feature data and word it Between mapping relations, obtain with the matched word of the voice feature data;
The operating system determines that the word is the corresponding word of the voice.
5. according to claim 1-4 any one of them methods, which is characterized in that the method further includes:
The operating system loads the exclusive sound bank when detecting destination application described in front stage operation.
6. according to claim 1-5 any one of them methods, which is characterized in that the operating system is determined according to the word The phonetic entry of the runnable interface as a result, including:
The operating system determines the reference language classification that the chat feature of the runnable interface is supported;
The operating system detects that the language category of the word is consistent with the reference language classification, determines that the word is The phonetic entry result of the runnable interface;
The operating system detects that the language category of the word and the reference language classification are inconsistent, according to the reference Language category translates the word, and determines the phonetic entry result that the word after translation is the runnable interface.
7. a kind of speech input device, which is characterized in that applied to mobile terminal, the running of mobile terminal have operating system and One or more application program, the speech input device include processing unit, acquiring unit and determination unit, wherein,
The processing unit, for handling collected voice as voice feature data, the front stage operation of the mobile terminal has The runnable interface of destination application;
The acquiring unit, for obtaining the corresponding exclusive sound bank of the destination application;
The determination unit, for according to the voice feature data and the exclusive sound bank, determining that the voice is corresponding Word;
The determination unit, also with the phonetic entry result that the runnable interface is determined according to the word.
8. speech input device according to claim 7, which is characterized in that obtain the destination application pair described In terms of the exclusive sound bank answered, the acquiring unit is specifically used for:It inquires between the application program to prestore and exclusive sound bank Mapping relations determine the corresponding exclusive sound bank of the destination application.
9. a kind of mobile terminal, which is characterized in that including processor, memory, communication interface and one or more program, In, one or more of programs are stored in the memory, and are configured to be performed by the processor, described program Instruction including being used for the step in any one of perform claim requirement 1-6 methods.
10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage is used for electron number According to the computer program of exchange, wherein, the computer program causes computer to perform such as claim 1-6 any one of them Method, the computer include mobile terminal.
CN201711461498.1A 2017-12-28 2017-12-28 Pronunciation inputting method and Related product Pending CN108196814A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711461498.1A CN108196814A (en) 2017-12-28 2017-12-28 Pronunciation inputting method and Related product

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711461498.1A CN108196814A (en) 2017-12-28 2017-12-28 Pronunciation inputting method and Related product

Publications (1)

Publication Number Publication Date
CN108196814A true CN108196814A (en) 2018-06-22

Family

ID=62585696

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711461498.1A Pending CN108196814A (en) 2017-12-28 2017-12-28 Pronunciation inputting method and Related product

Country Status (1)

Country Link
CN (1) CN108196814A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109086276A (en) * 2018-08-27 2018-12-25 Oppo广东移动通信有限公司 Data translating method, device, terminal and storage medium
CN110034976A (en) * 2019-04-08 2019-07-19 Oppo广东移动通信有限公司 A kind of method and device of data identification
CN110910889A (en) * 2018-08-28 2020-03-24 宏碁股份有限公司 Multimedia processing circuit and electronic system
CN111354360A (en) * 2020-03-17 2020-06-30 北京百度网讯科技有限公司 Voice interaction processing method and device and electronic equipment
CN111841006A (en) * 2019-04-19 2020-10-30 宏碁股份有限公司 Multimedia processing method and electronic system
CN113886100A (en) * 2021-09-23 2022-01-04 阿波罗智联(北京)科技有限公司 Voice data processing method, device, equipment and storage medium
US11699429B2 (en) 2018-08-28 2023-07-11 Acer Incorporated Multimedia processing method and electronic system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102855873A (en) * 2012-08-03 2013-01-02 海信集团有限公司 Electronic equipment and method used for controlling same
CN103593134A (en) * 2012-08-17 2014-02-19 上海博泰悦臻电子设备制造有限公司 Control method of vehicle device and voice function
CN104063136A (en) * 2013-07-02 2014-09-24 姜洪明 Mobile operation system
CN104965824A (en) * 2015-06-11 2015-10-07 胡开标 Real-time text and speech translation system
US20150378671A1 (en) * 2014-06-27 2015-12-31 Nuance Communications, Inc. System and method for allowing user intervention in a speech recognition process
CN105893131A (en) * 2016-04-01 2016-08-24 惠州Tcl移动通信有限公司 Method and system for starting mobile phone application through voice
CN106920546A (en) * 2015-12-23 2017-07-04 小米科技有限责任公司 The method and device of Intelligent Recognition voice

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102855873A (en) * 2012-08-03 2013-01-02 海信集团有限公司 Electronic equipment and method used for controlling same
CN103593134A (en) * 2012-08-17 2014-02-19 上海博泰悦臻电子设备制造有限公司 Control method of vehicle device and voice function
CN104063136A (en) * 2013-07-02 2014-09-24 姜洪明 Mobile operation system
US20150378671A1 (en) * 2014-06-27 2015-12-31 Nuance Communications, Inc. System and method for allowing user intervention in a speech recognition process
CN104965824A (en) * 2015-06-11 2015-10-07 胡开标 Real-time text and speech translation system
CN106920546A (en) * 2015-12-23 2017-07-04 小米科技有限责任公司 The method and device of Intelligent Recognition voice
CN105893131A (en) * 2016-04-01 2016-08-24 惠州Tcl移动通信有限公司 Method and system for starting mobile phone application through voice

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109086276A (en) * 2018-08-27 2018-12-25 Oppo广东移动通信有限公司 Data translating method, device, terminal and storage medium
CN109086276B (en) * 2018-08-27 2022-12-06 Oppo广东移动通信有限公司 Data translation method, device, terminal and storage medium
CN110910889A (en) * 2018-08-28 2020-03-24 宏碁股份有限公司 Multimedia processing circuit and electronic system
US11482229B2 (en) 2018-08-28 2022-10-25 Acer Incorporated Multimedia processing circuit and electronic system
US11699429B2 (en) 2018-08-28 2023-07-11 Acer Incorporated Multimedia processing method and electronic system
US11948581B2 (en) 2018-08-28 2024-04-02 Acer Incorporated Smart interpreter engine and electronic system
CN110034976A (en) * 2019-04-08 2019-07-19 Oppo广东移动通信有限公司 A kind of method and device of data identification
CN111841006A (en) * 2019-04-19 2020-10-30 宏碁股份有限公司 Multimedia processing method and electronic system
CN111354360A (en) * 2020-03-17 2020-06-30 北京百度网讯科技有限公司 Voice interaction processing method and device and electronic equipment
CN113886100A (en) * 2021-09-23 2022-01-04 阿波罗智联(北京)科技有限公司 Voice data processing method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN108196814A (en) Pronunciation inputting method and Related product
CN107861814A (en) Resource allocation method and equipment
CN107426432B (en) Resource allocation method and Related product
CN108091333A (en) Sound control method and Related product
CN108242837A (en) The method of the charging of electronic equipment and control electronics
CN108037999A (en) Resource allocation method and Related product
US11274932B2 (en) Navigation method, navigation device, and storage medium
CN108536480A (en) Input method configuration method and related product
CN107635078A (en) Game control method and equipment
CN107797868A (en) resource adjusting method and device
CN109379247A (en) The method and device that the network delay of a kind of pair of application program is detected
CN103763112B (en) A kind of user identity protection method and apparatus
CN111050370A (en) Network switching method and device, storage medium and electronic equipment
CN107807852A (en) Application program capacity control method and equipment
CN107786738B (en) Network control method and equipment
CN107995357A (en) Resource allocation method and device
CN107861603A (en) Power consumption control method and equipment
CN108549568A (en) Using entrance processing method, apparatus, storage medium and electronic equipment
CN109254793A (en) Engine partition method, relevant device and computer readable storage medium
CN108369479A (en) Target selection on small form factor display
CN107993672A (en) Frequency expansion method and device
CN103959199A (en) Power saving method and apparatus for first in first out (FIFO) memories
CN109753425A (en) Pop-up processing method and processing device
CN109348467A (en) Emergency call realization method, electronic device and computer readable storage medium
CN107832148A (en) Performance optimization method and equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180622