CN105810194A

CN105810194A - Voice control information acquisition method under standby state and intelligent terminal

Info

Publication number: CN105810194A
Application number: CN201610312120.4A
Authority: CN
Inventors: 吴伟兵
Original assignee: Beijing Qihoo Technology Co Ltd; Qiku Internet Technology Shenzhen Co Ltd
Current assignee: Beijing Qihoo Technology Co Ltd; Qiku Internet Technology Shenzhen Co Ltd
Priority date: 2016-05-11
Filing date: 2016-05-11
Publication date: 2016-07-27
Anticipated expiration: 2036-05-11
Also published as: CN105810194B

Abstract

The invention discloses a voice control information acquisition method under a standby state and an intelligent terminal. The method comprises the following steps: in accordance with sampling frequencies in different time periods specified by a voice collection scheme, collecting voice data; conducting characteristic matching on the voice data in accordance with preset voice command configuration data, and judging whether the voice data is a data command or not; in response to the voice command corresponding to the voice data, acquiring content information of a type corresponding to the voice command; and in accordance with the content information, associating with voice broadcast of the content information. Through processing of the voice control information acquisition method disclosed by the invention, a user can acquire information through personalized voice control when the intelligent terminal is under the standby state, so that energy consumption is reduced, and user-machine interactive experience in voice control and life and work efficiency of the user are improved.

Description

Speech-controlled information acquisition methods and intelligent terminal under holding state

Technical field

The present invention relates to voice control technology field, specifically, the present invention relates to a kind of holding state Lower speech-controlled information acquisition methods and intelligent terminal.

Background technology

Along with speech recognition technology and the development of intelligent control technology and popularizing of intelligent terminal, Yi Jiren To the variation of man-machine interaction experience and personal needs, and promote the need of live and work efficiency , voice control technology achieves significant progress.Speech recognition technology, typically uses statistical model Matching technique realizes, and is the basic composition of voice control technology.Voice control technology, i.e. uses language Sound controls the operation of equipment, more quick and easy for Non-follow control, can be used in all Fields as many in Industry Control, voice dialing system, intelligent appliance, acoustic control intelligent toy etc..Voice Control technology is brought, and is not only the change of man-machine interaction mode and experience, it is often more important that band next life The change of the mode of living and the lifting of the work productivity.Voice command is originally taking brain, laborious, time-consuming Conventional machines operation has become a simple interesting thing, and the maturation of correlation technique has also driven a series of Brand-new intelligent terminal occurs, the work making people is more convenient with life, its range of application and prospect The most wide.

In the prior art, the Voice command of intelligent terminal needs user manually to make intelligent terminal be in call out The state of waking up, then carry out voice-operated man-machine interaction.The Voice command of intelligent terminal needs user first to beat Open related application or user pins certain function button, just can carry out speech recognition, it is achieved voice control System.It addition, the format and content that intelligent terminal needs user to provide according to intelligent terminal carries out voice number According to input, speech data could be identified, and then realize voice-operated man-machine interaction.

But, first prior art at least there is problems in that, user needs to make at intelligent terminal In wake-up states, manual unlocking related application or pin certain function button, just can carry out voice control System.This result in when user be in busy in, have no time manual operation intelligent terminal and terminal is in standby Time under state, it is impossible to terminal is carried out voice-operated man-machine interaction；Or have to suspend other things Business come manually opened voice control function, reduce user carry out voice-operated man-machine interaction experience and Life and work efficiency.Secondly, intelligent terminal can only identify mark that user provides according to intelligent terminal with Content carries out the speech data inputted, and this reduces voice-operated motility and life and work effect Rate, it is difficult to meet the individualized experience demand that user is growing.

Summary of the invention

Present invention aims to present at least one aspect not enough, it is provided that a kind of standby shape Speech-controlled information acquisition methods and intelligent terminal under state, make user intelligent terminal in the standby state Just intelligent terminal can be carried out Voice command, and user can be with the mark of self-defined sound instruction and content Thus realize the Voice command of personalization, improve live and work efficiency.

In order to realize this purpose, the present invention adopts the following technical scheme that:

First aspect, provides speech-controlled information under a kind of holding state and obtains in the embodiment of the present invention Method, it is characterised in that comprise the steps:

According to the sample frequency under the different time sections that voice collecting scheme specifies, gather speech data；

Phonetic order configuration data according to pre-seting carry out characteristic matching to described speech data, it is judged that Whether described speech data is phonetic order；

In response to the phonetic order that described speech data is corresponding, obtain type corresponding to this phonetic order Content information；

It is associated the voice broadcast in this content information according to described content information.

In conjunction with first aspect, the present invention, in the first implementation of first aspect, also includes as follows Previous step:

User interface is shown under open state, for editor's phonetic order configuration data and the completeest Become pre-seting of these configuration data.

In conjunction with the first implementation of first aspect or first aspect, the present invention is the of first aspect In two kinds of implementations, described phonetic order configuration data include mark and the content of phonetic order, with And the information type corresponding to described content.

In conjunction with the first implementation of first aspect or first aspect, the present invention is the of first aspect In three kinds of implementations, described in pre-set by user by voice or the typing or selected and complete of word Become.

In conjunction with first aspect, the present invention is in the 4th kind of implementation of first aspect, and described voice is adopted Collection scheme defines intelligent terminal under each time period and gathers the sample frequency of voice, described voice collecting The formulation process of scheme comprises the steps:

Annual distribution based on user Yu the interbehavior of intelligent terminal calculates user at different time Mutual index under Duan；

The threshold range pre-seted met according to the mutual index of each described time period respectively, rule Being scheduled on each time period uses sample frequency corresponding with described threshold range, and completes voice accordingly and adopt The formulation of collection scheme.

In conjunction with the 4th kind of implementation of first aspect, the present invention is in the 5th kind of realization side of first aspect In formula, described interbehavior includes: user's manipulation behavior under intelligent terminal's wake-up states, intelligence The displacement of energy terminal and/or rotation, user's Voice command behavior under intelligent terminal is standby.

In conjunction with the 4th kind of implementation of first aspect, the present invention is in the 6th kind of realization side of first aspect In formula, the formulation process of described voice collecting scheme also includes, shows user interface under open state, For the sample frequency arranged corresponding to described threshold range and each threshold range, described to complete Pre-seting of threshold range.

In conjunction with the 4th kind of implementation of first aspect, the present invention is in the 7th kind of realization side of first aspect In formula, described intelligent terminal controls the collection of described speech data by dsp processor.

In conjunction with first aspect, the present invention in the 8th kind of implementation of first aspect, described feature The process joined includes:

Retrieve the mark whether contained in described speech data in phonetic order configuration data, if not containing institute State mark, then the instruction of this speech data non-voice, terminate retrieval；

If containing described mark, then retrieve and whether described speech data contains phonetic order configuration data In content, if containing described content, then this speech data is phonetic order, determines this speech data The information type corresponding to content comprised；If not containing described content, then this speech data non-voice refers to Order.

In conjunction with the 8th kind of implementation of first aspect, the present invention is in the 9th kind of realization side of first aspect In formula, during described characteristic matching, when described speech data configures data with described phonetic order Mark or content matching rate more than pre-set threshold value time, then judge speech data contain described mark Know or content；Otherwise, it is determined that speech data does not contains described mark or content.

In conjunction with first aspect, the present invention is in the tenth kind of implementation of first aspect, and described content is believed The acquisition methods of breath includes: by calling corresponding system command, by obtaining described class in intelligent terminal The content information of type.

In conjunction with first aspect, the present invention in the 11st kind of implementation of first aspect, described content The acquisition methods of information also includes: by calling corresponding interface or communication protocol, by corresponding software Or one or more in webpage or server obtain the content information of described types.

Second aspect, embodiments provides a kind of intelligent terminal, and this intelligent terminal has realization The function of one-card multi-number method for sending information in above-mentioned first aspect.Described function can be real by hardware Existing, it is also possible to perform corresponding software by hardware and realize.Described hardware or software include one or many The individual module corresponding with above-mentioned functions.

In a possible design, the structure of intelligent terminal includes:

Pickup unit: the sample frequency under the different time sections specified according to voice collecting scheme, Gather speech data；

Recognition unit: be configured to according to the phonetic order configuration data pre-seted described speech data Carry out characteristic matching, it is judged that whether described speech data is phonetic order；

Acquiring unit: for the phonetic order corresponding in response to described speech data, obtain this voice and refer to The content information of the corresponding type of order；

Report unit: be associated the voice broadcast in this content information according to described content information.

In conjunction with second aspect, the present invention, in the first implementation of second aspect, also includes presetting Unit:

It is configured under open state show user interface, for editor's phonetic order configuration data, And complete pre-seting of these configuration data accordingly.

In conjunction with the first implementation of second aspect, the present invention is in the second realization side of second aspect In formula, described default unit is configured to: phonetic order configuration data include phonetic order mark and Content, and the information type corresponding to described content.

In conjunction with the first implementation of second aspect, the present invention is in the third realization side of second aspect In formula, described default unit is configured to: described in pre-set by user by voice or the typing of word Or select and complete.

In conjunction with second aspect, the present invention in the 4th kind of implementation of second aspect, described pickup list In unit, described voice collecting scheme defines intelligent terminal under each time period and gathers the sampling frequency of voice Rate, the formulation process of described voice collecting scheme comprises the steps:

In conjunction with the third implementation of second aspect, the present invention is in the 5th kind of realization side of second aspect In formula, in described pickup unit, described interbehavior includes: user is at intelligent terminal's wake-up states Under manipulation behavior, the displacement of intelligent terminal and/or rotation, user's language under intelligent terminal is standby Sound controlling behavior.

In conjunction with the first implementation of second aspect, the present invention is in the 6th kind of realization side of second aspect In formula, described default unit is also configured under open state show user interface, for setting Sample frequency corresponding to described threshold range and each threshold range, to complete described threshold range Pre-set.

In conjunction with the third implementation of second aspect, the present invention is in the 7th kind of realization side of second aspect In formula, described pickup unit controls the collection of described speech data by dsp processor.

In conjunction with second aspect, the present invention is in the 8th kind of implementation of second aspect, and described identification is single In unit, the process of described characteristic matching includes:

In conjunction with the 6th kind of implementation of second aspect, the present invention is in the 9th kind of realization side of second aspect In formula, in described recognition unit, during described characteristic matching, when described speech data is with described When the mark of phonetic order configuration data or the matching rate of content are more than the threshold value pre-seted, then judge language Sound data contain described mark or content；Otherwise, it is determined that speech data does not contains described mark or content.

In conjunction with second aspect, the present invention is in the tenth kind of implementation of second aspect, and described acquisition is single Unit is configured to: by calling corresponding system command, by obtaining in intelligent terminal in described type Appearance information.

In conjunction with second aspect, the present invention in the 11st kind of implementation of second aspect, described acquisition Unit is configured to: by calling corresponding interface or communication protocol, by corresponding software or webpage or One or more in server obtain the content information of described type.

The third aspect, embodiments provides a kind of intelligent terminal, comprising:

Touch-sensitive display, is used for showing information editing interface, it is achieved man-machine interaction；

One or more processors；

Memorizer；

One or more application programs, wherein said one or more application programs are stored in memorizer In and be configured to be performed by the one or more processor；

The one or more program is used for driving the one or more processor to be configured to perform The unit of speech-controlled information acquisition methods under holding state in above-mentioned first aspect.

Compared with prior art, the technical scheme that the present invention provides at least has the advantage that

The present invention makes full use of intelligent terminal and the characteristic of operating system offer thereof, first adopts according to voice Sample frequency under the different time sections that collection scheme specifies, gathers speech data so that intelligent terminal exists Speech data can be gathered under holding state, carry out speech recognition, and, voice collecting side therein Case is according to the interbehavior of user Yu intelligent terminal, by specifying voice collecting frequency intelligently and effective Control energy consumption, improve voice-operated efficiency.On this basis, the present invention is further according to presetting The phonetic order configuration data put carry out characteristic matching to described speech data, it is judged that described speech data Whether is phonetic order, so, user just can use variation and personalized voice to realize voice control System.And then, intelligent terminal, in response to phonetic order corresponding to described speech data, obtains this voice and refers to The content information of the corresponding type of order, and be associated in this content information according to described content information Voice broadcast, brings user conveniently efficient speech-controlled information acquisition experience.

Generally, the enforcement of the present invention, solve user and use intelligent terminal to carry out Voice command During acquisition of information, the most voice-operated problem of implementation of intelligent terminal, energy consumption control problem, And intelligent terminal cannot be carried out the problem that personalized speech controls so that user can be intelligent terminal Carry out the Voice command of personalization when being in holding state and obtain information, improve user and carry out voice The man-machine interaction experience controlled and life and work efficiency.But, I have much more to say than I can write in this letter, the side that the present invention adds Face and advantage will part be given in the following description, and these will become apparent from the description below, Or recognized by the practice of the present invention.

Accompanying drawing explanation

For the technical scheme being illustrated more clearly that in the embodiment of the present invention, in embodiment being described below The required accompanying drawing used is briefly described, it should be apparent that, the accompanying drawing in describing below is only this Some embodiments of invention, for those skilled in the art, in the premise not paying creative work Under, it is also possible to other accompanying drawing is obtained according to these accompanying drawings.

Fig. 1 is the stream of an embodiment of speech-controlled information acquisition methods under holding state in the present invention Journey schematic diagram；

Fig. 2 is the stream of an embodiment of speech-controlled information acquisition methods under holding state in the present invention Journey schematic diagram；

Fig. 3 is the structural representation of an embodiment of intelligent terminal in the present invention；

Fig. 4 is the structural representation of an embodiment of intelligent terminal in the present invention.

Detailed description of the invention

In order to make those skilled in the art be more fully understood that the present invention program, real below in conjunction with the present invention Execute the accompanying drawing in example, the technical scheme in the embodiment of the present invention is clearly and completely described.

In some flow processs of description in description and claims of this specification and above-mentioned accompanying drawing, bag The multiple operations occurred according to particular order are contained, but it should be clearly understood that these operations can not be pressed Perform or executed in parallel according to its order occurred in this article, the sequence number of operation such as S10, S11 etc., only Being only for distinguishing each different operation, sequence number itself does not represent any execution sequence.It addition, These flow processs can include more or less of operation, and these operations can perform in order or parallel Perform.It should be noted that " first ", " second " herein etc. describe, it is for distinguishing difference Message, equipment, module etc., do not represent sequencing, it is different for not limiting " first " and " second " Type.

It will appreciated by the skilled person that unless expressly stated, singulative used herein " one ", " one ", " described " and " being somebody's turn to do " may also comprise plural form.It is to be further understood that The wording used in the description of the present invention " includes " referring to there is described feature, integer, step, behaviour Make, element and/or assembly, but it is not excluded that existence or add other features one or more, whole Number, step, operation, element, assembly and/or their group.It should be understood that when we claim element Being " connected " or during " coupled " to another element, it can be directly connected or coupled to other elements, or Intermediary element can also be there is in person.Additionally, " connection " used herein or " coupling " can include wireless Connect or wireless couple.Wording "and/or" used herein includes that what one or more was associated lists Whole or any cell of item and all combinations.

It will appreciated by the skilled person that unless otherwise defined, all terms used herein (including technical term and scientific terminology), have and the those of ordinary skill in art of the present invention Be commonly understood by identical meaning.Should also be understood that those arts defined in such as general dictionary Language, it should be understood that there is the meaning consistent with the meaning in the context of prior art, and remove Non-as here by specific definitions, otherwise will not with idealization or the most formal implication explain.

Both it will appreciated by the skilled person that " terminal " used herein above, " intelligent terminal " Including the equipment of wireless signal receiver, it only possesses the setting of wireless signal receiver of non-emissive ability Standby, include again the equipment receiving and launching hardware, it has and can carry out on bidirectional communication link The reception of two-way communication and the equipment of transmitting hardware.This equipment may include that honeycomb or other communication Equipment, its have single line display or multi-line display or the honeycomb not having multi-line display or Other communication equipments；PCS (Personal Communications Service, PCS Personal Communications System), It can process with combine voice, data, fax and/or its communication ability；PDA(Personal Digital Assistant, personal digital assistant), it can include radio frequency receiver, pager, mutually The access of networking/Intranet, web browser, notepad, calendar and/or GPS (Global Positioning System, global positioning system) receptor；Conventional laptop and/or palmtop computer or other set Standby, its have and/or include the conventional laptop of radio frequency receiver and/or palmtop computer or other Equipment." terminal " used herein above, " intelligent terminal " can be portable, can transport, be arranged on In the vehicles (aviation, sea-freight and/or land), or it is suitable for and/or is configured in this locality Run, and/or with distribution form, any other position operating in the earth and/or space is run.This In " terminal ", " intelligent terminal " that used can also is that intelligent terminal, access terminals, music/video Playback terminal, such as, can be PDA, POS (Point of Sales, point-of-sale terminal), MID (Mobile Internet Device, mobile internet device) and/or there is the mobile electricity of music/video playing function Words, it is also possible to be the equipment such as intelligent television, Set Top Box.

It will appreciated by the skilled person that the implication of noun involved in the present invention at least includes:

Speech recognition: also referred to as automatic speech knows (Automatic Speech Recognition, letter Claim ASR), its target is to be computer-readable input by the vocabulary Content Transformation in the voice of the mankind, Such as button, binary coding or character string.Including particular person speech recognition system, nonspecific The identification system of people's speech recognition system and many people, its method is mainly pattern matching method, involved Field includes: signal processing, pattern recognition, theory of probability and theory of information, sound generating mechanism and hearing mechanism, Artificial intelligence etc..

Voice command: support the speech data input of natural language, by speech recognition, and control to set Standby operation, in intelligent terminal, Voice command can call the information such as the time in terminal, weather, For Non-follow control more quick and easy.Voice command be applied to such as Industry Control, The fields such as voice dialing system, intelligent appliance, acoustic control intelligent toy.

User interface (User Interface, be called for short UI): be interact between system and user and The medium of information exchange, it realizes the inner form of information and the mankind can be to accept turning between form Change.User interface is to design link up related software the most alternately between user and hardware, purpose Allowing users to go to rate to operate hardware easily and effectively two-way mutual to reach, completing desired The work completed by hardware, user interface definition is extensive, contains man-machine interaction and connects with graphical user Mouthful, all mankind of participation also exist user interface, such as with the field of the communication for information of machinery: preset Put data edition interface.

Holding state: refer to that the start of intelligent terminal's (such as electronic equipment such as mobile phone or computer) is at start shape Under state, but do not produce any interbehavior with user, or do not carry out any substantive work (i.e. Not to file and the various operations of program) state, or refer to that the screen state of putting out that intelligent terminal is in (is opened Machine but screen extinguish), in the present invention, the holding state of intelligent terminal includes the dormancy/sleep shape of mobile phone State.In a kind of embodiment of android system, adjustable Use #echo mem >/sys/power/state make system enter resting state.In the standby state, Intelligent terminal will have longer cruising time.

Wake-up states: refer to that the start of intelligent terminal's (such as electronic equipment such as mobile phone or computer) is at start shape Under state, create interbehavior, or the state that screen lights with user.In android system one In kind of embodiment, #echo on can be called >/sys/power/state order makes intelligent terminal from standby State wakes up up, enters wake-up states, additionally, also include waking up mechanism as follows up: Wake_Lock calls out Awake lock mechanism；The pre-suspending mechanism of Early_Suspend；Late_Resume wakes up mechanism late up.

Interbehavior: refer to the information bidirectional transmission between user and intelligent terminal and feedback behavior, user By voice, displacement and/or rotation, word inputs, in modes such as interactive interface operations, to intelligence Terminal input message, operate；Intelligent terminal passes through the modes such as voice, image, video, word Provide a user with information.Such as: user's manipulation behavior under intelligent terminal's wake-up states, intelligence is eventually The displacement of end and/or rotation, user's Voice command behavior under intelligent terminal is standby.

The method of the invention is primarily adapted for use in intelligent mobile phone terminal or Intelligent flat terminal etc. and has logical The terminal of telecommunication function, is not restricted to the type of its operating system, can be Android, IOS, WP, The operating systems such as Saipan.

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is entered Row clearly and completely describes, and the most same or similar label represents same or similar Element or there is the element of same or like function.Obviously, described embodiment is only the present invention A part of embodiment rather than whole embodiments.Based on the embodiment in the present invention, this area skill The every other embodiment that art personnel are obtained under not making creative work premise, broadly falls into this The scope of invention protection.

As it is shown in figure 1, speech-controlled information acquisition methods under a kind of holding state of the present invention, including Following steps S11-S14:

Sample frequency under S11, the different time sections specified according to voice collecting scheme, gathers voice Data.To efficiently control energy consumption, extend intelligent terminal's stand-by time when gathering voice.

The phonetic order configuration data that S12, basis pre-set carry out characteristic matching to described speech data, Judge whether described speech data is phonetic order.So that intelligent terminal quickly knows from speech data The phonetic order of other user, to realize Voice command.

S13, in response to phonetic order corresponding to described speech data, obtain corresponding to this phonetic order The content information of type.Intelligent terminal is made to perform phonetic order and obtain content information.

S14, it is associated the voice broadcast in this content information according to described content information.Make user Obtain information needed, complete Voice command.

Wherein:

Sample frequency under S11, the different time sections specified according to voice collecting scheme, gathers voice Data.

In a kind of possible embodiment of the present invention, described voice collecting scheme defines each time The lower intelligent terminal of section gathers the sample frequency of voice, the formulation process of described voice collecting scheme include as Lower step:

First, user is calculated different based on user from the Annual distribution of the interbehavior of intelligent terminal Mutual index under time period；

Secondly, the threshold value model pre-seted met according to the mutual index of each described time period respectively Enclose, it is stipulated that use sample frequency corresponding with described threshold range in each time period, and complete accordingly The formulation of voice collecting scheme.

In a kind of embodiment, described interbehavior includes: user is under intelligent terminal's wake-up states Manipulation behavior, the displacement of intelligent terminal and/or rotation, user's voice control under intelligent terminal is standby Behavior processed.

In a kind of possible design, intelligent terminal gathers once (such as 5 minutes) at set intervals The data of the various interbehaviors of user, to show that user divided with the time of the interbehavior of intelligent terminal Cloth, and then calculate the user's mutual index under different time sections.

Described calculating comprises one or more the independent or model of association or algorithms, such as association rule mining Algorithm etc..

In a kind of possible algorithm wherein, if interbehavior is entered as 1, do not occur, compose Value is 0；The weight of user's manipulation behavior (being called for short " manipulation ") under intelligent terminal's wake-up states Being 0.3, the weight of the displacement of intelligent terminal and/or rotation (being called for short " displacement ") is 0.3, and user is in intelligence The weight of the Voice command behavior (being called for short " sound control ") under energy terminal standby is 1.Mutual index Computing formula is:

Mutual index=manipulation * 0.3+ displacement * 0.3+ sound control * 1；

As user does not manipulates in the current collection moment under intelligent terminal's wake-up states, and move Having moved intelligent terminal, and carried out Voice command under intelligent terminal is standby, the most current collection moment is used The mutual index at family is 0*0.3+1*0.3+1*1=1.3.The calculating of the mutual index in other moment is with this type of Push away.The time span of each time period is arranged voluntarily by user or is pre-seted by intelligent terminal, when one Between the mutual index of section be the meansigma methods of each mutual index gathering the moment in this time period.And then can Draw the user's mutual index under different time sections.

In a kind of possible embodiment, under the time period of 8:00-8:30, each gathers the moment to user Mutual index as shown in the table:

Then in the time period of 8:00-8:30, mutual index is 0.73.

Certainly, other interbehavior can also be used in other embodiments of the invention as variable Calculate mutual index.

In a kind of possible embodiment, voice collecting scheme is by intelligent terminal's different time under holding state The sample frequency of the voice collecting of section is defined as:

When the mutual index of a time period is less than 0.5, then use 22.05KHz in this time period Sample frequency；

When the mutual index of a time period is more than or equal to 0.5 and is less than 1.2, then in this time period Use the sample frequency of 44.1KHz；

When the mutual index of a time period is more than or equal to 1.2, then use in this time period The sample frequency of 48KHz.

Thus, in a kind of possible embodiment, voice collecting scheme is as shown in the table:

Time period	Mutual index	Sample frequency
			08:00-08:30	0.73	44.1KHz
08:31-08:35	0.35	22.05KHz
			08:36-10:15	1.47	48KHz
10:16-10:45	0.94	44.1KHz
			10:46-13:00	1.53	48KHz
13:01-13:50	0.25	22.05KHz
			13:51-18:27	1.19	44.1KHz
18:28-22:00	1.34	48KHz
			22:00-23:50	0.61	44.1KHz
23:51-07:59	0.08	22.05KHz

Certainly, in some possible embodiments of the present invention, described threshold range or its adopt accordingly Sample frequency is also dependent on the dump energy of intelligent terminal, or combines user according to the positional information of user In the service condition of the voice control function of this position, it is adjusted correspondingly, to control intelligent terminal Energy consumption when realizing voice control function, extends the stand-by time of intelligent terminal.

In a possible embodiment, the formulation process of described voice collecting scheme also includes, start User interface is shown, for arranging corresponding to described threshold range and each threshold range under state Sample frequency, to complete pre-seting of described threshold range.

Sample frequency corresponding to each threshold range of described threshold range arranged voluntarily by user or by Intelligent terminal pre-sets.Intelligent terminal is met according to the mutual index of each described time period respectively The threshold range pre-seted, it is stipulated that use sample frequency corresponding with this threshold range in this time period.

In a possible embodiment of the present invention, described intelligent terminal passes through dsp processor control Make the collection of described speech data.

Digital Signal Processing (Digital Signal Processing is called for short DSP) processor, is one Plant the hardware between fixing functional hardware and high flexibility ratio CPU, include sound for efficiently processing Frequency and voice application, image procossing, the task such as video input, belong to the portion of CPU before having born The division of labor is made, and can effectively reduce intelligent terminal's energy consumption.The most valiant dragon (Snapdragon) 820 processor The Hexagon DSP (low-power consumption island) of middle employing.Described voice number is controlled by dsp processor According to collection, improve intelligent terminal under holding state and carry out voice-operated efficiency, effectively reduce energy consumption.

Voice collecting scheme foundation user's interbehavior in different time sections and between intelligent terminal, In the voice collecting of different time sections, specify corresponding sample frequency, to efficiently control energy consumption, carry Under high holding state, the voice collecting usefulness of intelligent terminal, extends stand-by time.

The phonetic order configuration data that S12, basis pre-set carry out characteristic matching to described speech data, Judge whether described speech data is phonetic order.

In the possible embodiment of one of the present invention, the process of described characteristic matching includes:

The mark of phonetic order is first identified by intelligent terminal at holding state, after identifying mark Carry out next step identification again, while improving recognition efficiency, also reduce energy consumption.

In one embodiment, during described characteristic matching, when described speech data and institute's predicate When the mark of sound instruction configuration data or the matching rate of content are more than the threshold value pre-seted, then judge voice Data contain described mark or content；Otherwise, it is determined that speech data does not contains described mark or content.

Speech data and the described phonetic order configuration mark of data or mating of content, can use base In phonology and the method for acoustics, the method for pattern match or the method for neutral net.Such as pattern match Method in dynamic time warping (DTW), hidden Markov (HMM) is theoretical, vector quantization (VQ) technology etc..

Markov model (Markov Model) is a kind of statistical model, is widely used in voice and knows Not, automatic part-of-speech tagging, Syllable text conversion, the applications such as each natural language processing such as probabilistic grammar. Through long-run development, the application being particluarly suitable in speech recognition.

By in above example phonetic order data pre-set the method with characteristic matching, intelligence can be made Energy terminal quickly identifies the phonetic order of user from speech data, to realize Voice command.

S13, in response to phonetic order corresponding to described speech data, obtain corresponding to this phonetic order The content information of type.

In a possible embodiment, it is preferably as follows two kinds of possible methods, for obtaining described content letter Breath:

One, by calling corresponding system command, by the content obtaining described type in intelligent terminal Information.

Its two, by calling corresponding interface or communication protocol, by corresponding software or webpage or service One or more in device obtain the content information of described type.

In a kind of embodiment of android system, intelligent terminal refers to by calling corresponding system Order, can obtain described content information, as called SimpleDateFormat sDateFormat=new SimpleDateFormat (" yyyy-MM-dd hh:mm:ss ") instructs, and obtains the date in current system And the time.

In a kind of embodiment of android system, according to information type, corresponding by calling HTTP in api interface or WebService specification, the agreement such as POST, JSON, XML, can Obtain type described in corresponding software or webpage or server content information.

Such as by address http://wthrcdn.etouch.cn/weather_mini？City=Beijing, according to City name obtains weather data (JSON)；

Or by address http://wthrcdn.etouch.cn/weather_mini？Citykey=101010100 Weather data (JSON) is obtained by city id.

Certainly, in certain embodiments, it is possible to the location of combined with intelligent terminal, relevant information is obtained.

S14, it is associated the voice broadcast in this content information according to described content information.

After obtaining content information, according to the content of phonetic order, feed back corresponding information.Such as one In embodiment, after obtaining Weather information, according to the content " tomorrow can rain " in phonetic order, The weather condition of tomorrow in voice broadcast Weather information, includes whether the weather letter rained and other are concrete Breath.

Certainly, in certain embodiments, it is possible to use video, image, the carrier such as word, by with Interface, family or other modes are associated the notice in content information.

In a kind of possible embodiment of the present invention, for meeting the need of User Defined phonetic order Ask, as in figure 2 it is shown, also include following previous step:

User interface is shown under S10, open state, for editor's phonetic order configuration data, and Complete pre-seting of these configuration data accordingly.

In one embodiment, described phonetic order configuration data include the mark of phonetic order and interior Hold, and the information type corresponding to described content.

Described mark, the content information type corresponding with content can be arranged the most flexibly by user, So that speech recognition and control obtain information, such as, in a possible embodiment:

Described mark includes: " hello ", and " Hello " etc. is used for making phonetic order energy in speech recognition By Rapid Detection；

Described content includes: " now ", " what day is today ", " current temperature ", " bright It can rain ", " posteriori weather ", " present Shanghai and Shenzhen stock index " etc.；

Described information type includes " time ", " weather ", " stock index " etc.；

The corresponding relation of described content and information type is: " now " and " today is week Several " corresponding information type is " time "；" current temperature ", " tomorrow can rain and " posteriori Weather " corresponding to information type be " weather "；Info class corresponding to " present Shanghai and Shenzhen stock index " Type is " stock index " etc..

There is provided user interface to make user set mark and the content of phonetic order, and described content institute is right The information type answered, so that user realizes the personalization of phonetic order and diversified setting, improves people Experience that machine is mutual and the efficiency of life and work.

In one embodiment, pre-set described in by user by voice or the typing of word or selected and Complete.Make user that diversified phonetic order can be set personalizedly, improve Voice command merit The interest of energy and practicality.

Additionally, embodiments provide intelligent terminal, as it is shown on figure 3, for convenience of description, only show Having gone out the part relevant to the embodiment of the present invention, concrete ins and outs do not disclose, and refer to the present invention real Execute example method part.This terminal can be to include mobile phone, panel computer, PDA (Personal Digital Assistant, personal digital assistant), POS (Point of Sales, point-of-sale terminal), vehicle-mounted computer etc. appoint Meaning intelligent terminal, as a example by intelligent terminal is as mobile phone:

Fig. 3 is illustrated that the part-structure of the mobile phone relevant to the intelligent terminal of embodiment of the present invention offer Block diagram.With reference to Fig. 3, intelligent terminal includes: pickup unit 11, recognition unit 12, acquiring unit 13 With report unit 14.Wherein:

Pickup unit 11, the sampling frequency under the different time sections specified according to voice collecting scheme Rate, gathers speech data.

In a kind of possible embodiment of the present invention, the described voice collecting scheme of pickup unit 11 Define intelligent terminal under each time period and gather the sample frequency of voice, described voice collecting scheme Formulation process comprises the steps:

Secondly, the threshold value model pre-seted met according to the mutual index of each described time period respectively Enclose, it is stipulated that pickup unit 11 uses sample frequency corresponding with described threshold range in each time period, And complete the formulation of voice collecting scheme accordingly.

Mutual index=manipulation * 0.3+ displacement * 0.3+ sound control * 1；

Time	Manipulation	Displacement	Sound control	Mutual index
					08:00	1	1	0	0.6
08:05	1	0	0	0.3
					08:10	0	0	0	0
08:15	1	0	1	1.3
					08:20	0	1	0	0.3
08:25	0	1	0	1
					08:30	1	1	1	1.6

Then in the time period of 08:00-08:30, mutual index is 0.73.

Certainly, in some possible embodiments of the present invention, described threshold range or its adopt accordingly Sample frequency is also dependent on the dump energy of intelligent terminal, or combines user according to the positional information of user In the service condition of the voice control function of this position, carry out corresponding dynamically adjustment, to control intelligence The terminal energy consumption when realizing voice control function, extends the stand-by time of intelligent terminal.

In a possible embodiment, the described voice collecting scheme of pickup unit 11 also includes, Show user interface under open state, for arrange described threshold range and each threshold range institute right The sample frequency answered, to complete pre-seting of described threshold range.

In a possible embodiment of the present invention, pickup unit 11 is controlled by dsp processor The collection of described speech data.

Voice collecting scheme foundation user's interbehavior in different time sections and between intelligent terminal, The corresponding sample frequency of pickup unit 11 is specified, with effectively in the voice collecting of different time sections Control energy consumption, improve the voice collecting usefulness of intelligent terminal under holding state.

Recognition unit 12, is configured to according to the phonetic order configuration data pre-seted described voice Data carry out characteristic matching, it is judged that whether described speech data is phonetic order.

In the possible embodiment of one of the present invention, the process bag levying coupling of described recognition unit 12 Include:

Retrieve the mark whether contained in described speech data in phonetic order configuration data, if not containing institute State mark, then the instruction of this speech data non-voice, terminate retrieval；Intelligent terminal is the most right at holding state The mark of phonetic order is identified, and carries out next step identification after identifying mark again, is improving Energy consumption is also reduced while recognition efficiency.

By above example is preset pre-seting and recognition unit of the phonetic order data of unit 10 The method levying coupling of 12, intelligent terminal can be made quickly to identify from speech data, and the voice of user refers to Order, to realize Voice command.

Acquiring unit 13, for the phonetic order corresponding in response to described speech data, obtains this language The content information of the corresponding type of sound instruction.

In a possible embodiment, acquiring unit 13 is preferably as follows two kinds of possible methods, is used for obtaining institute The content information stated:

Certainly, in certain embodiments, acquiring unit 13 also can the location of combined with intelligent terminal, obtain Take relevant information.

Report unit 14, be associated the voice broadcast in this content information according to described content information.

After obtaining content information, report the unit 14 content according to phonetic order, the corresponding letter of feedback Breath.Such as in one embodiment, after obtaining Weather information, " bright according to the content in phonetic order It can rain ", the weather condition of tomorrow in voice broadcast Weather information, include whether to rain and it The Weather information that he is concrete.

Certainly, in certain embodiments, report unit 14 and may be used without video, image, word etc. Carrier, is associated the notice in content information by user interface or other modes.

In a kind of possible embodiment of the present invention, for meeting the need of User Defined phonetic order Ask, as shown in Figure 4, also include presetting unit 10 as follows:

Preset unit 10, be configured under open state show user interface, for editor's voice Instruction configuration data, and complete pre-seting of these configuration data accordingly.

In one embodiment, institute's phonetic order configuration data include mark and the content of phonetic order, And the information type corresponding to described content.

Presetting unit 10 provides user interface to make user set mark and the content of phonetic order, and Information type corresponding to described content, so that user realizes the personalization of phonetic order and diversified Set, improve experience and the efficiency of life and work of man-machine interaction.

In one embodiment, described default unit 10 pre-seted voice or the typing of word or Select and complete.Make user that diversified phonetic order can be set personalizedly, improve voice Control practicality and the interest of function.

Inventive embodiments additionally provides a kind of intelligent terminal, including:

Touch-sensitive display, is used for showing user interface, it is achieved man-machine interaction；

One or more processors；

Memorizer；

One or more application programs, wherein said one or more application programs are stored in described In memorizer and be configured to be performed by the one or more processor；

The one or more program is used for driving the one or more processor to be configured to perform The unit of speech-controlled information acquisition methods under above-mentioned holding state.Described unit includes: preset unit 10, pickup unit 11, recognition unit 12, acquiring unit 13 and report unit 14.

As a example by intelligent terminal is mobile phone:

Described intelligent terminal can be communicated with network and other equipment by radio communication.Above-mentioned wireless Communication can use arbitrary communication standard or agreement, includes but not limited to global system for mobile communications (Global System of Mobile communication, GSM), general packet radio service (General Packet Radio Service, GPRS), CDMA (Code Division Multiple Access, CDMA), WCDMA (Wideband Code Division Multiple Access, WCDMA), Long Term Evolution (Long Term Evolution, LTE), Email, short disappear Breath service (Short Messaging Service, SMS) etc..

Memorizer is used for storing software program and module, and processor is stored in memorizer by operation Software program and module, thus perform mobile phone various functions application and data process.Storage Device can mainly include storing program area and storage data field, and wherein, storage program area can store operation system Application program (such as sound-playing function, image player function etc.) needed for system, at least one function Deng；Storage data field can store data (the such as voice data, electricity that the use according to mobile phone is created Script for story-telling etc.) etc..Additionally, memorizer can include high-speed random access memory, it is also possible to include non- Volatile memory, for example, at least one disk memory, flush memory device or other volatibility are solid State memory device.

Touch-sensitive display can include touch detecting apparatus and two parts of touch controller.Wherein, touch The touch orientation of detection device detection user, and detect the signal that touch operation brings, by signal transmission To touch controller；Touch controller receives touch information from touch detecting apparatus, and it is changed Become contact coordinate, then give processor, and order that processor sends can be received and performed.This Outward, the polytypes such as resistance-type, condenser type, infrared ray and surface acoustic wave can be used to realize touch-sensitive Display.

Touch-sensitive display can be used for the information that inputted by user of display or the information being supplied to user and The various menus of mobile phone, such as information editing interface etc..Touch-sensitive display can include display floater, optional , liquid crystal display (Liquid Crystal Display, LCD), organic light-emitting diodes can be used The forms such as pipe (Organic Light-Emitting Diode, OLED) configure touch-sensitive display.Enter One step, when touch-sensitive display detects thereon or after neighbouring touch operation, sends processor to To determine the type of touch event, with preprocessor according to the type of touch event on the touch sensitive display Corresponding visual output is provided.

Mobile phone includes audio frequency input and output system or equipment, and including mike, bluetooth, earphone is (even Jointing holes), microphone etc..

Mobile phone may also include at least one sensor, and such as gravity sensor, optical sensor, motion pass Sensor and other sensors.Specifically, optical sensor can include ambient light sensor and close to sensing Device, wherein, ambient light sensor can regulate the brightness of touch-sensitive display according to the light and shade of ambient light, Proximity transducer can cut out touch-sensitive display and/or backlight when mobile phone moves in one's ear.As fortune The one of dynamic sensor, accelerometer sensor can detect (generally three axles) acceleration in all directions Size, can detect that size and the direction of gravity time static, can be used for identify mobile phone attitude application (such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (ratio Such as pedometer, percussion) etc.；The gyroscope that can also configure as mobile phone, barometer, drimeter, temperature Other sensors such as degree meter, infrared ray sensor, do not repeat them here.

Processor is the control centre of mobile phone, utilizes various interface and the whole mobile phone of connection each Part, by running or perform software program and/or the module being stored in memorizer, and calls The data being stored in memorizer, perform the various functions of mobile phone and process data, thus entering mobile phone Row integral monitoring.Optionally, processor can include one or more processing unit；Preferably, process Device can integrated application processor and modem processor, wherein, application processor mainly processes operation System, user interface and application program etc., modem processor mainly processes radio communication.Permissible Being understood by, above-mentioned modem processor can not also be integrated in processor.

Mobile phone also includes the power supply (such as battery) powered to all parts, it is preferred that power supply is permissible Logically contiguous with processor by power-supply management system, thus realize management by power-supply management system and fill The functions such as electricity, electric discharge and power managed.

Mobile phone can also include photographic head, bluetooth module etc., does not repeats them here.

In embodiments of the present invention, the processor included by this intelligent terminal also has a following functions:

According to the sample frequency under the different time sections that voice collecting scheme specifies, gather speech data.

Secondly, the threshold value model pre-seted met according to the mutual index of each described time period respectively Enclose, it is stipulated that use sample frequency corresponding with described threshold range in each time period.

In a kind of possible design, gather each of a user at set intervals (such as 5 minutes) Plant the data of interbehavior, to draw the Annual distribution of user and the interbehavior of intelligent terminal, and then Calculate the user's mutual index under different time sections.

Mutual index=manipulation * 0.3+ displacement * 0.3+ sound control * 1；

In a possible embodiment of the present invention, control described voice number by dsp processor According to collection.

Voice collecting scheme foundation user's interbehavior in different time sections and between intelligent terminal, In the voice collecting of different time sections, specify corresponding sample frequency, to efficiently control energy consumption, carry The voice collecting usefulness of intelligent terminal under high holding state.

Phonetic order configuration data according to pre-seting carry out characteristic matching to described speech data, it is judged that Whether described speech data is phonetic order.

In a possible embodiment, under open state, show user interface, for editor's voice Instruction configuration data, and complete pre-seting of these configuration data accordingly.

In one embodiment, pre-set described in by user by voice or the typing of word or selected and Complete.Make user that diversified phonetic order can be set personalizedly, improve Voice command merit The practicality of energy and interest.

In response to the phonetic order that described speech data is corresponding, obtain type corresponding to this phonetic order Content information.

Those skilled in the art is it can be understood that arrive, for convenience and simplicity of description, above-mentioned The system described, the specific works process of device and unit, it is referred in preceding method embodiment Corresponding process, does not repeats them here.

In several embodiments provided herein, it should be understood that disclosed system, device And method, can realize by another way.Such as, device embodiment described above is only It is schematic, such as, the division of described unit, it is only a kind of logic function and divides, actual real Can have now other dividing mode, the most multiple unit or assembly can in conjunction with or can be integrated To another system, or some features can be ignored, or does not performs.Another point, shown or discussed Coupling each other direct-coupling or communication connection can be by some interfaces, device or list The INDIRECT COUPLING of unit or communication connection, can be electrical, machinery or other form.

The described unit illustrated as separating component can be or may not be physically separate, The parts shown as unit can be or may not be physical location, i.e. may be located at a ground Side, or can also be distributed on multiple NE.Can select therein according to the actual needs Some or all of unit realizes the purpose of the present embodiment scheme.

It addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit In, it is also possible to it is that unit is individually physically present, it is also possible to two or more unit are integrated in In one unit.Above-mentioned integrated unit both can realize to use the form of hardware, it would however also be possible to employ soft The form of part functional unit realizes.

One of ordinary skill in the art will appreciate that, the whole or portion in the various methods of above-described embodiment The program that can be by step by step completes to instruct relevant hardware, and this program can be stored in a meter In calculation machine readable storage medium storing program for executing, storage medium can include but not limited to: any kind of dish (includes Floppy disk, hard disk, CD, CD-ROM and magneto-optic disk), ROM (Read-Only Memory, Read only memory), RAM (Random Access Memory, memorizer immediately), EPROM (Erasable Programmable Read-Only Memory, Erarable Programmable Read only Memory), EEPROM (Electrically Erasable Programmable Read-Only Memory, electrically erasable Programmable read only memory), flash memory, magnetic card or light card.

One of ordinary skill in the art will appreciate that, realize in above-described embodiment method is all or part of Step can be by program and completes to instruct relevant hardware, and described program can be stored in one In computer-readable recording medium, storage medium mentioned above can be read only memory, disk or CD etc..

Above to speech-controlled information acquisition methods and intelligence under a kind of holding state provided by the present invention Can terminal be described in detail, for one of ordinary skill in the art, former without departing from the present invention On the premise of reason, the most all will change, in sum, This specification content should not be construed as limitation of the present invention.

Claims

1. speech-controlled information acquisition methods under a holding state, it is characterised in that include walking as follows Rapid:

Method the most according to claim 1, it is characterised in that also include following previous step:

Method the most according to claim 1 and 2, it is characterised in that described phonetic order configures Data include mark and the content of phonetic order, and the information type corresponding to described content.

Method the most according to claim 1 and 2, it is characterised in that described in pre-set by with Family is by voice or the typing of word or selected and complete.

Method the most according to claim 1, it is characterised in that described voice collecting scheme specifies Under each time period, intelligent terminal gathers the sample frequency of voice, the formulation of described voice collecting scheme Process comprises the steps:

Method the most according to claim 1, it is characterised in that the process of described characteristic matching Including:

Method the most according to claim 1, it is characterised in that during described characteristic matching, When described speech data is more than pre-with the mark of described phonetic order configuration data or the matching rate of content During the threshold value arranged, then judge that speech data contains described mark or content；Otherwise, it is determined that voice number According to not containing described mark or content.

Method the most according to claim 1, it is characterised in that the acquisition side of described content information Method also includes: by calling corresponding interface or communication protocol, by corresponding software or webpage or service One or more in device obtain the content information of described type.

9. an intelligent terminal, it is characterised in that including:

10. an intelligent terminal, it is characterised in that including:

One or more processors；

Memorizer；

The one or more program is used for driving the one or more processor to be configured to perform The unit of method described in any one in claim 1 to 8.