CN103714815A - Voice control method and device thereof - Google Patents

Voice control method and device thereof Download PDF

Info

Publication number
CN103714815A
CN103714815A CN201310657278.1A CN201310657278A CN103714815A CN 103714815 A CN103714815 A CN 103714815A CN 201310657278 A CN201310657278 A CN 201310657278A CN 103714815 A CN103714815 A CN 103714815A
Authority
CN
China
Prior art keywords
information
voice
order
execution
waking
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310657278.1A
Other languages
Chinese (zh)
Inventor
何永
李传丰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201310657278.1A priority Critical patent/CN103714815A/en
Publication of CN103714815A publication Critical patent/CN103714815A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a voice control method and a device thereof. The control method comprises the steps of: (1) receiving audio data in real time; (b) performing head part judgment on the received audio data through voice breakpoint detection to obtain valid audio information; (c) judging whether the valid audio information includes awakening information, and if the awakening information being included, further executing step (d), otherwise executing step (a); (d) performing head part and tail part judgment on the valid audio information through voice breakpoint detection to obtain executive content information; (e) performing semantic parsing to convert the executive content information to standard executive command information; and (f) executing a related command according to the standard executive command information and displaying an executive result to a user. The voice control method and the device thereof provide a novel intelligent voice interaction environment, and enable the user to use a voice interaction function efficiently and conveniently.

Description

Sound control method and equipment thereof
Technical field
The present invention relates to voice/semantic recognition technology, natural language processing technique and intelligent terminal applicating developing technology field, specifically, is a kind of sound control method and equipment thereof.
Background technology
Along with interactive voice technology and intelligent control technology ground development, there is speech identifying function and can also get more and more according to the equipment of inputted voice content execution associative operation.At present, known voice opertaing device is mainly adopted and is carried out in two ways alternately, and a kind of mode is by manually booting speech recognition switch, and after starting this switch, content is carried out in phonetic entry.Another kind of mode is by specifically waking information up to start speech identifying function, and after having waken up, then content is carried out in phonetic entry.But the opertaing device of the interactive voice of above-mentioned two classes has following weak point: (1) first kind of way, need manual operation, and can not realize interactive voice full automatic working completely.(2) second way, each voice operating, first need first phonetic entry one specifically to wake information up, then waiting for a setting-up time (some seconds) afterwards, equipment just can remove to intercept the voice content described in user automatically, so can greatly reduce the mutual agility of intelligent sound and convenience like this.
Therefore, need a kind of novel sound control method and equipment thereof.
Summary of the invention
The object of the invention is to, a kind of sound control method and equipment thereof are provided, it can overcome the deficiencies in the prior art part, and provides a kind of novel intelligent sound mutual environment, makes user can more efficiently use easily voice interactive function.
For achieving the above object, a kind of sound control method of the present invention, comprises step: (a) audio reception data in real time; (b) by voice breaking point detection, the described voice data receiving is carried out to stem judgement, to obtain an effective audio-frequency information; (c) judge whether described effective audio-frequency information comprises the information of waking up; If wake information described in comprising up, further perform step (d); Otherwise execution step (a); (d) by voice breaking point detection, described effective audio-frequency information is carried out to head and the tail judgement, to obtain execution content information; (e) carry out semanteme and resolve, so that described execution content information is converted to standard fill order information; (f) according to described standard fill order information, carry out relevant order, and the result of execution is shown to user.
Further, further comprising the steps in step (c):
Described effective audio-frequency information is sent to a this locality and wakes information database up;
The content of waking described effective audio-frequency information up information database with described this locality is mated; When matching while waking information up, execution step (d); Otherwise, perform step (a).
Further, further comprising the steps in described step (d) and step (e):
The obtained information of waking up and execution content information are sent to high in the clouds database simultaneously;
By high in the clouds speech recognition, the described information of waking up is mated with the content of high in the clouds database; If while matching, perform step (e); Otherwise execution step (a).
Further, further comprising the steps in described step (e):
Transfer obtained execution content information to text formatting information;
By described text formatting information analysis, be standard fill order information.
Further, the information of waking up described in be in a word, word or a sentence any one.
To achieve these goals, the present invention also provides a kind of voice opertaing device, and it comprises audio frequency receiver module, breaking point detection module, wakes signal judgement module up, carries out content information acquisition module, modular converter and execution module; Wherein said audio frequency receiver module, in order to audio reception data in real time; Described breaking point detection module, is connected with described audio frequency receiver module, in order to by voice breaking point detection, the described voice data receiving is carried out to stem judgement, to obtain an effective audio-frequency information; The described signal judgement module that wakes up, is connected with described breaking point detection module, in order to judge whether described effective audio-frequency information comprises the information of waking up, if call described execution content information acquisition module, otherwise calls described audio frequency receiver module; Described execution content information acquisition module, is connected with the described signal judgement module that wakes up, in order to by voice breaking point detection, described effective audio-frequency information is carried out to head and the tail judgement, to obtain execution content information; Described modular converter, is connected with described execution content information acquisition module, resolves, so that described execution content information is converted to standard fill order information in order to carry out semanteme; Described execution module, is connected with described modular converter, and described execution module is in order to carry out relevant order according to described standard fill order information, and the result of execution is shown to user.
Further, described in, wake signal judgement module up and further comprise delivery unit and matching unit; Described delivery unit wakes information database up in order to described effective audio-frequency information is sent to a this locality; Described matching unit is connected with described delivery unit, in order to the content of waking described effective audio-frequency information up information database with described this locality, mates; When matching while waking information up, call and carry out content information acquisition module; Otherwise, call described audio frequency receiver module.
Further, described delivery unit is further in order to be sent to high in the clouds database by the obtained information of waking up and execution content information simultaneously; Described matching unit is further in order to mate the described information of waking up by high in the clouds speech recognition with the content of high in the clouds database; If while matching, call described modular converter; Otherwise call described audio frequency receiver module.
Further, described modular converter further comprises converting unit and resolution unit, and described converting unit, in order to transfer obtained execution content information to text formatting information; Described resolution unit is connected with described converting unit, in order to being standard fill order information by described text formatting information analysis.
Further, the information of waking up described in be in a word, word or a sentence any one.
The invention has the advantages that, utilize voice breaking point detection technology, wake information detection technology and speech recognition technology up, to provide a kind of novel intelligent sound mutual environment, make user can use more efficiently and easily voice interactive function, thus make relevant equipment can complete more quickly the voice content carried out of wish.
Accompanying drawing explanation
Fig. 1 is the flow chart of steps of sound control method of the present invention.
Fig. 2 is the Organization Chart of voice opertaing device of the present invention.
Embodiment
Below in conjunction with accompanying drawing, the embodiment of a kind of sound control method provided by the invention and equipment is elaborated.
First by reference to the accompanying drawings provide the embodiment of sound control method of the present invention.
Fig. 1 is the flow chart of steps of sound control method of the present invention.Shown in Figure 1, sound control method of the present invention comprises: step S110, audio reception data in real time; Step S120, by voice breaking point detection, the described voice data receiving is carried out to stem judgement, to obtain an effective audio-frequency information; Step S130, judge whether described effective audio-frequency information comprises the information of waking up; If wake information described in comprising up, further perform step S140; Otherwise execution step S110; Step S140, by voice breaking point detection, described effective audio-frequency information is carried out to head and the tail judgements, to obtain execution content information; Step S150, carry out semanteme and resolve, so that described execution content information is converted to standard fill order information; Step S160 carries out relevant order according to described standard fill order information, and the result of execution is shown to user.
Below with reference to accompanying drawing 1, illustrate each step.
Step S110: audio reception data in real time.
Enter init state, 24 hours audio reception data (with voice mode input) in real time.
Step S120: by voice breaking point detection, the described voice data receiving is carried out to stem judgement, to obtain an effective audio-frequency information.
In this step, utilize voice breaking point detection mode to carry out stem judgement to received described voice data, thereby obtain an effective audio-frequency information.So-called stem judgement, utilize just voice breaking point detection mode, can obtain effective audio-frequency information, and get rid of information that noise produces or the information of improper phonetic entry, thereby reduce the probability that destination object performs an action because of wrong audio-frequency information.
Step S130: judge whether described effective audio-frequency information comprises the information of waking up; If wake information described in comprising up, further perform step S140; Otherwise execution step S110.
In one embodiment of the present invention, described in to wake information (or claim wake up word) up be one to preset, its can be when dispatching from the factory default setting, or can select before use setting.Described wake up information be in a word, word or a sentence any one.For example, waking information up can be " newly ", " Xiao Ming ", " my baby " etc.Wake information up except comprising Chinese word, can also comprise other foreign language words, at this, do not limit.In addition, the address to destination object when the information of waking up described in literary composition is phonetic entry, this destination object can be carried out relevant action according to received voice content.Describedly wake the information explanation that also can be further explained hereinafter up.
In this step, utilization wakes information detection technology up and judges whether described effective audio-frequency information comprises the information of waking up setting.Described in comprising if judge, wake information up, continue subsequent step, otherwise again wait for the voice data that reception is new.
When judging effective audio-frequency information, comprise after the information of waking up, further confirm to wake the starting position whether information is positioned at effective audio-frequency information up, be positioned at the stem of effective audio-frequency information.If satisfy condition, carry out subsequent step, otherwise the information of for example waking up appears at the middle somewhere of effective audio-frequency information, or occur in the end, in the case, can again wait for and receive new voice data.
In another embodiment of the present invention, further comprising the steps in step S130:
Described effective audio-frequency information is sent to a this locality and wakes information database up;
The content of waking described effective audio-frequency information up information database with described this locality is mated; When matching while waking information up, perform step S140; Otherwise, perform step S110.
The content of wherein waking described effective audio-frequency information up information database with described this locality is mated, can be understood as, first by the data that preset in a large number, set up with data model, then by described effective audio-frequency information, mate with this data model, to determine similarity, when if similarity reaches a threshold value, think that described effective audio-frequency information includes the information of waking up.
At other embodiments of the present invention, be not limited to aforesaid way, can adopt above-mentioned by the information of waking up presetting, to judge whether described effective audio-frequency information comprises the information of waking up setting.
Step S140: by voice breaking point detection, described effective audio-frequency information is carried out to head and the tail judgement, to obtain execution content information.
In this step, by voice breaking point detection, again described effective audio-frequency information is carried out to head and the tail judgement, to obtain execution content.So-called head and the tail judgement, is by voice breaking point detection and not only can judges the end position of the information of waking up, carries out the starting position of content, and judges the end position of carrying out content, like this, just can obtain one and effectively carry out content information.
And prior art is first specifically to wake information up by phonetic entry one, then waiting for a setting-up time (fixing some seconds) afterwards, target device just can remove to intercept the voice content described in user automatically, so can cause the situation of time delay intercepting voice content, so that have deviation with actual speech input content, imperfect, thus different execution results produced.As can be seen here, adopt voice breaking point detection technology can guarantee that the execution content of obtaining is correct.
In another embodiment of the present invention, further comprising the steps in described step S140 and step S150:
The obtained information of waking up and execution content information are sent to high in the clouds database simultaneously;
By high in the clouds speech recognition, the described information of waking up is mated with the content of high in the clouds database; If match while waking information up, perform step S150; Otherwise execution step S110.
The execution of above-mentioned steps is in order to reduce false wake-up probability, by adopting high in the clouds speech recognition (engine) to verify that again whether the current information of waking up is effective.If again match identically while waking information up, carry out subsequent step.With only by this locality, wake the mode that information database judges whether effective audio-frequency information comprises the information of waking up up and compare, the mode that this step adopts is the data model that utilizes database its large amount of complex data that have in high in the clouds to set up, wake the coupling of information up, thereby can effectively lower false wake-up number of times.
In other embodiments of the present invention, be not limited to aforesaid way, also can adopt other modes to verify the correctness of the information of waking up.
Step S150: carry out semanteme and resolve, so that described execution content information is converted to standard fill order information.
Wake information up described in comprising judging described effective audio-frequency information, and after obtaining and carrying out content information,, by semantic analysis mode, described execution content information is converted to standard fill order information.
In one embodiment of the present invention, this step may further include following steps:
Transfer obtained execution content information to text formatting information;
By described text formatting information analysis, be standard fill order information.
In other words, by speech recognition technology, (for example convert voice messaging to discernible text message exactly, convert voice messaging " Xiao Ming; please open door " to text formatting " Xiao Ming; please open door "), and described this paper information analysis is gone out to relevant fill order, with standard format, export.Wherein, the step that the execution content information of described acquisition is transferred to text formatting information completes in database beyond the clouds, thereby improves conversion efficiency.And this step also can complete in other embodiments in local data base.By natural language processing technique, by described text formatting information analysis, be standard fill order information simultaneously.
Step S160: carry out relevant order according to described standard fill order information, and the result of execution is shown to user.
When target device (waking the object of information up) can be according to described standard fill order information, and call relevant module to carry out relevant fill order, and execution result is shown to user.
Below with reference to accompanying drawing, provide the embodiment of technique scheme.
Embodiment mono-, with user speech input " little intelligence please be opened bedroom air-conditioning ", be example.
Step S110, audio reception data in real time.
Destination object is within 24 hours, to detect in real time the voice data of received phonetic entry.
Step S120, by voice breaking point detection, the described voice data receiving is carried out to stem judgement, to obtain an effective audio-frequency information.
When destination object receives voice data, can utilize voice breaking point detection to carry out stem judgement to received voice data, to obtain effective audio-frequency information " little intelligence; please open bedroom air-conditioning ", and got rid of " little intelligence " effectively audio-frequency information noise information or improper speech input information before.
Step S130, judge whether described effective audio-frequency information comprises the information of waking up.
Destination object is sent to a this locality by received effective audio-frequency information and wakes information database up.
The content of waking described effective audio-frequency information up information database with described this locality is mated, detect and whether have the qualified information of waking up, after " little intelligence " having been detected this waken information up, can further judge, this wakes the stem whether information is positioned at described effective audio-frequency information up " little intelligence ".Due to " little intelligence ", this wakes the stem whether information is positioned at described effective audio-frequency information up, therefore, carries out subsequent step, otherwise destination object is waited for the voice data that reception is new again.
Step S140, by voice breaking point detection, described effective audio-frequency information is carried out to head and the tail judgements, to obtain execution content information.
By voice breaking point detection, again described effective audio-frequency information " little intelligence; please open bedroom air-conditioning " is carried out to head and the tail judgement, judge " intelligence " word in " little intelligence " and when finish, think that ensuing audio-frequency information is the starting position of carrying out content.Equally, utilize voice breaking point detection also to judge " tune " word in " please open bedroom air-conditioning " and when finish, think and carry out the end position of content.So, can obtain and carry out content information (" please open bedroom air-conditioning ").
In the present embodiment, destination object can comprise that by effective audio-frequency information the information of waking up and execution content information (are sent to high in the clouds database for " little intelligence " " please open bedroom air-conditioning " herein simultaneously through a step.
By high in the clouds speech recognition by described in wake information " little intelligence " up and mate with the content of high in the clouds database, if match, carry out next step operation, otherwise destination object is waited for new voice data again.By this locality, wake the content matching of information database and the content matching of high in the clouds database up, wake the double verification of information up, effectively to reduce false wake-up number of times.
In the present embodiment, by high in the clouds database and speech recognition technology, described execution content information transfers text formatting information to, thereby improves conversion efficiency.
Step S150, carry out semanteme and resolve, so that described execution content information is converted to standard fill order information.
In this step, by natural language processing technique, by described text formatting information analysis, be standard fill order information.That is to say, by natural-sounding treatment technology, to text formatting information analysis, identify the true intention of text formatting information, the implication that " please open bedroom air-conditioning " is " air-conditioning in this room, bedroom is opened ", and changes into standard fill order information for " CommandOpen| bedroom | air-conditioning ".The form of described standard fill order information can be by requirement definition, only need to be with set form.
Step S160 carries out relevant order according to described standard fill order information, and the result of execution is shown to user.
Destination object, according to described standard fill order information " CommandOpen| bedroom | air-conditioning ", calls relevant processing module and execution module, to have coordinated the content of described standard fill order information.Meanwhile, the result of execution is shown to user's (being destination object herein, opens the air-conditioning in bedroom).
Sound control method of the present invention, by identifying the information of waking up of user speech input and carrying out content, to start voice control flow, thereby the operational order (carrying out content) of user speech input is sent to target device with predetermined manner, realizes the control to target device.
More important point is, the present invention utilizes voice breaking point detection technology, wakes information detection technology, speech recognition technology and natural language processing technique up and provide a kind of novel intelligent sound mutual environment, user is without manual operation target device, so reduce user's operation, make user can use more efficiently and easily voice interactive function.
Except a kind of sound control method that the invention described above provides, the present invention also provides a kind of voice opertaing device.
Fig. 2 is the Organization Chart of voice opertaing device of the present invention.Shown in Figure 2, voice opertaing device of the present invention comprises audio frequency receiver module M210, breaking point detection module M220, wakes signal judgement module M230 up, carries out content information acquisition module M240, modular converter M250 and execution module M260.Wherein said audio frequency receiver module M210, in order to audio reception data in real time.
Described breaking point detection module M220, is connected with described audio frequency receiver module M210, in order to by voice breaking point detection, the described voice data receiving is carried out to stem judgement, to obtain an effective audio-frequency information.
Wherein, so-called stem judgement, utilizes voice breaking point detection mode just, can obtain effective audio-frequency information, and got rid of information that noise produces or the information of improper phonetic entry, thereby reduced the probability that destination object performs an action because of wrong audio-frequency information.
The described signal judgement module M230 that wakes up, is connected with described breaking point detection module M220, in order to judge whether described effective audio-frequency information comprises the information of waking up, if call described execution content information acquisition module, otherwise calls described audio frequency receiver module.
In an embodiment of the present invention, described in to wake information up be one to preset, its can be when dispatching from the factory default setting, or can select before use setting.Described wake up information be in a word, word or a sentence any one.For example, waking information up can be " newly ", " Xiao Ming ", " my baby " etc.Wake information up except comprising Chinese word, can also comprise other foreign language words, at this, do not limit.In addition, the address to destination object when the information of waking up described in literary composition is phonetic entry, this destination object can be carried out relevant action according to received voice content.
And as preferred embodiment, described in wake signal judgement module M230 up and further comprise delivery unit M231 and matching unit M233; Described delivery unit M231 wakes information database up in order to described effective audio-frequency information is sent to a this locality; Described matching unit M233 is connected with described delivery unit M231, in order to the content of waking described effective audio-frequency information up information database with described this locality, mates; When matching while waking information up, call described execution content information acquisition module M240; Otherwise, call described audio frequency receiver module M210.
As preferred embodiment, described delivery unit M231 is further in order to be sent to high in the clouds database by the obtained information of waking up and execution content information simultaneously; Described matching unit M233 is further in order to mate the described information of waking up by high in the clouds speech recognition with the content of high in the clouds database; If match while waking information up, call described modular converter M250; Otherwise call described audio frequency receiver module M210.With only by this locality, wake the mode that information database judges whether effective audio-frequency information comprises the information of waking up up and compare, the data model that utilizes database its large amount of complex data that have in high in the clouds to set up, wake the coupling of information up, thereby can effectively lower false wake-up number of times.
Described execution content information acquisition module M240, is connected with the described signal judgement module M230 that wakes up, in order to by voice breaking point detection, described effective audio-frequency information is carried out to head and the tail judgement, to obtain execution content information.
So-called head and the tail judgement, is by voice breaking point detection and not only can judges the end position of the information of waking up, carries out the starting position of content, and judges the end position of carrying out content, like this, just can obtain one and effectively carry out content information.And prior art is first specifically to wake information up by phonetic entry one, then waiting for a setting-up time (fixing some seconds) afterwards, target device just can remove to intercept the voice content described in user automatically, so can cause the situation of time delay intercepting voice content, so that have deviation with actual speech input content, imperfect, thus different execution results produced.As can be seen here, adopt voice breaking point detection technology can guarantee that the execution content of obtaining is correct.
Described modular converter M250, is connected with described execution content information acquisition module M240, resolves, so that described execution content information is converted to standard fill order information in order to carry out semanteme.
As preferred embodiment, described modular converter M250 further comprises converting unit M251 and resolution unit M253, and described converting unit M251, in order to transfer obtained execution content information to text formatting information.Wherein, described converting unit M251 can arrange in the database of high in the clouds, to transfer the execution content information of described acquisition to text formatting information, thereby improves conversion efficiency.And described converting unit M251 can arrange in local data base in other embodiments.Described resolution unit M253 is connected with described converting unit M251, in order to being standard fill order information by described text formatting information analysis.
Described execution module M260, is connected with described modular converter M250, and described execution module M260 is in order to carry out relevant order according to described standard fill order information, and the result of execution is shown to user.
The above is only the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the premise without departing from the principles of the invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (10)

1. a sound control method, is characterized in that, comprises step:
(a) audio reception data in real time;
(b) by voice breaking point detection, the described voice data receiving is carried out to stem judgement, to obtain an effective audio-frequency information;
(c) judge whether described effective audio-frequency information comprises the information of waking up; If wake information described in comprising up, further perform step (d); Otherwise execution step (a);
(d) by voice breaking point detection, described effective audio-frequency information is carried out to head and the tail judgement, to obtain execution content information;
(e) carry out semanteme and resolve, so that described execution content information is converted to standard fill order information;
(f) according to described standard fill order information, carry out relevant order, and the result of execution is shown to user.
2. sound control method according to claim 1, is characterized in that, further comprising the steps in step (c):
Described effective audio-frequency information is sent to a this locality and wakes information database up;
The content of waking described effective audio-frequency information up information database with described this locality is mated; When matching while waking information up, execution step (d); Otherwise, perform step (a).
3. sound control method according to claim 2, is characterized in that, further comprising the steps in described step (d) and step (e):
The obtained information of waking up and execution content information are sent to high in the clouds database simultaneously;
By high in the clouds speech recognition, the described information of waking up is mated with the content of high in the clouds database; If while matching, perform step (e); Otherwise execution step (a).
4. sound control method according to claim 1, is characterized in that, further comprising the steps in described step (e):
Transfer obtained execution content information to text formatting information;
By described text formatting information analysis, be standard fill order information.
5. sound control method according to claim 1, is characterized in that, described in wake up information be in a word, word or a sentence any one.
6. a voice opertaing device, is characterized in that, comprises audio frequency receiver module, breaking point detection module, wakes signal judgement module up, carries out content information acquisition module, modular converter and execution module; Wherein
Described audio frequency receiver module, in order to audio reception data in real time;
Described breaking point detection module, is connected with described audio frequency receiver module, in order to by voice breaking point detection, the described voice data receiving is carried out to stem judgement, to obtain an effective audio-frequency information;
The described signal judgement module that wakes up, is connected with described breaking point detection module, in order to judge whether described effective audio-frequency information comprises the information of waking up, if call described execution content information acquisition module, otherwise calls described audio frequency receiver module;
Described execution content information acquisition module, is connected with the described signal judgement module that wakes up, in order to by voice breaking point detection, described effective audio-frequency information is carried out to head and the tail judgement, to obtain execution content information;
Described modular converter, is connected with described execution content information acquisition module, resolves, so that described execution content information is converted to standard fill order information in order to carry out semanteme;
Described execution module, is connected with described modular converter, and described execution module is in order to carry out relevant order according to described standard fill order information, and the result of execution is shown to user.
7. voice opertaing device according to claim 6, is characterized in that, described in wake signal judgement module up and further comprise delivery unit and matching unit; Described delivery unit wakes information database up in order to described effective audio-frequency information is sent to a this locality; Described matching unit is connected with described delivery unit, in order to the content of waking described effective audio-frequency information up information database with described this locality, mates; When matching while waking information up, call described execution content information acquisition module; Otherwise, call described audio frequency receiver module.
8. voice opertaing device according to claim 7, is characterized in that, described delivery unit is further in order to be sent to high in the clouds database by the obtained information of waking up and execution content information simultaneously; Described matching unit is further in order to mate the described information of waking up by high in the clouds speech recognition with the content of high in the clouds database; If while matching, call described modular converter; Otherwise call described audio frequency receiver module.
9. voice opertaing device according to claim 6, is characterized in that, described modular converter further comprises converting unit and resolution unit, and described converting unit, in order to transfer obtained execution content information to text formatting information; Described resolution unit is connected with described converting unit, in order to being standard fill order information by described text formatting information analysis.
10. voice opertaing device according to claim 6, is characterized in that, described in wake up information be in a word, word or a sentence any one.
CN201310657278.1A 2013-12-09 2013-12-09 Voice control method and device thereof Pending CN103714815A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310657278.1A CN103714815A (en) 2013-12-09 2013-12-09 Voice control method and device thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310657278.1A CN103714815A (en) 2013-12-09 2013-12-09 Voice control method and device thereof

Publications (1)

Publication Number Publication Date
CN103714815A true CN103714815A (en) 2014-04-09

Family

ID=50407722

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310657278.1A Pending CN103714815A (en) 2013-12-09 2013-12-09 Voice control method and device thereof

Country Status (1)

Country Link
CN (1) CN103714815A (en)

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104464723A (en) * 2014-12-16 2015-03-25 科大讯飞股份有限公司 Voice interaction method and system
CN105632486A (en) * 2015-12-23 2016-06-01 北京奇虎科技有限公司 Voice wake-up method and device of intelligent hardware
CN105719645A (en) * 2014-12-17 2016-06-29 现代自动车株式会社 Speech recognition apparatus, vehicle including the same, and method of controlling the same
WO2016112634A1 (en) * 2015-01-12 2016-07-21 芋头科技(杭州)有限公司 Voice recognition system and method of robot system
CN105976814A (en) * 2015-12-10 2016-09-28 乐视致新电子科技(天津)有限公司 Headset control method and device
CN106126080A (en) * 2016-06-22 2016-11-16 北京云知声信息技术有限公司 Voice management method and device
CN106297777A (en) * 2016-08-11 2017-01-04 广州视源电子科技股份有限公司 A kind of method and apparatus waking up voice service up
CN106448664A (en) * 2016-10-28 2017-02-22 魏朝正 System and method for controlling intelligent home equipment by voice
CN106558305A (en) * 2016-11-16 2017-04-05 北京云知声信息技术有限公司 voice data processing method and device
CN106653031A (en) * 2016-10-17 2017-05-10 海信集团有限公司 Voice wake-up method and voice interaction device
CN106782554A (en) * 2016-12-19 2017-05-31 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence
CN107369445A (en) * 2016-05-11 2017-11-21 上海禹昌信息科技有限公司 The method for supporting voice wake-up and Voice command intelligent terminal simultaneously
CN107731226A (en) * 2017-09-29 2018-02-23 杭州聪普智能科技有限公司 Control method, device and electronic equipment based on speech recognition
CN107886947A (en) * 2017-10-19 2018-04-06 珠海格力电器股份有限公司 The method and device of a kind of image procossing
CN108665900A (en) * 2018-04-23 2018-10-16 百度在线网络技术(北京)有限公司 High in the clouds awakening method and system, terminal and computer readable storage medium
CN108806672A (en) * 2017-04-28 2018-11-13 辛雪峰 A kind of control method for fan of voice double mode
CN108806669A (en) * 2017-04-28 2018-11-13 三星电子株式会社 Electronic device for providing speech-recognition services and its method
CN109102806A (en) * 2018-09-29 2018-12-28 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and computer readable storage medium for interactive voice
CN110021294A (en) * 2018-01-09 2019-07-16 深圳市优必选科技有限公司 Control method, device and the storage device of robot
CN110097878A (en) * 2018-01-30 2019-08-06 阿拉的(深圳)人工智能有限公司 Polygonal color phonetic prompt method, cloud device, prompt system and storage medium
CN110265012A (en) * 2019-06-19 2019-09-20 泉州师范学院 It can interactive intelligence voice home control device and control method based on open source hardware
CN110853632A (en) * 2018-08-21 2020-02-28 蔚来汽车有限公司 Voice recognition method based on voiceprint information and intelligent interaction equipment
CN111986682A (en) * 2020-08-31 2020-11-24 百度在线网络技术(北京)有限公司 Voice interaction method, device, equipment and storage medium
CN112037786A (en) * 2020-08-31 2020-12-04 百度在线网络技术(北京)有限公司 Voice interaction method, device, equipment and storage medium
US10964317B2 (en) 2017-07-05 2021-03-30 Baidu Online Network Technology (Beijing) Co., Ltd. Voice wakeup method, apparatus and system, cloud server and readable medium
CN113643691A (en) * 2021-08-16 2021-11-12 思必驰科技股份有限公司 Far-field voice message interaction method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6965863B1 (en) * 1998-11-12 2005-11-15 Microsoft Corporation Speech recognition user interface
CN102629246A (en) * 2012-02-10 2012-08-08 北京百纳信息技术有限公司 Server used for recognizing browser voice commands and browser voice command recognition system
CN102999161A (en) * 2012-11-13 2013-03-27 安徽科大讯飞信息科技股份有限公司 Implementation method and application of voice awakening module
CN103021409A (en) * 2012-11-13 2013-04-03 安徽科大讯飞信息科技股份有限公司 Voice activating photographing system
CN103095911A (en) * 2012-12-18 2013-05-08 苏州思必驰信息科技有限公司 Method and system for finding mobile phone through voice awakening
CN103366740A (en) * 2012-03-27 2013-10-23 联想(北京)有限公司 Voice command recognition method and voice command recognition device
EP2669889A2 (en) * 2012-05-29 2013-12-04 Samsung Electronics Co., Ltd Method and apparatus for executing voice command in electronic device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6965863B1 (en) * 1998-11-12 2005-11-15 Microsoft Corporation Speech recognition user interface
CN102629246A (en) * 2012-02-10 2012-08-08 北京百纳信息技术有限公司 Server used for recognizing browser voice commands and browser voice command recognition system
CN103366740A (en) * 2012-03-27 2013-10-23 联想(北京)有限公司 Voice command recognition method and voice command recognition device
EP2669889A2 (en) * 2012-05-29 2013-12-04 Samsung Electronics Co., Ltd Method and apparatus for executing voice command in electronic device
CN102999161A (en) * 2012-11-13 2013-03-27 安徽科大讯飞信息科技股份有限公司 Implementation method and application of voice awakening module
CN103021409A (en) * 2012-11-13 2013-04-03 安徽科大讯飞信息科技股份有限公司 Voice activating photographing system
CN103095911A (en) * 2012-12-18 2013-05-08 苏州思必驰信息科技有限公司 Method and system for finding mobile phone through voice awakening

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104464723A (en) * 2014-12-16 2015-03-25 科大讯飞股份有限公司 Voice interaction method and system
CN105719645B (en) * 2014-12-17 2020-09-18 现代自动车株式会社 Voice recognition apparatus, vehicle including the same, and method of controlling voice recognition apparatus
CN105719645A (en) * 2014-12-17 2016-06-29 现代自动车株式会社 Speech recognition apparatus, vehicle including the same, and method of controlling the same
WO2016112634A1 (en) * 2015-01-12 2016-07-21 芋头科技(杭州)有限公司 Voice recognition system and method of robot system
JP2018507434A (en) * 2015-01-12 2018-03-15 ユウトウ・テクノロジー(ハンジョウ)・カンパニー・リミテッド Voice identification system and method for robot system
CN105976814A (en) * 2015-12-10 2016-09-28 乐视致新电子科技(天津)有限公司 Headset control method and device
CN105976814B (en) * 2015-12-10 2020-04-10 乐融致新电子科技(天津)有限公司 Control method and device of head-mounted equipment
CN105632486A (en) * 2015-12-23 2016-06-01 北京奇虎科技有限公司 Voice wake-up method and device of intelligent hardware
CN105632486B (en) * 2015-12-23 2019-12-17 北京奇虎科技有限公司 Voice awakening method and device of intelligent hardware
CN107369445A (en) * 2016-05-11 2017-11-21 上海禹昌信息科技有限公司 The method for supporting voice wake-up and Voice command intelligent terminal simultaneously
CN106126080A (en) * 2016-06-22 2016-11-16 北京云知声信息技术有限公司 Voice management method and device
CN106126080B (en) * 2016-06-22 2019-08-16 北京云知声信息技术有限公司 Voice management method and device
CN106297777A (en) * 2016-08-11 2017-01-04 广州视源电子科技股份有限公司 A kind of method and apparatus waking up voice service up
CN106297777B (en) * 2016-08-11 2019-11-22 广州视源电子科技股份有限公司 A kind of method and apparatus waking up voice service
CN106653031A (en) * 2016-10-17 2017-05-10 海信集团有限公司 Voice wake-up method and voice interaction device
CN106448664A (en) * 2016-10-28 2017-02-22 魏朝正 System and method for controlling intelligent home equipment by voice
CN106558305A (en) * 2016-11-16 2017-04-05 北京云知声信息技术有限公司 voice data processing method and device
CN106558305B (en) * 2016-11-16 2020-06-02 北京云知声信息技术有限公司 Voice data processing method and device
CN106782554B (en) * 2016-12-19 2020-09-25 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence
CN106782554A (en) * 2016-12-19 2017-05-31 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence
CN108806672A (en) * 2017-04-28 2018-11-13 辛雪峰 A kind of control method for fan of voice double mode
CN108806669A (en) * 2017-04-28 2018-11-13 三星电子株式会社 Electronic device for providing speech-recognition services and its method
US10964317B2 (en) 2017-07-05 2021-03-30 Baidu Online Network Technology (Beijing) Co., Ltd. Voice wakeup method, apparatus and system, cloud server and readable medium
CN107731226A (en) * 2017-09-29 2018-02-23 杭州聪普智能科技有限公司 Control method, device and electronic equipment based on speech recognition
CN107886947A (en) * 2017-10-19 2018-04-06 珠海格力电器股份有限公司 The method and device of a kind of image procossing
CN110021294A (en) * 2018-01-09 2019-07-16 深圳市优必选科技有限公司 Control method, device and the storage device of robot
CN110097878A (en) * 2018-01-30 2019-08-06 阿拉的(深圳)人工智能有限公司 Polygonal color phonetic prompt method, cloud device, prompt system and storage medium
CN108665900B (en) * 2018-04-23 2020-03-03 百度在线网络技术(北京)有限公司 Cloud wake-up method and system, terminal and computer readable storage medium
CN108665900A (en) * 2018-04-23 2018-10-16 百度在线网络技术(北京)有限公司 High in the clouds awakening method and system, terminal and computer readable storage medium
US11574632B2 (en) 2018-04-23 2023-02-07 Baidu Online Network Technology (Beijing) Co., Ltd. In-cloud wake-up method and system, terminal and computer-readable storage medium
CN110853632A (en) * 2018-08-21 2020-02-28 蔚来汽车有限公司 Voice recognition method based on voiceprint information and intelligent interaction equipment
CN109102806A (en) * 2018-09-29 2018-12-28 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and computer readable storage medium for interactive voice
CN110265012A (en) * 2019-06-19 2019-09-20 泉州师范学院 It can interactive intelligence voice home control device and control method based on open source hardware
CN111986682A (en) * 2020-08-31 2020-11-24 百度在线网络技术(北京)有限公司 Voice interaction method, device, equipment and storage medium
CN112037786A (en) * 2020-08-31 2020-12-04 百度在线网络技术(北京)有限公司 Voice interaction method, device, equipment and storage medium
CN113643691A (en) * 2021-08-16 2021-11-12 思必驰科技股份有限公司 Far-field voice message interaction method and system

Similar Documents

Publication Publication Date Title
CN103714815A (en) Voice control method and device thereof
CN108520743B (en) Voice control method of intelligent device, intelligent device and computer readable medium
CN103021409B (en) A kind of vice activation camera system
CN103093755B (en) Based on terminal and mutual network household electric appliance control method and the system of internet voice
CN102855872B (en) Based on terminal and the mutual household electric appliance control method of internet voice and system
CN102855874A (en) Method and system for controlling household appliance on basis of voice interaction of internet
CN104575504A (en) Method for personalized television voice wake-up by voiceprint and voice identification
CN108766441A (en) A kind of sound control method and device based on offline Application on Voiceprint Recognition and speech recognition
CN103188538A (en) Household appliance control method and system based on smart television equipment and Internet
KR20160015218A (en) On-line voice translation method and device
CN104658533A (en) Terminal unlocking method and device as well as terminal
CN104123939A (en) Substation inspection robot based voice interaction control method
CN107544272A (en) terminal control method, device and storage medium
CN102847325B (en) Toy control method and system based on voice interaction of mobile communication terminal
CN112735418B (en) Voice interaction processing method, device, terminal and storage medium
CN110782896A (en) Measuring instrument testing system and method based on voice control
CN102855875A (en) Network speech conversing control system and method based on external open control of speech input
CN110992955A (en) Voice operation method, device, equipment and storage medium of intelligent equipment
CN109992239A (en) Voice traveling method, device, terminal and storage medium
CN108304155A (en) A kind of man-machine interaction control method
CN111091819A (en) Voice recognition device and method, voice interaction system and method
CN111933149A (en) Voice interaction method, wearable device, terminal and voice interaction system
CN110364147A (en) A kind of wake-up training word acquisition system and method
CN102868740A (en) Method and system for controlling toy based on mobile communication terminal and internet voice interaction
CN109859752A (en) A kind of sound control method, device, storage medium and voice joint control system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20140409