CN105702253A - Voice awakening method and device - Google Patents

Voice awakening method and device Download PDF

Info

Publication number
CN105702253A
CN105702253A CN201610009102.9A CN201610009102A CN105702253A CN 105702253 A CN105702253 A CN 105702253A CN 201610009102 A CN201610009102 A CN 201610009102A CN 105702253 A CN105702253 A CN 105702253A
Authority
CN
China
Prior art keywords
speech data
terminal unit
confidence level
voice
confidence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610009102.9A
Other languages
Chinese (zh)
Inventor
朱辉
田伟
李鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Yunzhisheng Information Technology Co Ltd
Original Assignee
Beijing Yunzhisheng Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Yunzhisheng Information Technology Co Ltd filed Critical Beijing Yunzhisheng Information Technology Co Ltd
Priority to CN201610009102.9A priority Critical patent/CN105702253A/en
Publication of CN105702253A publication Critical patent/CN105702253A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The present invention discloses a voice awakening method and device used for improving the accuracy of utilizing the voice to awaken a terminal device. The method comprises the steps of when the terminal device receives the first voice datacontaining a preset awakening word inputted by a user, matching the first voice data and a preset language model to obtain the confidence of the first voice data; determining whether the confidence is less than a preset confidence threshold value;when the confidence is less than the preset confidence threshold value, executing a preset operation; when the confidence is greater than or equal to the preset confidence threshold value, awakening a voice control function of the terminal device. According to the technical scheme of the present invention, when the user utilizes the voice to awaken the terminal device unsuccessfully, the terminal device can execute the preset operation to improve the confidence of the first voice data, thereby improving the accuracy that the user utilizes the voice to awaken the terminal device and improving the user experience degree.

Description

A kind of voice awakening method and device
Technical field
The present invention relates to voice processing technology field, particularly relate to a kind of voice awakening method and device。
Background technology
Speech recognition technology achieved significant progress in recent years, and this technology has been enter into the every field such as industry, household electrical appliances, Smart Home。Namely voice wakes up is a kind of form of speech recognition technology, and it is not directly contacted with hardware device, can wake equipment up operation by voice。Generally, most equipment is all realize waking up or running of equipment by physical button。But, this is for Consumer's Experience and bad。Voice, as the most natural exchange way of people, wakes this contactless mode starting device up by voice and is undoubtedly more friendly。
Summary of the invention
The embodiment of the present invention provides a kind of voice awakening method and device, for improving the accuracy utilizing voice to wake terminal unit up。
A kind of voice awakening method, comprises the following steps:
When terminal unit receives when comprising default the first speech data waking word up of user's input, described first speech data and preset language model are mated, it is thus achieved that the confidence level of described first speech data;
Judge that whether described confidence level is less than pre-seting confidence threshold;
When described confidence level less than described pre-set confidence threshold time, perform predetermined registration operation;
When described confidence level more than or equal to described pre-set confidence threshold time, wake the voice control function of described terminal unit up。
Some beneficial effects of the embodiment of the present invention may include that
Technique scheme, it is determined by comprising the confidence level presetting the first speech data waking word up, and perform predetermined registration operation at this confidence level less than when pre-seting confidence threshold, simultaneously at this confidence level more than or equal to the voice control function waking terminal unit when pre-seting confidence threshold up, when making user utilize voice to wake terminal unit failure up, terminal unit can improve the confidence level of the first speech data by performing predetermined registration operation, utilizes voice to wake the accuracy of terminal unit and the Experience Degree of user up thus improving user。
In one embodiment, after described execution predetermined registration operation, described method also includes:
Exporting the first information, described first information is used for pointing out described user again to input described first speech data, until the confidence level of described first speech data received pre-sets confidence threshold more than or equal to described。
In this embodiment, prompting user speech data can be again inputted after performing predetermined registration operation, the confidence level making the speech data that user re-enters can reach to pre-set confidence threshold, utilizes voice to wake the accuracy of terminal unit and the Experience Degree of user up thus improving user。
In one embodiment, described execution predetermined registration operation, including:
Judge described terminal unit currently whether positive output second speech data;
When second speech data described in the current positive output of described terminal unit, turn down the volume value of described second speech data。
In this embodiment, the volume value of this speech data can be turned down when the current positive output speech data of terminal unit, so that the confidence level of the speech data of user's input can reach to pre-set confidence threshold, improve user and utilize voice to wake the accuracy of terminal unit and the Experience Degree of user up。
In one embodiment, described execution predetermined registration operation, including:
Exporting the second information, described second information is for pointing out described user the volume value improving described first speech data。
In this embodiment, by pointing out user to improve the volume value of input speech data so that the confidence level of the speech data of user's input can reach to pre-set confidence threshold, improves user and utilizes voice to wake the accuracy of terminal unit and the Experience Degree of user up。
In one embodiment, described execution predetermined registration operation, including:
Confidence threshold is pre-seted described in reduction。
In this embodiment, pre-set confidence threshold by reducing so that the confidence level of the speech data of user's input more easily reachs and pre-sets confidence threshold, improves user and utilizes voice to wake the accuracy of terminal unit and the Experience Degree of user up。
A kind of voice Rouser, including:
Matching module, is used for, when terminal unit receives when comprising default the first speech data waking word up of user's input, described first speech data and preset language model being mated, it is thus achieved that the confidence level of described first speech data;
Judge module, is used for judging that whether described confidence level is less than pre-seting confidence threshold;
Perform module, for when described confidence level less than described pre-set confidence threshold time, perform predetermined registration operation;
Wake module, for when described confidence level more than or equal to described pre-set confidence threshold time, wake the voice control function of described terminal unit up。
In one embodiment, described device also includes:
Output module, after described execution predetermined registration operation, exporting the first information, described first information is used for pointing out described user again to input described first speech data, until the confidence level of described first speech data received pre-sets confidence threshold more than or equal to described。
In one embodiment, described execution module includes:
Judge submodule, be used for judging described terminal unit currently whether positive output second speech data;
Turn down submodule, for when second speech data described in the current positive output of described terminal unit, turning down the volume value of described second speech data。
In one embodiment, described execution module includes:
Output sub-module, is used for exporting the second information, and described second information is for pointing out described user the volume value improving described first speech data。
In one embodiment, described execution module includes:
Reduce submodule, described in being used for reducing, pre-set confidence threshold。
Other features and advantages of the present invention will be set forth in the following description, and, partly become apparent from description, or understand by implementing the present invention。The purpose of the present invention and other advantages can be realized by structure specifically noted in the description write, claims and accompanying drawing and be obtained。
Below by drawings and Examples, technical scheme is described in further detail。
Accompanying drawing explanation
Accompanying drawing is for providing a further understanding of the present invention, and constitutes a part for description, is used for together with embodiments of the present invention explaining the present invention, is not intended that limitation of the present invention。In the accompanying drawings:
Fig. 1 is the flow chart of a kind of voice awakening method in the embodiment of the present invention;
Fig. 2 is the flow chart of step S13 in a kind of voice awakening method in the embodiment of the present invention;
Fig. 3 is the block diagram of a kind of voice Rouser in the embodiment of the present invention;
Fig. 4 is the block diagram of a kind of voice Rouser in the embodiment of the present invention;
Fig. 5 is the block diagram performing module in the embodiment of the present invention in a kind of voice Rouser;
Fig. 6 is the block diagram performing module in the embodiment of the present invention in a kind of voice Rouser;
Fig. 7 is the block diagram performing module in the embodiment of the present invention in a kind of voice Rouser。
Detailed description of the invention
Below in conjunction with accompanying drawing, the preferred embodiments of the present invention are illustrated, it will be appreciated that preferred embodiment described herein is merely to illustrate and explains the present invention, is not intended to limit the present invention。
Fig. 1 is the flow chart of a kind of voice awakening method in the embodiment of the present invention。This voice awakening method is applied in terminal unit, and this terminal unit can be mobile phone, computer, digital broadcast terminal, messaging devices, game console, tablet device, armarium, body-building equipment, arbitrary equipment with voice control function such as personal digital assistant。As it is shown in figure 1, the method comprises the following steps S11-S14:
Step S11, when terminal unit receives when comprising default the first speech data waking word up of user's input, mates the first speech data and preset language model, it is thus achieved that the confidence level of the first speech data。
Wherein, presetting and waking word up is the word relevant to the voice control function of terminal unit, user preset。Such as, if the voice control function of terminal unit includes controlling Smart Home, preset and wake word up and can include the words relevant with Smart Home such as air-conditioning, TV, curtain;Again such as, if the voice control function of terminal unit includes being connected to cloud server and during by the cloud server search network information, preset and wake word up and can include the words relevant to network service such as search, inquiry, weather, train ticket。
When performing this step, first the speech data of user's input can be identified by terminal unit, identify whether this speech data to comprise preset and wake word up, if this speech data comprising preset and waking word up, then continue executing with step S11-S14, if not comprising in this speech data to preset and waking word up, illustrating that user does not wake the wish of the voice control function of terminal unit up, now the speech data of user's input is not made any feedback by terminal unit。
Preset language model can be general language model。
Step S12, it is judged that whether confidence level is less than pre-seting confidence threshold。
Step S13, when confidence level is less than, when pre-seting confidence threshold, performing predetermined registration operation。
Step S14, when confidence level is more than or equal to, when pre-seting confidence threshold, waking the voice control function of terminal unit up。
Some beneficial effects of the embodiment of the present invention may include that
Technique scheme, it is determined by comprising the confidence level presetting the first speech data waking word up, and perform predetermined registration operation at this confidence level less than when pre-seting confidence threshold, simultaneously at this confidence level more than or equal to the voice control function waking terminal unit when pre-seting confidence threshold up, when making user utilize voice to wake terminal unit failure up, terminal unit can improve the confidence level of the first speech data by performing predetermined registration operation, utilizes voice to wake the accuracy of terminal unit and the Experience Degree of user up thus improving user。
In one embodiment, the confidence level of the first speech data can be determined by least one of the following characteristics of the first speech data:
(1) word speed;The i.e. duration of unit word。
(2) N-best feature。
(3) position;I.e. each word location in sentence, neutralizes end of the sentence including beginning of the sentence, sentence。
(4) word is long;Namely the character number that each word includes。
(5) duration;Namely the frame number that each word is lasting。
(6) competing words number: arc number between two neighborhood of nodes on confusion network, namely has several word in competition in a period of time。
(7) the ngram language model scores of word。
(8) difference of competing words posterior probability;The i.e. difference of the posterior probability of the competing words that two posterior probability between two neighborhood of nodes are maximum on confusion network。
(9) sentence is long。
For the features above of the first speech data, the method by the method classified based on predicted characteristics or based on posterior probability can determine and owing to these two kinds of methods are prior art, therefore repeat no more the confidence level of the first speech data。
In above-described embodiment, the value of confidence level is between the scope of 0~1, and owing to confidence level is used to the reliability of assessment voice identification result, therefore confidence level is more high, illustrates that voice identification result is more accurate。Pre-set the value of confidence threshold between the scope of 0~1。
In one embodiment, after step S13, said method is further comprising the steps of:
Exporting the first information, this first information is used for pointing out user again to input the first speech data, until the confidence level of the first speech data received is more than or equal to pre-seting confidence threshold。
Terminal unit can export the first information by the mode of voice output, for instance voice output " please inputs voice content " again。When user inputs the first speech data again, the confidence level of the first speech data, according to the result performed after predetermined registration operation, is determined, until the confidence level of the first speech data is more than or equal to pre-seting confidence threshold by terminal unit again。
In this embodiment, it is possible to after performing predetermined registration operation, prompting user inputs speech data again so that the confidence level of the speech data that user re-enters can reach to pre-set confidence threshold, utilizes voice to wake the success rate of terminal unit up thus improving user。
In above-mentioned steps S13, terminal unit can perform different predetermined registration operation according to different situations。Below by way of several embodiments, the concrete operations performed by terminal unit are described。
In one embodiment, as in figure 2 it is shown, step S13 comprises the following steps S21-S23:
Step S21, it is judged that terminal unit currently whether positive output second speech data;If the current positive output second speech data of terminal unit, then perform step S22;If terminal unit does not currently export second speech data, then perform step S23。
Step S22, turns down the volume value of second speech data。
Wherein, volume value can be characterized by decibel value。Terminal unit can determine that the decibel value of sound in the first speech data and second speech data。
The reduction amplitude of volume value can be turned down according to predetermined amplitude, such as, predetermined amplitude is 25 decibels, terminal unit is playing music, and have determined that the decibel value of this music is 60 decibels, then according to predetermined amplitude, the decibel value of music being reduced by 25 decibels, the decibel value of the music after reduction is 35 decibels。The reduction amplitude of volume value can be turned down according to the difference between the sound decibel value of second speech data and the sound decibel value of the first speech data, such as, terminal unit is playing music, and have determined that the decibel value of this music (i.e. second speech data) is 60 decibels, and the sound decibel value of the first speech data of user's input is 40 decibels, then the decibel value of music can be reduced to less than 40 decibels, so that the sound decibel value of the first speech data is higher than the decibel value of music, thus increasing the accuracy rate of the identification to the first speech data, improve the confidence level of the first speech data。
Step S23, exports information;This information is for pointing out user the volume value improving the first speech data。
Terminal unit can export this information by the mode of voice output, for instance, terminal unit voice output " your sound is too small, please speak up "。
In this embodiment, the volume value of this speech data can be turned down when the current positive output speech data of terminal unit, and point out user to reduce volume when terminal unit does not currently export second speech data, so that the confidence level of the speech data of user's input can reach to pre-set confidence threshold, improve user and utilize voice to wake the accuracy of terminal unit and the Experience Degree of user up。
In one embodiment, when performing step S13, no matter terminal unit currently whether positive output speech data, all can directly export information, to point out user to improve the volume value of the first speech data。
In one embodiment, step S13 also can be embodied as following steps: reduces and pre-sets confidence threshold。
In this embodiment, confidence threshold is pre-seted by reducing, the confidence level making the speech data that user inputs more easily reachs and pre-sets confidence threshold, when positive output second speech data current particularly in terminal unit, second speech data makes the first speech data that user inputs be interfered, it is not easy to be identified successfully, therefore reducing and pre-set confidence threshold and can make terminal unit that the success rate of the first speech data identification is increased, utilizing voice to wake the accuracy of terminal unit and the Experience Degree of user up thus improve user。
Fig. 3 is the block diagram of a kind of voice Rouser in the embodiment of the present invention。As it is shown on figure 3, this device includes:
Matching module 31, is used for, when terminal unit receives when comprising default the first speech data waking word up of user's input, the first speech data and preset language model being mated, it is thus achieved that the confidence level of the first speech data;
Judge module 32, is used for judging that whether confidence level is less than pre-seting confidence threshold;
Perform module 33, for when confidence level is less than, when pre-seting confidence threshold, performing predetermined registration operation;
Wake module 34, for when confidence level is more than or equal to, when pre-seting confidence threshold, waking the voice control function of terminal unit up。
In one embodiment, as shown in Figure 4, said apparatus also includes:
Output module 35, after being used for performing predetermined registration operation, exports the first information, and the first information is used for pointing out user again to input the first speech data, until the confidence level of the first speech data received is more than or equal to pre-seting confidence threshold。
In one embodiment, as it is shown in figure 5, perform module 33 and include:
Judge submodule 331, be used for judging terminal unit currently whether positive output second speech data;
Turn down submodule 332, for when the current positive output second speech data of terminal unit, turning down the volume value of second speech data。
In one embodiment, as shown in Figure 6, perform module 33 to include:
Output sub-module 333, is used for exporting the second information, and the second information is for pointing out user the volume value improving the first speech data。
In one embodiment, as it is shown in fig. 7, perform module 33 and include:
Reduce submodule 334, pre-set confidence threshold for reduction。
Some beneficial effects of the embodiment of the present invention may include that
Said apparatus, it is determined by comprising the confidence level presetting the first speech data waking word up, and perform predetermined registration operation at this confidence level less than when pre-seting confidence threshold, simultaneously at this confidence level more than or equal to the voice control function waking terminal unit when pre-seting confidence threshold up, when making user utilize voice to wake terminal unit failure up, terminal unit can improve the confidence level of the first speech data by performing predetermined registration operation, utilizes voice to wake the accuracy of terminal unit and the Experience Degree of user up thus improving user。
Those skilled in the art are it should be appreciated that embodiments of the invention can be provided as method, system or computer program。Therefore, the present invention can adopt the form of complete hardware embodiment, complete software implementation or the embodiment in conjunction with software and hardware aspect。And, the present invention can adopt the form at one or more upper computer programs implemented of computer-usable storage medium (including but not limited to disk memory and optical memory etc.) wherein including computer usable program code。
The present invention is that flow chart and/or block diagram with reference to method according to embodiments of the present invention, equipment (system) and computer program describe。It should be understood that can by the combination of the flow process in each flow process in computer program instructions flowchart and/or block diagram and/or square frame and flow chart and/or block diagram and/or square frame。These computer program instructions can be provided to produce a machine to the processor of general purpose computer, special-purpose computer, Embedded Processor or other programmable data processing device so that the instruction performed by the processor of computer or other programmable data processing device is produced for realizing the device of function specified in one flow process of flow chart or multiple flow process and/or one square frame of block diagram or multiple square frame。
These computer program instructions may be alternatively stored in and can guide in the computer-readable memory that computer or other programmable data processing device work in a specific way, the instruction making to be stored in this computer-readable memory produces to include the manufacture of command device, and this command device realizes the function specified in one flow process of flow chart or multiple flow process and/or one square frame of block diagram or multiple square frame。
These computer program instructions also can be loaded in computer or other programmable data processing device, make on computer or other programmable devices, to perform sequence of operations step to produce computer implemented process, thus the instruction performed on computer or other programmable devices provides for realizing the step of function specified in one flow process of flow chart or multiple flow process and/or one square frame of block diagram or multiple square frame。
Obviously, the present invention can be carried out various change and modification without deviating from the spirit and scope of the present invention by those skilled in the art。So, if these amendments of the present invention and modification belong within the scope of the claims in the present invention and equivalent technologies thereof, then the present invention is also intended to comprise these change and modification。

Claims (10)

1. a voice awakening method, it is characterised in that including:
When terminal unit receives when comprising default the first speech data waking word up of user's input, described first speech data and preset language model are mated, it is thus achieved that the confidence level of described first speech data;
Judge that whether described confidence level is less than pre-seting confidence threshold;
When described confidence level less than described pre-set confidence threshold time, perform predetermined registration operation;
When described confidence level more than or equal to described pre-set confidence threshold time, wake the voice control function of described terminal unit up。
2. method according to claim 1, it is characterised in that after described execution predetermined registration operation, described method also includes:
Exporting the first information, described first information is used for pointing out described user again to input described first speech data, until the confidence level of described first speech data received pre-sets confidence threshold more than or equal to described。
3. method according to claim 1, it is characterised in that described execution predetermined registration operation, including:
Judge described terminal unit currently whether positive output second speech data;
When second speech data described in the current positive output of described terminal unit, turn down the volume value of described second speech data。
4. the method according to claim 1 or 3, it is characterised in that described execution predetermined registration operation, including:
Exporting the second information, described second information is for pointing out described user the volume value improving described first speech data。
5. method according to claim 1, it is characterised in that described execution predetermined registration operation, including:
Confidence threshold is pre-seted described in reduction。
6. a voice Rouser, it is characterised in that including:
Matching module, is used for, when terminal unit receives when comprising default the first speech data waking word up of user's input, described first speech data and preset language model being mated, it is thus achieved that the confidence level of described first speech data;
Judge module, is used for judging that whether described confidence level is less than pre-seting confidence threshold;
Perform module, for when described confidence level less than described pre-set confidence threshold time, perform predetermined registration operation;
Wake module, for when described confidence level more than or equal to described pre-set confidence threshold time, wake the voice control function of described terminal unit up。
7. device according to claim 6, it is characterised in that described device also includes:
Output module, after described execution predetermined registration operation, exporting the first information, described first information is used for pointing out described user again to input described first speech data, until the confidence level of described first speech data received pre-sets confidence threshold more than or equal to described。
8. device according to claim 6, it is characterised in that described execution module includes:
Judge submodule, be used for judging described terminal unit currently whether positive output second speech data;
Turn down submodule, for when second speech data described in the current positive output of described terminal unit, turning down the volume value of described second speech data。
9. the device according to claim 6 or 8, it is characterised in that described execution module includes:
Output sub-module, is used for exporting the second information, and described second information is for pointing out described user the volume value improving described first speech data。
10. device according to claim 6, it is characterised in that described execution module includes:
Reduce submodule, described in being used for reducing, pre-set confidence threshold。
CN201610009102.9A 2016-01-07 2016-01-07 Voice awakening method and device Pending CN105702253A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610009102.9A CN105702253A (en) 2016-01-07 2016-01-07 Voice awakening method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610009102.9A CN105702253A (en) 2016-01-07 2016-01-07 Voice awakening method and device

Publications (1)

Publication Number Publication Date
CN105702253A true CN105702253A (en) 2016-06-22

Family

ID=56226088

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610009102.9A Pending CN105702253A (en) 2016-01-07 2016-01-07 Voice awakening method and device

Country Status (1)

Country Link
CN (1) CN105702253A (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106782536A (en) * 2016-12-26 2017-05-31 北京云知声信息技术有限公司 A kind of voice awakening method and device
CN106910496A (en) * 2017-02-28 2017-06-30 广东美的制冷设备有限公司 Intelligent electrical appliance control and device
CN107704275A (en) * 2017-09-04 2018-02-16 百度在线网络技术(北京)有限公司 Smart machine awakening method, device, server and smart machine
CN107742516A (en) * 2017-09-29 2018-02-27 上海与德通讯技术有限公司 Intelligent identification Method, robot and computer-readable recording medium
CN108064007A (en) * 2017-11-07 2018-05-22 苏宁云商集团股份有限公司 Know method for distinguishing and microcontroller and intelligent sound box for the enhancing voice of intelligent sound box
CN108320733A (en) * 2017-12-18 2018-07-24 上海科大讯飞信息科技有限公司 Voice data processing method and device, storage medium, electronic equipment
CN108377414A (en) * 2018-02-08 2018-08-07 海尔优家智能科技(北京)有限公司 A kind of method, apparatus, storage medium and electronic equipment adjusting volume
CN108615526A (en) * 2018-05-08 2018-10-02 腾讯科技(深圳)有限公司 The detection method of keyword, device, terminal and storage medium in voice signal
CN108833688A (en) * 2018-05-30 2018-11-16 Oppo广东移动通信有限公司 Position reminding method, apparatus, storage medium and electronic equipment
CN109661856A (en) * 2016-08-25 2019-04-19 昕诺飞控股有限公司 Light control
CN109672775A (en) * 2017-10-16 2019-04-23 腾讯科技(北京)有限公司 Adjust the method, apparatus and terminal of wakeup sensitivity
CN109841221A (en) * 2018-12-14 2019-06-04 深圳壹账通智能科技有限公司 Parameter adjusting method, device and body-building equipment based on speech recognition
CN110148405A (en) * 2019-04-10 2019-08-20 北京梧桐车联科技有限责任公司 Phonetic order processing method and processing device, electronic equipment and storage medium
CN111081251A (en) * 2019-11-27 2020-04-28 云知声智能科技股份有限公司 Voice wake-up method and device
CN111124512A (en) * 2019-12-10 2020-05-08 珠海格力电器股份有限公司 Awakening method, device, equipment and medium for intelligent equipment
CN111630413A (en) * 2018-06-05 2020-09-04 谷歌有限责任公司 Application-specific user interaction based on confidence
CN111816178A (en) * 2020-07-07 2020-10-23 云知声智能科技股份有限公司 Voice equipment control method, device and equipment
CN113228170A (en) * 2019-12-05 2021-08-06 海信视像科技股份有限公司 Information processing apparatus and nonvolatile storage medium
CN113539257A (en) * 2021-06-15 2021-10-22 复旦大学附属肿瘤医院 Voice awakening method and device

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08234787A (en) * 1995-03-01 1996-09-13 Hitachi Zosen Corp Voice recognition device provided with restarting function
WO2000070440A1 (en) * 1999-05-17 2000-11-23 Microsoft Corporation Automatic speech recognition system signalling and controlling
US20060074651A1 (en) * 2004-09-22 2006-04-06 General Motors Corporation Adaptive confidence thresholds in telematics system speech recognition
CN102915753A (en) * 2012-10-23 2013-02-06 华为终端有限公司 Method for intelligently controlling volume of electronic device and implementation device of method
CN102999161A (en) * 2012-11-13 2013-03-27 安徽科大讯飞信息科技股份有限公司 Implementation method and application of voice awakening module
CN103139351A (en) * 2011-11-24 2013-06-05 联想(北京)有限公司 Volume control method and device, and communication terminal
CN103578468A (en) * 2012-08-01 2014-02-12 联想(北京)有限公司 Method for adjusting confidence coefficient threshold of voice recognition and electronic device
CN103916511A (en) * 2013-01-08 2014-07-09 联想(北京)有限公司 Information processing method and electronic equipment
CN104424073A (en) * 2013-08-21 2015-03-18 联想(北京)有限公司 Information processing method and electronic equipment
US20150154953A1 (en) * 2013-12-02 2015-06-04 Spansion Llc Generation of wake-up words

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08234787A (en) * 1995-03-01 1996-09-13 Hitachi Zosen Corp Voice recognition device provided with restarting function
WO2000070440A1 (en) * 1999-05-17 2000-11-23 Microsoft Corporation Automatic speech recognition system signalling and controlling
US20060074651A1 (en) * 2004-09-22 2006-04-06 General Motors Corporation Adaptive confidence thresholds in telematics system speech recognition
CN103139351A (en) * 2011-11-24 2013-06-05 联想(北京)有限公司 Volume control method and device, and communication terminal
CN103578468A (en) * 2012-08-01 2014-02-12 联想(北京)有限公司 Method for adjusting confidence coefficient threshold of voice recognition and electronic device
CN102915753A (en) * 2012-10-23 2013-02-06 华为终端有限公司 Method for intelligently controlling volume of electronic device and implementation device of method
CN102999161A (en) * 2012-11-13 2013-03-27 安徽科大讯飞信息科技股份有限公司 Implementation method and application of voice awakening module
CN103916511A (en) * 2013-01-08 2014-07-09 联想(北京)有限公司 Information processing method and electronic equipment
CN104424073A (en) * 2013-08-21 2015-03-18 联想(北京)有限公司 Information processing method and electronic equipment
US20150154953A1 (en) * 2013-12-02 2015-06-04 Spansion Llc Generation of wake-up words

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109661856A (en) * 2016-08-25 2019-04-19 昕诺飞控股有限公司 Light control
CN106782536A (en) * 2016-12-26 2017-05-31 北京云知声信息技术有限公司 A kind of voice awakening method and device
CN106910496A (en) * 2017-02-28 2017-06-30 广东美的制冷设备有限公司 Intelligent electrical appliance control and device
WO2018157542A1 (en) * 2017-02-28 2018-09-07 广东美的制冷设备有限公司 Smart home appliance control method and device
CN107704275A (en) * 2017-09-04 2018-02-16 百度在线网络技术(北京)有限公司 Smart machine awakening method, device, server and smart machine
CN107742516A (en) * 2017-09-29 2018-02-27 上海与德通讯技术有限公司 Intelligent identification Method, robot and computer-readable recording medium
CN107742516B (en) * 2017-09-29 2020-11-17 上海望潮数据科技有限公司 Intelligent recognition method, robot and computer readable storage medium
CN109672775B (en) * 2017-10-16 2021-10-29 腾讯科技(北京)有限公司 Method, device and terminal for adjusting awakening sensitivity
CN109672775A (en) * 2017-10-16 2019-04-23 腾讯科技(北京)有限公司 Adjust the method, apparatus and terminal of wakeup sensitivity
CN108064007A (en) * 2017-11-07 2018-05-22 苏宁云商集团股份有限公司 Know method for distinguishing and microcontroller and intelligent sound box for the enhancing voice of intelligent sound box
CN108320733A (en) * 2017-12-18 2018-07-24 上海科大讯飞信息科技有限公司 Voice data processing method and device, storage medium, electronic equipment
CN108377414A (en) * 2018-02-08 2018-08-07 海尔优家智能科技(北京)有限公司 A kind of method, apparatus, storage medium and electronic equipment adjusting volume
CN108615526A (en) * 2018-05-08 2018-10-02 腾讯科技(深圳)有限公司 The detection method of keyword, device, terminal and storage medium in voice signal
US11341957B2 (en) 2018-05-08 2022-05-24 Tencent Technology (Shenzhen) Company Limited Method for detecting keyword in speech signal, terminal, and storage medium
CN108833688A (en) * 2018-05-30 2018-11-16 Oppo广东移动通信有限公司 Position reminding method, apparatus, storage medium and electronic equipment
CN108833688B (en) * 2018-05-30 2020-03-10 Oppo广东移动通信有限公司 Position reminding method and device, storage medium and electronic equipment
CN111630413B (en) * 2018-06-05 2024-04-16 谷歌有限责任公司 Confidence-based application-specific user interaction
CN111630413A (en) * 2018-06-05 2020-09-04 谷歌有限责任公司 Application-specific user interaction based on confidence
CN109841221A (en) * 2018-12-14 2019-06-04 深圳壹账通智能科技有限公司 Parameter adjusting method, device and body-building equipment based on speech recognition
CN110148405B (en) * 2019-04-10 2021-07-13 北京梧桐车联科技有限责任公司 Voice instruction processing method and device, electronic equipment and storage medium
CN110148405A (en) * 2019-04-10 2019-08-20 北京梧桐车联科技有限责任公司 Phonetic order processing method and processing device, electronic equipment and storage medium
CN111081251B (en) * 2019-11-27 2022-03-04 云知声智能科技股份有限公司 Voice wake-up method and device
CN111081251A (en) * 2019-11-27 2020-04-28 云知声智能科技股份有限公司 Voice wake-up method and device
CN113228170A (en) * 2019-12-05 2021-08-06 海信视像科技股份有限公司 Information processing apparatus and nonvolatile storage medium
CN111124512A (en) * 2019-12-10 2020-05-08 珠海格力电器股份有限公司 Awakening method, device, equipment and medium for intelligent equipment
CN111816178A (en) * 2020-07-07 2020-10-23 云知声智能科技股份有限公司 Voice equipment control method, device and equipment
CN113539257A (en) * 2021-06-15 2021-10-22 复旦大学附属肿瘤医院 Voice awakening method and device

Similar Documents

Publication Publication Date Title
CN105702253A (en) Voice awakening method and device
CN105654949B (en) A kind of voice awakening method and device
CN106782536B (en) Voice awakening method and device
CN108831469B (en) Voice command customizing method, device and equipment and computer storage medium
US9583102B2 (en) Method of controlling interactive system, method of controlling server, server, and interactive device
CN112106381B (en) User experience assessment method, device and equipment
CN108694940B (en) Voice recognition method and device and electronic equipment
CN107644638B (en) Audio recognition method, device, terminal and computer readable storage medium
US9466286B1 (en) Transitioning an electronic device between device states
US20140365225A1 (en) Ultra-low-power adaptive, user independent, voice triggering schemes
CN109979474B (en) Voice equipment and user speech rate correction method and device thereof and storage medium
CN102842306A (en) Voice control method and device as well as voice response method and device
US20190333514A1 (en) Method and apparatus for dialoguing based on a mood of a user
CN110751948A (en) Voice recognition method, device, storage medium and voice equipment
CN109360551B (en) Voice recognition method and device
CN111178081B (en) Semantic recognition method, server, electronic device and computer storage medium
CN110570855A (en) system, method and device for controlling intelligent household equipment through conversation mechanism
CN108932947B (en) Voice control method and household appliance
CN105825848A (en) Method, device and terminal for voice recognition
US20220399013A1 (en) Response method, terminal, and storage medium
CN110262278B (en) Control method and device of intelligent household electrical appliance and intelligent household electrical appliance
CN112735407A (en) Conversation processing method and device
CN112767916A (en) Voice interaction method, device, equipment, medium and product of intelligent voice equipment
CN111933135A (en) Terminal control method and device, intelligent terminal and computer readable storage medium
CN103941868A (en) Voice-control accuracy rate adjusting method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160622

RJ01 Rejection of invention patent application after publication