WO2017092189A1 - 语音唤醒实现方法、装置及终端、计算机存储介质 - Google Patents
语音唤醒实现方法、装置及终端、计算机存储介质 Download PDFInfo
- Publication number
- WO2017092189A1 WO2017092189A1 PCT/CN2016/075627 CN2016075627W WO2017092189A1 WO 2017092189 A1 WO2017092189 A1 WO 2017092189A1 CN 2016075627 W CN2016075627 W CN 2016075627W WO 2017092189 A1 WO2017092189 A1 WO 2017092189A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- wake
- recording
- training
- voice
- voiceprint
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/30—Authentication, i.e. establishing the identity or authorisation of security principals
- G06F21/31—User authentication
- G06F21/32—User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/22—Interactive procedures; Man-machine interfaces
- G10L17/24—Interactive procedures; Man-machine interfaces the user being prompted to utter a password or a predefined phrase
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/14—Use of phonemic categorisation or speech recognition prior to speaker recognition or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/22—Interactive procedures; Man-machine interfaces
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/725—Cordless telephones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Definitions
- the present invention relates to the field of intelligent terminals, and in particular to a method and device for implementing voice wake-up, a terminal, and a computer storage medium.
- voice wake-up and voiceprint encryption belong to two separate different hardware modules.
- the successfully recorded wake-up words are directly set to the voice chip; in the voiceprint training process, the successfully recorded wake-up words are stored on the application processor (AP, Application Processor) side, and the two are recorded. They are carried out separately and there is no connection between them.
- AP Application Processor
- wake-up training and voiceprint training are two separate processes. If voice wake-up and voiceprint unlocking are set to different voice commands, the user needs to remember two wake-up words, which are easy to be confused or forgotten; if the voice wakes up The voice recording is unlocked and recorded in the same voice command, and the number of repeated recordings is more, which brings a bad user experience; on the other hand, since the smart terminal needs to be separately used for voice wake-up and voiceprint unlocking, The wake-up words of the voice wake-up do not have a special voiceprint verification, and there is also a certain probability of false wake-up.
- the technical problem to be solved by the embodiments of the present invention is to provide a method and device for implementing voice wake-up, a terminal, and a computer storage medium, which can simplify the process of the user waking up and manipulating the smart terminal.
- the technical solution provided by the embodiment of the present invention is as follows:
- an embodiment of the present invention provides a voice wake-up implementation method, which is applied to an intelligent terminal, where the method includes:
- the method before receiving the voice wakeup command input by the user, the method further includes:
- the method before the completion of the unified training recording of the voice wake-up words including the voiceprint information, the method further includes:
- the unified training recording of the voice wake-up words including the voiceprint information is completed.
- the completion of the unified training recording of the voice wake-up words including voiceprint information includes:
- the wake-up word training recording and the voiceprint training recording of the voice wake-up words are performed, and the recording results are judged respectively;
- the completion of the unified training recording of the voice wake-up words including voiceprint information includes:
- the wake-up word training recording and the voiceprint training recording of the voice wake-up words are performed, and the recording results are judged respectively;
- the wake-up word training recording data is saved, and the wake-up word training recording is stopped, and the voiceprint training recording is continued.
- the voiceprint training recording data is saved, and the unity training recording is completed, where m, n are integers greater than 1.
- the completion of the unified training recording of the voice wake-up words including voiceprint information includes:
- the wake-up word training recording and the voiceprint training recording of the voice wake-up words are performed, and the recording results are judged respectively;
- the completion of the unified training recording of the voice wake-up words including voiceprint information includes:
- the wake-up word training recording and the voiceprint training recording of the voice wake-up words are performed, and the recording results are judged respectively;
- the embodiment of the invention further provides a voice wake-up implementation device, including:
- a receiving module configured to receive a voice wake-up command input by a user
- the determining module is configured to perform a wake-up word recognition judgment on the voice wake-up command by using a preset voice wake-up word to obtain a first judgment result, where the voice wake-up word includes voiceprint information, and the voice wake-up word is used to wake up the voice
- the command performs voiceprint judgment to obtain a second judgment result
- the processing module is configured to: when the first determination result and the second determination result both meet the preset condition, unlocking and waking up the smart terminal.
- the device further includes:
- a recording training module configured to complete a unified training recording of the voice wake-up words including voiceprint information
- a voice chip configured to store the voice wake-up word.
- the device further includes:
- the recording processing module is configured to perform noise detection on the environment of the unified training recording before performing the unified training recording process
- the recording training module is specifically configured to complete a unified training recording of the voice wake-up words including voiceprint information when the volume of the noise is lower than a preset decibel.
- the recording training module includes:
- a concurrent recording sub-module configured to control the left channel of the smart terminal to store the wake-up word training recording data during the unified training recording, and the right channel of the smart terminal stores the voiceprint training recording data;
- the right channel of the smart terminal stores the wake-up word training recording data, and the left channel of the smart terminal stores the voiceprint training recording data.
- the receiving module, the determining module, the processing module, the recording training module, the recording processing module, and the concurrent recording sub-module may use a central processing unit (CPU) when performing processing. , digital signal processor (DSP, Digital Singnal Processor) or programmable logic array (FPGA, Field-Programmable Gate Array) implementation.
- CPU central processing unit
- DSP Digital Singnal Processor
- FPGA Field-Programmable Gate Array
- the embodiment of the invention further provides an intelligent terminal, comprising the voice wake-up implementation device as described above.
- the embodiment of the invention further provides a computer storage medium, wherein a computer program is stored, which is used to execute the above-mentioned voice wake-up implementation method of the embodiment of the invention.
- the method for implementing the voice wake-up includes: receiving a voice wake-up command input by a user, and simultaneously using the preset voice wake-up word to perform a wake-up word recognition judgment and a voiceprint judgment on the voice wake-up instruction, when the judgment result meets the preset condition
- the original two-step operation can be simplified into one step operation, and the steps of using the smart terminal after unlocking the voiceprint after waking up are omitted, simplifying user wake-up and manipulation The process of intelligent terminals.
- FIG. 1 is a schematic diagram of a prior art training recording
- FIG. 2 is a schematic diagram of performing a unified training recording according to an embodiment of the present invention
- FIG. 3 is a schematic diagram of a wake-up and voiceprint unlocking of an intelligent terminal in the prior art
- FIG. 4 is a schematic diagram of awakening and voiceprint unlocking of an intelligent terminal according to an embodiment of the present invention.
- FIG. 5 is a schematic structural diagram of a device for implementing voice wakeup according to an embodiment of the present invention.
- FIG. 6 is a schematic diagram of performing a unified training recording according to Embodiment 4 of the present invention.
- FIG. 7 is a schematic diagram of performing a unified training recording according to Embodiment 4 of the present invention.
- FIG. 8 is a schematic diagram of performing a unified training recording according to Embodiment 5 of the present invention.
- the embodiment of the present invention is directed to the problem that the voice wake-up and the voiceprint unlocking are separately performed in the prior art, resulting in a cumbersome operation of the user, and a voice wake-up implementation method, apparatus, and terminal are provided, which can simplify the process of the user waking up and manipulating the smart terminal.
- the embodiment provides a voice wake-up implementation method, which is applied to an intelligent terminal, and the method includes:
- the voice wake-up command input by the user is received, and the wake-up speech recognition command and the voiceprint judgment are simultaneously performed on the voice wake-up command by using the preset voice wake-up word, and the judgment result is consistent.
- the smart terminal is unlocked and awake, and the technical solution of the present invention can simplify the original two-step operation into one step operation, omitting the steps of using the smart terminal after the wake-up sound file is unlocked, and simplifying the user wake-up. And the process of manipulating the smart terminal.
- the method before receiving the voice wakeup command input by the user, the method further includes:
- the method before the completion of the unified training recording of the voice wake-up words including the voiceprint information, the method further includes:
- the unified training recording of the voice wake-up words including the voiceprint information is completed.
- the completion of the unified training recording of the voice wake-up words including voiceprint information includes:
- the wake-up word training recording and the voiceprint training recording of the voice wake-up words are performed, and the recording results are judged respectively;
- the completion of the unified training recording of the voice wake-up words including voiceprint information includes:
- the wake-up word training recording and the voiceprint training recording of the voice wake-up words are performed, and the recording results are judged respectively;
- the completion of the unified training recording of the voice wake-up words including voiceprint information includes:
- the wake-up word training recording and the voiceprint training recording of the voice wake-up words are performed, and the recording results are judged respectively;
- the completion of the unified training recording of the voice wake-up words including voiceprint information includes:
- the wake-up word training recording and the voiceprint training recording of the voice wake-up words are performed, and the recording results are judged respectively;
- the wake-up word training recording and the voiceprint training recording are separately performed, as shown in FIG. 3, when the smart terminal is operated, the wake-up and the voiceprint unlocking are also performed separately.
- FIG. 2 in the embodiment of the present invention, when the training recording is performed, the wake-up word training recording and the voiceprint training recording are performed simultaneously, as shown in FIG. 4, when the smart terminal is operated, the wake-up and the voiceprint unlocking are also simultaneously ongoing.
- the embodiment provides a voice wake-up implementation device. As shown in FIG. 5, the embodiment includes:
- a receiving module configured to receive a voice wake-up command input by a user
- the determining module is configured to perform a wake-up word recognition judgment on the voice wake-up command by using a preset voice wake-up word to obtain a first judgment result, where the voice wake-up word includes voiceprint information, and the voice wake-up word is used to wake up the voice
- the command performs voiceprint judgment to obtain a second judgment result
- the processing module is configured to: when the first determination result and the second determination result both meet the preset condition, unlocking and waking up the smart terminal.
- the apparatus further includes:
- a recording training module configured to complete a unified training recording of the voice wake-up words including voiceprint information
- a voice chip configured to store the voice wake-up word.
- the present invention can simplify the original two-step operation into one step operation, omitting the steps of using the smart terminal after unlocking the voiceprint after waking up, simplifying the process of the user waking up and manipulating the smart terminal.
- the apparatus further includes:
- the recording processing module is configured to perform noise detection on the environment of the unified training recording before performing the unified training recording process
- the recording training module is specifically configured to complete a unified training recording of the voice wake-up words including voiceprint information when the volume of the noise is lower than a preset decibel.
- the recording training module can control the recording result of each time and judge whether the recording is successful or not and whether to enter the next recording.
- the recording processing module performs environmental noise judgment before performing the unified training recording, and appropriately enhances the signal-to-noise ratio (SNR) judgment during the unified training recording process to improve the data quality of the recording training module, thereby improving the recognition success. rate.
- SNR signal-to-noise ratio
- the recording training module includes:
- a concurrent recording sub-module configured to control the left channel of the smart terminal to store the wake-up word training recording data during the unified training recording, and the right channel of the smart terminal stores the voiceprint training recording data;
- the right channel of the smart terminal stores the wake-up word training recording data, and the left channel of the smart terminal stores the voiceprint training recording data.
- This embodiment provides an intelligent terminal, including the voice wakeup implementation device as described above.
- the intelligent terminal of the embodiment receives the voice wake-up instruction input by the user, and uses the preset voice wake-up word to simultaneously perform the wake-up word recognition judgment and the voice pattern judgment on the voice wake-up instruction, when determining the knot If the smart terminal is unlocked and awake, the technical solution of the present invention can simplify the original two-step operation into one step operation, and omits the step of using the smart terminal after unlocking the voiceprint after waking up. Simplify the process of user wake-up and manipulation of the smart terminal.
- the voice wake-up implementation method of the smart terminal the specific steps are:
- the first step the user performs a unified training recording of the voice wake-up words with voiceprint information before using the smart terminal;
- Step 2 Set the security lock screen of the smart terminal
- the third step the smart terminal is in a state where the black screen or standby can work normally;
- the fourth part the user speaks the wake-up word, performs the wake-up word recognition judgment and the voice pattern judgment, and if both meet the conditions, directly responds to the user to perform voice control on the smart terminal; otherwise, the prompt error.
- Step 1 Before the united training recording, first check the environmental noise. If the current environment meets the recording conditions, continue to perform the unity training recording, otherwise it will prompt to record in a quiet environment. The criteria for conditional judgment are determined based on empirical values obtained from tests under different environmental conditions;
- Step 2 In the process of unity training recording, it is assumed that the number of successes of the wake-up word training recording should reach m, and the number of successes of voiceprint training recording should reach n.
- the principle of unity training recording is that the wake-up word or the voiceprint is successfully recorded first. The first one is to exit the united training recording, and the unsuccessful party independently performs the unity training recording.
- the basic process of the unified training recording is as follows:
- Step 3 Switch to the corresponding unified route every time the training recording is combined, and use the left and right channels to store the wake-up word training recording data and the voiceprint training recording data respectively.
- the training recording When the training recording is performed independently, it switches to the independent recording route, and uses the left channel to store the current recording data;
- Step 4 After the training is successfully recorded, the wake-up training recording data is stored in the voice chip, and the voiceprint training recording data is stored on the AP side, and then the smart terminal enters the standby working state;
- Step 5 The user speaks the wake-up word, performs the wake-up word recognition judgment and the voiceprint judgment, and if both meet the conditions, directly responds to the user to perform voice control on the smart terminal; otherwise, the prompt error.
- the user sets a security lock screen mode.
- the wake-up words are spoken, and the smart terminal performs the wake-up word recognition judgment and the voiceprint unlocking judgment. If both of them meet the conditions, the user directly responds to the voice control, which simplifies the user's wake-up and manipulation of the smart terminal. the way.
- Step 1 Before the united training recording, first check the environmental noise. If the current environment meets the recording conditions, continue to perform the unity training recording, otherwise it will prompt to record in a quiet environment. The criteria for conditional judgment are determined based on empirical values obtained from tests under different environmental conditions;
- Step 2 In the process of unity training recording, it is assumed that the number of successes of the wake-up word training recording should reach m, and the number of successes of voiceprint training recording should reach n.
- the principle of unity training recording is that the wake-up word or the voiceprint is successfully recorded first. The first one is to exit the united training recording, and the unsuccessful party independently performs the unity training recording.
- the basic process of the unified training recording is as follows:
- the wake-up word training recording and the voiceprint training recording of the voice wake-up words are performed, and the recording results are judged respectively;
- the wake-up words are processed by the corresponding wake-up voice chip, and the voiceprint is processed by the corresponding voiceprint engine, and there may be timing differences in the process of the required processing, and each time in the present invention
- Table 1 The training of different timing problems between the two trainings is shown in Table 1:
- Step 3 Switch to the corresponding unified route every time the training recording is combined, and use the left and right channels to store the wake-up word training recording data and the voiceprint training recording data respectively.
- the training recording When the training recording is performed independently, it switches to the independent recording route, and uses the left channel to store the current recording data;
- Step 4 After the training is successfully recorded, the wake-up training recording data is stored in the voice chip, and the voiceprint training recording data is stored on the AP side, and then the smart terminal enters the standby working state;
- Step 5 The user speaks the wake-up word, performs the wake-up word recognition judgment and the voiceprint judgment, and if both meet the conditions, directly responds to the user to perform voice control on the smart terminal; otherwise, the prompt error.
- the user sets a security lock screen mode.
- the wake-up words are spoken, and the smart terminal performs the wake-up word recognition judgment and the voiceprint unlocking judgment. If both of them meet the conditions, the user directly responds to the voice control, which simplifies the user's wake-up and manipulation of the smart terminal. the way.
- the embodiment of the invention further provides a computer storage medium, wherein a computer program is stored, which is used to execute the above-mentioned voice wake-up implementation method of the embodiment of the invention.
- the modules may be implemented in software for execution by various types of processors.
- an identified executable code module can comprise one or more physical or logical blocks of computer instructions, which can be constructed, for example, as an object, procedure, or function. Nevertheless, the executable code of the identified modules need not be physically located together, but may include different instructions stored in different physicalities. When these instructions are logically combined, they constitute a module and achieve the specified purpose of the module. .
- the executable code module can be a single instruction or a plurality of instructions, and can even be distributed across multiple different code segments, distributed among different programs, and distributed across multiple memory devices.
- operational data may be identified within the modules and may be implemented in any suitable form and organized within any suitable type of data structure. The operational data may be collected as a single data set, or may be distributed at different locations (including on different storage devices), and may at least partially exist as an electronic signal on a system or network.
- the module can be implemented by software, considering the level of the existing hardware process, the module can be implemented in software, and the technician can construct a corresponding hardware circuit to implement the corresponding function without considering the cost.
- the hardware circuitry includes conventional Very Large Scale Integration (VLSI) circuits or gate arrays as well as existing semiconductors such as logic chips, transistors, or other discrete components.
- VLSI Very Large Scale Integration
- the modules can also be implemented with programmable hardware devices such as field programmable gate arrays, programmable array logic, programmable logic devices, and the like.
- sequence numbers of the steps are not used to limit the sequence of the steps.
- the steps of the steps are changed without any creative work. It is also within the scope of the invention.
- the method for implementing the voice wake-up includes: receiving a voice wake-up command input by a user, and simultaneously using the preset voice wake-up word to perform a wake-up word recognition judgment and a voiceprint judgment on the voice wake-up instruction, when the judgment result meets the preset condition
- the original two-step operation can be simplified into one step operation, and the steps of using the smart terminal after unlocking the voiceprint after waking up are omitted, simplifying user wake-up and manipulation The process of intelligent terminals.
Abstract
Description
Claims (13)
- 一种语音唤醒实现方法,应用于智能终端,所述方法包括:接收用户输入的语音唤醒指令;利用预设的语音唤醒词对所述语音唤醒指令进行唤醒词识别判断得到第一判断结果,所述语音唤醒词包括有声纹信息;利用所述语音唤醒词对所述语音唤醒指令进行声纹判断得到第二判断结果;当所述第一判断结果与所述第二判断结果均符合预设条件时,对所述智能终端进行解锁和唤醒。
- 根据权利要求1所述的语音唤醒实现方法,其中,所述接收用户输入的语音唤醒指令之前还包括:完成包括声纹信息的所述语音唤醒词的合一培训录音,并存储所述语音唤醒词。
- 根据权利要求2所述的语音唤醒实现方法,其中,所述完成包括声纹信息的所述语音唤醒词的合一培训录音之前还包括:对合一培训录音的环境进行噪音检测;所述完成包括声纹信息的所述语音唤醒词的合一培训录音具体为:在所述噪音的音量低于预设分贝时,完成包括声纹信息的所述语音唤醒词的合一培训录音。
- 根据权利要求2所述的语音唤醒实现方法,其中,所述完成包括声纹信息的所述语音唤醒词的合一培训录音包括:同时进行语音唤醒词的唤醒词培训录音和声纹培训录音,分别进行录音结果的判断;若声纹培训录音成功次数达到n,唤醒词培训录音成功次数为0时,重新开始合一培训录音;或若唤醒词培训录音成功次数达到m,声纹培训录音成功次数为0时,重新开始合一培训录音,其中m,n为大于1的整数。
- 根据权利要求2所述的语音唤醒实现方法,其中,所述完成包括声 纹信息的所述语音唤醒词的合一培训录音包括:同时进行语音唤醒词的唤醒词培训录音和声纹培训录音,分别进行录音结果的判断;若唤醒词培训录音成功次数达到m,声纹培训录音成功次数小于n,则保存唤醒词培训录音数据,并停止唤醒词培训录音,继续声纹培训录音,当声纹培训录音成功次数达到n时,保存声纹培训录音数据,完成合一培训录音,其中m,n为大于1的整数。
- 根据权利要求2所述的语音唤醒实现方法,其中,所述完成包括声纹信息的所述语音唤醒词的合一培训录音包括:同时进行语音唤醒词的唤醒词培训录音和声纹培训录音,分别进行录音结果的判断;若声纹培训录音成功次数达到n,唤醒词培训录音成功次数小于m,则保存声纹培训录音数据,并停止声纹培训录音,继续唤醒词培训录音,当唤醒词培训录音成功次数达到m时,保存唤醒词培训录音数据,完成合一培训录音,其中m,n为大于1的整数。
- 根据权利要求2所述的语音唤醒实现方法,其中,所述完成包括声纹信息的所述语音唤醒词的合一培训录音包括:同时进行语音唤醒词的唤醒词培训录音和声纹培训录音,分别进行录音结果的判断;记录唤醒词培训录音和声纹培训录音同时成功的次数,当所述次数达到m时,保存唤醒词培训录音数据和声纹培训录音数据,完成合一培训录音,其中m为大于1的整数。
- 一种语音唤醒实现装置,包括:接收模块,配置为接收用户输入的语音唤醒指令;判断模块,配置为利用预设的语音唤醒词对所述语音唤醒指令进行唤醒词识别判断得到第一判断结果,所述语音唤醒词包括有声纹信息,利用所述语音唤醒词对所述语音唤醒指令进行声纹判断得到第二判断结果;处理模块,配置为当所述第一判断结果与所述第二判断结果均符合预 设条件时,对所述智能终端进行解锁和唤醒。
- 根据权利要求8所述的语音唤醒实现装置,其中,所述装置还包括:录音培训模块,配置为完成包括声纹信息的所述语音唤醒词的合一培训录音;语音芯片,配置为存储所述语音唤醒词。
- 根据权利要求9所述的语音唤醒实现装置,其中,所述装置还包括:录音处理模块,配置为在进行合一培训录音过程前,对合一培训录音的环境进行噪音检测;所述录音培训模块具体配置为在所述噪音的音量低于预设分贝时,完成包括声纹信息的所述语音唤醒词的合一培训录音。
- 根据权利要求9所述的语音唤醒实现装置,其中,所述录音培训模块包括:并发录音子模块,配置为在合一培训录音过程中,控制所述智能终端的左声道存储唤醒词培训录音数据,所述智能终端的右声道存储声纹培训录音数据;或者制所述智能终端的右声道存储唤醒词培训录音数据,所述智能终端的左声道存储声纹培训录音数据。
- 一种智能终端,包括如权利要求8-11中任一项所述的语音唤醒实现装置。
- 一种计算机存储介质,所述计算机存储介质中存储有计算机程序,该计算机程序用于上述权利要求1-7任一项所述的语音唤醒实现方法。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2018527923A JP2019502947A (ja) | 2015-11-30 | 2016-03-04 | 音声ウェイクアップ実現方法、装置及び端末、コンピュータ記憶媒体 |
US15/780,149 US20180350372A1 (en) | 2015-11-30 | 2016-03-04 | Method realizing voice wake-up, device, terminal, and computer storage medium |
EP16869503.9A EP3385947A4 (en) | 2015-11-30 | 2016-03-04 | Method realizing voice wake-up, device, terminal, and computer storage medium |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510859545.2A CN106815507A (zh) | 2015-11-30 | 2015-11-30 | 语音唤醒实现方法、装置及终端 |
CN201510859545.2 | 2015-11-30 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2017092189A1 true WO2017092189A1 (zh) | 2017-06-08 |
Family
ID=58796244
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2016/075627 WO2017092189A1 (zh) | 2015-11-30 | 2016-03-04 | 语音唤醒实现方法、装置及终端、计算机存储介质 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20180350372A1 (zh) |
EP (1) | EP3385947A4 (zh) |
JP (1) | JP2019502947A (zh) |
CN (1) | CN106815507A (zh) |
WO (1) | WO2017092189A1 (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109003611A (zh) * | 2018-09-29 | 2018-12-14 | 百度在线网络技术(北京)有限公司 | 用于车辆语音控制的方法、装置、设备和介质 |
CN109584860A (zh) * | 2017-09-27 | 2019-04-05 | 九阳股份有限公司 | 一种语音唤醒词定义方法和系统 |
CN110400568A (zh) * | 2018-04-20 | 2019-11-01 | 比亚迪股份有限公司 | 智能语音系统的唤醒方法、智能语音系统及车辆 |
CN112201239A (zh) * | 2020-09-25 | 2021-01-08 | 海尔优家智能科技(北京)有限公司 | 目标设备的确定方法及装置、存储介质、电子装置 |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107705785A (zh) * | 2017-08-01 | 2018-02-16 | 百度在线网络技术(北京)有限公司 | 智能音箱的声源定位方法、智能音箱及计算机可读介质 |
CN107919124B (zh) * | 2017-12-22 | 2021-07-13 | 北京小米移动软件有限公司 | 设备唤醒方法及装置 |
US11152006B2 (en) * | 2018-05-07 | 2021-10-19 | Microsoft Technology Licensing, Llc | Voice identification enrollment |
CN108877790A (zh) * | 2018-05-21 | 2018-11-23 | 江西午诺科技有限公司 | 音箱控制方法、装置、可读存储介质及移动终端 |
CN109166571B (zh) * | 2018-08-06 | 2020-11-24 | 广东美的厨房电器制造有限公司 | 家电设备的唤醒词训练方法、装置及家电设备 |
CN109032554B (zh) * | 2018-06-29 | 2021-11-16 | 联想(北京)有限公司 | 一种音频处理方法和电子设备 |
CN110827824B (zh) * | 2018-08-08 | 2022-05-17 | Oppo广东移动通信有限公司 | 语音处理方法、装置、存储介质及电子设备 |
CN112740321A (zh) * | 2018-11-20 | 2021-04-30 | 深圳市欢太科技有限公司 | 唤醒设备的方法、装置、存储介质及电子设备 |
CN111354357A (zh) * | 2018-12-24 | 2020-06-30 | 中移(杭州)信息技术有限公司 | 一种音频资源播放的方法、装置、电子设备及存储介质 |
CN109887508A (zh) * | 2019-01-25 | 2019-06-14 | 广州富港万嘉智能科技有限公司 | 一种基于声纹的会议自动记录方法、电子设备及存储介质 |
CN110119083A (zh) * | 2019-04-17 | 2019-08-13 | 惠州市惠泽电器有限公司 | 智能手表的唤醒方法 |
CN110134233B (zh) * | 2019-04-24 | 2022-07-12 | 福建联迪商用设备有限公司 | 一种基于人脸识别的智能音箱唤醒方法及终端 |
JP6856697B2 (ja) * | 2019-04-24 | 2021-04-07 | ヤフー株式会社 | 情報処理装置、情報処理方法、情報処理プログラム、学習装置、学習方法および学習プログラム |
CN112309383A (zh) * | 2019-08-01 | 2021-02-02 | 北京声智科技有限公司 | 语音交互方法、装置及机顶盒 |
CN110473556B (zh) * | 2019-09-17 | 2022-06-21 | 深圳市万普拉斯科技有限公司 | 语音识别方法、装置和移动终端 |
CN110782891B (zh) * | 2019-10-10 | 2022-02-18 | 珠海格力电器股份有限公司 | 一种音频处理方法、装置、计算设备及存储介质 |
CN110827836B (zh) * | 2019-10-23 | 2022-05-03 | 珠海格力电器股份有限公司 | 一种重设唤醒词的方法、装置、电子设备及存储介质 |
CN110989963B (zh) * | 2019-11-22 | 2023-08-01 | 北京梧桐车联科技有限责任公司 | 唤醒词推荐方法及装置、存储介质 |
CN110827820B (zh) * | 2019-11-27 | 2022-09-27 | 北京梧桐车联科技有限责任公司 | 语音唤醒方法、装置、设备、计算机存储介质及车辆 |
CN113593541B (zh) * | 2020-04-30 | 2024-03-12 | 阿里巴巴集团控股有限公司 | 数据处理方法、装置、电子设备和计算机存储介质 |
CN111696555A (zh) * | 2020-06-11 | 2020-09-22 | 北京声智科技有限公司 | 一种唤醒词的确认方法及系统 |
CN111880988B (zh) * | 2020-07-09 | 2022-11-04 | Oppo广东移动通信有限公司 | 一种声纹唤醒日志收集方法及装置 |
CN111899722B (zh) * | 2020-08-11 | 2024-02-06 | Oppo广东移动通信有限公司 | 一种语音处理方法及装置、存储介质 |
CN112233676A (zh) * | 2020-11-20 | 2021-01-15 | 深圳市欧瑞博科技股份有限公司 | 智能设备唤醒方法、装置、电子设备及存储介质 |
CN112951229A (zh) * | 2021-02-07 | 2021-06-11 | 深圳市今视通数码科技有限公司 | 理疗机器人的语音唤醒方法、系统和存储介质 |
CN115312068B (zh) * | 2022-07-14 | 2023-05-09 | 荣耀终端有限公司 | 语音控制方法、设备及存储介质 |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040101112A1 (en) * | 2002-11-26 | 2004-05-27 | Lite-On Technology Corporation | Voice identification method for cellular phone and cellular phone with voiceprint password |
EP2509291A1 (en) * | 2011-04-06 | 2012-10-10 | Research In Motion Limited | System and method for locating a misplaced mobile device |
CN103051781A (zh) * | 2012-12-07 | 2013-04-17 | 百度在线网络技术(北京)有限公司 | 语音后台控制方法及移动终端 |
CN104143326A (zh) * | 2013-12-03 | 2014-11-12 | 腾讯科技(深圳)有限公司 | 一种语音命令识别方法和装置 |
CN104202486A (zh) * | 2014-09-26 | 2014-12-10 | 上海华勤通讯技术有限公司 | 移动终端及其屏幕解锁方法 |
CN104217152A (zh) * | 2014-09-23 | 2014-12-17 | 陈包容 | 一种移动终端在待机状态下进入应用程序的实现方法和装置 |
CN104575504A (zh) * | 2014-12-24 | 2015-04-29 | 上海师范大学 | 采用声纹和语音识别进行个性化电视语音唤醒的方法 |
CN104658533A (zh) * | 2013-11-20 | 2015-05-27 | 中兴通讯股份有限公司 | 一种终端解锁的方法、装置及终端 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11194795A (ja) * | 1997-12-26 | 1999-07-21 | Kyocera Corp | 音声認識作動装置 |
JP4897040B2 (ja) * | 2007-03-14 | 2012-03-14 | パイオニア株式会社 | 音響モデル登録装置、話者認識装置、音響モデル登録方法及び音響モデル登録処理プログラム |
US8775187B2 (en) * | 2008-09-05 | 2014-07-08 | Auraya Pty Ltd | Voice authentication system and methods |
JP2010152423A (ja) * | 2008-12-24 | 2010-07-08 | Brother Ind Ltd | 個人認証装置、個人認証方法、および個人認証プログラム |
US8871260B2 (en) * | 2012-09-19 | 2014-10-28 | Transdermal Biotechnology, Inc. | Methods and compositions for muscular and neuromuscular diseases |
US9691377B2 (en) * | 2013-07-23 | 2017-06-27 | Google Technology Holdings LLC | Method and device for voice recognition training |
JP2014092777A (ja) * | 2012-11-06 | 2014-05-19 | Magic Hand:Kk | モバイル通信機器の音声による起動 |
US10134392B2 (en) * | 2013-01-10 | 2018-11-20 | Nec Corporation | Terminal, unlocking method, and program |
WO2015005927A1 (en) * | 2013-07-11 | 2015-01-15 | Intel Corporation | Device wake and speaker verification using the same audio input |
CN103595869A (zh) * | 2013-11-15 | 2014-02-19 | 华为终端有限公司 | 一种终端语音控制方法、装置及终端 |
CN103594089A (zh) * | 2013-11-18 | 2014-02-19 | 联想(北京)有限公司 | 一种语音识别方法及电子设备 |
CN104282307A (zh) * | 2014-09-05 | 2015-01-14 | 中兴通讯股份有限公司 | 唤醒语音控制系统的方法、装置及终端 |
-
2015
- 2015-11-30 CN CN201510859545.2A patent/CN106815507A/zh active Pending
-
2016
- 2016-03-04 WO PCT/CN2016/075627 patent/WO2017092189A1/zh active Application Filing
- 2016-03-04 EP EP16869503.9A patent/EP3385947A4/en not_active Withdrawn
- 2016-03-04 US US15/780,149 patent/US20180350372A1/en not_active Abandoned
- 2016-03-04 JP JP2018527923A patent/JP2019502947A/ja active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040101112A1 (en) * | 2002-11-26 | 2004-05-27 | Lite-On Technology Corporation | Voice identification method for cellular phone and cellular phone with voiceprint password |
EP2509291A1 (en) * | 2011-04-06 | 2012-10-10 | Research In Motion Limited | System and method for locating a misplaced mobile device |
CN103051781A (zh) * | 2012-12-07 | 2013-04-17 | 百度在线网络技术(北京)有限公司 | 语音后台控制方法及移动终端 |
CN104658533A (zh) * | 2013-11-20 | 2015-05-27 | 中兴通讯股份有限公司 | 一种终端解锁的方法、装置及终端 |
CN104143326A (zh) * | 2013-12-03 | 2014-11-12 | 腾讯科技(深圳)有限公司 | 一种语音命令识别方法和装置 |
CN104217152A (zh) * | 2014-09-23 | 2014-12-17 | 陈包容 | 一种移动终端在待机状态下进入应用程序的实现方法和装置 |
CN104202486A (zh) * | 2014-09-26 | 2014-12-10 | 上海华勤通讯技术有限公司 | 移动终端及其屏幕解锁方法 |
CN104575504A (zh) * | 2014-12-24 | 2015-04-29 | 上海师范大学 | 采用声纹和语音识别进行个性化电视语音唤醒的方法 |
Non-Patent Citations (1)
Title |
---|
See also references of EP3385947A4 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109584860A (zh) * | 2017-09-27 | 2019-04-05 | 九阳股份有限公司 | 一种语音唤醒词定义方法和系统 |
CN110400568A (zh) * | 2018-04-20 | 2019-11-01 | 比亚迪股份有限公司 | 智能语音系统的唤醒方法、智能语音系统及车辆 |
CN109003611A (zh) * | 2018-09-29 | 2018-12-14 | 百度在线网络技术(北京)有限公司 | 用于车辆语音控制的方法、装置、设备和介质 |
CN112201239A (zh) * | 2020-09-25 | 2021-01-08 | 海尔优家智能科技(北京)有限公司 | 目标设备的确定方法及装置、存储介质、电子装置 |
Also Published As
Publication number | Publication date |
---|---|
EP3385947A1 (en) | 2018-10-10 |
US20180350372A1 (en) | 2018-12-06 |
JP2019502947A (ja) | 2019-01-31 |
CN106815507A (zh) | 2017-06-09 |
EP3385947A4 (en) | 2018-12-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2017092189A1 (zh) | 语音唤醒实现方法、装置及终端、计算机存储介质 | |
CN105989333B (zh) | 指纹认证方法、系统及支持指纹认证功能的终端 | |
CN106782536B (zh) | 一种语音唤醒方法及装置 | |
US20200227049A1 (en) | Method, apparatus and device for waking up voice interaction device, and storage medium | |
US20170344802A1 (en) | Method and device for fingerprint unlocking and user terminal | |
CN107112017A (zh) | 操作语音识别功能的电子设备和方法 | |
WO2015074411A1 (zh) | 一种终端解锁的方法、装置及终端 | |
JP2019128938A (ja) | 読話による音声ウェイクアップ方法、装置、設備及びコンピュータ可読媒体 | |
CN108766438A (zh) | 人机交互方法、装置、存储介质及智能终端 | |
US20190130411A1 (en) | Method and system for data processing | |
CN106297801A (zh) | 语音处理方法及装置 | |
CN110290280B (zh) | 一种终端状态的识别方法、装置及存储介质 | |
EP3407256A1 (en) | Recognizing biological feature | |
CN110546641B (zh) | 一种访问控制方法、装置、智能设备及存储介质 | |
WO2016198019A1 (zh) | 一种电子设备的操作方法、装置及电子设备 | |
CN110544468B (zh) | 应用唤醒方法、装置、存储介质及电子设备 | |
US20180032533A1 (en) | Tool for mining chat sessions | |
US20090006857A1 (en) | Method and apparatus for starting up a computing system | |
CN106531168B (zh) | 一种语音识别方法及装置 | |
WO2021169711A1 (zh) | 指令执行方法、装置、存储介质及电子设备 | |
CN106782498A (zh) | 语音信息播放方法、装置及终端 | |
US10818298B2 (en) | Audio processing | |
CN110164431B (zh) | 一种音频数据处理方法及装置、存储介质 | |
CN110046276A (zh) | 一种语音中关键词的检索方法和装置 | |
CN112740321A (zh) | 唤醒设备的方法、装置、存储介质及电子设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16869503 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2018527923 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2016869503 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2016869503 Country of ref document: EP Effective date: 20180702 |