WO2016078214A1 - 终端处理方法、装置及计算机存储介质 - Google Patents

终端处理方法、装置及计算机存储介质 Download PDF

Info

Publication number
WO2016078214A1
WO2016078214A1 PCT/CN2015/071481 CN2015071481W WO2016078214A1 WO 2016078214 A1 WO2016078214 A1 WO 2016078214A1 CN 2015071481 W CN2015071481 W CN 2015071481W WO 2016078214 A1 WO2016078214 A1 WO 2016078214A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
instruction
terminal
record
application
Prior art date
Application number
PCT/CN2015/071481
Other languages
English (en)
French (fr)
Inventor
张大凯
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016078214A1 publication Critical patent/WO2016078214A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range

Definitions

  • the present invention relates to terminal control technologies in the field of electrical engineering, and in particular, to a terminal processing method and apparatus, and a computer storage medium.
  • the voice interaction schemes of the existing popular terminals all follow the interactive process of "speaking awakening words - saying instructions - executing".
  • the disadvantage of this kind of interaction scheme is that the process is rigid. Specifically, the wake-up words are fixed, and the common behaviors of the users cannot effectively streamline the interaction process, which greatly reduces the user experience. This often makes the terminal's voice interaction system virtually useless.
  • the embodiment of the invention provides a terminal processing method and device, and a computer storage medium, so that the user can perform a preference setting of the activation password according to the habit.
  • an embodiment of the present invention provides a terminal processing method, including:
  • the second instruction is executed to launch the application.
  • the first voice record further includes application data corresponding to the application
  • the executing the second instruction to launch the application comprises: using the application data, executing the second instruction to launch the application.
  • the application data includes: a contact to be dialed, a contact to be sent a short message, a video identifier to be played (ID, IDentity, or a username and password of the client software to be logged in).
  • a contact to be dialed a contact to be sent a short message
  • a video identifier to be played ID, IDentity, or a username and password of the client software to be logged in.
  • the method further includes:
  • the first voice file is matched with the second voice recording database to generate a second matching result
  • the third instruction is executed to wake up the terminal in the listening mode.
  • the method before the receiving the first voice file, the method further includes:
  • the method further includes: configuring the first according to a command of the user Application data corresponding to voice recording;
  • the method further includes: establishing a correspondence between the first voice record and the application data.
  • the method before the receiving the first voice file, the method further includes:
  • an embodiment of the present invention provides a terminal processing device, which can be applied to a terminal, where the device includes:
  • a receiving unit configured to receive the first voice file
  • the first matching unit is configured to match the first voice file with the first voice record library to generate a first matching result
  • a first acquiring unit configured to acquire a first voice record corresponding to the first voice file in the first voice recording database when the first matching result is that the matching is successful, where the first voice record corresponds to a first instruction for waking up the terminal and a second instruction for starting at least one application;
  • a first execution unit configured to execute the first instruction to wake up the terminal
  • a second execution unit configured to execute the second instruction to launch the application.
  • the device further comprises:
  • a second matching unit configured to: when the first matching result is that the matching is unsuccessful, matching the first voice file with the second voice recording database to generate a second matching result;
  • a second acquiring unit configured to acquire, in the second voice recording library, a second voice record corresponding to the second voice file, when the second matching result is that the matching is successful, where the second voice record is only Corresponding to a third instruction for waking up the terminal;
  • a third execution unit configured to execute the third instruction to wake up the terminal in the listening mode.
  • the device further comprises:
  • the collecting unit is configured to perform voice collection in the training mode to obtain the first voice record
  • a configuration unit configured to configure, according to a user command, a first instruction for waking up the terminal and a second instruction for starting at least one application for the first voice record;
  • Establishing a unit configured to establish a correspondence between the first voice record and the first instruction; and establish a correspondence between the first voice record and the second instruction.
  • an embodiment of the present invention further provides a terminal, where the terminal processing device is provided in the terminal.
  • an embodiment of the present invention further provides a computer storage medium, where the computer storage medium stores executable instructions, and the executable instructions are used to execute the processing method of the terminal.
  • the user can perform a preference setting of the activation password according to the habit.
  • the user In the terminal monitoring state, the user only needs a password to trigger the terminal to perform the action that the user wants the terminal, and the process of the voice interaction is simplified, and the "breaking" process is broken.
  • the general process of saying wake-up words - saying instructions - execution "has greatly improved the user experience.
  • FIG. 1 is a schematic flowchart of a method for processing a terminal in a training mode according to an embodiment of the present invention
  • FIG. 2 is a schematic flowchart of a method for processing a terminal in a listening mode according to an embodiment of the present invention
  • FIG. 3 is a flowchart of a common voice interaction of an application scenario in an embodiment of the present invention.
  • FIG. 4 is a flow chart of voice interaction of activating a password in an application scenario in an embodiment of the present invention
  • FIG. 5 is a schematic structural diagram 1 of a processing apparatus of a terminal in an embodiment of the present invention.
  • FIG. 6 is a second schematic structural diagram of a processing apparatus of a terminal in an embodiment of the present invention.
  • the processing flow in the training mode is described below, and the training mode is prepared as a follow-up listening mode.
  • FIG. 1 is a schematic flowchart diagram of a processing method of a terminal according to an embodiment of the present invention, including the following steps:
  • step 11 the voice collection is performed in the training mode to obtain the first voice record.
  • Step 12 Configure, according to a user command, a corresponding first instruction for waking up the terminal and a second instruction for starting at least one application for the first voice record.
  • application data corresponding to the first voice record may be configured according to a command of the user.
  • Step 13 Establish a correspondence between the first voice record and the first instruction, and establish a correspondence between the first voice record and the second instruction.
  • the correspondence between the first voice record and the application data may also be established in the step.
  • the first voice record is "start chat” voice
  • the application is "start QQ software”
  • the login name is: "123”
  • the password is "456”
  • the first voice record "start chat” voice respectively and " The first command of "waking up the terminal”
  • the application data of "login name: 123 and password 456" establish a correspondence.
  • Step 14 Configure a third instruction for waking up the terminal corresponding to the first voice record according to a command of the user.
  • Step 15 Establish a correspondence between the first voice record and the third instruction.
  • the first voice record may further include application data corresponding to the application, and the application data may include: a contact to be dialed, a contact to be texted, a video ID to be played, or a username and password of the client software to be logged in;
  • the application data includes: a video ID; after step 15, a second instruction can be executed to start the application, and at this time, the application instruction can be used to execute the second instruction to start the application.
  • the terminal starts the QQ software according to the application data of "login name: 123 and password 456".
  • the first voice record is a “start chat” voice
  • the first voice record “start chat” voice is established to establish a corresponding relationship with the third command of the wake-up terminal.
  • a schematic flowchart of a method for processing a terminal includes the following steps:
  • Step 21 Receive a first voice file.
  • the user issues a "start chat" command.
  • Step 22 Match the first voice file with the first voice recording database to generate a first matching result, determine whether the matching is successful according to the generated first matching result, perform step 23 if successful, and perform step 26 if the matching fails.
  • Step 23 Acquire a first voice record corresponding to the first voice file in the first voice record library, where the first voice record corresponds to a first instruction for waking up the terminal and a second instruction for starting at least one application.
  • the user's "start chat” command is successfully matched with the "start chat” voice in the first voice recording library.
  • the "Start Chat” voice corresponds to the first command of "Wake Up Terminal", the second command of "Start QQ Software", and the application data of "Login Name: 123 and Password 456".
  • Step 24 executing a first instruction to wake up the terminal.
  • the wake-up terminal here can switch the terminal from the sleep (standby) mode to the working mode to turn on an input module (such as a microphone, a keyboard) and an output module (such as a screen), etc., so that the terminal can receive the command and respond at any time.
  • an input module such as a microphone, a keyboard
  • an output module such as a screen
  • Step 25 After waking up the terminal, execute a second instruction to start the application.
  • Step 26 Match the first voice file with the second voice record library to generate a second matching result.
  • Step 27 When the second matching result is that the matching is successful, acquiring a second voice record corresponding to the second voice file in the second voice recording library, where the second voice record only corresponds to the third instruction for waking up the terminal.
  • the user's "start chat” command is successfully matched with the "start chat” voice in the second voice recording library, and the "start chat” voice corresponds to the third command of "wake up terminal".
  • step 28 a third instruction is executed to wake up the terminal in the listening mode.
  • the embodiment of the present invention describes a scenario in which a user uses a startup password.
  • the user can customize a startup password.
  • the terminal can not only wake up the terminal according to the voice input of the user, but also directly start an application, for example, directly wake up the terminal and play music.
  • the application data includes: a contact.
  • the terminal processing device operating mode is switched to the training mode.
  • the user uses the four-character phrase for four recordings, and each time the data needs to be judged for the length of the syllable, within the threshold range, and consistent with the acoustic characteristics of the previous recorded data (except the first time), the training is successful. ;E.g,
  • the instruction data file is then saved to the terminal file system.
  • the instruction data file is saved to the terminal file system.
  • the terminal when switching to the listening mode, the terminal will wake up when it listens to the saved command.
  • step 31 the terminal processing device detects that the user enters the wake-up word password in the listening mode.
  • step 32 voice command data is collected.
  • step 33 the extracted instruction acoustic feature information is compared with the user-defined wake-up word.
  • step 34 if there is no match, the monitoring continues. If it matches, the terminal performs wake-up processing.
  • step 35 the terminal completes the wake-up process and waits for the user to input an instruction.
  • the terminal switches from the sleep mode (or standby mode) to the working mode, and the function modules (which may include the earphone, the keyboard, and the screen) in the terminal are in an instruction acquisition state, so that the user's instruction can be immediately responded.
  • the function modules which may include the earphone, the keyboard, and the screen
  • step 36 the user is detected to input a voice instruction, such as playing a song, calling a contact, opening an application, and the like.
  • step 37 the text information of the instruction is recognized by using a voice recognition technology.
  • step 38 an action execution is initiated.
  • the flow of initiating a password interaction described in the present invention includes the following steps:
  • step 41 the terminal processing device inputs the wake-up password in the listening mode.
  • step 42 the voice command data is collected.
  • step 43 the extracted instruction acoustic feature information is compared with a preset one-shot password.
  • Step 44 if there is no match, return to step 42 to continue monitoring to collect voice command data; if there is a matching one start password, step 45 is performed.
  • Step 45 Wake up the terminal, report the ID of the startup password, and find the corresponding action type (equivalent to the above-mentioned open application) and the additional data according to the reported activation password ID.
  • step 46 an action execution is initiated.
  • FIG. 5 is a schematic structural diagram of a terminal processing apparatus according to the present invention, including:
  • the receiving unit 51 is configured to receive the first voice file
  • the first matching unit 52 is configured to match the first voice file with the first voice record library to generate a first matching result
  • the first obtaining unit 53 is configured to: when the first matching result is that the matching is successful, acquire a first voice record corresponding to the first voice file in the first voice recording database, where the first voice record corresponds to the An instruction and a second instruction for starting at least one application;
  • the first executing unit 54 is configured to execute the first instruction to wake up the terminal
  • the second execution unit 55 is configured to execute the second instruction to launch the corresponding application.
  • the terminal processing apparatus may further include:
  • the second matching unit 55 is configured to: when the first matching result is that the matching is unsuccessful, match the first voice file with the second voice recording database to generate a second matching result;
  • the second obtaining unit 56 is configured to acquire, in the second voice recording library, a second voice record corresponding to the second voice file when the second matching result is that the matching is successful, where the second voice record only corresponds to the terminal for waking up the terminal Third instruction;
  • the third execution unit 57 is configured to execute the third instruction to wake up the terminal in the listening mode.
  • the terminal processing apparatus may further include:
  • the acquiring unit 58 is configured to perform voice collection in the training mode to obtain the first voice record.
  • the configuration unit 59 is configured to configure, according to a command of the user, a corresponding first instruction for waking up the terminal and a second instruction for starting at least one application for the first voice record;
  • the establishing unit 510 is configured to establish a correspondence between the first voice record and the first instruction, and establish a correspondence between the first voice record and the second instruction.
  • the above terminal may be a mobile terminal.
  • the receiving unit 51 and the collecting unit may be implemented by a microphone in the terminal processing device; the first matching unit 52, the first obtaining unit 53, the first executing unit 54, and the second executing unit 55 may be micro-processed in the terminal processing device.
  • MCU logic programmable gate array
  • ASIC application specific integrated circuit
  • the terminal processing device can be set as a functional module in the terminal, interacting with an application processor (AP) in the terminal to implement the terminal wake.
  • the application scenario is a configurable voice wake-up interaction device (that is, a terminal processing device, which can be set in the terminal) supporting multiple wake-up words, and includes four modules: a voice wake-up module 61 (equivalent to the above) The first matching unit), the speech recognition module 62 (equivalent to the first acquisition unit described above), the instruction training module 63 (equivalent to the above-described establishment unit), and the action configuration module 64 (equivalent to the configuration unit described above).
  • a voice wake-up module 61 (equivalent to the above) The first matching unit)
  • the speech recognition module 62 (equivalent to the first acquisition unit described above)
  • the instruction training module 63 equivalent to the above-described establishment unit
  • the action configuration module 64 equivalent to the configuration unit described above.
  • the voice waking module 61 can be configured to switch the mode in which the terminal operates for low power consumption; monitor the voice input in real time; compare the acoustic characteristics of the voice input with the existing wake words; store the plurality of wake word files; download the plurality of wake word files. That is, the voice wake-up module 61 controls the mode of operation of the terminal processing device.
  • the voice wake-up module 61 includes a main control unit 611, an instruction storage unit 612, a download unit 613, and a listening unit 614 that are sequentially connected. The function of each unit will be described separately below.
  • the working mode of the terminal processing device includes a listening mode and a training mode.
  • the listening mode refers to an operating mode in which the terminal is in the standby state and listens to the voice input password in real time.
  • the main control unit 611 completes the comparison between the voice input password and the acoustic characteristics of the existing wake-up words. If it matches a certain wake-up word, it is determined that the wake-up is successful, and the wake-up word ID is reported.
  • the training mode refers to a working mode in which the user trains the terminal to wake up words according to his or her preference. In the training mode, the main control unit 611 completes the collected voice processing work, generates a wake-up word file, and saves the file in the terminal. File system.
  • the instruction storage unit 612 is configured to store the wake-up word file generated by the main control unit 611 for processing, and is used by the download unit 613 to download the wake-up word to the main control unit 611.
  • the downloading unit 613 is configured to traverse all the wake-up word files stored in the storage unit 612 when the terminal is boot-up or when adding or deleting wake-up words, and download the wake-up word file to the master control. In unit 611.
  • the monitoring unit 614 is connected to the main control unit 611, such as a common mobile phone main microphone (MIC), for collecting voice data when the low power listening mode or the training mode is turned on, and transmitting it to the main control unit 611 for processing.
  • the main control unit 611 such as a common mobile phone main microphone (MIC)
  • the voice recognition module 62 is configured to receive the voice command of the user after the terminal is woken up by the voice wake-up module 61, and notify the action configuration module 64 to initiate a corresponding action execution according to the recognized text information, including collecting voice commands, recognizing voice commands, The initiating instruction corresponds to the execution of the action.
  • the voice recognition module 62 includes a voice collection unit 621, a voice recognition unit 622, and an action execution unit 723 that are sequentially connected. The function of each unit will be described separately below.
  • the voice collecting unit 621 is connected to the main control unit 611, and may be a mobile phone MIC, a three-four-segment earphone, a Bluetooth earphone, etc., configured to collect voice commands of the user, and send the voice command to the voice recognition unit 622.
  • the voice recognition unit 622 is configured to receive the voice command collected by the voice collection unit 621, perform voice recognition, recognize the text information, and send the message information to the action execution unit 623.
  • the action execution unit 623 is configured to receive the text information transmitted by the voice recognition unit 622, and initiate execution of the corresponding action.
  • the command training module 63 is configured to train its own personalized wake-up word and a start-up password according to the user's preference when the main control unit 611 of the voice wake-up module 61 switches to the training mode. To reduce the false wake-up rate of the training instructions, you can use a four-character phrase for recording and four recordings.
  • the main control unit of the voice wake-up module sets a threshold for the syllable length. If the threshold is lower than the lowest threshold or higher than the highest threshold, the recording fails.
  • the main control unit 611 compares the recorded voice with the previous one. If there is no match, Recording failed. It is a logic module, including the main control unit 611, the instruction storage unit 612, and the listening unit 614 introduced in the voice wake-up module 61.
  • the action configuration module 64 is configured to configure a new one-click activation password pair for the instruction training module The action to be performed, and the action is performed after the voice recognition module recognizes the text message or wakes up the terminal with activating the password. Its role includes setting a custom wake-up word; editing a startup password; storing a configuration relationship between the startup password and the execution action.
  • the action configuration module 64 includes an instruction editing unit 641 and an action configuration storage unit 642 that are sequentially connected. The function of each unit will be described separately below.
  • the instruction editing unit 641 is configured to complete the addition and deletion of the instructions and configure the actions to be performed for the instructions.
  • the action configuration storage unit 642 is configured to store the configuration relationship between the instruction and the action, and may use a database technology, a file storage technology, or the like, and save the field: the instruction ID, the action type, and the additional data.
  • the main control unit 611 and the download unit 613 in the voice wake-up module 61 can be implemented by an MCU, an FPGA, or an ASIC; the listening unit 614 can be implemented by the MIC; and the instruction storage unit 612 can be implemented by a non-volatile storage medium such as a flash memory;
  • the voice collection unit 621 in the voice recognition module 62 can be implemented by the MIC; the voice recognition unit 622 and the action execution unit 723 can be implemented by an MCU, an FPGA, or an ASIC;
  • the instruction training module 63 can be implemented by an MCU, an FPGA, or an ASIC;
  • the instruction editing unit 641 in the action configuration module 64 can be implemented by an MCU, FPGA, or ASIC; the action configuration storage unit 642 can be implemented by a non-volatile storage medium such as a flash memory.
  • a terminal is further included, and the terminal processing device shown in FIG. 5 or FIG. 6 is used.
  • the terminal may be an electronic device such as a smart phone or a tablet computer.
  • the present invention overcomes the deficiencies of the existing voice interaction scheme of the terminal, and provides a flexible, user configurable voice interaction solution.
  • the invention supports training multiple custom wake-up words. Users can define training wake-up words according to their own preferences, and can support multiple custom wake-up words at the same time, avoiding only using factory defaults. The wake-up word brings trouble to the user.
  • the invention supports a startup password.
  • the user can perform a preference for starting the password according to the habit.
  • the user In the sleep state of the terminal, the user only needs a password to execute the action that the user needs the terminal to perform, and the process of the voice interaction is simplified, and the "wake up word" is said to be broken.
  • the general process of instruction-execution greatly improves the user experience.
  • the foregoing program may be stored in a computer readable storage medium, and the program is executed when executed.
  • the foregoing storage medium includes: a mobile storage device, a random access memory (RAM), a read-only memory (ROM), a magnetic disk, or an optical disk.
  • RAM random access memory
  • ROM read-only memory
  • magnetic disk or an optical disk.
  • optical disk A medium that can store program code.
  • the above-described integrated unit of the present invention may be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a standalone product.
  • the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product, which is stored in a storage medium and includes a plurality of instructions for making
  • a computer device which may be a personal computer, server, or network device, etc.
  • the foregoing storage medium includes various media that can store program codes, such as a mobile storage device, a RAM, a ROM, a magnetic disk, or an optical disk.

Abstract

一种终端的处理方法、终端处理装置及计算机存储介质;方法包括:接收第一语音文件(21);将所述第一语音文件与第一语音记录库进行匹配,生成第一匹配结果(22);当所述第一匹配结果为匹配成功时,在所述第一语音记录库中获取与所述第一语音文件对应的第一语音记录(23),所述第一语音记录对应于用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;执行所述第一指令以唤醒终端(24);在唤醒所述终端后,执行所述第二指令以启动所述应用程序(25)。

Description

终端处理方法、装置及计算机存储介质 技术领域
本发明涉及电学领域的终端控制技术,特别涉及一种终端处理方法、装置及计算机存储介质。
背景技术
现有流行的终端的语音交互方案,都遵循着“说唤醒词——说指令——执行”的交互流程。这种交互方案的弊端是流程死板,具体为:唤醒词固定、对于用户常用的行为不能有效精简交互过程,大大降低了用户体验。这常常使得终端的语音交互系统形同虚设。
发明内容
本发明实施例提供一种终端处理方法、装置及计算机存储介质,使得用户可根据习惯进行一声启动口令的偏好设置。
一方面,本发明实施例提供一种终端处理方法,包括:
接收第一语音文件;
将所述第一语音文件与第一语音记录库进行匹配,生成第一匹配结果;
当所述第一匹配结果为匹配成功时,在所述第一语音记录库中获取与所述第一语音文件对应的第一语音记录,所述第一语音记录对应于用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
执行所述第一指令以唤醒终端;
在唤醒所述终端后,执行所述第二指令以启动所述应用程序。
优选地,所述第一语音记录还包括对应于所述应用程序的应用数据;
所述执行所述第二指令以启动所述应用程序包括:使用所述应用数据,执行所述第二指令以启动所述应用程序。
优选地,所述应用数据包括:待拨打的联系人、待发短信的联系人、待播放的视频标识(ID,IDentity、或待登录客户端软件的用户名和密码。
优选地,所述方法还包括:
当所述第一匹配结果为匹配不成功时,将所述第一语音文件与第二语音记录库进行匹配,生成第二匹配结果;
当所述第二匹配结果为匹配成功时,在所述第二语音记录库中获取与所述第二语音文件对应的第二语音记录,所述第二语音记录仅对应于用于唤醒终端的第三指令;
执行所述第三指令,以唤醒处于监听模式下的终端。
优选地,所述接收第一语音文件之前,所述方法还包括:
在训练模式下进行语音采集,获取第一语音记录;
根据用户的命令,为所述第一语音记录配置对应的用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
建立所述第一语音记录与所述第一指令之间的对应关系;以及建立所述第一语音记录与所述第二指令之间的对应关系。
所述为所述第一语音记录配置对应的用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令时,所述方法还包括:根据用户的命令,配置所述第一语音记录对应的应用数据;
所述建立所述第一语音记录与所述第二指令之间的对应关系时,所述方法还包括:建立所述第一语音记录与所述应用数据之间的对应关系。
优选地,所述接收第一语音文件之前,所述方法还包括:
在训练模式下进行语音采集,获取第一语音记录;
根据用户的命令,配置所述第一语音记录对应的用于唤醒终端的第三指令;
建立所述第一语音记录与所述第三指令之间的对应关系。
另一方面,本发明实施例提供一种终端处理装置,可以应用于终端中,装置包括:
接收单元,配置为接收第一语音文件;
第一匹配单元,配置为将所述第一语音文件与第一语音记录库进行匹配,生成第一匹配结果;
第一获取单元,配置为当所述第一匹配结果为匹配成功时,在所述第一语音记录库中获取与所述第一语音文件对应的第一语音记录,所述第一语音记录对应于用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
第一执行单元,配置为执行所述第一指令以唤醒终端;
第二执行单元,配置为执行所述第二指令以启动所述应用程序。
优选地,所述装置还包括:
第二匹配单元,配置为当所述第一匹配结果为匹配不成功时,将所述第一语音文件与第二语音记录库进行匹配,生成第二匹配结果;
第二获取单元,配置为当所述第二匹配结果为匹配成功时,在所述第二语音记录库中获取与所述第二语音文件对应的第二语音记录,所述第二语音记录仅对应于用于唤醒终端的第三指令;
第三执行单元,配置为执行所述第三指令,以唤醒处于监听模式下的终端。
优选地,所述装置还包括:
采集单元,配置为在训练模式下进行语音采集,获取第一语音记录;
配置单元,配置为根据用户的命令,为所述第一语音记录配置对应的用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
建立单元,配置为建立所述第一语音记录与所述第一指令之间的对应关系;以及建立所述第一语音记录与所述第二指令之间的对应关系。
另一方面,本发明实施例还提供一种终端,终端中设置有上述的终端处理装置。
另一方面,本发明实施例还提供一种计算机存储介质,计算机存储介质中存储有可执行指令,可执行指令用于执行上述的终端的处理方法。
本发明实施例的上述技术方案的有益效果如下:
本发明实施例中,用户可根据习惯进行一声启动口令的偏好设置,在终端监听状态下,用户只需一句口令,就可以触发终端执行用户想要终端的动作,精简了语音交互过程,打破“说唤醒词——说指令——执行”的常规流程,极大提高了用户体验。
附图说明
图1为本发明实施例中训练模式下的终端的处理方法的的流程示意图;
图2为本发明实施例中监听模式下的终端的处理方法的流程示意图;
图3是本发明实施例中一应用场景的普通语音交互流程图;
图4是本发明实施例中一应用场景中一声启动口令的语音交互流程图;
图5是本发明实施例中终端的处理装置的结构示意图一;
图6是本发明实施例中终端的处理装置的结构示意图二。
具体实施方式
为使本发明要解决的技术问题、技术方案和优点更加清楚,下面将结合附图及具体实施例进行详细描述。
以下描述训练模式下的处理流程,训练模式作为后续监听模式下的准备工作。
如图1所示,为本发明实施例记载的一种终端的处理方法的流程示意图,包括以下步骤:
步骤11,在训练模式下进行语音采集,获取第一语音记录。
步骤12,根据用户的命令,为第一语音记录配置对应的用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令。
优选地,该步骤中还可以根据用户的命令,配置第一语音记录对应的应用数据。
步骤13,建立第一语音记录与第一指令之间的对应关系;以及建立第一语音记录与第二指令之间的对应关系。
优选地,该步骤中还可以建立第一语音记录与应用数据之间的对应关系。例如,第一语音记录为“开始聊天”语音,应用程序为“启动QQ软件”,登陆名为:“123”,密码为“456”;则,第一语音记录“开始聊天”语音分别与“唤醒终端”的第一指令、“启动QQ软件”的第二指令、“登陆名为:123以及密码为456”的应用数据建立对应关系。
步骤14,根据用户的命令,配置第一语音记录对应的用于唤醒终端的第三指令。
步骤15,建立第一语音记录与第三指令之间的对应关系。
第一语音记录还可以包括对应于应用程序的应用数据,应用数据可以包括:待拨打的联系人、待发短信的联系人、待播放的视频ID、或待登录客户端软件的用户名和密码;当应用为播放器时,应用数据包括:视频ID;步骤15之后还可以执行第二指令以启动应用程序,此时可以使用应用数据,执行第二指令以启动应用程序。例如,终端根据“登陆名为:123以及密码为456”的应用数据启动QQ软件。
例如,第一语音记录为“开始聊天”语音,则,建立第一语音记录“开始聊天”语音与唤醒终端的第三指令建立对应关系。
以下描述唤醒的处理流程。
如图2所示,本发明实施例记载的一种终端的处理方法的流程示意图,包括以下步骤:
步骤21,接收第一语音文件。
例如,用户发出“开始聊天”指令。
步骤22,将第一语音文件与第一语音记录库进行匹配,生成第一匹配结果,根据生成的第一匹配结果判断是否匹配成功,如果成功则执行步骤23;如果匹配失败执行步骤26。
步骤23,在第一语音记录库中获取与第一语音文件对应的第一语音记录,第一语音记录对应于用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令。
例如,将用户的“开始聊天”指令与第一语音记录库中的“开始聊天”语音匹配成功。“开始聊天”语音对应于“唤醒终端”的第一指令、“启动QQ软件”的第二指令、“登陆名为:123以及密码为456”的应用数据。
步骤24,执行第一指令以唤醒终端。
这里的唤醒终端可以为终端从睡眠(待机)模式切换到工作模式,以开启输入模块(如麦克风、键盘)和输出模块(如屏幕)等,从而能够处于随时接收指令并进行响应的状态。
步骤25,在唤醒终端后,执行第二指令以启动应用程序。
步骤26,将第一语音文件与第二语音记录库进行匹配,生成第二匹配结果。
步骤27,当第二匹配结果为匹配成功时,在第二语音记录库中获取与第二语音文件对应的第二语音记录,第二语音记录仅对应于用于唤醒终端的第三指令。
例如,将用户的“开始聊天”指令与第二语音记录库中的“开始聊天”语音匹配成功,“开始聊天”语音对应于“唤醒终端”的第三指令。
步骤28,执行第三指令,以唤醒处于监听模式下的终端。
以上为监听模式下的处理流程。
本发明实施例描述了用户使用一声启动口令的场景,用户可以自定义一声启动口令,终端可以根据用户的语音输入,不仅唤醒终端,还可以直接启动应用程序等,例如,直接唤醒终端并播放音乐。当应用为打电话或者发短信时,应用数据包括:联系人。
以下描述进行指令训练的处理流程的应用场景,包括以下步骤:
首先,终端处理装置工作模式切换为训练模式。
然后,开始采集用户语音数据。
用户使用四字短语进行四次录制,每次采集的数据需要对其音节长度进行判定,在阈值范围之内,并且与前次录制数据声学特征一致(除第一次以外)时,则训练成功;例如,
如果当前是在训练一声启动口令,为指令配置要执行的动作,比如播放某首歌曲、打电话给某个联系人、打开某个应用等等,这些信息在配置完成后保存,其中播放歌曲、打电话、打开应用之类信息作为动作类型字段保存,歌曲ID、联系人ID、应用ID则作为附加数据(等同于上述的应用数据)保存。然后把指令数据文件保存到终端文件系统中。
如果当前是在训练自定义唤醒词,则把指令数据文件保存到终端文件系统中。
其次,当切换到监听模式时,监听到与已保存的匹配的指令时,终端会进行唤醒处理。
如图3所示,以下为本发明实施例记载的普通语音交互流程,包括以下步骤:
步骤31,终端处理装置在监听模式下,检测到用户录入唤醒词口令。
步骤32,采集到语音指令数据。
步骤33,提取指令声学特征信息与用户自定义唤醒词比较。
步骤34,如果不匹配,则继续监听。如果匹配,则终端进行唤醒处理。
步骤35,终端完成唤醒处理,等待用户输入指令。
例如,终端从睡眠模式(或待机模式)切换到工作模式,终端中的功能模块(可以包括耳机、键盘、屏幕)处于指令采集状态,从而可以对用户的指令进行即时响应。
步骤36,检测到用户录入语音指令,比如播放歌曲、打电话给某联系人,打开某应用等。
步骤37,利用语音识别技术识别出指令的文字信息。
步骤38,发起动作执行。
如图4所示,为本发明描述的一声启动口令交互流程,包括以下步骤:
步骤41,终端处理装置在监听模式下,用户录入唤醒词口令。
步骤42,采集到语音指令数据。
步骤43,提取指令声学特征信息与预置的一声启动口令比较。
步骤44,如果不匹配,则返回步骤42继续监听以采集语音指令数据;如果存在匹配的一声启动口令,则执行步骤45。
步骤45,唤醒终端,上报一声启动口令的ID,根据上报的一声启动口令ID,查找对应的动作类型(等同于上述的打开应用程序)和附加数据。
步骤46,发起动作执行。
如图5所示,为本发明记载的一种终端处理装置的结构示意图,包括:
接收单元51,配置为接收第一语音文件;
第一匹配单元52,配置为将第一语音文件与第一语音记录库进行匹配,生成第一匹配结果;
第一获取单元53,配置为当第一匹配结果为匹配成功时,在第一语音记录库中获取与第一语音文件对应的第一语音记录,第一语音记录对应于用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
第一执行单元54,配置为执行第一指令以唤醒终端;
第二执行单元55,配置为执行第二指令以启动对应的应用程序。
作为一个实施方式,终端处理装置还可以包括:
第二匹配单元55,配置为当第一匹配结果为匹配不成功时,将第一语音文件与第二语音记录库进行匹配,生成第二匹配结果;
第二获取单元56,配置为当第二匹配结果为匹配成功时,在第二语音记录库中获取与第二语音文件对应的第二语音记录,第二语音记录仅对应于用于唤醒终端的第三指令;
第三执行单元57,配置为执行第三指令,以唤醒处于监听模式下的终端。
作为一个实施方式,终端处理装置还可以包括:
采集单元58,配置为在训练模式下进行语音采集,获取第一语音记录;
配置单元59,配置为根据用户的命令,为所述第一语音记录配置对应的用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
建立单元510,配置为建立第一语音记录与第一指令之间的对应关系;以及建立第一语音记录与第二指令之间的对应关系。
上述的终端可以为移动终端。
实际应用中,接收单元51、采集单元可由终端处理装置中的麦克风实现;第一匹配单元52、第一获取单元53、第一执行单元54、第二执行单元55可由终端处理装置中的微处理器(MCU)、逻辑可编程门阵列(FPGA)或专用集成电路(ASIC)实现;终端处理装置可以作为一个功能模块设置于终端中,与终端中的应用处理器(AP)交互实现对终端的唤醒。
下描述本发明的终端的应用场景。
首先描述实现本发明的背景技术。
终端低功耗不间断侦测(Always-on)技术的出现,允许终端在待机(也即应用处理器AP睡眠)的情况下,不间断侦测,以极低的功耗为终端提供 广泛的环境感知能力,从而提供真正自然的用户体验。低功耗语音唤醒技术应运而生,它让终端在休眠状态下,随时获取语音指令并根据指令行事变成可能。
如图6所示,本应用场景为一种支持多唤醒词的可配置语音唤醒交互装置(也即终端处理装置,可以设置于终端中),包含四个模块:语音唤醒模块61(等同于上述的第一匹配单元)、语音识别模块62(等同于上述的第一获取单元)、指令训练模块63(等同于上述的建立单元)、以及动作配置模块64(等同于上述的配置单元)。
语音唤醒模块61可以配置为切换终端为低功耗工作的模式;实时监听语音输入;比较语音输入与已有唤醒词的声学特征;多个唤醒词文件的存储;多个唤醒词文件的下载。也就是说,语音唤醒模块61控制着终端处理装置的工作模式。
所述语音唤醒模块61包括依次连接的主控单元611、指令存储单元612、下载单元613,监听单元614。以下分别说明各单元的作用。
终端处理装置的工作模式包括监听模式与训练模式。监听模式指的是:终端处于待机状态下,实时监听语音输入口令的一种工作模式,在监听模式下,主控单元611完成语音输入口令与已有唤醒词声学特征的比较。如果与某个唤醒词匹配,则判定唤醒成功,上报唤醒词ID。训练模式指的是:用户根据自己的喜好对终端训练唤醒词的一种工作模式,在训练模式下,主控单元611完成对采集到的语音处理工作,生成唤醒词文件,并保存在终端的文件系统。
指令存储单元612配置为存储主控单元611处理后生成的唤醒词文件,供下载单元613下载唤醒词到主控单元611时引用。
下载单元613配置为在终端开机初始化时或者新增、删除唤醒词时,遍历存储单元612中存储的所有唤醒词文件,并将唤醒词文件下载到主控 单元611中。
监听单元614连接在主控单元611上,比如常见的手机主麦克风(MIC),用于在低功耗监听模式或训练模式开启时,采集语音数据,并传送给主控单元611处理。
语音识别模块62配置为在终端通过语音唤醒模块61被唤醒之后,接收用户的语音指令,并依据识别出的文字信息通知动作配置模块64发起相应的动作执行,包括采集语音指令、识别语音指令、发起指令对应动作的执行。
语音识别模块62包括依次连接的语音采集单元621、语音识别单元622、动作执行单元723。以下分别说明各单元的作用。
语音采集单元621连接在主控单元611上,可以是手机MIC、三四段式耳机、蓝牙耳机等,配置为采集用户的语音指令,并发送给语音识别单元622。
语音识别单元622配置为接收语音采集单元621采集的语音指令,进行语音识别,识别出文字信息,发送给动作执行单元623。
动作执行单元623配置为接收语音识别单元622发送的文字信息,发起对应动作的执行。
指令训练模块63配置为在语音唤醒模块61的主控单元611切换到训练模式下,根据用户喜好训练自己的个性化唤醒词与一声启动口令。为降低所训练指令的误唤醒率,可以使用四字短语进行录制,并进行四次录制。语音唤醒模块的主控单元对音节长度设置阈值,低于最低阈值或高于最高阈值,则录制失败;主控单元611还会对后次录制的语音与前次相比较,如不匹配,则录制失败。它是一个逻辑模块,包括语音唤醒模块61中介绍的主控单元611、指令存储单元612、监听单元614。
动作配置模块64配置为给指令训练模块的新建的一声启动口令配置对 应要执行的动作,并在语音识别模块识别出文字信息,或者一声启动口令唤醒终端后,发起动作执行。其作用包括设置自定义唤醒词;编辑一声启动口令;存储一声启动口令与执行动作之间的配置关系。
动作配置模块64包括依次连接的指令编辑单元641、动作配置存储单元642。以下分别说明各单元的作用。
指令编辑单元641,配置为完成对指令的增删、为指令配置要执行的动作。
动作配置存储单元642,配置为用于存储指令与动作的配置关系,可以采用数据库技术、文件存储技术等,保存字段:指令ID、动作类型、附加数据。
实际应用中,语音唤醒模块61中的主控单元611、下载单元613可由MCU、FPGA或ASIC实现;监听单元614可以MIC实现;指令存储单元612可由非易失性存储介质如闪存实现;
语音识别模块62中的语音采集单元621可由MIC实现;语音识别单元622、动作执行单元723可由MCU、FPGA或ASIC实现;
指令训练模块63可由MCU、FPGA或ASIC实现;
动作配置模块64中的指令编辑单元641可由MCU、FPGA或ASIC实现;动作配置存储单元642可由非易失性存储介质如闪存实现。
本发明实施例中还记载一种终端,包括图5或图6所示的终端处理装置,实际应用中终端可以为智能手机、平板电脑等电子设备。
以下描述本发明的有益效果:
1、本发明克服终端现有语音交互方案的不足,提供一种灵活的、用户可配置的语音交互方案。
2、本发明支持训练多个自定义唤醒词。用户可以根据自己的喜好定义训练唤醒词,并可以同时支持多个自定义唤醒词,避免只能使用厂商默认 唤醒词给用户带来的困扰。
3、本发明支持一声启动口令。用户可根据习惯进行一声启动口令的偏好设置,在终端休眠状态下,用户只需一句口令,就可以让终端执行用户需要终端执行的动作,精简了语音交互过程,打破“说唤醒词——说指令——执行”的常规流程,极大提高了用户体验。
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于一计算机可读取存储介质中,该程序在执行时,执行包括上述方法实施例的步骤;而前述的存储介质包括:移动存储设备、随机存取存储器(RAM,Random Access Memory)、只读存储器(ROM,Read-Only Memory)、磁碟或者光盘等各种可以存储程序代码的介质。
或者,本发明上述集成的单元如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明实施例的技术方案本质上或者说对相关技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机、服务器、或者网络设备等)执行本发明各个实施例所述方法的全部或部分。而前述的存储介质包括:移动存储设备、RAM、ROM、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。

Claims (11)

  1. 一种终端的处理方法,包括:
    接收第一语音文件;
    将所述第一语音文件与第一语音记录库进行匹配,生成第一匹配结果;
    当所述第一匹配结果为匹配成功时,在所述第一语音记录库中获取与所述第一语音文件对应的第一语音记录,所述第一语音记录对应于用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
    执行所述第一指令以唤醒终端;
    在唤醒所述终端后,执行所述第二指令以启动所述应用程序。
  2. 根据权利要求1所述的方法,其中,所述第一语音记录还包括对应于所述应用程序的应用数据;
    所述执行所述第二指令以启动所述应用程序包括:使用所述应用数据,执行所述第二指令以启动所述应用程序。
  3. 根据权利要求2所述的方法,其中,
    所述应用数据包括:待拨打的联系人、待发短信的联系人、待播放的视频ID、或待登录客户端软件的用户名和密码。
  4. 根据权利要求1所述的方法,其中,还包括:
    当所述第一匹配结果为匹配不成功时,将所述第一语音文件与第二语音记录库进行匹配,生成第二匹配结果;
    当所述第二匹配结果为匹配成功时,在所述第二语音记录库中获取与所述第二语音文件对应的第二语音记录,所述第二语音记录仅对应于用于唤醒终端的第三指令;
    执行所述第三指令,以唤醒处于监听模式下的终端。
  5. 根据权利要求1所述的方法,其中,所述接收第一语音文件之前,所述方法还包括:
    在训练模式下进行语音采集,获取第一语音记录;
    根据用户的命令,为所述第一语音记录配置对应的用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
    建立所述第一语音记录与所述第一指令之间的对应关系;以及建立所述第一语音记录与所述第二指令之间的对应关系。
  6. 根据权利要求5所述的方法,其中,所述为所述第一语音记录配置对应的用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令时,所述方法还包括:根据用户的命令,配置所述第一语音记录对应的应用数据;
    所述建立所述第一语音记录与所述第二指令之间的对应关系时,所述方法还包括:建立所述第一语音记录与所述应用数据之间的对应关系。
  7. 根据权利要求1所述的方法,其中,所述接收第一语音文件之前,所述方法还包括:
    在训练模式下进行语音采集,获取第一语音记录;
    根据用户的命令,配置所述第一语音记录对应的用于唤醒终端的第三指令;
    建立所述第一语音记录与所述第三指令之间的对应关系。
  8. 一种终端处理装置,包括:
    接收单元,配置为接收第一语音文件;
    第一匹配单元,配置为将所述第一语音文件与第一语音记录库进行匹配,生成第一匹配结果;
    第一获取单元,配置为当所述第一匹配结果为匹配成功时,在所述第一语音记录库中获取与所述第一语音文件对应的第一语音记录,所述第一语音记录对应于用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
    第一执行单元,配置为执行所述第一指令以唤醒终端;
    第二执行单元,配置为在唤醒所述终端后,执行所述第二指令以启动所述应用程序。
  9. 根据权利要求8所述的终端,其中,还包括:
    第二匹配单元,配置为当所述第一匹配结果为匹配不成功时,将所述第一语音文件与第二语音记录库进行匹配,生成第二匹配结果;
    第二获取单元,配置为当所述第二匹配结果为匹配成功时,在所述第二语音记录库中获取与所述第二语音文件对应的第二语音记录,所述第二语音记录仅对应于用于唤醒终端的所述第三指令;
    第三执行单元,配置为执行所述第三指令,以唤醒处于监听模式下的终端。
  10. 根据权利要求8所述的终端,其中,还包括:
    采集单元,配置为在训练模式下进行语音采集,获取第一语音记录;
    配置单元,配置为根据用户的命令,为所述第一语音记录配置对应的用于唤醒终端的第一指令和用于启动至少一个应用程序的第二指令;
    建立单元,配置为建立所述第一语音记录与所述第一指令之间的对应关系;以及建立所述第一语音记录与所述第二指令之间的对应关系。
  11. 一种计算机存储介质,所述计算机存储介质中存储有可执行指令,所述可执行指令用于执行权利要求1至权利要求7任一项所述的终端处理方法。
PCT/CN2015/071481 2014-11-18 2015-01-23 终端处理方法、装置及计算机存储介质 WO2016078214A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410657861.7A CN105677004A (zh) 2014-11-18 2014-11-18 一种终端的处理方法和终端
CN201410657861.7 2014-11-18

Publications (1)

Publication Number Publication Date
WO2016078214A1 true WO2016078214A1 (zh) 2016-05-26

Family

ID=56013137

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/071481 WO2016078214A1 (zh) 2014-11-18 2015-01-23 终端处理方法、装置及计算机存储介质

Country Status (2)

Country Link
CN (1) CN105677004A (zh)
WO (1) WO2016078214A1 (zh)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108335695A (zh) * 2017-06-27 2018-07-27 腾讯科技(深圳)有限公司 语音控制方法、装置、计算机设备和存储介质
CN111081241A (zh) * 2019-11-20 2020-04-28 Oppo广东移动通信有限公司 设备误唤醒的数据检测方法、装置、移动终端和存储介质
CN111143773A (zh) * 2019-12-16 2020-05-12 中国平安财产保险股份有限公司 建立概率模型的方法、装置、计算机设备和存储介质
CN111190806A (zh) * 2019-12-30 2020-05-22 苏州思必驰信息科技有限公司 一种语音交互设备的日志处理方法和装置

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106971719A (zh) * 2017-05-16 2017-07-21 上海智觅智能科技有限公司 一种离线可切换唤醒词的非特定音语音识别唤醒方法
US10504511B2 (en) * 2017-07-24 2019-12-10 Midea Group Co., Ltd. Customizable wake-up voice commands
CN107526512B (zh) * 2017-08-31 2020-11-20 联想(北京)有限公司 用于电子设备的切换方法和系统
CN107729102B (zh) * 2017-09-28 2020-04-10 维沃移动通信有限公司 一种信息处理方法及移动终端
CN108062464A (zh) * 2017-11-27 2018-05-22 北京传嘉科技有限公司 基于声纹识别的终端控制方法及系统
CN109448734A (zh) * 2018-09-20 2019-03-08 李庆湧 基于声纹的终端设备解锁及应用启动方法以及装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100269040A1 (en) * 2009-04-16 2010-10-21 Lg Electronics Inc. Mobile terminal and control method thereof
CN103176714A (zh) * 2012-04-24 2013-06-26 微软公司 从锁定屏幕直接访问应用
CN103269395A (zh) * 2013-04-22 2013-08-28 聚熵信息技术(上海)有限公司 基于锁屏状态下的语音控制方法及其装置
CN103680504A (zh) * 2012-09-18 2014-03-26 英业达科技有限公司 语音解锁系统及其方法
CN104049722A (zh) * 2013-03-11 2014-09-17 联想(北京)有限公司 一种信息处理方法以及电子设备

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049192A (zh) * 2012-12-17 2013-04-17 广东欧珀移动通信有限公司 一种应用程序开启方法及装置
US10395651B2 (en) * 2013-02-28 2019-08-27 Sony Corporation Device and method for activating with voice input
CN104133631B (zh) * 2014-07-28 2017-09-05 步步高教育电子有限公司 一种从锁屏界面快速开启应用的方法和装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100269040A1 (en) * 2009-04-16 2010-10-21 Lg Electronics Inc. Mobile terminal and control method thereof
CN103176714A (zh) * 2012-04-24 2013-06-26 微软公司 从锁定屏幕直接访问应用
CN103680504A (zh) * 2012-09-18 2014-03-26 英业达科技有限公司 语音解锁系统及其方法
CN104049722A (zh) * 2013-03-11 2014-09-17 联想(北京)有限公司 一种信息处理方法以及电子设备
CN103269395A (zh) * 2013-04-22 2013-08-28 聚熵信息技术(上海)有限公司 基于锁屏状态下的语音控制方法及其装置

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108335695A (zh) * 2017-06-27 2018-07-27 腾讯科技(深圳)有限公司 语音控制方法、装置、计算机设备和存储介质
CN111081241A (zh) * 2019-11-20 2020-04-28 Oppo广东移动通信有限公司 设备误唤醒的数据检测方法、装置、移动终端和存储介质
CN111081241B (zh) * 2019-11-20 2023-04-07 Oppo广东移动通信有限公司 设备误唤醒的数据检测方法、装置、移动终端和存储介质
CN111143773A (zh) * 2019-12-16 2020-05-12 中国平安财产保险股份有限公司 建立概率模型的方法、装置、计算机设备和存储介质
CN111143773B (zh) * 2019-12-16 2023-02-07 中国平安财产保险股份有限公司 建立概率模型的方法、装置、计算机设备和存储介质
CN111190806A (zh) * 2019-12-30 2020-05-22 苏州思必驰信息科技有限公司 一种语音交互设备的日志处理方法和装置
CN111190806B (zh) * 2019-12-30 2022-07-29 思必驰科技股份有限公司 一种语音交互设备的日志处理方法和装置

Also Published As

Publication number Publication date
CN105677004A (zh) 2016-06-15

Similar Documents

Publication Publication Date Title
WO2016078214A1 (zh) 终端处理方法、装置及计算机存储介质
US20210287671A1 (en) Speech recognition method, speech wakeup apparatus, speech recognition apparatus, and terminal
JP6811758B2 (ja) 音声対話方法、装置、デバイス及び記憶媒体
CN107112017B (zh) 操作语音识别功能的电子设备和方法
US10629013B2 (en) Unlocking control methods and related products
WO2017012511A1 (zh) 语音控制方法、装置及投影仪设备
US9549273B2 (en) Selective enabling of a component by a microphone circuit
TWI525532B (zh) Set the name of the person to wake up the name for voice manipulation
CN105575395A (zh) 语音唤醒方法及装置、终端及其处理方法
US10854199B2 (en) Communications with trigger phrases
CN104866274B (zh) 信息处理方法及电子设备
KR102580408B1 (ko) 음성 기능을 갖는 휴대용 오디오 디바이스
CN106131292B (zh) 设置终端唤醒的方法、唤醒方法及对应的系统
WO2010139169A1 (zh) 终端电量节省方法及装置
TW202025138A (zh) 語音互動方法、裝置及系統
WO2015188459A1 (zh) 一种终端控制方法、装置、语音控制装置及终端
JP7051799B2 (ja) 音声認識制御方法、装置、電子デバイス及び読み取り可能な記憶媒体
WO2016184095A1 (zh) 操作事件的执行方法及装置、终端
WO2019227370A1 (zh) 一种多语音助手控制方法、装置、系统及计算机可读存储介质
WO2020043217A1 (zh) 快速语音记录方法、装置、移动终端和计算机存储介质
CN108093350B (zh) 麦克风的控制方法和麦克风
WO2016172846A1 (zh) 基于吹气动作操作移动终端的方法和移动终端
WO2021058004A1 (zh) 设备控制方法、装置、电子设备及可读存储介质
CN114391165A (zh) 语音信息处理方法、装置、设备及存储介质
WO2023246036A1 (zh) 语音识别设备的控制方法、装置、电子设备及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15860062

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15860062

Country of ref document: EP

Kind code of ref document: A1