WO2016082344A1 - Voice control method and apparatus, and storage medium - Google Patents

Voice control method and apparatus, and storage medium Download PDF

Info

Publication number
WO2016082344A1
WO2016082344A1 PCT/CN2015/072705 CN2015072705W WO2016082344A1 WO 2016082344 A1 WO2016082344 A1 WO 2016082344A1 CN 2015072705 W CN2015072705 W CN 2015072705W WO 2016082344 A1 WO2016082344 A1 WO 2016082344A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
input
user
preset
preset information
Prior art date
Application number
PCT/CN2015/072705
Other languages
French (fr)
Chinese (zh)
Inventor
魏占婷
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016082344A1 publication Critical patent/WO2016082344A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/725Cordless telephones

Definitions

  • the present invention relates to the field of communications technologies, and in particular, to a voice control method, apparatus, and storage medium.
  • the terminal can Automatically dial 110; however, after the user makes a "dial 110" sound, the other person knows the user's intention, can immediately block it, cut off the terminal to dial 110, thereby affecting the user to implement self-help and so on.
  • the method in which the terminal operates according to the direct meaning of the user's voice is insecure, and other users can more easily acquire the user's intention, thereby affecting the user's operation.
  • embodiments of the present invention mainly provide a method, an apparatus, and a storage medium for voice control.
  • the embodiment of the invention provides a method for voice control, which is applied to a terminal side, and the method includes:
  • the terminal side starts the preset function, it is determined whether the terminal side has a signature voice consistent with the input voice.
  • the pre-set preset information that is not related to the meaning of the identifier voice is acquired according to the identifier voice;
  • the embodiment of the present invention further provides a device for voice control, which is applied to a terminal side, where the device includes: a voice acquiring module, a determining module, a preset information acquiring module, and a first executing module;
  • a voice acquisition module configured to acquire an input voice of the user
  • a determining module configured to determine whether the terminal side has a signature voice consistent with the input voice, if the preset function is enabled on the terminal side;
  • a preset information acquiring module configured to: if the identification voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice;
  • the first execution module is configured to perform an operation corresponding to the preset information.
  • the embodiment of the present invention further provides a terminal, where the terminal includes a processor, and the processor is configured to acquire an input voice of the user; if the terminal side starts the preset function, it is determined whether the terminal side stores the pre-installation And the preset voice that is not related to the meaning of the voice is obtained according to the voice, and the operation corresponding to the preset information is performed.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the voice control method.
  • the voice control method, device, and storage medium of the embodiment of the present invention after obtaining the input voice of the user, the identifier voice pre-stored by the terminal side is matched, and the matching is obtained after the matching is obtained.
  • the preset information that the meaning of the voice is not related, so that the terminal performs the operation corresponding to the preset information; in the embodiment of the present invention, the preset information that is not related to the meaning of the identified voice is preset, so that other users cannot directly obtain the user.
  • the real intention is to realize personalized voice control settings, which greatly improves the security and service of the terminal voice input; at the same time, it improves user satisfaction.
  • FIG. 1 is a flow chart showing the basic steps of a method for voice control according to an embodiment of the present invention
  • FIG. 2 is a flow chart showing the basic steps of a method for setting preset information in a method for voice control according to an embodiment of the present invention
  • FIG. 3 is a schematic structural diagram of an apparatus for voice control according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram showing a connection relationship of a specific structure of a device for voice control according to an embodiment of the present invention
  • FIG. 5 is a flowchart showing the execution of a specific embodiment 1 of the present invention.
  • Figure 6 is a flowchart showing the execution of a second embodiment of the present invention.
  • Figure 7 is a flowchart showing the execution of a third embodiment of the present invention.
  • Figure 8 is a flowchart showing the execution of a fourth embodiment of the present invention.
  • Figure 9 is a flow chart showing the execution of a fifth embodiment of the present invention.
  • the present invention is directed to the problem that the voice control mode of the terminal is not high in the prior art, and provides a voice control method and device, which is matched with the identifier voice pre-stored by the terminal side after acquiring the input voice of the user, and the matching is consistent.
  • the preset information that is not related to the meaning of the voice is obtained, so that the terminal performs the operation corresponding to the preset information.
  • the preset information that is not related to the meaning of the voice is preset, so that other users cannot directly Get user
  • the real intention is to realize personalized voice control settings, which greatly improves the security and service of the terminal voice input; at the same time, it improves user satisfaction.
  • an embodiment of the present invention provides a voice control method, which is applied to a terminal side, and includes:
  • Step 11 Acquire an input voice of the user
  • Step 12 If the terminal side starts the preset function, it is determined whether the terminal side has a signature voice that is consistent with the input voice.
  • Step 13 If the identifier voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice.
  • Step 14 Perform an operation corresponding to the preset information.
  • the input voice of the user is the voice sent by the user.
  • the terminal is provided with a human interface module, which is an interface for detecting the collected user voice, and is used for collecting the collected voice.
  • the sound is transmitted to the central processing unit of the terminal; the central processing unit of the terminal side performs step 12 and step 13, that is, parsing the input voice of the user, and calling the preset information corresponding to the identification voice consistent with the input voice of the user, wherein, The security of the input voice is guaranteed, and the preset information is not related to the meaning of the voice.
  • the specific setting steps of the preset preset information that is not related to the meaning of the identification voice include:
  • Step 21 Acquire preset information input by the user through a preset interface, where the preset information is used to instruct the terminal to perform a corresponding operation;
  • step 22 in response to the operation of the user inputting a voice through the voice interface, the input voice is set as the identifier voice set by the preset information; wherein the preset information and the content of the identifier voice are not related.
  • the preset information is an operation content that the user actually wants the terminal to perform, and the preset information needs to be customized by the user through a preset interface, where the preset interface is mainly packaged.
  • the voice control setting method provided by the embodiment of the present invention the user needs to set the identifier voice for the preset information through the voice interface, and the identifier voice and the preset information have a one-to-one correspondence; that is, the terminal detects the user voice, if If the user voice is one of the voices, the preset information corresponding to the voice is obtained, and the terminal performs the operation corresponding to the preset information.
  • the setting method provided by the embodiment of the present invention makes the terminal not directly perform operations according to the actual meaning of the user voice, thereby improving the security of the voice control method of the terminal.
  • step 11 when the preset interface is an input text interface, step 11 is specifically:
  • Step 211 Acquire text input by the user through an input text interface.
  • step 11 is specifically:
  • Step 212 Acquire a voice input by the user through an input voice interface.
  • step 11 is specifically:
  • Step 213 Acquire an instruction preset by a user
  • Step 214 Acquire an instruction that the user selects from the preset instructions by calling an instruction interface.
  • the interface for inputting text on the terminal side is a text input mode on the UI of the user interface;
  • the interface for inputting voice is a voice input mode on the UI of the user interface;
  • the interface for invoking the command is on the UI of the user interface.
  • Command input mode specifically, the terminal can customize the usage scene of “text, command, voice input”. For example, in all editing interfaces of the terminal, text input can be started; in the software chat tool dialog interface, text and voice input can be started; Browsing the web page can initiate command input such as "page turning, exiting".
  • the mobile phone detects the voice consistent with the defined voice "yes", and automatically enters the text "I am at home” in the edit box.
  • the mobile phone detects the voice consistent with the defined voice "yes” and automatically sends the voice "test success”.
  • the user selects a page turning command to define a "page turning” voice for "page turning”, and can record a user voice or other voice defined by the user;
  • the mobile phone detects the voice consistent with the defined voice "page turning", and the web page or document will automatically turn the page.
  • the method for providing voice control in the embodiment of the present invention further sets a configuration switch when the terminal is set, that is, the method can be effective only when the configuration switch is turned on, and if the configuration switch is turned off, the terminal can normally recognize the user voice, and The operation corresponding to the actual meaning of the user voice is performed, and the setting of the configuration switch is such that the original function of the terminal is not affected.
  • the configuration method implements a method of custom voice input, which greatly improves the security of the terminal.
  • the method further includes:
  • Step 31 Parse the input voice, and determine a meaning of the input voice
  • Step 32 Perform a corresponding operation according to the meaning of the input voice.
  • the user inputs the actual through the preset interface.
  • the content needs to be executed, and the voice is set for the actual content to be executed through the voice interface.
  • the terminal After the terminal detects the voice, the terminal needs to execute the actual content to be executed corresponding to the preset information, and implements personalized voice control.
  • the setting greatly improves the security and serviceability of the terminal voice input; at the same time, the user satisfaction is improved.
  • the embodiment of the present invention further provides a device for voice control, which is applied to the terminal side, and includes:
  • the voice acquiring module 301 is configured to acquire an input voice of the user.
  • the determining module 302 is configured to: if the terminal side starts the preset function, determine whether the terminal side stores the identification voice consistent with the input voice in advance;
  • the preset information obtaining module 303 is configured to: if the identification voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice;
  • the first execution module 304 is configured to perform an operation corresponding to the preset information.
  • the device further includes:
  • a parsing module configured to parse the input voice to determine a meaning of the input voice
  • the second execution module is configured to perform a corresponding operation according to the meaning of the input voice.
  • the device further includes:
  • An acquiring module configured to acquire preset information that is input by the user through a preset interface, where the preset information is used to instruct the terminal to perform a corresponding operation;
  • a setting module configured to respond to the operation of the user inputting a voice through a voice interface, and set the input voice as an identifier voice set by the preset information; where the preset information and the content of the identifier voice are not Related.
  • the acquiring module includes:
  • the first obtaining submodule is configured to acquire text input by the user through an input text interface.
  • the acquiring module includes:
  • the second obtaining submodule is configured to acquire the voice input by the user through the input voice interface.
  • the acquiring module includes:
  • a third obtaining submodule configured to obtain an instruction preset by the user
  • a fourth acquiring submodule configured to acquire an instruction that the user selects from the preset instructions by calling an instruction interface.
  • the function of the voice acquiring module 301 is actually implemented by a human interface module on the terminal, and the corresponding functions of the determining module 302, the preset information acquiring module 303, and the executing module 304 are a central processing unit on the terminal.
  • the terminal further includes a UI interface and a setting module; the specific connection relationship is as shown in FIG.
  • the setting module provides the user to customize the actual input content, and provides corresponding customized sound and storage functions;
  • the human-machine interface module Detecting the interface for collecting user's voice, which is connected to the setting module through the central processing unit to collect sound and transmit the information to the central processor;
  • the central processor is responsible for the human-machine interface module, the UI module, the setting module and other functional modules, and processes User voice, and call the custom voice input module custom corresponding input, and display the corresponding input in the UI interface;
  • UI interface according to the processing and calling of the central processor, the user-defined actual input content is displayed in the UI interface.
  • the user issues a voice “yes”, and the terminal determines whether it has a custom stored voice; if the voice is not stored, the terminal does not respond; if the voice is stored, the terminal reads The preset information corresponding to the sound, for example, enter "I am at home” in the text box, and then automatically enter "I am at home” in the message edit box.
  • the user issues a sound, such as “something to make a call”, and the terminal determines whether it has a custom stored sound; if the sound is not stored, the terminal does not respond; if the sound is stored, The terminal reads the preset information corresponding to the sound, for example, automatically transmitting the voice content “test successful”, and then the terminal automatically sends the voice information “test success” to the information receiver.
  • a sound such as “something to make a call”
  • the terminal determines whether it has a custom stored sound; if the sound is not stored, the terminal does not respond; if the sound is stored,
  • the terminal reads the preset information corresponding to the sound, for example, automatically transmitting the voice content “test successful”, and then the terminal automatically sends the voice information “test success” to the information receiver.
  • the terminal determines whether it has a custom stored sound; if the sound is not stored, the terminal does not respond; if there is stored the sound
  • the terminal reads the preset information corresponding to the sound, for example, automatically sends the voice content "I was caught by the police", and the terminal automatically sends the voice message "I was caught by the police".
  • Embodiment 5 is a diagrammatic representation of Embodiment 5:
  • the terminal determines whether it has stored the sound by itself; if the sound is not stored, the terminal does not respond; if the sound is stored, The terminal reads the preset information corresponding to the sound, for example, the webpage automatically scrolls down one page, and the webpage on the terminal automatically flips to the next page.
  • the embodiment of the present invention further provides a terminal, where the terminal includes a processor, and the processor is configured to acquire an input voice of the user; And determining, by the terminal side, whether the identifier voice is consistent with the input voice; if the identifier voice is present, acquiring, according to the identifier voice, preset information that is not related to the meaning of the identifier voice. Performing an operation corresponding to the preset information.
  • the voice control method according to the embodiment of the present invention may also be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a stand-alone product. in.
  • the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product stored in a storage medium, including a plurality of instructions.
  • a computer device (which may be a personal computer, server, or network device, etc.) is caused to perform all or part of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a removable hard disk, a read-only memory (ROM), a magnetic disk, or an optical disk, and the like, which can store program codes.
  • ROM read-only memory
  • magnetic disk or an optical disk, and the like, which can store program codes.
  • optical disk and the like, which can store program codes.
  • the embodiment of the present invention further provides a computer storage medium, wherein a computer program for executing the voice control method of the embodiment of the present invention is stored.
  • the apparatus for voice control provided by the embodiment of the present invention is a device that utilizes the method of voice control described above, and all embodiments of the foregoing methods are applicable to the device, and all of the same or similar beneficial effects can be achieved.
  • the preset information that is not related to the meaning of the identified voice is preset, so that other users cannot directly obtain the true intention of the user, and the personalized voice control setting is realized, thereby greatly improving the security of the voice input of the terminal. Serviceability; at the same time, improved user satisfaction.

Abstract

Provided are a voice control method and apparatus, and a storage medium. The method applied to a terminal side comprises: obtaining an input voice of a user; if the terminal side opens a pre-set function, determining whether an identification voice consistent with the input voice is pre-stored at the terminal side; if the identification voice exists, according to the identification voice, obtaining pre-set information irrelevant to the meaning of the identification voice; and executing an operation corresponding to the pre-set information.

Description

一种语音控制的方法、装置及存储介质Method, device and storage medium for voice control 技术领域Technical field
本发明涉及通信技术领域,特别涉及一种语音控制的方法、装置及存储介质。The present invention relates to the field of communications technologies, and in particular, to a voice control method, apparatus, and storage medium.
背景技术Background technique
手机已成为人们日常生活中形影不离的工具,手机使用安全性显得越来越重要,语音输入使用的频率越来越多,目前市面上的语音输入是终端识别用户语音后,对用户实际的语音含义复述或显示。例如利用Siri(苹果公司推出的一项语音控制功能)用户可以通过手机读短信、介绍餐厅、询问天气、语音设置闹钟等;Siri可以支持自然语言输入,并且可以调用系统自带的天气预报、日程安排、搜索资料等应用,还能够不断学习新的声音和语调,提供对话式的应答。但是,由于现有技术中的语音输入是控制终端执行语音输入的实际含义,该种方法容易被其他用户轻易获知其目的,例如发生危险情况时,用户须发出“拨打110”的声音,终端才能自动拨打110;但是此时用户发出“拨打110”的声音后,别人就知道了该用户的意图,可以立即对其进行阻断,切断终端拨打110的操作,从而影响用户实施自救等等。综上,终端根据用户发出声音的直接含义进行操作的方法缺乏安全性,其他用户能够较容易获取用户意图,从而影响用户操作。Mobile phones have become an inseparable tool in people's daily life. The security of mobile phones is becoming more and more important. The frequency of voice input is increasing. The voice input on the market is the actual voice meaning of the user after the terminal recognizes the user's voice. Repeat or display. For example, using Siri (a voice control function introduced by Apple), users can read text messages, introduce restaurants, ask for weather, and set alarm clocks through mobile phones. Siri can support natural language input and can call the system's own weather forecast and schedule. Applications such as scheduling and searching for materials can also continue to learn new voices and intonations and provide a conversational response. However, since the voice input in the prior art is the actual meaning of controlling the voice input by the terminal, the method is easy for other users to easily know the purpose. For example, when a dangerous situation occurs, the user must issue a "dial 110" sound, and the terminal can Automatically dial 110; however, after the user makes a "dial 110" sound, the other person knows the user's intention, can immediately block it, cut off the terminal to dial 110, thereby affecting the user to implement self-help and so on. In summary, the method in which the terminal operates according to the direct meaning of the user's voice is insecure, and other users can more easily acquire the user's intention, thereby affecting the user's operation.
发明内容Summary of the invention
为解决现有存在的技术问题,本发明实施例主要期望提供一种语音控制的方法、装置及存储介质。In order to solve the existing technical problems, embodiments of the present invention mainly provide a method, an apparatus, and a storage medium for voice control.
本发明实施例提供一种语音控制的方法,应用于终端侧,该方法包括: The embodiment of the invention provides a method for voice control, which is applied to a terminal side, and the method includes:
获取用户的输入语音;Obtain the input voice of the user;
若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;If the terminal side starts the preset function, it is determined whether the terminal side has a signature voice consistent with the input voice.
若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;If the identification voice is present, the pre-set preset information that is not related to the meaning of the identifier voice is acquired according to the identifier voice;
执行所述预设信息对应的操作。Executing an operation corresponding to the preset information.
本发明实施例还提供一种语音控制的装置,应用于终端侧,该装置包括:语音获取模块、确定模块、预设信息获取模块、第一执行模块;其中,The embodiment of the present invention further provides a device for voice control, which is applied to a terminal side, where the device includes: a voice acquiring module, a determining module, a preset information acquiring module, and a first executing module;
语音获取模块,配置为获取用户的输入语音;a voice acquisition module configured to acquire an input voice of the user;
确定模块,配置为若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;a determining module, configured to determine whether the terminal side has a signature voice consistent with the input voice, if the preset function is enabled on the terminal side;
预设信息获取模块,配置为若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;a preset information acquiring module, configured to: if the identification voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice;
第一执行模块,配置为执行所述预设信息对应的操作。The first execution module is configured to perform an operation corresponding to the preset information.
本发明实施例还提供一种终端,该终端包括处理器,所述处理器,配置为获取用户的输入语音;若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;执行所述预设信息对应的操作。The embodiment of the present invention further provides a terminal, where the terminal includes a processor, and the processor is configured to acquire an input voice of the user; if the terminal side starts the preset function, it is determined whether the terminal side stores the pre-installation And the preset voice that is not related to the meaning of the voice is obtained according to the voice, and the operation corresponding to the preset information is performed.
本发明实施例还提供一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行上述的语音控制的方法。The embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the voice control method.
本发明的上述技术方案至少具有如下有益效果:The above technical solution of the present invention has at least the following beneficial effects:
本发明实施例的语音控制的方法、装置及存储介质中,通过获取用户的输入语音后与终端侧预先存储的标识语音相匹配,匹配一致后获取与标 识语音的含义不相关的预设信息,从而所述终端执行预设信息对应的操作;本发明实施例中通过预先设置的与标识语音的含义不相关的预设信息使得其他用户无法直接获取用户的真实意图,实现了个性化的语音控制设置,大大提高了终端语音输入的安全性和服务性;同时提高了用户满意度。In the voice control method, device, and storage medium of the embodiment of the present invention, after obtaining the input voice of the user, the identifier voice pre-stored by the terminal side is matched, and the matching is obtained after the matching is obtained. The preset information that the meaning of the voice is not related, so that the terminal performs the operation corresponding to the preset information; in the embodiment of the present invention, the preset information that is not related to the meaning of the identified voice is preset, so that other users cannot directly obtain the user. The real intention is to realize personalized voice control settings, which greatly improves the security and service of the terminal voice input; at the same time, it improves user satisfaction.
附图说明DRAWINGS
图1表示本发明实施例的语音控制的方法的基本步骤流程图;1 is a flow chart showing the basic steps of a method for voice control according to an embodiment of the present invention;
图2表示本发明实施例的语音控制的方法中设置预设信息的方法的基本步骤流程图;2 is a flow chart showing the basic steps of a method for setting preset information in a method for voice control according to an embodiment of the present invention;
图3表示本发明实施例的语音控制的装置的结构示意图;3 is a schematic structural diagram of an apparatus for voice control according to an embodiment of the present invention;
图4表示本发明实施例的语音控制的装置的具体结构的连接关系示意图;4 is a schematic diagram showing a connection relationship of a specific structure of a device for voice control according to an embodiment of the present invention;
图5表示本发明的具体实施例一的执行流程图;Figure 5 is a flowchart showing the execution of a specific embodiment 1 of the present invention;
图6表示本发明的具体实施例二的执行流程图;Figure 6 is a flowchart showing the execution of a second embodiment of the present invention;
图7表示本发明的具体实施例三的执行流程图;Figure 7 is a flowchart showing the execution of a third embodiment of the present invention;
图8表示本发明的具体实施例四的执行流程图;Figure 8 is a flowchart showing the execution of a fourth embodiment of the present invention;
图9表示本发明的具体实施例五的执行流程图。Figure 9 is a flow chart showing the execution of a fifth embodiment of the present invention.
具体实施方式detailed description
为使本发明要解决的技术问题、技术方案和优点更加清楚,下面将结合附图及具体实施例进行详细描述。The technical problems, the technical solutions, and the advantages of the present invention will be more clearly described in the following description.
本发明针对现有技术中终端的语音控制方式安全性不高的问题,提供一种语音控制的方法及装置,通过获取用户的输入语音后与终端侧预先存储的标识语音相匹配,匹配一致后获取与标识语音的含义不相关的预设信息,从而所述终端执行预设信息对应的操作;本发明实施例中通过预先设置的与标识语音的含义不相关的预设信息使得其他用户无法直接获取用户 的真实意图,实现了个性化的语音控制设置,大大提高了终端语音输入的安全性和服务性;同时提高了用户满意度。The present invention is directed to the problem that the voice control mode of the terminal is not high in the prior art, and provides a voice control method and device, which is matched with the identifier voice pre-stored by the terminal side after acquiring the input voice of the user, and the matching is consistent. The preset information that is not related to the meaning of the voice is obtained, so that the terminal performs the operation corresponding to the preset information. In the embodiment of the present invention, the preset information that is not related to the meaning of the voice is preset, so that other users cannot directly Get user The real intention is to realize personalized voice control settings, which greatly improves the security and service of the terminal voice input; at the same time, it improves user satisfaction.
如图1所示,本发明实施例提供一种语音控制的方法,应用于终端侧,包括:As shown in FIG. 1 , an embodiment of the present invention provides a voice control method, which is applied to a terminal side, and includes:
步骤11,获取用户的输入语音;Step 11: Acquire an input voice of the user;
步骤12,若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;Step 12: If the terminal side starts the preset function, it is determined whether the terminal side has a signature voice that is consistent with the input voice.
步骤13,若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;Step 13: If the identifier voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice.
步骤14,执行所述预设信息对应的操作。Step 14: Perform an operation corresponding to the preset information.
本发明的上述实施例中,用户的输入语音即为用户发出的声音,具体的,终端上设置有一人机接口模块,该人机接口模块是检测收集用户声音的接口,并用于将收集到的声音传递至终端的中央处理器;由终端侧的中央处理器执行步骤12及步骤13,即解析用户的输入语音,并调用与用户的输入语音一致的标识语音对应的预设信息,其中,为了保障输入语音的安全性,该预设信息与标识语音的含义不相关。In the above embodiment of the present invention, the input voice of the user is the voice sent by the user. Specifically, the terminal is provided with a human interface module, which is an interface for detecting the collected user voice, and is used for collecting the collected voice. The sound is transmitted to the central processing unit of the terminal; the central processing unit of the terminal side performs step 12 and step 13, that is, parsing the input voice of the user, and calling the preset information corresponding to the identification voice consistent with the input voice of the user, wherein, The security of the input voice is guaranteed, and the preset information is not related to the meaning of the voice.
较佳的,如图2所示,所述预先设置的与所述标识语音的含义不相关的预设信息的具体设置步骤包括:Preferably, as shown in FIG. 2, the specific setting steps of the preset preset information that is not related to the meaning of the identification voice include:
步骤21,获取所述用户通过预设接口输入的预设信息,所述预设信息用于指示所述终端执行相应操作;Step 21: Acquire preset information input by the user through a preset interface, where the preset information is used to instruct the terminal to perform a corresponding operation;
步骤22,响应所述用户通过语音接口输入语音的操作,将输入的所述语音设置为所述预设信息设置的标识语音;其中,所述预设信息和所述标识语音的内容不相关。In step 22, in response to the operation of the user inputting a voice through the voice interface, the input voice is set as the identifier voice set by the preset information; wherein the preset information and the content of the identifier voice are not related.
本发明的上述实施例中,预设信息即为用户实际想让终端执行的操作内容,该预设信息需要用户通过预设接口自定义,其中,预设接口主要包 括输入文本的接口、输入语音的接口以及调用指令的接口。同时,本发明实施例提供的语音控制的设置方法中用户需通过语音接口为所述预设信息设置标识语音,该标识语音与预设信息为一一对应的关系;即终端检测用户语音,若用户语音为标识语音中的一种,则获取标识语音对应的预设信息,终端则执行上述预设信息对应的操作。本发明实施例提供的设置方法使得终端不是直接根据用户语音的实际含义执行操作,提高了终端的语音控制方法的安全性。In the foregoing embodiment of the present invention, the preset information is an operation content that the user actually wants the terminal to perform, and the preset information needs to be customized by the user through a preset interface, where the preset interface is mainly packaged. The interface for inputting text, the interface for inputting voice, and the interface for calling instructions. In addition, in the voice control setting method provided by the embodiment of the present invention, the user needs to set the identifier voice for the preset information through the voice interface, and the identifier voice and the preset information have a one-to-one correspondence; that is, the terminal detects the user voice, if If the user voice is one of the voices, the preset information corresponding to the voice is obtained, and the terminal performs the operation corresponding to the preset information. The setting method provided by the embodiment of the present invention makes the terminal not directly perform operations according to the actual meaning of the user voice, thereby improving the security of the voice control method of the terminal.
具体的,本发明具体实施例中,当预设接口为输入文本接口时,步骤11具体为:Specifically, in the specific embodiment of the present invention, when the preset interface is an input text interface, step 11 is specifically:
步骤211,获取所述用户通过输入文本接口输入的文本。Step 211: Acquire text input by the user through an input text interface.
或者,当预设接口为输入语音接口时,步骤11具体为:Or, when the preset interface is an input voice interface, step 11 is specifically:
步骤212,获取所述用户通过输入语音接口输入的语音。Step 212: Acquire a voice input by the user through an input voice interface.
或者,当预设接口为调用指令接口时,步骤11具体为:Or, when the preset interface is a call instruction interface, step 11 is specifically:
步骤213,获取用户预先设置的指令;Step 213: Acquire an instruction preset by a user;
步骤214,获取所述用户通过调用指令接口从所述预先设置的指令中选择的指令。Step 214: Acquire an instruction that the user selects from the preset instructions by calling an instruction interface.
本发明实施例的具体应用中,终端侧的输入文本的接口为用户界面UI上的文本输入模式;输入语音的接口为用户界面UI上的语音输入模式;调用指令的接口为用户界面UI上的指令输入模式;具体的,终端可自定义“文本、指令、语音输入”的使用场景,例如,在终端所有编辑界面,可以启动文本输入;在软件聊天工具对话界面可以启动文本及语音输入;在浏览网页可以启动“翻页、退出”等指令输入。In a specific application of the embodiment of the present invention, the interface for inputting text on the terminal side is a text input mode on the UI of the user interface; the interface for inputting voice is a voice input mode on the UI of the user interface; the interface for invoking the command is on the UI of the user interface. Command input mode; specifically, the terminal can customize the usage scene of “text, command, voice input”. For example, in all editing interfaces of the terminal, text input can be started; in the software chat tool dialog interface, text and voice input can be started; Browsing the web page can initiate command input such as "page turning, exiting".
例如,若用户选择自定义“文本输入”:For example, if the user chooses to customize "text input":
1.提供用户输入文本的接口,比如用户可以输入“我在家呢”;1. Provide an interface for the user to input text, for example, the user can input "I am at home";
2.为用户提供定义语音的接口,为“我在家呢”定义“yes”等语音, 可以录制用户声音或用户定义的其他声音;2. Provide users with a voice-defining interface, and define "yes" and other voices for "I am at home". User sounds or other user-defined sounds can be recorded;
3.在终端的任何编辑界面,手机检测到与定义的语音“yes”一致的语音,自动在编辑框内输入文本“我在家呢”。3. In any editing interface of the terminal, the mobile phone detects the voice consistent with the defined voice "yes", and automatically enters the text "I am at home" in the edit box.
若用户选择自定义“语音输入”:If the user chooses to customize "voice input":
1.提供用户输入语音的接口,比如用户输入语音“试验成功”;1. Providing an interface for the user to input voice, such as the user inputting the voice "test successful";
2.为用户提供自定义语音的接口,为“试验成功”定义“yes”等语音,可以录制用户声音或用户定义的其他声音;2. Provide users with a customized voice interface, define "yes" and other voices for "test success", and record user voice or other user-defined voices;
3.在互动聊天界面,手机检测到与定义的语音“yes”一致的语音,自动发送语音“试验成功”。3. In the interactive chat interface, the mobile phone detects the voice consistent with the defined voice "yes" and automatically sends the voice "test success".
若用户选择自定义“指令输入”:If the user chooses a custom "command input":
1.首先自定义一些指令,并提供用户调用指令的接口,比如定义“网页翻页”指令;1. First customize some instructions and provide an interface for the user to invoke the instructions, such as defining a "page flip" command;
2.用户选择网页翻页指令,为“网页翻页”定义“翻页”等语音,可以录制用户声音或用户定义的其他声音;2. The user selects a page turning command to define a "page turning" voice for "page turning", and can record a user voice or other voice defined by the user;
3.在浏览器界面或文档阅读界面,手机检测到与定义的语音“翻页”一致的语音,网页或文档会自动翻页。3. In the browser interface or the document reading interface, the mobile phone detects the voice consistent with the defined voice "page turning", and the web page or document will automatically turn the page.
需要说明的是,本发明实施例提供语音控制的方法在终端中设置时还设置一配置开关,即打开上述配置开关该方法才能生效,若该配置开关关闭,则终端能够正常识别用户语音,并执行与用户语音的实际含义对应的操作,该配置开关的设置使得终端原有功能不受影响。该配置方法实现了自定义语音输入的方法,大大提高终端的安全性。It should be noted that the method for providing voice control in the embodiment of the present invention further sets a configuration switch when the terminal is set, that is, the method can be effective only when the configuration switch is turned on, and if the configuration switch is turned off, the terminal can normally recognize the user voice, and The operation corresponding to the actual meaning of the user voice is performed, and the setting of the configuration switch is such that the original function of the terminal is not affected. The configuration method implements a method of custom voice input, which greatly improves the security of the terminal.
具体的,若所述终端侧未开启所述预设功能,所述方法还包括:Specifically, if the preset function is not enabled on the terminal side, the method further includes:
步骤31,解析所述输入语音,确定所述输入语音的含义;Step 31: Parse the input voice, and determine a meaning of the input voice;
步骤32,根据所述输入语音的含义,执行对应操作。Step 32: Perform a corresponding operation according to the meaning of the input voice.
本发明实施例的预设信息的设置方法中,用户通过预设接口输入实际 需执行内容(预设信息),并通过语音接口为实际需执行内容设置标识语音,则终端检测到标识语音后对应需执行上述预设信息对应的实际需执行内容,实现了个性化的语音控制设置,大大提高了终端语音输入的安全性和服务性;同时提高了用户满意度。In the method for setting preset information in the embodiment of the present invention, the user inputs the actual through the preset interface. The content (preset information) needs to be executed, and the voice is set for the actual content to be executed through the voice interface. After the terminal detects the voice, the terminal needs to execute the actual content to be executed corresponding to the preset information, and implements personalized voice control. The setting greatly improves the security and serviceability of the terminal voice input; at the same time, the user satisfaction is improved.
为了更好的实现上述方法,如图3所示,本发明实施例还提供一种语音控制的装置,应用于终端侧,包括:In order to achieve the above method, as shown in FIG. 3, the embodiment of the present invention further provides a device for voice control, which is applied to the terminal side, and includes:
语音获取模块301,配置为获取用户的输入语音;The voice acquiring module 301 is configured to acquire an input voice of the user.
确定模块302,配置为若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;The determining module 302 is configured to: if the terminal side starts the preset function, determine whether the terminal side stores the identification voice consistent with the input voice in advance;
预设信息获取模块303,配置为若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;The preset information obtaining module 303 is configured to: if the identification voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice;
第一执行模块304,配置为执行所述预设信息对应的操作。The first execution module 304 is configured to perform an operation corresponding to the preset information.
具体的,本发明上述实施例中,若所述终端侧未开启所述预设功能,所述装置还包括:Specifically, in the above embodiment of the present invention, if the preset function is not enabled on the terminal side, the device further includes:
解析模块,配置为解析所述输入语音,确定所述输入语音的含义;a parsing module configured to parse the input voice to determine a meaning of the input voice;
第二执行模块,配置为根据所述输入语音的含义,执行对应操作。The second execution module is configured to perform a corresponding operation according to the meaning of the input voice.
具体的,本发明上述实施例中,所述装置还包括:Specifically, in the foregoing embodiment of the present invention, the device further includes:
获取模块,配置为获取所述用户通过预设接口输入的预设信息,所述预设信息用于指示所述终端执行相应操作;An acquiring module, configured to acquire preset information that is input by the user through a preset interface, where the preset information is used to instruct the terminal to perform a corresponding operation;
设置模块,配置为响应所述用户通过语音接口输入语音的操作,将输入的所述语音设置为所述预设信息设置的标识语音;其中,所述预设信息和所述标识语音的内容不相关。a setting module, configured to respond to the operation of the user inputting a voice through a voice interface, and set the input voice as an identifier voice set by the preset information; where the preset information and the content of the identifier voice are not Related.
具体的,本发明上述实施例中,所述获取模块包括:Specifically, in the foregoing embodiment of the present invention, the acquiring module includes:
第一获取子模块,配置为获取所述用户通过输入文本接口输入的文本。The first obtaining submodule is configured to acquire text input by the user through an input text interface.
具体的,本发明上述实施例中,所述获取模块包括: Specifically, in the foregoing embodiment of the present invention, the acquiring module includes:
第二获取子模块,配置为获取所述用户通过输入语音接口输入的语音。The second obtaining submodule is configured to acquire the voice input by the user through the input voice interface.
具体的,本发明上述实施例中,所述获取模块包括:Specifically, in the foregoing embodiment of the present invention, the acquiring module includes:
第三获取子模块,配置为获取用户预先设置的指令;a third obtaining submodule configured to obtain an instruction preset by the user;
第四获取子模块,配置为获取所述用户通过调用指令接口从所述预先设置的指令中选择的指令。And a fourth acquiring submodule configured to acquire an instruction that the user selects from the preset instructions by calling an instruction interface.
本发明的具体实施例中,语音获取模块301的功能在终端上实际为一人机接口模块实现,确定模块302、预设信息获取模块303以及执行模块304的相应功能在终端上为一中央处理器实现;终端还包括一UI界面和一设置模块;具体的连接关系如图4所示,设置模块,提供用户自定义实际输入的内容,提供对应的自定义声音及存储功能;人机接口模块,检测收集用户声音的接口,它通过中央处理器与设置模块连接,用于收集声音并将信息传递到中央处理器;中央处理器,负责人机接口模块、UI模块,设置模块等功能模块,处理用户声音,并调用自定义语音输入模块自定义的对应输入,并将对应的输入显示在UI界面;UI界面:根据中央处理器的处理和调用情况,将用户自定义实际输入的内容显示在UI界面。In a specific embodiment of the present invention, the function of the voice acquiring module 301 is actually implemented by a human interface module on the terminal, and the corresponding functions of the determining module 302, the preset information acquiring module 303, and the executing module 304 are a central processing unit on the terminal. The terminal further includes a UI interface and a setting module; the specific connection relationship is as shown in FIG. 4, the setting module provides the user to customize the actual input content, and provides corresponding customized sound and storage functions; the human-machine interface module, Detecting the interface for collecting user's voice, which is connected to the setting module through the central processing unit to collect sound and transmit the information to the central processor; the central processor is responsible for the human-machine interface module, the UI module, the setting module and other functional modules, and processes User voice, and call the custom voice input module custom corresponding input, and display the corresponding input in the UI interface; UI interface: according to the processing and calling of the central processor, the user-defined actual input content is displayed in the UI interface.
具体说明如下:The specific instructions are as follows:
具体实施例一:Embodiment 1
如图5所示,首先在消息编辑界面,用户发出声音“yes”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如:在文本框输入“我在家呢”,然后在消息编辑框内自动输入“我在家呢”。As shown in FIG. 5, firstly, in the message editing interface, the user issues a voice “yes”, and the terminal determines whether it has a custom stored voice; if the voice is not stored, the terminal does not respond; if the voice is stored, the terminal reads The preset information corresponding to the sound, for example, enter "I am at home" in the text box, and then automatically enter "I am at home" in the message edit box.
具体实施例二:Specific embodiment 2:
如图6所示,首先用户发出声音,比如“啊啊啊”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如自动呼叫110,然后终端则自 动呼叫110。As shown in Figure 6, first the user makes a sound, such as "Ahhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Preset information corresponding to the sound, such as automatic call 110, and then the terminal is self- Call 110.
具体实施例三:Specific embodiment 3:
如图7所示,首先在信息编辑界面,用户发出声音,比如“有事打电话”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如:自动发送语音内容“试验成功”,然后终端则自动向信息接收方发送语音信息“试验成功”。As shown in FIG. 7 , firstly, in the information editing interface, the user issues a sound, such as “something to make a call”, and the terminal determines whether it has a custom stored sound; if the sound is not stored, the terminal does not respond; if the sound is stored, The terminal reads the preset information corresponding to the sound, for example, automatically transmitting the voice content “test successful”, and then the terminal automatically sends the voice information “test success” to the information receiver.
具体实施例四:Specific Embodiment 4:
如图8所示,首先用户在通话过程中发出声音,比如“我现在很好”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如:自动发送语音内容“我被警察抓了”,则终端自动将“我被警察抓了”的语音信息发送出去。As shown in Figure 8, first the user makes a sound during the call, such as "I am fine now", the terminal determines whether it has a custom stored sound; if the sound is not stored, the terminal does not respond; if there is stored the sound The terminal reads the preset information corresponding to the sound, for example, automatically sends the voice content "I was caught by the police", and the terminal automatically sends the voice message "I was caught by the police".
具体实施例五:Embodiment 5:
如图9所示,首先用户在浏览网页过程中,发出声音,比如“翻页”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如:网页自动往下翻一页,则终端上的网页自动翻到下一页。As shown in FIG. 9 , firstly, during the process of browsing the webpage, the user issues a sound, such as “turning the page”, and the terminal determines whether it has stored the sound by itself; if the sound is not stored, the terminal does not respond; if the sound is stored, The terminal reads the preset information corresponding to the sound, for example, the webpage automatically scrolls down one page, and the webpage on the terminal automatically flips to the next page.
为了更好的实现本发明实施例的方法,本发明实施例还提供一种终端,该终端包括处理器,所述处理器,配置为获取用户的输入语音;若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;执行所述预设信息对应的操作。In order to better implement the method of the embodiment of the present invention, the embodiment of the present invention further provides a terminal, where the terminal includes a processor, and the processor is configured to acquire an input voice of the user; And determining, by the terminal side, whether the identifier voice is consistent with the input voice; if the identifier voice is present, acquiring, according to the identifier voice, preset information that is not related to the meaning of the identifier voice. Performing an operation corresponding to the preset information.
本发明实施例所述语音控制的方法如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质 中。基于这样的理解,本发明实施例的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机、服务器、或者网络设备等)执行本发明各个实施例所述方法的全部或部分。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、磁碟或者光盘等各种可以存储程序代码的介质。这样,本发明实施例不限制于任何特定的硬件和软件结合。The voice control method according to the embodiment of the present invention may also be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a stand-alone product. in. Based on such understanding, the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product stored in a storage medium, including a plurality of instructions. A computer device (which may be a personal computer, server, or network device, etc.) is caused to perform all or part of the methods described in various embodiments of the present invention. The foregoing storage medium includes: a U disk, a removable hard disk, a read-only memory (ROM), a magnetic disk, or an optical disk, and the like, which can store program codes. Thus, embodiments of the invention are not limited to any specific combination of hardware and software.
相应的,本发明实施例还提供一种计算机存储介质,其中存储有计算机程序,该计算机程序用于执行本发明实施例的语音控制的方法。Correspondingly, the embodiment of the present invention further provides a computer storage medium, wherein a computer program for executing the voice control method of the embodiment of the present invention is stored.
需要说明的是,本发明实施例提供的语音控制的装置是利用上述语音控制的方法的装置,则上述方法的所有实施例均适用于该装置,且均能达到相同或相似的有益效果。It should be noted that the apparatus for voice control provided by the embodiment of the present invention is a device that utilizes the method of voice control described above, and all embodiments of the foregoing methods are applicable to the device, and all of the same or similar beneficial effects can be achieved.
以上所述是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明所述原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。The above is a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should also be considered as the scope of protection of the present invention.
工业实用性Industrial applicability
本发明实施例中通过预先设置的与标识语音的含义不相关的预设信息使得其他用户无法直接获取用户的真实意图,实现了个性化的语音控制设置,大大提高了终端语音输入的安全性和服务性;同时提高了用户满意度。 In the embodiment of the present invention, the preset information that is not related to the meaning of the identified voice is preset, so that other users cannot directly obtain the true intention of the user, and the personalized voice control setting is realized, thereby greatly improving the security of the voice input of the terminal. Serviceability; at the same time, improved user satisfaction.

Claims (14)

  1. 一种语音控制的方法,应用于终端侧,所述方法包括:A voice control method is applied to a terminal side, and the method includes:
    获取用户的输入语音;Obtain the input voice of the user;
    若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;If the terminal side starts the preset function, it is determined whether the terminal side has a signature voice consistent with the input voice.
    若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;If the identification voice is present, the pre-set preset information that is not related to the meaning of the identifier voice is acquired according to the identifier voice;
    执行所述预设信息对应的操作。Executing an operation corresponding to the preset information.
  2. 根据权利要求1所述的语音控制的方法,其中,若所述终端侧未开启所述预设功能,所述方法还包括:The method of voice control according to claim 1, wherein if the terminal side does not enable the preset function, the method further includes:
    解析所述输入语音,确定所述输入语音的含义;Parsing the input voice to determine a meaning of the input voice;
    根据所述输入语音的含义,执行对应操作。Corresponding operations are performed according to the meaning of the input voice.
  3. 根据权利要求1所述的语音控制的方法,其中,所述预先设置的与所述标识语音的含义不相关的预设信息的设置步骤包括:The voice control method according to claim 1, wherein the setting step of the preset preset information that is not related to the meaning of the identification voice comprises:
    获取所述用户通过预设接口输入的预设信息,所述预设信息用于指示所述终端执行相应操作;Acquiring preset information input by the user through a preset interface, where the preset information is used to instruct the terminal to perform a corresponding operation;
    响应所述用户通过语音接口输入语音的操作,将输入的所述语音设置为所述预设信息设置的标识语音;其中,所述预设信息和所述标识语音的内容不相关。And responding to the operation of the user inputting the voice through the voice interface, setting the input voice as the identifier voice set by the preset information; wherein the preset information and the content of the identifier voice are not related.
  4. 根据权利要求3所述的语音控制的方法,其中,所述获取所述用户通过预设接口输入的预设信息,包括:The method of claim 3, wherein the acquiring preset information input by the user through a preset interface comprises:
    获取所述用户通过输入文本接口输入的文本。Get the text entered by the user through the input text interface.
  5. 根据权利要求3所述的语音控制的方法,其中,所述获取所述用户通过预设接口输入的预设信息,包括:The method of claim 3, wherein the acquiring preset information input by the user through a preset interface comprises:
    获取所述用户通过输入语音接口输入的语音。 Acquiring the voice input by the user through the input voice interface.
  6. 根据权利要求3所述的语音控制的方法,其中,所述获取所述用户通过预设接口输入的预设信息,包括:The method of claim 3, wherein the acquiring preset information input by the user through a preset interface comprises:
    获取用户预先设置的指令;Obtain an instruction preset by the user;
    获取所述用户通过调用指令接口从所述预先设置的指令中选择的指令。Obtaining an instruction that the user selects from the preset instructions by calling an instruction interface.
  7. 一种语音控制的装置,应用于终端侧,该装置包括:语音获取模块、确定模块、预设信息获取模块、第一执行模块;其中,A voice control device is applied to the terminal side, and the device includes: a voice acquisition module, a determination module, a preset information acquisition module, and a first execution module;
    语音获取模块,配置为获取用户的输入语音;a voice acquisition module configured to acquire an input voice of the user;
    确定模块,配置为若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;a determining module, configured to determine whether the terminal side has a signature voice consistent with the input voice, if the preset function is enabled on the terminal side;
    预设信息获取模块,配置为若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;a preset information acquiring module, configured to: if the identification voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice;
    第一执行模块,配置为执行所述预设信息对应的操作。The first execution module is configured to perform an operation corresponding to the preset information.
  8. 根据权利要求7所述的语音控制的装置,其中,若所述终端侧未开启所述预设功能,所述装置还包括:The apparatus for voice control according to claim 7, wherein if the terminal side does not enable the preset function, the apparatus further includes:
    解析模块,配置为解析所述输入语音,确定所述输入语音的含义;a parsing module configured to parse the input voice to determine a meaning of the input voice;
    第二执行模块,配置为根据所述输入语音的含义,执行对应操作。The second execution module is configured to perform a corresponding operation according to the meaning of the input voice.
  9. 根据权利要求7所述的语音控制的装置,其中,所述装置还包括:The apparatus of claim 7, wherein the apparatus further comprises:
    获取模块,配置为获取所述用户通过预设接口输入的预设信息,所述预设信息配置为指示所述终端执行相应操作;An acquiring module, configured to acquire preset information input by the user through a preset interface, where the preset information is configured to instruct the terminal to perform a corresponding operation;
    设置模块,配置为响应所述用户通过语音接口输入语音的操作,将输入的所述语音设置为所述预设信息设置的标识语音;其中,所述预设信息和所述标识语音的内容不相关。a setting module, configured to respond to the operation of the user inputting a voice through a voice interface, and set the input voice as an identifier voice set by the preset information; where the preset information and the content of the identifier voice are not Related.
  10. 根据权利要求9所述的语音控制的装置,其中,所述获取模块包括:The apparatus for voice control according to claim 9, wherein the obtaining module comprises:
    第一获取子模块,配置为获取所述用户通过输入文本接口输入的文本。 The first obtaining submodule is configured to acquire text input by the user through an input text interface.
  11. 根据权利要求9所述的语音控制的装置,其中,所述获取模块包括:The apparatus for voice control according to claim 9, wherein the obtaining module comprises:
    第二获取子模块,配置为获取所述用户通过输入语音接口输入的语音。The second obtaining submodule is configured to acquire the voice input by the user through the input voice interface.
  12. 根据权利要求9所述的语音控制的装置,其中,所述获取模块包括:The apparatus for voice control according to claim 9, wherein the obtaining module comprises:
    第三获取子模块,配置为获取用户预先设置的指令;a third obtaining submodule configured to obtain an instruction preset by the user;
    第四获取子模块,配置为获取所述用户通过调用指令接口从所述预先设置的指令中选择的指令。And a fourth acquiring submodule configured to acquire an instruction that the user selects from the preset instructions by calling an instruction interface.
  13. 一种终端,该终端包括处理器,所述处理器,配置为获取用户的输入语音;若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;执行所述预设信息对应的操作。A terminal, the terminal includes a processor, the processor is configured to acquire an input voice of the user; and if the terminal side starts the preset function, determining whether the terminal side pre-stores an identifier consistent with the input voice If the voice is present, the pre-set preset information that is not related to the meaning of the voice is obtained according to the voice, and the operation corresponding to the preset information is performed.
  14. 一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行权利要求1-6任一项的方法。 A computer storage medium having stored therein computer executable instructions for performing the method of any of claims 1-6.
PCT/CN2015/072705 2014-11-25 2015-02-10 Voice control method and apparatus, and storage medium WO2016082344A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410689720.3A CN105611033A (en) 2014-11-25 2014-11-25 Method and device for voice control
CN201410689720.3 2014-11-25

Publications (1)

Publication Number Publication Date
WO2016082344A1 true WO2016082344A1 (en) 2016-06-02

Family

ID=55990566

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/072705 WO2016082344A1 (en) 2014-11-25 2015-02-10 Voice control method and apparatus, and storage medium

Country Status (2)

Country Link
CN (1) CN105611033A (en)
WO (1) WO2016082344A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105869643A (en) * 2016-06-06 2016-08-17 青岛海信移动通信技术股份有限公司 Terminal control method based on voice and voice control device
CN107547726A (en) * 2016-06-24 2018-01-05 中兴通讯股份有限公司 A kind of mobile terminal sound command processing method and device
CN107545892B (en) * 2016-06-24 2021-07-30 中兴通讯股份有限公司 Equipment control method, device and system
CN109597657B (en) * 2017-09-29 2022-04-29 阿里巴巴(中国)有限公司 Operation method and device for target application and computing equipment
CN108632463A (en) * 2018-04-24 2018-10-09 维沃移动通信有限公司 A kind of sound control method and mobile terminal
CN109087640A (en) * 2018-08-22 2018-12-25 蔚来汽车有限公司 Information interacting method, system and vehicle device and server for information exchange

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103458090A (en) * 2012-05-28 2013-12-18 百度在线网络技术(北京)有限公司 Mobile terminal control method and mobile terminal control device
CN103448632A (en) * 2012-05-28 2013-12-18 百度在线网络技术(北京)有限公司 Automobile control method and device
US20140049697A1 (en) * 2012-08-14 2014-02-20 Kentec Inc. Television device and method for displaying virtual on-screen interactive moderator
CN103674012A (en) * 2012-09-21 2014-03-26 高德软件有限公司 Voice customizing method and device and voice identification method and device
CN103793641A (en) * 2014-02-27 2014-05-14 联想(北京)有限公司 Information processing method and device, and electronic device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646646B (en) * 2013-11-27 2018-08-31 联想(北京)有限公司 A kind of sound control method and electronic equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103458090A (en) * 2012-05-28 2013-12-18 百度在线网络技术(北京)有限公司 Mobile terminal control method and mobile terminal control device
CN103448632A (en) * 2012-05-28 2013-12-18 百度在线网络技术(北京)有限公司 Automobile control method and device
US20140049697A1 (en) * 2012-08-14 2014-02-20 Kentec Inc. Television device and method for displaying virtual on-screen interactive moderator
CN103674012A (en) * 2012-09-21 2014-03-26 高德软件有限公司 Voice customizing method and device and voice identification method and device
CN103793641A (en) * 2014-02-27 2014-05-14 联想(北京)有限公司 Information processing method and device, and electronic device

Also Published As

Publication number Publication date
CN105611033A (en) 2016-05-25

Similar Documents

Publication Publication Date Title
WO2016082344A1 (en) Voice control method and apparatus, and storage medium
US10930277B2 (en) Configuration of voice controlled assistant
KR101726945B1 (en) Reducing the need for manual start/end-pointing and trigger phrases
US9811870B2 (en) Information processing method, apparatus and payment system
US9263029B2 (en) Instant communication voice recognition method and terminal
US10547720B2 (en) Method and system for automatically saving unknown number in mobile phone
CN105072178B (en) Cell-phone number binding information acquisition methods and device
RU2017124103A (en) MAKING A TASK WITHOUT A MONITOR IN A DIGITAL PERSONAL ASSISTANT
WO2015027789A1 (en) Language control method, device and terminal
KR20140141916A (en) Apparatus and Method for operating a receiving notification function of a user device
CN109243443B (en) Voice control method and device and electronic equipment
WO2016201767A1 (en) Voice control method and device, and computer storage medium
US20170064084A1 (en) Method and Apparatus for Implementing Voice Mailbox
CN105245729A (en) Message reading method and device for mobile terminal
CN107483736B (en) Message processing method and device for instant messaging application program
WO2015188459A1 (en) Terminal control method and device, voice control device and terminal
CN104735238A (en) Communication recording method and device
WO2015103842A1 (en) Message responding method and device
CN109087643A (en) Sound control method, device and electronic equipment
CN107170450A (en) Audio recognition method and device
WO2020063451A1 (en) Call voicemail messaging method, terminal, and device having storage function
CN103064828A (en) Method and device for text operating
US20170118586A1 (en) Voice data transmission processing method, terminal and computer storage medium
CN104572007A (en) Method for adjusting sound volume of terminal
KR101643808B1 (en) Method and system of providing voice service using interoperation between application and server

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15864005

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15864005

Country of ref document: EP

Kind code of ref document: A1