CN110597480A

CN110597480A - User-defined voice instruction implementation method and terminal

Info

Publication number: CN110597480A
Application number: CN201910883587.8A
Authority: CN
Inventors: 赵云; 李凯; 王德中; 王明宇
Original assignee: Hisense Mobile Communications Technology Co Ltd
Current assignee: Hisense Mobile Communications Technology Co Ltd
Priority date: 2019-09-18
Filing date: 2019-09-18
Publication date: 2019-12-20
Anticipated expiration: 2039-09-18
Also published as: CN110597480B

Abstract

The application discloses a user-defined voice instruction implementation method and a terminal, relates to the field of voice assistants, and aims to solve the problem that in the prior art, the operation is complex in the use of the voice assistant. The method comprises the following steps: collecting voice input by a user through a microphone, and identifying the voice input by the user; if a corresponding voice instruction is selected from preset voice instructions according to the recognition result, determining an instruction set corresponding to the selected voice instruction according to the corresponding relation between the voice instruction and the instruction set; wherein the instruction set comprises at least two instructions; executing instructions of the set of instructions. And after recognizing that the voice input by the user is a preset voice instruction, executing the instruction in the instruction set corresponding to the voice instruction. Therefore, the user can execute a plurality of instructions only by inputting the voice instruction once so as to realize the things that the user wants to complete, thereby simplifying the operation of the user and improving the use experience of the user.

Description

User-defined voice instruction implementation method and terminal

Technical Field

The application relates to the field of voice assistants, in particular to a method and a terminal for realizing a user-defined voice instruction.

Background

The voice assistant is a function of the intelligent terminal, and can receive voice input by a user and help the user to execute instructions to be completed by the voice. If the user downloads more application software in the terminal, when the user wants to use the application software, the user needs to search the application software which the user wants to use from the downloaded application software, which is very time-consuming. When the voice assistant searches for the application software, the terminal can automatically find the application software only by inputting the voice corresponding to the application software, thereby simplifying the operation of the user.

However, in many scenarios, a user needs to input instructions to the voice assistant many times to implement what the user wants to do, so in the prior art, the use of the voice assistant has a problem of complicated operation.

Disclosure of Invention

The application provides a user-defined voice instruction implementation method and a user-defined voice instruction implementation terminal, which are used for solving the problem that in the prior art, the operation is complex due to the use of a voice assistant.

In a first aspect, an embodiment of the present application provides a method for implementing a custom voice instruction, where the method includes:

collecting voice input by a user through a microphone, and identifying the voice input by the user;

if a corresponding voice instruction is selected from preset voice instructions according to the recognition result, determining an instruction set corresponding to the selected voice instruction according to the corresponding relation between the voice instruction and the instruction set; wherein the instruction set comprises at least two instructions;

executing instructions of the set of instructions.

According to the method, after the voice input by the user is recognized to be the preset voice command, the command in the command set corresponding to the voice command is executed. Therefore, the user can execute a plurality of instructions only by inputting the voice instruction once so as to realize the things that the user wants to complete, thereby simplifying the operation of the user and improving the use experience of the user.

In one possible implementation, the correspondence between the voice command and the command set is established by:

responding to an instruction for establishing a corresponding relation between a voice instruction and an instruction set, and taking information input by a user as the voice instruction; and;

and taking at least two instructions selected by a user from a preset instruction set and/or a custom instruction set as an instruction set corresponding to the voice instruction.

According to the method, the voice instruction and the instruction set corresponding to the voice instruction are input by the user, so that the voice assistant can better conform to the use habit of the user. Meanwhile, the user selects the voice command and the command set corresponding to the voice command, so that the user can realize the things the user wants to complete through one voice command.

In a possible implementation manner, the selecting a corresponding voice instruction from preset voice instructions according to the recognition result includes:

determining the characteristic information of the voice according to the recognition result;

carrying out similarity matching on the characteristic information of the voice and a preset voice instruction;

and taking the voice instruction with the similarity larger than a preset threshold value as the voice instruction corresponding to the recognition result.

According to the method, the voice instruction is determined by performing similarity matching on the characteristic information of the voice input by the user and the preset voice instruction. Therefore, even if the voice input by the user is not completely the same as the preset voice command, the voice of the user can still be recognized, so that the requirement on the voice input by the user is not high, the voice recognition is more humanized, and the user experience is improved.

In one possible implementation, the method further includes:

responding to an editing instruction of a user, and editing a voice instruction specified by the editing instruction and/or an instruction set specified by the editing instruction; and/or;

and responding to a deleting instruction of a user, and deleting the voice instruction specified by the deleting instruction and an instruction set corresponding to the voice instruction.

According to the method, the user can edit or delete the voice instruction and the instruction set corresponding to the voice instruction, so that the preset voice instruction and the instruction set corresponding to the voice instruction are more in line with the requirements of the user, and the things to be completed by the user are realized.

In one possible implementation, the executing the instructions in the instruction set includes:

and executing the instructions in the instruction set in sequence according to the order of the instructions in the instruction set.

According to the method, the events to be completed by the user can be accurately completed by sequentially executing the instructions in the instruction set, and the accuracy of the completed events is ensured.

In one possible implementation, the executing the instructions in the instruction set in order according to the order of the instructions in the instruction set includes:

according to the sequence of the instructions in the instruction set, after the operation corresponding to the instruction is executed, checking whether the operation corresponding to the next instruction is influenced with the operation being executed;

if the operation corresponding to the next instruction and the operation being executed are mutually influenced and the priority of the operation corresponding to the next instruction is lower than that of the operation being executed, the operation corresponding to the next instruction is executed after the operation being executed is finished;

if the operation corresponding to the next instruction and the operation being executed are mutually influenced and the priority of the operation corresponding to the next instruction is higher than that of the operation being executed, interrupting the operation being executed, executing the operation corresponding to the next instruction, and recovering the interrupted operation after the operation corresponding to the next instruction is completed;

if the operation corresponding to the next instruction and the operation being executed influence each other, and the priority of the operation corresponding to the next instruction is equal to the priority of the operation being executed, stopping the operation being executed, and executing the operation corresponding to the next instruction;

and if the operation corresponding to the next instruction and the operation being executed influence each other, executing the operation corresponding to the next instruction.

According to the method, the corresponding operation is executed by judging whether the instructions in the instruction set are mutually influenced or not, so that the instructions in the instruction set can be completed in sequence while the operation corresponding to the instructions is accurately executed.

In one possible implementation, if the operations corresponding to the two instructions cannot be executed simultaneously, it is determined that the operations corresponding to the two instructions affect each other.

The method determines whether the operations of the two instructions affect each other by determining whether the operations corresponding to the two instructions can be executed simultaneously, so as to determine how to execute the instructions in the instruction set in sequence.

In a second aspect, an embodiment of the present application provides a customized voice instruction implementing terminal, including: a processor, a memory and a camera, wherein the memory stores program code that, when executed by the processor, causes the terminal to perform the following:

executing instructions of the set of instructions.

In one possible implementation, the processor is specifically configured to:

In a possible implementation manner, the terminal further includes:

In one possible implementation, the processor is specifically configured to:

In a third aspect, an embodiment of the present application further provides a terminal for implementing a customized voice instruction, where the terminal includes an identification module, an instruction set determination module, and an execution module:

the recognition module is used for collecting the voice input by the user through the microphone and recognizing the voice input by the user;

the instruction set determining module is used for determining an instruction set corresponding to the selected voice instruction according to the corresponding relation between the voice instruction and the instruction set if the corresponding voice instruction is selected from preset voice instructions according to the recognition result; wherein the instruction set comprises at least two instructions;

and the execution module is used for executing the instructions in the instruction set.

In one possible implementation, the correspondence between the voice command and the command set is established by the following modules:

the response module is used for responding to an instruction for establishing a corresponding relation between a voice instruction and an instruction set and taking information input by a user as the voice instruction; and;

and the selection module is used for taking at least two instructions selected by a user from a preset instruction set and/or a user-defined instruction set as an instruction set corresponding to the voice instruction.

In one possible implementation, the determining an instruction set module includes:

a determining feature information unit for determining feature information of the voice according to the recognition result;

the matching unit is used for matching the similarity of the characteristic information of the voice with a preset voice instruction;

and the voice instruction determining unit is used for taking the voice instruction with the similarity larger than a preset threshold value as the voice instruction corresponding to the recognition result.

In a possible implementation manner, the terminal further includes:

the editing module is used for responding to an editing instruction of a user and editing a voice instruction specified by the editing instruction and/or an instruction set specified by the editing instruction; and/or;

and the deleting module is used for responding to a deleting instruction of a user and deleting the voice instruction specified by the deleting instruction and the instruction set corresponding to the voice instruction.

In one possible implementation, the execution module includes:

and the sequential execution unit is used for sequentially executing the instructions in the instruction set according to the order of the instructions in the instruction set.

In one possible implementation, the sequential execution unit includes:

the mutual influence determining subunit is used for checking whether the operation corresponding to the next instruction is mutually influenced with the operation being executed or not after the operation corresponding to the instruction is executed according to the sequence of the instructions in the instruction set;

the first mutual influence subunit is used for waiting for the operation corresponding to the next instruction to be executed after the operation corresponding to the next instruction is completed if the operation corresponding to the next instruction and the operation being executed are mutually influenced and the priority of the operation corresponding to the next instruction is lower than that of the operation being executed;

the second mutual influence subunit is used for interrupting the operation being executed and executing the operation corresponding to the next instruction if the operation corresponding to the next instruction and the operation being executed are mutually influenced and the priority of the operation corresponding to the next instruction is higher than that of the operation being executed, and recovering the interrupted operation after the operation corresponding to the next instruction is completed;

a third interacting subunit, configured to suspend the operation being executed and execute the operation corresponding to the next instruction if the operation corresponding to the next instruction and the operation being executed interact with each other and the priority of the operation corresponding to the next instruction is equal to the priority of the operation being executed;

and the non-interaction subunit is used for executing the operation corresponding to the next instruction if the operation corresponding to the next instruction and the operation being executed interact with each other.

In a fourth aspect, the present application also provides a computer storage medium having a computer program stored thereon, which when executed by a processor, performs the steps of the method of the first aspect.

In addition, for technical effects brought by any one implementation manner of the second aspect to the third aspect, reference may be made to technical effects brought by different implementation manners of the first aspect, and details are not described here.

Additional features and advantages of the application will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by the practice of the application. The objectives and other advantages of the application may be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.

Fig. 1 is a flowchart for establishing a correspondence between a voice command and a command set according to an embodiment of the present application;

FIG. 2 is a diagram of a window for inputting voice commands according to an embodiment of the present application;

FIG. 3 is a diagram illustrating a default instruction set according to an embodiment of the present disclosure;

FIG. 4 is a schematic diagram of a custom instruction set provided by an embodiment of the present application;

FIG. 5 is a schematic diagram of a response command provided by an embodiment of the present application;

FIG. 6 is a schematic diagram of an operation instruction provided in an embodiment of the present application;

fig. 7 is a schematic diagram illustrating establishment of a correspondence between a voice command and a command set according to an embodiment of the present application;

fig. 8 is a schematic diagram of a correspondence between a voice command and a command set after a command is selected according to an embodiment of the present application;

fig. 9 is a schematic diagram of an established correspondence relationship provided in the embodiment of the present application;

FIG. 10 is a flowchart of a method for implementing a customized voice command according to an embodiment of the present application;

FIG. 11 is a flowchart illustrating execution of instructions in an instruction set according to an embodiment of the present application;

fig. 12 is a schematic diagram of a first terminal for processing an application according to an embodiment of the present application;

fig. 13 is a schematic diagram of a second terminal for processing an application according to an embodiment of the present application;

fig. 14 is a schematic diagram of a third terminal for processing an application according to an embodiment of the present application.

Detailed Description

In order to make the objects, technical solutions and advantages of the present application clearer, the present application will be described in further detail with reference to the accompanying drawings, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

Some of the words that appear in the text are explained below:

1. in the embodiment of the present application, the term "and/or" describes an association relationship of associated objects, and means that there may be three relationships, for example, a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship.

2. The term "terminal" in the embodiments of the present application refers to a computer device that can be used in mobile, i.e., a mobile phone, a tablet, and the like.

The application scenario described in the embodiment of the present application is for more clearly illustrating the technical solution of the embodiment of the present application, and does not form a limitation on the technical solution provided in the embodiment of the present application, and it can be known by a person skilled in the art that with the occurrence of a new application scenario, the technical solution provided in the embodiment of the present application is also applicable to similar technical problems. In the description of the present application, the term "plurality" means two or more unless otherwise specified.

In order to better understand the technical solution provided by the embodiments of the present application, the following brief description is made on the basic principle of the solution:

as a new interactive technology, voice is more and more concerned by people, and one of the most important development directions is to help users to use mobile phones more conveniently and quickly by means of voice. Especially, the functions of the current smart phone are more and more, the application software downloaded by a user is more and more, it takes ten seconds to find an application from a desktop or find a function from a setting sometimes, and even some functions cannot be found by the user because the entrance is too deep. The voice assistant provides a direct-function operation mode, and the voice assistant only needs one or two seconds to open application software, operate functions and the like, so that the operation of a user is greatly simplified.

However, at present, the technical problem of using the voice assistant is that a user inputs a voice command, and the terminal completes a certain thing according to the voice command input by the user. For example: the voice instruction input by the user is as follows: the terminal tells the current time to the user in a voice broadcast mode when the current time is reached; if the voice command input by the user is: playing music, and opening music application software by the terminal to play the music; if the voice command input by the user is: and when the alarm clock is turned off, the terminal turns off the alarm clock. However, in many scenarios, the user needs to input instructions to the voice assistant many times to implement what the user wants to do, for example: when a user goes home from work, the user wants to navigate to determine a route to go home, inform family members of the time to go home, and play music while on the way. If the user wants to complete the above requirements of the user, the user needs to input three instructions of "navigate home", "call family member" and "play music" to the terminal, so that the user can complete the requirements, and the user operation is complicated.

In view of this, the present application provides a method for implementing a customized voice instruction, in which a user firstly sets and stores a voice instruction and an operation set corresponding to the voice instruction according to his own requirements. In this way, when the user wants to fulfill the own requirement, the stored voice instruction is input to the terminal, and the terminal executes the instruction in the instruction set corresponding to the voice instruction after recognizing that the voice input by the user is the preset voice instruction. Therefore, the user can execute a plurality of instructions only by inputting the voice instruction once so as to realize the things that the user wants to complete, thereby simplifying the operation of the user and improving the use experience of the user.

In view of the above scenario, the following describes embodiments of the present application in further detail with reference to the drawings of the specification.

In the embodiment of the application, the method for realizing the custom voice instruction can be divided into two parts, including a setting part and an executing part. The setting part is used for setting the voice instruction and the instruction executed in the instruction set corresponding to the voice instruction; the execution part is used for recognizing the voice of the user and executing the instruction in the instruction set corresponding to the voice instruction according to the recognized voice instruction, and the setting part and the execution part are respectively explained below.

Firstly, a setting part:

in this embodiment of the present application, in order to implement the requirement of the user, the user needs to set a voice instruction and an operation set corresponding to the voice instruction, as shown in fig. 1, which is a schematic flow chart for establishing a corresponding relationship between the voice instruction and the instruction set, and includes the following steps:

step 101: and responding to an instruction for establishing the corresponding relation between the voice instruction and the instruction set, and taking the information input by the user as the voice instruction.

In the embodiment of the application, the user can input voice information and can also input text information as a voice command through typing. As shown in fig. 2, it is a schematic diagram of a window for inputting voice commands. In this case, a voice instruction is input in the input field "i say".

Step 102: and taking at least two instructions selected by a user from a preset instruction set and/or a custom instruction set as an instruction set corresponding to the voice instruction.

In the embodiment of the application, the instruction set includes a preset instruction set and a custom instruction set. The preset instruction set is an instruction pre-stored in the terminal, and the custom instruction set is an instruction set by the user. The user selects the required instruction from the preset instruction set and the user-defined instruction set, and the required instruction is stored corresponding to the input voice instruction, so that the corresponding relation between the voice instruction and the instruction set is established. As shown in fig. 3, it is a schematic diagram of a preset instruction set, where a user selects an instruction that the user needs by clicking an instruction in the preset instruction set, that is, the user determines to use the instruction by clicking an instruction "broadcast date and time".

In the embodiment of the application, a user sets an instruction in a custom instruction set in a screen recording manner, as shown in fig. 4, which is a schematic diagram of the custom instruction set, and if the user completes the custom instruction, the instruction is displayed in the custom instruction set, as shown in "instruction 1" in the figure. In the specific operation, the user clicks the add instruction (the "+" in fig. 4), the terminal starts to record the operation of the user, the operation executed by the user on the terminal is recorded by the terminal, and when the user clicks the finish instruction, the recording is finished and the recording is saved. For example: if the user 1 wants to send the information of 'i get home' to the user 2 through the communication application software, after the adding instruction is clicked, the terminal displays the desktop state, the user 1 opens the communication application software, searches the name of the user 2 in the communication application software, and finally sends the information of 'i get home' to the user 2, and the instruction is clicked to finish. Thus, the user-defined instruction is completed, and the terminal records the operation of the user and repeatedly executes the operation completed by the user during recording. Therefore, even if the operation instruction preset by the terminal does not have the instruction required by the user, the user can still complete the operation which the user wants to complete through the self-defined instruction. Of course, the user-defined instruction name can be edited, so that the operation completed by the instruction can be known according to the name of the user-defined instruction.

In an embodiment of the present application, the instructions in the instruction set include response instructions and operation instructions. Wherein, the response instruction is the voice output by the terminal according to the voice instruction input by the user, for example: "good morning", "good night", etc., in response to the user's utterance. The operation instruction is an operation executed according to a voice instruction input by a user, for example: the method comprises the following operations of opening application software, broadcasting date and time and the like. In order to refine the command, the response command and the operation command are classified, as shown in fig. 5, which is a schematic diagram of the response command, where there is a response command preset by the terminal, for example: "good morning, start of nice day", click the microphone in the response command, can listen on trial. Of course, the response command may be added by clicking an add command ("+" in fig. 5), typing may be performed, or voice input may be performed by clicking a microphone in the add command. The instruction can also be deleted by clicking the delete ("-" in fig. 5) after the instruction. If a plurality of response instructions are selected, one of the plurality of response instructions can be randomly selected to be played when the instructions are executed, and the response instructions can be played in sequence according to the selected sequence. As shown in fig. 6, it is a schematic diagram of an operation instruction, where the instruction in the operation instruction is a preset instruction of a terminal, for example: the weather is broadcasted, the call is made, and other instructions, and the instructions in the operation instructions cannot be added or deleted.

As shown in fig. 7, it is a schematic diagram of establishing a corresponding relationship between a voice command and a command set, wherein, an input voice command window is shown above, through which the voice command is set; the selection instructions are arranged below the mobile phone, and are respectively an individual reply (which is a response instruction), an official skill (which is a preset operation instruction) and a teaching command (which is a self-defined operation instruction). By clicking on each instruction set, the appropriate instruction is selected. Fig. 8 is a schematic diagram showing the correspondence between the voice command and the command set after the command is selected. The upper part is a voice instruction, the middle part is an instruction set corresponding to the voice instruction, and the lower part is a selection instruction. The correspondence is saved by saving. In fig. 8, the voice command is set to "good morning", and the selected command set is the response command "good morning, start of beautiful day", the operation command "broadcast date and time", and the custom command "command 1". Thus, when it is recognized that the user inputs a voice command of "good morning", a corresponding command set, i.e., "good morning, nice day starts", "date and time of broadcast", and "command 1" is executed.

It should be noted that the execution order of step 101 and step 102 is not limited. That is, the voice command may be input first, or the command may be selected first. Therefore, the voice assistant can better accord with the use habit of the user by inputting the completion voice command and the command set corresponding to the voice command by the user. Meanwhile, the user selects the voice command and the command set corresponding to the voice command, so that the user can realize the things the user wants to complete through one voice command.

In the embodiment of the application, in addition to establishing the corresponding relationship between the voice command and the command set, the established corresponding relationship can be modified or deleted. And if the user selects one of the established corresponding relations to modify, the terminal responds to the editing instruction of the user and edits the voice instruction specified by the editing instruction and/or the instruction set specified by the editing instruction. And if the user wants to delete one of the established corresponding relations, the terminal responds to a deletion instruction of the user and deletes the voice instruction specified by the deletion instruction and the instruction set corresponding to the voice instruction.

As shown in fig. 9, which is a schematic diagram of the established corresponding relationship, if the user clicks "add" in the first corresponding relationship, the user enters an editing state, and the editing state is as shown in fig. 8, and the user may modify the voice instruction, add or delete the instruction in the instruction set, and may also adjust the order of the instructions in the instruction set by clicking an icon for adjusting the order. If the user presses the first corresponding relation, the deleting state is carried out, and if the user clicks the deleting, the first corresponding relation is deleted.

Therefore, the user can edit or delete the voice instruction and the instruction set corresponding to the voice instruction, so that the preset voice instruction and the instruction set corresponding to the voice instruction better meet the requirements of the user, and the things to be completed by the user are realized.

II, an execution part:

in the embodiment of the application, after the corresponding relationship between the voice instruction and the instruction set is set, the instruction in the operation set corresponding to the voice instruction can be executed through the voice instruction input by the user. As shown in fig. 10, the method specifically includes the following steps:

step 1001: and collecting the voice input by the user through a microphone, and identifying the voice input by the user.

Step 1002: if a corresponding voice instruction is selected from preset voice instructions according to the recognition result, determining an instruction set corresponding to the selected voice instruction according to the corresponding relation between the voice instruction and the instruction set; wherein the instruction set comprises at least two instructions.

Step 1003: executing instructions of the set of instructions.

In the embodiment of the present application, in order to accurately complete the things to be completed by the user and ensure the accuracy of the completion event, when the instructions in the set are executed, the instructions are executed according to the order of the instructions in the corresponding relationship, which may be specifically implemented as: and executing the instructions in the instruction set in sequence according to the order of the instructions in the instruction set.

In the embodiment of the present application, when instructions in the instruction set are executed, if operations corresponding to the instructions in the instruction set affect each other, the instructions are executed according to priorities of the operations corresponding to the instructions. Specifically, the method can be implemented as steps A1-A5:

step A1: and according to the order of the instructions in the instruction set, after the operation corresponding to the instruction is executed, checking whether the operation corresponding to the next instruction is influenced with the operation being executed.

In the embodiment of the application, whether the operations corresponding to the two instructions affect each other can be determined according to whether the operations corresponding to the instructions can be executed simultaneously. The method can be specifically implemented as follows: and if the operations corresponding to the two instructions cannot be executed simultaneously, determining that the operations corresponding to the two instructions influence each other.

In one embodiment, if the operations corresponding to the two instructions both occupy audio or both occupy video, it is determined that the operations corresponding to the two instructions affect each other. Such as: the operation corresponding to one instruction is to play songs, the operation corresponding to the other instruction is to reply by voice, and the operations corresponding to the two instructions both need to use a microphone to occupy an audio channel, so the operations corresponding to the two instructions are mutually influenced.

Step A2: and if the operation corresponding to the next instruction and the operation being executed influence each other, and the priority of the operation corresponding to the next instruction is lower than that of the operation being executed, waiting for the operation corresponding to the next instruction to be executed after the operation being executed is completed.

In the embodiment of the present application, the priority of the operation corresponding to each instruction is preset, and of course, the priority of the operation may also be modified according to specific situations, which is not limited in the present application.

In one embodiment, if there are 2 instructions in the instruction set, instruction 1 and instruction 2, respectively. If the operation corresponding to the instruction 1 and the operation corresponding to the instruction 2 affect each other and the priority of the instruction 1 is high, the operation corresponding to the instruction 2 is executed after the operation corresponding to the instruction 1 is completed without executing other operations when the operation corresponding to the instruction 1 is executed.

For example: the instructions in the instruction set are response instructions of broadcasting date and time and operation instructions of playing songs respectively. When the terminal executes the instruction in the instruction set, the terminal firstly executes the response instruction 'broadcasting date and time', judges whether the response instruction 'broadcasting date and time' and the operation instruction 'playing song' influence each other, judges the priority of the two instruction operations after determining that the two instruction operations influence each other, determines that the priority of the response instruction 'broadcasting date and time' is higher than the priority of the operation instruction 'playing song', continues to execute the response instruction 'broadcasting date and time', and executes the operation instruction 'playing song' after the response instruction 'broadcasting date and time' is executed.

Step A3: and if the operation corresponding to the next instruction and the operation being executed are mutually influenced and the priority of the operation corresponding to the next instruction is higher than that of the operation being executed, interrupting the operation being executed, executing the operation corresponding to the next instruction and recovering the interrupted operation after the operation corresponding to the next instruction is completed.

In one embodiment, if there are 2 instructions in the instruction set, instruction 1 and instruction 2, respectively. If the operation corresponding to the instruction 1 and the operation corresponding to the instruction 2 affect each other and the priority of the instruction 2 is high, the operation corresponding to the instruction 1 is interrupted and the operation corresponding to the instruction 2 is executed when the operation corresponding to the instruction 1 is executed, and the operation corresponding to the instruction 1 is resumed and continues to be executed after the execution of the operation corresponding to the instruction 2 is completed.

For example: the instructions in the instruction set are an operation instruction of playing songs and a response instruction of broadcasting date and time respectively. When the terminal executes the instruction in the instruction set, firstly, the operation instruction "play song" is executed, whether the response instruction "broadcast date and time" and the operation instruction "play song" influence each other is judged, after the two instruction operations are determined to influence each other, the priority of the two instruction operations is judged, the priority of the response instruction "broadcast date and time" is determined to be higher than the priority of the operation instruction "play song", the operation instruction "play song" is stopped, the response instruction "broadcast date and time" is executed, and after the execution of the response instruction "broadcast date and time" is completed, the operation instruction "play song" is continuously executed.

Step A4: and if the operation corresponding to the next instruction and the operation being executed influence each other, and the priority of the operation corresponding to the next instruction is equal to that of the operation being executed, suspending the operation being executed and executing the operation corresponding to the next instruction.

In one embodiment, if there are 2 instructions in the instruction set, instruction 1 and instruction 2, respectively. If the operation corresponding to the instruction 1 and the operation corresponding to the instruction 2 affect each other, and the priorities of the instruction 1 and the instruction 2 are the same, the operation corresponding to the instruction 1 is suspended and the operation corresponding to the instruction 2 is executed when the operation corresponding to the instruction 1 is executed.

For example: the instructions in the instruction set are the operation instruction "play song 1" and the operation instruction "play song 2", respectively. When the terminal executes the instruction in the instruction set, the terminal firstly executes the operation instruction 'play song 1', judges whether the operation instruction 'play song 1' and the operation instruction 'play song 2' influence each other, judges the priority of the two instruction operations after the two instruction operations are determined to influence each other, stops playing the song 1 and plays the song 2 after the priority of the two operation instructions is determined to be the same.

Step A5: and if the operation corresponding to the next instruction and the operation being executed influence each other, executing the operation corresponding to the next instruction.

In one embodiment, if there are 2 instructions in the instruction set, instruction 1 and instruction 2, respectively. If the operation corresponding to the instruction 1 and the operation corresponding to the instruction 2 do not affect each other, the operation corresponding to the instruction 1 is executed while the operation corresponding to the instruction 2 is executed.

For example: the instructions in the instruction set are the operation instruction of playing songs and the operation instruction of opening the communication application software respectively. When the terminal executes the instruction in the instruction set, the terminal firstly executes the operation instruction 'play song', judges whether the operation instruction 'play song' and the operation instruction 'open the communication application software' influence each other, and executes the operation instruction 'open the communication application software' at the same time after the two instruction operations are determined not to influence each other.

It should be noted that the steps a2-a5 are different from those in the present application, and the execution order is not limited.

As shown in FIG. 11, a flow diagram for executing instructions in an instruction set is shown. The instruction set has two instructions, instruction 1 and instruction 2. The execution steps are as follows:

step 1101: instruction 1 is executed.

Step 1102: judging whether the instruction 2 and the instruction 1 influence each other; if yes, go to step 1103; if not, go to step 1107.

Step 1103: judging the priority of the instruction 2 and the instruction 1; if the priority of instruction 1 is higher than that of instruction 2, go to step 1104; if the priority of instruction 1 is lower than that of instruction 2, go to step 1106; if the priority of instruction 1 is the same as the priority of instruction 2, then step 1105 is performed.

Step 1104: after waiting for instruction 1 to finish execution, instruction 2 is executed.

Step 1105: execution of instruction 1 is terminated and instruction 2 is executed.

Step 1106: and interrupting the execution of the instruction 1, executing the instruction 2, and continuing the instruction 1 after the instruction 2 is executed.

Step 1107: while instruction 2 is executed.

In one embodiment, if there are 4 instructions in the instruction set, instruction 1, instruction 2, instruction 3, and instruction 4, respectively. The operation corresponding to the instruction 1 and the operation corresponding to the instruction 2 do not affect each other, the operation corresponding to the instruction 1, the operation corresponding to the instruction 3 and the operation corresponding to the instruction 4 affect each other, the priority of the instruction 1 is higher than the priorities of the instruction 3 and the instruction 4, and the priority of the instruction 3 is the same as the priority of the instruction 4. When the terminal executes the instruction in the instruction set, firstly executing the instruction 1, judging whether the instruction 2 and the instruction 1 influence each other, and executing the instruction 2 at the same time after determining that the instruction 2 and the instruction 1 do not influence each other; judging whether the instruction 3 is mutually influenced with the instruction 1 and the instruction 2, judging the priority of the instruction 3 and the instruction 1 if the instruction 3 is mutually influenced with the instruction 1, and executing the instruction 3 after the instruction 1 is executed if the priority of the instruction 3 is lower than the priority of the instruction 1; judging whether the instruction 4 is mutually influenced with the instruction 2 and the instruction 3, determining that the instruction 3 is mutually influenced with the instruction 4, judging the priority of the instruction 3 and the priority of the instruction 4, determining that the priority of the instruction 3 is the same as the priority of the instruction 4, stopping executing the instruction 3, and executing the instruction 4.

Therefore, corresponding operations are executed by judging whether the instructions in the instruction set are mutually influenced or not, and the instructions in the instruction set can be completed in sequence while the operations corresponding to the instructions are accurately executed.

In the above description, how to implement the method of customizing voice command is described, the following further description is made on how to determine the voice input by the user as the preset voice command, which can be specifically implemented as steps B1-B3:

step B1: and determining the characteristic information of the voice according to the recognition result.

Step B2: and performing similarity matching on the characteristic information of the voice and a preset voice instruction.

Step B3: and taking the voice instruction with the similarity larger than a preset threshold value as the voice instruction corresponding to the recognition result.

In the embodiment of the present application, if the voice input by the user is "i will come home soon", the feature information of the voice is determined to be "come home". And matching the 'home returning' with a preset voice instruction, and if the matching degree of the 'navigation home returning' in the voice instruction is greater than a preset threshold value, taking the 'navigation home returning' as the voice instruction corresponding to the 'home returning'.

In the embodiment of the application, if the matching degree between the recognition result and the plurality of voice instructions is greater than the preset threshold, the voice instruction with the highest matching degree is used as the voice instruction corresponding to the recognition result.

In the embodiment of the application, if the voice instruction corresponding to the instruction in the execution instruction set is the same as the voice instruction for executing only one instruction, the voice instruction corresponding to the instruction in the execution instruction set is selected to execute the operation.

In this way, the voice instruction is determined by similarity matching of the feature information of the voice input by the user and the voice instruction set in advance. Therefore, even if the voice input by the user is not completely the same as the preset voice command, the voice of the user can still be recognized, so that the requirement on the voice input by the user is not high, the voice recognition is more humanized, and the user experience is improved.

As shown in fig. 12, an embodiment of the present application provides a customized voice instruction implementing terminal, where the terminal includes: a processor 1210 and a memory 1220, wherein the memory 1220 stores program code, which when executed by the processor 1210, causes the terminal to perform the following processes:

executing instructions of the set of instructions.

Optionally, the processor 1210 is specifically configured to:

responding to an instruction for establishing a corresponding relation between a voice instruction and an instruction set, and taking information input by a user as the voice instruction; and the number of the first and second groups,

Optionally, the processor 1210 is specifically configured to:

Optionally, the terminal further includes:

responding to an editing instruction of a user, and editing a voice instruction specified by the editing instruction and/or an instruction set specified by the editing instruction; and/or the presence of a gas in the gas,

Optionally, the processor 1210 is specifically configured to:

Optionally, if the operations corresponding to the two instructions cannot be executed simultaneously, it is determined that the operations corresponding to the two instructions affect each other.

As shown in fig. 13, an embodiment of the present application further provides a customized voice instruction implementing terminal, where the terminal includes an identification module 1301, a determination instruction set module 1302, and an execution module 1303:

the recognition module 1301 is used for collecting the voice input by the user through a microphone and recognizing the voice input by the user;

a command set determining module 1302, configured to determine, if a corresponding voice command is selected from preset voice commands according to the recognition result, a command set corresponding to the selected voice command according to a correspondence between the voice command and the command set; wherein the instruction set comprises at least two instructions;

and the execution module 1303 is configured to execute the instructions in the instruction set.

the response module is used for responding to an instruction for establishing a corresponding relation between a voice instruction and an instruction set and taking information input by a user as the voice instruction; and the number of the first and second groups,

In one possible implementation, the determine instruction set module 1302 includes:

In a possible implementation manner, the terminal further includes:

the editing module is used for responding to an editing instruction of a user and editing a voice instruction specified by the editing instruction and/or an instruction set specified by the editing instruction; and/or the presence of a gas in the gas,

In one possible implementation, the executing module 1303 includes:

In one possible implementation, the sequential execution unit includes:

As shown in fig. 14, a terminal 1400 for implementing a customized voice instruction provided in the embodiment of the present disclosure includes: radio Frequency (RF) circuitry 1410, power supply 1420, processor 1430, memory 1440, input unit 1450, display unit 1460, camera 1470, communication interface 1480, and WiFi module 1490. Those skilled in the art will appreciate that the configuration of the terminal shown in fig. 14 is not intended to be limiting, and that the terminal provided by the embodiments of the present application may include more or less components than those shown, or some components may be combined, or a different arrangement of components may be provided.

The following describes the various components of the terminal 1400 in detail with reference to fig. 14:

the RF circuitry 1410 may be used for receiving and transmitting data during a communication or conversation. Specifically, the RF circuit 1410 sends downlink data of a base station to the processor 1430 for processing; and in addition, sending the uplink data to be sent to the base station. In general, the RF circuit 1410 includes, but is not limited to, an antenna, at least one Amplifier, a transceiver, a coupler, a Low Noise Amplifier (LNA), a duplexer, and the like.

In addition, the RF circuit 1410 may also communicate with a network and other terminals through wireless communication. The wireless communication may use any communication standard or protocol, including but not limited to Global System for mobile communications (GSM), General Packet Radio Service (GPRS), Code Division Multiple Access (CDMA), Wideband Code Division Multiple Access (WCDMA), Long Term Evolution (LTE), email, Short Messaging Service (SMS), and the like.

The WiFi technology belongs to a short-distance wireless transmission technology, and the terminal 1400 may connect to an Access Point (AP) through a WiFi module 1490, thereby implementing Access to a data network. The WiFi module 1490 can be used for receiving and transmitting data during communication.

The terminal 1400 can be physically connected to other terminals through the communication interface 1480. In some embodiments, the communication interface 1480 is connected to a communication interface of the other terminal through a cable, so as to realize data transmission between the terminal 1400 and the other terminal.

In the embodiment of the present application, the terminal 1400 is capable of implementing a communication service and sending information to other contacts, so that the terminal 1400 needs to have a data transmission function, that is, the terminal 1400 needs to include a communication module inside. Although fig. 14 shows communication modules such as the RF circuit 1410, the WiFi module 1490, and the communication interface 1480, it is to be understood that at least one of the above components or other communication modules (e.g., bluetooth module) for enabling communication exists in the terminal 1400 for data transmission.

For example, when the terminal 1400 is a mobile phone, the terminal 1400 may include the RF circuit 1410, and may further include the WiFi module 1490; when the terminal 1400 is a computer, the terminal 1400 may include the communication interface 1480 and may further include the WiFi module 1490; when the terminal 1400 is a tablet computer, the terminal 1400 may include the WiFi module.

The memory 1440 may be used to store software programs and modules. The processor 1430 executes software programs and modules stored in the memory 1440 to perform various functional applications and data processing of the terminal 1400, and when the processor 1430 executes the program codes in the memory 1440, part or all of the processes in fig. 10 according to the embodiments of the present disclosure can be implemented.

In some embodiments, the memory 1440 may mainly include a program storage area and a data storage area. The storage program area can store an operating system, various application programs (such as communication application), a face recognition module and the like; the storage data area may store data (such as various multimedia files like pictures, video files, etc., and face information templates) created according to the use of the terminal, etc.

Further, the memory 1440 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.

The input unit 1450 may be used to receive numeric or character information input by a user and to generate key signal inputs related to user settings and function control of the terminal 1400.

In some embodiments, the input unit 1450 may include a touch panel 1451 and other input terminals 1452.

The touch panel 1451, also referred to as a touch screen, may collect touch operations of a user (for example, operations of the user on or near the touch panel 1451 using any suitable object or accessory such as a finger, a stylus pen, etc.) and drive a corresponding connection device according to a preset program. In some embodiments, the touch panel 1451 may include two parts, a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, and sends the touch point coordinates to the processor 1430, and can receive and execute commands sent from the processor 1430. In addition, the touch panel 1451 may be implemented in various types, such as resistive, capacitive, infrared, and surface acoustic wave.

In some embodiments, the other input terminals 1452 may include, but are not limited to, one or more of a physical keyboard, function keys (e.g., volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like.

The display unit 1460 may be used to display information input by a user or information provided to a user and various menus of the terminal 1400. The display unit 1460 is a display system of the terminal 1400, and is used for presenting an interface and implementing human-computer interaction.

Further, the touch panel 1451 may cover the display panel 1461, and when the touch panel 1451 detects a touch operation on or near the touch panel, the touch operation is transmitted to the processor 1430 to determine the type of the touch event, and then the processor 1430 provides a corresponding visual output on the display panel 1461 according to the type of the touch event.

Although in fig. 14, the touch panel 1451 and the display 1461 are implemented as two separate components to implement the input and output functions of the terminal 1400, in some embodiments, the touch panel 1451 and the display 1461 may be integrated to implement the input and output functions of the terminal 1400.

The processor 1430 is a control center of the terminal 1400, connects various components using various interfaces and lines, performs various functions of the terminal 1400 and processes data by operating or executing software programs and/or modules stored in the memory 1440 and calling data stored in the memory 1440, thereby implementing various services based on the terminal.

In some embodiments, the processor 1430 may include one or more processors. In some embodiments, the processor 1430 may integrate an application processor, which primarily handles operating systems, user interfaces, applications, etc., and a modem processor, which primarily handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 1430.

The camera 1470 is configured to implement a shooting function of the terminal 1400 and shoot pictures or videos. The camera 1470 may also be used to implement a scanning function of the terminal 1400, and scan a scanned object (two-dimensional code/barcode).

The terminal 1400 also includes a power supply 1420 (e.g., a battery) for powering the various components. In some embodiments, the power supply 1420 may be logically coupled to the processor 1430 via a power management system, such that the power management system performs functions of managing charging, discharging, and power consumption.

Although not shown, the terminal 1400 may further include at least one sensor, an audio circuit, and the like, which are not described in detail herein.

Wherein memory 1430 may store the same program code as memory 1220 which, when executed by processor 1420, causes processor 1420 to perform all of the functions of processor 1210.

The embodiment of the present application further provides a computer-readable non-volatile storage medium, which includes a program code, and when the program code runs on a computing terminal, the program code is configured to enable the computing terminal to execute the steps of the method for implementing the customized voice instruction according to the embodiment of the present application.

The present application is described above with reference to block diagrams and/or flowchart illustrations of methods, apparatus (systems) and/or computer program products according to embodiments of the application. It will be understood that one block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, and/or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer and/or other programmable data processing apparatus, create means for implementing the functions/acts specified in the block diagrams and/or flowchart block or blocks.

Accordingly, the subject application may also be embodied in hardware and/or in software (including firmware, resident software, micro-code, etc.). Furthermore, the present application may take the form of a computer program product on a computer-usable or computer-readable storage medium having computer-usable or computer-readable program code embodied in the medium for use by or in connection with an instruction execution system. In the context of this application, a computer-usable or computer-readable medium may be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present application without departing from the spirit and scope of the application. Thus, if such modifications and variations of the present application fall within the scope of the claims of the present application and their equivalents, the present application is intended to include such modifications and variations as well.

Claims

1. A method for realizing a user-defined voice instruction is applied to a terminal, and comprises the following steps:

executing instructions of the set of instructions.

2. The method of claim 1, wherein the correspondence between the voice command and the command set is established by:

3. The method according to claim 1, wherein selecting a corresponding voice command from preset voice commands according to the recognition result comprises:

4. The method of claim 1, further comprising:

5. The method according to any one of claims 1 to 4, wherein the executing the instructions in the instruction set comprises:

6. The method of claim 5, wherein the executing the instructions of the instruction set in order according to the order of the instructions of the instruction set comprises:

7. A terminal for realizing a user-defined voice instruction is characterized by comprising: a processor and a memory, wherein the memory stores program code that, when executed by the processor, causes the terminal to perform the following:

executing instructions of the set of instructions.

8. The terminal of claim 7, wherein the processor is specifically configured to:

9. The terminal of claim 7, wherein the processor is specifically configured to:

10. The terminal of claim 7, further comprising: