CN111954868A

CN111954868A - Multi-voice assistant control method, device, system and computer readable storage medium

Info

Publication number: CN111954868A
Application number: CN201880091294.XA
Authority: CN
Inventors: 董学章
Original assignee: Jiangsu Shushi Technology Co ltd
Current assignee: Jiangsu Shushi Technology Co ltd
Priority date: 2018-05-31
Filing date: 2018-05-31
Publication date: 2020-11-17
Also published as: WO2019227370A1

Abstract

The invention discloses a multi-voice assistant control method, a device, a system and a computer readable storage medium. The multi-voice assistant control method is applied to the electronic equipment integrating a plurality of voice assistants, and comprises the following steps: acquiring external instructions through a plurality of current voice assistants of the electronic equipment; and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabled state. The invention can switch the voice among a plurality of voice assistants at any time, and the plurality of voice assistants are not in the awakening working state at the same time, thereby avoiding mutual interference and influencing user experience.

Description

Multi-voice assistant control method, device, system and computer readable storage medium

Technical Field

The present invention relates to the field of speech technology, and in particular, to a multi-speech assistant control method, apparatus, system, and computer-readable storage medium.

Background

With the development of speech recognition technology and the increasing demand of people for speech interaction, an intelligent application of speech assistant is produced. In the related art, the voice assistant may implement an intelligent dialog or an instant question and answer with the user to help the user solve a problem. However, only one voice assistant is usually built in the existing electronic device, even if multiple voice assistants are built in the existing electronic device, some voice assistants may wake up other voice assistants to work when one voice assistant works, the multiple voice assistants work simultaneously, so that conflicts (mutual interference) are caused, and the user experience is not ideal.

Disclosure of Invention

The invention aims to provide a multi-voice assistant control method, a device, a system and a computer readable storage medium, which are used for solving the problems that a plurality of voice assistants of the existing electronic equipment work simultaneously to cause conflict (mutual interference) and the user experience is not ideal.

In the present invention, a first aspect provides a multi-voice assistant control method, which is applied to an electronic device integrating multiple voice assistants, and comprises the following steps:

acquiring external instructions by a plurality of current voice assistants of the electronic equipment;

and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.

Preferably, the step "the voice assistants determine to perform an interrupt response process or wake up according to the external instruction" includes the steps of:

the plurality of voice assistants input the external instruction;

the voice assistants judge whether the external instruction is consistent with a first awakening marker preset by each voice assistant;

and if the external instruction is consistent with the first wake-up marker, waking up the corresponding voice assistant, and otherwise, interrupting the response process of the corresponding voice assistant.

Preferably, the step of "the voice assistants determine whether the external command is consistent with the respective preset first wake-up markers" includes the steps of:

judging whether the external instruction contains a voice trigger event or not;

if the external instruction contains the voice trigger event, the step of 'the plurality of voice assistants judging whether the external instruction is consistent with the respective preset first wake-up markers' is executed.

Preferably, the step of determining whether the external command includes a voice trigger event includes the steps of:

determining whether the external instruction includes a keyword associated with an identification of any of the plurality of voice assistants;

determining that the external instruction contains the voice-triggered event if a keyword associated with an identification of any of the plurality of voice assistants is included in the external instruction.

associating the keyword with a preset corresponding second wake-up marker when it is determined that the external instruction includes the voice trigger event;

the voice assistants judge whether the second awakening marker is consistent with a first awakening marker preset by each voice assistant;

if the second awakening marker is consistent with the first awakening marker preset by the user, executing the step of awakening the corresponding voice assistant, and otherwise, executing the step of interrupting the response process of the corresponding voice assistant.

one voice assistant is judged to wake up according to the external instruction, and meanwhile, other voice assistants are judged to perform an interrupt response process according to the external instruction.

Preferably, the step of "obtaining external instructions by a plurality of voice assistants of the electronic equipment" includes the following steps:

and acquiring an externally input instruction through a microphone array.

the voice assistant which judges to be awakened sends the external instruction to a cloud server through a network for processing;

and responding the result processed by the cloud server by the voice assistant which is judged to be awakened.

The invention provides a multi-voice assistant control device, which is applied to an electronic device integrating a plurality of voice assistants, and comprises:

the acquisition module is used for acquiring external instructions through a plurality of current voice assistants of the electronic equipment;

and the judging module is used for judging whether to perform interrupt response process or wake up according to the external instruction by the voice assistants, wherein the wake-up functions of the voice assistants are all in an enabling state.

Preferably, the determination module includes:

an input sub-module for the plurality of voice assistants to input the external instruction;

the awakening judgment sub-module is used for judging whether the external instruction is consistent with a first awakening marker preset by each voice assistant;

the awakening sub-module is used for awakening the corresponding voice assistant when the external instruction is consistent with the first awakening marker;

and the interrupt submodule is used for interrupting the response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.

Preferably, the determination module includes:

a trigger event judging submodule for judging whether the external instruction contains a voice trigger event,

and the awakening judgment sub-module is used for judging whether the external instruction is consistent with the respective preset first awakening markers or not by the plurality of voice assistants when the external instruction contains the voice trigger event.

Preferably, the triggering event determining sub-module includes:

a keyword judgment sub-module, configured to judge whether the external instruction includes a keyword associated with any of the plurality of voice assistant identifiers;

and the trigger event confirmation sub-module is used for determining that the external instruction contains the voice trigger event when the external instruction comprises a keyword associated with any one of the voice assistant identifiers.

Preferably, the wake-up determining sub-module includes:

the association submodule is used for associating the keyword with a preset corresponding second awakening marker when the external instruction is determined to contain the voice trigger event;

a marker judgment sub-module, configured to judge, by the multiple voice assistants, whether the second wake-up marker is consistent with a first wake-up marker preset in each of the multiple voice assistants;

the determination module includes:

the awakening sub-module is used for awakening the corresponding voice assistant when the second awakening marker is consistent with the respective preset first awakening marker;

and the interruption submodule is used for interrupting the response process of the corresponding voice assistant when the second awakening marker is inconsistent with the respective preset first awakening marker.

Preferably, the apparatus comprises:

and the microphone array acquisition module is used for acquiring an externally input instruction through the microphone array.

A third aspect of the present invention provides a multi-voice assistant control system, the system comprising:

a cloud server and the multi-voice assistant control device of the second aspect;

the multi-voice assistant control apparatus includes:

the receiving and sending module is used for sending an external instruction to the cloud server and/or receiving a result processed by the cloud server through the voice assistant judged to be awakened;

and the response module is used for responding to the result received by the transceiver module and processed by the cloud server through the judged and awakened voice assistant.

The invention also provides a multi-voice assistant control device, which is applied to electronic equipment integrating a plurality of voice assistants and comprises:

a processor;

a memory for storing the processor-executable instructions;

wherein the processor is configured to:

The present invention also provides a computer readable storage medium having stored thereon computer program instructions which, when executed by a processor, implement the steps of the method of the first aspect.

Compared with the prior art, the implementation mode of the invention has the main differences and the effects that:

according to the voice assistant and the voice assistant control method, the voice assistant wakes up according to the instruction, and other voice assistants interrupt the response process according to the instruction, so that a plurality of voice assistants are not in a wake-up working state at the same time, and mutual interference is avoided, and user experience is not influenced.

Furthermore, the preset awakening words can be defined by the user, so that the use method of the voice assistant is enriched, and the single role of the voice assistant is avoided.

Furthermore, the awakening assistant is switched through voice instruction conversion, and the technical problem that the existing electronic equipment needs to manually enable the voice assistant when the voice assistant is switched and the switching mode is not flexible is solved.

Furthermore, the user can select one voice assistant from the plurality of voice assistants according to the preference and the acceptance of the user, so that the method can be suitable for different users, and the human-computer interaction is more interesting.

Furthermore, the microphone array collects an externally input instruction, so that the pickup distance is long, the noise suppression is good, and the collection effect is better.

It is to be understood that within the scope of the present invention, the above-described features of the present invention and those specifically described below (e.g., in the examples) may be combined with each other to form new or preferred embodiments. For reasons of space, they will not be described in detail.

Drawings

FIG. 1 is a flowchart illustrating a multi-voice assistant control method according to a first embodiment of the present invention.

FIG. 2 is a flowchart illustrating a multi-voice assistant control method according to a second embodiment of the present invention.

FIG. 3 is a flowchart illustrating a multi-voice assistant control method according to a third embodiment of the present invention.

FIG. 4 is a flowchart illustrating a multi-voice assistant control method according to a fourth embodiment of the present invention.

Fig. 5 is a flowchart illustrating a multi-voice assistant control method according to a fifth embodiment of the present invention.

Fig. 6 is a schematic structural diagram of a multiple voice assist control apparatus according to a sixth embodiment of the present invention.

Fig. 7 is a schematic configuration diagram of a multiple voice assist control apparatus according to a seventh embodiment of the present invention.

Fig. 8 is a schematic structural diagram of a multiple voice assist control apparatus according to an eighth embodiment of the present invention.

Fig. 9 is a schematic structural diagram of a multiple voice assist control apparatus according to a ninth embodiment of the present invention.

Fig. 10 is a schematic diagram showing the configuration of a multilingual assistant control system according to a tenth embodiment of the present invention.

Detailed Description

In the following description, numerous technical details are set forth in order to provide a better understanding of the present application. However, it will be understood by those skilled in the art that the technical solutions claimed in the present application may be implemented without these technical details and with various changes and modifications based on the following embodiments.

In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

Interpretation of terms

And (4) responding to the process: as used herein, the term "response process" refers to: after the voice assistant is awakened, an audio process for answering (responding) is performed according to the instruction of the user, and the responding process and the audio process can be used interchangeably.

Monitoring state: as referred to herein, the "listening state" refers to: the voice assistant waits for the state of the activation of the wakeup word before being awakened, and the voice assistant in the monitoring state has no response process.

The working state is as follows: the term "operating state" as used herein means: the audio process of the voice assistant is active.

And (4) re-awakening: reference herein to "re-waking" means: when the audio process of the voice assistant is in an active state, the voice assistant is reactivated by the awakening word, and the audio process in the active state is interrupted.

The first embodiment of the present invention relates to a multi-voice assistant control method, and fig. 1 is a flowchart of the method.

As shown in the figure, the method in the embodiment is applied to an electronic device integrating a plurality of voice assistants.

In step 100, external instructions are obtained by a plurality of voice assistants of the electronic equipment.

Thereafter, step 101 is entered: and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.

Preferably, step 101 comprises the steps of:

In this embodiment, a plurality of voice assistants currently in the electronic device obtain an external instruction, and the external instruction may be voice information. The voice assistants determine to perform an interrupt response process or wake up according to external instructions, such as: a voice assistant is in a monitoring state (not awakened) and awakens according to an external instruction, and simultaneously the voice assistant is in a working state and carries out an interrupt response process according to the external instruction; another example is: a voice assistant is in a working state and awakens (re-awakens) according to an external instruction, and meanwhile, the voice assistant is in a monitoring state (not awakened) and performs an interrupt response process (keeps the monitoring state) according to the external instruction, so that the situation that a plurality of voice assistants work simultaneously to cause conflict (mutual interference) can not occur at the same time.

A second embodiment of the present invention relates to a multi-voice assistant control method, and fig. 2 is a flowchart of the method.

In step 200, an externally input instruction is collected by a microphone array.

Thereafter, step 201 is entered: and acquiring external instructions by a plurality of current voice assistants of the electronic equipment.

Thereafter, step 202 is entered: the plurality of voice assistants input the external instruction.

Thereafter step 203 is entered: the voice assistants determine whether the external instruction is consistent with a first wakeup marker preset by each of the voice assistants, and if so, execute step 204: awakens the corresponding voice assistant, otherwise executes step 205: the response process of the corresponding voice assistant is interrupted.

Step 201 can implement the function of step 100, and step 202, step 203, step 204 and step 205 can implement the function of step 101.

In this embodiment, the microphone array may take various forms: linear, circular and spherical, for example: 2 microphone array, 6+1 microphone array and 8+1 microphone array, the pickup distance is far away, noise suppression is good, the collection effect is better.

A third embodiment of the present invention relates to a multi-voice assistant control method, and fig. 3 is a flowchart of the method.

In step 300, an instruction input from the outside is collected by a microphone array.

Thereafter, step 301 is entered: and acquiring external instructions by a plurality of current voice assistants of the electronic equipment.

Thereafter, step 302 is entered: the plurality of voice assistants input the external instruction.

Thereafter, step 303 is entered: determining whether the external instruction includes a keyword associated with any of the plurality of voice assistant identifications, and if so, performing step 304: the voice assistants determine whether the external command is consistent with the respective preset first wake-up markers, otherwise, execute the above step 300.

Step 304: the voice assistants determine whether the external command is consistent with a respective preset first wake-up marker, and if so, execute step 305: awakening the corresponding voice assistant, otherwise executing step 306: the response process of the corresponding voice assistant is interrupted.

Step 300 may implement the function of step 200, step 301 may implement the function of step 201, step 302 may implement the function of step 202, step 304 may implement the function of step 203, step 305 may implement the function of step 204, and step 306 may implement the function of step 205.

The fourth embodiment of the present invention relates to a multi-voice assistant control method, and fig. 4 is a flowchart of the method.

In step 400, an externally input instruction is collected by a microphone array.

Thereafter, step 401 is entered: and acquiring external instructions by a plurality of current voice assistants of the electronic equipment.

Thereafter, step 402 is entered: the plurality of voice assistants input the external instruction.

Thereafter, step 403 is entered: determining whether the external instruction includes a keyword associated with an identification of any of the plurality of voice assistants, and if so, performing step 404: and associating the keyword with a preset corresponding second awakening marker, otherwise, executing the step 400.

Step 404 is followed by step 405: the voice assistants determine whether the second wake-up markers are consistent with the respective preset first wake-up markers, and if so, execute step 406: awakening the corresponding voice assistant, otherwise executing step 407: the response process of the corresponding voice assistant is interrupted.

Step 400 may implement the function of step 300, step 401 may implement the function of step 301, step 402 may implement the function of step 302, step 403 may implement the function of step 303,

steps

404 and 405 may implement the function of step 304, step 406 may implement the function of step 305, and step 407 may implement the function of step 306.

In this embodiment, a plurality of voice assistants are provided with wake-up words (keywords) corresponding thereto, and a preset corresponding relationship exists between the voice assistant and the preset wake-up words (keywords), for example: when the external instruction received by a plurality of voice assistants contains the awakening word of a certain voice assistant, the voice assistant is awakened, and other voice assistants interrupt the response process of the corresponding voice assistant.

The present embodiment corresponds to a scenario as follows:

the microphone array collects an external input instruction comprising 'small A', a plurality of voice assistants receive the instruction and judge that the instruction comprises a wake-up word (keyword): ' small A ', associating the awakening word with a preset corresponding second awakening marker (the second awakening marker corresponding to the small A is 00, the second awakening marker corresponding to the small B is 01, the second awakening marker corresponding to the small C is 02 '), comparing the associated second awakening marker (00) with respective preset first awakening markers (the first awakening marker preset by the voice A is 00, the first awakening marker preset by the voice assistant B is 01, the first awakening marker preset by the voice assistant C is 02), judging that the second awakening marker (00) is consistent with the first awakening marker (00) preset by the voice assistant A, performing awakening (awakening work if the voice assistant A is in a monitoring state before, and re-awakening if the voice assistant A is in a working state before), judging that the second awakening marker (00) is consistent with the first awakening marker (01) preset by the voice assistant B and the voice assistant C, (02) If not, the response process of the corresponding voice assistant is interrupted (if voice assistant B/voice assistant C were in the listening state before, the listening state is maintained, and if they were in the working state before, the response process of the corresponding voice assistant is interrupted).

For example: the awakening word (key word) of the Baidu voice assistant is 'Xiaodu', the awakening word (key word) of the Amazon voice assistant is 'Alexa', and the Baidu voice assistant and the Amazon voice assistant are integrated and applied to the electronic equipment. When a user firstly sends a 'small degree' instruction to the electronic equipment, the Baidu voice assistant is awakened, and the Amazon voice assistant keeps a monitoring state; and when the user sends an Alexa instruction to the electronic equipment, the Baidu voice assistant interrupts the response process, and the Amazon voice assistant is awakened. Or when the user firstly sends a 'small degree' instruction to the electronic equipment, the Baidu voice assistant is awakened, and the Amazon voice assistant keeps a monitoring state; and then when the user sends a 'small' instruction to the electronic equipment again, the Baidu voice assistant is awakened again, and the Amazon voice assistant keeps a monitoring state.

It should be noted that the wake-up word (keyword) of the voice assistant may be the name of each voice assistant, or may be a self-defined word or sentence of the user according to the actual requirement. If the preset awakening word and the voice assistant do not have the preset corresponding relationship, the user-defined preset awakening word does not specifically designate which voice assistant to awaken, and the system can directly awaken the default voice assistant or prompt the user to select which voice assistant to awaken again.

By adopting the method, the voice switching can be carried out between the voice assistants at any time, the situation that the voice assistants are in the awakening working state at the same time is avoided, and the mutual interference and the influence on the user experience are avoided. Meanwhile, the preset awakening words can be customized by the user, so that the use method of the voice assistant is enriched, the single role of the voice assistant is avoided, the awakening assistant is switched through voice instruction conversion, and the technical problem that the voice assistant is required to be manually enabled when the voice assistant is switched by the conventional electronic equipment, and the switching mode is not flexible is solved.

A fifth embodiment of the present invention relates to a multi-voice assistant control method, and fig. 5 is a flowchart of the method.

In step 500, external instructions are obtained by a plurality of voice assistants present in the electronic device.

Thereafter, step 501 is entered: and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction.

Step 502 is then entered: and the voice assistant which judges to be awakened sends the external instruction to the cloud server through the network for processing.

Thereafter, step 503 is entered: and responding the result processed by the cloud server by the voice assistant which is judged to be awakened.

Step 500 may implement the function of step 100, and step 501 may implement the function of step 101.

The present embodiment corresponds to a scenario as follows:

the hundredth voice assistant and amazon voice assistant are integrated and applied to the electronic equipment (the awakening word (keyword) of the hundredth voice assistant is "small", and the awakening word (keyword) of the amazon voice assistant is "Alexa").

The user firstly sends a 'small weather, Shanghai weather today' instruction to the electronic equipment, the hectic voice assistant is awakened (the amazon voice assistant keeps monitoring), the hectic voice assistant uploads the instruction to the cloud server through the network, the cloud server processes the instruction and sends a result (which can be a voice packet) back to the hectic voice assistant, and the hectic voice assistant responds the processed result (sends out 'the Shanghai weather is cloudy, 25 degrees'). The user then issues an "Alexa, at's the weather of ShangHai Today" command to the electronic device, the amazon voice assistant is awakened (audio/response process before the hecto voice assistant interrupts), the amazon voice assistant uploads the command to the cloud server over the network, the cloud server processes the command and sends the result (which may be a voice packet) back to the amazon voice assistant, and the amazon voice assistant responds to the processed result (issues "weather t he weather of ShangHai is cloudy").

By adopting the method, the user can select a voice assistant from a plurality of voice assistants to ask and answer (QA) according to the preference and the acceptance of the user, and the method can be suitable for different users.

A sixth embodiment of the present invention relates to a multi-voice assistant control apparatus, and fig. 6 is a schematic structural view of the apparatus.

As shown in the drawing, the apparatus in this embodiment is applied to an electronic device integrating a plurality of voice assistants.

The device comprises:

an obtaining module 600, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;

a determining module 601, configured to determine, by the voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the voice assistants are all in an enabled state.

The first embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment can be implemented in cooperation with the first embodiment. The related technical details mentioned in the first embodiment are still valid in this embodiment, and are not described herein again in order to reduce repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the first embodiment.

The seventh embodiment of the present invention relates to a multi-voice assistant control apparatus, and fig. 7 is a schematic structural view of the apparatus.

As shown in the drawings, the apparatus in this embodiment is applied to an electronic device integrating a plurality of voice assistants, and this embodiment is improved from the sixth embodiment in that:

in this embodiment, the apparatus further comprises:

the microphone array acquisition module has the advantages of long pickup distance, good noise suppression and better acquisition effect;

the determination module includes: an input sub-module operable for the plurality of voice assistants to input the external instruction; the wake-up judging sub-module may be configured to judge, by the voice assistants, whether the external instruction is consistent with a first wake-up marker preset in each of the voice assistants; a wake-up sub-module operable to wake up a corresponding voice assistant when the external instruction is consistent with the first wake-up marker; and the interrupt submodule can be used for interrupting the response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.

Specifically, the method comprises the following steps:

the device comprises:

a microphone array collecting module 700 for collecting an externally input instruction through a microphone array;

an obtaining module 701, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;

a determining module 702, configured to determine, by the voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the voice assistants are all in an enabled state.

Among them, the decision module 702 includes:

an input sub-module 703 for the plurality of voice assistants to input the external instruction;

a wake-up determining sub-module 704, configured to determine, by the voice assistants, whether the external instruction is consistent with a first wake-up marker preset in each of the voice assistants;

a wake-up sub-module 705, configured to wake up a corresponding voice assistant when the external instruction is consistent with the first wake-up marker;

and an interrupt sub-module 706, configured to interrupt a response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.

The second embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment can be implemented in cooperation with the second embodiment. The related technical details mentioned in the second embodiment are still valid in this embodiment, and are not described herein again in order to reduce repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the second embodiment.

An eighth embodiment of the present invention relates to a multi-voice assistant control apparatus, and fig. 8 is a schematic structural view of the apparatus.

As shown in the drawings, the apparatus in this embodiment is applied to an electronic device integrating a plurality of voice assistants, and this embodiment is improved from the seventh embodiment in that:

in this embodiment, the determining module further includes: and the triggering event judgment submodule can be used for judging whether the external instruction contains a voice triggering event or not so as to determine whether the awakening judgment submodule is triggered to work or not.

Specifically, the method comprises the following steps:

the device comprises:

a microphone array collecting module 800 for collecting an externally input instruction through a microphone array;

an obtaining module 801, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;

a determining module 802, configured to determine, by the voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the voice assistants are all in an enabled state.

Wherein, the determining module 802 includes:

an input sub-module 803 for the plurality of voice assistants to input the external instruction;

a trigger event judgment sub-module 804, configured to judge whether the external instruction includes a voice trigger event;

a wake-up determining sub-module 807, configured to, when the external instruction includes the voice trigger event, determine, by the multiple voice assistants, whether the external instruction is consistent with a first wake-up flag preset in each of the multiple voice assistants;

a wake-up sub-module 808, configured to wake up a corresponding voice assistant when the external instruction is consistent with the first wake-up marker;

and the interrupt submodule 809 is configured to interrupt the response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.

The trigger event determining sub-module 804 includes:

a keyword determination sub-module 805 configured to determine whether the external instruction includes a keyword associated with any of the plurality of voice assistant identifiers;

a trigger event confirmation sub-module 806, configured to determine that the external instruction contains the voice trigger event when a keyword associated with any of the plurality of identifiers of the voice assistant is included in the external instruction.

The third embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment can be implemented in cooperation with the third embodiment. The related technical details mentioned in the third embodiment are still valid in this embodiment, and are not described herein again in order to reduce repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the third embodiment.

The ninth embodiment of the present invention relates to a multi-voice assistant control apparatus, and fig. 9 is a schematic structural view of the apparatus.

As shown in the drawings, the apparatus in this embodiment is applied to an electronic device integrating a plurality of voice assistants, and this embodiment is improved from the eighth embodiment in that:

in this embodiment, the wake-up determining sub-module includes:

the association submodule can be used for associating the keyword with a preset corresponding second wake-up marker when the external instruction is determined to contain the voice trigger event;

and the marker judging sub-module may be configured to judge, by the voice assistants, whether the second wake-up marker coincides with a preset first wake-up marker.

Specifically, the method comprises the following steps:

the device comprises:

a microphone array collecting module 900 for collecting an externally input instruction through a microphone array;

an obtaining module 901, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;

a determining module 902, configured to determine, by the voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the voice assistants are all in an enabled state.

Wherein, the determining module 902 includes:

an input sub-module 903 for the plurality of voice assistants to input the external instruction;

a trigger event determining submodule 904, configured to determine whether the external instruction includes a voice trigger event;

a wake-up determining sub-module 907, configured to, when the external instruction includes the voice trigger event, determine, by the multiple voice assistants, whether the external instruction is consistent with a first wake-up marker preset in each of the multiple voice assistants;

a wake-up sub-module 910, configured to wake up a corresponding voice assistant when the external instruction is consistent with the first wake-up marker;

and the interrupt submodule 911 is configured to interrupt a response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.

The trigger event determining sub-module 904 includes:

a keyword determination sub-module 905, configured to determine whether the external instruction includes a keyword associated with any of the identifiers of the voice assistants;

a trigger event confirmation submodule 906 configured to determine that the external instruction contains the voice trigger event when a keyword associated with any of the plurality of voice assistant identifiers is included in the external instruction.

The wake-up determination sub-module 907 includes:

an association submodule 908, configured to associate the keyword with a preset corresponding second wake-up marker when it is determined that the external instruction includes the voice trigger event;

a flag determining sub-module 909, configured to determine whether the second wake-up flag coincides with the first wake-up flag preset by the voice assistants.

The fourth embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment and the fourth embodiment can be implemented in cooperation with each other. The related technical details mentioned in the fourth embodiment are still valid in this embodiment, and are not described herein again in order to reduce repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the fourth embodiment.

The tenth embodiment of the present invention relates to a multilingual assistant control system, and fig. 10 is a schematic configuration of the system.

As shown in the figure, the system in this embodiment includes: cloud server 1000 and a multi-voice assistant control device.

The multilingual speech assistant control apparatus according to the present embodiment is an improvement of the multilingual speech assistant control apparatus according to the sixth embodiment, and the main improvement is that:

the device still includes:

the receiving and sending module can be used for sending an external instruction to the cloud server and/or receiving a result processed by the cloud server through the voice assistant judged to be awakened;

and the response module may be configured to respond to the result, which is received by the transceiver module and processed by the cloud server, by the voice assistant that has been determined to wake up.

Specifically, the method comprises the following steps:

in this embodiment, the apparatus includes:

an obtaining module 1001, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;

a determining module 1002, configured to determine, by the multiple voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the multiple voice assistants are all in an enabled state;

the transceiver module 1003 may be configured to send an external instruction to the cloud server 1000 and/or receive a result processed by the cloud server 1000 by using a voice assistant that has determined to wake up;

the responding module 1004 may be configured to respond, by the voice assistant that has determined to wake up, to a result that is received by the transceiving module 1003 and processed by the cloud server 1000.

The fifth embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment and the fifth embodiment can be implemented in cooperation with each other. The related technical details mentioned in the fifth embodiment are still valid in this embodiment, and are not described herein again to reduce the repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the fifth embodiment.

An embodiment of the present invention further relates to a multilingual voice assistant control apparatus applied to an electronic device integrating a plurality of voice assistants, including:

a processor;

a memory for storing the processor-executable instructions;

wherein the processor is configured to:

An embodiment of the invention also relates to a computer-readable storage medium, on which computer program instructions are stored, which program instructions, when executed by a processor, implement the steps of the method described in the first embodiment.

The method embodiments of the present invention may be implemented in software, hardware, firmware, etc. Whether the present invention is implemented as software, hardware, or firmware, the instruction code may be stored in any type of computer-accessible memory (e.g., permanent or modifiable, volatile or non-volatile, solid or non-solid, fixed or removable media, etc.). Also, the Memory may be, for example, Programmable Array Logic (PAL), Random Access Memory (RAM), Programmable Read Only Memory (PROM), Read-Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), a magnetic disk, an optical disk, a Digital Versatile Disk (DVD), or the like.

It should be noted that, in each device embodiment of the present invention, each module is a logic module, and physically, one logic module may be one physical module, or may be a part of one physical module, or may be implemented by a combination of multiple physical modules, and the physical implementation manner of the logic modules itself is not the most important, and the combination of the functions implemented by the logic modules is the key to solve the technical problem provided by the present invention. Furthermore, in order to highlight the innovative part of the present invention, the above-mentioned embodiments of the device of the present invention do not introduce modules which are not so closely related to solve the technical problems proposed by the present invention, which does not indicate that there are no other modules in the above-mentioned embodiments of the device.

It is noted that, in the specification of the present patent, relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, the use of the verb "comprise a" to define an element does not exclude the presence of another, same element in a process, method, article, or apparatus that comprises the element.

While the invention has been shown and described with reference to certain preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention.

Claims

A multi-voice assistant control method applied to an electronic device integrating a plurality of voice assistants, the method comprising the steps of:

acquiring external instructions by a plurality of current voice assistants of the electronic equipment;

and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.
The method as claimed in claim 1, wherein the step "the voice assistants decide to perform an interrupt response process or wake up according to the external instruction" comprises the steps of:

the plurality of voice assistants input the external instruction;

the voice assistants judge whether the external instruction is consistent with a first awakening marker preset by each voice assistant;

and if the external instruction is consistent with the first wake-up marker, waking up the corresponding voice assistant, and otherwise, interrupting the response process of the corresponding voice assistant.
The method according to claim 2, wherein the step of determining whether the external command is consistent with the respective preset first wake-up marker by the voice assistants comprises the steps of:

judging whether the external instruction contains a voice trigger event or not;

if the external instruction contains the voice trigger event, the step of 'the plurality of voice assistants judging whether the external instruction is consistent with the respective preset first wake-up markers' is executed.
The method of claim 3, wherein the step of determining whether the external command contains a voice trigger event comprises the steps of:

determining whether the external instruction includes a keyword associated with an identification of any of the plurality of voice assistants;

determining that the external instruction contains the voice-triggered event if a keyword associated with an identification of any of the plurality of voice assistants is included in the external instruction.
The method according to claim 4, wherein the step of determining whether the external command is consistent with the respective preset first wake-up markers by the voice assistants comprises the steps of:

associating the keyword with a preset corresponding second wake-up marker when it is determined that the external instruction includes the voice trigger event;

the voice assistants judge whether the second awakening marker is consistent with a first awakening marker preset by each voice assistant;

if the second awakening marker is consistent with the first awakening marker preset by the user, executing the step of awakening the corresponding voice assistant, and otherwise, executing the step of interrupting the response process of the corresponding voice assistant.
The method as claimed in claim 1, wherein the step "the voice assistants decide to perform an interrupt response process or wake up according to the external instruction" comprises the steps of:

one voice assistant is judged to wake up according to the external instruction, and meanwhile, other voice assistants are judged to perform an interrupt response process according to the external instruction.
The method according to claim 1, wherein the step of "obtaining external instructions by a plurality of voice assistants of the electronic equipment" comprises the steps of:

and acquiring an externally input instruction through a microphone array.
The method as claimed in claim 1, wherein the step "the voice assistants determine to perform an interrupt response process or wake up according to the external instruction" comprises the steps of:

the voice assistant which judges to be awakened sends the external instruction to a cloud server through a network for processing;

and responding the result processed by the cloud server by the voice assistant which is judged to be awakened.
A multi-voice assistant control apparatus applied to an electronic device integrating a plurality of voice assistants, the apparatus comprising:

the acquisition module is used for acquiring external instructions through a plurality of current voice assistants of the electronic equipment;

and the judging module is used for judging whether to perform interrupt response process or wake up according to the external instruction by the voice assistants, wherein the wake-up functions of the voice assistants are all in an enabling state.
The apparatus of claim 9, wherein the determining module comprises:

an input sub-module for the plurality of voice assistants to input the external instruction;

the awakening judgment sub-module is used for judging whether the external instruction is consistent with a first awakening marker preset by each voice assistant;

the awakening sub-module is used for awakening the corresponding voice assistant when the external instruction is consistent with the first awakening marker;

and the interrupt submodule is used for interrupting the response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.
The apparatus of claim 10, wherein the determining module comprises:

a trigger event judging submodule for judging whether the external instruction contains a voice trigger event,

and the awakening judgment sub-module is used for judging whether the external instruction is consistent with the respective preset first awakening markers or not by the plurality of voice assistants when the external instruction contains the voice trigger event.
The apparatus of claim 11, wherein the trigger event determination submodule comprises:

a keyword judgment sub-module, configured to judge whether the external instruction includes a keyword associated with any of the plurality of voice assistant identifiers;

and the trigger event confirmation sub-module is used for determining that the external instruction contains the voice trigger event when the external instruction comprises a keyword associated with any one of the voice assistant identifiers.
The apparatus of claim 12, wherein the wake up decision submodule comprises:

the association submodule is used for associating the keyword with a preset corresponding second awakening marker when the external instruction is determined to contain the voice trigger event;

a marker judgment sub-module, configured to judge, by the multiple voice assistants, whether the second wake-up marker is consistent with a first wake-up marker preset in each of the multiple voice assistants;

the determination module includes:

the awakening sub-module is used for awakening the corresponding voice assistant when the second awakening marker is consistent with the respective preset first awakening marker;

and the interruption submodule is used for interrupting the response process of the corresponding voice assistant when the second awakening marker is inconsistent with the respective preset first awakening marker.
The apparatus of claim 9, wherein the apparatus comprises:

and the microphone array acquisition module is used for acquiring an externally input instruction through the microphone array.
A multi-voice assistant control system, the system comprising:

a cloud server and the multi-voice assistant control apparatus of claim 9;

the multi-voice assistant control apparatus includes:

the receiving and sending module is used for sending an external instruction to the cloud server and/or receiving a result processed by the cloud server through the voice assistant judged to be awakened;

and the response module is used for responding to the result received by the transceiver module and processed by the cloud server through the judged and awakened voice assistant.
A multi-voice assistant control apparatus applied to an electronic device integrating a plurality of voice assistants, comprising:

a processor;

a memory for storing the processor-executable instructions;

wherein the processor is configured to:

acquiring external instructions by a plurality of current voice assistants of the electronic equipment;

and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.
A computer-readable storage medium, on which computer program instructions are stored, which program instructions, when executed by a processor, carry out the steps of the method as claimed in claim 1.