CN111954868A - Multi-voice assistant control method, device, system and computer readable storage medium - Google Patents

Multi-voice assistant control method, device, system and computer readable storage medium Download PDF

Info

Publication number
CN111954868A
CN111954868A CN201880091294.XA CN201880091294A CN111954868A CN 111954868 A CN111954868 A CN 111954868A CN 201880091294 A CN201880091294 A CN 201880091294A CN 111954868 A CN111954868 A CN 111954868A
Authority
CN
China
Prior art keywords
voice
external instruction
wake
assistants
awakening
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201880091294.XA
Other languages
Chinese (zh)
Inventor
董学章
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Shushi Technology Co ltd
Original Assignee
Jiangsu Shushi Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Shushi Technology Co ltd filed Critical Jiangsu Shushi Technology Co ltd
Publication of CN111954868A publication Critical patent/CN111954868A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention discloses a multi-voice assistant control method, a device, a system and a computer readable storage medium. The multi-voice assistant control method is applied to the electronic equipment integrating a plurality of voice assistants, and comprises the following steps: acquiring external instructions through a plurality of current voice assistants of the electronic equipment; and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabled state. The invention can switch the voice among a plurality of voice assistants at any time, and the plurality of voice assistants are not in the awakening working state at the same time, thereby avoiding mutual interference and influencing user experience.

Description

Multi-voice assistant control method, device, system and computer readable storage medium Technical Field
The present invention relates to the field of speech technology, and in particular, to a multi-speech assistant control method, apparatus, system, and computer-readable storage medium.
Background
With the development of speech recognition technology and the increasing demand of people for speech interaction, an intelligent application of speech assistant is produced. In the related art, the voice assistant may implement an intelligent dialog or an instant question and answer with the user to help the user solve a problem. However, only one voice assistant is usually built in the existing electronic device, even if multiple voice assistants are built in the existing electronic device, some voice assistants may wake up other voice assistants to work when one voice assistant works, the multiple voice assistants work simultaneously, so that conflicts (mutual interference) are caused, and the user experience is not ideal.
Disclosure of Invention
The invention aims to provide a multi-voice assistant control method, a device, a system and a computer readable storage medium, which are used for solving the problems that a plurality of voice assistants of the existing electronic equipment work simultaneously to cause conflict (mutual interference) and the user experience is not ideal.
In the present invention, a first aspect provides a multi-voice assistant control method, which is applied to an electronic device integrating multiple voice assistants, and comprises the following steps:
acquiring external instructions by a plurality of current voice assistants of the electronic equipment;
and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.
Preferably, the step "the voice assistants determine to perform an interrupt response process or wake up according to the external instruction" includes the steps of:
the plurality of voice assistants input the external instruction;
the voice assistants judge whether the external instruction is consistent with a first awakening marker preset by each voice assistant;
and if the external instruction is consistent with the first wake-up marker, waking up the corresponding voice assistant, and otherwise, interrupting the response process of the corresponding voice assistant.
Preferably, the step of "the voice assistants determine whether the external command is consistent with the respective preset first wake-up markers" includes the steps of:
judging whether the external instruction contains a voice trigger event or not;
if the external instruction contains the voice trigger event, the step of 'the plurality of voice assistants judging whether the external instruction is consistent with the respective preset first wake-up markers' is executed.
Preferably, the step of determining whether the external command includes a voice trigger event includes the steps of:
determining whether the external instruction includes a keyword associated with an identification of any of the plurality of voice assistants;
determining that the external instruction contains the voice-triggered event if a keyword associated with an identification of any of the plurality of voice assistants is included in the external instruction.
Preferably, the step of "the voice assistants determine whether the external command is consistent with the respective preset first wake-up markers" includes the steps of:
associating the keyword with a preset corresponding second wake-up marker when it is determined that the external instruction includes the voice trigger event;
the voice assistants judge whether the second awakening marker is consistent with a first awakening marker preset by each voice assistant;
if the second awakening marker is consistent with the first awakening marker preset by the user, executing the step of awakening the corresponding voice assistant, and otherwise, executing the step of interrupting the response process of the corresponding voice assistant.
Preferably, the step "the voice assistants determine to perform an interrupt response process or wake up according to the external instruction" includes the steps of:
one voice assistant is judged to wake up according to the external instruction, and meanwhile, other voice assistants are judged to perform an interrupt response process according to the external instruction.
Preferably, the step of "obtaining external instructions by a plurality of voice assistants of the electronic equipment" includes the following steps:
and acquiring an externally input instruction through a microphone array.
Preferably, the step "the voice assistants determine to perform an interrupt response process or wake up according to the external instruction" includes the steps of:
the voice assistant which judges to be awakened sends the external instruction to a cloud server through a network for processing;
and responding the result processed by the cloud server by the voice assistant which is judged to be awakened.
The invention provides a multi-voice assistant control device, which is applied to an electronic device integrating a plurality of voice assistants, and comprises:
the acquisition module is used for acquiring external instructions through a plurality of current voice assistants of the electronic equipment;
and the judging module is used for judging whether to perform interrupt response process or wake up according to the external instruction by the voice assistants, wherein the wake-up functions of the voice assistants are all in an enabling state.
Preferably, the determination module includes:
an input sub-module for the plurality of voice assistants to input the external instruction;
the awakening judgment sub-module is used for judging whether the external instruction is consistent with a first awakening marker preset by each voice assistant;
the awakening sub-module is used for awakening the corresponding voice assistant when the external instruction is consistent with the first awakening marker;
and the interrupt submodule is used for interrupting the response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.
Preferably, the determination module includes:
a trigger event judging submodule for judging whether the external instruction contains a voice trigger event,
and the awakening judgment sub-module is used for judging whether the external instruction is consistent with the respective preset first awakening markers or not by the plurality of voice assistants when the external instruction contains the voice trigger event.
Preferably, the triggering event determining sub-module includes:
a keyword judgment sub-module, configured to judge whether the external instruction includes a keyword associated with any of the plurality of voice assistant identifiers;
and the trigger event confirmation sub-module is used for determining that the external instruction contains the voice trigger event when the external instruction comprises a keyword associated with any one of the voice assistant identifiers.
Preferably, the wake-up determining sub-module includes:
the association submodule is used for associating the keyword with a preset corresponding second awakening marker when the external instruction is determined to contain the voice trigger event;
a marker judgment sub-module, configured to judge, by the multiple voice assistants, whether the second wake-up marker is consistent with a first wake-up marker preset in each of the multiple voice assistants;
the determination module includes:
the awakening sub-module is used for awakening the corresponding voice assistant when the second awakening marker is consistent with the respective preset first awakening marker;
and the interruption submodule is used for interrupting the response process of the corresponding voice assistant when the second awakening marker is inconsistent with the respective preset first awakening marker.
Preferably, the apparatus comprises:
and the microphone array acquisition module is used for acquiring an externally input instruction through the microphone array.
A third aspect of the present invention provides a multi-voice assistant control system, the system comprising:
a cloud server and the multi-voice assistant control device of the second aspect;
the multi-voice assistant control apparatus includes:
the receiving and sending module is used for sending an external instruction to the cloud server and/or receiving a result processed by the cloud server through the voice assistant judged to be awakened;
and the response module is used for responding to the result received by the transceiver module and processed by the cloud server through the judged and awakened voice assistant.
The invention also provides a multi-voice assistant control device, which is applied to electronic equipment integrating a plurality of voice assistants and comprises:
a processor;
a memory for storing the processor-executable instructions;
wherein the processor is configured to:
acquiring external instructions by a plurality of current voice assistants of the electronic equipment;
and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.
The present invention also provides a computer readable storage medium having stored thereon computer program instructions which, when executed by a processor, implement the steps of the method of the first aspect.
Compared with the prior art, the implementation mode of the invention has the main differences and the effects that:
according to the voice assistant and the voice assistant control method, the voice assistant wakes up according to the instruction, and other voice assistants interrupt the response process according to the instruction, so that a plurality of voice assistants are not in a wake-up working state at the same time, and mutual interference is avoided, and user experience is not influenced.
Furthermore, the preset awakening words can be defined by the user, so that the use method of the voice assistant is enriched, and the single role of the voice assistant is avoided.
Furthermore, the awakening assistant is switched through voice instruction conversion, and the technical problem that the existing electronic equipment needs to manually enable the voice assistant when the voice assistant is switched and the switching mode is not flexible is solved.
Furthermore, the user can select one voice assistant from the plurality of voice assistants according to the preference and the acceptance of the user, so that the method can be suitable for different users, and the human-computer interaction is more interesting.
Furthermore, the microphone array collects an externally input instruction, so that the pickup distance is long, the noise suppression is good, and the collection effect is better.
It is to be understood that within the scope of the present invention, the above-described features of the present invention and those specifically described below (e.g., in the examples) may be combined with each other to form new or preferred embodiments. For reasons of space, they will not be described in detail.
Drawings
FIG. 1 is a flowchart illustrating a multi-voice assistant control method according to a first embodiment of the present invention.
FIG. 2 is a flowchart illustrating a multi-voice assistant control method according to a second embodiment of the present invention.
FIG. 3 is a flowchart illustrating a multi-voice assistant control method according to a third embodiment of the present invention.
FIG. 4 is a flowchart illustrating a multi-voice assistant control method according to a fourth embodiment of the present invention.
Fig. 5 is a flowchart illustrating a multi-voice assistant control method according to a fifth embodiment of the present invention.
Fig. 6 is a schematic structural diagram of a multiple voice assist control apparatus according to a sixth embodiment of the present invention.
Fig. 7 is a schematic configuration diagram of a multiple voice assist control apparatus according to a seventh embodiment of the present invention.
Fig. 8 is a schematic structural diagram of a multiple voice assist control apparatus according to an eighth embodiment of the present invention.
Fig. 9 is a schematic structural diagram of a multiple voice assist control apparatus according to a ninth embodiment of the present invention.
Fig. 10 is a schematic diagram showing the configuration of a multilingual assistant control system according to a tenth embodiment of the present invention.
Detailed Description
In the following description, numerous technical details are set forth in order to provide a better understanding of the present application. However, it will be understood by those skilled in the art that the technical solutions claimed in the present application may be implemented without these technical details and with various changes and modifications based on the following embodiments.
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in detail with reference to the accompanying drawings.
Interpretation of terms
And (4) responding to the process: as used herein, the term "response process" refers to: after the voice assistant is awakened, an audio process for answering (responding) is performed according to the instruction of the user, and the responding process and the audio process can be used interchangeably.
Monitoring state: as referred to herein, the "listening state" refers to: the voice assistant waits for the state of the activation of the wakeup word before being awakened, and the voice assistant in the monitoring state has no response process.
The working state is as follows: the term "operating state" as used herein means: the audio process of the voice assistant is active.
And (4) re-awakening: reference herein to "re-waking" means: when the audio process of the voice assistant is in an active state, the voice assistant is reactivated by the awakening word, and the audio process in the active state is interrupted.
The first embodiment of the present invention relates to a multi-voice assistant control method, and fig. 1 is a flowchart of the method.
As shown in the figure, the method in the embodiment is applied to an electronic device integrating a plurality of voice assistants.
In step 100, external instructions are obtained by a plurality of voice assistants of the electronic equipment.
Thereafter, step 101 is entered: and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.
Preferably, step 101 comprises the steps of:
one voice assistant is judged to wake up according to the external instruction, and meanwhile, other voice assistants are judged to perform an interrupt response process according to the external instruction.
In this embodiment, a plurality of voice assistants currently in the electronic device obtain an external instruction, and the external instruction may be voice information. The voice assistants determine to perform an interrupt response process or wake up according to external instructions, such as: a voice assistant is in a monitoring state (not awakened) and awakens according to an external instruction, and simultaneously the voice assistant is in a working state and carries out an interrupt response process according to the external instruction; another example is: a voice assistant is in a working state and awakens (re-awakens) according to an external instruction, and meanwhile, the voice assistant is in a monitoring state (not awakened) and performs an interrupt response process (keeps the monitoring state) according to the external instruction, so that the situation that a plurality of voice assistants work simultaneously to cause conflict (mutual interference) can not occur at the same time.
A second embodiment of the present invention relates to a multi-voice assistant control method, and fig. 2 is a flowchart of the method.
As shown in the figure, the method in the embodiment is applied to an electronic device integrating a plurality of voice assistants.
In step 200, an externally input instruction is collected by a microphone array.
Thereafter, step 201 is entered: and acquiring external instructions by a plurality of current voice assistants of the electronic equipment.
Thereafter, step 202 is entered: the plurality of voice assistants input the external instruction.
Thereafter step 203 is entered: the voice assistants determine whether the external instruction is consistent with a first wakeup marker preset by each of the voice assistants, and if so, execute step 204: awakens the corresponding voice assistant, otherwise executes step 205: the response process of the corresponding voice assistant is interrupted.
Step 201 can implement the function of step 100, and step 202, step 203, step 204 and step 205 can implement the function of step 101.
In this embodiment, the microphone array may take various forms: linear, circular and spherical, for example: 2 microphone array, 6+1 microphone array and 8+1 microphone array, the pickup distance is far away, noise suppression is good, the collection effect is better.
A third embodiment of the present invention relates to a multi-voice assistant control method, and fig. 3 is a flowchart of the method.
As shown in the figure, the method in the embodiment is applied to an electronic device integrating a plurality of voice assistants.
In step 300, an instruction input from the outside is collected by a microphone array.
Thereafter, step 301 is entered: and acquiring external instructions by a plurality of current voice assistants of the electronic equipment.
Thereafter, step 302 is entered: the plurality of voice assistants input the external instruction.
Thereafter, step 303 is entered: determining whether the external instruction includes a keyword associated with any of the plurality of voice assistant identifications, and if so, performing step 304: the voice assistants determine whether the external command is consistent with the respective preset first wake-up markers, otherwise, execute the above step 300.
Step 304: the voice assistants determine whether the external command is consistent with a respective preset first wake-up marker, and if so, execute step 305: awakening the corresponding voice assistant, otherwise executing step 306: the response process of the corresponding voice assistant is interrupted.
Step 300 may implement the function of step 200, step 301 may implement the function of step 201, step 302 may implement the function of step 202, step 304 may implement the function of step 203, step 305 may implement the function of step 204, and step 306 may implement the function of step 205.
The fourth embodiment of the present invention relates to a multi-voice assistant control method, and fig. 4 is a flowchart of the method.
As shown in the figure, the method in the embodiment is applied to an electronic device integrating a plurality of voice assistants.
In step 400, an externally input instruction is collected by a microphone array.
Thereafter, step 401 is entered: and acquiring external instructions by a plurality of current voice assistants of the electronic equipment.
Thereafter, step 402 is entered: the plurality of voice assistants input the external instruction.
Thereafter, step 403 is entered: determining whether the external instruction includes a keyword associated with an identification of any of the plurality of voice assistants, and if so, performing step 404: and associating the keyword with a preset corresponding second awakening marker, otherwise, executing the step 400.
Step 404 is followed by step 405: the voice assistants determine whether the second wake-up markers are consistent with the respective preset first wake-up markers, and if so, execute step 406: awakening the corresponding voice assistant, otherwise executing step 407: the response process of the corresponding voice assistant is interrupted.
Step 400 may implement the function of step 300, step 401 may implement the function of step 301, step 402 may implement the function of step 302, step 403 may implement the function of step 303, steps 404 and 405 may implement the function of step 304, step 406 may implement the function of step 305, and step 407 may implement the function of step 306.
In this embodiment, a plurality of voice assistants are provided with wake-up words (keywords) corresponding thereto, and a preset corresponding relationship exists between the voice assistant and the preset wake-up words (keywords), for example: when the external instruction received by a plurality of voice assistants contains the awakening word of a certain voice assistant, the voice assistant is awakened, and other voice assistants interrupt the response process of the corresponding voice assistant.
The present embodiment corresponds to a scenario as follows:
the microphone array collects an external input instruction comprising 'small A', a plurality of voice assistants receive the instruction and judge that the instruction comprises a wake-up word (keyword): ' small A ', associating the awakening word with a preset corresponding second awakening marker (the second awakening marker corresponding to the small A is 00, the second awakening marker corresponding to the small B is 01, the second awakening marker corresponding to the small C is 02 '), comparing the associated second awakening marker (00) with respective preset first awakening markers (the first awakening marker preset by the voice A is 00, the first awakening marker preset by the voice assistant B is 01, the first awakening marker preset by the voice assistant C is 02), judging that the second awakening marker (00) is consistent with the first awakening marker (00) preset by the voice assistant A, performing awakening (awakening work if the voice assistant A is in a monitoring state before, and re-awakening if the voice assistant A is in a working state before), judging that the second awakening marker (00) is consistent with the first awakening marker (01) preset by the voice assistant B and the voice assistant C, (02) If not, the response process of the corresponding voice assistant is interrupted (if voice assistant B/voice assistant C were in the listening state before, the listening state is maintained, and if they were in the working state before, the response process of the corresponding voice assistant is interrupted).
For example: the awakening word (key word) of the Baidu voice assistant is 'Xiaodu', the awakening word (key word) of the Amazon voice assistant is 'Alexa', and the Baidu voice assistant and the Amazon voice assistant are integrated and applied to the electronic equipment. When a user firstly sends a 'small degree' instruction to the electronic equipment, the Baidu voice assistant is awakened, and the Amazon voice assistant keeps a monitoring state; and when the user sends an Alexa instruction to the electronic equipment, the Baidu voice assistant interrupts the response process, and the Amazon voice assistant is awakened. Or when the user firstly sends a 'small degree' instruction to the electronic equipment, the Baidu voice assistant is awakened, and the Amazon voice assistant keeps a monitoring state; and then when the user sends a 'small' instruction to the electronic equipment again, the Baidu voice assistant is awakened again, and the Amazon voice assistant keeps a monitoring state.
It should be noted that the wake-up word (keyword) of the voice assistant may be the name of each voice assistant, or may be a self-defined word or sentence of the user according to the actual requirement. If the preset awakening word and the voice assistant do not have the preset corresponding relationship, the user-defined preset awakening word does not specifically designate which voice assistant to awaken, and the system can directly awaken the default voice assistant or prompt the user to select which voice assistant to awaken again.
By adopting the method, the voice switching can be carried out between the voice assistants at any time, the situation that the voice assistants are in the awakening working state at the same time is avoided, and the mutual interference and the influence on the user experience are avoided. Meanwhile, the preset awakening words can be customized by the user, so that the use method of the voice assistant is enriched, the single role of the voice assistant is avoided, the awakening assistant is switched through voice instruction conversion, and the technical problem that the voice assistant is required to be manually enabled when the voice assistant is switched by the conventional electronic equipment, and the switching mode is not flexible is solved.
A fifth embodiment of the present invention relates to a multi-voice assistant control method, and fig. 5 is a flowchart of the method.
As shown in the figure, the method in the embodiment is applied to an electronic device integrating a plurality of voice assistants.
In step 500, external instructions are obtained by a plurality of voice assistants present in the electronic device.
Thereafter, step 501 is entered: and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction.
Step 502 is then entered: and the voice assistant which judges to be awakened sends the external instruction to the cloud server through the network for processing.
Thereafter, step 503 is entered: and responding the result processed by the cloud server by the voice assistant which is judged to be awakened.
Step 500 may implement the function of step 100, and step 501 may implement the function of step 101.
The present embodiment corresponds to a scenario as follows:
the hundredth voice assistant and amazon voice assistant are integrated and applied to the electronic equipment (the awakening word (keyword) of the hundredth voice assistant is "small", and the awakening word (keyword) of the amazon voice assistant is "Alexa").
The user firstly sends a 'small weather, Shanghai weather today' instruction to the electronic equipment, the hectic voice assistant is awakened (the amazon voice assistant keeps monitoring), the hectic voice assistant uploads the instruction to the cloud server through the network, the cloud server processes the instruction and sends a result (which can be a voice packet) back to the hectic voice assistant, and the hectic voice assistant responds the processed result (sends out 'the Shanghai weather is cloudy, 25 degrees'). The user then issues an "Alexa, at's the weather of ShangHai Today" command to the electronic device, the amazon voice assistant is awakened (audio/response process before the hecto voice assistant interrupts), the amazon voice assistant uploads the command to the cloud server over the network, the cloud server processes the command and sends the result (which may be a voice packet) back to the amazon voice assistant, and the amazon voice assistant responds to the processed result (issues "weather t he weather of ShangHai is cloudy").
By adopting the method, the user can select a voice assistant from a plurality of voice assistants to ask and answer (QA) according to the preference and the acceptance of the user, and the method can be suitable for different users.
A sixth embodiment of the present invention relates to a multi-voice assistant control apparatus, and fig. 6 is a schematic structural view of the apparatus.
As shown in the drawing, the apparatus in this embodiment is applied to an electronic device integrating a plurality of voice assistants.
The device comprises:
an obtaining module 600, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;
a determining module 601, configured to determine, by the voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the voice assistants are all in an enabled state.
The first embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment can be implemented in cooperation with the first embodiment. The related technical details mentioned in the first embodiment are still valid in this embodiment, and are not described herein again in order to reduce repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the first embodiment.
The seventh embodiment of the present invention relates to a multi-voice assistant control apparatus, and fig. 7 is a schematic structural view of the apparatus.
As shown in the drawings, the apparatus in this embodiment is applied to an electronic device integrating a plurality of voice assistants, and this embodiment is improved from the sixth embodiment in that:
in this embodiment, the apparatus further comprises:
the microphone array acquisition module has the advantages of long pickup distance, good noise suppression and better acquisition effect;
the determination module includes: an input sub-module operable for the plurality of voice assistants to input the external instruction; the wake-up judging sub-module may be configured to judge, by the voice assistants, whether the external instruction is consistent with a first wake-up marker preset in each of the voice assistants; a wake-up sub-module operable to wake up a corresponding voice assistant when the external instruction is consistent with the first wake-up marker; and the interrupt submodule can be used for interrupting the response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.
Specifically, the method comprises the following steps:
the device comprises:
a microphone array collecting module 700 for collecting an externally input instruction through a microphone array;
an obtaining module 701, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;
a determining module 702, configured to determine, by the voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the voice assistants are all in an enabled state.
Among them, the decision module 702 includes:
an input sub-module 703 for the plurality of voice assistants to input the external instruction;
a wake-up determining sub-module 704, configured to determine, by the voice assistants, whether the external instruction is consistent with a first wake-up marker preset in each of the voice assistants;
a wake-up sub-module 705, configured to wake up a corresponding voice assistant when the external instruction is consistent with the first wake-up marker;
and an interrupt sub-module 706, configured to interrupt a response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.
The second embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment can be implemented in cooperation with the second embodiment. The related technical details mentioned in the second embodiment are still valid in this embodiment, and are not described herein again in order to reduce repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the second embodiment.
An eighth embodiment of the present invention relates to a multi-voice assistant control apparatus, and fig. 8 is a schematic structural view of the apparatus.
As shown in the drawings, the apparatus in this embodiment is applied to an electronic device integrating a plurality of voice assistants, and this embodiment is improved from the seventh embodiment in that:
in this embodiment, the determining module further includes: and the triggering event judgment submodule can be used for judging whether the external instruction contains a voice triggering event or not so as to determine whether the awakening judgment submodule is triggered to work or not.
Specifically, the method comprises the following steps:
the device comprises:
a microphone array collecting module 800 for collecting an externally input instruction through a microphone array;
an obtaining module 801, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;
a determining module 802, configured to determine, by the voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the voice assistants are all in an enabled state.
Wherein, the determining module 802 includes:
an input sub-module 803 for the plurality of voice assistants to input the external instruction;
a trigger event judgment sub-module 804, configured to judge whether the external instruction includes a voice trigger event;
a wake-up determining sub-module 807, configured to, when the external instruction includes the voice trigger event, determine, by the multiple voice assistants, whether the external instruction is consistent with a first wake-up flag preset in each of the multiple voice assistants;
a wake-up sub-module 808, configured to wake up a corresponding voice assistant when the external instruction is consistent with the first wake-up marker;
and the interrupt submodule 809 is configured to interrupt the response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.
The trigger event determining sub-module 804 includes:
a keyword determination sub-module 805 configured to determine whether the external instruction includes a keyword associated with any of the plurality of voice assistant identifiers;
a trigger event confirmation sub-module 806, configured to determine that the external instruction contains the voice trigger event when a keyword associated with any of the plurality of identifiers of the voice assistant is included in the external instruction.
The third embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment can be implemented in cooperation with the third embodiment. The related technical details mentioned in the third embodiment are still valid in this embodiment, and are not described herein again in order to reduce repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the third embodiment.
The ninth embodiment of the present invention relates to a multi-voice assistant control apparatus, and fig. 9 is a schematic structural view of the apparatus.
As shown in the drawings, the apparatus in this embodiment is applied to an electronic device integrating a plurality of voice assistants, and this embodiment is improved from the eighth embodiment in that:
in this embodiment, the wake-up determining sub-module includes:
the association submodule can be used for associating the keyword with a preset corresponding second wake-up marker when the external instruction is determined to contain the voice trigger event;
and the marker judging sub-module may be configured to judge, by the voice assistants, whether the second wake-up marker coincides with a preset first wake-up marker.
Specifically, the method comprises the following steps:
the device comprises:
a microphone array collecting module 900 for collecting an externally input instruction through a microphone array;
an obtaining module 901, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;
a determining module 902, configured to determine, by the voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the voice assistants are all in an enabled state.
Wherein, the determining module 902 includes:
an input sub-module 903 for the plurality of voice assistants to input the external instruction;
a trigger event determining submodule 904, configured to determine whether the external instruction includes a voice trigger event;
a wake-up determining sub-module 907, configured to, when the external instruction includes the voice trigger event, determine, by the multiple voice assistants, whether the external instruction is consistent with a first wake-up marker preset in each of the multiple voice assistants;
a wake-up sub-module 910, configured to wake up a corresponding voice assistant when the external instruction is consistent with the first wake-up marker;
and the interrupt submodule 911 is configured to interrupt a response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.
The trigger event determining sub-module 904 includes:
a keyword determination sub-module 905, configured to determine whether the external instruction includes a keyword associated with any of the identifiers of the voice assistants;
a trigger event confirmation submodule 906 configured to determine that the external instruction contains the voice trigger event when a keyword associated with any of the plurality of voice assistant identifiers is included in the external instruction.
The wake-up determination sub-module 907 includes:
an association submodule 908, configured to associate the keyword with a preset corresponding second wake-up marker when it is determined that the external instruction includes the voice trigger event;
a flag determining sub-module 909, configured to determine whether the second wake-up flag coincides with the first wake-up flag preset by the voice assistants.
The fourth embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment and the fourth embodiment can be implemented in cooperation with each other. The related technical details mentioned in the fourth embodiment are still valid in this embodiment, and are not described herein again in order to reduce repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the fourth embodiment.
The tenth embodiment of the present invention relates to a multilingual assistant control system, and fig. 10 is a schematic configuration of the system.
As shown in the figure, the system in this embodiment includes: cloud server 1000 and a multi-voice assistant control device.
The multilingual speech assistant control apparatus according to the present embodiment is an improvement of the multilingual speech assistant control apparatus according to the sixth embodiment, and the main improvement is that:
the device still includes:
the receiving and sending module can be used for sending an external instruction to the cloud server and/or receiving a result processed by the cloud server through the voice assistant judged to be awakened;
and the response module may be configured to respond to the result, which is received by the transceiver module and processed by the cloud server, by the voice assistant that has been determined to wake up.
Specifically, the method comprises the following steps:
in this embodiment, the apparatus includes:
an obtaining module 1001, configured to obtain an external instruction through a plurality of current voice assistants of the electronic device;
a determining module 1002, configured to determine, by the multiple voice assistants according to the external instruction, to perform an interrupt response process or wake up, where wake-up functions of the multiple voice assistants are all in an enabled state;
the transceiver module 1003 may be configured to send an external instruction to the cloud server 1000 and/or receive a result processed by the cloud server 1000 by using a voice assistant that has determined to wake up;
the responding module 1004 may be configured to respond, by the voice assistant that has determined to wake up, to a result that is received by the transceiving module 1003 and processed by the cloud server 1000.
The fifth embodiment is a method embodiment corresponding to the present embodiment, and the present embodiment and the fifth embodiment can be implemented in cooperation with each other. The related technical details mentioned in the fifth embodiment are still valid in this embodiment, and are not described herein again to reduce the repetition. Accordingly, the related-art details mentioned in the present embodiment can also be applied to the fifth embodiment.
An embodiment of the present invention further relates to a multilingual voice assistant control apparatus applied to an electronic device integrating a plurality of voice assistants, including:
a processor;
a memory for storing the processor-executable instructions;
wherein the processor is configured to:
acquiring external instructions by a plurality of current voice assistants of the electronic equipment;
and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.
An embodiment of the invention also relates to a computer-readable storage medium, on which computer program instructions are stored, which program instructions, when executed by a processor, implement the steps of the method described in the first embodiment.
The method embodiments of the present invention may be implemented in software, hardware, firmware, etc. Whether the present invention is implemented as software, hardware, or firmware, the instruction code may be stored in any type of computer-accessible memory (e.g., permanent or modifiable, volatile or non-volatile, solid or non-solid, fixed or removable media, etc.). Also, the Memory may be, for example, Programmable Array Logic (PAL), Random Access Memory (RAM), Programmable Read Only Memory (PROM), Read-Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), a magnetic disk, an optical disk, a Digital Versatile Disk (DVD), or the like.
It should be noted that, in each device embodiment of the present invention, each module is a logic module, and physically, one logic module may be one physical module, or may be a part of one physical module, or may be implemented by a combination of multiple physical modules, and the physical implementation manner of the logic modules itself is not the most important, and the combination of the functions implemented by the logic modules is the key to solve the technical problem provided by the present invention. Furthermore, in order to highlight the innovative part of the present invention, the above-mentioned embodiments of the device of the present invention do not introduce modules which are not so closely related to solve the technical problems proposed by the present invention, which does not indicate that there are no other modules in the above-mentioned embodiments of the device.
It is noted that, in the specification of the present patent, relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, the use of the verb "comprise a" to define an element does not exclude the presence of another, same element in a process, method, article, or apparatus that comprises the element.
While the invention has been shown and described with reference to certain preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention.

Claims (17)

  1. A multi-voice assistant control method applied to an electronic device integrating a plurality of voice assistants, the method comprising the steps of:
    acquiring external instructions by a plurality of current voice assistants of the electronic equipment;
    and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.
  2. The method as claimed in claim 1, wherein the step "the voice assistants decide to perform an interrupt response process or wake up according to the external instruction" comprises the steps of:
    the plurality of voice assistants input the external instruction;
    the voice assistants judge whether the external instruction is consistent with a first awakening marker preset by each voice assistant;
    and if the external instruction is consistent with the first wake-up marker, waking up the corresponding voice assistant, and otherwise, interrupting the response process of the corresponding voice assistant.
  3. The method according to claim 2, wherein the step of determining whether the external command is consistent with the respective preset first wake-up marker by the voice assistants comprises the steps of:
    judging whether the external instruction contains a voice trigger event or not;
    if the external instruction contains the voice trigger event, the step of 'the plurality of voice assistants judging whether the external instruction is consistent with the respective preset first wake-up markers' is executed.
  4. The method of claim 3, wherein the step of determining whether the external command contains a voice trigger event comprises the steps of:
    determining whether the external instruction includes a keyword associated with an identification of any of the plurality of voice assistants;
    determining that the external instruction contains the voice-triggered event if a keyword associated with an identification of any of the plurality of voice assistants is included in the external instruction.
  5. The method according to claim 4, wherein the step of determining whether the external command is consistent with the respective preset first wake-up markers by the voice assistants comprises the steps of:
    associating the keyword with a preset corresponding second wake-up marker when it is determined that the external instruction includes the voice trigger event;
    the voice assistants judge whether the second awakening marker is consistent with a first awakening marker preset by each voice assistant;
    if the second awakening marker is consistent with the first awakening marker preset by the user, executing the step of awakening the corresponding voice assistant, and otherwise, executing the step of interrupting the response process of the corresponding voice assistant.
  6. The method as claimed in claim 1, wherein the step "the voice assistants decide to perform an interrupt response process or wake up according to the external instruction" comprises the steps of:
    one voice assistant is judged to wake up according to the external instruction, and meanwhile, other voice assistants are judged to perform an interrupt response process according to the external instruction.
  7. The method according to claim 1, wherein the step of "obtaining external instructions by a plurality of voice assistants of the electronic equipment" comprises the steps of:
    and acquiring an externally input instruction through a microphone array.
  8. The method as claimed in claim 1, wherein the step "the voice assistants determine to perform an interrupt response process or wake up according to the external instruction" comprises the steps of:
    the voice assistant which judges to be awakened sends the external instruction to a cloud server through a network for processing;
    and responding the result processed by the cloud server by the voice assistant which is judged to be awakened.
  9. A multi-voice assistant control apparatus applied to an electronic device integrating a plurality of voice assistants, the apparatus comprising:
    the acquisition module is used for acquiring external instructions through a plurality of current voice assistants of the electronic equipment;
    and the judging module is used for judging whether to perform interrupt response process or wake up according to the external instruction by the voice assistants, wherein the wake-up functions of the voice assistants are all in an enabling state.
  10. The apparatus of claim 9, wherein the determining module comprises:
    an input sub-module for the plurality of voice assistants to input the external instruction;
    the awakening judgment sub-module is used for judging whether the external instruction is consistent with a first awakening marker preset by each voice assistant;
    the awakening sub-module is used for awakening the corresponding voice assistant when the external instruction is consistent with the first awakening marker;
    and the interrupt submodule is used for interrupting the response process of the corresponding voice assistant when the external instruction is inconsistent with the first wake-up marker.
  11. The apparatus of claim 10, wherein the determining module comprises:
    a trigger event judging submodule for judging whether the external instruction contains a voice trigger event,
    and the awakening judgment sub-module is used for judging whether the external instruction is consistent with the respective preset first awakening markers or not by the plurality of voice assistants when the external instruction contains the voice trigger event.
  12. The apparatus of claim 11, wherein the trigger event determination submodule comprises:
    a keyword judgment sub-module, configured to judge whether the external instruction includes a keyword associated with any of the plurality of voice assistant identifiers;
    and the trigger event confirmation sub-module is used for determining that the external instruction contains the voice trigger event when the external instruction comprises a keyword associated with any one of the voice assistant identifiers.
  13. The apparatus of claim 12, wherein the wake up decision submodule comprises:
    the association submodule is used for associating the keyword with a preset corresponding second awakening marker when the external instruction is determined to contain the voice trigger event;
    a marker judgment sub-module, configured to judge, by the multiple voice assistants, whether the second wake-up marker is consistent with a first wake-up marker preset in each of the multiple voice assistants;
    the determination module includes:
    the awakening sub-module is used for awakening the corresponding voice assistant when the second awakening marker is consistent with the respective preset first awakening marker;
    and the interruption submodule is used for interrupting the response process of the corresponding voice assistant when the second awakening marker is inconsistent with the respective preset first awakening marker.
  14. The apparatus of claim 9, wherein the apparatus comprises:
    and the microphone array acquisition module is used for acquiring an externally input instruction through the microphone array.
  15. A multi-voice assistant control system, the system comprising:
    a cloud server and the multi-voice assistant control apparatus of claim 9;
    the multi-voice assistant control apparatus includes:
    the receiving and sending module is used for sending an external instruction to the cloud server and/or receiving a result processed by the cloud server through the voice assistant judged to be awakened;
    and the response module is used for responding to the result received by the transceiver module and processed by the cloud server through the judged and awakened voice assistant.
  16. A multi-voice assistant control apparatus applied to an electronic device integrating a plurality of voice assistants, comprising:
    a processor;
    a memory for storing the processor-executable instructions;
    wherein the processor is configured to:
    acquiring external instructions by a plurality of current voice assistants of the electronic equipment;
    and the voice assistants judge to perform an interrupt response process or wake up according to the external instruction, wherein the wake-up functions of the voice assistants are all in an enabling state.
  17. A computer-readable storage medium, on which computer program instructions are stored, which program instructions, when executed by a processor, carry out the steps of the method as claimed in claim 1.
CN201880091294.XA 2018-05-31 2018-05-31 Multi-voice assistant control method, device, system and computer readable storage medium Pending CN111954868A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/089166 WO2019227370A1 (en) 2018-05-31 2018-05-31 Method, apparatus and system for controlling multiple voice assistants, and computer-readable storage medium

Publications (1)

Publication Number Publication Date
CN111954868A true CN111954868A (en) 2020-11-17

Family

ID=68696754

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880091294.XA Pending CN111954868A (en) 2018-05-31 2018-05-31 Multi-voice assistant control method, device, system and computer readable storage medium

Country Status (2)

Country Link
CN (1) CN111954868A (en)
WO (1) WO2019227370A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112817556A (en) * 2021-02-04 2021-05-18 青岛海信传媒网络技术有限公司 Switching method of voice scheme on display equipment, display equipment and control device

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111462747B (en) * 2020-04-22 2024-04-05 战炜 Hearing assistance device and setting method thereof
US11700139B2 (en) 2020-11-13 2023-07-11 Haier Us Appliance Solutions, Inc. Virtual microphone input for multiple voice assistants
CN114697713B (en) * 2020-12-29 2024-02-06 深圳Tcl新技术有限公司 Voice assistant control method and device, storage medium and intelligent television

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104345877A (en) * 2013-08-08 2015-02-11 联想(北京)有限公司 Information processing method and electronic equipment
US20160104480A1 (en) * 2014-10-09 2016-04-14 Google Inc. Hotword detection on multiple devices
CN106168895A (en) * 2016-07-07 2016-11-30 北京行云时空科技有限公司 Sound control method and intelligent terminal for intelligent terminal
CN107393548A (en) * 2017-07-05 2017-11-24 青岛海信电器股份有限公司 The processing method and processing device of the voice messaging of multiple voice assistant equipment collections
CN107919123A (en) * 2017-12-07 2018-04-17 北京小米移动软件有限公司 More voice assistant control method, device and computer-readable recording medium

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105975063B (en) * 2016-04-27 2019-06-14 吴波 A kind of method and apparatus controlling intelligent terminal
CN106453859B (en) * 2016-09-23 2019-11-15 维沃移动通信有限公司 A kind of sound control method and mobile terminal

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104345877A (en) * 2013-08-08 2015-02-11 联想(北京)有限公司 Information processing method and electronic equipment
US20160104480A1 (en) * 2014-10-09 2016-04-14 Google Inc. Hotword detection on multiple devices
CN106168895A (en) * 2016-07-07 2016-11-30 北京行云时空科技有限公司 Sound control method and intelligent terminal for intelligent terminal
CN107393548A (en) * 2017-07-05 2017-11-24 青岛海信电器股份有限公司 The processing method and processing device of the voice messaging of multiple voice assistant equipment collections
CN107919123A (en) * 2017-12-07 2018-04-17 北京小米移动软件有限公司 More voice assistant control method, device and computer-readable recording medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112817556A (en) * 2021-02-04 2021-05-18 青岛海信传媒网络技术有限公司 Switching method of voice scheme on display equipment, display equipment and control device

Also Published As

Publication number Publication date
WO2019227370A1 (en) 2019-12-05

Similar Documents

Publication Publication Date Title
CN106782554B (en) Voice awakening method and device based on artificial intelligence
CN111223497B (en) Nearby wake-up method and device for terminal, computing equipment and storage medium
CN111954868A (en) Multi-voice assistant control method, device, system and computer readable storage medium
CN109410952B (en) Voice awakening method, device and system
CN107591155B (en) Voice recognition method and device, terminal and computer readable storage medium
CN108235745B (en) Robot awakening method and device and robot
CN106225174B (en) Air conditioner control method and system and air conditioner
CN107220532B (en) Method and apparatus for recognizing user identity through voice
CN109841214B (en) Voice wakeup processing method and device and storage medium
CN109450747B (en) Method and device for awakening smart home equipment and computer storage medium
CN109686368B (en) Voice wake-up response processing method and device, electronic equipment and storage medium
CN108172228B (en) Voice command word replacing method and device, voice control equipment and computer storage medium
CN110097884B (en) Voice interaction method and device
CN111199733A (en) Multi-stage recognition voice awakening method and device, computer storage medium and equipment
CN108509225B (en) Information processing method and electronic equipment
CN111261160B (en) Signal processing method and device
CN107742520B (en) Voice control method, device and system
CN111724781A (en) Audio data storage method and device, terminal and storage medium
CN110335599B (en) Voice control method, system, equipment and computer readable storage medium
CN111599352B (en) Voice wake-up method and device, computer equipment and storage medium
CN111475212B (en) Equipment driving method and device
CN111179924B (en) Method and system for optimizing awakening performance based on mode switching
CN112233676A (en) Intelligent device awakening method and device, electronic device and storage medium
CN111161745A (en) Awakening method, device, equipment and medium for intelligent equipment
CN114391165A (en) Voice information processing method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination