WO2020224346A1 - Control device and operation method therefor, and speech interaction device and operation method therefor - Google Patents

Control device and operation method therefor, and speech interaction device and operation method therefor Download PDF

Info

Publication number
WO2020224346A1
WO2020224346A1 PCT/CN2020/081165 CN2020081165W WO2020224346A1 WO 2020224346 A1 WO2020224346 A1 WO 2020224346A1 CN 2020081165 W CN2020081165 W CN 2020081165W WO 2020224346 A1 WO2020224346 A1 WO 2020224346A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
voice interaction
information
interaction device
user
Prior art date
Application number
PCT/CN2020/081165
Other languages
French (fr)
Chinese (zh)
Inventor
汤跃忠
Original Assignee
北京京东尚科信息技术有限公司
北京京东世纪贸易有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京京东尚科信息技术有限公司, 北京京东世纪贸易有限公司 filed Critical 北京京东尚科信息技术有限公司
Publication of WO2020224346A1 publication Critical patent/WO2020224346A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/227Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of the speaker; Human-factor methodology

Definitions

  • the present disclosure relates to the field of Internet technology, and more specifically, to a control device and an operation method thereof, and a voice interaction device and an operation method thereof.
  • the inventor found that there are at least the following problems in the prior art: With the popularization of intelligence and IoT technologies, modern homes are often equipped with multiple voice interaction devices that can interact with users. And in order to realize the so-called management of multiple voice interaction devices, the multiple voice interaction devices often originate from the same supplier, so the multiple voice interaction devices have the same wake-up language, then when the multiple voice interaction devices are closer At this time, the multiple voice interaction devices are often awakened in response to the wake-up words sent by the user at the same time, and when the user performs voice interaction with the voice interaction device, the multiple voice interaction devices often respond at the same time, which undoubtedly makes The interactive scene is noisy and chaotic, which reduces the user experience.
  • the present disclosure provides a control device and an operation method thereof, and a voice interaction device and an operation method thereof that can improve the interactive experience.
  • a first aspect of the present disclosure provides an operating method of a control device, the method includes: receiving multiple parameter information sent by multiple voice interaction devices, and the multiple parameter information is collected by the multiple voice interaction devices. It is sent when the user's first voice input at the same time; according to the first voice input, the user's demand information is determined; according to the multiple parameter information and the demand information, the multiple voices are determined.
  • the first voice interaction device in the interaction device is a device that interacts with the user; and sends a wake-up instruction to the first voice interaction device, and sends a wake-up instruction to the plurality of voice interaction devices except for the first voice interaction device
  • the other voice interaction device in the sends a non-wake-up instruction, wherein the first voice input includes a predetermined voice input and a voice input that characterizes the needs of the user.
  • the above-mentioned parameter information includes performance parameters of the voice interaction device
  • the determining the first voice interaction device according to the multiple parameter information and the demand information includes: The matching relationship between the performance parameters of each voice device in the device and the demand information determines the first voice interaction device.
  • the above-mentioned parameter information further includes location information of the user, and the first step is determined based on the matching relationship between the performance parameter of each voice device in the plurality of voice interaction devices and the demand information.
  • a voice interaction device includes: at least one second voice interaction device that determines that the performance parameters of the multiple voice interaction devices match the user's demand information; and according to the parameter information sent by the at least one second voice interaction device It is determined that one of the at least one second voice interaction device is the first voice interaction device, wherein the location information of the user characterizes the location of the user relative to the voice interaction device.
  • the aforementioned parameter information includes operation information of a voice interaction device
  • the non-wake-up instruction includes a sleep instruction and a hibernation instruction
  • the sending a non-wake-up instruction to the other voice interaction device includes: according to the other voice The operation information of the interaction device, determining whether the other voice interaction device performs the first operation when collecting the first voice input; and sending the sleep instruction to the other voice interaction device performing the first operation,
  • the other voice interaction device of the first operation sends the sleep instruction, wherein the voice interaction device is in a sleep state in response to the sleep instruction, and the sleep state includes performing the first operation and responding to the collected user A state in which voice input does not respond; the voice interaction device is in a dormant state in response to the dormant instruction, and the dormant state includes a state in which no operation is performed.
  • the above method further includes: receiving first voice information sent by the first voice interaction device when the user's second voice input is collected, and the first voice information is related to the Corresponding to the second voice input; determine whether the first voice information is general voice information; and if the first voice information is the general voice information, send the first voice information to the multiple voice interaction devices One voice message.
  • the above method further includes: receiving a recovery request sent by the first voice interaction device, the recovery request being collected by the first voice interaction device after the user’s third voice input or preview Suppose it is sent when the user’s voice input is not collected within a time period; and a restoration instruction is sent to the multiple voice interaction devices, so that the multiple voice interaction devices are restored to the time before collecting the first voice input status.
  • the above method further includes: monitoring the operation of the first voice interaction device, and determining whether the first voice interaction device performs a third operation In the case where the first voice interaction device performs the third operation, sending a synchronization request to the first voice interaction device; and receiving the first voice interaction device sent in response to the synchronization request The execution progress information of the third operation.
  • the above method further includes: in a case where at least one parameter information respectively sent by at least one voice interaction device among the multiple voice interaction devices is received again, re-determining the first voice interaction device; The acquisition request sent by the re-determined first voice interaction device under the condition that the user’s fourth voice input is collected; and in response to the re-determined acquisition request of the first voice interaction device, The re-determined first voice interaction device sends the execution progress information.
  • the second aspect of the present disclosure provides an operation method of a voice interaction device.
  • the method includes: when the user's first voice input is collected, sending parameter information to the control device to determine whether the voice interaction device is the first voice interaction device; when the voice interaction device is the first voice interaction device In the case of the device, the wake-up instruction sent by the control device is received, in response to the wake-up instruction being in the wake-up state; in the case that the voice interaction device is not the first voice interaction device, the non-received instruction sent by the control device is received.
  • the wake-up instruction is in a non-wake-up state in response to the non-wake-up instruction, wherein the first voice input includes a predetermined voice input and a voice input that represents the user's needs.
  • the above-mentioned parameter information includes performance parameters of the voice interaction device, and before sending the parameter information to the control device, the above-mentioned method further includes: acquiring the performance parameters.
  • the above-mentioned parameter information further includes location information of the user.
  • the method further includes: determining the user according to the collected first voice input of the user The location information of the user, wherein the location information of the user represents the location of the user relative to the voice interaction device.
  • the above-mentioned parameter information includes operation information of the voice interaction device
  • the non-wake-up instruction includes a sleep instruction and a hibernation instruction
  • the sleep state includes a state in which the first operation is performed and the collected voice input of the user is not responded
  • the sleep state includes The state of not performing any operations.
  • the above method further includes: collecting the user's voice input in real time; in the case of collecting the user's second voice input, determining the second voice input corresponding to the second voice input. 2. Whether the voice information is general voice information; and in the case of determining that the second voice information corresponding to the second voice input is general voice information, sending the second voice information to the control device, so that the control device sends the second voice information to multiple The voice interaction device enables multiple voice interaction devices to perform operations corresponding to the second voice information.
  • the above method further includes: determining the first voice information corresponding to the second voice input when the voice interaction device is in an awake state and the second voice input of the user is collected Whether it is general voice information; and if it is determined that the first voice information is general voice information, sending the first voice information to the control device. And/or the above method further includes: receiving second voice information belonging to the general voice information sent by the control device; and executing a second operation according to the second voice information, and the second operation is the same as the first 2.
  • the voice input corresponds to the voice information.
  • the above method further includes: when the voice interaction device is in an awake state and the user's third voice input is collected or the user's voice input is not collected within a preset time period, Sending a restoration request to the control device; and/or, the above method further includes: receiving a restoration instruction sent by the control device, and switching the current state to a state before the first voice input is collected.
  • the above method further includes: when the voice interaction device is in an awake state and a third operation is performed, in response to a synchronization request sent by the control device, sending the control device The execution progress information of the third operation.
  • the above method further includes: when the voice interaction device is in an awake state and a fourth voice input of the user is collected, sending an acquisition request to the control device; receiving the control The apparatus sends the execution progress information in response to the acquisition request; and executes the third operation according to the execution progress information, wherein the third operation corresponds to the fourth voice input.
  • a third aspect of the present disclosure provides a control device.
  • the device includes: a parameter information receiving module, configured to receive multiple parameter information sent by multiple voice interaction devices, and the multiple parameter information is the multiple voice
  • the interaction device sends the first voice input of the user at the same time; the demand information determination module is used to determine the demand information of the user according to the first voice input; the first device determination module is used to According to the multiple parameter information and the demand information, it is determined that the first voice interaction device among the multiple voice interaction devices is the device that interacts with the user; and the instruction sending module is configured to send the first voice
  • the interaction device sends a wake-up instruction, and sends a non-wake-up instruction to voice interaction devices other than the first voice interaction device among the multiple voice interaction devices, wherein the first voice input includes a predetermined voice input and a characterizing Voice input that describes user needs.
  • the above-mentioned parameter information includes performance parameters of a voice interaction device
  • the first device determining module is specifically configured to: according to the performance parameters of each voice device in the plurality of voice interaction devices and the demand information To determine the first voice interaction device.
  • the above-mentioned parameter information further includes location information of the user
  • the first device determining module includes: a first determining sub-module configured to determine the performance parameters of the multiple voice interaction devices and the At least one second voice interaction device that matches the user’s demand information; and a second determining submodule, configured to determine the at least one first voice interaction device according to the user’s location information in the parameter information sent by the at least one second voice interaction device
  • One of the two voice interaction devices is the first voice interaction device, wherein the location information of the user represents the location of the user relative to the voice interaction device.
  • the above-mentioned parameter information includes operation information of the voice interaction device, the non-wake-up instruction includes a sleep instruction and a sleep instruction, and the instruction sending module includes: an operation determination sub-module for interacting according to the other voice
  • the operation information of the device determines whether the other voice interaction device performs the first operation when collecting the first voice input; and the instruction sending sub-module is configured to send the other voice interaction device that performs the first operation A sleep instruction to send the sleep instruction to other voice interaction devices that have not performed the first operation, wherein the voice interaction device is in a sleep state in response to the sleep instruction, and the sleep state includes performing the first operation and
  • the collected voice input of the user is not responding; the voice interaction device is in a sleep state in response to the sleep instruction, and the sleep state includes a state where no operation is performed.
  • the above-mentioned control device further includes a first voice information receiving module, configured to receive the first voice information sent by the first voice interaction device when the second voice input of the user is collected, The first voice information corresponds to the second voice input; a first voice information determining module is used to determine whether the first voice information is general voice information; and a first voice information sending module is used to If the first voice information is the general voice information, sending the first voice information to the multiple voice interaction apparatuses.
  • a first voice information receiving module configured to receive the first voice information sent by the first voice interaction device when the second voice input of the user is collected, The first voice information corresponds to the second voice input
  • a first voice information determining module is used to determine whether the first voice information is general voice information
  • a first voice information sending module is used to If the first voice information is the general voice information, sending the first voice information to the multiple voice interaction apparatuses.
  • the above-mentioned control device further includes: a recovery request receiving module that receives a recovery request sent by the first voice interaction device, and the recovery request is collected by the first voice interaction device after collecting The third voice input or sending when the user's voice input is not collected within a preset time period; and the instruction sending module is also used for: when the recovery request receiving module receives the recovery request, Sending a recovery instruction to the multiple voice interaction devices to restore the multiple voice interaction devices to a state before collecting the first voice input.
  • the above-mentioned control device further includes: an operation monitoring module, configured to monitor the operation performed by the first voice interaction device after the instruction sending module sends the wake-up instruction, and determine the first Whether the voice interaction device performs the third operation; a synchronization request sending module, configured to send a synchronization request to the first voice interaction device when it is determined that the first voice interaction device performs the third operation; and The progress information receiving module is configured to receive the execution progress information of the third operation sent by the first voice interaction device in response to the synchronization request.
  • an operation monitoring module configured to monitor the operation performed by the first voice interaction device after the instruction sending module sends the wake-up instruction, and determine the first Whether the voice interaction device performs the third operation
  • a synchronization request sending module configured to send a synchronization request to the first voice interaction device when it is determined that the first voice interaction device performs the third operation
  • the progress information receiving module is configured to receive the execution progress information of the third operation sent by the first voice interaction device in response to the synchronization request.
  • the above-mentioned first device determining module is further configured to: when the parameter information receiving module again receives at least one parameter information respectively sent by at least one voice interaction device among the multiple voice interaction devices , Re-determine the first voice interaction device.
  • the above-mentioned control device further includes: an acquisition request receiving module configured to receive an acquisition request sent by the re-determined first voice interaction device when the user's fourth voice input is collected; and first progress information transmission A module for sending the execution progress information to the re-determined first voice interaction device in response to the acquisition request of the re-determined first voice interaction device.
  • a fourth aspect of the present disclosure provides a voice interaction device, including: a parameter information sending module, configured to send parameter information to a control device when the user's first voice input is collected to determine the voice interaction device Whether it is the first voice interaction device; the instruction receiving module is configured to: in the case that the voice interaction device is the first voice interaction device, receive the wake-up instruction sent by the control device; when the voice interaction device is not the first voice interaction device In the case of a voice interaction device, receiving a non-wake-up instruction sent by the control device; and a state switching module, configured to: in a case where the instruction receiving module receives the wake-up instruction, respond to the wake-up instruction to The current state is switched to the wake-up state; in the case that the instruction receiving module receives the non-wake-up instruction, in response to the non-wake-up instruction, the current state is switched to the non-wake-up state, wherein the first voice input includes A predetermined voice input and a voice input that characterizes the user's needs.
  • the above-mentioned parameter information includes performance parameters of the voice interaction device
  • the above-mentioned voice interaction device further includes: a performance parameter acquisition module for sending the parameter information to the control device by the parameter information sending module Before information, obtain the performance parameters.
  • the above-mentioned parameter information further includes the position information of the user
  • the above-mentioned voice interaction device further includes: a position information determining module configured to determine the user’s position information according to the collected first voice input of the user Location information, where the location information of the user represents the location of the user relative to the voice interaction device.
  • the above-mentioned parameter information includes operation information of the voice interaction device
  • the non-wake-up instruction includes a sleep instruction and a hibernation instruction
  • the state switching module switches the current state to the sleep state in response to the sleep instruction
  • the sleep state includes performing the first operation and The collected voice input of the user is not responding
  • the non-awakening instruction received by the instruction receiving module is the sleep instruction
  • the state switching module switches the current state to the sleep state in response to the sleep instruction, and the sleep state includes a state where no operation is performed.
  • the above-mentioned voice interaction device further includes: a second voice information determining module, configured to determine the second voice information in the wake-up state and the second voice input of the user is collected Whether the first voice information corresponding to the voice input is general voice information; the second voice information sending module is configured to send the first voice information to the control device when it is determined that the first voice information is general voice information; And or alternatively, the above-mentioned voice interaction device further includes: a second voice information receiving module, configured to receive second voice information belonging to the general voice information sent by the control device; and an operation execution module, configured to perform according to the second voice information Voice information, perform a second operation, and the second operation corresponds to the voice input corresponding to the second voice information.
  • a second voice information determining module configured to determine the second voice information in the wake-up state and the second voice input of the user is collected Whether the first voice information corresponding to the voice input is general voice information
  • the second voice information sending module is configured to send the first voice information to the control device when it is determined that
  • the above-mentioned voice interaction device further includes a recovery request sending module, which is used for those who are in an awake state and collect the third voice input of the user or have not collected the voice input of the user within a preset time period.
  • a restoration request is sent to the control device; and/or the instruction receiving module is also used to receive a restoration instruction sent by the control device; the state switching module is also used to respond to the restoration instruction to change the current The state is switched to the state before the first voice input is collected.
  • the above-mentioned voice interaction device further includes: a second progress information sending module, configured to respond to the synchronization request sent by the control device in the wake-up state and perform the third operation, The control device sends execution progress information of the third operation.
  • the above-mentioned voice interaction device further includes: an acquisition request sending module, configured to send the acquisition request to the control device when the user is in the awake state and the fourth voice input of the user is collected. Request; a second progress information receiving module for receiving the execution progress information sent by the control device in response to the acquisition request; and an operation execution module for performing the third operation according to the execution progress information , Wherein the third operation corresponds to the fourth voice input.
  • a fifth aspect of the present disclosure provides an electronic device, including the aforementioned control device and the aforementioned voice interaction device.
  • a sixth aspect of the present disclosure provides an electronic device, including: one or more processors; a storage device, for storing one or more programs, wherein when the one or more programs are used by the one or more When the two processors are executed, the one or more processors are caused to execute the above-mentioned operation method of the control device and/or the above-mentioned operation method of the voice interaction device.
  • Another aspect of the present disclosure provides a computer-readable storage medium storing computer-executable instructions, which are used to implement the above-mentioned control method of the smart home system when executed.
  • Another aspect of the present disclosure provides a computer program that includes computer-executable instructions that, when executed, are used to implement the operation method of the above-mentioned control device and/or the above-mentioned voice interaction device Method of operation.
  • the embodiments of the present disclosure it is possible to at least partially solve the technical problem in the prior art that multiple voice interaction devices will be awakened in response to the user's wake-up words at the same time at the same time, thereby causing noisy and chaotic interaction scenes between the user and the smart device. And therefore, by determining a technical solution of a voice interaction device as the only device to be awakened based on multiple parameter information sent by multiple voice interaction devices, a noisy interaction environment is avoided, thereby improving user experience.
  • FIG. 1 schematically shows an application scenario of a control device and an operation method thereof, and a voice interaction device and an operation method thereof according to an embodiment of the present disclosure
  • FIG. 2A schematically shows a flowchart of an operation method of a control device according to an embodiment of the present disclosure
  • Fig. 2B schematically shows an operation flowchart of determining a first voice interaction device according to an embodiment of the present disclosure
  • FIG. 2C schematically shows an operation flowchart of sending a non-wake-up instruction to other voice interaction devices according to an embodiment of the present disclosure
  • FIG. 3 schematically shows a flowchart of an operation method of a control device according to a second embodiment of the present disclosure
  • Fig. 4 schematically shows a flowchart of an operation method of a control device according to a third embodiment of the present disclosure
  • FIG. 5A schematically shows a flowchart of an operation method of a control device according to a fourth embodiment of the present disclosure
  • FIG. 5B schematically shows an application scenario diagram of the operation method shown in FIG. 5A;
  • FIG. 5C schematically shows a flowchart of the operation method of the control device according to the fifth embodiment of the present disclosure
  • Fig. 6A schematically shows a flow chart of the operation method of the voice interaction device according to the first embodiment of the present disclosure
  • FIG. 6B schematically shows a flowchart of the operation method of the voice interaction device according to the second embodiment of the present disclosure
  • Fig. 7A schematically shows a flowchart of an operation method of a voice interaction device according to a third embodiment of the present disclosure
  • FIG. 7B schematically shows a flowchart of the operation method of the voice interaction device according to the fourth embodiment of the present disclosure
  • FIG. 8 schematically shows a flowchart of an operation method of a voice interaction device according to a fifth embodiment of the present disclosure
  • Fig. 9A schematically shows a flowchart of an operation method of a voice interaction device according to a sixth embodiment of the present disclosure
  • FIG. 9B schematically shows a flowchart of the operation method of the voice interaction device according to the seventh embodiment of the present disclosure.
  • Fig. 10 schematically shows a structural block diagram of a control device according to an embodiment of the present disclosure
  • Fig. 11 schematically shows a structural block diagram of a voice interaction device according to an embodiment of the present disclosure
  • Fig. 12 schematically shows a block diagram of an electronic device suitable for performing an operating method of a control device or an operating method of a voice interaction device according to an embodiment of the present disclosure.
  • At least one of the “systems” shall include but not limited to systems having A alone, B alone, C alone, A and B, A and C, B and C, and/or systems having A, B, C, etc. ).
  • the embodiment of the present disclosure provides an operating method of a control device capable of improving interactive experience.
  • the method includes: receiving multiple parameter information sent by multiple voice interaction devices, where the multiple parameter information is It is sent when the first voice input of the user at the same time is collected; the user's demand information is determined according to the first voice input; the first voice interaction device among the multiple voice interaction devices is determined according to multiple parameter information and demand information It is a device that interacts with a user; and sends a wake-up instruction to the first voice interaction device, and sends a non-wake-up instruction to voice interaction devices other than the first voice interaction device among the multiple voice interaction devices.
  • the first voice input includes a predetermined voice input and a voice input that characterizes user needs.
  • Another embodiment of the present disclosure provides an operating method of a voice interaction device capable of improving interactive experience.
  • the method includes: sending parameter information to the control device to determine the voice when the user’s first voice input is collected. Whether the interaction device is the first voice interaction device; in the case that the voice interaction device is the first voice interaction device, receive the wake-up instruction sent by the control device to respond to the wake-up instruction in the wake-up state; when the voice interaction device is not the first voice interaction device In the case of the device, a non-wake-up instruction sent by the control device is received to respond to the non-wake-up instruction in a non-wake-up state, wherein the first voice input includes a predetermined voice input and a voice input that represents the user's needs.
  • FIG. 1 schematically shows an application scenario of a control device and an operation method thereof, and a voice interaction device and an operation method thereof according to an embodiment of the present disclosure.
  • FIG. 1 is only an example of application scenarios in which the embodiments of the present disclosure can be applied to help those skilled in the art understand the technical content of the present disclosure, but it does not mean that the embodiments of the present disclosure cannot be used for other applications.
  • the application scenario 100 includes multiple voice interaction devices, a network 120, and a user 130.
  • the network 120 may be a local area network, for example, a wireless network, etc., for example.
  • At least two of the multiple voice interaction devices may interact with each other via the network 120, for example, and the multiple voice interaction devices are smart electronic devices that can interact with the user through voice.
  • the multiple voice interaction devices may be, for example, smart home devices, which may specifically include smart lamps 111, smart TVs, smart set-top boxes, smart speakers, smart air conditioners 112, smart water heaters 113, smart refrigerators 114, smart curtains, and smart washing machines 115.
  • Smart air purifiers, smart game consoles, smart projectors, etc. may include smart bathroom equipment, such as smart showers, smart bathtubs, smart bath heaters, smart vanity mirrors, smart toilets, etc.; or, for specific examples, it may also Including smart kitchen equipment, such as smart range hoods, smart hot water boilers, smart gas hoods, smart cabinets, smart dishwashers, smart microwave ovens, and smart ovens.
  • the plurality of voice interaction devices can receive a user's voice input, and can switch in response to the voice input when the voice input includes a wake-up word (for example, "dingdongdingdong", etc.) To wake up state, to receive user instructions, and perform corresponding operations according to user instructions.
  • a wake-up word for example, "dingdongdingdong", etc.
  • the plurality of voice interaction devices for example, further have sensors, such as a sound sensor or a distance sensor, etc., which are used to realize the user's voice input according to the sound source or infrared detection when the user's voice input is received. Positioning to measure the user's location information relative to itself.
  • the multiple voice interaction apparatuses may also interact with a network device that provides the network 120, for example, to send the measured location information to the network device, so that the network device determines the interaction with the user according to the location information.
  • the first voice interaction device enables the multiple voice interaction devices to work under the control of the network device.
  • the user 130 may, for example, also control the multiple voice interaction devices through an application program installed in other electronic devices, and may set one voice interaction device among the multiple voice interaction devices through the application program. If the device is a control device, and the other devices are controlled devices, the multiple voice interaction devices send the position information measured according to the voice input to the control device, so that the control device determines the first voice interaction device to interact with the user according to the position information , So that the multiple voice interaction devices work under the control of the set control device.
  • the smart refrigerator 114 is awakened under the control of the network device or the determined control device, while the smart lamp 111, the smart air conditioner 112, the smart water heater 113, and the smart washing machine 115 are under the control of the network device or the determined control device Keeping the non-awakened state can avoid the noisy and chaotic voice interaction scene caused by multiple voice interaction devices being awakened, and improve user experience.
  • the operation method of the control device provided in the embodiments of the present disclosure may generally be executed by a certain control device, and the operation method of the voice interaction device may be executed by any one of multiple voice interaction devices.
  • the control device provided by the embodiment of the present disclosure can generally be set in a network device or any voice interaction device (for example, in a certain control device), and the voice interaction device may refer to the voice interaction devices 111 to 115 in FIG. .
  • the operation method of the control device provided by the embodiment of the present disclosure can also be executed by other electronic equipment that is different from the voice interaction device and can communicate with the voice interaction device.
  • the control device provided by the embodiment of the present disclosure can also be set in other electronic equipment that is different from the voice interaction device and can communicate with the voice interaction device.
  • voice interaction devices and networks 120 in FIG. 1 are only illustrative, and any type and number of voice interaction devices and cloud devices can be provided according to implementation needs.
  • FIG. 2A schematically shows a flowchart of an operation method of a control device according to an embodiment of the present disclosure
  • FIG. 2B schematically shows an operation flowchart of determining a first voice interaction device according to an embodiment of the present disclosure
  • FIG. 2C schematically An operation flowchart of sending a non-wake-up instruction to another voice interaction device according to an embodiment of the present disclosure is shown.
  • the operation method of the control device includes operation S201 to operation S204.
  • the multiple parameter information may be sent when multiple voice interaction devices receive the first voice input of a certain user at the same time.
  • the voice interaction device has a voice collection function, such as a voice collector, etc.
  • the multiple voice interaction devices may be voice interaction devices capable of collecting the user's first voice input within a spatial range, for example, if the user has If there are n voice interaction devices, the multiple voice interaction devices involved in operation S201 are some or all of the n voice interaction devices.
  • the multiple voice interaction devices may be part or all of the smart lamp 111, smart air conditioner 112, smart water heater 113, smart refrigerator 114, and smart washing machine 115 in FIG. 1.
  • multiple voice interaction devices 111 to 115 can receive the first voice input by collecting the voice command. Then, when the multiple voice interaction devices 111-115 receive the first voice input, they respectively send parameter information to the control device for the control device to receive.
  • the first voice input may specifically include a predetermined voice input and a voice input that characterizes user needs.
  • the predetermined voice input may be, for example, a voice input corresponding to a wake-up word (for example, "Ding Dong Ding Dong") of the voice interaction device.
  • the wake-up word may be, for example, a wake-up word preset when the voice interactive device is shipped from the factory, or a wake-up word customized by the user.
  • the voice input that characterizes the user's needs is the voice input corresponding to other voices except the wake-up word, for example, it can be "Play Daoxiang for me”.
  • the parameter information may include, for example, attribute parameters and/or performance parameters of the voice interaction device, where the attribute parameters may include, for example, the brand and model of the voice interaction device, and the performance parameters may include, for example, the voice interaction device.
  • Functions such as playing music, playing video, temperature control, lighting
  • the working parameters of the voice interaction device such as lighting brightness, sound quality, screen resolution, temperature adjustment range, etc.
  • the demand information of the user is determined according to the first voice input.
  • the certain voice interaction device may also perform voice recognition analysis on the first voice input after collecting the first voice input, thereby obtaining the user Demand information.
  • the user's demand information is specifically obtained by recognizing and analyzing the voice input that represents the user's demand in the first voice input. For example, when the voice input that characterizes the user's needs is "Play Daoxiang for me", the demand information obtained can be, for example, playing music; when the voice input that characterizes the user's needs is "Playing lossless music for me", the demand information is obtained. For example, it can be playing music and high sound quality.
  • the specific implementation method of recognizing and analyzing speech can adopt any speech recognition method in the prior art, which is not limited in the present disclosure.
  • the first voice interaction device among the multiple voice interaction devices is determined to be the device that interacts with the user according to multiple parameter information and demand information.
  • the operation S203 may specifically include, for example, first determining the matching relationship between the multiple parameter information and the demand information, and then determining the voice matching the parameter information and the demand information
  • the interaction device is the first voice interaction device. For example, when the demand information is to play music, it can be determined that the parameter information that has a music play function matches the demand information, and the voice interaction device corresponding to the parameter information is the first voice interaction device.
  • the demand information is to play music and high sound quality
  • the voice interaction device corresponding to the parameter information is the first voice interaction device.
  • the parameter information sent by the voice interaction device may, for example, further include user location information used to characterize the location of the user relative to the voice interaction device.
  • the multiple voice interaction devices should also have the function of analyzing and processing the collected voice input.
  • the voice interaction device may determine the distance of the user relative to itself according to the strength of the voice signal after collecting the voice input of the user. Then the position information included in the sent parameter information may be a distance value.
  • the multiple voice interaction devices may also be provided with a sensor capable of positioning the user, and the trigger condition of the sensor operation is that the voice interaction device collects the first voice input, and the sensor may be collected by the voice interaction device, for example.
  • the sound source of the voice input realizes the positioning of the user, or it can also realize the positioning of the user through infrared detection and other technologies to obtain the user's location information included in the user information.
  • the location information can be a distance value or a voice interaction
  • the location of the device is the coordinate value of the user's location obtained by the origin positioning. It is understandable that the location information of the user described above is only used as an example to facilitate understanding of the present disclosure, and the location information may also include, for example, the space where the user is determined.
  • operation S203 may specifically include operation S213 to operation S223.
  • operation S213 determine at least one second voice interaction device whose performance parameter matches the user's demand information among the plurality of voice interaction devices; in operation S223, according to the user's location information in the parameter information sent by the at least one second voice interaction device , It is determined that one of the at least one second voice interaction device is the first voice interaction device.
  • the second voice interaction device that can meet the needs of the user is determined according to the performance parameters of multiple voice interaction devices. Then, the second voice interaction device whose location information characterizes the closest distance to the user is selected as the first voice interaction device.
  • the first voice interaction device determined in operation S203 may be, for example, the smart refrigerator 114 described with reference to FIG. 1, which will not be repeated here.
  • a wake-up instruction is sent to the first voice interaction device, and a non-wake-up instruction is sent to other voice interaction devices among the plurality of voice interaction devices except the first voice interaction device.
  • the first voice interaction device since the wake-up instruction is sent to the first voice interaction device, the first voice interaction device can switch from the state before receiving the first voice input to the wake-up state in response to the wake-up instruction; The voice interaction device sends a non-wake-up instruction. Therefore, other voice interaction devices can switch from the state before receiving the first voice input to the non-wake state.
  • the non-awake state may be, for example, a state in which no response is made to the first voice input; or, the non-awake state may also be a state in which the voice interaction device does not perform any operation, that is, similar to shutting down. status.
  • the above-mentioned operation S204 may specifically be: sending a wake-up instruction to the smart refrigerator 114 in reference to FIG. 1, and the smart refrigerator 114 is awakened in response to the wake-up instruction, and can perform voice interaction with the user, and send a voice to the smart refrigerator 114.
  • the voice interaction device that receives the first voice input among the lamp 111, the smart air conditioner 112, the smart water heater 113, and the smart washing machine 115 sends a non-wake-up instruction, and the voice interaction devices switch to the non-wake state in response to the non-wake-up instruction, that is, The state where voice interaction with the user is not possible.
  • some voice interaction devices may perform operations before collecting the first voice input in other voice interaction devices, which are similar to lighting, cooling, etc., and do not affect the interaction process between the first voice interaction device and the user.
  • Operations such as operations that do not make a sound.
  • the part of the voice interaction device is directly switched to a state similar to shutting down, the user experience may be affected. For example, before the user sends out the voice command "Ding Dong Ding Dong, help me play Dao Xiang" in a dark space, the smart lamp 111 in the dark space performs the lighting operation.
  • the first voice interaction device is determined In the case of a smart refrigerator 114, issuing an instruction to the smart lamp 111 to switch the smart lamp 111 to a state of not performing any operation (that is, a state of not performing a lighting operation) will undoubtedly bring a poor user experience.
  • the non-wake-up instruction issued by the operation S204 may include a sleep instruction and a sleep instruction, and the parameter information sent by multiple voice interaction devices may also include operation information of the voice interaction device.
  • operation S204 may specifically include operation S214 to operation S224, for example.
  • the sleep instruction is sent to other voice interaction devices that perform the first operation, and Other voice interaction devices that have not performed the first operation send a sleep instruction.
  • the first operation may specifically be an operation that does not affect the interaction process between the first voice interaction device and the user, and the type of operation included in the first operation may be set according to actual requirements.
  • another voice interaction device receives a sleep instruction, it can switch to the sleep state in response to the sleep instruction.
  • the sleep state may specifically be able to perform the first operation and collect voice input, but does not respond to the collected voice input of the user status.
  • other voice interaction devices receive the sleep instruction, they can switch to the sleep state in response to the sleep instruction.
  • the sleep state may be a state in which no operation is performed, for example, it may be a shutdown state.
  • the operating method of the control device of the present disclosure determines the only device among the multiple voice interaction devices as the awakened device through the received parameter information, which is used to interact with the user, compared with the multiple voice interaction devices in the prior art.
  • the technical solution in which voice interaction devices are all awakened can avoid the defect of noisy and chaotic interaction scenes caused by multiple voice interaction devices interacting with the user at the same time, and thus improve user experience.
  • Fig. 3 schematically shows a flowchart of an operation method of a control device according to a second embodiment of the present disclosure.
  • the operation method of the control device according to an embodiment of the present disclosure may further include operations S305 to S307 as shown in FIG. 3.
  • the first voice information sent by the first voice interaction device when the user's second voice input is collected is received.
  • the first voice information corresponds to the second voice input.
  • the second voice input may specifically be a voice input corresponding to the voice used to send instructions to multiple voice interaction devices, and the instruction may specifically be similar to "turn off all devices" or "I'm going out", etc.
  • Multiple voice interaction devices can be universal and require multiple voice interaction devices to respond to voice instructions. Because the voice instruction requires multiple voice interaction devices to work together to achieve the effect that the user wants, if only the first voice The interaction device performs a corresponding operation in response to the second voice information corresponding to the second voice input, which cannot well meet the needs of the user.
  • the user's voice input can be collected in real time, and corresponding operations can be performed in response to the user's voice input.
  • the first voice interaction device first determines whether the corresponding first voice information is general voice information, that is, multiple voice interaction devices can be universal, And the voice information that multiple voice interaction devices respond together is required; if it is general voice information, the first voice information corresponding to the second voice input should be sent to the control device to notify the control device that the voice command requires multiple voice interaction devices to share carry out.
  • the cloud system or the first voice interaction device pre-stores a list of general voice information as a reference for determining whether it is general voice information.
  • the first voice information may specifically correspond to the second voice input, and can characterize the second voice input and the information that can be recognized by the electronic device. For example, it may be a binary code or character sequence obtained by converting the second voice input. And so on; or after the speech input is recognized and processed, then the binary code or character sequence obtained is converted.
  • operation S306 it is determined whether the first voice information is general voice information.
  • the operation S306 may specifically be to compare the first voice information with a general voice information list stored in the cloud system or pre-stored by the control device, if the first voice information is in the general voice list Information, it is determined that the first voice information is general voice information; or, operation S306 can also be specifically implemented by the following operations: use the first voice information as the input of the pre-trained deep learning model, and the output result is the binary classification As a result, it can be characterized as universal language information or not universal language information. It can be understood that the foregoing method is only an example for implementing operation S306, which is not limited in the present disclosure. It is also understandable that the control device can avoid the defect of inaccurate judgment result caused by the difference between the general voice information stored in the control device and the first voice interaction device by determining whether the first voice information is general voice information again.
  • the first voice information is sent to multiple voice interaction devices, so that the multiple voice interaction devices perform an operation corresponding to the second voice input.
  • the first voice information is sent to multiple voice interaction devices, and the multiple voice interaction devices respond to the operations of the first voice information to complete the same activities as the user needs, and meet the user needs, for example, when the second voice input
  • the multiple voice interaction devices can perform operations corresponding to the first voice information, such as shutdown operations.
  • the response operation of the voice interactive device can be made more in line with user needs.
  • the voice interaction device in order to prevent the voice interaction device itself from being turned off after receiving the first voice information, the voice interaction device may be turned on by performing a corresponding operation again.
  • the voice interaction apparatus may determine whether to perform an operation corresponding to the second voice input, for example, according to the current state. If the current state matches the first voice information, the operation corresponding to the second voice input is performed, and if the current state does not match the first voice information, the operation corresponding to the second voice input is not performed.
  • Fig. 4 schematically shows a flowchart of the operation method of the control device according to the third embodiment of the present disclosure.
  • the first voice interaction device collects the user's third voice input or does not collect the user's voice input within a preset time period, it is considered that the user may no longer need to interact with the user at this time.
  • the first voice interaction device interacts. Or, in the case that the user’s voice input is not collected within the preset time period, the user can also be issued a questioning voice similar to "Master, master, are you still listening?" or "Master, are you still there?" If the user’s voice input is still not collected after the inquiry voice is issued, it can be determined that the user no longer needs to interact with the first voice interaction device at this time.
  • the third voice input may specifically be a voice input corresponding to an instruction similar to "sleep" issued by the user.
  • the control device Send a resume request that characterizes the end of the interaction.
  • the operation method of the control device of the embodiment of the present disclosure may further include S408 to S409.
  • operation S408 a recovery request sent by the first voice interaction device is received, and the recovery request is sent by the first voice interaction device when the user's third voice input is collected or the user's voice input is not collected within a preset time period;
  • operation S409 a recovery instruction is sent to the multiple voice interaction devices, so that the multiple voice interaction devices are restored to the state before the first voice input of the user is collected.
  • the multiple voice interaction devices are restored to the state of playing music or radio before collecting the first voice input, so as to continue to perform operations such as playing radio or music to the user to meet user needs.
  • Fig. 5A schematically shows a flowchart of an operation method of a control device according to a fourth embodiment of the present disclosure.
  • the operation method of the control device of the embodiment of the present disclosure may also include operations S510 to S512 as shown in FIG. 5A.
  • operation S510 the operation of the first voice interaction device is monitored, and it is determined whether the first voice interaction device performs a third operation.
  • operation S511 when the first voice interaction device performs the third operation, a synchronization request is sent to the first voice interaction device; in operation S512, the execution progress of the third operation sent by the first voice interaction device in response to the synchronization request is received information.
  • the control device may, for example, monitor the operation of the first voice interaction device in real time, and determine in real time Whether the monitored operation is the third operation.
  • the third operation may specifically be an operation that takes longer than a preset time to execute in response to the user's voice input, for example, it may be a broadcast operation, such as playing music, reading e-books, playing radio, or playing videos; or Operations with complex processes, such as online shopping.
  • the execution progress information of the third operation sent by the first voice interaction device in response to the synchronization request can be received.
  • the execution progress information may be the length of time the third operation has been executed, or the third operation has been executed.
  • the stored progress information may also be stored and updated.
  • Fig. 5B schematically shows an application scenario diagram of the operation method of the control device shown in Fig. 5A
  • Fig. 5C schematically shows a flowchart of the operation method of the control device according to the fifth embodiment of the present disclosure.
  • the operation method of the control device of the embodiment of the present disclosure can be applied to a home scene, for example, in which smart speakers are configured in the living room 501, the bedroom 502, the bedroom 503, and the bedroom 504, then in FIG. 2A
  • the multiple voice interaction devices described may be smart speakers configured in the living room 501, bedroom 502, bedroom 503, and bedroom 504.
  • the user speaks a voice command including a wake-up word in the living room 501
  • the four smart speakers Then, the first voice input can be collected, and the smart speaker in the living room 501 can be awakened through operations S201 to S204.
  • the operation method of the control device of the embodiment of the present disclosure may also include operation S513 shown in FIG.
  • the first voice interaction device is re-determined. Specifically, operations S201 to S203 may be repeatedly performed to re-determine the first voice interaction device.
  • the re-determined first voice interaction device is the smart speaker in the bedroom 502.
  • the method can also send a message to the re-determined first voice
  • the interactive device (the smart speaker in the bedroom 502) sends a wake-up instruction to switch the re-determined first voice interaction device to the wake-up state, and the first voice input is received to other than the re-determined first voice interaction device
  • the other devices in the send a non-wake-up instruction to switch the previously determined first voice interaction device (such as the smart speaker in the living room 501) and other devices to the non-wake state.
  • the previously determined first voice interaction device may also fail to collect the voice instruction provided again because of being too far away from the user.
  • the operating method of the control device may also send a sleep instruction to the first voice interaction device currently in the awake state after re-determining the first voice interaction device, so that the previous first voice interaction device switches to the sleep state , To avoid the consumption of additional power.
  • the user when the user instructs the first voice interaction device to perform the third operation described in FIG. 5A, that is, the user listens to music or listens to content with a long duration such as broadcasting, after moving from the living room 501 to the bedroom 502 When waking up the smart speaker in the bedroom 502 and sleeping the smart speaker in the living room 501, it is more hoped that the smart speaker in the bedroom 502 can continue to perform operations such as playing following the execution progress of the smart speaker in the living room 501. Therefore, the present disclosure
  • the operation method of the control device of the embodiment may further include operations S514 to S515 described in FIG. 5C.
  • an acquisition request sent by the re-determined first voice interaction device when the user’s fourth voice input is collected is received; in operation S515, in response to the re-determined acquisition request of the first voice interaction device, Send the execution progress information to the re-determined first voice interaction device.
  • the control device receives the third operation in real time. Therefore, through the above operations S514 to S515, when the user can issue an instruction such as "continue” or "continue playing", the smart speaker in the bedroom 502 can collect the fourth voice corresponding to the instruction In the case of input, an acquisition request is sent to the control device to acquire the execution progress information of the third operation, and the third operation is continued to be executed according to the acquired execution progress information.
  • the operating method of the control device of the embodiment of the present disclosure responds to the user's voice input, and the re-determined first voice interaction device can continue to perform the third operation that was not completed by the previous first voice interaction device, so that
  • the intelligent system composed of multiple voice interaction devices can provide users with smooth services and avoid the repeated execution of some operations, thereby avoiding the defect of wasting user time and effectively improving user experience.
  • Fig. 6A schematically shows a flow chart of the operation method of the voice interaction device according to the first embodiment of the present disclosure
  • Fig. 6B schematically shows the flow chart of the operation method of the voice interaction device according to the second embodiment of the present disclosure.
  • the operation method of the voice interaction device includes operation S601.
  • parameter information is transmitted to the control device.
  • the operation S601 may be specifically executed when the voice interaction device collects the user's first voice input, so that the control device determines whether the voice interaction device is the first voice interaction device according to the parameter information .
  • the first voice input may specifically include a predetermined voice input and a voice input that characterizes user needs.
  • a predetermined voice input and a voice input that characterizes user needs.
  • the parameter information may include performance parameters of the voice interaction device, for example.
  • the operation method of the voice interaction apparatus may further include operation S604 to obtain performance parameters.
  • the performance parameters may be obtained locally by the voice interaction device, or obtained from a server or cloud that provides services to the voice interaction device.
  • the parameter information may further include location information of the user, for example.
  • the operation method may further include operation S605: determining the location information of the user according to the collected first voice input of the user. The method for determining the location information of the user is detailed in the description of the location information in FIG. 2C, which will not be described in detail here.
  • the parameter information includes not only performance parameters, but also user location information.
  • the operation method of the voice interaction device includes operation S604 and operation S605 at the same time.
  • the operation S604 may be performed before or after the operation S605, which is not limited in the present disclosure, as long as the operations S604 to S605 are all performed before the operation S601.
  • the operation method of the voice interaction device may further include operation S602, receiving a wake-up instruction sent by the control device, so as to be in the wake-up state in response to the wake-up instruction.
  • the operation S602 may specifically be: upon receiving the wake-up instruction, in response to the wake-up instruction, switch the current state to the wake-up state to interact with the user.
  • the voice interaction device when the voice interaction device is the first voice interaction device, a non-wake-up instruction sent by the control device through operation S204 is received.
  • the operation method of the voice interaction device may further include operation S603, receiving a non-wake-up instruction sent by the control device, so as to be in a non-wake-up state in response to the non-wake-up instruction.
  • the operation S603 may specifically be: when a non-wake-up command is received, in response to the non-wake-up command, first determine whether the current state is a non-wake-up state, if not, switch the current state to the wake-up state in response to the non-wake-up command to Avoid responding to voice commands used.
  • the received non-wake-up instruction may specifically include a sleep instruction or a hibernation instruction, for example.
  • the parameter information sent in operation S601 may also include operation information of the voice interaction device, for example. Therefore, when the operation information indicates that the voice interaction device performs the first operation, the received non-wake-up instruction is the sleep instruction sent by the control device through operation S224 in FIG. 2C. In response to the sleep instruction, the voice interaction device is in the sleep state, i.e. The state switches to sleep state.
  • the received non-wake-up instruction is the sleep instruction sent by the control device through operation S224 in FIG. 2C.
  • the voice interaction device is in a sleep state, i.e. The state switches to sleep state.
  • the sleep state refers to a state where the first operation can be performed but does not respond to the collected user's voice input; the sleep state refers to a state where no operation is performed.
  • the voice interaction device of the embodiment of the present disclosure is controlled by the control device to switch its working state, instead of directly switching the working state to the wake-up state after the user's wake-up words are collected, and when it is switched to the wake-up state, Other voice interaction devices are in a non-awake state, so that the defect of a noisy and chaotic working environment of the voice interaction device can be avoided, and thus the user experience is improved.
  • the voice interaction device is not the first voice interaction device, when the first operation is performed, the received instruction of the control device is a sleep instruction that can continue to perform the first operation, which can improve user experience to a certain extent.
  • Fig. 7A schematically shows a flowchart of an operation method of a voice interaction device according to a third embodiment of the present disclosure.
  • the voice interaction device when switching to the awake state, it can perform voice interaction with the user to perform an operation corresponding to the user's voice instruction.
  • the operation method of the voice interaction device in the embodiment of the present disclosure further includes operation S706 to operation S707 after operation S602.
  • operation S706 when the second voice input of the user is collected, it is determined whether the first voice information corresponding to the second voice input is general voice information; and in operation S707, it is determined whether the first voice information corresponding to the second voice input is
  • the first voice information is sent to the control device, so that the control device sends the first voice information to multiple voice interaction devices that have collected the first voice input through operations S305 to S307, so that The multiple voice interaction devices perform operations corresponding to the second voice input.
  • the above-mentioned operation S707 may specifically recognize the collected voice input first, for example, may recognize a keyword of the voice input, and if the recognized keyword is a preset keyword, determine the The voice input of is the second voice input, and then it is determined whether the first voice information corresponding to the second voice input is general voice information.
  • the first voice information corresponding to the second voice input can be the first voice information corresponding to the second voice input with the voice interaction device or cloud system
  • the list of general voice information stored in the database is compared, and if the first voice information corresponding to the second voice input is the voice information in the general voice information list, it is determined that the first voice information corresponding to the second voice input is the general voice Information; if the first voice information corresponding to the second voice input is not the voice information in the general voice information list, it is determined that the first voice information corresponding to the second voice input is not general voice information.
  • the second voice input and the first voice information may be the second voice input and the first voice information described in operation S305 and operation S306 in FIG. 3, and details are not described herein again.
  • the control device can send the first voice information to multiple voice interaction devices through the operations S305 to S307 described with reference to FIG. 3, This allows multiple voice interaction devices to perform operations corresponding to the second voice input, such as a shutdown operation, to meet the user's needs, thereby ensuring that the interaction scene is not noisy and chaotic, and making the response operation of the voice interaction device more in line with the user demand.
  • Fig. 7B schematically shows a flowchart of the operation method of the voice interaction device according to the fourth embodiment of the present disclosure.
  • the voice interaction device can receive the voice information sent by the control device through operation S307.
  • the operation method of the voice interaction device of the embodiment of the present disclosure may further include operation S708 to operation S709.
  • operation S708 the second voice information belonging to the general voice information sent by the control device is received; in operation S709, a second operation is performed according to the second voice information, and the second operation corresponds to a voice input corresponding to the second voice information.
  • the voice interaction device is the first voice interaction device
  • operations S708 to S709 are performed after operation S707 described in FIG. 7A.
  • the second voice information is the first voice information sent in operation S707.
  • the voice interaction device is not the first voice interaction device
  • operations S708 to S709 are performed after operation S603 described in FIG. 6A.
  • the second voice information is the first voice information determined by the control device to be general voice information through operation S306.
  • Fig. 8 schematically shows a flowchart of an operation method of a voice interaction device according to a fifth embodiment of the present disclosure.
  • the voice interaction device After the voice interaction device is the first voice interaction device and is switched to the awake state, it can be determined whether the user still needs to interact with the voice interaction device in response to the user's voice input or self-determination. Among them, the implementation of determining whether the user still needs to interact is described in detail above, and will not be repeated here.
  • the voice interaction device and/or other voice interaction device may perform operations such as playing broadcast or music before receiving the first voice input, and the user often hopes that the voice interaction device will end the interaction with the voice interaction device.
  • the device and/or other voice interaction devices can continue operations such as broadcasting or music. Therefore, after the voice interaction device determines that the user no longer needs to interact, it may send a resume request that characterizes the end of the interaction to the control device. Therefore, as shown in FIG. 8, after operation S602, the operation method of the embodiment of the present disclosure may further include operation S810, when the user's third voice input is collected or the user's voice input is not collected within a preset time period. To send a recovery request to the control device.
  • the control device can send recovery instructions to multiple voice interaction devices through operations S408 to S409 to restore the multiple voice interaction devices to the state before collecting the user's first voice input .
  • the multiple voice interaction devices can be restored to the state of playing music or radio before collecting the first voice input, so as to continue to play radio or music to the user to meet the user's needs.
  • operation S811 can be performed to receive a recovery instruction from the control device to switch the current state to the one before the first voice input is collected in response to the recovery instruction. status.
  • operation S811 is performed after operation S810.
  • operation S811 is performed after operation S603.
  • Fig. 9A schematically shows a flowchart of an operation method of a voice interaction device according to a sixth embodiment of the present disclosure.
  • the operation method of the voice interaction device of the embodiment of the present disclosure further includes operation S912.
  • the third operation respond In the synchronization request sent by the control device, the execution progress information of the third operation is sent to the control device in real time to update the execution progress of the third operation to the control device in real time.
  • the control device is After the first voice interaction device is re-determined, operations S514 to S515 described in FIG.
  • the intelligent system composed of multiple voice interactive devices can provide users with smooth services, avoid the repeated execution of some operations, and therefore can avoid the defect of wasting user time, and effectively improve user experience.
  • FIG. 5B to FIG. 5C please refer to the description of FIG. 5B to FIG. 5C, which will not be repeated here.
  • Fig. 9B schematically shows a flowchart of the operation method of the voice interaction device according to the seventh embodiment of the present disclosure.
  • the voice interaction device is not the first voice interaction device and is switched to the sleep state by operation S603
  • the first voice input corresponding to the first voice instruction issued by the user again can also be collected, and
  • the parameter information is re-transmitted to the control device through an operation similar to operation S601, so that the control device re-determines the first voice interaction device through operation S513.
  • the operation of the voice interaction device may further include operations S913 to S914.
  • the voice interaction device of the embodiment of the present disclosure can make the re-determined first voice interaction device continue to execute the first voice interaction device executed by the original first voice interaction device in response to the user's voice instruction through the above operations S913 to S915. Three operations, thereby avoiding repeated execution of some operations, and thus avoiding the defect of wasting user time, and effectively improving user experience.
  • Fig. 10 schematically shows a structural block diagram of a control device according to an embodiment of the present disclosure.
  • the control device 1000 includes a parameter information receiving module 1001, a demand information determining module 1002, a first device determining module 1003, and an instruction sending module 1004.
  • the parameter information receiving module 1001 is configured to receive multiple parameter information respectively sent by multiple voice interaction devices (operation S201), and the multiple parameter information is when the multiple voice interaction devices collect the first voice input of the user at the same time Sent.
  • the requirement information determining module 1002 is configured to determine the requirement information of the user according to the first voice input (operation S202).
  • the first voice input includes a predetermined voice input and a voice input that characterizes the needs of the user.
  • the first device determining module 1003 is configured to determine the first voice interaction device among the multiple voice interaction devices as the device that interacts with the user according to multiple parameter information and demand information (operation S203).
  • the instruction sending module 1004 is configured to send a wake-up instruction to the first voice interaction device, and send a non-wake-up instruction to voice interaction devices other than the first voice interaction device among the multiple voice interaction devices (operation S204).
  • the above-mentioned parameter information includes performance parameters of the voice interaction device
  • the first device determining module 1003 is specifically configured to: determine according to the matching relationship between the performance parameters of each voice device in the multiple voice interaction devices and the demand information The first voice interaction device.
  • the aforementioned parameter information further includes location information of the user.
  • the first device determination module 1003 may include a first determination sub-module 10031 and a second determination sub-module 10032.
  • the first determining submodule 10031 is configured to determine at least one second voice interaction device whose performance parameters match the user's demand information among the multiple voice interaction devices (operation S213).
  • the second determining submodule 10032 is configured to determine one of the at least one second voice interaction device as the first voice interaction device according to the user's location information in the parameter information sent by the at least one second voice interaction device (operation S223).
  • the location information of the user represents the location of the user relative to the voice interaction device.
  • the aforementioned parameter information includes operation information of the voice interaction device
  • the non-wake-up instruction includes a sleep instruction and a sleep instruction.
  • the above-mentioned instruction sending module 1004 may include an operation determining sub-module 10041 and an instruction sending sub-module 10042.
  • the operation determining sub-module 10041 is configured to determine whether the other voice interaction device performs the first operation when collecting the first voice input according to the operation information of the other voice interaction device (operation S214).
  • the instruction sending submodule 10042 is configured to send a sleep instruction to other voice interaction devices that perform the first operation, and send a sleep instruction to other voice interaction devices that do not perform the first operation (operation S224).
  • the voice interaction device is in a sleep state in response to the sleep instruction, and the sleep state includes a state in which the first operation is performed and does not respond to the collected voice input of the user; the voice interaction device responds to the sleep instruction In a sleep state, the sleep state includes a state where no operation is performed.
  • the above-mentioned control device 1000 may further include a first voice information receiving module 1005, a first voice information determining module 1006, and a first voice information sending module 1007, for example.
  • the first voice information receiving module 1005 is configured to receive first voice information sent by the first voice interaction device when the user's second voice input is collected (operation S305), where the first voice information corresponds to the second voice input .
  • the first voice information determining module 1006 is used to determine whether the first voice information is general voice information (operation S306).
  • the first voice information sending module 1007 is configured to send the first voice information to multiple voice interaction devices (operation S307) when the first voice information is general voice information (operation S307), so that the multiple voice interaction devices execute the second voice Enter the corresponding operation.
  • the aforementioned control device 1000 may, for example, further include a restoration request receiving module 1008, which is configured to receive a restoration request sent by the first voice interaction device (operation S408),
  • the restoration request is sent by the first voice interaction device when the user's third voice input is collected or the user's voice input is not collected within a preset time period; accordingly, the above-mentioned instruction sending module 1004 is also used to receive the restoration request
  • the module 1008 receives the recovery request, it sends a recovery instruction to the multiple voice interaction devices to restore the multiple voice interaction devices to the state before collecting the first voice input (operation S409).
  • the aforementioned control device 1000 may further include an operation monitoring module 1009, a synchronization request sending module 1010, and a first progress information receiving module 1011, for example.
  • the operation monitoring module 1009 is configured to monitor the operation performed by the first voice interaction device after the command sending module sends the wake-up instruction, and determine whether the first voice interaction device performs the third operation (operation S510).
  • the synchronization request sending module 1010 is configured to send a synchronization request to the first voice interaction device when it is determined that the first voice interaction device performs the third operation (operation S511).
  • the first progress information receiving module 1011 is configured to receive the execution progress information of the third operation sent by the first voice interaction device in response to the synchronization request (operation S512).
  • the above-mentioned first device determining module 1003 is further configured to re-determine when the parameter information receiving module 1001 again receives at least one parameter information respectively sent by at least one voice interaction device among the multiple voice interaction devices
  • the first voice interaction device (operation S513).
  • the above-mentioned control device 1000 may further include, for example, an acquisition request receiving module 1012 and a first progress information sending module 1013.
  • the acquisition request receiving module 1012 is configured to receive the re-determined first voice interaction device that has collected the user’s information.
  • the first progress information sending module 1013 is configured to send execution progress information to the re-determined first voice interaction device in response to the re-determined acquisition request of the first voice interaction device (operation S515).
  • any number of modules, submodules, units, and subunits, or at least part of the functions of any number of them, may be implemented in one module. Any one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be split into multiple modules for implementation.
  • any one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be at least partially implemented as a hardware circuit, such as a field programmable gate array (FPGA), a programmable logic array (PLA), System-on-chip, system-on-substrate, system-on-package, application-specific integrated circuit (ASIC), or hardware or firmware in any other reasonable way that integrates or encapsulates the circuit, or can be implemented by software, hardware, and firmware. Any one of these implementations or an appropriate combination of any of them can be implemented.
  • FPGA field programmable gate array
  • PLA programmable logic array
  • ASIC application-specific integrated circuit
  • any one of these implementations or an appropriate combination of any of them can be implemented.
  • one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be at least partially implemented as a computer program module, and the computer program module may perform corresponding functions when it is executed.
  • FPGA field programmable gate array
  • PLA programmable logic array
  • At least one of the module 10032, the operation determining sub-module 10041, and the instruction sending sub-module 10042 may be at least partially implemented as a computer program module, and when the computer program module is run, it may perform a corresponding function.
  • Fig. 11 schematically shows a structural block diagram of a voice interaction device according to an embodiment of the present disclosure.
  • the voice interaction device 1100 includes a parameter information sending module 1101, an instruction receiving module 1102, and a state switching module 1103.
  • the parameter information sending module 1101 is configured to send parameter information to the control device when the user's first voice input is collected (operation S601) to determine whether the voice interaction device is the first voice interaction device.
  • the first voice input includes a predetermined voice input and a voice input that characterizes user needs.
  • the instruction receiving module 1102 is configured to receive a wake-up instruction sent by the control device when the voice interaction device is the first voice interaction device.
  • the state switching module 1103 is configured to switch the current state to the wake-up state in response to the wake-up instruction when the instruction receiving module 1102 receives the wake-up instruction (operation S602).
  • the instruction receiving module 1102 is configured to receive a non-wake-up instruction sent by the control device when the voice interaction device is not the first voice interaction device.
  • the state switching module 1103 is configured to switch the current state to the non-awakening state in response to the non-awakening instruction when the instruction receiving module 1102 receives the non-awakening instruction (operation S603).
  • the above parameter information includes performance parameters of the voice interaction device.
  • the above-mentioned voice interaction device 1100 may, for example, further include a performance parameter obtaining module 1104 for obtaining performance parameters before the parameter information sending module 1101 sends parameter information to the control device (operation S604).
  • the aforementioned parameter information further includes location information of the user.
  • the above-mentioned voice interaction device 1100 may further include, for example, a location information determining module 1105, configured to determine the location information of the user according to the collected first voice input of the user (operation S605).
  • the location information of the user represents the location of the user relative to the voice interaction device.
  • the aforementioned parameter information includes operation information of the voice interaction device, and the non-wake-up instruction includes a sleep instruction and a sleep instruction.
  • the operation information indicates that the voice interaction device performs the first operation
  • the non-wake-up instruction received by the instruction receiving module 1102 is a sleep instruction
  • the state switching module 1103 switches the current state to the sleep state in response to the sleep instruction. It includes a state where the first operation is performed and the collected voice input of the user is not responded.
  • the non-wake-up instruction received by the instruction receiving module 1102 is a sleep instruction
  • the state switching module 1103 switches the current state to the sleep state in response to the sleep instruction.
  • the state includes the state where no operation is performed.
  • the aforementioned voice interaction device 1100 may further include, for example, a second voice information determining module 1106 and a second voice information sending module 1107.
  • the second voice information determining module 1106 is configured to determine whether the first voice information corresponding to the second voice input is general voice information when the user is in an awake state and the second voice input of the user is collected (operation S706).
  • the second voice information sending module 1107 is configured to send the first voice information to the control device when it is determined that the first voice information is general voice information (operation S707).
  • the voice interaction apparatus 1100 may further include a second voice information receiving module 1108 and an operation execution module 1109, for example.
  • the second voice information receiving module 1108 is configured to receive second voice information belonging to general voice information sent by the control device (operation S708).
  • the operation execution module 1109 is configured to perform a second operation according to the second voice information, and the second operation corresponds to a voice input corresponding to the second voice information (operation S709).
  • the above-mentioned voice interaction device 1100 may, for example, further include a recovery request sending module 1110, which is configured to be in an awake state and collect the user's third voice input or within a preset time period. If the user's voice input is not collected, a recovery request is sent to the control device (operation S810). And/or, the instruction receiving module 1102 is further configured to receive a recovery instruction sent by the control device; the state switching module 1103 is also configured to switch the current state to the state before the first voice input is collected in response to the recovery instruction (operation S811).
  • a recovery request sending module 1110 which is configured to be in an awake state and collect the user's third voice input or within a preset time period. If the user's voice input is not collected, a recovery request is sent to the control device (operation S810).
  • the instruction receiving module 1102 is further configured to receive a recovery instruction sent by the control device; the state switching module 1103 is also configured to switch the current state to the state before the first voice input is collected in
  • the above-mentioned voice interaction device 1100 may, for example, further include a second progress information sending module 1111, which is configured to respond to the control device when it is in an awake state and the third operation is performed.
  • the sent synchronization request sends the execution progress information of the third operation to the control device (operation S912).
  • the above-mentioned voice interaction apparatus 1100 may further include an acquisition request sending module 1112, a second progress information receiving module 1113, and an operation execution module, for example.
  • the acquisition request sending module 1112 is configured to send an acquisition request to the control device when the user is in the awake state and the fourth voice input of the user is collected (operation S913).
  • the second progress information receiving module 1113 is configured to receive the execution progress information sent by the control device in response to the acquisition request (operation S914).
  • the operation execution module is configured to execute the third operation according to the execution progress information (operation S915).
  • the third operation corresponds to the fourth voice input, and the operation execution module may specifically be the operation execution module 1109 described above.
  • any number of modules, submodules, units, and subunits, or at least part of the functions of any number of them, may be implemented in one module. Any one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be split into multiple modules for implementation.
  • any one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be at least partially implemented as a hardware circuit, such as a field programmable gate array (FPGA), a programmable logic array (PLA), System-on-chip, system-on-substrate, system-on-package, application-specific integrated circuit (ASIC), or hardware or firmware in any other reasonable way that integrates or encapsulates the circuit, or can be implemented by software, hardware, and firmware. Any one of these implementations or an appropriate combination of any of them can be implemented.
  • FPGA field programmable gate array
  • PLA programmable logic array
  • ASIC application-specific integrated circuit
  • any one of these implementations or an appropriate combination of any of them can be implemented.
  • one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be at least partially implemented as a computer program module, and the computer program module may perform corresponding functions when it is executed.
  • the location information determining module 1105 the second voice information determining module 1106, the second voice information sending module 1107, the second voice information
  • Any number of the receiving module 1108, the operation execution module 1109, the recovery request sending module 1110, the second progress information sending module 1111, the acquisition request sending module 1112, and the second progress information receiving module 1113 can be combined into one module for implementation, or Any one of these modules can be split into multiple modules. Or, at least part of the functions of one or more of these modules may be combined with at least part of the functions of other modules and implemented in one module.
  • At least one of the second voice information receiving module 1108, the operation execution module 1109, the restoration request sending module 1110, the second progress information sending module 1111, the acquisition request sending module 1112, and the second progress information receiving module 1113 may be at least partially Implemented as a hardware circuit, such as field programmable gate array (FPGA), programmable logic array (PLA), system on chip, system on substrate, system on package, application specific integrated circuit (ASIC), or can be integrated by circuit Or encapsulated in any other reasonable way such as hardware or firmware, or implemented in any one of the three implementation ways of software, hardware, and firmware, or an appropriate combination of any of them.
  • FPGA field programmable gate array
  • PLA programmable logic array
  • ASIC application specific integrated circuit
  • the receiving module 1108, the operation execution module 1109, the recovery request sending module 1110, the second progress information sending module 1111, the acquisition request sending module 1112, and the second progress information receiving module 1113 may be at least partially implemented as a computer program module When the computer program module is running, it can execute the corresponding function.
  • an electronic device which can be used to perform the operation method of the control device described in FIGS. 2A to 5C, and can also be used to perform the voice interaction described with reference to FIGS. 6A to 9B How to operate the device.
  • the electronic device includes both the control device described with reference to FIG. 10 and the voice interaction device described in FIG. 11.
  • the electronic device may be integrated and controlled in any one of the multiple voice interaction devices described with reference to FIG.
  • the electronic device formed by the device, or the control device and the voice interaction device may be two functional modules in the electronic device, and the two functional modules can interact, which will not be repeated here.
  • Fig. 12 schematically shows a block diagram of an electronic device suitable for performing an operating method of a control device or an operating method of a voice interaction device according to an embodiment of the present disclosure.
  • the electronic device shown in FIG. 12 is only an example, and should not bring any limitation to the function and scope of use of the embodiments of the present disclosure.
  • an electronic device 1200 includes a processor 1201, which can be loaded into a random access memory (RAM) 1203 according to a program stored in a read only memory (ROM) 1202 or from a storage part 1208 The program executes various appropriate actions and processing.
  • the processor 1201 may include, for example, a general-purpose microprocessor (for example, a CPU), an instruction set processor and/or a related chipset and/or a special purpose microprocessor (for example, an application specific integrated circuit (ASIC)), and so on.
  • the processor 1201 may also include on-board memory for caching purposes.
  • the processor 1201 may include a single processing unit or multiple processing units for performing different actions of a method flow according to an embodiment of the present disclosure.
  • the processor 1201, the ROM 1202, and the RAM 1203 are connected to each other through a bus 1204.
  • the processor 1201 executes various operations of the method flow according to the embodiments of the present disclosure by executing programs in the ROM 1202 and/or RAM 1203. It should be noted that the program may also be stored in one or more memories other than the ROM 1202 and RAM 1203.
  • the processor 1201 may also execute various operations of the method flow according to the embodiments of the present disclosure by executing programs stored in the one or more memories.
  • the electronic device 1200 may further include an input/output (I/O) interface 1205, and the input/output (I/O) interface 1205 is also connected to the bus 1204.
  • the electronic device 1200 may also include one or more of the following components connected to the I/O interface 1205: an input part 1206 including a keyboard, a mouse, etc.; including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., and An output section 1207 of a speaker and the like; a storage section 1208 including a hard disk and the like; and a communication section 1209 including a network interface card such as a LAN card, a modem, and the like.
  • the communication section 1209 performs communication processing via a network such as the Internet.
  • the driver 1210 is also connected to the I/O interface 1205 as needed.
  • a removable medium 1211 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc., is installed on the drive 1210 as needed, so that the computer program read therefrom is installed into the storage portion 1208 as needed.
  • the method flow according to the embodiment of the present disclosure may be implemented as a computer software program.
  • an embodiment of the present disclosure includes a computer program product, which includes a computer program carried on a computer-readable storage medium, and the computer program contains program code for executing the method shown in the flowchart.
  • the computer program may be downloaded and installed from the network through the communication part 1209, and/or installed from the removable medium 1211.
  • the above-mentioned functions defined in the system of the embodiment of the present disclosure are executed.
  • the above-described systems, devices, devices, modules, units, etc. may be implemented by computer program modules.
  • the present disclosure also provides a computer-readable storage medium.
  • the computer-readable storage medium may be included in the device/device/system described in the above embodiment; or it may exist alone without being assembled into the device/ In the device/system.
  • the aforementioned computer-readable storage medium carries one or more programs, and when the aforementioned one or more programs are executed, the method according to the embodiments of the present disclosure is implemented.
  • the computer-readable storage medium may be a non-volatile computer-readable storage medium, for example, may include but not limited to: portable computer disk, hard disk, random access memory (RAM), read-only memory (ROM) , Erasable programmable read-only memory (EPROM or flash memory), portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program, and the program may be used by or in combination with an instruction execution system, apparatus, or device.
  • the computer-readable storage medium may include one or more memories other than the ROM 1202 and/or RAM 1203 and/or ROM 1202 and RAM 1203 described above.
  • each block in the flowchart or block diagram may represent a module, program segment, or part of code, and the above-mentioned module, program segment, or part of code contains one or more for realizing the specified logical function Executable instructions.
  • the functions marked in the block may also occur in a different order from the order marked in the drawings. For example, two blocks shown in succession can actually be executed substantially in parallel, or they can sometimes be executed in the reverse order, depending on the functions involved.
  • each block in the block diagram or flowchart, and the combination of blocks in the block diagram or flowchart can be implemented by a dedicated hardware-based system that performs the specified functions or operations, or can be It is realized by a combination of dedicated hardware and computer instructions.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Selective Calling Equipment (AREA)

Abstract

The present disclosure provides a control device and an operation method therefor, and a speech interaction device and an operation method therefor. The operation method for the control device comprises: receiving multiple pieces of parameter information sent by multiple speech interaction devices respectively, the multiple pieces of parameter information being sent by the multiple speech interaction devices when a first speech input from a user at the same moment is acquired; determining demand information of the user according to the first speech input; according to the multiple pieces of parameter information and the demand information, determining that a first speech interaction device among the multiple speech interaction devices is the device interacting with the user; and sending a wake-up instruction to the first speech interaction device, and sending a non-wake-up instruction to the speech interaction devices among the multiple speech interaction devices except the first speech interaction device. The first speech input comprises a predetermined speech input and a speech input which characterizes a demand of the user.

Description

控制装置及其操作方法,和语音交互装置及其操作方法Control device and operation method thereof, and voice interaction device and operation method thereof
本申请要求于2019年05月09日提交的中国专利申请CN201910388450.5的优先权,其内容一并在此作为参考。This application claims the priority of the Chinese patent application CN201910388450.5 filed on May 9, 2019, the content of which is incorporated herein by reference.
技术领域Technical field
本公开涉及互联网技术领域,更具体地,涉及一种控制装置及其操作方法,和语音交互装置及其操作方法。The present disclosure relates to the field of Internet technology, and more specifically, to a control device and an operation method thereof, and a voice interaction device and an operation method thereof.
背景技术Background technique
随着互联网技术和计算机技术的快速发展,语音交互技术应运而生,以解放双手,提高用户与电子设备的交互效率。With the rapid development of Internet technology and computer technology, voice interaction technology has emerged to free hands and improve the efficiency of interaction between users and electronic devices.
在实现本公开构思的过程中,发明人发现现有技术中至少存在如下问题:随着智能及物联化技术的普及,现代化家庭中往往设置有能够与用户进行交互的多个语音交互设备,且为了实现所谓的多个语音交互设备的管理,该多个语音交互设备往往源自同一供应商,因此该多个语音交互设备具有相同的唤醒语,则当该多个语音交互设备距离较近时,该多个语音交互设备往往响应于用户同一时刻发出的唤醒词而被唤醒,且在用户与语音交互设备进行语音交互时,该多个语音交互设备往往会同时做出应答,这无疑使得交互场景嘈杂混乱,从而降低用户体验。In the process of realizing the concept of the present disclosure, the inventor found that there are at least the following problems in the prior art: With the popularization of intelligence and IoT technologies, modern homes are often equipped with multiple voice interaction devices that can interact with users. And in order to realize the so-called management of multiple voice interaction devices, the multiple voice interaction devices often originate from the same supplier, so the multiple voice interaction devices have the same wake-up language, then when the multiple voice interaction devices are closer At this time, the multiple voice interaction devices are often awakened in response to the wake-up words sent by the user at the same time, and when the user performs voice interaction with the voice interaction device, the multiple voice interaction devices often respond at the same time, which undoubtedly makes The interactive scene is noisy and chaotic, which reduces the user experience.
发明内容Summary of the invention
有鉴于此,本公开提供了一种能够提高交互体验的控制装置及其操作方法,和语音交互装置及其操作方法。In view of this, the present disclosure provides a control device and an operation method thereof, and a voice interaction device and an operation method thereof that can improve the interactive experience.
本公开的第一方面提供了一种控制装置的操作方法,该方法包括:接收多个语音交互装置分别发送的多个参数信息,所述多个参数信息是所述多个语音交互装置在采集到用户同一时刻的第一语音输入的情况下发送的;根据所述第一语音输入,确定所述用户的需求信息;根据所述多个参数信息及所述需求信息,确定所述多个语音交互装置中的第一语音交互装置为与所述用户交互的装置;以及向所述第一语音交互装置发送唤醒指令,并向所述多个语音交互装置中除所述第一语音交互装置外的其他语音交互装置发送非唤醒指令,其中,所述第一语音输入包括预 定语音输入和表征所述用户需求的语音输入。A first aspect of the present disclosure provides an operating method of a control device, the method includes: receiving multiple parameter information sent by multiple voice interaction devices, and the multiple parameter information is collected by the multiple voice interaction devices. It is sent when the user's first voice input at the same time; according to the first voice input, the user's demand information is determined; according to the multiple parameter information and the demand information, the multiple voices are determined The first voice interaction device in the interaction device is a device that interacts with the user; and sends a wake-up instruction to the first voice interaction device, and sends a wake-up instruction to the plurality of voice interaction devices except for the first voice interaction device The other voice interaction device in the sends a non-wake-up instruction, wherein the first voice input includes a predetermined voice input and a voice input that characterizes the needs of the user.
根据本公开的实施例,上述参数信息包括语音交互装置的性能参数,所述根据所述多个参数信息及所述需求信息,确定所述第一语音交互装置包括:根据所述多个语音交互装置中每个语音装置的性能参数与所述需求信息的匹配关系,确定所述第一语音交互装置。According to an embodiment of the present disclosure, the above-mentioned parameter information includes performance parameters of the voice interaction device, and the determining the first voice interaction device according to the multiple parameter information and the demand information includes: The matching relationship between the performance parameters of each voice device in the device and the demand information determines the first voice interaction device.
根据本公开的实施例,上述参数信息还包括所述用户的位置信息,所述根据所述多个语音交互装置中每个语音装置的性能参数与所述需求信息的匹配关系,确定所述第一语音交互装置包括:确定所述多个语音交互装置中性能参数与所述用户的需求信息匹配的至少一个第二语音交互装置;以及根据所述至少一个第二语音交互装置发送的参数信息中的用户的位置信息,确定所述至少一个第二语音交互装置中的一个为所述第一语音交互装置,其中,所述用户的位置信息表征所述用户相对于语音交互装置的位置。According to an embodiment of the present disclosure, the above-mentioned parameter information further includes location information of the user, and the first step is determined based on the matching relationship between the performance parameter of each voice device in the plurality of voice interaction devices and the demand information. A voice interaction device includes: at least one second voice interaction device that determines that the performance parameters of the multiple voice interaction devices match the user's demand information; and according to the parameter information sent by the at least one second voice interaction device It is determined that one of the at least one second voice interaction device is the first voice interaction device, wherein the location information of the user characterizes the location of the user relative to the voice interaction device.
根据本公开的实施例,上述参数信息包括语音交互装置的操作信息,所述非唤醒指令包括睡眠指令和休眠指令,所述向所述其他语音交互装置发送非唤醒指令包括:根据所述其他语音交互装置的操作信息,确定所述其他语音交互装置在采集所述第一语音输入时是否执行第一操作;以及向执行所述第一操作的其他语音交互装置发送所述睡眠指令,向未执行所述第一操作的其他语音交互装置发送所述休眠指令,其中,语音交互装置响应于所述睡眠指令处于睡眠状态,所述睡眠状态包括执行所述第一操作且对采集的所述用户的语音输入不作响应的状态;语音交互装置响应于所述休眠指令处于休眠状态,所述休眠状态包括不执行任何操作的状态。According to an embodiment of the present disclosure, the aforementioned parameter information includes operation information of a voice interaction device, the non-wake-up instruction includes a sleep instruction and a hibernation instruction, and the sending a non-wake-up instruction to the other voice interaction device includes: according to the other voice The operation information of the interaction device, determining whether the other voice interaction device performs the first operation when collecting the first voice input; and sending the sleep instruction to the other voice interaction device performing the first operation, The other voice interaction device of the first operation sends the sleep instruction, wherein the voice interaction device is in a sleep state in response to the sleep instruction, and the sleep state includes performing the first operation and responding to the collected user A state in which voice input does not respond; the voice interaction device is in a dormant state in response to the dormant instruction, and the dormant state includes a state in which no operation is performed.
根据本公开的实施例,上述方法还包括:接收所述第一语音交互装置在采集到所述用户的第二语音输入的情况下发送的第一语音信息,所述第一语音信息与所述第二语音输入相对应;确定所述第一语音信息是否为通用语音信息;以及在所述第一语音信息为所述通用语音信息的情况下,向所述多个语音交互装置发送所述第一语音信息。According to an embodiment of the present disclosure, the above method further includes: receiving first voice information sent by the first voice interaction device when the user's second voice input is collected, and the first voice information is related to the Corresponding to the second voice input; determine whether the first voice information is general voice information; and if the first voice information is the general voice information, send the first voice information to the multiple voice interaction devices One voice message.
根据本公开的实施例,上述方法还包括:接收所述第一语音交互装置发送的恢复请求,所述恢复请求由所述第一语音交互装置在采集到所 述用户的第三语音输入或预设时段内未采集到所述用户的语音输入的情况下发送;以及向所述多个语音交互装置发送恢复指令,以使所述多个语音交互装置恢复至采集所述第一语音输入之前的状态。According to an embodiment of the present disclosure, the above method further includes: receiving a recovery request sent by the first voice interaction device, the recovery request being collected by the first voice interaction device after the user’s third voice input or preview Suppose it is sent when the user’s voice input is not collected within a time period; and a restoration instruction is sent to the multiple voice interaction devices, so that the multiple voice interaction devices are restored to the time before collecting the first voice input status.
根据本公开的实施例,在向所述第一语音交互装置发送唤醒指令之后,上述方法还包括:监听所述第一语音交互装置的操作,确定所述第一语音交互装置是否执行第三操作;在所述第一语音交互装置执行所述第三操作的情况下,向所述第一语音交互装置发送同步请求;以及接收所述第一语音交互装置响应于所述同步请求发送的所述第三操作的执行进度信息。According to an embodiment of the present disclosure, after sending a wake-up instruction to the first voice interaction device, the above method further includes: monitoring the operation of the first voice interaction device, and determining whether the first voice interaction device performs a third operation In the case where the first voice interaction device performs the third operation, sending a synchronization request to the first voice interaction device; and receiving the first voice interaction device sent in response to the synchronization request The execution progress information of the third operation.
根据本公开的实施例,上述方法还包括:在再次接收到所述多个语音交互装置中至少一个语音交互装置分别发送的至少一个参数信息的情况下,重新确定第一语音交互装置;接收所述重新确定后的第一语音交互装置在采集到所述用户的第四语音输入的情况下发送的获取请求;以及响应于所述重新确定后的第一语音交互装置的获取请求,向所述重新确定后的第一语音交互装置发送所述执行进度信息。According to an embodiment of the present disclosure, the above method further includes: in a case where at least one parameter information respectively sent by at least one voice interaction device among the multiple voice interaction devices is received again, re-determining the first voice interaction device; The acquisition request sent by the re-determined first voice interaction device under the condition that the user’s fourth voice input is collected; and in response to the re-determined acquisition request of the first voice interaction device, The re-determined first voice interaction device sends the execution progress information.
本公开的第二方面提供了一种语音交互装置的操作方法。该方法包括:在采集到用户的第一语音输入的情况下,向控制装置发送参数信息,以确定所述语音交互装置是否为第一语音交互装置;在所述语音交互装置是第一语音交互装置的情况下,接收所述控制装置发送的唤醒指令,以响应于所述唤醒指令处于唤醒状态;在所述语音交互装置不是第一语音交互装置的情况下,接收所述控制装置发送的非唤醒指令,以响应于所述非唤醒指令处于非唤醒状态,其中,所述第一语音输入包括预定语音输入和表征所述用户需求的语音输入。The second aspect of the present disclosure provides an operation method of a voice interaction device. The method includes: when the user's first voice input is collected, sending parameter information to the control device to determine whether the voice interaction device is the first voice interaction device; when the voice interaction device is the first voice interaction device In the case of the device, the wake-up instruction sent by the control device is received, in response to the wake-up instruction being in the wake-up state; in the case that the voice interaction device is not the first voice interaction device, the non-received instruction sent by the control device is received. The wake-up instruction is in a non-wake-up state in response to the non-wake-up instruction, wherein the first voice input includes a predetermined voice input and a voice input that represents the user's needs.
根据本公开的实施例,上述参数信息包括所述语音交互装置的性能参数,在向控制装置发送参数信息之前,上述方法还包括:获取所述性能参数。According to an embodiment of the present disclosure, the above-mentioned parameter information includes performance parameters of the voice interaction device, and before sending the parameter information to the control device, the above-mentioned method further includes: acquiring the performance parameters.
根据本公开的实施例,上述参数信息还包括所述用户的位置信息,在向控制装置发送参数信息之前,所述方法还包括:根据采集的所述用户的第一语音输入,确定所述用户的位置信息,其中,所述用户的位置信息表征所述用户相对于所述语音交互装置的位置。According to an embodiment of the present disclosure, the above-mentioned parameter information further includes location information of the user. Before sending the parameter information to the control device, the method further includes: determining the user according to the collected first voice input of the user The location information of the user, wherein the location information of the user represents the location of the user relative to the voice interaction device.
根据本公开的实施例,上述参数信息包括语音交互装置的操作信息,所述非唤醒指令包括睡眠指令和休眠指令,其中:在所述操作信息表征所述语音交互装置执行第一操作的情况下,接收到的非唤醒指令为所述睡眠指令,以响应于所述睡眠指令处于睡眠状态,所述睡眠状态包括执行所述第一操作且对采集的所述用户的语音输入不作响应的状态;在所述操作信息表征所述语音交互装置未执行第一操作的情况下,接收到的所述非唤醒指令为所述休眠指令,以响应于所述休眠指令处于休眠状态,所述休眠状态包括不执行任何操作的状态。According to an embodiment of the present disclosure, the above-mentioned parameter information includes operation information of the voice interaction device, and the non-wake-up instruction includes a sleep instruction and a hibernation instruction, wherein: when the operation information indicates that the voice interaction device performs the first operation , The received non-wake-up instruction is the sleep instruction, in response to the sleep instruction being in a sleep state, the sleep state includes a state in which the first operation is performed and the collected voice input of the user is not responded; In the case where the operation information indicates that the voice interaction device has not performed the first operation, the received non-wake-up instruction is the sleep instruction, in response to the sleep instruction being in the sleep state, the sleep state includes The state of not performing any operations.
根据本公开的实施例,在语音交互装置切换至唤醒状态后,上述方法还包括:实时采集用户的语音输入;在采集到用户的第二语音输入的情况下,确定第二语音输入对应的第二语音信息是否为通用语音信息;以及在确定第二语音输入对应的第二语音信息是通用语音信息的情况下,向控制装置发送第二语音信息,以使控制装置发送第二语音信息至多个语音交互装置,使多个语音交互装置执行与第二语音信息相对应的操作。According to an embodiment of the present disclosure, after the voice interaction device is switched to the awake state, the above method further includes: collecting the user's voice input in real time; in the case of collecting the user's second voice input, determining the second voice input corresponding to the second voice input. 2. Whether the voice information is general voice information; and in the case of determining that the second voice information corresponding to the second voice input is general voice information, sending the second voice information to the control device, so that the control device sends the second voice information to multiple The voice interaction device enables multiple voice interaction devices to perform operations corresponding to the second voice information.
根据本公开的实施例,上述方法还包括:在所述语音交互装置处于唤醒状态、且采集到所述用户的第二语音输入的情况下,确定所述第二语音输入对应的第一语音信息是否为通用语音信息;以及在确定所述第一语音信息为通用语音信息的情况下,向所述控制装置发送第一语音信息。并且/或者上述方法还包括:接收所述控制装置发送的属于所述通用语音信息的第二语音信息;以及根据所述第二语音信息,执行第二操作,所述第二操作与所述第二语音信息对应的语音输入相对应。According to an embodiment of the present disclosure, the above method further includes: determining the first voice information corresponding to the second voice input when the voice interaction device is in an awake state and the second voice input of the user is collected Whether it is general voice information; and if it is determined that the first voice information is general voice information, sending the first voice information to the control device. And/or the above method further includes: receiving second voice information belonging to the general voice information sent by the control device; and executing a second operation according to the second voice information, and the second operation is the same as the first 2. The voice input corresponds to the voice information.
根据本公开的实施例,上述方法还包括:在所述语音交互装置处于唤醒状态、且采集到所述用户的第三语音输入或在预设时段内未采集到用户的语音输入的情况下,向所述控制装置发送恢复请求;并且/或者,上述方法还包括:接收所述控制装置发送的恢复指令,将当前状态切换至采集所述第一语音输入之前的状态。According to an embodiment of the present disclosure, the above method further includes: when the voice interaction device is in an awake state and the user's third voice input is collected or the user's voice input is not collected within a preset time period, Sending a restoration request to the control device; and/or, the above method further includes: receiving a restoration instruction sent by the control device, and switching the current state to a state before the first voice input is collected.
根据本公开的实施例,上述方法还包括:在所述语音交互装置处于唤醒状态、且执行第三操作的情况下,响应于所述控制装置发送的同步请求,向所述控制装置发送所述第三操作的执行进度信息。According to an embodiment of the present disclosure, the above method further includes: when the voice interaction device is in an awake state and a third operation is performed, in response to a synchronization request sent by the control device, sending the control device The execution progress information of the third operation.
根据本公开的实施例,上述方法还包括:在所述语音交互装置处于 唤醒状态、且采集到所述用户的第四语音输入的情况下,向所述控制装置发送获取请求;接收所述控制装置响应于所述获取请求发送的所述执行进度信息;以及根据所述执行进度信息,执行所述第三操作,其中,所述第三操作与所述第四语音输入相对应。According to an embodiment of the present disclosure, the above method further includes: when the voice interaction device is in an awake state and a fourth voice input of the user is collected, sending an acquisition request to the control device; receiving the control The apparatus sends the execution progress information in response to the acquisition request; and executes the third operation according to the execution progress information, wherein the third operation corresponds to the fourth voice input.
本公开的第三方面提供了一种控制装置,该装置包括:参数信息接收模块,用于接收多个语音交互装置分别发送的多个参数信息,所述多个参数信息是所述多个语音交互装置在采集到用户同一时刻的第一语音输入的情况下发送的;需求信息确定模块,用于根据所述第一语音输入,确定所述用户的需求信息;第一装置确定模块,用于根据所述多个参数信息及所述需求信息,确定所述多个语音交互装置中的第一语音交互装置为与所述用户交互的装置;以及指令发送模块,用于向所述第一语音交互装置发送唤醒指令,并向所述多个语音交互装置中除所述第一语音交互装置外的其他语音交互装置发送非唤醒指令,其中,所述第一语音输入包括预定语音输入和表征所述用户需求的语音输入。A third aspect of the present disclosure provides a control device. The device includes: a parameter information receiving module, configured to receive multiple parameter information sent by multiple voice interaction devices, and the multiple parameter information is the multiple voice The interaction device sends the first voice input of the user at the same time; the demand information determination module is used to determine the demand information of the user according to the first voice input; the first device determination module is used to According to the multiple parameter information and the demand information, it is determined that the first voice interaction device among the multiple voice interaction devices is the device that interacts with the user; and the instruction sending module is configured to send the first voice The interaction device sends a wake-up instruction, and sends a non-wake-up instruction to voice interaction devices other than the first voice interaction device among the multiple voice interaction devices, wherein the first voice input includes a predetermined voice input and a characterizing Voice input that describes user needs.
根据本公开的实施例,上述参数信息包括语音交互装置的性能参数,所述第一装置确定模块具体用于:根据所述多个语音交互装置中每个语音装置的性能参数与所述需求信息的匹配关系,确定所述第一语音交互装置。According to an embodiment of the present disclosure, the above-mentioned parameter information includes performance parameters of a voice interaction device, and the first device determining module is specifically configured to: according to the performance parameters of each voice device in the plurality of voice interaction devices and the demand information To determine the first voice interaction device.
根据本公开的实施例,上述参数信息还包括所述用户的位置信息,所述第一装置确定模块包括:第一确定子模块,用于确定所述多个语音交互装置中性能参数与所述用户的需求信息匹配的至少一个第二语音交互装置;以及第二确定子模块,用于根据所述至少一个第二语音交互装置发送的参数信息中的用户的位置信息,确定所述至少一个第二语音交互装置中的一个为所述第一语音交互装置,其中,所述用户的位置信息表征所述用户相对于语音交互装置的位置。According to an embodiment of the present disclosure, the above-mentioned parameter information further includes location information of the user, and the first device determining module includes: a first determining sub-module configured to determine the performance parameters of the multiple voice interaction devices and the At least one second voice interaction device that matches the user’s demand information; and a second determining submodule, configured to determine the at least one first voice interaction device according to the user’s location information in the parameter information sent by the at least one second voice interaction device One of the two voice interaction devices is the first voice interaction device, wherein the location information of the user represents the location of the user relative to the voice interaction device.
根据本公开的实施例,上述参数信息包括语音交互装置的操作信息,所述非唤醒指令包括睡眠指令和休眠指令,所述指令发送模块包括:操作确定子模块,用于根据所述其他语音交互装置的操作信息,确定所述其他语音交互装置在采集所述第一语音输入时是否执行第一操作;以及指令发送子模块,用于向执行所述第一操作的其他语音交互装置发送所 述睡眠指令,向未执行所述第一操作的其他语音交互装置发送所述休眠指令,其中,语音交互装置响应于所述睡眠指令处于睡眠状态,所述睡眠状态包括执行所述第一操作且对采集的所述用户的语音输入不作响应的状态;语音交互装置响应于所述休眠指令处于休眠状态,所述休眠状态包括不执行任何操作的状态。According to an embodiment of the present disclosure, the above-mentioned parameter information includes operation information of the voice interaction device, the non-wake-up instruction includes a sleep instruction and a sleep instruction, and the instruction sending module includes: an operation determination sub-module for interacting according to the other voice The operation information of the device determines whether the other voice interaction device performs the first operation when collecting the first voice input; and the instruction sending sub-module is configured to send the other voice interaction device that performs the first operation A sleep instruction to send the sleep instruction to other voice interaction devices that have not performed the first operation, wherein the voice interaction device is in a sleep state in response to the sleep instruction, and the sleep state includes performing the first operation and The collected voice input of the user is not responding; the voice interaction device is in a sleep state in response to the sleep instruction, and the sleep state includes a state where no operation is performed.
根据本公开的实施例,上述控制装置还包括第一语音信息接收模块,用于接收所述第一语音交互装置在采集到所述用户的第二语音输入的情况下发送的第一语音信息,所述第一语音信息与所述第二语音输入相对应;第一语音信息确定模块,用于确定所述第一语音信息是否为通用语音信息;以及第一语音信息发送模块,用于在所述第一语音信息为所述通用语音信息的情况下,向所述多个语音交互装置发送所述第一语音信息。According to an embodiment of the present disclosure, the above-mentioned control device further includes a first voice information receiving module, configured to receive the first voice information sent by the first voice interaction device when the second voice input of the user is collected, The first voice information corresponds to the second voice input; a first voice information determining module is used to determine whether the first voice information is general voice information; and a first voice information sending module is used to If the first voice information is the general voice information, sending the first voice information to the multiple voice interaction apparatuses.
根据本公开的实施例,上述控制装置还包括:恢复请求接收模块,接收所述第一语音交互装置发送的恢复请求,所述恢复请求由所述第一语音交互装置在采集到所述用户的第三语音输入或预设时段内未采集到所述用户的语音输入的情况下发送;以及所述指令发送模块还用于:在所述恢复请求接收模块接收到所述恢复请求的情况下,向所述多个语音交互装置发送恢复指令,以使所述多个语音交互装置恢复至采集所述第一语音输入之前的状态。According to an embodiment of the present disclosure, the above-mentioned control device further includes: a recovery request receiving module that receives a recovery request sent by the first voice interaction device, and the recovery request is collected by the first voice interaction device after collecting The third voice input or sending when the user's voice input is not collected within a preset time period; and the instruction sending module is also used for: when the recovery request receiving module receives the recovery request, Sending a recovery instruction to the multiple voice interaction devices to restore the multiple voice interaction devices to a state before collecting the first voice input.
根据本公开的实施例,上述控制装置还包括:操作监听模块,用于在所述指令发送模块发送所述唤醒指令后,监听所述第一语音交互装置执行的操作,并确定所述第一语音交互装置是否执行第三操作;同步请求发送模块,用于在确定所述第一语音交互装置执行所述第三操作的情况下,向所述第一语音交互装置发送同步请求;以及第一进度信息接收模块,用于接收所述第一语音交互装置响应于所述同步请求发送的所述第三操作的执行进度信息。According to an embodiment of the present disclosure, the above-mentioned control device further includes: an operation monitoring module, configured to monitor the operation performed by the first voice interaction device after the instruction sending module sends the wake-up instruction, and determine the first Whether the voice interaction device performs the third operation; a synchronization request sending module, configured to send a synchronization request to the first voice interaction device when it is determined that the first voice interaction device performs the third operation; and The progress information receiving module is configured to receive the execution progress information of the third operation sent by the first voice interaction device in response to the synchronization request.
根据本公开的实施例,上述第一装置确定模块还用于:在所述参数信息接收模块再次接收到所述多个语音交互装置中至少一个语音交互装置分别发送的至少一个参数信息的情况下,重新确定第一语音交互装置。上述控制装置还包括:获取请求接收模块,用于接收所述重新确定后的 第一语音交互装置在采集到所述用户的第四语音输入的情况下发送的获取请求;以及第一进度信息发送模块,用于响应于所述重新确定后的第一语音交互装置的获取请求,向所述重新确定后的第一语音交互装置发送所述执行进度信息。According to an embodiment of the present disclosure, the above-mentioned first device determining module is further configured to: when the parameter information receiving module again receives at least one parameter information respectively sent by at least one voice interaction device among the multiple voice interaction devices , Re-determine the first voice interaction device. The above-mentioned control device further includes: an acquisition request receiving module configured to receive an acquisition request sent by the re-determined first voice interaction device when the user's fourth voice input is collected; and first progress information transmission A module for sending the execution progress information to the re-determined first voice interaction device in response to the acquisition request of the re-determined first voice interaction device.
本公开的第四方面提供了一种语音交互装置,包括:参数信息发送模块,用于在采集到用户的第一语音输入的情况下,向控制装置发送参数信息,以确定所述语音交互装置是否为第一语音交互装置;指令接收模块,用于:在所述语音交互装置是第一语音交互装置的情况下,接收所述控制装置发送的唤醒指令;在所述语音交互装置不是第一语音交互装置的情况下,接收所述控制装置发送的非唤醒指令;以及状态切换模块,用于:在所述指令接收模块接收到所述唤醒指令的情况下,响应于所述唤醒指令,将当前状态切换为唤醒状态;在所述指令接收模块接收到所述非唤醒指令的情况下,响应于所述非唤醒指令,将当前状态切换为非唤醒状态,其中,所述第一语音输入包括预定语音输入和表征所述用户需求的语音输入。A fourth aspect of the present disclosure provides a voice interaction device, including: a parameter information sending module, configured to send parameter information to a control device when the user's first voice input is collected to determine the voice interaction device Whether it is the first voice interaction device; the instruction receiving module is configured to: in the case that the voice interaction device is the first voice interaction device, receive the wake-up instruction sent by the control device; when the voice interaction device is not the first voice interaction device In the case of a voice interaction device, receiving a non-wake-up instruction sent by the control device; and a state switching module, configured to: in a case where the instruction receiving module receives the wake-up instruction, respond to the wake-up instruction to The current state is switched to the wake-up state; in the case that the instruction receiving module receives the non-wake-up instruction, in response to the non-wake-up instruction, the current state is switched to the non-wake-up state, wherein the first voice input includes A predetermined voice input and a voice input that characterizes the user's needs.
根据本公开的实施例,上述参数信息包括所述语音交互装置的性能参数,上述语音交互装置还包括:性能参数获取模块,用于在所述参数信息发送模块向所述控制装置发送所述参数信息之前,获取所述性能参数。According to an embodiment of the present disclosure, the above-mentioned parameter information includes performance parameters of the voice interaction device, and the above-mentioned voice interaction device further includes: a performance parameter acquisition module for sending the parameter information to the control device by the parameter information sending module Before information, obtain the performance parameters.
根据本公开的实施例,上述参数信息还包括所述用户的位置信息,上述语音交互装置还包括:位置信息确定模块,用于根据采集的所述用户的第一语音输入,确定所述用户的位置信息,其中,所述用户的位置信息表征所述用户相对于所述语音交互装置的位置。According to an embodiment of the present disclosure, the above-mentioned parameter information further includes the position information of the user, and the above-mentioned voice interaction device further includes: a position information determining module configured to determine the user’s position information according to the collected first voice input of the user Location information, where the location information of the user represents the location of the user relative to the voice interaction device.
根据本公开的实施例,上述参数信息包括语音交互装置的操作信息,所述非唤醒指令包括睡眠指令和休眠指令,其中:在所述操作信息表征所述语音交互装置执行第一操作的情况下,所述指令接收模块接收到的非唤醒指令为所述睡眠指令,所述状态切换模块响应于所述睡眠指令将当前状态切换至睡眠状态,所述睡眠状态包括执行所述第一操作且对采集的所述用户的语音输入不作响应的状态;在所述操作信息表征所述语音交互装置未执行第一操作的情况下,所述指令接收模块接收到的非唤 醒指令为所述休眠指令,所述状态切换模块响应于所述休眠指令将当前状态切换至休眠状态,所述休眠状态包括不执行任何操作的状态。According to an embodiment of the present disclosure, the above-mentioned parameter information includes operation information of the voice interaction device, and the non-wake-up instruction includes a sleep instruction and a hibernation instruction, wherein: when the operation information indicates that the voice interaction device performs the first operation The non-wake-up instruction received by the instruction receiving module is the sleep instruction, and the state switching module switches the current state to the sleep state in response to the sleep instruction, and the sleep state includes performing the first operation and The collected voice input of the user is not responding; when the operation information indicates that the voice interaction device has not performed the first operation, the non-awakening instruction received by the instruction receiving module is the sleep instruction, The state switching module switches the current state to the sleep state in response to the sleep instruction, and the sleep state includes a state where no operation is performed.
根据本公开的实施例,上述语音交互装置还包括:第二语音信息确定模块,用于在处于所述唤醒状态,且采集到所述用户的第二语音输入的情况下,确定所述第二语音输入对应的第一语音信息是否为通用语音信息;第二语音信息发送模块,用于在确定所述第一语音信息为通用语音信息的情况下,向所述控制装置发送第一语音信息;并且或者,上述语音交互装置还包括:第二语音信息接收模块,用于接收所述控制装置发送的属于所述通用语音信息的第二语音信息;以及操作执行模块,用于根据所述第二语音信息,执行第二操作,所述第二操作与所述第二语音信息对应的语音输入相对应。According to an embodiment of the present disclosure, the above-mentioned voice interaction device further includes: a second voice information determining module, configured to determine the second voice information in the wake-up state and the second voice input of the user is collected Whether the first voice information corresponding to the voice input is general voice information; the second voice information sending module is configured to send the first voice information to the control device when it is determined that the first voice information is general voice information; And or alternatively, the above-mentioned voice interaction device further includes: a second voice information receiving module, configured to receive second voice information belonging to the general voice information sent by the control device; and an operation execution module, configured to perform according to the second voice information Voice information, perform a second operation, and the second operation corresponds to the voice input corresponding to the second voice information.
根据本公开的实施例,上述语音交互装置还包括恢复请求发送模块,用于在处于唤醒状态、且采集到所述用户的第三语音输入或在预设时段内未采集到用户的语音输入的情况下,向所述控制装置发送恢复请求;并且/或者所述指令接收模块还用于接收所述控制装置发送的恢复指令;所述状态切换模块还用于响应于所述恢复指令,将当前状态切换至采集所述第一语音输入之前的状态。According to an embodiment of the present disclosure, the above-mentioned voice interaction device further includes a recovery request sending module, which is used for those who are in an awake state and collect the third voice input of the user or have not collected the voice input of the user within a preset time period. In this case, a restoration request is sent to the control device; and/or the instruction receiving module is also used to receive a restoration instruction sent by the control device; the state switching module is also used to respond to the restoration instruction to change the current The state is switched to the state before the first voice input is collected.
根据本公开的实施例,上述语音交互装置还包括:第二进度信息发送模块,用于在处于唤醒状态,且执行第三操作的情况下,响应于所述控制装置发送的同步请求,向所述控制装置发送所述第三操作的执行进度信息。According to an embodiment of the present disclosure, the above-mentioned voice interaction device further includes: a second progress information sending module, configured to respond to the synchronization request sent by the control device in the wake-up state and perform the third operation, The control device sends execution progress information of the third operation.
根据本公开的实施例,上述语音交互装置还包括:获取请求发送模块,用于在处于所述唤醒状态、且采集到所述用户的第四语音输入的情况下,向所述控制装置发送获取请求;第二进度信息接收模块,用于接收所述控制装置响应于所述获取请求发送的所述执行进度信息;以及操作执行模块,用于根据所述执行进度信息,执行所述第三操作,其中,所述第三操作与所述第四语音输入相对应。According to an embodiment of the present disclosure, the above-mentioned voice interaction device further includes: an acquisition request sending module, configured to send the acquisition request to the control device when the user is in the awake state and the fourth voice input of the user is collected. Request; a second progress information receiving module for receiving the execution progress information sent by the control device in response to the acquisition request; and an operation execution module for performing the third operation according to the execution progress information , Wherein the third operation corresponds to the fourth voice input.
本公开的第五方面提供了一种电子设备,包括上述的控制装置和上述的语音交互装置。A fifth aspect of the present disclosure provides an electronic device, including the aforementioned control device and the aforementioned voice interaction device.
本公开的第六方面提供了一种电子设备,包括:一个或多个处理器; 存储装置,用于存储一个或多个程序,其中,当所述一个或多个程序被所述一个或多个处理器执行时,使得所述一个或多个处理器执行上述的控制装置的操作方法,和/或上述的语音交互装置的操作方法。A sixth aspect of the present disclosure provides an electronic device, including: one or more processors; a storage device, for storing one or more programs, wherein when the one or more programs are used by the one or more When the two processors are executed, the one or more processors are caused to execute the above-mentioned operation method of the control device and/or the above-mentioned operation method of the voice interaction device.
本公开的另一个方面提供了一种计算机可读存储介质,存储有计算机可执行指令,所述指令在被执行时用于实现如上所述的智能家居系统的控制方法。Another aspect of the present disclosure provides a computer-readable storage medium storing computer-executable instructions, which are used to implement the above-mentioned control method of the smart home system when executed.
本公开的另一个方面提供了一种计算机程序,该计算机程序包括计算机可执行指令,该指令在被执行时用于实现如上所述的控制装置的操作方法,和/或上述的语音交互装置的操作方法。Another aspect of the present disclosure provides a computer program that includes computer-executable instructions that, when executed, are used to implement the operation method of the above-mentioned control device and/or the above-mentioned voice interaction device Method of operation.
根据本公开的实施例,可以至少部分地解决现有技术中多个语音交互设备会同时响应于用户同一时刻的唤醒词而被唤醒,从而导致用户与智能设备的交互场景嘈杂混乱的技术问题,并因此通过根据多个语音交互装置发送的多个参数信息,确定一个语音交互装置作为唯一被唤醒的装置的技术方案,避免嘈杂的交互环境,从而提高用户体验。According to the embodiments of the present disclosure, it is possible to at least partially solve the technical problem in the prior art that multiple voice interaction devices will be awakened in response to the user's wake-up words at the same time at the same time, thereby causing noisy and chaotic interaction scenes between the user and the smart device. And therefore, by determining a technical solution of a voice interaction device as the only device to be awakened based on multiple parameter information sent by multiple voice interaction devices, a noisy interaction environment is avoided, thereby improving user experience.
附图说明Description of the drawings
通过以下参照附图对本公开实施例的描述,本公开的上述以及其他目的、特征和优点将更为清楚,在附图中:Through the following description of the embodiments of the present disclosure with reference to the accompanying drawings, the above and other objectives, features, and advantages of the present disclosure will be more apparent. In the accompanying drawings:
图1示意性示出了根据本公开实施例的控制装置及其操作方法,和语音交互装置及其操作方法的应用场景;FIG. 1 schematically shows an application scenario of a control device and an operation method thereof, and a voice interaction device and an operation method thereof according to an embodiment of the present disclosure;
图2A示意性示出了根据本公开实施例的控制装置的操作方法的流程图;Fig. 2A schematically shows a flowchart of an operation method of a control device according to an embodiment of the present disclosure;
图2B示意性示出了根据本公开实施例的确定第一语音交互装置的操作流程图;Fig. 2B schematically shows an operation flowchart of determining a first voice interaction device according to an embodiment of the present disclosure;
图2C示意性示出了根据本公开实施例的向其他语音交互装置发送非唤醒指令的操作流程图;FIG. 2C schematically shows an operation flowchart of sending a non-wake-up instruction to other voice interaction devices according to an embodiment of the present disclosure;
图3示意性示出了根据本公开第二实施例的控制装置的操作方法的流程图;Fig. 3 schematically shows a flowchart of an operation method of a control device according to a second embodiment of the present disclosure;
图4示意性示出了根据本公开第三实施例的控制装置的操作方法的流程图;Fig. 4 schematically shows a flowchart of an operation method of a control device according to a third embodiment of the present disclosure;
图5A示意性示出了根据本公开第四实施例的控制装置的操作方法 的流程图;Fig. 5A schematically shows a flowchart of an operation method of a control device according to a fourth embodiment of the present disclosure;
图5B示意性示出了图5A所示的操作方法的应用场景图;FIG. 5B schematically shows an application scenario diagram of the operation method shown in FIG. 5A;
图5C示意性示出了根据本公开第五实施例的控制装置的操作方法的流程图;FIG. 5C schematically shows a flowchart of the operation method of the control device according to the fifth embodiment of the present disclosure;
图6A示意性示出了根据本公开第一实施例的语音交互装置的操作方法的流程图;Fig. 6A schematically shows a flow chart of the operation method of the voice interaction device according to the first embodiment of the present disclosure;
图6B示意性示出了根据本公开第二实施例的语音交互装置的操作方法的流程图;FIG. 6B schematically shows a flowchart of the operation method of the voice interaction device according to the second embodiment of the present disclosure;
图7A示意性示出了根据本公开第三实施例的语音交互装置的操作方法的流程图;Fig. 7A schematically shows a flowchart of an operation method of a voice interaction device according to a third embodiment of the present disclosure;
图7B示意性示出了根据本公开第四实施例的语音交互装置的操作方法的流程图;FIG. 7B schematically shows a flowchart of the operation method of the voice interaction device according to the fourth embodiment of the present disclosure;
图8示意性示出了根据本公开第五实施例的语音交互装置的操作方法的流程图;FIG. 8 schematically shows a flowchart of an operation method of a voice interaction device according to a fifth embodiment of the present disclosure;
图9A示意性示出了根据本公开第六实施例的语音交互装置的操作方法的流程图;Fig. 9A schematically shows a flowchart of an operation method of a voice interaction device according to a sixth embodiment of the present disclosure;
图9B示意性示出了根据本公开第七实施例的语音交互装置的操作方法的流程图;FIG. 9B schematically shows a flowchart of the operation method of the voice interaction device according to the seventh embodiment of the present disclosure;
图10示意性示出了根据本公开实施例的控制装置的结构框图;Fig. 10 schematically shows a structural block diagram of a control device according to an embodiment of the present disclosure;
图11示意性示出了根据本公开实施例的语音交互装置的结构框图;Fig. 11 schematically shows a structural block diagram of a voice interaction device according to an embodiment of the present disclosure;
图12示意性示出了根据本公开实施例的适于执行控制装置的操作方法,或语音交互装置的操作方法的电子设备的方框图。Fig. 12 schematically shows a block diagram of an electronic device suitable for performing an operating method of a control device or an operating method of a voice interaction device according to an embodiment of the present disclosure.
具体实施方式Detailed ways
以下,将参照附图来描述本公开的实施例。但是应该理解,这些描述只是示例性的,而并非要限制本公开的范围。在下面的详细描述中,为便于解释,阐述了许多具体的细节以提供对本公开实施例的全面理解。然而,明显地,一个或多个实施例在没有这些具体细节的情况下也可以被实施。此外,在以下说明中,省略了对公知结构和技术的描述,以避免不必要地混淆本公开的概念。Hereinafter, embodiments of the present disclosure will be described with reference to the drawings. However, it should be understood that these descriptions are only exemplary and are not intended to limit the scope of the present disclosure. In the following detailed description, for ease of explanation, many specific details are set forth to provide a comprehensive understanding of the embodiments of the present disclosure. However, obviously, one or more embodiments may also be implemented without these specific details. In addition, in the following description, descriptions of well-known structures and technologies are omitted to avoid unnecessarily obscuring the concept of the present disclosure.
在此使用的术语仅仅是为了描述具体实施例,而并非意在限制本公 开。在此使用的术语“包括”、“包含”等表明了所述特征、步骤、操作和/或部件的存在,但是并不排除存在或添加一个或多个其他特征、步骤、操作或部件。The terminology used here is only for describing specific embodiments, and is not intended to limit the disclosure. The terms "including", "including", etc. used herein indicate the existence of the described features, steps, operations and/or components, but do not exclude the existence or addition of one or more other features, steps, operations or components.
在此使用的所有术语(包括技术和科学术语)具有本领域技术人员通常所理解的含义,除非另外定义。应注意,这里使用的术语应解释为具有与本说明书的上下文相一致的含义,而不应以理想化或过于刻板的方式来解释。All terms (including technical and scientific terms) used herein have meanings commonly understood by those skilled in the art, unless otherwise defined. It should be noted that the terms used herein should be interpreted as having meanings consistent with the context of this specification, and should not be interpreted in an idealized or overly rigid manner.
在使用类似于“A、B和C等中至少一个”这样的表述的情况下,一般来说应该按照本领域技术人员通常理解该表述的含义来予以解释(例如,“具有A、B和C中至少一个的系统”应包括但不限于单独具有A、单独具有B、单独具有C、具有A和B、具有A和C、具有B和C、和/或具有A、B、C的系统等)。In the case of using an expression similar to "at least one of A, B, C, etc.", generally speaking, it should be interpreted according to the meaning of the expression commonly understood by those skilled in the art (for example, "having A, B and C" At least one of the "systems" shall include but not limited to systems having A alone, B alone, C alone, A and B, A and C, B and C, and/or systems having A, B, C, etc. ).
本公开的实施例提供了一种能够提高交互体验的控制装置的操作方法,该方法包括:接收多个语音交互装置分别发送的多个参数信息,该多个参数信息是多个语音交互装置在采集到用户同一时刻的第一语音输入的情况下发送的;根据第一语音输入,确定用户的需求信息;根据多个参数信息及需求信息,确定多个语音交互装置中的第一语音交互装置为与用户交互的装置;以及向第一语音交互装置发送唤醒指令,并向多个语音交互装置中除第一语音交互装置外的其他语音交互装置发送非唤醒指令。其中,第一语音输入包括预定语音输入和表征用户需求的语音输入。The embodiment of the present disclosure provides an operating method of a control device capable of improving interactive experience. The method includes: receiving multiple parameter information sent by multiple voice interaction devices, where the multiple parameter information is It is sent when the first voice input of the user at the same time is collected; the user's demand information is determined according to the first voice input; the first voice interaction device among the multiple voice interaction devices is determined according to multiple parameter information and demand information It is a device that interacts with a user; and sends a wake-up instruction to the first voice interaction device, and sends a non-wake-up instruction to voice interaction devices other than the first voice interaction device among the multiple voice interaction devices. Wherein, the first voice input includes a predetermined voice input and a voice input that characterizes user needs.
本公开的另一实施例提供了一种能够提高交互体验的语音交互装置的操作方法,该方法包括:在采集到用户的第一语音输入的情况下,向控制装置发送参数信息,以确定语音交互装置是否为第一语音交互装置;在语音交互装置是第一语音交互装置的情况下,接收控制装置发送的唤醒指令,以响应于唤醒指令处于唤醒状态;在语音交互装置不是第一语音交互装置的情况下,接收控制装置发送的非唤醒指令,以响应于非唤醒指令处于非唤醒状态,其中,第一语音输入包括预定语音输入和表征所述用户需求的语音输入。Another embodiment of the present disclosure provides an operating method of a voice interaction device capable of improving interactive experience. The method includes: sending parameter information to the control device to determine the voice when the user’s first voice input is collected. Whether the interaction device is the first voice interaction device; in the case that the voice interaction device is the first voice interaction device, receive the wake-up instruction sent by the control device to respond to the wake-up instruction in the wake-up state; when the voice interaction device is not the first voice interaction device In the case of the device, a non-wake-up instruction sent by the control device is received to respond to the non-wake-up instruction in a non-wake-up state, wherein the first voice input includes a predetermined voice input and a voice input that represents the user's needs.
图1示意性示出了根据本公开实施例的控制装置及其操作方法,和 语音交互装置及其操作方法的应用场景。需要说明的是,图1所示仅为可以应用本公开实施例的应用场景的示例,以帮助本领域技术人员理解本公开的技术内容,但并不意味着本公开实施例不可以用于其他设备、系统、环境或场景。Fig. 1 schematically shows an application scenario of a control device and an operation method thereof, and a voice interaction device and an operation method thereof according to an embodiment of the present disclosure. It should be noted that FIG. 1 is only an example of application scenarios in which the embodiments of the present disclosure can be applied to help those skilled in the art understand the technical content of the present disclosure, but it does not mean that the embodiments of the present disclosure cannot be used for other applications. Equipment, system, environment or scenario.
如图1所示,该应用场景100包括多个语音交互装置、网络120和用户130,其中,网络120例如可以是局域网络、具体例如可以是无线网络等。As shown in FIG. 1, the application scenario 100 includes multiple voice interaction devices, a network 120, and a user 130. The network 120 may be a local area network, for example, a wireless network, etc., for example.
其中,多个语音交互装置中的至少两个装置之间例如可以通过网络120进行交互,该多个语音交互装置是能够与用户通过语音进行交互的智能电子设备。Wherein, at least two of the multiple voice interaction devices may interact with each other via the network 120, for example, and the multiple voice interaction devices are smart electronic devices that can interact with the user through voice.
具体地,该多个语音交互装置例如可以是智能家居设备,具体可以包括智能灯具111、智能电视、智能机顶盒、智能音箱、智能空调112、智能热水器113、智能冰箱114、智能窗帘、智能洗衣机115、智能空气净化器、智能游戏机和智能投影仪等;或者,具体例如可以包括智能卫浴设备,例如智能花洒、智能浴缸、智能浴霸、智能梳妆镜、智能马桶等;或者,具体例如还可以包括智能厨具设备,例如智能抽油烟机、智能热水壶、智能燃气罩、智能橱柜、智能洗碗机、智能微波炉和智能烤箱等。Specifically, the multiple voice interaction devices may be, for example, smart home devices, which may specifically include smart lamps 111, smart TVs, smart set-top boxes, smart speakers, smart air conditioners 112, smart water heaters 113, smart refrigerators 114, smart curtains, and smart washing machines 115. , Smart air purifiers, smart game consoles, smart projectors, etc.; or, for example, may include smart bathroom equipment, such as smart showers, smart bathtubs, smart bath heaters, smart vanity mirrors, smart toilets, etc.; or, for specific examples, it may also Including smart kitchen equipment, such as smart range hoods, smart hot water boilers, smart gas hoods, smart cabinets, smart dishwashers, smart microwave ovens, and smart ovens.
根据本公开的实施例,该多个语音交互装置可以接收用户的语音输入,并且可以在语音输入包括唤醒词(例如“叮咚叮咚”等)的情况下,可以响应于该语音输入而切换至唤醒状态,以接收用户指令,并根据用户指令执行相应操作。According to an embodiment of the present disclosure, the plurality of voice interaction devices can receive a user's voice input, and can switch in response to the voice input when the voice input includes a wake-up word (for example, "dingdongdingdong", etc.) To wake up state, to receive user instructions, and perform corresponding operations according to user instructions.
根据本公开的实施例,该多个语音交互装置例如还具有传感器,例如声音传感器或距离传感器等,以用于在接收到用户的语音输入的情况下,根据声音来源或红外检测实现对用户的定位,以测得用户相对于其自身的位置信息。According to an embodiment of the present disclosure, the plurality of voice interaction devices, for example, further have sensors, such as a sound sensor or a distance sensor, etc., which are used to realize the user's voice input according to the sound source or infrared detection when the user's voice input is received. Positioning to measure the user's location information relative to itself.
根据本公开的实施例,该多个语音交互装置例如还可以与提供网络120的网络设备进行交互,用于将测得的位置信息发送给网络设备,以由网络设备根据位置信息确定与用户交互的第一语音交互装置,使得该多个语音交互装置在网络设备的控制下进行工作。According to an embodiment of the present disclosure, the multiple voice interaction apparatuses may also interact with a network device that provides the network 120, for example, to send the measured location information to the network device, so that the network device determines the interaction with the user according to the location information. The first voice interaction device enables the multiple voice interaction devices to work under the control of the network device.
根据本公开的实施例,用户130例如还可以通过其他电子设备中安装的应用程序对该多个语音交互装置进行控制,且可以通过该应用程序设定该多个语音交互装置中的一个语音交互装置为控制装置,其余装置为受控装置,则该多个语音交互装置将根据语音输入测得的位置信息发送给控制装置,以由控制装置根据位置信息确定与用户交互的第一语音交互装置,使得该多个语音交互装置在设定的控制装置的控制下进行工作。According to the embodiment of the present disclosure, the user 130 may, for example, also control the multiple voice interaction devices through an application program installed in other electronic devices, and may set one voice interaction device among the multiple voice interaction devices through the application program. If the device is a control device, and the other devices are controlled devices, the multiple voice interaction devices send the position information measured according to the voice input to the control device, so that the control device determines the first voice interaction device to interact with the user according to the position information , So that the multiple voice interaction devices work under the control of the set control device.
例如,如图1所示,当用户130说出包括唤醒词“叮咚叮咚,帮我播放稻香”的语音时,多个语音交互装置智能灯具111、智能空调112、智能热水器113、智能冰箱114和智能洗衣机115中的部分或全部均可接收到该语音输入,则接收到该语音输入的语音交互装置将参数信息(可以包括自身的功能信息和/或检测到的用户的位置信息)发送给网络设备或确定的控制装置,网络设备或确定的控制装置根据接收的参数信息,确定具有音乐播放功能且距离用户较近的智能冰箱114为第一语音交互装置,以用于与用户130进行交互,智能冰箱114在该网络设备或确定的控制装置的控制下被唤醒,而智能灯具111、智能空调112、智能热水器113和智能洗衣机115则在该网络设备或确定的控制装置的控制下保持非唤醒状态,从而可以避免多个语音交互装置均被唤醒导致的语音交互场景嘈杂混乱的缺陷,提高用户体验。For example, as shown in Figure 1, when the user 130 speaks a voice that includes the wake-up word "Ding Dong Ding Dong, help me play Dao Xiang", multiple voice interaction devices smart lamps 111, smart air conditioners 112, smart water heaters 113, smart Part or all of the refrigerator 114 and the smart washing machine 115 can receive the voice input, and the voice interaction device that receives the voice input will set parameter information (which may include its own function information and/or detected user location information) It is sent to the network device or the determined control device, and the network device or the determined control device determines that the smart refrigerator 114 with music playback function and close to the user is the first voice interaction device according to the received parameter information. During the interaction, the smart refrigerator 114 is awakened under the control of the network device or the determined control device, while the smart lamp 111, the smart air conditioner 112, the smart water heater 113, and the smart washing machine 115 are under the control of the network device or the determined control device Keeping the non-awakened state can avoid the noisy and chaotic voice interaction scene caused by multiple voice interaction devices being awakened, and improve user experience.
需要说明的是,本公开实施例所提供的控制装置的操作方法一般可以由确定的控制装置执行,语音交互装置的操作方法可以由多个语音交互装置中任意一个装置执行。相应地,本公开实施例所提供的控制装置一般可以设置于网络设备或任意一个语音交互装置中(例如确定的控制装置中),语音交互装置可以是参考图1中的语音交互装置111~115。本公开实施例所提供的控制装置的操作方法也可以由不同于语音交互装置且能够与语音交互装置通信的其他电子设备执行。相应地,本公开实施例所提供的控制装置也可以设置于不同于语音交互装置且能够与语音交互装置通信的其他电子设备中。It should be noted that the operation method of the control device provided in the embodiments of the present disclosure may generally be executed by a certain control device, and the operation method of the voice interaction device may be executed by any one of multiple voice interaction devices. Correspondingly, the control device provided by the embodiment of the present disclosure can generally be set in a network device or any voice interaction device (for example, in a certain control device), and the voice interaction device may refer to the voice interaction devices 111 to 115 in FIG. . The operation method of the control device provided by the embodiment of the present disclosure can also be executed by other electronic equipment that is different from the voice interaction device and can communicate with the voice interaction device. Correspondingly, the control device provided by the embodiment of the present disclosure can also be set in other electronic equipment that is different from the voice interaction device and can communicate with the voice interaction device.
可以理解的是,图1中的语音交互装置和网络120的类型和数目仅仅是示意性的,根据实现需要,可以具有任意类型和任意数目的语音交 互装置和云端设备。It can be understood that the types and numbers of voice interaction devices and networks 120 in FIG. 1 are only illustrative, and any type and number of voice interaction devices and cloud devices can be provided according to implementation needs.
图2A示意性示出了根据本公开实施例的控制装置的操作方法的流程图,图2B示意性示出了根据本公开实施例的确定第一语音交互装置的操作流程图,图2C示意性示出了根据本公开实施例的向其他语音交互装置发送非唤醒指令的操作流程图。FIG. 2A schematically shows a flowchart of an operation method of a control device according to an embodiment of the present disclosure, FIG. 2B schematically shows an operation flowchart of determining a first voice interaction device according to an embodiment of the present disclosure, and FIG. 2C schematically An operation flowchart of sending a non-wake-up instruction to another voice interaction device according to an embodiment of the present disclosure is shown.
如图2A所示,该控制装置的操作方法包括操作S201~操作S204。As shown in FIG. 2A, the operation method of the control device includes operation S201 to operation S204.
在操作S201,接收多个语音交互装置分别发送的多个参数信息。In operation S201, multiple parameter information respectively sent by multiple voice interaction apparatuses are received.
具体地,多个参数信息具体可以是在多个语音交互装置接收到某个用户在同一时刻的第一语音输入的情况下发送的。相应地,语音交互装置具有语音采集功能,例如设置有语音采集器等,该多个语音交互装置可以是空间范围内能够采集到用户的第一语音输入的语音交互装置,例如,若用户家中具有n个语音交互装置,则操作S201中涉及的多个语音交互装置为该n个语音交互装置中的部分或全部。例如,该多个语音交互装置可以是参考图1中的智能灯具111、智能空调112、智能热水器113、智能冰箱114和智能洗衣机115中的部分或全部。Specifically, the multiple parameter information may be sent when multiple voice interaction devices receive the first voice input of a certain user at the same time. Correspondingly, the voice interaction device has a voice collection function, such as a voice collector, etc. The multiple voice interaction devices may be voice interaction devices capable of collecting the user's first voice input within a spatial range, for example, if the user has If there are n voice interaction devices, the multiple voice interaction devices involved in operation S201 are some or all of the n voice interaction devices. For example, the multiple voice interaction devices may be part or all of the smart lamp 111, smart air conditioner 112, smart water heater 113, smart refrigerator 114, and smart washing machine 115 in FIG. 1.
其中,当用户发出语音指令“叮咚叮咚、帮我播放稻香”时,多个语音交互装置111~115均可通过采集该语音指令,接收到第一语音输入。则多个语音交互装置111~115在接收到第一语音输入时,即分别向控制装置发送参数信息,以供控制装置接收。Wherein, when the user issues a voice command "Ding Dong Ding Dong, play Dao Xiang for me", multiple voice interaction devices 111 to 115 can receive the first voice input by collecting the voice command. Then, when the multiple voice interaction devices 111-115 receive the first voice input, they respectively send parameter information to the control device for the control device to receive.
根据本公开的实施例,所述的第一语音输入具体可以包括有预定语音输入和表征用户需求的语音输入。其中,预定语音输入例如可以是语音交互装置的唤醒词(例如“叮咚叮咚”)对应的语音输入。该唤醒词例如可以是语音交互装置出厂时预设的唤醒词,也可以是用户自定义设定的唤醒词。其中的表征用户需求的语音输入则为除了唤醒词外的其他语音对应的语音输入,例如可以为“帮我播放稻香”等。According to an embodiment of the present disclosure, the first voice input may specifically include a predetermined voice input and a voice input that characterizes user needs. Wherein, the predetermined voice input may be, for example, a voice input corresponding to a wake-up word (for example, "Ding Dong Ding Dong") of the voice interaction device. The wake-up word may be, for example, a wake-up word preset when the voice interactive device is shipped from the factory, or a wake-up word customized by the user. Among them, the voice input that characterizes the user's needs is the voice input corresponding to other voices except the wake-up word, for example, it can be "Play Daoxiang for me".
根据本公开的实施例,所述参数信息例如可以包括语音交互装置的属性参数和/或性能参数等,其中属性参数例如可以包括语音交互装置的品牌、型号等,性能参数例如可以包括语音交互装置的功能(例如播放音乐、播放视频、控温、照明灯)、及语音交互装置的工作参数(例如光照亮度、音质、屏幕分辨率、温度调节范围等)。According to an embodiment of the present disclosure, the parameter information may include, for example, attribute parameters and/or performance parameters of the voice interaction device, where the attribute parameters may include, for example, the brand and model of the voice interaction device, and the performance parameters may include, for example, the voice interaction device. Functions (such as playing music, playing video, temperature control, lighting), and the working parameters of the voice interaction device (such as lighting brightness, sound quality, screen resolution, temperature adjustment range, etc.).
在操作S202,根据第一语音输入,确定所述用户的需求信息。In operation S202, the demand information of the user is determined according to the first voice input.
根据本公开的实施例,当控制装置集成于某个语音交互装置时,则该某个语音交互装置还可以在采集得到第一语音输入后,对第一语音输入进行语音识别分析,从而得到用户的需求信息。其中用户的需求信息具体是通过识别分析第一语音输入中表征用户需求的语音输入得到的。例如,当表征用户需求的语音输入为“帮我播放稻香”时,得到的需求信息例如可以为播放音乐;当表征用户需求的语音输入为“帮我播放无损音乐时”,得到的需求信息例如可以为播放音乐和高音质等。其中,识别分析语音的具体实现方法可以采用现有技术中任意的语音识别方法,本公开对此不作限定。According to the embodiment of the present disclosure, when the control device is integrated into a certain voice interaction device, the certain voice interaction device may also perform voice recognition analysis on the first voice input after collecting the first voice input, thereby obtaining the user Demand information. The user's demand information is specifically obtained by recognizing and analyzing the voice input that represents the user's demand in the first voice input. For example, when the voice input that characterizes the user's needs is "Play Daoxiang for me", the demand information obtained can be, for example, playing music; when the voice input that characterizes the user's needs is "Playing lossless music for me", the demand information is obtained. For example, it can be playing music and high sound quality. Among them, the specific implementation method of recognizing and analyzing speech can adopt any speech recognition method in the prior art, which is not limited in the present disclosure.
在操作S203,根据多个参数信息及需求信息,确定多个语音交互装置中的第一语音交互装置为与所述用户交互的装置。In operation S203, the first voice interaction device among the multiple voice interaction devices is determined to be the device that interacts with the user according to multiple parameter information and demand information.
根据本公开的实施例,在多个参数信息包括性能参数的情况下,该操作S203具体例如可以包括:先确定多个参数信息与需求信息的匹配关系,然后确定参数信息与需求信息匹配的语音交互装置为第一语音交互装置。例如,当需求信息为播放音乐时,可以确定表征具有音乐播放功能的参数信息与需求信息相匹配,则该参数信息对应的语音交互装置即为第一语音交互装置。当需求信息为播放音乐和高音质时,首先确定表征具有音乐播放功能的参数信息与需求信息相匹配,再确定表征音质高的参数信息与需求信息的匹配度更高,则可以确定匹配度高的参数信息对应的语音交互装置为第一语音交互装置。According to an embodiment of the present disclosure, when the multiple parameter information includes performance parameters, the operation S203 may specifically include, for example, first determining the matching relationship between the multiple parameter information and the demand information, and then determining the voice matching the parameter information and the demand information The interaction device is the first voice interaction device. For example, when the demand information is to play music, it can be determined that the parameter information that has a music play function matches the demand information, and the voice interaction device corresponding to the parameter information is the first voice interaction device. When the demand information is to play music and high sound quality, first determine that the parameter information that characterizes the music playing function matches the demand information, and then determine that the parameter information that characterizes high sound quality has a higher degree of matching with the demand information, and then it can be determined that the matching degree is high The voice interaction device corresponding to the parameter information is the first voice interaction device.
根据本公开的实施例,为了在性能参数与需求信息匹配的语音交互装置有多个时,能够择一的选择第一语音交互装置,从而进一步避免多个语音交互装置响应于用户的指令导致的环境嘈杂的情况。语音交互装置发送的参数信息例如还可以包括用于表征用户相对于语音交互装置的位置的用户的位置信息。According to the embodiments of the present disclosure, in order to be able to select the first voice interaction device alternatively when there are multiple voice interaction devices matching performance parameters and demand information, thereby further avoiding multiple voice interaction devices responding to user instructions. Noisy environment. The parameter information sent by the voice interaction device may, for example, further include user location information used to characterize the location of the user relative to the voice interaction device.
相应地,该多个语音交互装置例如还应具有对采集得到的语音输入进行分析处理的功能。具体地,语音交互装置可以在采集到用户的语音输入后,根据语音信号的强弱来确定用户相对于其自身的距离远近。则发送的参数信息中包括的位置信息可以是距离值。或者,该多个语音交 互装置还可以设置有能够对用户进行定位的传感器,且该传感器工作的触发条件为语音交互装置采集到第一语音输入,该传感器例如可以是通过语音交互装置采集到的语音输入的声音来源实现对用户的定位,或者还可以是通过红外检测等技术实现对用户的定位,以获取用户信息包括的用户的位置信息,该位置信息可以是距离值,或者是以语音交互装置所在位置为原点定位得到的用户所在位置的坐标值。可以理解的是,上述用户的位置信息仅作为示例以利于理解本公开,该位置信息例如还可以包括确定的用户所在的空间。Correspondingly, the multiple voice interaction devices, for example, should also have the function of analyzing and processing the collected voice input. Specifically, the voice interaction device may determine the distance of the user relative to itself according to the strength of the voice signal after collecting the voice input of the user. Then the position information included in the sent parameter information may be a distance value. Alternatively, the multiple voice interaction devices may also be provided with a sensor capable of positioning the user, and the trigger condition of the sensor operation is that the voice interaction device collects the first voice input, and the sensor may be collected by the voice interaction device, for example. The sound source of the voice input realizes the positioning of the user, or it can also realize the positioning of the user through infrared detection and other technologies to obtain the user's location information included in the user information. The location information can be a distance value or a voice interaction The location of the device is the coordinate value of the user's location obtained by the origin positioning. It is understandable that the location information of the user described above is only used as an example to facilitate understanding of the present disclosure, and the location information may also include, for example, the space where the user is determined.
则如图2B所示,操作S203具体可以包括操作S213~操作S223。在操作S213,确定多个语音交互装置中性能参数与用户的需求信息匹配的至少一个第二语音交互装置;在操作S223,根据至少一个第二语音交互装置发送的参数信息中的用户的位置信息,确定至少一个第二语音交互装置中的一个为第一语音交互装置。具体可以是,先根据多个语音交互装置的性能参数,确定能够满足用户需求的第二语音交互装置。然后选择位置信息表征距离用户最近的第二语音交互装置为第一语音交互装置。As shown in FIG. 2B, operation S203 may specifically include operation S213 to operation S223. In operation S213, determine at least one second voice interaction device whose performance parameter matches the user's demand information among the plurality of voice interaction devices; in operation S223, according to the user's location information in the parameter information sent by the at least one second voice interaction device , It is determined that one of the at least one second voice interaction device is the first voice interaction device. Specifically, the second voice interaction device that can meet the needs of the user is determined according to the performance parameters of multiple voice interaction devices. Then, the second voice interaction device whose location information characterizes the closest distance to the user is selected as the first voice interaction device.
根据本公开的实施例,该操作S203确定的第一语音交互装置例如可以是参考图1中描述的智能冰箱114,在此不再赘述。According to an embodiment of the present disclosure, the first voice interaction device determined in operation S203 may be, for example, the smart refrigerator 114 described with reference to FIG. 1, which will not be repeated here.
在操作S204,向第一语音交互装置发送唤醒指令,并向多个语音交互装置中除第一语音交互装置外的其他语音交互装置发送非唤醒指令。In operation S204, a wake-up instruction is sent to the first voice interaction device, and a non-wake-up instruction is sent to other voice interaction devices among the plurality of voice interaction devices except the first voice interaction device.
根据本公开的实施例,由于向第一语音交互装置发送的是唤醒指令,则第一语音交互装置可以响应于该唤醒指令由接收第一语音输入之前的状态切换至唤醒状态;而由于向其他语音交互装置发送的是非唤醒指令,因此,其他语音交互装置可以由接收第一语音输入之前的状态切换至非唤醒状态。According to the embodiment of the present disclosure, since the wake-up instruction is sent to the first voice interaction device, the first voice interaction device can switch from the state before receiving the first voice input to the wake-up state in response to the wake-up instruction; The voice interaction device sends a non-wake-up instruction. Therefore, other voice interaction devices can switch from the state before receiving the first voice input to the non-wake state.
根据本公开的实施例,非唤醒状态例如可以是不对第一语音输入做出任何响应的状态;或者,该非唤醒状态例如还可以是语音交互装置不执行任何操作的状态,即类似于关机的状态。According to an embodiment of the present disclosure, the non-awake state may be, for example, a state in which no response is made to the first voice input; or, the non-awake state may also be a state in which the voice interaction device does not perform any operation, that is, similar to shutting down. status.
根据本公开的实施例,上述操作S204具体可以是,向参考图1中的智能冰箱114发送唤醒指令,则该智能冰箱114响应于该唤醒指令被唤醒,可以与用户进行语音交互,而向智能灯具111、智能空调112、智 能热水器113和智能洗衣机115中接收到第一语音输入的语音交互装置发送的是非唤醒指令,则该些语音交互装置响应于该非唤醒指令切换至非唤醒状态,即无法与用户进行语音交互的状态。According to an embodiment of the present disclosure, the above-mentioned operation S204 may specifically be: sending a wake-up instruction to the smart refrigerator 114 in reference to FIG. 1, and the smart refrigerator 114 is awakened in response to the wake-up instruction, and can perform voice interaction with the user, and send a voice to the smart refrigerator 114. The voice interaction device that receives the first voice input among the lamp 111, the smart air conditioner 112, the smart water heater 113, and the smart washing machine 115 sends a non-wake-up instruction, and the voice interaction devices switch to the non-wake state in response to the non-wake-up instruction, that is, The state where voice interaction with the user is not possible.
根据本公开的实施例,考虑到其他语音交互装置中可能有部分语音交互装置在采集到第一语音输入之前执行的操作是类似于照明、供冷等不影响第一语音交互装置与用户交互过程的操作,例如不会发出声音的操作。此种情况下,若直接将该部分语音交互装置切换至类似于关机的状态,可能会影响用户体验。例如,在用户位于黑暗的空间中发出“叮咚叮咚,帮我播放稻香”的语音指令之前,位于该黑暗空间中的智能灯具111执行照明操作,此时若在确定第一语音交互装置为智能冰箱114时,向智能灯具111发出指令使该智能灯具111切换至不执行任何操作的状态(即不执行照明操作的状态),无疑会给用户带来较差的体验。According to the embodiment of the present disclosure, considering that some voice interaction devices may perform operations before collecting the first voice input in other voice interaction devices, which are similar to lighting, cooling, etc., and do not affect the interaction process between the first voice interaction device and the user. Operations, such as operations that do not make a sound. In this case, if the part of the voice interaction device is directly switched to a state similar to shutting down, the user experience may be affected. For example, before the user sends out the voice command "Ding Dong Ding Dong, help me play Dao Xiang" in a dark space, the smart lamp 111 in the dark space performs the lighting operation. At this time, if the first voice interaction device is determined In the case of a smart refrigerator 114, issuing an instruction to the smart lamp 111 to switch the smart lamp 111 to a state of not performing any operation (that is, a state of not performing a lighting operation) will undoubtedly bring a poor user experience.
为了避免上述缺陷,如图2C所示,该操作S204发出的非唤醒指令例如可以包括休眠指令和睡眠指令,且多个语音交互装置发送的参数信息还可以包括语音交互装置的操作信息,以用于表征语音交互装置在采集得到第一语音输入时执行的操作。相应地,操作S204具体例如可以包括操作S214~操作S224。In order to avoid the above-mentioned defects, as shown in FIG. 2C, the non-wake-up instruction issued by the operation S204 may include a sleep instruction and a sleep instruction, and the parameter information sent by multiple voice interaction devices may also include operation information of the voice interaction device. To characterize the operation performed by the voice interaction device when the first voice input is collected. Correspondingly, operation S204 may specifically include operation S214 to operation S224, for example.
在操作S214,根据其他语音交互装置的操作信息,确定其他语音交互装置在采集第一语音输入时是否执行第一操作;在操作S224,向执行第一操作的其他语音交互装置发送睡眠指令,向未执行第一操作的其他语音交互装置发送休眠指令。其中,第一操作具体可以是不影响第一语音交互装置与用户交互过程的操作,该第一操作包括的操作类型可以根据实际需求进行设定。在其他语音交互装置接收到睡眠指令时,可以响应于睡眠指令切换到睡眠状态,该睡眠状态具体可以是能够执行第一操作,能够采集语音输入,但对采集到的用户的语音输入不作响应的状态。在其他语音交互装置接收到休眠指令时,可以响应于休眠指令切换到休眠状态。休眠状态可以是不执行任何操作的状态,例如可以为关机状态。In operation S214, according to the operation information of other voice interaction devices, it is determined whether the other voice interaction device performs the first operation when collecting the first voice input; in operation S224, the sleep instruction is sent to other voice interaction devices that perform the first operation, and Other voice interaction devices that have not performed the first operation send a sleep instruction. The first operation may specifically be an operation that does not affect the interaction process between the first voice interaction device and the user, and the type of operation included in the first operation may be set according to actual requirements. When another voice interaction device receives a sleep instruction, it can switch to the sleep state in response to the sleep instruction. The sleep state may specifically be able to perform the first operation and collect voice input, but does not respond to the collected voice input of the user status. When other voice interaction devices receive the sleep instruction, they can switch to the sleep state in response to the sleep instruction. The sleep state may be a state in which no operation is performed, for example, it may be a shutdown state.
综上可知,本公开的控制装置的操作方法通过接收的参数信息确定多个语音交互装置中的唯一的装置作为被唤醒的装置,用于与用户进行交互,相较于现有技术中多个语音交互装置均被唤醒的技术方案,可以 避免多个语音交互装置同时与用户进行交互导致的交互场景嘈杂混乱的缺陷,并因此提高用户体验。In summary, the operating method of the control device of the present disclosure determines the only device among the multiple voice interaction devices as the awakened device through the received parameter information, which is used to interact with the user, compared with the multiple voice interaction devices in the prior art. The technical solution in which voice interaction devices are all awakened can avoid the defect of noisy and chaotic interaction scenes caused by multiple voice interaction devices interacting with the user at the same time, and thus improve user experience.
图3示意性示出了根据本公开第二实施例的控制装置的操作方法的流程图。Fig. 3 schematically shows a flowchart of an operation method of a control device according to a second embodiment of the present disclosure.
根据本公开的实施例的控制装置的操作方法除了图2A描述的操作S201~操作S204外,如图3所示,还可以包括操作S305~操作S307。In addition to operations S201 to S204 described in FIG. 2A, the operation method of the control device according to an embodiment of the present disclosure may further include operations S305 to S307 as shown in FIG. 3.
在操作S305,接收第一语音交互装置在采集到用户的第二语音输入的情况下发送的第一语音信息。其中,第一语音信息与第二语音输入相对应。In operation S305, the first voice information sent by the first voice interaction device when the user's second voice input is collected is received. Wherein, the first voice information corresponds to the second voice input.
其中第二语音输入具体可以是与用于向多个语音交互装置发送指令的语音对应的语音输入,其中的指令具体可以是类似于“关闭所有设备”、或者“我要出门了”等用户发出的、多个语音交互装置可以通用的、且需要多个语音交互装置响应的语音指令,由于该语音指令需要多个语音交互装置协同工作才能达到用户想要的效果,因此,若仅第一语音交互装置响应于该第二语音输入对应的第二语音信息执行相应操作,则不能很好的满足用户的需求。The second voice input may specifically be a voice input corresponding to the voice used to send instructions to multiple voice interaction devices, and the instruction may specifically be similar to "turn off all devices" or "I'm going out", etc. Multiple voice interaction devices can be universal and require multiple voice interaction devices to respond to voice instructions. Because the voice instruction requires multiple voice interaction devices to work together to achieve the effect that the user wants, if only the first voice The interaction device performs a corresponding operation in response to the second voice information corresponding to the second voice input, which cannot well meet the needs of the user.
根据本公开的实施例,在第一语音交互装置切换至唤醒状态后,即可实时的采集用户的语音输入,并响应于用户的语音输入,执行相应的操作。考虑到上述问题,当采集到的用户的语音输入为第二语音输入时,先由第一语音交互装置确定对应的第一语音信息是否为通用语音信息,即多个语音交互装置可以通用的、且需要多个语音交互装置共同响应的语音信息;若是通用语音信息,则应该向控制装置发送该第二语音输入对应的第一语音信息,以通知控制装置该语音指令需要多个语音交互装置共同完成。具体地,云端系统或第一语音交互装置预先存储有通用语音信息列表,以作为确定是否为通用语音信息的参考。According to the embodiments of the present disclosure, after the first voice interaction device is switched to the awake state, the user's voice input can be collected in real time, and corresponding operations can be performed in response to the user's voice input. Taking into account the above problems, when the collected voice input of the user is the second voice input, the first voice interaction device first determines whether the corresponding first voice information is general voice information, that is, multiple voice interaction devices can be universal, And the voice information that multiple voice interaction devices respond together is required; if it is general voice information, the first voice information corresponding to the second voice input should be sent to the control device to notify the control device that the voice command requires multiple voice interaction devices to share carry out. Specifically, the cloud system or the first voice interaction device pre-stores a list of general voice information as a reference for determining whether it is general voice information.
其中的第一语音信息具体例如可以是与第二语音输入对应的,能够表征该第二语音输入的、电子设备能够识别的信息,例如可以是将第二语音输入转换得到的二进制编码或字符序列等;或者对语音输入进行识别处理后,再转换得到的二进制编码或字符序列等。The first voice information may specifically correspond to the second voice input, and can characterize the second voice input and the information that can be recognized by the electronic device. For example, it may be a binary code or character sequence obtained by converting the second voice input. And so on; or after the speech input is recognized and processed, then the binary code or character sequence obtained is converted.
在操作S306,确定第一语音信息是否为通用语音信息。In operation S306, it is determined whether the first voice information is general voice information.
根据本公开的实施例,该操作S306具体可以是将第一语音信息与云端系统中存储的或控制装置预存储的通用语音信息列表进行比对,若该第一语音信息是通用语音列表中的信息,则确定该第一语音信息是通用语音信息;或者,操作S306具体还可以通过以下操作实现:将第一语音信息作为预训练得到的深度学习模型的输入,输出得到的结果即为二分类结果,可以表征是通用语言信息或不是通用语言信息。可以理解的是,上述方法仅为实现操作S306的示例,本公开对此不作限定。还可以理解的是,该控制装置通过再次判断第一语音信息是否为通用语音信息,可以避免控制装置与第一语音交互装置存储的通用语音信息存在差异的情况导致的判断结果不准确的缺陷。According to an embodiment of the present disclosure, the operation S306 may specifically be to compare the first voice information with a general voice information list stored in the cloud system or pre-stored by the control device, if the first voice information is in the general voice list Information, it is determined that the first voice information is general voice information; or, operation S306 can also be specifically implemented by the following operations: use the first voice information as the input of the pre-trained deep learning model, and the output result is the binary classification As a result, it can be characterized as universal language information or not universal language information. It can be understood that the foregoing method is only an example for implementing operation S306, which is not limited in the present disclosure. It is also understandable that the control device can avoid the defect of inaccurate judgment result caused by the difference between the general voice information stored in the control device and the first voice interaction device by determining whether the first voice information is general voice information again.
在操作S307,在第一语音信息为通用语音信息的情况下,向多个语音交互装置发送第一语音信息,以使多个语音交互装置执行与第二语音输入相对应的操作。In operation S307, in a case where the first voice information is general voice information, the first voice information is sent to multiple voice interaction devices, so that the multiple voice interaction devices perform an operation corresponding to the second voice input.
考虑到该属于通用语音信息的第一语音信息对应的第二语音输入的语音指令需要多个语音交互装置协同操作才能满足用户的需求,因此,在确定了第一语音信息是通用语音信息的情况下,将该第一语音信息发送至多个语音交互装置,通过多个语音交互装置响应于该第一语音信息的操作即可完成与用户需求相同的活动,满足用户需求,例如当第二语音输入为“我要出门”或“关闭所有设备”时,能够表征用户的意图为关闭所有语音交互装置,则该多个语音交互装置通过执行与第一语音信息对应的操作,例如关机操作,即可满足用户的需求,从而在保证交互场景不嘈杂混乱的同时,可使得语音交互装置的响应操作更为符合用户需求。Considering that the second voice input corresponding to the first voice information belonging to the general voice information requires the cooperative operation of multiple voice interaction devices to meet the needs of the user, therefore, when it is determined that the first voice information is general voice information Next, the first voice information is sent to multiple voice interaction devices, and the multiple voice interaction devices respond to the operations of the first voice information to complete the same activities as the user needs, and meet the user needs, for example, when the second voice input When it is "I want to go out" or "Turn off all devices", it can indicate that the user's intention is to turn off all voice interaction devices, and the multiple voice interaction devices can perform operations corresponding to the first voice information, such as shutdown operations. To meet the needs of users, while ensuring that the interactive scene is not noisy and chaotic, the response operation of the voice interactive device can be made more in line with user needs.
根据本公开的实施例,在上述应用场景中,为了避免本身处于关机状态的语音交互装置在接收到第一语音信息后因再次执行对应的操作而开机。语音交互装置在接收到所述第一语音信息后,例如还可以根据当前状态来确定是否执行与第二语音输入对应的操作。若当前状态与第一语音信息相匹配,则执行与第二语音输入对应的操作,若当前状态与第一语音信息不匹配,则不执行与第二语音输入对应的操作。According to the embodiments of the present disclosure, in the above-mentioned application scenario, in order to prevent the voice interaction device itself from being turned off after receiving the first voice information, the voice interaction device may be turned on by performing a corresponding operation again. After receiving the first voice information, the voice interaction apparatus may determine whether to perform an operation corresponding to the second voice input, for example, according to the current state. If the current state matches the first voice information, the operation corresponding to the second voice input is performed, and if the current state does not match the first voice information, the operation corresponding to the second voice input is not performed.
图4示意性示出了根据本公开第三实施例的控制装置的操作方法的 流程图。Fig. 4 schematically shows a flowchart of the operation method of the control device according to the third embodiment of the present disclosure.
根据本公开的实施例,在第一语音交互装置在采集到用户的第三语音输入或在预设时段内未采集到用户的语音输入的情况下,考虑到可能用户此时不再需要与该第一语音交互装置进行交互。或者,在预设时段内未采集到用户的语音输入的情况下,还可以向用户发出类似于“主人,主人,您还在听吗?”或“主人,您还在吗?”的询问语音,在发出询问语音后依旧未采集到用户的语音输入的情况下,可以确定用户此时不再需要与该第一语音交互装置进行交互。其中,第三语音输入具体可以是用户发出的类似于“休眠”等指令对应的语音输入。According to an embodiment of the present disclosure, when the first voice interaction device collects the user's third voice input or does not collect the user's voice input within a preset time period, it is considered that the user may no longer need to interact with the user at this time. The first voice interaction device interacts. Or, in the case that the user’s voice input is not collected within the preset time period, the user can also be issued a questioning voice similar to "Master, master, are you still listening?" or "Master, are you still there?" If the user’s voice input is still not collected after the inquiry voice is issued, it can be determined that the user no longer needs to interact with the first voice interaction device at this time. Wherein, the third voice input may specifically be a voice input corresponding to an instruction similar to "sleep" issued by the user.
此种情况下,考虑到在第一语音交互装置和/或其他语音交互装置接收第一语音输入之前可能在执行播放广播或音乐等操作,而用户往往希望在结束与第一语音交互装置的交互后,第一语音交互装置和/或其他语音交互装置能够继续播放广播或音乐等操作,因此,在第一语音交互装置确定用户不再需要与第一语音交互装置进行交互后,可以向控制装置发送表征结束交互的恢复请求。In this case, considering that operations such as playing broadcast or music may be performed before the first voice interaction device and/or other voice interaction devices receive the first voice input, the user often wishes to end the interaction with the first voice interaction device Later, the first voice interaction device and/or other voice interaction devices can continue to play broadcast or music operations. Therefore, after the first voice interaction device determines that the user no longer needs to interact with the first voice interaction device, the control device Send a resume request that characterizes the end of the interaction.
相应地,本公开实施例的控制装置的操作方法除了图2A描述的操作S201~操作S204外,如图4所示,还可以包括S408~操作S409。在操作S408,接收第一语音交互装置发送的恢复请求,该恢复请求由第一语音交互装置在采集到用户的第三语音输入或预设时段内未采集到用户的语音输入的情况下发送;以及在操作S409,向多个语音交互装置发送恢复指令,以使多个语音交互装置恢复至采集用户的第一语音输入之前的状态。具体即为,使多个语音交互装置恢复至其在采集第一语音输入之前的播放音乐或广播等的状态,以继续执行向用户播放广播或音乐等操作,满足用户需求。Correspondingly, in addition to operations S201 to S204 described in FIG. 2A, as shown in FIG. 4, the operation method of the control device of the embodiment of the present disclosure may further include S408 to S409. In operation S408, a recovery request sent by the first voice interaction device is received, and the recovery request is sent by the first voice interaction device when the user's third voice input is collected or the user's voice input is not collected within a preset time period; And in operation S409, a recovery instruction is sent to the multiple voice interaction devices, so that the multiple voice interaction devices are restored to the state before the first voice input of the user is collected. Specifically, the multiple voice interaction devices are restored to the state of playing music or radio before collecting the first voice input, so as to continue to perform operations such as playing radio or music to the user to meet user needs.
图5A示意性示出了根据本公开第四实施例的控制装置的操作方法的流程图。Fig. 5A schematically shows a flowchart of an operation method of a control device according to a fourth embodiment of the present disclosure.
本公开实施例的控制装置的操作方法除了图2A描述的操作S201~操作S204外,如图5A所示,还可以包括操作S510~操作S512。In addition to operations S201 to S204 described in FIG. 2A, the operation method of the control device of the embodiment of the present disclosure may also include operations S510 to S512 as shown in FIG. 5A.
在操作S510,监听第一语音交互装置的操作,确定第一语音交互装置是否执行第三操作。在操作S511,在第一语音交互装置执行第三操作 的情况下,向第一语音交互装置发送同步请求;在操作S512,接收第一语音交互装置响应于同步请求发送的第三操作的执行进度信息。In operation S510, the operation of the first voice interaction device is monitored, and it is determined whether the first voice interaction device performs a third operation. In operation S511, when the first voice interaction device performs the third operation, a synchronization request is sent to the first voice interaction device; in operation S512, the execution progress of the third operation sent by the first voice interaction device in response to the synchronization request is received information.
根据本公开的实施例,在确定了第一语音交互装置,且第一语音交互装置切换至唤醒状态后,控制装置例如还可以实时的监听所述第一语音交互装置的操作,并实时的确定监听到的操作是否为第三操作。其中第三操作具体可以是响应于用户的语音输入,执行所需的时长大于预设时长的操作,例如可以是播报类的操作,例如播放音乐、朗读电子书、播放广播或播放视频等;或者具有复杂流程类的操作,例如网上购物等。According to an embodiment of the present disclosure, after the first voice interaction device is determined, and the first voice interaction device is switched to the awake state, the control device may, for example, monitor the operation of the first voice interaction device in real time, and determine in real time Whether the monitored operation is the third operation. The third operation may specifically be an operation that takes longer than a preset time to execute in response to the user's voice input, for example, it may be a broadcast operation, such as playing music, reading e-books, playing radio, or playing videos; or Operations with complex processes, such as online shopping.
发送同步请求后,即可接收到第一语音交互装置响应于该同步请求发送的第三操作的执行进度信息,该执行进度信息可以是该第三操作已执行的时长,或者该第三操作已执行的时长占该第三操作执行所需的总时长的比例等。根据本公开的实施例,在接收到该第三操作的执行进度信息后,例如还可以存储并更新已存储的进度信息。After the synchronization request is sent, the execution progress information of the third operation sent by the first voice interaction device in response to the synchronization request can be received. The execution progress information may be the length of time the third operation has been executed, or the third operation has been executed. The ratio of the execution time to the total time required to execute the third operation, etc. According to an embodiment of the present disclosure, after receiving the execution progress information of the third operation, for example, the stored progress information may also be stored and updated.
图5B示意性示出了图5A所示的控制装置的操作方法的应用场景图;图5C示意性示出了根据本公开第五实施例的控制装置的操作方法的流程图。Fig. 5B schematically shows an application scenario diagram of the operation method of the control device shown in Fig. 5A; Fig. 5C schematically shows a flowchart of the operation method of the control device according to the fifth embodiment of the present disclosure.
如图5B所示,本公开的实施例的控制装置的操作方法例如可以应用于家庭场景中,其中,在客厅501、卧室502、卧室503和卧室504中均配置有智能音箱,则图2A中描述的多个语音交互设备即可以是该客厅501、卧室502、卧室503和卧室504中配置的智能音箱,则当用户在客厅501中说出包括唤醒词的语音指令时,该四个智能音箱即可采集到第一语音输入,并通过操作S201~操作S204,将客厅501中的智能音箱唤醒。As shown in FIG. 5B, the operation method of the control device of the embodiment of the present disclosure can be applied to a home scene, for example, in which smart speakers are configured in the living room 501, the bedroom 502, the bedroom 503, and the bedroom 504, then in FIG. 2A The multiple voice interaction devices described may be smart speakers configured in the living room 501, bedroom 502, bedroom 503, and bedroom 504. When the user speaks a voice command including a wake-up word in the living room 501, the four smart speakers Then, the first voice input can be collected, and the smart speaker in the living room 501 can be awakened through operations S201 to S204.
根据本公开的实施例,当用户自图5B中左侧图所示的客厅501位置移动到图5B中右侧图所示的卧室502时,用户一般会需要将卧室502中的智能音箱唤醒,而将客厅501中的智能音箱休眠,因此,会再此说出包括唤醒词的语音指令,则多个语音交互装置中能够采集到该唤醒词的语音输入的装置会向控制装置再次发送参数信息。因此,本公开实施例的控制装置的操作方法除了图2A所示的操作S201~操作S204外,例如还可以包括图5C所示的操作S513:在再次接收到多个语音交互装置 中至少一个语音交互装置分别发送的至少一个参数信息的情况下,重新确定第一语音交互装置。具体可以是重复执行操作S201~操作S203来重新确定第一语音交互装置。在图5B的应用场景中,重新确定的第一语音交互装置即为卧室502中的智能音箱,类似的,在重新确定该第一语音交互装置后,该方法还可以向重新确定的第一语音交互装置(卧室502中的智能音箱)发送唤醒指令,以使重新确定的第一语音交互装置切换至唤醒状态,而向除该重新确定的第一语音交互装置外的接收到该第一语音输入的其他装置发送非唤醒指令,以使之前确定的第一语音交互装置(例如客厅501中的智能音箱)及其他装置切换至非唤醒状态。According to an embodiment of the present disclosure, when the user moves from the position of the living room 501 shown in the left side diagram in FIG. 5B to the bedroom 502 shown in the right side diagram in FIG. 5B, the user generally needs to wake up the smart speaker in the bedroom 502. The smart speaker in the living room 501 is put to sleep, so the voice command including the wake-up word will be spoken again, and the device that can collect the voice input of the wake-up word among the multiple voice interaction devices will send parameter information to the control device again . Therefore, in addition to operations S201 to S204 shown in FIG. 2A, the operation method of the control device of the embodiment of the present disclosure may also include operation S513 shown in FIG. 5C: when at least one voice of the multiple voice interaction devices is received again In the case of at least one piece of parameter information respectively sent by the interaction device, the first voice interaction device is re-determined. Specifically, operations S201 to S203 may be repeatedly performed to re-determine the first voice interaction device. In the application scenario of FIG. 5B, the re-determined first voice interaction device is the smart speaker in the bedroom 502. Similarly, after the first voice interaction device is re-determined, the method can also send a message to the re-determined first voice The interactive device (the smart speaker in the bedroom 502) sends a wake-up instruction to switch the re-determined first voice interaction device to the wake-up state, and the first voice input is received to other than the re-determined first voice interaction device The other devices in the send a non-wake-up instruction to switch the previously determined first voice interaction device (such as the smart speaker in the living room 501) and other devices to the non-wake state.
根据本公开的实施例,在用户再次提供包括唤醒词的语音指令时,之前确定的第一语音交互装置例如还可能由于距离用户过远而采集不到再次提供的语音指令。此种情况下,控制装置的操作方法也可以在重新确定第一语音交互装置后,向当前处于唤醒状态的第一语音交互装置发送休眠指令,以使之前的第一语音交互装置切换至休眠状态,避免额外电能的消耗。According to an embodiment of the present disclosure, when the user provides the voice instruction including the wake-up word again, the previously determined first voice interaction device may also fail to collect the voice instruction provided again because of being too far away from the user. In this case, the operating method of the control device may also send a sleep instruction to the first voice interaction device currently in the awake state after re-determining the first voice interaction device, so that the previous first voice interaction device switches to the sleep state , To avoid the consumption of additional power.
根据本公开的实施例,在用户指示第一语音交互装置执行图5A描述的第三操作,即用户听音乐,或者听广播等持续时间较长的内容时,在自客厅501移动至卧室502后,在唤醒卧室502中的智能音箱,休眠客厅501中的智能音箱的同时,会比较希望卧室502中的智能音箱能够接着客厅501中的智能音箱的执行进度继续执行播放等操作,因此,本公开实施例的控制装置的操作方法还可以包括图5C描述的操作S514~操作S515。According to an embodiment of the present disclosure, when the user instructs the first voice interaction device to perform the third operation described in FIG. 5A, that is, the user listens to music or listens to content with a long duration such as broadcasting, after moving from the living room 501 to the bedroom 502 When waking up the smart speaker in the bedroom 502 and sleeping the smart speaker in the living room 501, it is more hoped that the smart speaker in the bedroom 502 can continue to perform operations such as playing following the execution progress of the smart speaker in the living room 501. Therefore, the present disclosure The operation method of the control device of the embodiment may further include operations S514 to S515 described in FIG. 5C.
在操作S514,接收重新确定后的第一语音交互装置在采集到用户的第四语音输入的情况下发送的获取请求;在操作S515,响应于重新确定后的第一语音交互装置的获取请求,向重新确定后的第一语音交互装置发送所述执行进度信息。In operation S514, an acquisition request sent by the re-determined first voice interaction device when the user’s fourth voice input is collected is received; in operation S515, in response to the re-determined acquisition request of the first voice interaction device, Send the execution progress information to the re-determined first voice interaction device.
其中,考虑到用户希望卧室502中的智能音箱能够接着客厅501中的智能音箱的执行进度继续执行播放等操作,且根据图5A描述的控制装置的操作方法可知,控制装置实时接收有第三操作的执行进度信息,因此,通过上述操作S514~操作S515,在用户可以发出“继续”、或“继 续播放”等指令时,卧室502中的智能音箱可以在采集到与该指令相应的第四语音输入的情况下向控制装置发送获取请求,以获取第三操作的执行进度信息,并根据获取的执行进度信息继续执行第三操作。Among them, considering that the user hopes that the smart speaker in the bedroom 502 can continue to perform operations such as playing following the execution progress of the smart speaker in the living room 501, and according to the operation method of the control device described in FIG. 5A, it can be seen that the control device receives the third operation in real time. Therefore, through the above operations S514 to S515, when the user can issue an instruction such as "continue" or "continue playing", the smart speaker in the bedroom 502 can collect the fourth voice corresponding to the instruction In the case of input, an acquisition request is sent to the control device to acquire the execution progress information of the third operation, and the third operation is continued to be executed according to the acquired execution progress information.
综上可知,本公开实施例的控制装置的操作方法响应于用户的语音输入,可以由重新确定的第一语音交互装置继续执行之前的第一语音交互装置未完成的第三操作,从而可以使得多个语音交互装置构成的智能系统可以为用户提供流畅的服务,避免部分操作的重复执行,因此可以避免浪费用户时间的缺陷,有效提高用户体验。In summary, the operating method of the control device of the embodiment of the present disclosure responds to the user's voice input, and the re-determined first voice interaction device can continue to perform the third operation that was not completed by the previous first voice interaction device, so that The intelligent system composed of multiple voice interaction devices can provide users with smooth services and avoid the repeated execution of some operations, thereby avoiding the defect of wasting user time and effectively improving user experience.
图6A示意性示出了根据本公开第一实施例的语音交互装置的操作方法的流程图,图6B示意性示出了根据本公开第二实施例的语音交互装置的操作方法的流程图。Fig. 6A schematically shows a flow chart of the operation method of the voice interaction device according to the first embodiment of the present disclosure, and Fig. 6B schematically shows the flow chart of the operation method of the voice interaction device according to the second embodiment of the present disclosure.
如图6A所示,该语音交互装置的操作方法包括操作S601。As shown in FIG. 6A, the operation method of the voice interaction device includes operation S601.
在操作S601,向控制装置发送参数信息。In operation S601, parameter information is transmitted to the control device.
根据本公开的实施例,该操作S601具体可以是在语音交互装置采集到用户的第一语音输入的情况下执行的,以使得控制装置根据该参数信息确定语音交互装置是否为第一语音交互装置。According to an embodiment of the present disclosure, the operation S601 may be specifically executed when the voice interaction device collects the user's first voice input, so that the control device determines whether the voice interaction device is the first voice interaction device according to the parameter information .
根据本公开的实施例,第一语音输入具体可以包括预定语音输入和表征用户需求的语音输入,具体详见对图2A中操作S201的描述部分,在此不再赘述。According to an embodiment of the present disclosure, the first voice input may specifically include a predetermined voice input and a voice input that characterizes user needs. For details, please refer to the description of operation S201 in FIG. 2A, which will not be repeated here.
根据本公开的实施例,所述的参数信息例如可以包括语音交互装置的性能参数。此种情况下,如图6B所示,语音交互装置的操作方法在执行操作S601之前,例如还可以包括操作S604,获取性能参数。其中,性能参数具体可以是语音交互装置自本地获取的,或自向该语音交互装置提供服务的服务器或云端获取的。According to an embodiment of the present disclosure, the parameter information may include performance parameters of the voice interaction device, for example. In this case, as shown in FIG. 6B, before performing operation S601, the operation method of the voice interaction apparatus may further include operation S604 to obtain performance parameters. Among them, the performance parameters may be obtained locally by the voice interaction device, or obtained from a server or cloud that provides services to the voice interaction device.
根据本公开的实施例,为了便于控制装置能够更精准地确定唯一的第一语音交互装置,该参数信息例如还可以包括用户的位置信息。则如图6B,该操作方法在执行操作S601之前,还可以包括操作S605:根据采集的用户的第一语音输入,确定用户的位置信息。其中确定用户的位置信息的方法详见对图2C中位置信息的描述,在此不再详述。According to the embodiments of the present disclosure, in order to facilitate the control device to more accurately determine the unique first voice interaction device, the parameter information may further include location information of the user, for example. As shown in FIG. 6B, before performing operation S601, the operation method may further include operation S605: determining the location information of the user according to the collected first voice input of the user. The method for determining the location information of the user is detailed in the description of the location information in FIG. 2C, which will not be described in detail here.
根据本公开的实施例,参数信息例如不仅包括性能参数,还包括用 户的位置信息,则如图6B所示,该语音交互装置的操作方法同时包括操作S604和操作S605。该操作S604可以在操作S605之前或之后执行,本公开对此不作限定,只要操作S604~操作S605均在操作S601之前执行即可。According to the embodiment of the present disclosure, the parameter information includes not only performance parameters, but also user location information. As shown in FIG. 6B, the operation method of the voice interaction device includes operation S604 and operation S605 at the same time. The operation S604 may be performed before or after the operation S605, which is not limited in the present disclosure, as long as the operations S604 to S605 are all performed before the operation S601.
根据本公开的实施例,考虑到在语音交互装置是第一语音交互装置的情况下,会接收到控制装置通过操作S204发送的唤醒指令。则如图6A所示,该语音交互装置的操作方法还可以包括操作S602,接收控制装置发送的唤醒指令,以响应于唤醒指令处于唤醒状态。该操作S602具体可以是:在接收到唤醒指令时,响应于唤醒指令,将当前状态切换至唤醒状态,以与用户进行交互。According to the embodiment of the present disclosure, it is considered that when the voice interaction device is the first voice interaction device, the wake-up instruction sent by the control device through operation S204 is received. As shown in FIG. 6A, the operation method of the voice interaction device may further include operation S602, receiving a wake-up instruction sent by the control device, so as to be in the wake-up state in response to the wake-up instruction. The operation S602 may specifically be: upon receiving the wake-up instruction, in response to the wake-up instruction, switch the current state to the wake-up state to interact with the user.
根据本公开的实施例,考虑到在语音交互装置是第一语音交互装置的情况下,会接收到控制装置通过操作S204发送的非唤醒指令。则如图6A所示,该语音交互装置的操作方法还可以包括操作S603,接收控制装置发送的非唤醒指令,以响应于非唤醒指令处于非唤醒状态。该操作S603具体可以是:在接收到非唤醒指令时,响应于非唤醒指令,先确定当前状态是否为非唤醒状态,若不是,则响应于非唤醒指令,将当前状态切换至唤醒状态,以避免对用于的语音指令做出响应。According to an embodiment of the present disclosure, it is considered that when the voice interaction device is the first voice interaction device, a non-wake-up instruction sent by the control device through operation S204 is received. As shown in FIG. 6A, the operation method of the voice interaction device may further include operation S603, receiving a non-wake-up instruction sent by the control device, so as to be in a non-wake-up state in response to the non-wake-up instruction. The operation S603 may specifically be: when a non-wake-up command is received, in response to the non-wake-up command, first determine whether the current state is a non-wake-up state, if not, switch the current state to the wake-up state in response to the non-wake-up command to Avoid responding to voice commands used.
其中,在语音交互装置不是第一语音交互装置的情况下,接收到的非唤醒指令具体例如可以包括睡眠指令或休眠指令。相应地,为了避免语音交互装置在执行第一操作时,因接收到休眠指令而停止执行第一操作。操作S601发送的参数信息例如还可以包括有语音交互装置的操作信息。因此在操作信息表征语音交互装置执行第一操作的情况下,接收的非唤醒指令为控制装置通过图2C中的操作S224发送的睡眠指令,响应于该睡眠指令语音交互装置处于睡眠状态,即将当前状态切换至睡眠状态。在操作信息表征语音交互装置未执行第一操作的情况下,接收的非唤醒指令为控制装置通过图2C中的操作S224发送的休眠指令,响应于该休眠指令语音交互装置处于休眠状态,即将当前状态切换至休眠状态。其中,睡眠状态是指能够执行第一操作但对采集的用户的语音输入不作响应的状态;休眠状态指不执行任何操作的状态。Wherein, when the voice interaction device is not the first voice interaction device, the received non-wake-up instruction may specifically include a sleep instruction or a hibernation instruction, for example. Correspondingly, in order to prevent the voice interaction device from stopping performing the first operation due to receiving the sleep instruction when performing the first operation. The parameter information sent in operation S601 may also include operation information of the voice interaction device, for example. Therefore, when the operation information indicates that the voice interaction device performs the first operation, the received non-wake-up instruction is the sleep instruction sent by the control device through operation S224 in FIG. 2C. In response to the sleep instruction, the voice interaction device is in the sleep state, i.e. The state switches to sleep state. When the operation information indicates that the voice interaction device does not perform the first operation, the received non-wake-up instruction is the sleep instruction sent by the control device through operation S224 in FIG. 2C. In response to the sleep instruction, the voice interaction device is in a sleep state, i.e. The state switches to sleep state. Among them, the sleep state refers to a state where the first operation can be performed but does not respond to the collected user's voice input; the sleep state refers to a state where no operation is performed.
综上可知,本公开实施例的语音交互装置由控制装置控制切换其工 作状态,而并非在采集到用户的唤醒词后直接切换工作状态至唤醒状态,且在其切换至唤醒状态的情况下,其他语音交互装置处于非唤醒状态,从而可以避免语音交互装置的工作环境嘈杂混乱的缺陷,并因此提高用户体验。且语音交互装置不是第一语音交互装置时,在执行第一操作时,接收到的控制装置的指令为能够继续执行第一操作的睡眠指令,从而可以在一定程度上提高用户体验。In summary, the voice interaction device of the embodiment of the present disclosure is controlled by the control device to switch its working state, instead of directly switching the working state to the wake-up state after the user's wake-up words are collected, and when it is switched to the wake-up state, Other voice interaction devices are in a non-awake state, so that the defect of a noisy and chaotic working environment of the voice interaction device can be avoided, and thus the user experience is improved. And when the voice interaction device is not the first voice interaction device, when the first operation is performed, the received instruction of the control device is a sleep instruction that can continue to perform the first operation, which can improve user experience to a certain extent.
图7A示意性示出了根据本公开第三实施例的语音交互装置的操作方法的流程图。Fig. 7A schematically shows a flowchart of an operation method of a voice interaction device according to a third embodiment of the present disclosure.
根据本公开的实施例,在该语音交互装置为第一语音交互装置的情况下,在切换至唤醒状态后,即可以与用户进行语音交互,以执行与用户的语音指令对应的操作。According to an embodiment of the present disclosure, when the voice interaction device is the first voice interaction device, after switching to the awake state, it can perform voice interaction with the user to perform an operation corresponding to the user's voice instruction.
根据本公开的实施例,考虑到用户的一些语音指令例如“关闭所有设备”、或者“我要出门了”,需要通过多个语音交互装置协同操作才能达到用户想要的效果,因此如图7A所示,本公开实施例的语音交互装置的操作方法在操作S602之后,还包括操作S706~操作S707。在操作S706,在采集到用户的第二语音输入的情况下,确定第二语音输入对应的第一语音信息是否为通用语音信息;以及在操作S707,在确定第二语音输入对应的第一语音信息是通用语音信息的情况下,向控制装置发送该第一语音信息,以使控制装置通过操作S305~操作S307将第一语音信息发送至采集到第一语音输入的多个语音交互装置,使多个语音交互装置执行与第二语音输入相对应的操作。According to the embodiment of the present disclosure, considering some voice commands of the user, such as "turn off all devices" or "I am going out", it is necessary to coordinate operations through multiple voice interaction devices to achieve the desired effect of the user, so as shown in Figure 7A As shown, the operation method of the voice interaction device in the embodiment of the present disclosure further includes operation S706 to operation S707 after operation S602. In operation S706, when the second voice input of the user is collected, it is determined whether the first voice information corresponding to the second voice input is general voice information; and in operation S707, it is determined whether the first voice information corresponding to the second voice input is When the information is general voice information, the first voice information is sent to the control device, so that the control device sends the first voice information to multiple voice interaction devices that have collected the first voice input through operations S305 to S307, so that The multiple voice interaction devices perform operations corresponding to the second voice input.
根据本公开的实施例,上述操作S707具体可以是先识别采集到的语音输入,例如可以是识别得到语音输入的关键词,在识别得到的关键词为预设关键词的情况下,确定采集到的语音输入是第二语音输入,然后确定该第二语音输入对应的第一语音信息是否为通用语音信息,具体可以是将第二语音输入对应的第一语音信息与语音交互装置中或云端系统中存储的通用语音信息列表进行比对,若第二语音输入对应的第一语音信息是通用语音信息列表中的语音信息的情况下,确定该第二语音输入对应的第一语音信息为通用语音信息;若第二语音输入对应的第一语音信息不是通用语音信息列表中的语音信息的情况下,确定该第二语音 输入对应的第一语音信息不是通用语音信息。其中第二语音输入和第一语音信息可以是参考图3中操作S305和操作S306中描述的第二语音输入和第一语音信息,在此不再赘述。According to an embodiment of the present disclosure, the above-mentioned operation S707 may specifically recognize the collected voice input first, for example, may recognize a keyword of the voice input, and if the recognized keyword is a preset keyword, determine the The voice input of is the second voice input, and then it is determined whether the first voice information corresponding to the second voice input is general voice information. Specifically, it can be the first voice information corresponding to the second voice input with the voice interaction device or cloud system The list of general voice information stored in the database is compared, and if the first voice information corresponding to the second voice input is the voice information in the general voice information list, it is determined that the first voice information corresponding to the second voice input is the general voice Information; if the first voice information corresponding to the second voice input is not the voice information in the general voice information list, it is determined that the first voice information corresponding to the second voice input is not general voice information. The second voice input and the first voice information may be the second voice input and the first voice information described in operation S305 and operation S306 in FIG. 3, and details are not described herein again.
根据本公开的实施例,通过上述操作S707将第二语音信息发送给控制装置后,控制装置即可以通过参考图3描述的操作S305~操作S307将第一语音信息发送给多个语音交互装置,以使多个语音交互装置通过执行与第二语音输入对应的操作,例如关机操作,来满足用户的需求,从而在保证交互场景不嘈杂混乱的同时,使得语音交互装置的响应操作更为符合用户需求。According to an embodiment of the present disclosure, after the second voice information is sent to the control device through the above operation S707, the control device can send the first voice information to multiple voice interaction devices through the operations S305 to S307 described with reference to FIG. 3, This allows multiple voice interaction devices to perform operations corresponding to the second voice input, such as a shutdown operation, to meet the user's needs, thereby ensuring that the interaction scene is not noisy and chaotic, and making the response operation of the voice interaction device more in line with the user demand.
图7B示意性示出了根据本公开第四实施例的语音交互装置的操作方法的流程图。Fig. 7B schematically shows a flowchart of the operation method of the voice interaction device according to the fourth embodiment of the present disclosure.
根据本公开的实施例,无论语音交互装置是否为第一语音交互装置,该语音交互装置均可接收控制装置通过操作S307发送的语音信息。相应地,如图7B所示,本公开实施例的语音交互装置的操作方法还可以包括操作S708~操作S709。在操作S708,接收控制装置发送的属于通用语音信息的第二语音信息;在操作S709,根据第二语音信息,执行第二操作,第二操作与第二语音信息对应的语音输入相对应。According to an embodiment of the present disclosure, regardless of whether the voice interaction device is the first voice interaction device, the voice interaction device can receive the voice information sent by the control device through operation S307. Correspondingly, as shown in FIG. 7B, the operation method of the voice interaction device of the embodiment of the present disclosure may further include operation S708 to operation S709. In operation S708, the second voice information belonging to the general voice information sent by the control device is received; in operation S709, a second operation is performed according to the second voice information, and the second operation corresponds to a voice input corresponding to the second voice information.
其中,在语音交互装置是第一语音交互装置的情况下,在图7A描述的操作S707之后执行操作S708~操作S709,此时,第二语音信息即为操作S707中发送的第一语音信息。In the case where the voice interaction device is the first voice interaction device, operations S708 to S709 are performed after operation S707 described in FIG. 7A. At this time, the second voice information is the first voice information sent in operation S707.
在语音交互装置不是第一语音交互装置的情况下,在图6A描述的操作S603之后执行操作S708~操作S709。此时,第二语音信息即为控制装置通过操作S306确定的为通用语音信息的第一语音信息。In the case where the voice interaction device is not the first voice interaction device, operations S708 to S709 are performed after operation S603 described in FIG. 6A. At this time, the second voice information is the first voice information determined by the control device to be general voice information through operation S306.
图8示意性示出了根据本公开第五实施例的语音交互装置的操作方法的流程图。Fig. 8 schematically shows a flowchart of an operation method of a voice interaction device according to a fifth embodiment of the present disclosure.
在语音交互装置是第一语音交互装置,并切换至唤醒状态后,可以响应于用户的语音输入或自行判断后确定用户此时是否还需要与该语音交互装置进行交互。其中,确定用户是否还需要进行交互的实现方式详见上文描述,在此不再赘述。After the voice interaction device is the first voice interaction device and is switched to the awake state, it can be determined whether the user still needs to interact with the voice interaction device in response to the user's voice input or self-determination. Among them, the implementation of determining whether the user still needs to interact is described in detail above, and will not be repeated here.
此种情况下,考虑到在语音交互装置和/或其他语音交互装置接收第 一语音输入之前可能在执行播放广播或音乐等操作,而用户往往希望在结束与语音交互装置的交互后,语音交互装置和/或其他语音交互装置能够继续播放广播或音乐等操作,因此,在语音交互装置确定用户不再需要进行交互后,可以向控制装置发送表征结束交互的恢复请求。因此如图8所示,本公开实施例的操作方法在操作S602之后,还可以包括操作S810,在采集到用户的第三语音输入或在预设时段内未采集到用户的语音输入的情况下,向控制装置发送恢复请求。In this case, considering that the voice interaction device and/or other voice interaction device may perform operations such as playing broadcast or music before receiving the first voice input, and the user often hopes that the voice interaction device will end the interaction with the voice interaction device. The device and/or other voice interaction devices can continue operations such as broadcasting or music. Therefore, after the voice interaction device determines that the user no longer needs to interact, it may send a resume request that characterizes the end of the interaction to the control device. Therefore, as shown in FIG. 8, after operation S602, the operation method of the embodiment of the present disclosure may further include operation S810, when the user's third voice input is collected or the user's voice input is not collected within a preset time period. To send a recovery request to the control device.
相应地,控制装置在接收到恢复请求后,即可通过操作S408~操作S409向多个语音交互装置发送恢复指令,使多个语音交互装置恢复至采集用户的所述第一语音输入之前的状态。具体可以是使多个语音交互装置恢复至采集第一语音输入之前的播放音乐或广播等的状态,以向用户继续播放广播或音乐,满足用户需求。Correspondingly, after receiving the recovery request, the control device can send recovery instructions to multiple voice interaction devices through operations S408 to S409 to restore the multiple voice interaction devices to the state before collecting the user's first voice input . Specifically, the multiple voice interaction devices can be restored to the state of playing music or radio before collecting the first voice input, so as to continue to play radio or music to the user to meet the user's needs.
因此,如图8所示,无论语音交互装置是否为第一语音交互装置,均可执行操作S811,接收控制装置的恢复指令,以响应于恢复指令将当前状态切换至采集第一语音输入之前的状态。其中,在语音交互装置是第一语音交互装置的情况下,操作S811在操作S810之后执行。在语音交互装置不是第一语音交互装置的情况下,操作S811在操作S603之后执行。Therefore, as shown in FIG. 8, regardless of whether the voice interaction device is the first voice interaction device, operation S811 can be performed to receive a recovery instruction from the control device to switch the current state to the one before the first voice input is collected in response to the recovery instruction. status. Wherein, in a case where the voice interaction device is the first voice interaction device, operation S811 is performed after operation S810. In a case where the voice interaction device is not the first voice interaction device, operation S811 is performed after operation S603.
图9A示意性示出了根据本公开第六实施例的语音交互装置的操作方法的流程图。Fig. 9A schematically shows a flowchart of an operation method of a voice interaction device according to a sixth embodiment of the present disclosure.
根据本公开的实施例,在语音交互装置通过操作S602切换至唤醒状态后,如图9A所示,本公开实施例的语音交互装置的操作方法还包括操作S912,在执行第三操作时,响应于控制装置发送的同步请求,向控制装置实时地发送第三操作的执行进度信息,以将第三操作的执行进度实时地更新于控制装置,则在图5B描述的应用场景中,控制装置在重新确定第一语音交互装置后即可执行图5C描述的操作S514~操作S515,以使得该第一语音交互装置在控制装置的控制下切换至非唤醒状态后,可以由重新确定的第一语音交互装置继续执行该第三操作,使得多个语音交互装置构成的智能系统可以为用户提供流畅的服务,避免部分操作的重复执行,并因此可以避免浪费用户时间的缺陷,有效提高用 户体验,详细内容请参见对图5B~图5C的描述,在此不再赘述。According to the embodiment of the present disclosure, after the voice interaction device is switched to the awake state through operation S602, as shown in FIG. 9A, the operation method of the voice interaction device of the embodiment of the present disclosure further includes operation S912. When the third operation is performed, respond In the synchronization request sent by the control device, the execution progress information of the third operation is sent to the control device in real time to update the execution progress of the third operation to the control device in real time. In the application scenario described in FIG. 5B, the control device is After the first voice interaction device is re-determined, operations S514 to S515 described in FIG. 5C can be performed, so that after the first voice interaction device is switched to the non-awake state under the control of the control device, the re-determined first voice The interactive device continues to perform the third operation, so that the intelligent system composed of multiple voice interactive devices can provide users with smooth services, avoid the repeated execution of some operations, and therefore can avoid the defect of wasting user time, and effectively improve user experience. For the content, please refer to the description of FIG. 5B to FIG. 5C, which will not be repeated here.
图9B示意性示出了根据本公开第七实施例的语音交互装置的操作方法的流程图。Fig. 9B schematically shows a flowchart of the operation method of the voice interaction device according to the seventh embodiment of the present disclosure.
根据本公开的实施例,而在该语音交互装置不是第一语音交互装置,并由操作S603切换至休眠状态后,同样可以采集用户再次发出的第一语音指令对应的第一语音输入,且在再次接收到用户的第一语音输入的情况下,通过类似于操作S601的操作重新向控制装置发送参数信息,以使控制装置通过操作S513重新确定第一语音交互装置。According to the embodiment of the present disclosure, after the voice interaction device is not the first voice interaction device and is switched to the sleep state by operation S603, the first voice input corresponding to the first voice instruction issued by the user again can also be collected, and When the user's first voice input is received again, the parameter information is re-transmitted to the control device through an operation similar to operation S601, so that the control device re-determines the first voice interaction device through operation S513.
根据本公开的实施例,在语音交互装置是重新确定后的第一语音交互装置,并响应于控制装置发送的唤醒指令切换至唤醒状态的情况下,如图9B所示,语音交互装置的操作方法还可以包括操作S913~操作S914。According to the embodiment of the present disclosure, in the case where the voice interaction device is the first voice interaction device after re-determination and switches to the wake-up state in response to the wake-up instruction sent by the control device, as shown in FIG. 9B, the operation of the voice interaction device The method may further include operations S913 to S914.
在操作S913,在采集到用户的第四语音输入的情况下,向控制装置发送获取请求;在操作S914,接收控制装置响应于获取请求发送的第三操作的执行进度信息;以及在操作S915,根据所述执行进度信息,执行所述第三操作。其中,第四语音输入具体例如可以是与用户的语音指令“继续”或“继续播放”等对应的语音输入,具体详见对图5C中操作S514的描述,在此不再赘述。In operation S913, when the fourth voice input of the user is collected, an acquisition request is sent to the control device; in operation S914, the execution progress information of the third operation sent by the control device in response to the acquisition request is received; and in operation S915, Perform the third operation according to the execution progress information. Wherein, the fourth voice input may specifically be a voice input corresponding to the user's voice command "continue" or "continue playing". For details, please refer to the description of operation S514 in FIG. 5C, which will not be repeated here.
综上可知,本公开实施例的语音交互装置通过上述操作S 913~操作S915,可以使得重新确定的第一语音交互装置响应于用户的语音指令,继续执行原来的第一语音交互装置执行的第三操作,从而可避免部分操作的重复执行,并因此可以避免浪费用户时间的缺陷,有效提高用户体验。In summary, the voice interaction device of the embodiment of the present disclosure can make the re-determined first voice interaction device continue to execute the first voice interaction device executed by the original first voice interaction device in response to the user's voice instruction through the above operations S913 to S915. Three operations, thereby avoiding repeated execution of some operations, and thus avoiding the defect of wasting user time, and effectively improving user experience.
图10示意性示出了根据本公开实施例的控制装置的结构框图。Fig. 10 schematically shows a structural block diagram of a control device according to an embodiment of the present disclosure.
如图10所示,该控制装置1000包括参数信息接收模块1001、需求信息确定模块1002、第一装置确定模块1003和指令发送模块1004。As shown in FIG. 10, the control device 1000 includes a parameter information receiving module 1001, a demand information determining module 1002, a first device determining module 1003, and an instruction sending module 1004.
参数信息接收模块1001用于接收多个语音交互装置分别发送的多个参数信息(操作S201),该多个参数信息是多个语音交互装置在采集到用户同一时刻的第一语音输入的情况下发送的。The parameter information receiving module 1001 is configured to receive multiple parameter information respectively sent by multiple voice interaction devices (operation S201), and the multiple parameter information is when the multiple voice interaction devices collect the first voice input of the user at the same time Sent.
需求信息确定模块1002用于根据第一语音输入,确定用户的需求信息(操作S202)。其中,第一语音输入包括预定语音输入和表征用户需 求的语音输入。The requirement information determining module 1002 is configured to determine the requirement information of the user according to the first voice input (operation S202). Wherein, the first voice input includes a predetermined voice input and a voice input that characterizes the needs of the user.
第一装置确定模块1003用于根据多个参数信息及需求信息,确定多个语音交互装置中的第一语音交互装置为与用户交互的装置(操作S203)。The first device determining module 1003 is configured to determine the first voice interaction device among the multiple voice interaction devices as the device that interacts with the user according to multiple parameter information and demand information (operation S203).
指令发送模块1004用于向述第一语音交互装置发送唤醒指令,并向多个语音交互装置中除第一语音交互装置外的其他语音交互装置发送非唤醒指令(操作S204)。The instruction sending module 1004 is configured to send a wake-up instruction to the first voice interaction device, and send a non-wake-up instruction to voice interaction devices other than the first voice interaction device among the multiple voice interaction devices (operation S204).
根据本公开的实施例,上述参数信息包括语音交互装置的性能参数,第一装置确定模块1003具体用于:根据多个语音交互装置中每个语音装置的性能参数与需求信息的匹配关系,确定第一语音交互装置。According to an embodiment of the present disclosure, the above-mentioned parameter information includes performance parameters of the voice interaction device, and the first device determining module 1003 is specifically configured to: determine according to the matching relationship between the performance parameters of each voice device in the multiple voice interaction devices and the demand information The first voice interaction device.
根据本公开的实施例,上述参数信息还包括所述用户的位置信息。如图10所示,第一装置确定模块1003可以包括第一确定子模块10031和第二确定子模块10032。第一确定子模块10031用于确定多个语音交互装置中性能参数与用户的需求信息匹配的至少一个第二语音交互装置(操作S213)。第二确定子模块10032用于根据至少一个第二语音交互装置发送的参数信息中的用户的位置信息,确定至少一个第二语音交互装置中的一个为第一语音交互装置(操作S223)。其中,用户的位置信息表征用户相对于语音交互装置的位置。According to an embodiment of the present disclosure, the aforementioned parameter information further includes location information of the user. As shown in FIG. 10, the first device determination module 1003 may include a first determination sub-module 10031 and a second determination sub-module 10032. The first determining submodule 10031 is configured to determine at least one second voice interaction device whose performance parameters match the user's demand information among the multiple voice interaction devices (operation S213). The second determining submodule 10032 is configured to determine one of the at least one second voice interaction device as the first voice interaction device according to the user's location information in the parameter information sent by the at least one second voice interaction device (operation S223). Among them, the location information of the user represents the location of the user relative to the voice interaction device.
根据本公开的实施例,上述参数信息包括语音交互装置的操作信息,所述非唤醒指令包括睡眠指令和休眠指令。如图10所示,上述指令发送模块1004可以包括操作确定子模块10041和指令发送子模块10042。操作确定子模块10041用于根据其他语音交互装置的操作信息,确定其他语音交互装置在采集第一语音输入时是否执行第一操作(操作S214)。指令发送子模块10042用于向执行第一操作的其他语音交互装置发送睡眠指令,向未执行第一操作的其他语音交互装置发送休眠指令(操作S224)。其中,语音交互装置响应于所述睡眠指令处于睡眠状态,所述睡眠状态包括执行所述第一操作且对采集的所述用户的语音输入不作响应的状态;语音交互装置响应于所述休眠指令处于休眠状态,所述休眠状态包括不执行任何操作的状态。According to an embodiment of the present disclosure, the aforementioned parameter information includes operation information of the voice interaction device, and the non-wake-up instruction includes a sleep instruction and a sleep instruction. As shown in FIG. 10, the above-mentioned instruction sending module 1004 may include an operation determining sub-module 10041 and an instruction sending sub-module 10042. The operation determining sub-module 10041 is configured to determine whether the other voice interaction device performs the first operation when collecting the first voice input according to the operation information of the other voice interaction device (operation S214). The instruction sending submodule 10042 is configured to send a sleep instruction to other voice interaction devices that perform the first operation, and send a sleep instruction to other voice interaction devices that do not perform the first operation (operation S224). Wherein, the voice interaction device is in a sleep state in response to the sleep instruction, and the sleep state includes a state in which the first operation is performed and does not respond to the collected voice input of the user; the voice interaction device responds to the sleep instruction In a sleep state, the sleep state includes a state where no operation is performed.
根据本公开的实施例,如图10所示,上述控制装置1000例如还可 以包括第一语音信息接收模块1005、第一语音信息确定模块1006和第一语音信息发送模块1007。第一语音信息接收模块1005用于接收第一语音交互装置在采集到用户的第二语音输入的情况下发送的第一语音信息(操作S305),该第一语音信息与第二语音输入相对应。第一语音信息确定模块1006用于确定第一语音信息是否为通用语音信息(操作S306)。第一语音信息发送模块1007用于在第一语音信息为通用语音信息的情况下,向多个语音交互装置发送第一语音信息(操作S307),以使多个语音交互装置执行与第二语音输入相对应的操作。According to an embodiment of the present disclosure, as shown in Fig. 10, the above-mentioned control device 1000 may further include a first voice information receiving module 1005, a first voice information determining module 1006, and a first voice information sending module 1007, for example. The first voice information receiving module 1005 is configured to receive first voice information sent by the first voice interaction device when the user's second voice input is collected (operation S305), where the first voice information corresponds to the second voice input . The first voice information determining module 1006 is used to determine whether the first voice information is general voice information (operation S306). The first voice information sending module 1007 is configured to send the first voice information to multiple voice interaction devices (operation S307) when the first voice information is general voice information (operation S307), so that the multiple voice interaction devices execute the second voice Enter the corresponding operation.
根据本公开的实施例,如图10所示,上述控制装置1000例如还可以包括恢复请求接收模块1008,该恢复请求接收模块1008用于接收第一语音交互装置发送的恢复请求(操作S408),该恢复请求由第一语音交互装置在采集到用户的第三语音输入或预设时段内未采集到用户的语音输入的情况下发送;相应地,上述指令发送模块1004还用于在恢复请求接收模块1008接收到恢复请求的情况下,向多个语音交互装置发送恢复指令,以使多个语音交互装置恢复至采集第一语音输入之前的状态(操作S409)。According to an embodiment of the present disclosure, as shown in FIG. 10, the aforementioned control device 1000 may, for example, further include a restoration request receiving module 1008, which is configured to receive a restoration request sent by the first voice interaction device (operation S408), The restoration request is sent by the first voice interaction device when the user's third voice input is collected or the user's voice input is not collected within a preset time period; accordingly, the above-mentioned instruction sending module 1004 is also used to receive the restoration request When the module 1008 receives the recovery request, it sends a recovery instruction to the multiple voice interaction devices to restore the multiple voice interaction devices to the state before collecting the first voice input (operation S409).
根据本公开的实施例,如图10所示,上述控制装置1000例如还可以包括操作监听模块1009、同步请求发送模块1010和第一进度信息接收模块1011。操作监听模块1009用于在指令发送模块发送唤醒指令后,监听第一语音交互装置执行的操作,并确定第一语音交互装置是否执行第三操作(操作S510)。同步请求发送模块1010用于在确定第一语音交互装置执行第三操作的情况下,向第一语音交互装置发送同步请求(操作S511)。第一进度信息接收模块1011用于接收第一语音交互装置响应于同步请求发送的第三操作的执行进度信息(操作S512)。According to an embodiment of the present disclosure, as shown in FIG. 10, the aforementioned control device 1000 may further include an operation monitoring module 1009, a synchronization request sending module 1010, and a first progress information receiving module 1011, for example. The operation monitoring module 1009 is configured to monitor the operation performed by the first voice interaction device after the command sending module sends the wake-up instruction, and determine whether the first voice interaction device performs the third operation (operation S510). The synchronization request sending module 1010 is configured to send a synchronization request to the first voice interaction device when it is determined that the first voice interaction device performs the third operation (operation S511). The first progress information receiving module 1011 is configured to receive the execution progress information of the third operation sent by the first voice interaction device in response to the synchronization request (operation S512).
根据本公开的实施例,上述第一装置确定模块1003还用于在参数信息接收模块1001再次接收到多个语音交互装置中至少一个语音交互装置分别发送的至少一个参数信息的情况下,重新确定第一语音交互装置(操作S513)。如图10所示,上述控制装置1000例如还可以包括获取请求接收模块1012和第一进度信息发送模块1013,获取请求接收模块1012用于接收重新确定后的第一语音交互装置在采集到用户的第四语 音输入的情况下发送的获取请求(操作S514)。第一进度信息发送模块1013用于响应于重新确定后的第一语音交互装置的获取请求,向重新确定后的第一语音交互装置发送执行进度信息(操作S515)。According to an embodiment of the present disclosure, the above-mentioned first device determining module 1003 is further configured to re-determine when the parameter information receiving module 1001 again receives at least one parameter information respectively sent by at least one voice interaction device among the multiple voice interaction devices The first voice interaction device (operation S513). As shown in FIG. 10, the above-mentioned control device 1000 may further include, for example, an acquisition request receiving module 1012 and a first progress information sending module 1013. The acquisition request receiving module 1012 is configured to receive the re-determined first voice interaction device that has collected the user’s information. An acquisition request sent in the case of a fourth voice input (operation S514). The first progress information sending module 1013 is configured to send execution progress information to the re-determined first voice interaction device in response to the re-determined acquisition request of the first voice interaction device (operation S515).
根据本公开的实施例的模块、子模块、单元、子单元中的任意多个、或其中任意多个的至少部分功能可以在一个模块中实现。根据本公开实施例的模块、子模块、单元、子单元中的任意一个或多个可以被拆分成多个模块来实现。根据本公开实施例的模块、子模块、单元、子单元中的任意一个或多个可以至少被部分地实现为硬件电路,例如现场可编程门阵列(FPGA)、可编程逻辑阵列(PLA)、片上系统、基板上的系统、封装上的系统、专用集成电路(ASIC),或可以通过对电路进行集成或封装的任何其他的合理方式的硬件或固件来实现,或以软件、硬件以及固件三种实现方式中任意一种或以其中任意几种的适当组合来实现。或者,根据本公开实施例的模块、子模块、单元、子单元中的一个或多个可以至少被部分地实现为计算机程序模块,当该计算机程序模块被运行时,可以执行相应的功能。According to the embodiments of the present disclosure, any number of modules, submodules, units, and subunits, or at least part of the functions of any number of them, may be implemented in one module. Any one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be split into multiple modules for implementation. Any one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be at least partially implemented as a hardware circuit, such as a field programmable gate array (FPGA), a programmable logic array (PLA), System-on-chip, system-on-substrate, system-on-package, application-specific integrated circuit (ASIC), or hardware or firmware in any other reasonable way that integrates or encapsulates the circuit, or can be implemented by software, hardware, and firmware. Any one of these implementations or an appropriate combination of any of them can be implemented. Alternatively, one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be at least partially implemented as a computer program module, and the computer program module may perform corresponding functions when it is executed.
例如,参数信息接收模块1001、需求信息确定模块1002、第一装置确定模块1003、指令发送模块1004、第一语音信息接收模块1005、第一语音信息确定模块1006和第一语音信息发送模块1007、恢复请求接收模块1008、操作监听模块1009、同步请求发送模块1010、第一进度信息接收模块1011、获取请求接收模块1012、第一进度信息发送模块1013、第一确定子模块10031和第二确定子模块10032、操作确定子模块10041以及指令发送子模块10042中的任意多个可以合并在一个模块中实现,或者其中的任意一个模块可以被拆分成多个模块。或者,这些模块中的一个或多个模块的至少部分功能可以与其他模块的至少部分功能相结合,并在一个模块中实现。根据本公开的实施例,参数信息接收模块1001、需求信息确定模块1002、第一装置确定模块1003、指令发送模块1004、第一语音信息接收模块1005、第一语音信息确定模块1006和第一语音信息发送模块1007、恢复请求接收模块1008、操作监听模块1009、同步请求发送模块1010、第一进度信息接收模块1011、获取请求接收模块1012、第一进度信息发送模块1013、第一确定子模块10031 和第二确定子模块10032、操作确定子模块10041以及指令发送子模块10042中的至少一个可以至少被部分地实现为硬件电路,例如现场可编程门阵列(FPGA)、可编程逻辑阵列(PLA)、片上系统、基板上的系统、封装上的系统、专用集成电路(ASIC),或可以通过对电路进行集成或封装的任何其他的合理方式等硬件或固件来实现,或以软件、硬件以及固件三种实现方式中任意一种或以其中任意几种的适当组合来实现。或者,参数信息接收模块1001、需求信息确定模块1002、第一装置确定模块1003、指令发送模块1004、第一语音信息接收模块1005、第一语音信息确定模块1006和第一语音信息发送模块1007、恢复请求接收模块1008、操作监听模块1009、同步请求发送模块1010、第一进度信息接收模块1011、获取请求接收模块1012、第一进度信息发送模块1013、第一确定子模块10031和第二确定子模块10032、操作确定子模块10041以及指令发送子模块10042中的至少一个可以至少被部分地实现为计算机程序模块,当该计算机程序模块被运行时,可以执行相应的功能。For example, the parameter information receiving module 1001, the demand information determining module 1002, the first device determining module 1003, the instruction sending module 1004, the first voice information receiving module 1005, the first voice information determining module 1006, and the first voice information sending module 1007, Recovery request receiving module 1008, operation monitoring module 1009, synchronization request sending module 1010, first progress information receiving module 1011, acquisition request receiving module 1012, first progress information sending module 1013, first determining submodule 10031, and second determining submodule Any number of the module 10032, the operation determination sub-module 10041, and the instruction sending sub-module 10042 may be combined into one module for implementation, or any one of the modules may be split into multiple modules. Or, at least part of the functions of one or more of these modules may be combined with at least part of the functions of other modules and implemented in one module. According to the embodiment of the present disclosure, the parameter information receiving module 1001, the demand information determining module 1002, the first device determining module 1003, the instruction sending module 1004, the first voice information receiving module 1005, the first voice information determining module 1006, and the first voice Information sending module 1007, recovery request receiving module 1008, operation monitoring module 1009, synchronization request sending module 1010, first progress information receiving module 1011, acquisition request receiving module 1012, first progress information sending module 1013, first determining submodule 10031 And at least one of the second determining submodule 10032, the operation determining submodule 10041, and the instruction sending submodule 10042 may be at least partially implemented as a hardware circuit, such as a field programmable gate array (FPGA), a programmable logic array (PLA) , System-on-chip, system-on-substrate, system-on-package, application-specific integrated circuit (ASIC), or can be implemented by hardware or firmware such as any other reasonable way to integrate or package the circuit, or by software, hardware and firmware Any one of the three implementation methods or an appropriate combination of any of them can be implemented. Or, the parameter information receiving module 1001, the demand information determining module 1002, the first device determining module 1003, the instruction sending module 1004, the first voice information receiving module 1005, the first voice information determining module 1006, and the first voice information sending module 1007, Recovery request receiving module 1008, operation monitoring module 1009, synchronization request sending module 1010, first progress information receiving module 1011, acquisition request receiving module 1012, first progress information sending module 1013, first determining submodule 10031, and second determining submodule At least one of the module 10032, the operation determining sub-module 10041, and the instruction sending sub-module 10042 may be at least partially implemented as a computer program module, and when the computer program module is run, it may perform a corresponding function.
图11示意性示出了根据本公开实施例的语音交互装置的结构框图。Fig. 11 schematically shows a structural block diagram of a voice interaction device according to an embodiment of the present disclosure.
如图11所示,该语音交互装置1100包括参数信息发送模块1101、指令接收模块1102和状态切换模块1103。As shown in FIG. 11, the voice interaction device 1100 includes a parameter information sending module 1101, an instruction receiving module 1102, and a state switching module 1103.
参数信息发送模块1101用于在采集到用户的第一语音输入的情况下,向控制装置发送参数信息(操作S601),以确定语音交互装置是否为第一语音交互装置。其中,第一语音输入包括预定语音输入和表征用户需求的语音输入。The parameter information sending module 1101 is configured to send parameter information to the control device when the user's first voice input is collected (operation S601) to determine whether the voice interaction device is the first voice interaction device. Wherein, the first voice input includes a predetermined voice input and a voice input that characterizes user needs.
指令接收模块1102用于在语音交互装置是第一语音交互装置的情况下,接收控制装置发送的唤醒指令。状态切换模块1103用于在指令接收模块1102接收到唤醒指令的情况下,响应于唤醒指令,将当前状态切换为唤醒状态(操作S602)。或者,指令接收模块1102用于在语音交互装置不是第一语音交互装置的情况下,接收控制装置发送的非唤醒指令。状态切换模块1103用于在指令接收模块1102接收到非唤醒指令的情况下,响应于非唤醒指令,将当前状态切换为非唤醒状态(操作S603)。The instruction receiving module 1102 is configured to receive a wake-up instruction sent by the control device when the voice interaction device is the first voice interaction device. The state switching module 1103 is configured to switch the current state to the wake-up state in response to the wake-up instruction when the instruction receiving module 1102 receives the wake-up instruction (operation S602). Alternatively, the instruction receiving module 1102 is configured to receive a non-wake-up instruction sent by the control device when the voice interaction device is not the first voice interaction device. The state switching module 1103 is configured to switch the current state to the non-awakening state in response to the non-awakening instruction when the instruction receiving module 1102 receives the non-awakening instruction (operation S603).
根据本公开的实施例,上述参数信息包括所述语音交互装置的性能参数。如图11所示,上述语音交互装置1100例如还可以包括性能参数 获取模块1104,用于在参数信息发送模块1101向控制装置发送参数信息之前,获取性能参数(操作S604)。According to an embodiment of the present disclosure, the above parameter information includes performance parameters of the voice interaction device. As shown in FIG. 11, the above-mentioned voice interaction device 1100 may, for example, further include a performance parameter obtaining module 1104 for obtaining performance parameters before the parameter information sending module 1101 sends parameter information to the control device (operation S604).
根据本公开的实施例,上述参数信息还包括所述用户的位置信息。如图11所示,上述语音交互装置1100例如还可以包括位置信息确定模块1105,用于根据采集的用户的第一语音输入,确定用户的位置信息(操作S605)。其中,用户的位置信息表征用户相对于语音交互装置的位置。According to an embodiment of the present disclosure, the aforementioned parameter information further includes location information of the user. As shown in FIG. 11, the above-mentioned voice interaction device 1100 may further include, for example, a location information determining module 1105, configured to determine the location information of the user according to the collected first voice input of the user (operation S605). Among them, the location information of the user represents the location of the user relative to the voice interaction device.
根据本公开的实施例,上述参数信息包括语音交互装置的操作信息,所述非唤醒指令包括睡眠指令和休眠指令。在所述操作信息表征语音交互装置执行第一操作的情况下,上述指令接收模块1102接收到的非唤醒指令为睡眠指令,状态切换模块1103响应于睡眠指令将当前状态切换至睡眠状态,睡眠状态包括执行第一操作且对采集的所述用户的语音输入不作响应的状态。或者,在操作信息表征语音交互装置未执行第一操作的情况下,上述指令接收模块1102接收到的非唤醒指令为休眠指令,状态切换模块1103响应于休眠指令将当前状态切换至休眠状态,休眠状态包括不执行任何操作的状态。According to an embodiment of the present disclosure, the aforementioned parameter information includes operation information of the voice interaction device, and the non-wake-up instruction includes a sleep instruction and a sleep instruction. In the case where the operation information indicates that the voice interaction device performs the first operation, the non-wake-up instruction received by the instruction receiving module 1102 is a sleep instruction, and the state switching module 1103 switches the current state to the sleep state in response to the sleep instruction. It includes a state where the first operation is performed and the collected voice input of the user is not responded. Or, when the operation information indicates that the voice interaction device does not perform the first operation, the non-wake-up instruction received by the instruction receiving module 1102 is a sleep instruction, and the state switching module 1103 switches the current state to the sleep state in response to the sleep instruction. The state includes the state where no operation is performed.
根据本公开的实施例,如图11所示,上述语音交互装置1100例如还可以包括第二语音信息确定模块1106和第二语音信息发送模块1107。第二语音信息确定模块1106用于在处于唤醒状态,且采集到用户的第二语音输入的情况下,确定第二语音输入对应的第一语音信息是否为通用语音信息(操作S706)。第二语音信息发送模块1107用于在确定第一语音信息为通用语音信息的情况下,向控制装置发送第一语音信息(操作S707)。并且/或者,如图11所示,上述语音交互装置1100例如还可以包括第二语音信息接收模块1108和操作执行模块1109。第二语音信息接收模块1108用于接收控制装置发送的属于通用语音信息的第二语音信息(操作S708)。操作执行模块1109用于根据第二语音信息,执行第二操作,该第二操作与第二语音信息对应的语音输入相对应(操作S709)。According to an embodiment of the present disclosure, as shown in FIG. 11, the aforementioned voice interaction device 1100 may further include, for example, a second voice information determining module 1106 and a second voice information sending module 1107. The second voice information determining module 1106 is configured to determine whether the first voice information corresponding to the second voice input is general voice information when the user is in an awake state and the second voice input of the user is collected (operation S706). The second voice information sending module 1107 is configured to send the first voice information to the control device when it is determined that the first voice information is general voice information (operation S707). And/or, as shown in FIG. 11, the voice interaction apparatus 1100 may further include a second voice information receiving module 1108 and an operation execution module 1109, for example. The second voice information receiving module 1108 is configured to receive second voice information belonging to general voice information sent by the control device (operation S708). The operation execution module 1109 is configured to perform a second operation according to the second voice information, and the second operation corresponds to a voice input corresponding to the second voice information (operation S709).
根据本公开的实施例,如图11所示,上述语音交互装置1100例如还可以包括恢复请求发送模块1110,用于在处于唤醒状态、且采集到用户的第三语音输入或在预设时段内未采集到用户的语音输入的情况下,向控制装置发送恢复请求(操作S810)。并且/或者,指令接收模块1102 还用于接收控制装置发送的恢复指令;状态切换模块1103还用于响应于恢复指令,将当前状态切换至采集第一语音输入之前的状态(操作S811)。According to an embodiment of the present disclosure, as shown in FIG. 11, the above-mentioned voice interaction device 1100 may, for example, further include a recovery request sending module 1110, which is configured to be in an awake state and collect the user's third voice input or within a preset time period. If the user's voice input is not collected, a recovery request is sent to the control device (operation S810). And/or, the instruction receiving module 1102 is further configured to receive a recovery instruction sent by the control device; the state switching module 1103 is also configured to switch the current state to the state before the first voice input is collected in response to the recovery instruction (operation S811).
根据本公开的实施例,如图11所示,上述语音交互装置1100例如还可以包括第二进度信息发送模块1111,用于在处于唤醒状态,且执行第三操作的情况下,响应于控制装置发送的同步请求,向控制装置发送第三操作的执行进度信息(操作S912)。According to an embodiment of the present disclosure, as shown in FIG. 11, the above-mentioned voice interaction device 1100 may, for example, further include a second progress information sending module 1111, which is configured to respond to the control device when it is in an awake state and the third operation is performed. The sent synchronization request sends the execution progress information of the third operation to the control device (operation S912).
根据本公开的实施例,如图11所示,上述语音交互装置1100例如还可以包括获取请求发送模块1112、第二进度信息接收模块1113和操作执行模块。获取请求发送模块1112用于在处于所述唤醒状态、且采集到用户的第四语音输入的情况下,向控制装置发送获取请求(操作S913)。第二进度信息接收模块1113用于接收控制装置响应于获取请求发送的执行进度信息(操作S914)。操作执行模块用于根据所述执行进度信息,执行所述第三操作(操作S915)。其中,第三操作与第四语音输入相对应,操作执行模块具体可以是上文描述的操作执行模块1109。According to an embodiment of the present disclosure, as shown in FIG. 11, the above-mentioned voice interaction apparatus 1100 may further include an acquisition request sending module 1112, a second progress information receiving module 1113, and an operation execution module, for example. The acquisition request sending module 1112 is configured to send an acquisition request to the control device when the user is in the awake state and the fourth voice input of the user is collected (operation S913). The second progress information receiving module 1113 is configured to receive the execution progress information sent by the control device in response to the acquisition request (operation S914). The operation execution module is configured to execute the third operation according to the execution progress information (operation S915). The third operation corresponds to the fourth voice input, and the operation execution module may specifically be the operation execution module 1109 described above.
根据本公开的实施例的模块、子模块、单元、子单元中的任意多个、或其中任意多个的至少部分功能可以在一个模块中实现。根据本公开实施例的模块、子模块、单元、子单元中的任意一个或多个可以被拆分成多个模块来实现。根据本公开实施例的模块、子模块、单元、子单元中的任意一个或多个可以至少被部分地实现为硬件电路,例如现场可编程门阵列(FPGA)、可编程逻辑阵列(PLA)、片上系统、基板上的系统、封装上的系统、专用集成电路(ASIC),或可以通过对电路进行集成或封装的任何其他的合理方式的硬件或固件来实现,或以软件、硬件以及固件三种实现方式中任意一种或以其中任意几种的适当组合来实现。或者,根据本公开实施例的模块、子模块、单元、子单元中的一个或多个可以至少被部分地实现为计算机程序模块,当该计算机程序模块被运行时,可以执行相应的功能。According to the embodiments of the present disclosure, any number of modules, submodules, units, and subunits, or at least part of the functions of any number of them, may be implemented in one module. Any one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be split into multiple modules for implementation. Any one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be at least partially implemented as a hardware circuit, such as a field programmable gate array (FPGA), a programmable logic array (PLA), System-on-chip, system-on-substrate, system-on-package, application-specific integrated circuit (ASIC), or hardware or firmware in any other reasonable way that integrates or encapsulates the circuit, or can be implemented by software, hardware, and firmware. Any one of these implementations or an appropriate combination of any of them can be implemented. Alternatively, one or more of the modules, sub-modules, units, and sub-units according to the embodiments of the present disclosure may be at least partially implemented as a computer program module, and the computer program module may perform corresponding functions when it is executed.
例如,参数信息发送模块1101、指令接收模块1102、状态切换模块1103、性能参数获取模块1104、位置信息确定模块1105、第二语音信息确定模块1106、第二语音信息发送模块1107、第二语音信息接收模块1108、操作执行模块1109、恢复请求发送模块1110、第二进度信息发送 模块1111、获取请求发送模块1112以及第二进度信息接收模块1113中的任意多个可以合并在一个模块中实现,或者其中的任意一个模块可以被拆分成多个模块。或者,这些模块中的一个或多个模块的至少部分功能可以与其他模块的至少部分功能相结合,并在一个模块中实现。根据本公开的实施例,参数信息发送模块1101、指令接收模块1102、状态切换模块1103、性能参数获取模块1104、位置信息确定模块1105、第二语音信息确定模块1106、第二语音信息发送模块1107、第二语音信息接收模块1108、操作执行模块1109、恢复请求发送模块1110、第二进度信息发送模块1111、获取请求发送模块1112以及第二进度信息接收模块1113中的至少一个可以至少被部分地实现为硬件电路,例如现场可编程门阵列(FPGA)、可编程逻辑阵列(PLA)、片上系统、基板上的系统、封装上的系统、专用集成电路(ASIC),或可以通过对电路进行集成或封装的任何其他的合理方式等硬件或固件来实现,或以软件、硬件以及固件三种实现方式中任意一种或以其中任意几种的适当组合来实现。或者,参数信息发送模块1101、指令接收模块1102、状态切换模块1103、性能参数获取模块1104、位置信息确定模块1105、第二语音信息确定模块1106、第二语音信息发送模块1107、第二语音信息接收模块1108、操作执行模块1109、恢复请求发送模块1110、第二进度信息发送模块1111、获取请求发送模块1112以及第二进度信息接收模块1113中的至少一个可以至少被部分地实现为计算机程序模块,当该计算机程序模块被运行时,可以执行相应的功能。For example, the parameter information sending module 1101, the instruction receiving module 1102, the state switching module 1103, the performance parameter obtaining module 1104, the location information determining module 1105, the second voice information determining module 1106, the second voice information sending module 1107, the second voice information Any number of the receiving module 1108, the operation execution module 1109, the recovery request sending module 1110, the second progress information sending module 1111, the acquisition request sending module 1112, and the second progress information receiving module 1113 can be combined into one module for implementation, or Any one of these modules can be split into multiple modules. Or, at least part of the functions of one or more of these modules may be combined with at least part of the functions of other modules and implemented in one module. According to the embodiment of the present disclosure, the parameter information sending module 1101, the instruction receiving module 1102, the state switching module 1103, the performance parameter acquiring module 1104, the position information determining module 1105, the second voice information determining module 1106, and the second voice information sending module 1107 At least one of the second voice information receiving module 1108, the operation execution module 1109, the restoration request sending module 1110, the second progress information sending module 1111, the acquisition request sending module 1112, and the second progress information receiving module 1113 may be at least partially Implemented as a hardware circuit, such as field programmable gate array (FPGA), programmable logic array (PLA), system on chip, system on substrate, system on package, application specific integrated circuit (ASIC), or can be integrated by circuit Or encapsulated in any other reasonable way such as hardware or firmware, or implemented in any one of the three implementation ways of software, hardware, and firmware, or an appropriate combination of any of them. Or, the parameter information sending module 1101, the instruction receiving module 1102, the state switching module 1103, the performance parameter acquiring module 1104, the location information determining module 1105, the second voice information determining module 1106, the second voice information sending module 1107, the second voice information At least one of the receiving module 1108, the operation execution module 1109, the recovery request sending module 1110, the second progress information sending module 1111, the acquisition request sending module 1112, and the second progress information receiving module 1113 may be at least partially implemented as a computer program module When the computer program module is running, it can execute the corresponding function.
根据本公开的实施例,还提供了一种电子设备,该电子设备可以用于执行图2A~图5C描述的控制装置的操作方法,还可以用于执行参考图6A~图9B描述的语音交互装置的操作方法。相应地,该电子设备既包括参考图10描述的控制装置,也包括图11描述的语音交互装置,该电子设备可以是在参考图1描述的多个语音交互装置中的任意一个装置中集成控制装置形成的电子设备,或者,控制装置和语音交互装置可以是该电子设备中的两个功能模块,且该两个功能模块可以进行交互,在此不再赘述。According to an embodiment of the present disclosure, an electronic device is also provided, which can be used to perform the operation method of the control device described in FIGS. 2A to 5C, and can also be used to perform the voice interaction described with reference to FIGS. 6A to 9B How to operate the device. Correspondingly, the electronic device includes both the control device described with reference to FIG. 10 and the voice interaction device described in FIG. 11. The electronic device may be integrated and controlled in any one of the multiple voice interaction devices described with reference to FIG. The electronic device formed by the device, or the control device and the voice interaction device may be two functional modules in the electronic device, and the two functional modules can interact, which will not be repeated here.
图12示意性示出了根据本公开实施例的适于执行控制装置的操作 方法,或语音交互装置的操作方法的电子设备的方框图。图12示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。Fig. 12 schematically shows a block diagram of an electronic device suitable for performing an operating method of a control device or an operating method of a voice interaction device according to an embodiment of the present disclosure. The electronic device shown in FIG. 12 is only an example, and should not bring any limitation to the function and scope of use of the embodiments of the present disclosure.
如图12所示,根据本公开实施例的电子设备1200包括处理器1201,其可以根据存储在只读存储器(ROM)1202中的程序或者从存储部分1208加载到随机访问存储器(RAM)1203中的程序而执行各种适当的动作和处理。处理器1201例如可以包括通用微处理器(例如CPU)、指令集处理器和/或相关芯片组和/或专用微处理器(例如,专用集成电路(ASIC)),等等。处理器1201还可以包括用于缓存用途的板载存储器。处理器1201可以包括用于执行根据本公开实施例的方法流程的不同动作的单一处理单元或者是多个处理单元。As shown in FIG. 12, an electronic device 1200 according to an embodiment of the present disclosure includes a processor 1201, which can be loaded into a random access memory (RAM) 1203 according to a program stored in a read only memory (ROM) 1202 or from a storage part 1208 The program executes various appropriate actions and processing. The processor 1201 may include, for example, a general-purpose microprocessor (for example, a CPU), an instruction set processor and/or a related chipset and/or a special purpose microprocessor (for example, an application specific integrated circuit (ASIC)), and so on. The processor 1201 may also include on-board memory for caching purposes. The processor 1201 may include a single processing unit or multiple processing units for performing different actions of a method flow according to an embodiment of the present disclosure.
在RAM 1203中,存储有电子设备1200操作所需的各种程序和数据。处理器1201、ROM 1202以及RAM 1203通过总线1204彼此相连。处理器1201通过执行ROM 1202和/或RAM 1203中的程序来执行根据本公开实施例的方法流程的各种操作。需要注意,所述程序也可以存储在除ROM 1202和RAM 1203以外的一个或多个存储器中。处理器1201也可以通过执行存储在所述一个或多个存储器中的程序来执行根据本公开实施例的方法流程的各种操作。In the RAM 1203, various programs and data required for the operation of the electronic device 1200 are stored. The processor 1201, the ROM 1202, and the RAM 1203 are connected to each other through a bus 1204. The processor 1201 executes various operations of the method flow according to the embodiments of the present disclosure by executing programs in the ROM 1202 and/or RAM 1203. It should be noted that the program may also be stored in one or more memories other than the ROM 1202 and RAM 1203. The processor 1201 may also execute various operations of the method flow according to the embodiments of the present disclosure by executing programs stored in the one or more memories.
根据本公开的实施例,电子设备1200还可以包括输入/输出(I/O)接口1205,输入/输出(I/O)接口1205也连接至总线1204。电子设备1200还可以包括连接至I/O接口1205的以下部件中的一项或多项:包括键盘、鼠标等的输入部分1206;包括诸如阴极射线管(CRT)、液晶显示器(LCD)等以及扬声器等的输出部分1207;包括硬盘等的存储部分1208;以及包括诸如LAN卡、调制解调器等的网络接口卡的通信部分1209。通信部分1209经由诸如因特网的网络执行通信处理。驱动器1210也根据需要连接至I/O接口1205。可拆卸介质1211,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器1210上,以便于从其上读出的计算机程序根据需要被安装入存储部分1208。According to an embodiment of the present disclosure, the electronic device 1200 may further include an input/output (I/O) interface 1205, and the input/output (I/O) interface 1205 is also connected to the bus 1204. The electronic device 1200 may also include one or more of the following components connected to the I/O interface 1205: an input part 1206 including a keyboard, a mouse, etc.; including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., and An output section 1207 of a speaker and the like; a storage section 1208 including a hard disk and the like; and a communication section 1209 including a network interface card such as a LAN card, a modem, and the like. The communication section 1209 performs communication processing via a network such as the Internet. The driver 1210 is also connected to the I/O interface 1205 as needed. A removable medium 1211, such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc., is installed on the drive 1210 as needed, so that the computer program read therefrom is installed into the storage portion 1208 as needed.
根据本公开的实施例,根据本公开实施例的方法流程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其 包括承载在计算机可读存储介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信部分1209从网络上被下载和安装,和/或从可拆卸介质1211被安装。在该计算机程序被处理器1201执行时,执行本公开实施例的系统中限定的上述功能。根据本公开的实施例,上文描述的系统、设备、装置、模块、单元等可以通过计算机程序模块来实现。According to the embodiment of the present disclosure, the method flow according to the embodiment of the present disclosure may be implemented as a computer software program. For example, an embodiment of the present disclosure includes a computer program product, which includes a computer program carried on a computer-readable storage medium, and the computer program contains program code for executing the method shown in the flowchart. In such an embodiment, the computer program may be downloaded and installed from the network through the communication part 1209, and/or installed from the removable medium 1211. When the computer program is executed by the processor 1201, the above-mentioned functions defined in the system of the embodiment of the present disclosure are executed. According to the embodiments of the present disclosure, the above-described systems, devices, devices, modules, units, etc. may be implemented by computer program modules.
本公开还提供了一种计算机可读存储介质,该计算机可读存储介质可以是上述实施例中描述的设备/装置/系统中所包含的;也可以是单独存在,而未装配入该设备/装置/系统中。上述计算机可读存储介质承载有一个或者多个程序,当上述一个或者多个程序被执行时,实现根据本公开实施例的方法。The present disclosure also provides a computer-readable storage medium. The computer-readable storage medium may be included in the device/device/system described in the above embodiment; or it may exist alone without being assembled into the device/ In the device/system. The aforementioned computer-readable storage medium carries one or more programs, and when the aforementioned one or more programs are executed, the method according to the embodiments of the present disclosure is implemented.
根据本公开的实施例,计算机可读存储介质可以是非易失性的计算机可读存储介质,例如可以包括但不限于:便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。例如,根据本公开的实施例,计算机可读存储介质可以包括上文描述的ROM 1202和/或RAM 1203和/或ROM 1202和RAM 1203以外的一个或多个存储器。According to an embodiment of the present disclosure, the computer-readable storage medium may be a non-volatile computer-readable storage medium, for example, may include but not limited to: portable computer disk, hard disk, random access memory (RAM), read-only memory (ROM) , Erasable programmable read-only memory (EPROM or flash memory), portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In the present disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program, and the program may be used by or in combination with an instruction execution system, apparatus, or device. For example, according to an embodiment of the present disclosure, the computer-readable storage medium may include one or more memories other than the ROM 1202 and/or RAM 1203 and/or ROM 1202 and RAM 1203 described above.
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计 算机指令的组合来实现。The flowcharts and block diagrams in the accompanying drawings illustrate the possible implementation architecture, functions, and operations of the system, method, and computer program product according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagram may represent a module, program segment, or part of code, and the above-mentioned module, program segment, or part of code contains one or more for realizing the specified logical function Executable instructions. It should also be noted that, in some alternative implementations, the functions marked in the block may also occur in a different order from the order marked in the drawings. For example, two blocks shown in succession can actually be executed substantially in parallel, or they can sometimes be executed in the reverse order, depending on the functions involved. It should also be noted that each block in the block diagram or flowchart, and the combination of blocks in the block diagram or flowchart, can be implemented by a dedicated hardware-based system that performs the specified functions or operations, or can be It is realized by a combination of dedicated hardware and computer instructions.
本领域技术人员可以理解,本公开的各个实施例和/或权利要求中记载的特征可以进行多种组合或/或结合,即使这样的组合或结合没有明确记载于本公开中。特别地,在不脱离本公开精神和教导的情况下,本公开的各个实施例和/或权利要求中记载的特征可以进行多种组合和/或结合。所有这些组合和/或结合均落入本公开的范围。Those skilled in the art can understand that the features described in the various embodiments of the present disclosure and/or the claims can be combined or/or combined in various ways, even if such combinations or combinations are not explicitly described in the present disclosure. In particular, without departing from the spirit and teachings of the present disclosure, the various embodiments of the present disclosure and/or the features described in the claims can be combined and/or combined in various ways. All these combinations and/or combinations fall within the scope of the present disclosure.
以上对本公开的实施例进行了描述。但是,这些实施例仅仅是为了说明的目的,而并非为了限制本公开的范围。尽管在以上分别描述了各实施例,但是这并不意味着各个实施例中的措施不能有利地结合使用。本公开的范围由所附权利要求及其等同物限定。不脱离本公开的范围,本领域技术人员可以做出多种替代和修改,这些替代和修改都应落在本公开的范围之内。The embodiments of the present disclosure have been described above. However, these examples are for illustrative purposes only, and are not intended to limit the scope of the present disclosure. Although the respective embodiments are described above, this does not mean that the measures in the respective embodiments cannot be advantageously used in combination. The scope of the present disclosure is defined by the appended claims and their equivalents. Without departing from the scope of the present disclosure, those skilled in the art can make various substitutions and modifications, and these substitutions and modifications should fall within the scope of the present disclosure.

Claims (21)

  1. 一种控制装置的操作方法,包括:An operation method of a control device includes:
    接收多个语音交互装置分别发送的多个参数信息,所述多个参数信息是所述多个语音交互装置在采集到用户同一时刻的第一语音输入的情况下发送的;Receiving multiple pieces of parameter information respectively sent by multiple voice interaction devices, the multiple pieces of parameter information being sent by the multiple voice interaction devices when the first voice input of the user at the same time is collected;
    根据所述第一语音输入,确定所述用户的需求信息;Determine the demand information of the user according to the first voice input;
    根据所述多个参数信息及所述需求信息,确定所述多个语音交互装置中的第一语音交互装置为与所述用户交互的装置;以及Determine, according to the multiple parameter information and the demand information, the first voice interaction device among the multiple voice interaction devices as the device that interacts with the user; and
    向所述第一语音交互装置发送唤醒指令,并向所述多个语音交互装置中除所述第一语音交互装置外的其他语音交互装置发送非唤醒指令,Sending a wake-up instruction to the first voice interaction device, and sending a non-wake-up instruction to voice interaction devices other than the first voice interaction device among the multiple voice interaction devices,
    其中,所述第一语音输入包括预定语音输入和表征所述用户需求的语音输入。Wherein, the first voice input includes a predetermined voice input and a voice input characterizing the needs of the user.
  2. 根据权利要求1所述的方法,其中,所述参数信息包括语音交互装置的性能参数,根据所述多个参数信息及所述需求信息,确定所述第一语音交互装置包括:The method according to claim 1, wherein the parameter information includes performance parameters of a voice interaction device, and determining the first voice interaction device according to the multiple parameter information and the demand information comprises:
    根据所述多个语音交互装置中每个语音装置的性能参数与所述需求信息的匹配关系,确定所述第一语音交互装置。The first voice interaction device is determined according to the matching relationship between the performance parameter of each voice device in the multiple voice interaction devices and the demand information.
  3. 根据权利要求2所述的方法,其中,所述参数信息还包括所述用户的位置信息,所述根据所述多个语音交互装置中每个语音装置的性能参数与所述需求信息的匹配关系,确定所述第一语音交互装置包括:The method according to claim 2, wherein the parameter information further includes location information of the user, and the matching relationship between the performance parameter of each voice device in the plurality of voice interaction devices and the demand information , Determining that the first voice interaction device includes:
    确定所述多个语音交互装置中性能参数与所述用户的需求信息匹配的至少一个第二语音交互装置;以及Determining at least one second voice interaction device of the plurality of voice interaction devices whose performance parameters match the user's demand information; and
    根据所述至少一个第二语音交互装置发送的参数信息中的用户的位置信息,确定所述至少一个第二语音交互装置中的一个为所述第一语音交互装置,Determining one of the at least one second voice interaction device as the first voice interaction device according to the user's location information in the parameter information sent by the at least one second voice interaction device,
    其中,所述用户的位置信息表征所述用户相对于语音交互装置的位置。Wherein, the location information of the user represents the location of the user relative to the voice interaction device.
  4. 根据权利要求1所述的方法,其中,所述参数信息包括语音交互装置的操作信息,所述非唤醒指令包括睡眠指令和休眠指令,向所述其他语音交互装置发送非唤醒指令包括:The method of claim 1, wherein the parameter information includes operation information of a voice interaction device, the non-wake-up instruction includes a sleep instruction and a hibernation instruction, and sending a non-wake-up instruction to the other voice interaction device includes:
    根据所述其他语音交互装置的操作信息,确定所述其他语音交互装置在采集所述第一语音输入时是否执行第一操作;以及According to the operation information of the other voice interaction device, determine whether the other voice interaction device performs the first operation when collecting the first voice input; and
    向执行所述第一操作的其他语音交互装置发送所述睡眠指令,向未执行所述第一操作的其他语音交互装置发送所述休眠指令,Sending the sleep instruction to other voice interaction devices that perform the first operation, and send the sleep instruction to other voice interaction devices that do not perform the first operation,
    其中,语音交互装置响应于所述睡眠指令处于睡眠状态,所述睡眠状态包括执行所述第一操作且对采集的所述用户的语音输入不作响应的状态;语音交互装置响应于所述休眠指令处于休眠状态,所述休眠状态包括不执行任何操作的状态。Wherein, the voice interaction device is in a sleep state in response to the sleep instruction, and the sleep state includes a state in which the first operation is performed and does not respond to the collected voice input of the user; the voice interaction device responds to the sleep instruction In a sleep state, the sleep state includes a state where no operation is performed.
  5. 根据权利要求1所述的方法,还包括:The method according to claim 1, further comprising:
    接收所述第一语音交互装置在采集到所述用户的第二语音输入的情况下发送的第一语音信息,所述第一语音信息与所述第二语音输入相对应;Receiving first voice information sent by the first voice interaction device when a second voice input of the user is collected, where the first voice information corresponds to the second voice input;
    确定所述第一语音信息是否为通用语音信息;以及Determining whether the first voice information is general voice information; and
    在所述第一语音信息为所述通用语音信息的情况下,向所述多个语音交互装置发送所述第一语音信息。If the first voice information is the general voice information, sending the first voice information to the multiple voice interaction apparatuses.
  6. 根据权利要求1所述的方法,还包括:The method according to claim 1, further comprising:
    接收所述第一语音交互装置发送的恢复请求,所述恢复请求由所述第一语音交互装置在采集到所述用户的第三语音输入或预设时段内未采集到所述用户的语音输入的情况下发送;以及Receive a recovery request sent by the first voice interaction device, where the recovery request is collected by the first voice interaction device within the third voice input of the user or the user's voice input is not collected within a preset time period Send in case of; and
    向所述多个语音交互装置发送恢复指令,以使所述多个语音交互装置恢复至采集所述第一语音输入之前的状态。Sending a recovery instruction to the multiple voice interaction devices to restore the multiple voice interaction devices to a state before collecting the first voice input.
  7. 根据权利要求1所述的方法,其中,在向所述第一语音交互装置发送唤醒指令之后,所述方法还包括:The method according to claim 1, wherein after sending a wake-up instruction to the first voice interaction device, the method further comprises:
    监听所述第一语音交互装置的操作,确定所述第一语音交互装置是否执行第三操作;Monitor the operation of the first voice interaction device, and determine whether the first voice interaction device performs a third operation;
    在所述第一语音交互装置执行所述第三操作的情况下,向所述第一语音交互装置发送同步请求;以及When the first voice interaction device performs the third operation, sending a synchronization request to the first voice interaction device; and
    接收所述第一语音交互装置响应于所述同步请求发送的所述第三操作的执行进度信息。Receiving execution progress information of the third operation sent by the first voice interaction apparatus in response to the synchronization request.
  8. 根据权利要求7所述的方法,还包括:The method according to claim 7, further comprising:
    在再次接收到所述多个语音交互装置中至少一个语音交互装置分别发送的至少一个参数信息的情况下,重新确定第一语音交互装置;In a case where at least one parameter information respectively sent by at least one voice interaction device among the multiple voice interaction devices is received again, re-determine the first voice interaction device;
    接收所述重新确定后的第一语音交互装置在采集到所述用户的第四语音输入的情况下发送的获取请求;以及Receiving the acquisition request sent by the re-determined first voice interaction device when the user's fourth voice input is collected; and
    响应于所述重新确定后的第一语音交互装置的获取请求,向所述重新确定后的第一语音交互装置发送所述执行进度信息。In response to the acquisition request of the re-determined first voice interaction device, sending the execution progress information to the re-determined first voice interaction device.
  9. 一种语音交互装置的操作方法,包括:An operation method of a voice interaction device includes:
    在采集到用户的第一语音输入的情况下,向控制装置发送参数信息,以确定所述语音交互装置是否为第一语音交互装置;When the user's first voice input is collected, sending parameter information to the control device to determine whether the voice interaction device is the first voice interaction device;
    在所述语音交互装置是第一语音交互装置的情况下,接收所述控制装置发送的唤醒指令,以响应于所述唤醒指令处于唤醒状态;In the case that the voice interaction device is the first voice interaction device, receiving a wake-up instruction sent by the control device to respond to the wake-up instruction to be in a wake-up state;
    在所述语音交互装置不是第一语音交互装置的情况下,接收所述控制装置发送的非唤醒指令,以响应于所述非唤醒指令处于非唤醒状态,其中,所述第一语音输入包括预定语音输入和表征所述用户需求的语音输入。In the case that the voice interaction device is not the first voice interaction device, receiving a non-wake-up instruction sent by the control device in response to the non-wake-up instruction being in a non-wake-up state, wherein the first voice input includes a predetermined Voice input and voice input characterizing the needs of the user.
  10. 根据权利要求9所述的方法,其中,所述参数信息包括所述语音交互装置的性能参数,在向控制装置发送参数信息之前,所述方法还包括:获取所述性能参数。The method according to claim 9, wherein the parameter information includes performance parameters of the voice interaction device, and before sending the parameter information to the control device, the method further comprises: acquiring the performance parameters.
  11. 根据权利要求10所述的方法,其中,所述参数信息还包括所述用户的位置信息,在向控制装置发送参数信息之前,所述方法还包括:The method according to claim 10, wherein the parameter information further includes location information of the user, and before sending the parameter information to the control device, the method further includes:
    根据采集的所述用户的第一语音输入,确定所述用户的位置信息,其中,所述用户的位置信息表征所述用户相对于所述语音交互装置的位置。The location information of the user is determined according to the collected first voice input of the user, wherein the location information of the user represents the location of the user relative to the voice interaction device.
  12. 根据权利要求9所述的方法,其中,所述参数信息包括语音交互装置的操作信息,所述非唤醒指令包括睡眠指令和休眠指令,其中:The method according to claim 9, wherein the parameter information includes operation information of a voice interaction device, and the non-wake-up instruction includes a sleep instruction and a sleep instruction, wherein:
    在所述操作信息表征所述语音交互装置执行第一操作的情况下,接收到的非唤醒指令为所述睡眠指令,以响应于所述睡眠指令处于睡眠状 态,所述睡眠状态包括执行所述第一操作且对采集的所述用户的语音输入不作响应的状态;In the case where the operation information indicates that the voice interaction device performs the first operation, the received non-wake-up instruction is the sleep instruction, in response to the sleep instruction being in the sleep state, the sleep state includes executing the The first operation and the state of not responding to the collected voice input of the user;
    在所述操作信息表征所述语音交互装置未执行第一操作的情况下,接收到的所述非唤醒指令为所述休眠指令,以响应于所述休眠指令处于休眠状态,所述休眠状态包括不执行任何操作的状态。In the case where the operation information indicates that the voice interaction device has not performed the first operation, the received non-wake-up instruction is the sleep instruction, in response to the sleep instruction being in the sleep state, the sleep state includes The state of not performing any operations.
  13. 根据权利要求9所述的方法,其中:The method according to claim 9, wherein:
    所述方法还包括:The method also includes:
    在所述语音交互装置处于唤醒状态、且采集到所述用户的第二语音输入的情况下,确定所述第二语音输入对应的第一语音信息是否为通用语音信息;以及In the case where the voice interaction device is in an awake state and the second voice input of the user is collected, determining whether the first voice information corresponding to the second voice input is general voice information; and
    在确定所述第一语音信息为通用语音信息的情况下,向所述控制装置发送第一语音信息;并且/或者In the case of determining that the first voice information is general voice information, send the first voice information to the control device; and/or
    所述方法还包括:The method also includes:
    接收所述控制装置发送的属于所述通用语音信息的第二语音信息;以及Receiving second voice information belonging to the general voice information sent by the control device; and
    根据所述第二语音信息,执行第二操作,所述第二操作与所述第二语音信息对应的语音输入相对应。Perform a second operation according to the second voice information, and the second operation corresponds to a voice input corresponding to the second voice information.
  14. 根据权利要求9所述的方法,其中:The method according to claim 9, wherein:
    所述方法还包括:在所述语音交互装置处于唤醒状态、且采集到所述用户的第三语音输入或在预设时段内未采集到用户的语音输入的情况下,向所述控制装置发送恢复请求;并且/或者The method further includes: when the voice interaction device is in an awake state and the user's third voice input is collected or the user's voice input is not collected within a preset time period, sending to the control device Reinstatement request; and/or
    所述方法还包括:接收所述控制装置发送的恢复指令,将当前状态切换至采集所述第一语音输入之前的状态。The method further includes: receiving a recovery instruction sent by the control device, and switching the current state to a state before collecting the first voice input.
  15. 根据权利要求9所述的方法,其中,所述方法还包括:The method according to claim 9, wherein the method further comprises:
    在所述语音交互装置处于唤醒状态、且执行第三操作的情况下,响应于所述控制装置发送的同步请求,向所述控制装置发送所述第三操作的执行进度信息。When the voice interaction device is in the awake state and the third operation is performed, in response to the synchronization request sent by the control device, the execution progress information of the third operation is sent to the control device.
  16. 根据权利要求15所述的方法,其中,上述方法还包括:The method according to claim 15, wherein the method further comprises:
    在所述语音交互装置处于唤醒状态、且采集到所述用户的第四语音输入的情况下,向所述控制装置发送获取请求;When the voice interaction device is in an awake state and the fourth voice input of the user is collected, sending an acquisition request to the control device;
    接收所述控制装置响应于所述获取请求发送的所述执行进度信息;以及Receiving the execution progress information sent by the control device in response to the acquisition request; and
    根据所述执行进度信息,执行所述第三操作,Execute the third operation according to the execution progress information,
    其中,所述第三操作与所述第四语音输入相对应。Wherein, the third operation corresponds to the fourth voice input.
  17. 一种控制装置,包括:A control device includes:
    参数信息接收模块,用于接收多个语音交互装置分别发送的多个参数信息,所述多个参数信息是所述多个语音交互装置在采集到用户同一时刻的第一语音输入的情况下发送的;The parameter information receiving module is configured to receive multiple parameter information respectively sent by multiple voice interaction devices, and the multiple parameter information is sent by the multiple voice interaction devices when the first voice input of the user at the same time is collected of;
    需求信息确定模块,用于根据所述第一语音输入,确定所述用户的需求信息;A demand information determination module, configured to determine demand information of the user according to the first voice input;
    第一装置确定模块,用于根据所述多个参数信息及所述需求信息,确定所述多个语音交互装置中的第一语音交互装置为与所述用户交互的装置;以及A first device determining module, configured to determine, according to the multiple parameter information and the demand information, that the first voice interaction device among the multiple voice interaction devices is a device that interacts with the user; and
    指令发送模块,用于向所述第一语音交互装置发送唤醒指令,并向所述多个语音交互装置中除所述第一语音交互装置外的其他语音交互装置发送非唤醒指令,An instruction sending module, configured to send a wake-up instruction to the first voice interaction device, and send a non-wake-up instruction to voice interaction devices other than the first voice interaction device among the multiple voice interaction devices,
    其中,所述第一语音输入包括预定语音输入和表征所述用户需求的语音输入。Wherein, the first voice input includes a predetermined voice input and a voice input characterizing the needs of the user.
  18. 一种语音交互装置,包括:A voice interaction device includes:
    参数信息发送模块,用于在采集到用户的第一语音输入的情况下,向控制装置发送参数信息,以确定所述语音交互装置是否为第一语音交互装置;The parameter information sending module is configured to send parameter information to the control device when the user's first voice input is collected, so as to determine whether the voice interaction device is the first voice interaction device;
    指令接收模块,用于:Command receiving module for:
    在所述语音交互装置是第一语音交互装置的情况下,接收所述控制装置发送的唤醒指令;In a case where the voice interaction device is the first voice interaction device, receiving a wake-up instruction sent by the control device;
    在所述语音交互装置不是第一语音交互装置的情况下,接收所述控制装置发送的非唤醒指令;以及In the case that the voice interaction device is not the first voice interaction device, receiving a non-wake-up instruction sent by the control device; and
    状态切换模块,用于:State switching module for:
    在所述指令接收模块接收到所述唤醒指令的情况下,响应于所述唤醒指令,将当前状态切换为唤醒状态;In the case that the instruction receiving module receives the wake-up instruction, in response to the wake-up instruction, switch the current state to the wake-up state;
    在所述指令接收模块接收到所述非唤醒指令的情况下,响应于所述唤醒指令,将当前状态切换为非唤醒状态,In the case that the instruction receiving module receives the non-wake-up instruction, in response to the wake-up instruction, switch the current state to the non-wake-up state,
    其中,所述第一语音输入包括预定语音输入和表征所述用户需求的语音输入。Wherein, the first voice input includes a predetermined voice input and a voice input characterizing the needs of the user.
  19. 一种电子设备,包括:An electronic device including:
    根据权利要求17所述的控制装置;以及The control device according to claim 17; and
    根据权利要求18所述的语音交互装置。The voice interaction device according to claim 18.
  20. 一种电子设备,包括:An electronic device including:
    一个或多个处理器;One or more processors;
    存储装置,用于存储一个或多个程序,Storage device for storing one or more programs,
    其中,当所述一个或多个程序被所述一个或多个处理器执行时,使得所述一个或多个处理器执行:Wherein, when the one or more programs are executed by the one or more processors, the one or more processors are caused to execute:
    根据权利要求1~8中任意一项所述的方法;并且/或者The method according to any one of claims 1-8; and/or
    根据权利要求9~16中任意一项所述的方法。The method according to any one of claims 9-16.
  21. 一种计算机可读存储介质,其上存储有可执行指令,该指令被处理器执行时使处理器:A computer-readable storage medium with executable instructions stored thereon, which when executed by a processor causes the processor to:
    执行根据权利要求1~8中任意一项所述的方法;并且/或者Perform the method according to any one of claims 1-8; and/or
    执行根据权利要求9~16中任意一项所述的方法。Perform the method according to any one of claims 9-16.
PCT/CN2020/081165 2019-05-09 2020-03-25 Control device and operation method therefor, and speech interaction device and operation method therefor WO2020224346A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910388450.5 2019-05-09
CN201910388450.5A CN111754997B (en) 2019-05-09 2019-05-09 Control device and operation method thereof, and voice interaction device and operation method thereof

Publications (1)

Publication Number Publication Date
WO2020224346A1 true WO2020224346A1 (en) 2020-11-12

Family

ID=72672786

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/081165 WO2020224346A1 (en) 2019-05-09 2020-03-25 Control device and operation method therefor, and speech interaction device and operation method therefor

Country Status (2)

Country Link
CN (1) CN111754997B (en)
WO (1) WO2020224346A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115171680A (en) * 2022-06-07 2022-10-11 青岛海尔科技有限公司 Voice interaction method and device of equipment, storage medium and electronic device

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113050505B (en) * 2021-03-25 2021-12-24 广东凌霄泵业股份有限公司 Remote control type multifunctional SPA bathtub intelligent controller
CN113113007A (en) * 2021-03-30 2021-07-13 北京金山云网络技术有限公司 Voice data processing method and device, electronic equipment and storage medium
CN113569712B (en) * 2021-07-23 2023-11-14 北京百度网讯科技有限公司 Information interaction method, device, equipment and storage medium
CN116030812B (en) * 2023-03-29 2023-06-16 广东海新智能厨房股份有限公司 Intelligent interconnection voice control method, device, equipment and medium for gas stove

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103475551A (en) * 2013-09-11 2013-12-25 厦门狄耐克电子科技有限公司 Intelligent home system based on voice recognition
CN106469040A (en) * 2015-08-19 2017-03-01 华为终端(东莞)有限公司 Communication means, server and equipment
CN106782540A (en) * 2017-01-17 2017-05-31 联想(北京)有限公司 Speech ciphering equipment and the voice interactive system including the speech ciphering equipment
CN106951209A (en) * 2017-03-29 2017-07-14 联想(北京)有限公司 A kind of control method, device and electronic equipment
US20180096690A1 (en) * 2016-10-03 2018-04-05 Google Inc. Multi-User Personalization at a Voice Interface Device
CN107919119A (en) * 2017-11-16 2018-04-17 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and the computer-readable medium of more equipment interaction collaborations
WO2018135803A1 (en) * 2017-01-20 2018-07-26 Samsung Electronics Co., Ltd. Voice input processing method and electronic device for supporting the same
CN109215663A (en) * 2018-10-11 2019-01-15 北京小米移动软件有限公司 Equipment awakening method and device
CN109391528A (en) * 2018-08-31 2019-02-26 百度在线网络技术(北京)有限公司 Awakening method, device, equipment and the storage medium of speech-sound intelligent equipment
CN109450750A (en) * 2018-11-30 2019-03-08 广东美的制冷设备有限公司 Sound control method, device, mobile terminal and the household appliance of equipment

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105554283B (en) * 2015-12-21 2019-01-15 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN107450879A (en) * 2016-05-30 2017-12-08 中兴通讯股份有限公司 Terminal operation method and device
CN108663942B (en) * 2017-04-01 2021-12-07 青岛有屋科技有限公司 Voice recognition equipment control method, voice recognition equipment and central control server
US11189273B2 (en) * 2017-06-29 2021-11-30 Amazon Technologies, Inc. Hands free always on near field wakeword solution
CN107680589B (en) * 2017-09-05 2021-02-05 百度在线网络技术(北京)有限公司 Voice information interaction method, device and equipment
CN108538298B (en) * 2018-04-04 2021-05-04 科大讯飞股份有限公司 Voice wake-up method and device
CN109377987B (en) * 2018-08-31 2020-07-28 百度在线网络技术(北京)有限公司 Interaction method, device, equipment and storage medium between intelligent voice equipment
CN109274562B (en) * 2018-09-27 2020-08-04 珠海格力电器股份有限公司 Voice instruction execution method and device, intelligent household appliance and medium
CN109192208B (en) * 2018-09-30 2021-07-30 深圳创维-Rgb电子有限公司 Control method, system, device, equipment and medium for electrical equipment

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103475551A (en) * 2013-09-11 2013-12-25 厦门狄耐克电子科技有限公司 Intelligent home system based on voice recognition
CN106469040A (en) * 2015-08-19 2017-03-01 华为终端(东莞)有限公司 Communication means, server and equipment
US20180096690A1 (en) * 2016-10-03 2018-04-05 Google Inc. Multi-User Personalization at a Voice Interface Device
CN106782540A (en) * 2017-01-17 2017-05-31 联想(北京)有限公司 Speech ciphering equipment and the voice interactive system including the speech ciphering equipment
WO2018135803A1 (en) * 2017-01-20 2018-07-26 Samsung Electronics Co., Ltd. Voice input processing method and electronic device for supporting the same
CN106951209A (en) * 2017-03-29 2017-07-14 联想(北京)有限公司 A kind of control method, device and electronic equipment
CN107919119A (en) * 2017-11-16 2018-04-17 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and the computer-readable medium of more equipment interaction collaborations
CN109391528A (en) * 2018-08-31 2019-02-26 百度在线网络技术(北京)有限公司 Awakening method, device, equipment and the storage medium of speech-sound intelligent equipment
CN109215663A (en) * 2018-10-11 2019-01-15 北京小米移动软件有限公司 Equipment awakening method and device
CN109450750A (en) * 2018-11-30 2019-03-08 广东美的制冷设备有限公司 Sound control method, device, mobile terminal and the household appliance of equipment

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115171680A (en) * 2022-06-07 2022-10-11 青岛海尔科技有限公司 Voice interaction method and device of equipment, storage medium and electronic device

Also Published As

Publication number Publication date
CN111754997A (en) 2020-10-09
CN111754997B (en) 2023-08-04

Similar Documents

Publication Publication Date Title
WO2020224346A1 (en) Control device and operation method therefor, and speech interaction device and operation method therefor
US11721342B2 (en) Multi-modal interaction with intelligent assistants in voice command devices
US11825253B2 (en) Electrical meter for identifying devices using power data and network data
US11422772B1 (en) Creating scenes from voice-controllable devices
WO2019205134A1 (en) Smart home voice control method, apparatus, device and system
TWI665584B (en) A voice controlling system and method
WO2019091171A1 (en) Voice control method, device, system, and electronic apparatus for smart home appliance
CN105700389B (en) Intelligent home natural language control method
CN107339786B (en) A kind of system and method for air-conditioning, regulation air-conditioning loudspeaker casting volume
CN113168304A (en) Conditionally assigning various automatic assistant functions to interactions with peripheral assistant control devices
CN112051743A (en) Device control method, conflict processing method, corresponding devices and electronic device
KR102411619B1 (en) Electronic apparatus and the controlling method thereof
US11721343B2 (en) Hub device, multi-device system including the hub device and plurality of devices, and method of operating the same
JP6645438B2 (en) Information processing apparatus, information processing method, and computer program
US20200193982A1 (en) Terminal device and method for controlling thereof
WO2018157542A1 (en) Smart home appliance control method and device
WO2017141530A1 (en) Information processing device, information processing method and program
CN110618614A (en) Control method and device for smart home, storage medium and robot
CN112489413B (en) Control method and system of remote controller, storage medium and electronic equipment
CN111077785A (en) Awakening method, awakening device, terminal and storage medium
WO2019227368A1 (en) Mode control method and apparatus, and readable storage medium and electronic device
WO2018023515A1 (en) Gesture and emotion recognition home control system
CN114373462A (en) Voice interaction equipment and control method and control device thereof
CN109974229B (en) Method and device for determining air conditioner state, electronic equipment and storage medium
CN106128458A (en) A kind of home voice control system based on speech recognition technology and method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20802122

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20802122

Country of ref document: EP

Kind code of ref document: A1