CN113421559A - Control method based on voice awakening, electronic equipment and controller - Google Patents

Control method based on voice awakening, electronic equipment and controller Download PDF

Info

Publication number
CN113421559A
CN113421559A CN202110611312.6A CN202110611312A CN113421559A CN 113421559 A CN113421559 A CN 113421559A CN 202110611312 A CN202110611312 A CN 202110611312A CN 113421559 A CN113421559 A CN 113421559A
Authority
CN
China
Prior art keywords
voice
electronic device
controller
user
intensity value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110611312.6A
Other languages
Chinese (zh)
Other versions
CN113421559B (en
Inventor
张启敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Honor Device Co Ltd
Original Assignee
Honor Device Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Honor Device Co Ltd filed Critical Honor Device Co Ltd
Priority to CN202110611312.6A priority Critical patent/CN113421559B/en
Publication of CN113421559A publication Critical patent/CN113421559A/en
Application granted granted Critical
Publication of CN113421559B publication Critical patent/CN113421559B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L12/2816Controlling appliance services of a home automation network by calling their functionalities
    • H04L12/282Controlling appliance services of a home automation network by calling their functionalities based on user interaction within the home
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L12/2816Controlling appliance services of a home automation network by calling their functionalities
    • H04L12/2821Avoiding conflicts related to the use of home appliances
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Automation & Control Theory (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Computer Security & Cryptography (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Selective Calling Equipment (AREA)
  • Telephonic Communication Services (AREA)

Abstract

A control method based on voice awakening, electronic equipment and a controller relate to the technical field of communication. According to the scheme, in a scene that the awakening voices of the electronic devices are the same, when a user awakens one device through voice, the electronic devices respectively collect the awakening voices of the user and report the voice intensity value of the awakening voices of the user to the controller of the intelligent routing end; then, the controller judges which electronic device of the electronic devices meets the wake-up response condition (for example, the voice intensity value is the maximum, the historical wake-up times are the maximum, and/or the electronic device has responded for the last time) based on the voice intensity value sent by each electronic device, and instructs one electronic device meeting the wake-up response condition to output the wake-up response message for response, so that the situation that a plurality of electronic devices respond simultaneously is avoided, and the user experience is improved.

Description

Control method based on voice awakening, electronic equipment and controller
Technical Field
The present application relates to the field of communications technologies, and in particular, to a control method based on voice wakeup, an electronic device, and a controller.
Background
With the rapid development of terminal technology and artificial intelligence, electronic devices (e.g., smart home devices) can generally support voice wake-up, such as a voice engine or a voice assistant running in the background, so as to implement voice interaction between a user and the electronic device.
When a user needs to trigger certain electronic equipment to start a certain function, the user can trigger to wake up the electronic equipment by directly speaking the wake-up keyword. The speech engine of the electronic device may output a response word (e.g., "on") as a response to the user after being awakened. Further, the electronic device may support voice interaction with the user, providing some application services to the user.
However, in a home scene, there are usually a plurality of smart home devices, such as one or more smart speakers, smart screens, etc., where some of the smart home devices may have the same factory-preset wake-up words, in this case, a voice wake-up word spoken by a user may simultaneously wake up the plurality of smart home devices in the home scene, so that a plurality of electronic devices may all output a response word to respond, and perform voice interaction with the user, which may not satisfy a requirement of the user to wake up a certain electronic device.
Disclosure of Invention
The application provides a control method based on voice awakening, electronic equipment and a controller, and solves the problem that the requirement of a user for awakening certain electronic equipment cannot be met under the condition that a plurality of pieces of electronic equipment are identical in awakening language in the prior art.
In order to achieve the purpose, the technical scheme is as follows:
in a first aspect, a control method based on voice wake-up is provided, and is applied to a first electronic device and a second electronic device, where wake-up words of the first electronic device and the second electronic device are the same, and the wake-up word is a first wake-up word; the method comprises the following steps:
the first electronic equipment determines that the collected user voice is matched with the first awakening language; the first electronic equipment determines the intensity value of the user voice as a first voice intensity value; the first electronic equipment sends a first voice intensity value to the controller;
the second electronic equipment determines that the collected user voice is matched with the first awakening language; the second electronic equipment determines the intensity value of the user voice as a second voice intensity value; the second electronic equipment sends a second voice intensity value to the controller;
the controller receives a first voice intensity value and a second voice intensity value; the controller determines that the first voice intensity value is greater than the second voice intensity value; the controller sends a first indication message to the first electronic equipment, wherein the first indication message is used for indicating the first electronic equipment to respond;
and the first electronic equipment responds to the first indication message and outputs a first awakening response word to respond.
According to the scheme, in a scene that the awakening voices of the electronic devices are the same, when a user awakens a certain device through voice, the electronic devices respectively collect the awakening voices of the user and report the voice intensity value of the awakening voices of the user to the controller of the intelligent routing end; and then the controller judges which electronic equipment has the largest voice intensity value based on the voice intensity values sent by the electronic equipment respectively, and instructs the electronic equipment with the largest voice intensity value to output the awakening response words for response, so that the condition that a plurality of electronic equipment respond simultaneously is avoided, and the use experience of a user is improved.
In some embodiments of the first aspect, the voice wake-up based control method may be applied to a plurality of electronic devices, where the plurality of electronic devices include a first electronic device, a second electronic device, and a third electronic device, where the wake-up languages of the plurality of electronic devices are the same, and the wake-up language is the first wake-up language.
In this case, the voice wakeup based control method may further include:
the third electronic equipment determines that the collected user voice is matched with the first awakening language; the third electronic equipment determines the intensity value of the user voice as a third voice intensity value; the third electronic equipment sends a third voice intensity value to the controller;
wherein, the controller receives a first voice intensity value and a second voice intensity value, and comprises: the controller receives a first voice intensity value, a second voice intensity value and a third voice intensity value;
wherein, the controller determines that the first voice intensity value is greater than the second voice intensity value, including: the controller determines the maximum voice intensity value of the first voice intensity value, the second voice intensity value and the third voice intensity value;
wherein, the controller sends a first indication message to the first electronic device, including: the controller sends a first indication message to a target electronic device corresponding to the maximum voice intensity value in the plurality of electronic devices;
wherein, the first electronic device responds to the first indication message and outputs a first wake-up response message to respond, including: and the target electronic equipment responds to the first indication message and outputs a first awakening response word for responding.
According to the control method based on voice awakening, when the collected user voice is matched with the first awakening voice, each electronic device in the plurality of electronic devices sends the voice intensity value corresponding to the user voice to the controller. The controller receives a plurality of voice intensity values sent by a plurality of electronic devices, and compares the voice intensity values, wherein each electronic device in the plurality of electronic devices corresponds to one voice intensity value. The controller sends first indication information to a target electronic device corresponding to the maximum voice intensity value in the plurality of electronic devices, and the first indication information is used for indicating the target electronic device to output a first awakening response word for responding. And the target electronic equipment outputs a first awakening response word to respond according to the first indication message sent by the controller.
According to the scheme, in a scene that the awakening voices of the electronic devices are the same, when a user awakens a certain device through voice, the electronic devices respectively collect the awakening voices of the user and report the voice intensity value of the awakening voices of the user to the controller of the intelligent routing end; then, the controller judges which electronic device of the electronic devices meets the wake-up response condition (for example, the voice intensity value is the maximum, the historical wake-up times are the maximum, and/or the electronic device has responded last time) based on the voice intensity value sent by each electronic device, and instructs one electronic device meeting the wake-up response condition to output a wake-up response message for response, so that the situation that a plurality of electronic devices respond simultaneously is avoided, and the use experience of a user is improved.
In some embodiments of the first aspect, the voice wake-up based control method further comprises:
when at least two voice intensity values in the plurality of voice intensity values are equal and are maximum values, if a target electronic device in the plurality of electronic devices has the maximum voice intensity value and the historical awakening times of the target electronic device in the plurality of electronic devices are the maximum, the controller sends the first indication message to the target electronic device;
or, when at least two of the voice intensity values are equal and are maximum values, if a target electronic device in the electronic devices has the maximum voice intensity value and the target electronic device in the electronic devices has responded last time, the controller sends the first indication message to the target electronic device.
In some embodiments of the first aspect, the first electronic device, in response to the first indication message, outputting a first wake-up answer response, including: and the first electronic equipment responds to the first indication message, starts the voice engine and outputs a first awakening response message for responding.
In some embodiments of the first aspect, the voice wake-up based control method further comprises: the controller sends second indication information to the second electronic device, wherein the second indication information is used for indicating that the second electronic device does not respond.
In some embodiments of the first aspect, the controller may be provided at the routing side. For example, the controller may be an intelligent router.
In some embodiments of the first aspect, the voice wake-up based control method further comprises: the first electronic device and the second electronic device respectively establish wireless connection (such as Bluetooth connection or wireless local area network connection) with the controller and/or respectively establish wired connection with the controller.
For example, a wireless local area network connection is established between a plurality of electronic devices and an intelligent router.
In some embodiments of the first aspect, the determining, by the first electronic device, the intensity value of the user's voice to be a first intensity value comprises: the first electronic equipment converts the collected user voice into an electric signal and obtains a first voice intensity value according to the electric signal.
In some embodiments of the first aspect, the first electronic device sending the first speech strength value to the controller includes:
the first electronic equipment determines that the first voice intensity value is greater than or equal to a preset intensity threshold value;
the first electronic device sends the first voice intensity value to the controller.
In some embodiments of the first aspect, before the first electronic device sends the first speech strength value to the controller, the method further comprises: the first electronic equipment performs user identity authentication according to the collected user voice and preset voiceprint characteristic information;
wherein, first electronic equipment sends first voice intensity value to the controller, includes: and under the condition that the first electronic equipment confirms that the user identity authentication is successful according to the collected user voice and the collected user voice is matched with the first awakening language, the first electronic equipment sends a first voice intensity value to the controller.
In some embodiments of the first aspect, the preset voiceprint feature information may be stored in each of the plurality of electronic devices, the cloud server and/or the controller.
In some embodiments of the first aspect, the sending, by the controller, the first indication message to a target electronic device of the plurality of electronic devices corresponding to the maximum voice intensity value includes:
if the target electronic equipment in the plurality of electronic equipment has the maximum voice intensity value and the distance value between the target electronic equipment and the user in the plurality of electronic equipment is the minimum, the controller sends a first indication message to the target electronic equipment.
In some embodiments of the first aspect, the voice wake-up based control method further comprises: each electronic device in the plurality of electronic devices sends a device identification to the controller; the method comprises the steps that a controller receives equipment identifications sent by a plurality of pieces of electronic equipment, and each piece of electronic equipment corresponds to one equipment identification;
wherein, the controller sends a first indication message to a target electronic device corresponding to the maximum voice intensity value among the plurality of electronic devices, and the method includes: if the target electronic equipment in the plurality of electronic equipment has the maximum voice intensity value, the controller sends a first indication message to the target electronic equipment according to the equipment identification of the target electronic equipment.
In a second aspect, the present application provides a control method based on voice wakeup, which is applied to an electronic device, and includes:
collecting user voice;
if the user voice is matched with a first awakening language preset in the electronic equipment, sending a target voice intensity value corresponding to the user voice to the controller, so that the controller judges whether the electronic equipment meets a preset awakening response condition or not based on the target voice intensity value;
when the electronic equipment meets a preset awakening response condition, receiving a first indication message sent by a controller;
responding to the first indication message, and outputting a first awakening response word for responding;
wherein, the preset wake-up response condition includes: when the controller receives a target voice intensity value in a plurality of voice intensity values sent by a plurality of pieces of electronic equipment, the controller instructs the electronic equipment to output a first awakening response word to respond; the awakening words of the electronic devices are the same.
In some embodiments of the second aspect, the electronic device may start the speech engine in response to the first indication message and respond by the speech engine outputting the first wake-up answer.
In some embodiments of the second aspect, the preset wake-up response condition further includes:
when at least two voice intensity values in the plurality of voice intensity values are equal and are the maximum values, if the historical awakening times of the electronic equipment in the plurality of electronic equipment are the maximum, the controller instructs the electronic equipment to output an awakening response word for responding;
or when at least two of the voice intensity values are equal and are maximum values, if the electronic device in the plurality of electronic devices has responded last time, the controller instructs the electronic device to output a wake-up response word for responding.
In some embodiments of the second aspect, the controller is disposed at the routing side.
In some embodiments of the second aspect, before the electronic device collects the user voice, the voice wake-up based control method further includes: the electronic device establishes a wireless connection and/or a wired connection with the controller.
In some embodiments of the second aspect, after the electronic device collects the user voice, the voice wake-up based control method further includes: and converting the user voice into a target electric signal, and obtaining a target voice intensity value according to the target electric signal.
In some embodiments of the second aspect, the sending the target speech strength value corresponding to the user speech to the controller includes: and sending a target voice intensity value corresponding to the voice of the user to the controller under the condition that the target voice intensity value is greater than or equal to the preset intensity threshold value.
In some embodiments of the second aspect, the sending the target speech strength value corresponding to the user speech to the controller includes: and when the user identity authentication is determined to be successful according to the user voice and the preset voiceprint characteristic information and the user voice is matched with the first awakening language, sending a target voice intensity value corresponding to the user voice to the controller.
In some embodiments of the second aspect, the determining that the user identity authentication is successful according to the user voice and the preset voiceprint feature information, and that the user voice is matched with the first wake-up word includes:
performing user identity authentication according to the user voice and preset voiceprint characteristic information;
if the identity authentication is successful, comparing the user voice with the first awakening language;
and if the text information of the user voice contains the first awakening language or is consistent with the first awakening language, determining that the user voice is matched with the first awakening language.
In some embodiments of the second aspect, the preset voiceprint feature information is stored in each of the plurality of electronic devices, the cloud server and/or the controller.
In some embodiments of the second aspect, the sending the target speech strength value corresponding to the user speech to the controller includes: and sending the equipment identification and the target voice intensity value of the electronic equipment to the controller, wherein the target voice intensity value has a corresponding relation with the equipment identification.
In a third aspect, the present application provides a voice wake-up based control method, applied to a controller, the method including:
if the voice intensity values sent by the electronic devices are received within the preset duration, comparing the voice intensity values, wherein the voice intensity values are corresponding to the voice intensity values when the user voice collected by each electronic device is matched with the preset awakening words, each electronic device corresponds to one voice intensity value, and the preset awakening words of the electronic devices are the same;
and sending a first indication message to a target electronic device with the maximum voice intensity value in the plurality of electronic devices, wherein the first indication message is used for indicating the target electronic device to output a first awakening response word for responding.
The first indication message may be used to instruct the target electronic device to start the speech engine and output a first wake-up response message for response.
In some embodiments of the third aspect, the voice wake-up based control method further comprises:
when at least two voice intensity values in the plurality of voice intensity values are equal and are maximum values, if the controller judges that the historical awakening times of the electronic equipment in the plurality of electronic equipment are the most, the electronic equipment is instructed to output a first awakening response word to respond;
or when at least two of the voice intensity values are equal and are maximum values, if the controller determines that the electronic device has responded for the last time in the plurality of electronic devices, the controller instructs the electronic device to output the first wake-up response word for responding.
In some embodiments of the third aspect, the controller is disposed at the routing side.
In some embodiments of the third aspect, the voice wake-up based control method further comprises: the controller establishes wireless connection and/or wired connection with the plurality of electronic devices respectively.
In some embodiments of the third aspect, the voice wake-up based control method further comprises: the controller sends second indication information to other electronic devices except the target electronic device in the plurality of electronic devices, wherein the second indication information is used for indicating that the other electronic devices do not respond.
In some embodiments of the third aspect, the sending the first indication message to the target electronic device with the largest voice intensity value among the plurality of electronic devices includes:
if the target electronic equipment in the plurality of electronic equipment has the maximum voice intensity value and the distance value between the target electronic equipment and the user in the plurality of electronic equipment is the minimum, the controller sends a first indication message to the target electronic equipment.
In some embodiments of the third aspect, the receiving, within the preset time period, the voice strength values sent by the plurality of electronic devices includes: receiving equipment identifications and voice intensity values sent by a plurality of electronic equipment within a preset time, wherein the voice intensity values have a corresponding relation with the equipment identifications, and each electronic equipment corresponds to one equipment identification;
wherein, the sending the first indication message to the target electronic device with the maximum voice intensity value in the plurality of electronic devices includes: if the target electronic equipment in the plurality of electronic equipment has the maximum voice intensity value, the controller sends a first indication message to the target electronic equipment according to the equipment identification of the target electronic equipment.
In a fourth aspect, the present application provides a voice wake-up based user device comprising means for performing the method of the second aspect. The apparatus may correspond to the method described in the second aspect, and for the description of the units in the apparatus, reference is made to the description of the second aspect, and for brevity, no further description is given here.
The method described in the second aspect may be implemented by hardware, or may be implemented by hardware executing corresponding software. The hardware or software includes one or more modules or units corresponding to the above-described functions. For example, a voice acquisition module or unit, a processing module or unit, a transceiver module or unit, a voice output module or unit, etc.
In a fifth aspect, the present application provides a voice wake-up based control apparatus comprising means for performing the method of the third aspect. The apparatus may correspond to performing the method described in the third aspect, and for the description of the units in the apparatus, reference is made to the description of the third aspect, and for brevity, no further description is provided here.
The method described in the third aspect may be implemented by hardware, or may be implemented by hardware executing corresponding software. The hardware or software includes one or more modules or units corresponding to the above-described functions. Such as a transceiver module or unit, a processing module or unit, etc.
In a sixth aspect, the present application provides an electronic device comprising a processor coupled with a memory, the memory for storing computer programs or instructions, the processor for executing the computer programs or instructions stored by the memory such that the method of the second aspect is performed.
For example, the processor is adapted to execute a computer program or instructions stored by the memory to cause the apparatus to perform the method of the second aspect.
In a seventh aspect, the present application provides a controller, the electronic device comprising a processor coupled with a memory, the memory being configured to store a computer program or instructions, the processor being configured to execute the computer program or instructions stored by the memory, such that the method of the third aspect is performed.
For example, the processor is adapted to execute a computer program or instructions stored by the memory to cause the apparatus to perform the method of the third aspect.
In an eighth aspect, the present application provides a communication system comprising the electronic device in the sixth aspect and the controller in the seventh aspect.
In a ninth aspect, the present application provides a computer readable storage medium having stored thereon a computer program (which may also be referred to as instructions or code) for implementing the method in the first aspect.
The computer program, when executed by a computer, causes the computer to perform the method of the first aspect, for example.
In a tenth aspect, the present application provides a chip comprising a processor. The processor is adapted to read and execute the computer program stored in the memory to perform the method of the first aspect and any possible implementation thereof.
Optionally, the chip further comprises a memory, and the memory is connected with the processor through a circuit or a wire.
In an eleventh aspect, the present application provides a chip system comprising a processor. The processor is adapted to read and execute the computer program stored in the memory to perform the method of the first aspect and any possible implementation thereof.
Optionally, the chip system further comprises a memory, and the memory is connected with the processor through a circuit or a wire.
In a twelfth aspect, the present application provides a computer program product comprising a computer program (also referred to as instructions or code) which, when executed by a computer, causes the computer to carry out the method of the first aspect.
It is understood that the beneficial effects of the second to twelfth aspects can be seen from the description of the first aspect, and are not described herein again.
Drawings
Fig. 1 is a scene architecture diagram applied to a control method based on voice wakeup according to an embodiment of the present application;
fig. 2 is a schematic flowchart of a control method based on voice wakeup according to an embodiment of the present application;
fig. 3 is a schematic diagram of a system architecture applied to a voice wake-up based control method according to an embodiment of the present application;
fig. 4 is a schematic flowchart of a control method based on voice wakeup according to an embodiment of the present application;
fig. 5 is a second schematic flowchart of a voice wake-up based control method according to an embodiment of the present application;
fig. 6 is a third schematic flowchart of a voice wake-up based control method according to an embodiment of the present application;
fig. 7 is a fourth schematic flowchart of a voice wake-up based control method according to an embodiment of the present application;
fig. 8 is a fifth flowchart illustrating a control method based on voice wakeup according to an embodiment of the present application;
fig. 9 is a schematic structural diagram of a user device based on voice wake-up according to an embodiment of the present application;
fig. 10 is a schematic structural diagram of a control device based on voice wake-up according to an embodiment of the present application;
fig. 11 is a hardware schematic diagram of an electronic device according to an embodiment of the present application;
fig. 12 is a hardware schematic diagram of a controller according to an embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The term "and/or" herein is an association relationship describing an associated object, meaning that three relationships may exist, e.g., a and/or B, may mean: a exists alone, A and B exist simultaneously, and B exists alone. The symbol "/" herein denotes a relationship in which the associated object is or, for example, a/B denotes a or B.
The terms "first" and "second," and the like, in the description and in the claims herein are used for distinguishing between different objects and not for describing a particular order of the objects. For example, the first speech strength value and the second speech strength value, etc. are used to distinguish between different speech strength values, rather than describing a particular order of speech strength values.
In the embodiments of the present application, words such as "exemplary" or "for example" are used to mean serving as an example, instance, or illustration. Any embodiment or design described herein as "exemplary" or "e.g.," is not necessarily to be construed as preferred or advantageous over other embodiments or designs. Rather, use of the word "exemplary" or "such as" is intended to present concepts related in a concrete fashion.
In the description of the embodiments of the present application, unless otherwise specified, "a plurality" means two or more, for example, a plurality of processing units means two or more processing units, or the like; plural elements means two or more elements, and the like.
At present, a plurality of smart home devices are generally arranged in a home scene, for example, a smart sound box, a smart screen, and the like, wherein a wakeup word preset when some smart home devices leave a factory may be the same, and in this case, a voice wakeup word spoken by a user may simultaneously wake up the plurality of smart home devices in the home scene, so that a plurality of electronic devices may all output a response word to respond, and perform voice interaction with the user.
Fig. 1 shows a scene architecture diagram to which the scheme provided by the embodiment of the present application is applied. As shown in fig. 1, the scene architecture 1 includes a plurality of smart home devices, such as a smart screen 10, a smart sound box 11, a smart sound box 12, and the like. In the scene architecture, the preset wakeup words of the smart home devices leaving the factory are the same, for example, "you are you". When a user wants to wake up a certain smart home device by voice, the user may wake up the smart screen 10, the smart speaker 11 and the smart speaker 12 at the same time after speaking the voice wake-up word "hello, you", which results in that these electronic devices all output a response word (for example, "on") to respond, and perform voice interaction with the user, which may not satisfy the requirement of the user for waking up a certain smart home device.
In view of this, the embodiment of the present application provides a control method based on voice wake-up, an electronic device and a controller, according to the scheme of the present application, in a scene where wake-up languages of a plurality of electronic devices are the same, when a user wakes up a certain device through voice, the plurality of electronic devices respectively collect user wake-up voices, and report a voice intensity value of the user wake-up voices to the controller of an intelligent routing end; then, the controller judges which electronic device of the electronic devices meets the wake-up response condition (for example, the voice intensity value is the maximum, the historical wake-up times are the maximum, and/or the electronic device has responded last time) based on the voice intensity value sent by each electronic device, and instructs one electronic device meeting the wake-up response condition to output a wake-up response message for response, so that the situation that a plurality of electronic devices respond simultaneously is avoided, and the use experience of a user is improved.
First embodiment
In the first embodiment, a specific implementation manner of the communication system provided by the embodiment of the present application is described specifically from the perspective of interaction between the electronic device and the controller. It should be noted that the solution provided in the embodiment of the present application may be applied to a scenario where multiple electronic devices (i.e., two or more electronic devices) having the same preset wake-up word, and for convenience of description, the following takes the electronic device 1 and the electronic device 2 as an example for exemplary description. Fig. 2 is a flowchart illustrating a voice wake-up based control method according to a first embodiment of the present application. Referring to FIG. 2, the control method 100 includes steps S101-S113 described below.
S101, the electronic equipment 1 collects user voice.
The collected user voice may include a wake-up word or a wake-up keyword, and therefore, the user voice including the wake-up word is also referred to as a wake-up voice or a user wake-up voice hereinafter.
The electronic device 1 may be provided with a microphone having a function of collecting or picking up sound, which may collect or pick up sound of the external environment in real time or periodically, for example, collecting user wake-up voice in real time for waking up the electronic device 1 in real time.
And S102, the electronic equipment 1 performs user identity authentication according to the user voice and the preset first voiceprint characteristic information.
For example, if the voice of the user includes the preset voiceprint feature information, the user identity authentication is successful, that is, it is confirmed that the user is an authorized user, so that the related application function of the electronic device 1 is allowed to be used, and the electronic device 1 further continues to perform step S103 described below.
On the other hand, if the voice of the user does not include the preset voiceprint feature information, the user identity authentication fails, that is, it is determined that the user is an unauthorized user, and therefore, the related application function of the electronic device 1 is prohibited from being used, and therefore, the electronic device 1 may not respond.
Alternatively, the preset first voiceprint feature information may be stored in the electronic device 1, the cloud server, and/or the controller. The electronic device 1 may invoke the preset first voiceprint feature information from the local electronic device 1, the cloud server and/or the controller when the user voice is collected.
S103, the electronic device 1 compares the collected user voice with the first wake-up word.
In this embodiment of the application, the electronic device 1 may perform voice recognition on the collected user voice to obtain text information of the user voice, and then compare the text information of the user voice with the first wake-up word to determine whether the user voice is a wake-up voice.
For example, on the one hand, if the text information of the user voice includes the first wake-up word or is consistent with the first wake-up word, it is determined that the user voice matches the first wake-up word, that is, the user voice is a wake-up voice, and the electronic device 1 continues to perform step S104 described below.
On the other hand, if the text information of the user voice does not include the first wake-up word or is inconsistent with the first wake-up word, it is determined that the user voice does not match the first wake-up word, that is, the user voice is not a wake-up voice, and therefore the electronic device 1 may not respond.
In this embodiment, the electronic device 1 may perform user identity authentication according to the user voice and the preset voiceprint feature information. Further, if the identity authentication is successful, the user voice is compared with the first awakening voice, and whether the user voice is the awakening voice is judged. Therefore, under the condition that the user is authenticated after the identity authentication is successful, whether the user voice is the awakening voice is judged, and the privacy safety of the user can be ensured.
It should be noted that, in practical implementation, the embodiment of the present application does not limit the execution sequence in which the electronic device 1 performs user identity authentication according to the user voice and determines whether the user voice is the wake-up voice according to the user voice. For example, the electronic device 1 may perform user identity authentication according to the user voice after judging that the collected user voice is the wake-up voice; of course, the electronic device 1 may also execute the two steps at the same time, and determine whether the user voice is a wake-up voice while performing user identity authentication according to the user voice and the preset voiceprint feature information, which may be specifically determined according to actual use requirements, and the embodiment of the present application is not limited.
And S104, if the voice intensity value is matched with the first voice intensity value, the electronic equipment 1 sends the first voice intensity value corresponding to the user voice to the controller.
Alternatively, the electronic device 1 may establish a wireless connection and/or a wired connection with the controller.
For example, the electronic device 1 may send a wake-up request message to the controller in a Wi-Fi wireless manner, where the wake-up request message includes a first voice intensity value corresponding to the user voice acquired by the electronic device 1.
It should be noted that the wake-up request message further includes a device identifier of the electronic device 1, where the device identifier is used to uniquely indicate the electronic device 1. The device identification has a corresponding relationship with the first voice strength value. In this way, the controller can know that the first voice intensity value is sent by the electronic device 1 according to the corresponding relationship.
In this embodiment of the application, when it is determined that the user identity authentication is successful according to the user voice collected by the electronic device 1 and the preset first voiceprint feature information, and the user voice collected by the electronic device 1 matches the first wake-up word, the electronic device 1 sends a first voice intensity value corresponding to the user voice to the controller.
In the embodiment of the present application, the distance between the user and the device can be measured by using the magnitude of the voice strength value. Wherein, the larger the voice intensity value is, the closer the distance between the user and the device is (i.e. the smaller the distance value is); vice versa, the closer the distance between the user and the device (i.e., the smaller the distance value), the greater the speech strength value. Here, the voice intensity value may be a volume value of the voice of the user (for example, expressed in decibels (dB)), or may be a magnitude of an electrical signal (for example, expressed in voltage values, in millivolts (mV) or volts (V)), which is not limited in this embodiment of the present application.
In this embodiment of the application, the electronic device 1 may convert the user voice collected by the electronic device 1 into a first electrical signal, and obtain a first voice intensity value according to the first electrical signal. The magnitude of the first electrical signal can be a voltage value, and a voice intensity value is obtained.
It is understood that the magnitude of the electrical signal is proportional to the magnitude of the volume value of the user's voice. That is, under the condition that the distance between the user and the electronic device 1 is fixed, the larger the voice intensity of the user is, the larger the decibel value acquired by the electronic device through the microphone is, and the larger the value of the electric signal converted by the electronic device is.
Alternatively, the electronic device 1 may determine whether to report the first voice intensity value to the controller according to whether the collected voice intensity value (e.g., the volume value of the voice of the user or the magnitude of the electric signal) is greater than or equal to a first preset intensity threshold (also referred to as a reporting threshold).
For example, in a case where the voice intensity value obtained by the electronic device 1 is greater than or equal to the first preset intensity threshold, the electronic device 1 may transmit the voice intensity value to the controller. Alternatively, if the magnitude of the speech electric signal obtained by the electronic device 1 is lower than the reporting threshold, the electronic device 1 may not report to the controller.
The first preset intensity threshold is an intensity threshold preset in the electronic device 1, and serves as a reporting threshold, and the intensity threshold may be set according to actual use requirements. In actual implementation, the strength threshold is a trigger threshold, the voice strength value higher than or equal to the reporting threshold can be reported to the controller, and the voice strength value lower than the reporting threshold is not reported, so that the accuracy of voice interaction can be improved.
It should be further noted that, in the embodiment of the present application, the execution time for converting the user voice into the first electrical signal and obtaining the first voice strength value according to the first electrical signal is not limited, for example, after the user voice identity authentication is passed, the electronic device 1 may convert the user voice into the first electrical signal and obtain the first voice strength value; or, after determining that the user voice is the wake-up voice, the electronic device 1 may convert the user voice into the first electrical signal and obtain the first voice strength value. The method can be determined according to actual use requirements, and the embodiment of the application is not limited.
S105, the electronic equipment 2 collects the voice of the user.
The electronic device 2 may convert the user voice collected by the electronic device 2 into a second electrical signal, and obtain a second voice intensity value according to the second electrical signal.
And S106, the electronic equipment 2 performs user identity authentication according to the user voice and the preset second voiceprint characteristic information.
The preset second voiceprint feature information may be stored in the electronic device 2, the cloud server, and/or the controller. The preset second voiceprint feature information may be completely the same as or partially the same as the preset first voiceprint feature information.
S107, the electronic device 2 compares the collected user voice with the first wake-up language.
And S108, if the voice intensity value is matched with the first voice intensity value, the electronic equipment 2 sends the second voice intensity value corresponding to the user voice to the controller.
Wherein the electronic device 2 may establish a wireless connection (e.g., a wireless connection conforming to a Wi-Fi protocol) and/or a wired connection with the controller.
In this embodiment of the application, if the electronic device 2 determines that the user voice is the wake-up voice and the user is the authorized user according to the user voice collected by the electronic device 2, the second voice intensity value corresponding to the user voice is sent to the controller.
Alternatively, the electronic device 2 may report the second voice intensity value to the controller if the collected voice intensity value (e.g., the volume value of the user's voice or the magnitude of the electrical signal) is greater than or equal to a second preset intensity threshold. The second preset intensity threshold is an intensity threshold preset in the electronic device 2, and is used as a reporting threshold, and the intensity threshold may be set according to actual use requirements. It should be noted that, the values of the first preset intensity threshold and the second preset intensity threshold may be the same or different, and may be determined specifically according to actual use requirements, and the embodiment of the present application is not limited.
For a series of operation steps of the electronic device 2 in the steps S105 to S108 are similar to a series of operation steps of the electronic device 1 in the steps S101 to S104, reference may be specifically made to related descriptions, which are not described herein again.
It should be noted that, the embodiment of the present application does not limit the execution sequence of steps S105 to S108 and steps S101 to S104, that is, steps S105 to S108 may be executed first, and then steps S101 to S104 may be executed; alternatively, steps S101 to S104 may be performed first, and then steps S105 to S108 may be performed; steps S101 to S104 and steps S105 to S108 may also be performed simultaneously. Fig. 2 illustrates the exemplary case where steps S101 to S104 are performed first, and then steps S105 to S108 are performed.
As can be seen from the foregoing steps S101 to S108, the electronic device 1 and the electronic device 2 each have a first wake-up word preset therein, and after the user wake-up word is collected, send respective voice intensity values to the controller, the controller determines which device of the electronic device 1 and the electronic device 2 satisfies the wake-up response condition based on the received voice intensity values, and after the electronic device satisfying the wake-up response condition is determined, instructs the electronic device to start the voice engine and output the wake-up response word through the voice engine for response, and the specific processing procedure of the controller refers to the following steps S109 to S111.
And S109, the controller receives a plurality of voice intensity values sent by a plurality of electronic devices in a preset time length.
The plurality of voice intensity values include a first voice intensity value sent by the electronic device 1 and a second voice intensity value sent by the electronic device 2.
The controller may be set at a routing end, and is connected to both the electronic device 1 and the electronic device 2, so that the controller may not only route the packet transmission of the electronic device 1 or the electronic device 2, but also serve as an intelligent voice control center to perform algorithm decision on the user wake-up voice, and thus the controller may also be referred to as an intelligent router.
In the embodiment of the application, when the user speaks the wake-up language, all electronic devices around the user collect user voices, and transmit the received voice strength value to the intelligent routing end. The intelligent routing end serves as a voice control center, algorithm judgment is carried out based on the voice intensity value sent by each electronic device, and which electronic device in each electronic device meets the preset awakening response condition can be judged.
It should be noted that, the time difference when the controller receives the plurality of voice strength values is within the preset time length range, that is, the controller may perform the related determination of the following step S110 or S111 or S112 based on the plurality of voice strength values received within the preset time length (e.g., within 1 second), which may ensure the accuracy and validity of the determination performed by the controller according to the voice strength values. The specific preset duration range can be determined according to actual use requirements, and the embodiment of the application is not limited.
S110, when a first voice intensity value of the plurality of voice intensity values is the maximum, the controller instructs the electronic device 1 to start the voice engine and respond.
In this embodiment of the application, when a first speech intensity value of the plurality of speech intensity values is the maximum, that is, the electronic device 1 satisfies a preset wake-up response condition, the controller may determine that the electronic device 1 is a device that the user desires to wake up, and instruct the electronic device 1 to start the speech engine and respond.
Alternatively, the controller may instruct the other electronic devices other than the electronic device 1 among the plurality of electronic devices not to respond.
It should be noted that the controller may use the magnitude of the voice intensity value, and may measure the distance between the user and the device, and further may measure or determine the device that the user desires to wake up according to the distance between the user and the device. If the first speech intensity value corresponding to the electronic device 1 is greater than the second speech intensity value corresponding to the electronic device 2, it indicates that the electronic device 1 is closer to the user than the electronic device 2, so that it can be determined that the user tends to wake up the electronic device 1 instead of the electronic device 2.
Based on this, the controller may compare based on the electric signal values fed back by the plurality of electronic devices because the electronic device having the largest voice intensity value is closest to the user, and thus, the electronic device having the largest electric signal value is determined as the electronic device to respond to.
According to the scheme provided by the embodiment of the application, when the user awakens one electronic device of the electronic devices through voice, the electronic device closest to the user can respond, the situation that the electronic devices respond simultaneously is avoided, and therefore the user experience is improved.
Alternatively, in some embodiments, when the speech intensity value of electronic device 1 is greater than the speech intensity value of electronic device 2, the controller may not only instruct electronic device 1 to start the speech engine and respond, but also instruct electronic device 2 not to respond.
In this embodiment of the application, as described above, after the electronic device 1 sends the device identifier and the first voice strength value to the controller, correspondingly, the controller may receive the device identifier and the first voice strength value sent by the electronic device 1, and when the controller determines that the first voice strength value is greater than the second voice strength value, the controller may send an indication message to the electronic device 1 corresponding to the device identifier according to the device identifier, instruct the electronic device 1 to start a voice engine, and output a wake-up response word through the voice engine to respond.
In an exemplary scheme, the controller may transmit an indication message to an electronic device having a maximum voice intensity value among the plurality of electronic devices and a minimum distance value from the user, instructing it to start a voice engine and respond.
For example, when the first voice intensity value is greater than the second voice intensity value and the distance value between the electronic device 1 and the user is less than the distance value between the electronic device 2 and the user, the controller instructs the electronic device 1 to start the voice engine and output a wake-up response message through the voice engine to respond. Therefore, which electronic equipment responds can be judged more accurately, and user experience is improved.
It should be noted that, for the distance value between the electronic device and the user, the specific calculation method may refer to a ranging scheme or an indoor positioning scheme disclosed in the prior art, for example, a Received Signal Strength Indication (RSSI) ranging scheme, and details thereof are not repeated herein.
S111, when the electronic device 1 has the maximum speech intensity value and the historical wake-up times of the electronic device 1 are the maximum (for example, the historical wake-up times of the electronic device 1 is greater than the historical wake-up times of the electronic device 2), the controller instructs the electronic device 1 to start the speech engine and respond.
Illustratively, in a case where the voice intensity value of the electronic device 1 is equal to the voice intensity value of the electronic device 2, when the historical wake-up number of the electronic device 1 is greater than the historical wake-up number of the electronic device 2, the controller instructs the electronic device 1 to start the voice engine and respond.
The historical wake-up times of the electronic device connected with the controller can be recorded and stored by the controller.
And S112, when the electronic equipment 1 has responded for the last time, instructing the electronic equipment 1 to start the voice engine and respond when at least two maximum values exist in the plurality of voice intensity values.
Wherein, the device identification of the electronic device which has responded last time can be recorded and stored by the controller.
It should be noted that, the steps S110, S111 and S112 may alternatively be executed.
Alternatively, when the electronic device 1 and the electronic device 2 both have the maximum voice intensity value, and the historical wake-up times of the electronic device 1 are the same as the historical wake-up times of the electronic device 2, and the electronic device 1 has last responded, the controller instructs the electronic device 1 to output a wake-up responder response.
Alternatively, when the electronic device 1 and the electronic device 2 both have the maximum voice intensity value and the historical wake-up times of the electronic device 1 are less than the historical wake-up times of the electronic device 2, the controller instructs the electronic device 2 to output the wake-up answer to respond.
Alternatively, when both the electronic device 1 and the electronic device 2 have the maximum voice intensity value and the electronic device 2 has responded for the last time, the controller instructs the electronic device 2 to output the wake-up answer to respond.
S113, the electronic device 1 starts the speech engine according to the indication message sent by the controller, and outputs a wake-up response message to respond.
In this way, under the instruction of the controller, the electronic device 1 starts the voice engine and outputs the wake-up response message to respond, that is, the electronic device 1 is woken up by the user through voice, and can support voice interaction with the user to provide services for the user.
According to the scheme provided by the embodiment of the application, when the user awakens one electronic device of the electronic devices through voice, the electronic device closest to the user can respond, the situation that the electronic devices respond simultaneously is avoided, and therefore the user experience is improved.
It should be noted that, for convenience of description, in the first embodiment of the present application, the electronic device 1 and the electronic device 2 are taken as examples for illustration, where the electronic device 1 and the electronic device 2 respectively collect user awakening voices, and convert the collected user awakening voices into electrical signals to obtain corresponding voice intensity values; then the electronic equipment 1 and the electronic equipment 2 respectively send the voice intensity values to the controller; and then the controller judges which electronic equipment 1 or 2 meets the awakening response condition based on the voice intensity values sent by the electronic equipment 1 and 2 respectively, and instructs the electronic equipment meeting the awakening response condition to start the voice engine and output an awakening response word through the voice engine for response.
It can be understood that the application scenarios of the control method based on voice wakeup provided in the first embodiment of the present application include, but are not limited to, the electronic device 1 and the electronic device 2, and in practical implementation, the solution of the present application may also be applied to scenarios of three or more electronic devices with the same preset wakeup words. The electronic equipment respectively collects user awakening voices and converts the collected user awakening voices into electric signals to obtain corresponding voice strength values; then the electronic devices respectively send corresponding voice intensity values to the controller; and then the controller judges which electronic equipment meets the awakening response condition based on the voice intensity values sent by the electronic equipment respectively, and instructs one electronic equipment meeting the awakening response condition to start the voice engine and output an awakening response word through the voice engine for response.
In practical implementation, for a plurality of home networking devices, the networking mode may be a Wi-Fi (wireless) mode or a network cable (wired) mode, because these home networking devices are usually provided with corresponding Media Access Control (MAC) addresses or Internet Protocol (IP) addresses at the intelligent routing end, the controller of the intelligent routing end may identify these home networking devices according to the MAC/IP addresses. The home networking devices may interact with the controller of the intelligent routing end, for example, the intensity value of the user wake-up voice is respectively reported to the controller, the controller of the intelligent routing end performs algorithm judgment, the home devices meeting the preset wake-up response condition are determined, and the home devices meeting the preset wake-up response condition are instructed to start a voice engine and respond according to the MAC/IP address of the home networking devices. That is to say, the intelligent routing end can be used as a voice control center to perform algorithm judgment on the user awakening voice.
The system architecture and the specific implementation of the scheme applied by the scheme provided by the embodiment of the present application are exemplarily described below with reference to fig. 3. As shown in fig. 3 (a), the system architecture 2 includes a plurality of smart home devices, such as a smart screen 10, a smart sound box 11, a smart sound box 12, and the like, and a controller 20 disposed at the smart router. The controller 20 may be connected to each of the plurality of smart home devices in a wired manner or a wireless manner (e.g., Wi-Fi).
Wherein, these a plurality of intelligent household equipment all have the pronunciation and gather the function to support the pronunciation function of awakening up. Assume that the preset wakeup words of the multiple smart home devices leaving the factory are the same, for example, "hello, you". As shown in fig. 3 (a), when the user speaks a user wake-up voice "hello, you", the smart screen 10 collects the user wake-up voice, converts the user wake-up voice into an electrical signal, obtains a voice intensity value 1, and then sends the voice intensity value 1 to the controller 20; the smart sound box 11 collects the user awakening voice, converts the user awakening voice into an electric signal, obtains a voice intensity value 2, and then sends the voice intensity value 2 to the controller 20; the smart sound box 12 collects the user awakening voice, converts the user awakening voice into an electric signal, obtains a voice intensity value 3, and then sends the voice intensity value 3 to the controller 20.
Where the specific who of the plurality of home smart devices will be woken up and respond is decided by the controller 20. The controller 20, as an intelligent voice control center, obtains the voice strength values sent by the plurality of intelligent home devices: voice intensity value 1 sent by the smart screen 10, voice intensity value 2 sent by the smart sound box 11, and voice intensity value 3 sent by the smart sound box 12. When the controller 20 determines that the voice intensity value 2 among the voice intensity value 1, the voice intensity value 2, and the voice intensity value 3 is the maximum value, as shown in (b) of fig. 3, the controller 20 determines the smart speaker 11 transmitting the voice intensity value 2 as a device to be responded to, and the controller 20 instructs the smart speaker 11 to start the voice engine and respond, as shown by a solid line in (b) of fig. 3.
Accordingly, after receiving the instruction from the controller 20, the smart sound box 11 starts the voice engine, and outputs a wake-up response word "on" to respond, so as to implement voice wake-up, and further support voice interaction with the user, so as to provide services for the user.
In addition, when the controller 20 determines that the voice intensity value 2 of the voice intensity value 1, the voice intensity value 2 and the voice intensity value 3 is the maximum value, as shown by the dotted line in (b) of fig. 3, the controller 20 indicates that the smart screen 10 and the smart speaker 12 do not respond, and it can be understood that the smart screen 10 and the smart speaker 12 will not start the voice engine and will not output the wake-up answer through the voice engine.
According to the scheme provided by the embodiment of the application, the plurality of intelligent devices of the instant user home networking adopt the unified awakening words, when the user has the requirement of awakening one intelligent device, the user only needs to be close to the expected awakening intelligent device and speak the awakening words to awaken the intelligent device, and the problem that the requirement of awakening one electronic device by the user cannot be met under the condition that the awakening words of the plurality of intelligent devices are the same at present is solved.
According to the communication system provided by the embodiment of the application, in a scene that the awakening voices of the electronic devices are the same, when a user awakens a certain device through voice, the electronic devices respectively collect the awakening voices of the user and report the voice intensity value of the awakening voices of the user to the controller of the intelligent routing terminal; then, the controller judges which electronic device of the electronic devices meets the wake-up response condition (for example, the voice intensity value is the maximum, the historical wake-up times are the maximum, and/or the electronic device has responded last time) based on the voice intensity value sent by each electronic device, and instructs one electronic device meeting the wake-up response condition to start the voice engine and output the wake-up response words through the voice engine to respond, so that the situation that a plurality of electronic devices respond simultaneously is avoided, and the user experience is improved.
Second embodiment
In the second embodiment, a specific implementation of the scheme of the present application is described specifically from the perspective of an electronic device. Fig. 4 is a flowchart illustrating a voice wake-up based control method according to a second embodiment of the present application. Referring to FIG. 4, the control method 200 includes steps S201-S204 described below.
S201, the electronic equipment collects user voice.
S202, if the user voice is matched with a first awakening language preset in the electronic equipment, the electronic equipment sends a target voice intensity value corresponding to the user voice to the controller, so that the controller judges whether the electronic equipment meets a preset awakening response condition or not based on the target voice intensity value.
Assume that the controller receives M voice intensity values, where the M voice intensity values include a target voice intensity value and other voice intensity values, and the other voice intensity values are voice intensity values sent to the controller by other electronic devices except the electronic device when the collected user voice matches the first wake-up word. Wherein M is a positive integer.
S203, when the electronic equipment meets the preset awakening response condition, the electronic equipment receives an indication message sent by the controller.
The indication message is used for indicating the electronic equipment to start the voice engine and outputting a wake-up response message through the voice engine to respond.
And S204, the electronic equipment starts a voice engine of the electronic equipment according to the indication message, and outputs a wake-up response message through the voice engine to respond.
Therefore, when the electronic equipment meets the preset awakening response condition, under the indication of the controller, the electronic equipment starts the voice engine and outputs the awakening response words to respond, namely, the electronic equipment is awakened by the user through voice, and can support voice interaction with the user and provide services for the user.
In an exemplary scheme, the preset wake-up response condition may include: when the target voice intensity value in the M voice intensity values received by the controller is the maximum, the controller instructs the electronic equipment to start the voice engine and outputs a wake-up response word through the voice engine to respond.
In an exemplary scheme, the preset wake-up response condition may further include: and if the controller detects that the target voice intensity value is maximum and the distance value between the electronic equipment and the user is smaller than the distance values between other electronic equipment and the user, the controller instructs the electronic equipment to start the voice engine and respond. Therefore, which electronic equipment responds can be judged more accurately, and user experience is improved.
According to the scheme provided by the embodiment of the application, when the user awakens one electronic device of the electronic devices through voice, the electronic device closest to the user can respond, the situation that the electronic devices respond simultaneously is avoided, and therefore the user experience is improved.
In an exemplary scheme, the preset wake-up response condition may further include: when the target voice intensity value in the M voice intensity values received by the controller is the maximum and the historical awakening times of the electronic equipment are the maximum, the controller instructs the electronic equipment to start the voice engine and output an awakening response word through the voice engine to respond.
The historical wake-up times of the electronic device connected with the controller can be recorded and stored by the controller.
In an exemplary scheme, the preset wake-up response condition may further include: when the target voice intensity value in the M voice intensity values received by the controller is the maximum and the electronic equipment responds the latest, the controller instructs the electronic equipment to start the voice engine and output a wake-up response word through the voice engine to respond.
Wherein, the device identification of the electronic device which has responded last time can be recorded and stored by the controller.
It should be noted that the preset wake-up response conditions are all exemplified lists, and it is understood that, in actual implementation, the embodiment of the present application includes but is not limited to the above-mentioned wake-up response conditions, which may be determined according to actual needs, and the embodiment of the present application is not limited.
Optionally, in the embodiment of the present application, the controller may be disposed at the routing end, and establish a wireless connection and/or a wired connection with the electronic device and other electronic devices. The controller not only can route the message transmission of the electronic equipment and other electronic equipment, but also can be used as an intelligent voice control center to carry out algorithm judgment on the user awakening voice, so the controller can also be called an intelligent router.
In an exemplary scenario, referring to fig. 4, as shown in fig. 5, after the electronic device collects the user voice (i.e., step S201), the control method 200 further includes the following step S205.
S205, the electronic equipment converts the user voice into a target electric signal and obtains a target voice intensity value according to the target electric signal.
In an exemplary scheme, referring to fig. 5, as shown in fig. 6, after the step S205, the step S202 may be specifically implemented by a step S202A.
S202A, if the user voice matches the first wake-up word and the target voice intensity value is greater than or equal to the preset intensity threshold, the electronic device reports the target voice intensity value to the controller.
The preset intensity threshold is a preset intensity threshold in the electronic device, and serves as a reporting threshold, that is, a voice intensity value higher than or equal to the reporting threshold can be reported to the controller, and a voice intensity value lower than the reporting threshold is not reported to the controller, so that accuracy of voice interaction can be improved.
In an exemplary scenario, referring to fig. 4, as shown in fig. 7, after the electronic device collects the user voice (step S201), the control method 200 further includes the following step S206.
And S206, the electronic equipment performs user identity authentication according to the user voice and the preset voiceprint characteristic information.
Further, as shown in fig. 7, after step S206, step S202 may be specifically implemented by step S202B.
S202B, if the authentication is successful according to the user voice and the user voice matches the first wake-up word, the electronic device sends a target voice intensity value corresponding to the user voice to the controller.
Optionally, in this embodiment of the application, the electronic device may perform user identity authentication according to the user voice and preset voiceprint feature information. And if the identity authentication is successful, comparing the user voice with the first awakening language. And if the text information of the user voice contains the first awakening language or is consistent with the first awakening language, determining that the user voice is matched with the first awakening language. And then, under the condition that the authentication is successful according to the user voice and the user voice is matched with the first awakening language, the electronic equipment sends a target voice intensity value corresponding to the user voice to the controller.
Therefore, under the condition that the user is authenticated after the identity authentication is successful, whether the user voice is the awakening voice is judged, and the privacy safety of the user can be ensured.
Of course, the electronic device may also compare the user voice with the first wake-up language, and if the user voice is matched with the first wake-up language, further perform user identity authentication according to the user voice and the preset voiceprint feature information. And then, under the condition that the user voice is matched with the first awakening language and the user voice authentication is successful, the electronic equipment sends a target voice intensity value corresponding to the user voice to the controller.
Therefore, under the condition that the authorized user is confirmed after the identity authentication is successful, the target voice intensity value corresponding to the user voice is sent to the controller, and the privacy safety of the user can be ensured.
In an exemplary scheme, the preset voiceprint feature information may be stored in the electronic device, the cloud server and/or the controller.
In an exemplary scheme, the electronic device may send a device identifier of the electronic device and a target speech strength value to the controller, where the target speech strength value has a corresponding relationship with the device identifier. The device identification uniquely represents the electronic device.
Therefore, when the electronic device meets the preset awakening response condition, the controller can send an indication message to the electronic device corresponding to the device identifier according to the device identifier, instruct the electronic device 1 to start the voice engine and output the awakening response words through the voice engine to respond.
According to the scheme provided by the embodiment of the application, the plurality of intelligent devices of the instant user home networking adopt the unified awakening words, when the user has the requirement of awakening one intelligent device, the user only needs to be close to the expected awakening intelligent device and speak the awakening words to awaken the intelligent device, and the problem that the requirement of awakening one electronic device by the user cannot be met under the condition that the awakening words of the plurality of intelligent devices are the same at present is solved.
The method 200 provided in the second embodiment of the present application can be used to perform the actions performed on the electronic device side in the first embodiment above. For the description of the specific steps and the beneficial effects, reference may be made to the related description in the first embodiment, and for brevity, no further description is provided herein.
In the method provided by the embodiment of the application, the electronic equipment can acquire the voice of a user; if the user voice is matched with a first awakening language preset in the electronic equipment, the electronic equipment sends a target voice intensity value corresponding to the user voice to the controller, so that the controller judges whether the electronic equipment meets a preset awakening response condition or not based on the target voice intensity value; when the electronic equipment meets a preset awakening response condition, the electronic equipment receives an indication message sent by a controller; and the electronic equipment starts a voice engine of the electronic equipment according to the indication message, and outputs the awakening response words through the voice engine to respond. According to the scheme, in a scene that a plurality of electronic devices are identical in awakening voice, when a user awakens a certain device through voice, the plurality of electronic devices respectively collect the user awakening voice and report the voice intensity value of the user awakening voice to the controller of the intelligent routing end; then, the controller judges which electronic device of the electronic devices meets the wake-up response condition (for example, the voice intensity value is the maximum, the historical wake-up times are the maximum, and/or the electronic device has responded last time) based on the voice intensity value sent by each electronic device, and instructs one electronic device meeting the wake-up response condition to start the voice engine and output the wake-up response words through the voice engine to respond, so that the situation that a plurality of electronic devices respond simultaneously is avoided, and the user experience is improved.
Third embodiment
In a third embodiment, a specific implementation of the scheme of the present application is described in the context of a specific controller. Fig. 8 is a flowchart illustrating a voice wake-up based control method according to a third embodiment of the present application. Referring to FIG. 8, the control method 300 includes steps S301-S303 described below.
S301, the controller obtains a first voice intensity value and a second voice intensity value within a preset time length.
The first voice intensity value is a voice intensity value corresponding to the matching of the user voice collected by the electronic device 1 and the first awakening language, and the second voice intensity value is a voice intensity value corresponding to the matching of the user voice collected by the electronic device 2 and the first awakening language;
and S302, the controller determines whether the electronic equipment 1 and the electronic equipment 2 meet a preset awakening response condition according to the first voice intensity value and the second voice intensity value.
S303, when the electronic device 1 meets the preset wake-up response condition, the controller instructs the electronic device 1 to start the speech engine and respond.
For the description of the preset wake-up response condition, reference may be specifically made to the description of the preset wake-up response condition in the first embodiment and the second embodiment, and details are not described herein again.
In the embodiment of the present application, the controller may be disposed at the routing end, and establish a wireless connection and/or a wired connection with the electronic device 1 and the electronic device 2.
In the embodiment of the application, when the user speaks the wake-up language, all electronic devices around the user collect user voices, and transmit the received voice strength value to the intelligent routing end. The intelligent routing end serves as a voice control center, algorithm judgment is carried out based on the voice intensity value sent by each electronic device, and which electronic device in each electronic device meets the preset awakening response condition can be judged.
In an exemplary scheme, when the controller detects that the first voice intensity value is greater than the second voice intensity value, the controller determines that the electronic device 1 satisfies a preset wake-up response condition, and instructs the electronic device 1 to start a voice engine and output a wake-up response message through the voice engine to respond.
In an exemplary scheme, when the controller detects that the first voice intensity value is greater than the second voice intensity value and the first distance value between the electronic device 1 and the user is less than the second distance value between the electronic device 2 and the user, the controller determines that the electronic device 1 meets a preset wake-up response condition and instructs the electronic device 1 to start a voice engine and output a wake-up response word through the voice engine to respond.
Through the scheme provided by the embodiment of the application, when a user wants to awaken a certain electronic device through voice, the electronic device closest to the user can respond, the situation that a plurality of electronic devices respond simultaneously is avoided, and therefore the user use experience is improved.
In an exemplary scheme, when the controller detects that the historical wake-up times of the first electronic device is greater than the historical wake-up times of the second electronic device under the condition that the controller detects that the first voice intensity value is equal to the second voice intensity value, the controller determines that the electronic device 1 meets a preset wake-up response condition, and instructs the electronic device 1 to start a voice engine and output a wake-up response word through the voice engine for responding.
In an exemplary scheme, in the case that the controller detects that the first voice intensity value is equal to the second voice intensity value, when the electronic device 1 is a device that has responded last time, the controller determines that the electronic device 1 satisfies a preset wake-up response condition and instructs the electronic device 1 to start a voice engine and output a wake-up answer to respond through the voice engine.
In an exemplary aspect, the control method 300 further includes: when the first voice intensity value is greater than the second voice intensity value, the controller determines that the electronic device 2 does not meet the preset wake-up response condition and indicates that the electronic device 2 does not respond.
In an exemplary aspect, the control method 300 further includes: under the condition that the first voice intensity value is equal to the second voice intensity value, when the historical awakening times of the electronic equipment 1 are larger than the historical awakening times of the electronic equipment 2 or the latest responded equipment is the electronic equipment 1, the controller determines that the electronic equipment 2 does not meet the preset awakening response condition and indicates that the electronic equipment 2 does not respond.
In an exemplary scheme, the controller may receive the device identifier 1 and a first voice strength value sent by the electronic device 1, where the first voice strength value has a corresponding relationship with the device identifier. Further, when the controller detects that the first voice intensity value is greater than the second voice intensity value, the controller determines that the electronic device 1 meets a preset wake-up response condition, and sends an indication message to the electronic device 1 corresponding to the device identifier 1 according to the device identifier 1, and instructs the electronic device 1 to start a voice engine and output a wake-up response word through the voice engine for response.
It should be noted that, in this embodiment of the application, the controller may determine which electronic device satisfies the preset wake-up response condition based on the first voice strength value and the second voice strength value received within a preset time duration (e.g., within 1 second), so as to ensure accuracy and validity of determination performed by the controller according to the voice strength value. The specific preset duration range can be determined according to actual use requirements, and the embodiment of the application is not limited.
In this way, under the instruction of the controller, the electronic device 1 starts the voice engine and outputs the wake-up response message to respond, that is, the electronic device 1 is woken up by the user through voice, and can support voice interaction with the user to provide services for the user.
For convenience of description, in the third embodiment of the present application, the electronic device 1 and the electronic device 2 are taken as examples for illustration, where the electronic device 1 and the electronic device 2 respectively collect user awakening voices, and convert the collected user awakening voices into electrical signals to obtain corresponding voice intensity values; then the electronic equipment 1 and the electronic equipment 2 respectively send the voice intensity values to the controller; and then the controller judges which electronic equipment 1 or 2 meets the awakening response condition based on the voice intensity value sent by the electronic equipment 1 or 2 respectively, and instructs the electronic equipment meeting the awakening response condition to start the voice engine and respond.
It can be understood that the application scenarios of the control method based on voice wakeup provided in the third embodiment of the present application include, but are not limited to, the electronic device 1 and the electronic device 2, and in actual implementation, the scheme of the present application may also be applied to scenarios of three or more electronic devices with the same preset wakeup words, and the specific implementation steps are similar to the implementation steps in the third embodiment, and are not described again here.
The method 300 provided by the third embodiment of the present application can be used to perform the actions performed by the controller side in the first embodiment above. For the description of the specific steps and the beneficial effects, reference may be made to the related description in the first embodiment, and for brevity, no further description is provided herein.
In the method provided by the embodiment of the application, the controller obtains a first voice intensity value and a second voice intensity value within a preset time period (the first voice intensity value is a voice intensity value corresponding to the user voice collected by the electronic device 1 when the user voice is matched with the first wake-up word, and the second voice intensity value is a voice intensity value corresponding to the user voice collected by the electronic device 2 when the user voice is matched with the first wake-up word); the controller determines whether the electronic equipment 1 and the electronic equipment 2 meet a preset awakening response condition according to the first voice intensity value and the second voice intensity value; when the electronic device 1 satisfies the preset wake-up response condition, the controller instructs the electronic device 1 to start the speech engine and respond. According to the scheme, in a scene that a plurality of electronic devices are identical in awakening voice, when a user awakens a certain device through voice, the plurality of electronic devices respectively collect the user awakening voice and report the voice intensity value of the user awakening voice to the controller of the intelligent routing end; then, the controller judges which electronic device of the electronic devices meets the wake-up response condition (for example, the voice intensity value is the maximum, the historical wake-up times are the maximum, and/or the electronic device has responded last time) based on the voice intensity value sent by each electronic device, and instructs one electronic device meeting the wake-up response condition to start the voice engine and output the wake-up response words through the voice engine to respond, so that the situation that a plurality of electronic devices respond simultaneously is avoided, and the user experience is improved.
It should be further noted that, in the above embodiments of the present application, the controller receives two or more speech intensity values and then performs the determination, and in actual implementation, the embodiments of the present application also support a scenario where the controller receives one speech intensity value to perform the determination, where the controller may determine: and the electronic equipment sending the voice intensity value meets a preset awakening response condition and indicates the electronic equipment to start a voice engine and respond.
According to the scheme provided by the embodiment of the application, the plurality of intelligent devices of the instant user home networking adopt the unified awakening words, when the user has the requirement of awakening one intelligent device, the user only needs to be close to the expected awakening intelligent device and speak the awakening words to awaken the intelligent device, and the problem that the requirement of awakening one electronic device by the user cannot be met under the condition that the awakening words of the plurality of intelligent devices are the same at present is solved.
In the embodiment of the application, when the equipment is newly added on the basis of the existing multiple equipment, the non-perception integrated home voice system can be realized through the embodiment of the application. For example, for complex scenes such as study, kitchen, LOFT two-layer and the like, "integrated intelligent voice awakening" of the whole house at any time and any place can be realized through the embodiment of the application. That is, the scheme can be realized by normal household networking without any configuration of the user.
Compared with the prior art, in the embodiment of the application, each electronic device can upload the voice signal to the intelligent router. The reporting mechanism is not available in the prior art. The reporting mechanism that each electronic device uploads the voice signal to the intelligent router can be analyzed and checked through software codes, log files (logs) and the like.
In addition, compared with the prior art, the difference is that in the embodiment of the application, the intelligent router can make a decision according to the strength of the voice signal reported by each electronic device. The speech strength decision mechanism is not available in the prior art. The voice intensity judgment mechanism of the intelligent router can be analyzed and checked through software codes, logs and the like.
The embodiment of the present application does not particularly limit a specific structure of an execution subject of the method provided by the embodiment of the present application, as long as communication can be performed by the method provided by the embodiment of the present application by running a program in which codes of the method provided by the embodiment of the present application are recorded. For example, an execution main body of the method provided by the embodiment of the present application may be an electronic device or a router device, or a functional module capable of calling a program and executing the program in the electronic device or the router device.
It should be noted that in the embodiments of the present application, "greater than" may be replaced by "greater than or equal to" and "less than or equal to" may be replaced by "less than", or "greater than or equal to" may be replaced by "greater than" and "less than" may be replaced by "less than or equal to".
The various embodiments described herein may be implemented as stand-alone solutions or combined in accordance with inherent logic and are intended to fall within the scope of the present application.
It is to be understood that the methods and operations implemented by the electronic device in the above method embodiments may also be implemented by components (e.g., chips or circuits) that can be used in the electronic device.
Embodiments of the methods provided herein are described above, and embodiments of the apparatus provided herein are described below. It should be understood that the description of the apparatus embodiments corresponds to the description of the method embodiments, and therefore, for brevity, details are not repeated here, since the details that are not described in detail may be referred to the above method embodiments.
The solutions provided by the embodiments of the present application have been described above primarily in terms of method steps. It is understood that, in order to implement the above functions, the electronic device implementing the method includes corresponding hardware structures and/or software modules for performing the respective functions. Those of skill in the art would appreciate that the various illustrative components and algorithm steps described in connection with the embodiments disclosed herein may be implemented as hardware or combinations of hardware and computer software. Whether a function is performed as hardware or computer software drives hardware depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
In the embodiment of the present application, according to the method example, the electronic device may be divided into the functional modules, for example, each functional module may be divided corresponding to each function, or two or more functions may be integrated into one processing module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. It should be noted that, the division of the modules in the embodiment of the present application is schematic, and is only one logical function division, and other feasible division manners may be available in actual implementation. The following description will be given taking the example of dividing each functional module corresponding to each function.
Fig. 9 is a schematic block diagram of a user device 400 based on voice wake-up according to an embodiment of the present application. The apparatus 400 is used to execute the voice wake-up based control method provided in the embodiments corresponding to fig. 4 to fig. 7, and the apparatus 400 may be used to execute the actions performed by the electronic device in the above method embodiments.
In the embodiment of the present application, the function modules of the electronic device may be divided according to the control method based on voice wakeup provided in the embodiments corresponding to fig. 4 to fig. 7, for example, each function module may be divided corresponding to each function, or two or more functions may be integrated into one processing module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The division of the modules in the embodiment of the present application is schematic, and is only a logic function division, and there may be another division manner in actual implementation.
In the case of adopting a method of dividing each function module corresponding to each function, as shown in fig. 9, there is provided a method of dividing function modules by an electronic device, including: a voice acquisition unit 410, a processing unit 420, a transceiving unit 430 and a voice output unit 440. The functions implemented by the unit modules provided by the present application are specifically as follows:
a voice collecting unit 410 for collecting user voice;
the processing unit 420 is configured to determine whether the user voice matches a first wakeup word preset in the electronic device;
the transceiving unit 430 is configured to send a target voice strength value corresponding to the user voice to the controller if the user voice matches a first wake-up word preset in the electronic device, so that the controller determines whether the electronic device meets a preset wake-up response condition based on the target voice strength value;
the transceiving unit 430 is further configured to receive a first indication message sent by the controller when the electronic device meets a preset wake-up response condition;
a voice output unit 440, configured to output a wakeup response message in response to the first indication message;
wherein, the preset wake-up response condition may include: when the target voice intensity value in the plurality of voice intensity values sent by the plurality of pieces of electronic equipment received by the controller is the maximum, the controller instructs the electronic equipment to output a wakeup response message for responding; the awakening words of the electronic devices are the same.
In some embodiments, the voice output unit 440 is specifically configured to start the voice engine in response to the first indication message, and output a wake-up response message through the voice engine to respond.
In some embodiments, the preset wake-up response condition further includes:
when at least two voice intensity values in the plurality of voice intensity values are equal and are the maximum values, if the historical awakening times of the electronic equipment in the plurality of electronic equipment are the maximum, the controller instructs the electronic equipment to output an awakening response word for responding;
or when at least two of the voice intensity values are equal and are maximum values, if the electronic device in the plurality of electronic devices has responded last time, the controller instructs the electronic device to output a wake-up response word for responding.
In some embodiments, the controller is disposed at the routing side.
In some embodiments, the electronic device establishes a wireless connection and/or a wired connection with the controller before the electronic device collects the user's voice.
In some embodiments, the processing unit 420 is further configured to: after the voice collecting unit 410 collects the user voice, the user voice is converted into a target electric signal, and a target voice intensity value is obtained according to the target electric signal.
In some embodiments, the transceiving unit 430 is specifically configured to: and sending a target voice intensity value corresponding to the voice of the user to the controller under the condition that the target voice intensity value is greater than or equal to the preset intensity threshold value.
In some embodiments, the transceiving unit 430 is specifically configured to: and under the condition that the user identity authentication is determined to be successful according to the user voice and the preset voiceprint characteristic information and the user voice is matched with the first awakening language, sending a target voice intensity value corresponding to the user voice to the controller.
In some embodiments, the determining that the user identity authentication is successful according to the user voice and the preset voiceprint feature information, and the user voice is matched with the first wake-up word includes:
performing user identity authentication according to the user voice and preset voiceprint characteristic information;
if the identity authentication is successful, comparing the user voice with the first awakening language;
and if the text information of the user voice contains the first awakening language or is consistent with the first awakening language, determining that the user voice is matched with the first awakening language.
In some embodiments, the preset voiceprint feature information may be stored in each of the plurality of electronic devices, the cloud server and/or the controller.
In some embodiments, the sending the target speech strength value corresponding to the user speech to the controller includes: and sending the equipment identification and the target voice intensity value of the electronic equipment to the controller, wherein the target voice intensity value has a corresponding relation with the equipment identification.
Of course, the electronic device provided in the embodiments of the present application includes, but is not limited to, the above-mentioned unit, and for example, the electronic device may further include a storage unit. The memory unit may be used for storing program codes of the electronic device. All relevant contents of each step related to the above method embodiment may be referred to the functional description of the corresponding functional module, and are not described herein again.
According to the user device based on voice awakening, in a scene that the awakening voices of a plurality of electronic devices are the same, when a user awakens a certain device through the voice, the plurality of electronic devices respectively collect the awakening voices of the user and report the voice intensity value of the awakening voices of the user to a controller of an intelligent routing end; then, the controller judges which electronic device of the electronic devices meets the wake-up response condition (for example, the voice intensity value is the maximum, the historical wake-up times are the maximum, and/or the electronic device has responded last time) based on the voice intensity value sent by each electronic device, and instructs one electronic device meeting the wake-up response condition to output a wake-up response message for response, so that the situation that a plurality of electronic devices respond simultaneously is avoided, and the use experience of a user is improved.
The apparatus 400 according to the embodiment of the present application may correspond to performing the method described in the embodiment of the present application, and the above and other operations and/or functions of the units in the apparatus 400 are respectively for implementing corresponding flows of the method, and are not described herein again for brevity.
Fig. 10 is a schematic block diagram of a device 500 based on voice wake-up provided in an embodiment of the present application. The apparatus 500 is used to execute the voice wake-up based control method provided in the corresponding embodiment of fig. 8, and the apparatus 500 may be used to execute the actions performed by the controller in the above method embodiment.
In the embodiment of the present application, the device 500 may be divided into functional modules according to the voice wake-up based control method provided in the embodiment corresponding to fig. 8, for example, each functional module may be divided corresponding to each function, or two or more functions may be integrated into one processing module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The division of the modules in the embodiment of the present application is schematic, and is only a logic function division, and there may be another division manner in actual implementation.
In the case of dividing each function module according to each function, as shown in fig. 10, there is provided a method for dividing a function module by a controller, including: a transceiving unit 510 and a processing unit 520. The functions implemented by the unit modules provided by the present application are specifically as follows:
the transceiving unit 510 is configured to receive voice intensity values sent by a plurality of electronic devices within a preset time period;
the processing unit 520 is configured to compare the voice intensity values, where the voice intensity value is a voice intensity value corresponding to when the user voice collected by each electronic device matches a preset wake-up word, where each electronic device corresponds to one voice intensity value, and the preset wake-up words of the electronic devices are the same;
the transceiving unit 510 is further configured to send a first indication message to a target electronic device with a maximum voice intensity value among the plurality of electronic devices, where the first indication message is used to instruct the target electronic device to output a wake-up response to respond.
The first indication message may be used to instruct the target electronic device to start a speech engine and output a wake-up response message for response.
In some embodiments, the processing unit 520 is further configured to, when at least two of the voice strength values are equal to each other and are maximum values, instruct the electronic device to output a wake-up response message to respond if it is determined that the historical wake-up frequency of the electronic device in the plurality of electronic devices is the maximum;
in some embodiments, the processing unit 520 is further configured to, when at least two of the voice intensity values are equal to each other and are maximum values, instruct the electronic device to output a wake-up response message to respond if it is determined that the electronic device has responded for the last time in the plurality of electronic devices.
In some embodiments, the apparatus 500 may be disposed at a routing end.
In some embodiments, the apparatus 500 establishes wireless connections and/or wired connections with a plurality of electronic devices, respectively.
In some embodiments, the transceiver unit 510 is further configured to transmit second indication information to other electronic devices in the plurality of electronic devices except the target electronic device, where the second indication information indicates that the other electronic devices do not respond.
In some embodiments, the sending the first indication message to the target electronic device with the largest voice intensity value among the plurality of electronic devices includes:
if the target electronic equipment in the plurality of electronic equipment has the maximum voice intensity value and the distance value between the target electronic equipment and the user in the plurality of electronic equipment is the minimum, the controller sends a first indication message to the target electronic equipment.
In some embodiments, the receiving, within the preset time period, the voice intensity values sent by the plurality of electronic devices includes: receiving equipment identifications and voice intensity values sent by a plurality of electronic equipment within a preset time, wherein the voice intensity values have a corresponding relation with the equipment identifications, and each electronic equipment corresponds to one equipment identification;
wherein, the sending the first indication message to the target electronic device with the maximum voice intensity value in the plurality of electronic devices includes: if the target electronic equipment in the plurality of electronic equipment has the maximum voice intensity value, the controller sends a first indication message to the target electronic equipment according to the equipment identification of the target electronic equipment.
Of course, the electronic device provided in the embodiments of the present application includes, but is not limited to, the above-mentioned unit, and for example, the electronic device may further include a storage unit. The memory unit may be used for storing program codes of the electronic device. All relevant contents of each step related to the above method embodiment may be referred to the functional description of the corresponding functional module, and are not described herein again.
According to the user device based on voice awakening, when the collected user voice is matched with the first awakening voice, each electronic device in the plurality of electronic devices sends the voice intensity value corresponding to the user voice to the controller. The controller receives a plurality of voice intensity values sent by a plurality of electronic devices, and compares the voice intensity values, wherein each electronic device in the plurality of electronic devices corresponds to one voice intensity value. The controller sends first indication information to a target electronic device corresponding to the maximum voice intensity value in the plurality of electronic devices, and the first indication information is used for indicating the target electronic device to output a first awakening response word for responding. And the target electronic equipment outputs a first awakening response word to respond according to the first indication message sent by the controller.
According to the scheme, in a scene that the awakening voices of the electronic devices are the same, when a user awakens a certain device through voice, the electronic devices respectively collect the awakening voices of the user and report the voice intensity value of the awakening voices of the user to the controller of the intelligent routing end; then, the controller judges which electronic device of the electronic devices meets the wake-up response condition (for example, the voice intensity value is the maximum, the historical wake-up times are the maximum, and/or the electronic device has responded last time) based on the voice intensity value sent by each electronic device, and instructs one electronic device meeting the wake-up response condition to output a wake-up response message for response, so that the situation that a plurality of electronic devices respond simultaneously is avoided, and the use experience of a user is improved.
The apparatus 500 according to the embodiment of the present application may correspond to performing the method described in the embodiment of the present application, and the above and other operations and/or functions of the units in the apparatus 500 are respectively for implementing corresponding flows of the method, and are not described herein again for brevity.
Fig. 11 is a hardware schematic diagram of an electronic device 600 provided in an embodiment of the present application. As shown in fig. 11, the electronic device 600 may include: processor 601, memory 602, wireless communication module 603, audio module 604, and power module 605, among others, which may communicate via one or more communication buses or signal lines (not shown). The various components of the electronic device are described in detail below with reference to fig. 11.
The processor 601 is a control center of the electronic device, connects various parts of the electronic device using various interfaces and lines, and performs various functions of the electronic device and processes data by running or executing an application program stored in the memory 602 and calling data stored in the memory 602. The processor 601 may include one or more processing units, and different processing units may be independent devices or may be integrated into one or more of the processors 601. The processor 601 may be specifically an integrated control chip, or may be composed of a circuit including various active and/or passive components, and the circuit is configured to execute the functions belonging to the processor 601 described in the embodiments of the present application.
It should be understood that, in the embodiment of the present application, the processor 601 may adopt a Central Processing Unit (CPU). The processor may also be other general purpose processors, Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. Or the processor 601 may employ one or more integrated circuits for executing related programs to implement the technical solutions provided in the embodiments of the present application.
The memory 602 is used to store application programs and data, and the processor 601 executes various functions and data processing of the electronic device by operating the application programs and data stored in the memory 602. The memory 602 mainly includes a program storage area and a data storage area, wherein the program storage area can store an operating system and an application program required by at least one function; the storage data area may store data created according to use of the electronic device.
In addition, the memory 602 may include read-only memory, random access memory, and may also include non-volatile memory, such as a disk storage device, flash memory device, or other volatile solid state storage device, and provides instructions and data to the processor 601. The memory 602 may store various operating systems. The memory 602 may be independent of the processor 601 and connected to the processor 601 through the communication bus; alternatively, the memory 602 may be integrated with the processor 601.
The wireless communication module 603 may be used to support wireless connection and wireless communication between electronic devices through wireless communication technology. In some embodiments, the wireless communication module 603 may be a Bluetooth chip, or a Wi-Fi chip. For example, the electronic device 600 may establish a wireless connection with a Wi-Fi chip of the intelligent routing end through the Wi-Fi chip, so as to implement wireless communication and service processing between the electronic device 600 and the intelligent routing end through the wireless connection.
The wireless communication module 603 may further include an antenna, and the wireless communication module 603 may receive an electromagnetic wave via the antenna, frequency-modulate and filter an electromagnetic wave signal, and transmit the processed signal to the processor 601. The wireless communication module 603 may also receive a signal to be transmitted from the processor 601, perform frequency modulation and amplification on the signal, and convert the signal into electromagnetic waves through the antenna to radiate the electromagnetic waves.
The audio module 604 may be used to manage audio data, enabling the electronic device to input and output audio streams. The audio module 604 may include a speaker 6041 (or called an earphone or a receiver) for outputting an audio stream, a microphone 6042 (or called a microphone or a microphone), a microphone receiving circuit cooperating with the microphone, and the like. The speaker 6041 may be used to convert an audio electric signal into a sound signal and play it. The microphone 6042 may be used to convert a sound signal into an audio electric signal.
The power module 605 may provide power to various components, for example, the power module 605 may include a battery and a power management chip, and the battery may be logically connected to the processor 601 through the power management chip, so as to implement functions of managing charging, discharging, and power consumption through the power module 605.
In some embodiments, the electronic device may or may not also include a display (or display screen). The display may be used to display a display interface of the APP, such as a currently playing song, etc. The display includes a display panel, and the display panel may adopt a Liquid Crystal Display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode (active-matrix organic light-emitting diode, AMOLED), a flexible light-emitting diode (FLED), a quantum dot light-emitting diode (QLED), and the like. In some embodiments, a touch sensor may be disposed in the display to form a touch screen, which is not limited in this application. The touch sensor is used to detect a touch operation applied thereto or nearby. The touch sensor may communicate the detected touch operation to the processor 601 to determine the touch event type. The processor 601 may provide visual output related to touch operations through a display.
In some embodiments, the electronic device may further include more devices, such as a USB interface, a sensor, and the like, which are not described herein again. The various components shown in fig. 11 may be implemented in hardware, software, or a combination of hardware and software, including one or more signal processing or application specific integrated circuits. It is to be understood that the illustrated structure of the embodiments of the present application does not constitute a specific limitation to electronic devices. The electronic device may also include more or fewer components than shown, or combine certain components, or split certain components, or a different arrangement of components.
Optionally, the electronic device 600 may further include a bus. The memory 602 and the wireless communication module 603 may be connected to the processor 601 through a bus. The bus may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc.
In the embodiment of the present application, the processor 601 in the electronic device 600 shown in fig. 11 may correspond to the processing unit 420 in the apparatus 400 in fig. 9, the microphone 6042 in the electronic device 600 may correspond to the voice acquisition unit 410 in the apparatus 400, the speaker 6041 may correspond to the voice output unit 440 in the apparatus 400, and the wireless communication module 603 may correspond to the transceiving unit 430 in the apparatus 400. Moreover, the above and other operations and/or functions of each unit in the electronic device 600 are respectively for implementing corresponding flows of the method, and are not described herein again for brevity.
When the electronic device 600 is running, the electronic device 600 executes computer-executable instructions in the memory 602 to perform the operational steps of any of the methods of fig. 4-7 described above by the electronic device 600.
Optionally, the electronic device in the embodiment of the present application may be a mobile terminal that supports a voice wake-up function, and may also be a non-mobile terminal that supports the voice wake-up function. For example, the mobile terminal may be a smart speaker, a mobile phone, a tablet computer, a laptop computer, a palmtop computer, a vehicle-mounted terminal, various devices such as a wearable device (e.g., a smart watch), an ultra-mobile personal computer (UMPC), a netbook, or a Personal Digital Assistant (PDA) that support a voice wake-up function, and the non-mobile terminal may be various devices such as a Personal Computer (PC), a smart screen, a teller machine, or a self-service machine that support a voice wake-up function, which is not limited in this embodiment.
Embodiments of the present application also provide a computer-readable storage medium having instructions stored therein; when it is run on the electronic device, it causes the electronic device to perform the voice wake-up based control method as described above.
Embodiments of the present application also provide a computer program product comprising instructions, which, when run on an electronic device, cause the electronic device to perform the above-mentioned voice wake-up based control method.
An embodiment of the present application further provides a chip system, where the chip system includes a processor, and is used to support an electronic device to implement the above control method based on voice wakeup. In one possible design, the system-on-chip further includes a memory. The memory is used for storing program instructions and data of the electronic equipment. Of course, the memory may not be in the system-on-chip. The chip system may be formed by a chip, and may also include a chip and other discrete devices, which is not specifically limited in this embodiment of the present application.
Fig. 12 is a schematic structural diagram of a controller 700 provided in an embodiment of the present application. The controller 700 includes: processor 710, memory 720, communication interface 730, bus 740.
The processor 710 may be coupled to the memory 720. The memory 720 may be used for storing the program codes and data. Therefore, the memory 720 may be a storage unit inside the processor 710, an external storage unit independent of the processor 710, such as an independent server or cloud storage, or a component including a storage unit inside the processor 710 and an external storage unit independent of the processor 710.
Optionally, the controller 700 may also include a bus 740. The memory 720 and the communication interface 730 may be connected to the processor 710 via a bus 740. The bus 740 may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The bus 740 may be divided into an address bus, a data bus, a control bus, and the like. For ease of illustration, only one line is shown in FIG. 12, but this does not represent only one bus or one type of bus.
It should be understood that, in the embodiment of the present application, the processor 710 may employ a Central Processing Unit (CPU). The processor may also be other general purpose processors, Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. Or the processor 710 may employ one or more integrated circuits for executing related programs to implement the technical solutions provided in the embodiments of the present application.
The memory 720 may include both read-only memory and random-access memory, and provides instructions and data to the processor 710. A portion of the processor 710 may also include non-volatile random access memory. For example, the processor 710 may also store information of the device type.
When the controller 700 is running, the processor 710 executes the computer-executable instructions in the memory 720 to perform the operational steps of the method illustrated in FIG. 8 described above.
In the embodiment of the present application, the processor 610 in the controller 700 shown in fig. 12 may correspond to the processing unit 520 in the apparatus 500 in fig. 10, and the communication interface 730 in the controller 700 may correspond to the transceiver unit 510 in the apparatus 500. Moreover, the above and other operations and/or functions of each unit in the controller 700 are respectively for implementing corresponding flows of the method, and are not described herein again for brevity.
Alternatively, the controller 700 may be an intelligent router, or any other possible controller, and the embodiment of the present application does not specifically limit the device type of the controller 700.
It should be understood that the controller 700 according to the embodiment of the present application may correspond to the apparatus 500 in the embodiment of the present application. The above and other operations and/or functions of the units in the apparatus 500 are respectively used for implementing the corresponding flows of the above method, and are not described herein again for brevity.
Embodiments of the present application also provide a computer-readable storage medium having instructions stored therein; which when run on the controller causes the controller to perform the voice wake-up based control method as described above.
Embodiments of the present application also provide a computer program product comprising instructions which, when run on a controller, cause the controller to perform the voice wake-up based control method as described above.
An embodiment of the present application further provides a chip system, where the chip system includes a processor, and is configured to support the controller to implement the above control method based on voice wakeup. In one possible design, the system-on-chip further includes a memory. The memory is used for storing program instructions and data of the controller. Of course, the memory may not be in the system-on-chip. The chip system may be formed by a chip, and may also include a chip and other discrete devices, which is not specifically limited in this embodiment of the present application.
In the above embodiments, all or part of the implementation may be realized by software, hardware, firmware or any combination thereof. When implemented using a software program, may take the form of a computer program product, either entirely or partially. The computer program product includes one or more computer instructions. The procedures or functions according to the embodiments of the present application are all or partially generated when the computer program instructions are loaded and executed on a computer. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, e.g., the computer instructions may be transmitted from one website, computer, server, or data center to another website, computer, server, or data center via wired (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.) means. The computer-readable storage medium can be any available medium that can be accessed by a computer or a data electronics device including one or more available media integrated servers, data centers, and the like. The usable medium may be a magnetic medium (e.g., floppy disk, hard disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.
Through the above description of the embodiments, it is clear to those skilled in the art that, for convenience and simplicity of description, the foregoing division of the functional modules is merely used as an example, and in practical applications, the above function distribution may be completed by different functional modules according to needs, that is, the internal structure of the device may be divided into different functional modules to complete all or part of the above described functions.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described device embodiments are merely illustrative, and for example, the division of the modules or units is only one logical functional division, and there may be other divisions when actually implemented, for example, a plurality of units or components may be combined or may be integrated into another device, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may be one physical unit or a plurality of physical units, that is, may be located in one place, or may be distributed in a plurality of different places. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a readable storage medium. Based on such understanding, the technical solutions of the embodiments of the present application may be essentially or partially contributed to by the prior art, or all or part of the technical solutions may be embodied in the form of a software product, where the software product is stored in a storage medium and includes several instructions to enable a device (which may be a single chip, a chip, or the like) or a processor (processor) to execute all or part of the steps of the methods described in the embodiments of the present application. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a read-only memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
The above description is only for the specific embodiments of the present application, but the scope of the present application is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present application, and shall be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (32)

1. A control method based on voice wake-up is characterized in that the control method is applied to a first electronic device and a second electronic device, the wake-up words of the first electronic device and the second electronic device are the same, and the wake-up word is a first wake-up word; the method comprises the following steps:
the first electronic equipment determines that the collected user voice is matched with the first awakening language;
the first electronic equipment determines the intensity value of the user voice to be a first voice intensity value;
the first electronic equipment sends the first voice intensity value to a controller;
the second electronic equipment determines that the collected user voice is matched with the first awakening language;
the second electronic equipment determines the intensity value of the user voice to be a second voice intensity value;
the second electronic device sends the second voice intensity value to the controller;
the controller receives the first voice strength value and the second voice strength value;
the controller determines that the first voice intensity value is greater than the second voice intensity value;
the controller sends a first indication message to the first electronic device, wherein the first indication message is used for indicating the first electronic device to respond;
and the first electronic equipment responds to the first indication message and outputs a first awakening response word for responding.
2. The method according to claim 1, wherein the method is applied to a plurality of electronic devices, the plurality of electronic devices include the first electronic device, the second electronic device and a third electronic device, the plurality of electronic devices have the same wake-up language, and the wake-up language is the first wake-up language;
the method further comprises the following steps:
the third electronic equipment determines that the collected user voice is matched with the first awakening language;
the third electronic equipment determines the intensity value of the user voice to be a third voice intensity value;
the third electronic device sends the third voice intensity value to the controller;
wherein the controller receives the first voice strength value and the second voice strength value, including: the controller receives the first voice strength value, the second voice strength value and the third voice strength value;
wherein the controller determines that the first speech strength value is greater than the second speech strength value, including: the controller determines a maximum voice intensity value of the first, second, and third voice intensity values;
wherein the controller sends a first indication message to the first electronic device, including: the controller sends the first indication message to a target electronic device corresponding to the maximum voice intensity value in the plurality of electronic devices;
wherein, the first electronic device responds to the first indication message and outputs a first wake-up response message to respond, including: and the target electronic equipment responds to the first indication message and outputs the first awakening response words for responding.
3. The method according to claim 1 or 2, characterized in that the method further comprises:
when at least two voice intensity values in the plurality of voice intensity values are equal and are maximum values, if a target electronic device in the plurality of electronic devices has the maximum voice intensity value and the historical awakening times of the target electronic device in the plurality of electronic devices are the maximum, the controller sends the first indication message to the target electronic device; alternatively, the first and second electrodes may be,
when at least two of the voice intensity values are equal and are maximum values, if a target electronic device in the electronic devices has the maximum voice intensity value and the target electronic device in the electronic devices has responded last time, the controller sends the first indication message to the target electronic device.
4. The method of any of claims 1-3, wherein the first electronic device, in response to the first indication message, outputting a first wake-up answer response, comprises:
and the first electronic equipment responds to the first indication message, starts a voice engine and outputs the first awakening response message for responding.
5. The method according to any one of claims 1 to 4, further comprising:
the controller sends second indication information to the second electronic device, wherein the second indication information is used for indicating that the second electronic device does not respond.
6. The method according to any of claims 1 to 4, wherein the controller is provided at a routing side.
7. The method according to any one of claims 1 to 6, further comprising:
the first electronic device and the second electronic device are respectively in wireless connection and/or wired connection with the controller.
8. The method of any of claims 1-7, wherein the first electronic device determining the intensity value of the user speech to be a first intensity value comprises:
the first electronic equipment converts the collected user voice into an electric signal, and the first voice strength value is obtained according to the electric signal.
9. The method of any of claims 1-5, wherein the first electronic device sending the first voice intensity value to a controller comprises:
the first electronic equipment determines that the first voice intensity value is greater than or equal to a preset intensity threshold value;
the first electronic device sends the first voice intensity value to the controller.
10. The method of any of claims 1-7, wherein prior to the first electronic device sending the first speech strength value to a controller, the method further comprises:
the first electronic equipment performs user identity authentication according to the collected user voice and preset voiceprint characteristic information;
wherein the first electronic device sends the first voice intensity value to a controller, including:
and under the condition that the first electronic equipment confirms that the user identity authentication is successful according to the collected user voice and the collected user voice is matched with the first awakening language, the first electronic equipment sends the first voice intensity value to the controller.
11. The method of claim 10,
the preset voiceprint feature information is stored in each of the plurality of electronic devices, the cloud server and/or the controller.
12. A control method based on voice wake-up is applied to electronic equipment, and is characterized in that the method comprises the following steps:
collecting user voice;
if the user voice is matched with a first awakening language preset in the electronic equipment, sending a target voice intensity value corresponding to the user voice to a controller, so that the controller judges whether the electronic equipment meets a preset awakening response condition or not based on the target voice intensity value;
when the electronic equipment meets a preset awakening response condition, receiving a first indication message sent by the controller;
responding to the first indication message, and outputting a first awakening response word for responding;
wherein the preset wake-up response condition includes: when the target voice intensity value in the plurality of voice intensity values sent by the plurality of pieces of electronic equipment received by the controller is the maximum, the controller instructs the electronic equipment to output the first awakening response word for responding; the awakening words of the electronic devices are the same.
13. The method of claim 12, wherein the preset wake response condition further comprises:
when at least two voice intensity values in the plurality of voice intensity values are equal and are maximum values, if the historical awakening times of the electronic equipment in the plurality of electronic equipment are the maximum, the controller instructs the electronic equipment to start a voice engine and respond; alternatively, the first and second electrodes may be,
when at least two of the voice intensity values are equal and are maximum values, if the electronic device in the electronic devices has responded for the last time, the controller instructs the electronic device to start a voice engine and respond.
14. The method according to claim 12 or 13, wherein the controller is provided at a routing end, and the controller establishes a wireless connection and/or a wired connection with the electronic device.
15. The control method according to any one of claims 12 to 14, wherein outputting a first wake-up response in response to the first indication message comprises:
and responding to the first indication message, starting a voice engine, and outputting the first awakening response message for responding.
16. The method of any of claims 12 to 15, wherein after said capturing user speech, the method further comprises:
and converting the user voice into a target electric signal, and obtaining the target voice intensity value according to the target electric signal.
17. The method according to any one of claims 12 to 16, wherein said sending a target speech intensity value corresponding to the user speech to a controller comprises:
and sending a target voice intensity value corresponding to the user voice to the controller under the condition that the target voice intensity value is greater than or equal to a preset intensity threshold value.
18. The method according to any one of claims 12 to 17, wherein said sending a target speech strength value corresponding to the user speech to a controller comprises:
and when the user identity authentication is determined to be successful according to the user voice and the preset voiceprint characteristic information and the user voice is matched with the first awakening language, sending a target voice intensity value corresponding to the user voice to the controller.
19. The method of claim 18, wherein the determining that the user identity authentication is successful according to the user voice and preset voiceprint feature information, and the user voice is matched with the first wake-up word, comprises:
performing user identity authentication according to the user voice and preset voiceprint characteristic information;
if the identity authentication is successful, comparing the user voice with the first awakening language;
and if the text information of the user voice contains the first awakening language or is consistent with the first awakening language, determining that the user voice is matched with the first awakening language.
20. The method of claim 18 or 19,
the preset voiceprint feature information is stored in each of the plurality of electronic devices, a cloud server and/or the controller.
21. The method according to any one of claims 12 to 20, wherein said sending a target speech intensity value corresponding to the user speech to a controller comprises:
and sending the equipment identifier of the electronic equipment and the target voice strength value to the controller, wherein the target voice strength value has a corresponding relation with the equipment identifier.
22. A control method based on voice wake-up is applied to a controller, and is characterized in that the method comprises the following steps:
if the voice intensity values sent by the electronic devices are received within a preset time length, comparing the voice intensity values, wherein the voice intensity values are corresponding to the voice intensity values when the user voice collected by each electronic device is matched with a preset awakening language, each electronic device corresponds to one voice intensity value, and the preset awakening languages of the electronic devices are the same;
and sending a first indication message to a target electronic device with the maximum voice intensity value in the plurality of electronic devices, wherein the first indication message is used for indicating the target electronic device to output a wake-up response message for responding.
23. The method of claim 22, further comprising:
when at least two voice intensity values in the plurality of voice intensity values are equal and are maximum values, if the controller judges that the historical awakening times of the electronic equipment in the plurality of electronic equipment are the most, indicating the electronic equipment to start a voice engine and respond; alternatively, the first and second electrodes may be,
when at least two of the voice intensity values are equal and are maximum values, if the controller judges that the electronic equipment in the electronic equipment has responded for the last time, the electronic equipment is instructed to start a voice engine and respond.
24. The method according to claim 22 or 23, wherein the controller is provided at a routing end, and the controller establishes a wireless connection and/or a wired connection with the plurality of electronic devices, respectively.
25. The method of any one of claims 22 to 24, further comprising:
the controller sends second indication information to other electronic devices except the target electronic device in the plurality of electronic devices, wherein the second indication information is used for indicating that the other electronic devices do not respond.
26. The method according to any one of claims 22 to 25, wherein said sending a first indication message to a target electronic device of the plurality of electronic devices having a maximum speech intensity value comprises:
if the target electronic device in the plurality of electronic devices has the maximum voice intensity value and the distance value between the target electronic device and the user in the plurality of electronic devices is the minimum, the controller sends the first indication message to the target electronic device.
27. The method according to any one of claims 22 to 26, wherein receiving the voice intensity values sent by the plurality of electronic devices within the preset time period comprises:
receiving device identifications and voice intensity values sent by the electronic devices within a preset time, wherein the voice intensity values have a corresponding relation with the device identifications, and each electronic device corresponds to one device identification;
wherein the sending the first indication message to the target electronic device with the largest voice intensity value among the plurality of electronic devices comprises:
if the target electronic device in the plurality of electronic devices has the maximum voice intensity value, the controller sends the first indication message to the target electronic device according to the device identifier of the target electronic device.
28. An electronic device comprising a processor coupled with a memory, the processor being configured to execute computer programs or instructions stored in the memory to cause the electronic device to implement the method of any of claims 12-21.
29. A controller comprising a processor coupled with a memory, the processor being configured to execute a computer program or instructions stored in the memory to cause the controller to implement the method of any of claims 22 to 27.
30. A communication system comprising at least one electronic device according to claim 28 and a controller according to claim 29.
31. A computer-readable storage medium, characterized in that it stores a computer program for implementing the method of any one of claims 12 to 21 when the computer program runs on an electronic device and/or for implementing the method of any one of claims 22 to 27 when the computer program runs on a controller.
32. A chip system, wherein the chip system is coupled with a memory, and the chip system is configured to read and execute a computer program stored in the memory to implement the method according to any one of claims 12 to 21, and/or the method according to any one of claims 22 to 27.
CN202110611312.6A 2021-06-01 2021-06-01 Control method based on voice awakening, electronic equipment and controller Active CN113421559B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110611312.6A CN113421559B (en) 2021-06-01 2021-06-01 Control method based on voice awakening, electronic equipment and controller

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110611312.6A CN113421559B (en) 2021-06-01 2021-06-01 Control method based on voice awakening, electronic equipment and controller

Publications (2)

Publication Number Publication Date
CN113421559A true CN113421559A (en) 2021-09-21
CN113421559B CN113421559B (en) 2023-04-07

Family

ID=77713650

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110611312.6A Active CN113421559B (en) 2021-06-01 2021-06-01 Control method based on voice awakening, electronic equipment and controller

Country Status (1)

Country Link
CN (1) CN113421559B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023097524A1 (en) * 2021-11-30 2023-06-08 华为技术有限公司 Device control method and apparatus
CN117437919A (en) * 2023-12-18 2024-01-23 美智纵横科技有限责任公司 Voice interaction method, device, electronic equipment and readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106356060A (en) * 2016-08-23 2017-01-25 北京小米移动软件有限公司 Voice communication method and device
CN110223684A (en) * 2019-05-16 2019-09-10 华为技术有限公司 A kind of voice awakening method and equipment
CN110322878A (en) * 2019-07-01 2019-10-11 华为技术有限公司 A kind of sound control method, electronic equipment and system
CN110459221A (en) * 2019-08-27 2019-11-15 苏州思必驰信息科技有限公司 The method and apparatus of more equipment collaboration interactive voices
CN111091828A (en) * 2019-12-31 2020-05-01 华为技术有限公司 Voice wake-up method, device and system
CN111369988A (en) * 2018-12-26 2020-07-03 华为终端有限公司 Voice awakening method and electronic equipment

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106356060A (en) * 2016-08-23 2017-01-25 北京小米移动软件有限公司 Voice communication method and device
CN111369988A (en) * 2018-12-26 2020-07-03 华为终端有限公司 Voice awakening method and electronic equipment
CN110223684A (en) * 2019-05-16 2019-09-10 华为技术有限公司 A kind of voice awakening method and equipment
CN110322878A (en) * 2019-07-01 2019-10-11 华为技术有限公司 A kind of sound control method, electronic equipment and system
CN112289313A (en) * 2019-07-01 2021-01-29 华为技术有限公司 Voice control method, electronic equipment and system
CN110459221A (en) * 2019-08-27 2019-11-15 苏州思必驰信息科技有限公司 The method and apparatus of more equipment collaboration interactive voices
CN111091828A (en) * 2019-12-31 2020-05-01 华为技术有限公司 Voice wake-up method, device and system

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023097524A1 (en) * 2021-11-30 2023-06-08 华为技术有限公司 Device control method and apparatus
CN117437919A (en) * 2023-12-18 2024-01-23 美智纵横科技有限责任公司 Voice interaction method, device, electronic equipment and readable storage medium
CN117437919B (en) * 2023-12-18 2024-03-01 美智纵横科技有限责任公司 Voice interaction method, device, electronic equipment and readable storage medium

Also Published As

Publication number Publication date
CN113421559B (en) 2023-04-07

Similar Documents

Publication Publication Date Title
WO2018147687A1 (en) Method and apparatus for managing voice-based interaction in internet of things network system
CN113421559B (en) Control method based on voice awakening, electronic equipment and controller
CN107992728B (en) Face verification method and device
CN103456306A (en) Method and apparatus for executing voice command in electronic device
CN107346397B (en) Information processing method and related product
EP3852102B1 (en) Voice assistant proxy for voice assistant servers
CN109672775B (en) Method, device and terminal for adjusting awakening sensitivity
CN108156537B (en) Remote operation method of mobile terminal and mobile terminal
CN114550343A (en) Intelligent lock control method and device based on UWB, medium and electronic equipment
CN106604277B (en) Wireless network access method and mobile terminal
CN110175016A (en) Start the method for voice assistant and the electronic device with voice assistant
CN112230877A (en) Voice operation method and device, storage medium and electronic equipment
WO2019019837A1 (en) Biological identification method and related product
CN115174418A (en) Communication environment safety early warning method and device, electronic equipment and storage medium
CN110418311B (en) Interconnection method and device based on multiple terminals and terminal
CN109151779B (en) Neighbor Awareness Network (NAN) access method and related product
US20240056442A1 (en) Apparatus and methods for continuous real-time authentication
CN108600559A (en) Control method, device, storage medium and the electronic equipment of silent mode
CN106095566B (en) Response control method and mobile terminal
CN116861198A (en) Data processing method, device and storage medium
US9843683B2 (en) Configuration method for sound collection system for meeting using terminals and server apparatus
CN107025389B (en) Fingerprint input method and terminal
CN108702410B (en) Contextual model control method and mobile terminal
CN106572463B (en) Access control method and mobile terminal
CN111653021B (en) WiFi-Aware network-based user identity authentication method and device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant