WO2023005844A1 - Device wake-up method, related apparatus, and communication system - Google Patents

Device wake-up method, related apparatus, and communication system Download PDF

Info

Publication number
WO2023005844A1
WO2023005844A1 PCT/CN2022/107411 CN2022107411W WO2023005844A1 WO 2023005844 A1 WO2023005844 A1 WO 2023005844A1 CN 2022107411 W CN2022107411 W CN 2022107411W WO 2023005844 A1 WO2023005844 A1 WO 2023005844A1
Authority
WO
WIPO (PCT)
Prior art keywords
wake
electronic device
image
smart glasses
instruction
Prior art date
Application number
PCT/CN2022/107411
Other languages
French (fr)
Chinese (zh)
Inventor
闻琛
李明雨
赵伟
冯晓兵
陈航
曾旺
刘杰
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2023005844A1 publication Critical patent/WO2023005844A1/en

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B15/00Systems controlled by a computer
    • G05B15/02Systems controlled by a computer electric
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/61Control of cameras or camera modules based on recognised objects

Definitions

  • the present application relates to the field of terminal technology, and in particular to a method for waking up a device, a related device and a communication system.
  • the present application provides a method for waking up a device, a related device, and a communication system, which can effectively reduce the situation of false waking up, and bring better user experience for users using the voice interaction function of electronic devices.
  • the present application provides a system for waking up a device.
  • the device wake-up system includes an image acquisition device and multiple electronic devices.
  • the image acquisition device can be used to detect the first user input, and when the first user input is detected, acquire the first image.
  • the image acquisition device can also be used to select a target electronic device included in the first image from multiple electronic devices, and send a wake-up instruction to the target electronic device; the wake-up command is used to trigger the target electronic device to enter a wake-up state.
  • the target electronic device may be configured to enter a wake-up state in response to the received wake-up instruction.
  • the above-mentioned multiple electronic devices may be electronic devices with a voice interaction function, and the voice interaction function is turned on. Having a voice interaction function may mean that the electronic device can recognize a user's voice command and perform an operation corresponding to the voice command.
  • the image acquisition device can determine the target electronic device through the image collected by itself, and instruct the target electronic device to enter the wake-up state.
  • the above-mentioned target electronic equipment is the electronic equipment determined by the image acquisition device that the user wishes to wake up. That is to say, the user can use the image acquisition device to wake up the electronic device he wants to wake up.
  • the target electronic device may enter a wake-up state to respond to the user's voice command. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
  • the image acquisition device may be smart glasses.
  • the images collected by the smart glasses are the images within the user's field of vision.
  • the smart glasses can more accurately determine which electronic device the user wishes to wake up based on the images collected by themselves.
  • the above-mentioned first user input may be voice input including a wake-up word.
  • the above-mentioned first user input may be a user operation acting on the first position of the image capture device.
  • the image acquisition device may perform image acquisition to obtain the above-mentioned first image, and determine the target electronic device according to the above-mentioned first image.
  • the above-mentioned multiple electronic devices can also monitor the wake-up word.
  • the above-mentioned multiple electronic devices may detect whether the above-mentioned image acquisition device exists in the device wake-up system.
  • the electronic devices that exist in the above device wake-up system may exist in the local device list.
  • the local device list may be stored in one or more electronic devices in the device wake-up system.
  • the local device list may also be stored in a cloud server. All electronic devices in the local device list can obtain the local device list. That is, all the electronic devices in the local device list can determine which electronic devices are contained in the wake-up system of the above-mentioned devices.
  • the plurality of electronic devices may determine whether the image capture device exists in the wake-up system of the device by determining whether the image capture device is included in the local device list. If the above-mentioned local device list includes the above-mentioned image collection device, the above-mentioned multiple electronic devices may determine that the above-mentioned image collection device exists in the above-mentioned device wake-up system. If it is determined that the image capture device exists in the wake-up system of the device, the multiple electronic devices may further determine whether the image capture device is in a wearing state. If it is determined that the image capture device is in the wearing state, the above-mentioned multiple electronic devices may wait for a wake-up instruction instead of entering the wake-up state immediately.
  • this electronic device When an electronic device among the plurality of electronic devices receives a wake-up instruction, this electronic device may enter a wake-up state. During the process of waiting for the wake-up instruction, the plurality of electronic devices may not respond to the monitored wake-up words, voice instructions, and the like.
  • the above-mentioned image acquisition device may be smart glasses.
  • the target electronic device before receiving the above-mentioned wake-up instruction, listens to the wake-up word but does not listen to the voice command (for example, in a scenario where the user only speaks the wake-up word). Then, when entering the wake-up state, the above-mentioned target electronic device may output a voice response to the above-mentioned wake-up word.
  • the voice response to the wake word may be, for example, "I am”.
  • the target electronic device before receiving the above-mentioned wake-up instruction, neither listens to the wake-up word nor the voice command (such as the user does not say the wake-up word, but the user operation acting on the first position above realizes in scenarios where the device wakes up).
  • the target electronic device may also output a voice response to the wake-up word. That is to say, when entering the wake-up state but no voice command is heard, the target electronic device can output a voice response to the above-mentioned wake-up word to remind the user that the target electronic device has entered the wake-up state. In this way, the user can know which electronic device is awakened, and then instructs the electronic device in the awakened state to perform corresponding operations through voice instructions.
  • the target electronic device can recognize the voice command and execute the user operation corresponding to the voice command.
  • the target electronic device may directly output a voice response to the voice command after entering the wake-up state, and execute the corresponding voice command. operation. For example, in the scenario where the user speaks the wake-up word and the voice command at one time, or in the scenario where the user speaks the voice command while performing the user operation on the above-mentioned first position or before performing the user operation on the first position, The target electronic device may have listened to the voice command before receiving the wake-up command. Wherein, the target electronic device may detect whether the sound signal collected during the first time period before receiving the wake-up instruction and after receiving the wake-up instruction contains the voice instruction. In this way, when the user speaks the voice command before the target electronic device receives the wake-up command, the target electronic device does not respond to the voice command because it does not detect the user's voice command.
  • the above-mentioned multiple electronic devices determine that the above-mentioned image capture device does not exist in the device wake-up system (that is, the image capture device is not included in the local device list), or determine that the above-mentioned image capture device exists in the device wake-up system However, the image acquisition device is not in the wearing state, and the above-mentioned multiple electronic devices can negotiate to select one electronic device.
  • An electronic device selected through the above negotiation may enter a wake-up state. Other electronic devices may not enter the wake-up state.
  • the plurality of electronic devices may negotiate and select an electronic device that receives the sound signal containing the wake-up word with the highest intensity according to the strength of the received sound signal containing the wake-up word. The electronic device that receives the sound signal containing the wake-up word with the highest intensity may enter the wake-up state.
  • these multiple electronic devices can determine whether the user will wake up the device through the image acquisition device.
  • the plurality of electronic devices may wait for a wake-up instruction, and enter into a wake-up state after receiving the wake-up instruction. In this way, the plurality of electronic devices will not all enter the wake-up state after listening to the wake-up word, and false wake-up will occur.
  • the electronic device that has received the wake-up instruction is most likely to be the electronic device that the user wishes to wake up. This can bring a better user experience for the user to use the voice interaction function of the electronic device.
  • the above-mentioned image acquisition device may be smart glasses.
  • the above-mentioned first position may be a position on the temple of the smart glasses.
  • the specific implementation manner of selecting the target electronic device contained in the first image by the above-mentioned image acquisition device from multiple electronic devices may be: determining the type of the electronic device contained in the first image, identifying At least one of accuracy rate and viewing angle deviation; the recognition accuracy rate is used to indicate the accuracy rate of the recognition result of the type of electronic device contained in the first image, and the viewing angle deviation is used to indicate that the position of the electronic device in the first image is different from the first the distance from the center of the image;
  • the priority is determined according to one or more of the type, recognition accuracy and viewing angle deviation; the electronic device's The priority of the type in the wake-up sort determined by type is positively correlated with the priority of the electronic device, the recognition accuracy of the electronic device is positively correlated with the priority of the electronic device, and the viewing angle deviation of the electronic device is negatively correlated with the priority of the electronic device.
  • the recognition accuracy rate of the electronic device Considering the relationship between the recognition accuracy rate of the electronic device and the priority of the above-mentioned electronic device, if the type of the above-mentioned electronic device and the value of the characteristics such as the recognition accuracy rate remain unchanged, the smaller the deviation of the viewing angle of the electronic device is, the greater the The higher the priority.
  • the electronic device that the user wishes to wake up can be accurately determined according to one or more of the above types, recognition accuracy and viewing angle deviation. In this way, false wakeups can be effectively reduced, and better user experience can be brought to the user when using the voice interaction function of the electronic device.
  • the present application provides a device wake-up system.
  • the device wake-up system may include an image acquisition device and a processing device.
  • the image acquisition device can be used to detect the first user input, and when the first user input is detected, acquire the first image.
  • the image acquisition device may also be configured to send a first instruction to the processing device, the first instruction may include the first image, and the first instruction may be used to instruct the processing device to select a target electronic device included in the first image from multiple electronic devices.
  • the processing device may be configured to respond to the first instruction, select a target electronic device included in the first image from multiple electronic devices, and send a wake-up instruction to the target electronic device.
  • the wake-up command can be used to trigger the target electronic device to enter the wake-up state.
  • the above-mentioned multiple electronic devices may be electronic devices with a voice interaction function, and the voice interaction function is turned on. Having a voice interaction function may mean that the electronic device can recognize a user's voice command and perform an operation corresponding to the voice command.
  • the image acquisition device can collect images when the user needs to wake up the device, and send the acquired images to the processing device.
  • the processing device may determine the target electronic device through the image from the image acquisition device, and instruct the target electronic device to enter a wake-up state.
  • the above-mentioned target electronic equipment is the electronic equipment determined by the image acquisition device that the user wishes to wake up. That is to say, the user can wake up the electronic device he wants to wake up by means of the image acquisition device and the processing device.
  • the above-mentioned target electronic device may enter a wake-up state to respond to the user's voice command. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
  • the image acquisition device does not need to perform the operation of determining the target electronic device, which can save the power consumption of the image acquisition device.
  • the aforementioned processing device may be an electronic device with strong computing power, such as a mobile phone, a cloud server, and the like.
  • the above-mentioned image acquisition device may be smart glasses.
  • the images collected by the smart glasses are the images within the user's field of vision.
  • the smart glasses can more accurately determine which electronic device the user wishes to wake up based on the images collected by themselves.
  • the first user input is a voice input including a wake-up word; or, the first user input is a user operation acting on the first position of the image capture device.
  • the image acquisition device may perform image acquisition to obtain the above-mentioned first image, and determine the target electronic device according to the above-mentioned first image.
  • the above-mentioned device wake-up system may also include the above-mentioned multiple electronic devices.
  • the user speaks the wake-up word in addition to the wake-up word that can be monitored by the image acquisition device, the above-mentioned multiple electronic devices can also monitor the wake-up word.
  • the above-mentioned multiple electronic devices may detect whether the above-mentioned image acquisition device exists in the device wake-up system.
  • the electronic devices that exist in the above device wake-up system may exist in the local device list.
  • the local device list may be stored in one or more electronic devices in the device wake-up system.
  • the local device list may also be stored in a cloud server. All electronic devices in the local device list can obtain the local device list. That is, all the electronic devices in the local device list can determine which electronic devices are contained in the wake-up system of the above-mentioned devices.
  • the plurality of electronic devices may determine whether the image capture device exists in the wake-up system of the device by determining whether the image capture device is included in the local device list. If the above-mentioned local device list includes the above-mentioned image collection device, the above-mentioned multiple electronic devices may determine that the above-mentioned image collection device exists in the above-mentioned device wake-up system. If it is determined that the image capture device exists in the wake-up system of the device, the multiple electronic devices may further determine whether the image capture device is in a wearing state. If it is determined that the image capture device is in the wearing state, the above-mentioned multiple electronic devices may wait for a wake-up instruction instead of entering the wake-up state immediately.
  • this electronic device When an electronic device among the plurality of electronic devices receives a wake-up instruction, this electronic device may enter a wake-up state. During the process of waiting for the wake-up instruction, the plurality of electronic devices may not respond to the monitored wake-up words, voice instructions, and the like.
  • the multiple electronic devices can determine whether the user will wake up the device through the image acquisition device.
  • the plurality of electronic devices may wait for a wake-up instruction, and enter into a wake-up state after receiving the wake-up instruction. In this way, the plurality of electronic devices will not all enter the wake-up state after listening to the wake-up word, and false wake-up will occur.
  • the electronic device that has received the wake-up instruction is most likely to be the electronic device that the user wishes to wake up. This can bring a better user experience for the user to use the voice interaction function of the electronic device.
  • the specific method for the above-mentioned processing device to select the target electronic device included in the first image from the plurality of electronic devices may be: determine the type of the electronic device included in the first image, the recognition accuracy rate , at least one item of viewing angle deviation; the recognition accuracy is used to indicate the accuracy of the recognition result of the type of electronic device contained in the first image, and the viewing angle deviation is used to indicate the difference between the position of the electronic device in the first image and the position of the first image Center distance.
  • the priority is determined according to one or more of the type, recognition accuracy and viewing angle deviation; the electronic device's The priority of the type in the wake-up sort determined by type is positively correlated with the priority of the electronic device, the recognition accuracy of the electronic device is positively correlated with the priority of the electronic device, and the viewing angle deviation of the electronic device is negatively correlated with the priority of the electronic device.
  • the recognition accuracy rate of the electronic device Considering the relationship between the recognition accuracy rate of the electronic device and the priority of the above-mentioned electronic device, if the type of the above-mentioned electronic device and the value of the characteristics such as the recognition accuracy rate remain unchanged, the smaller the deviation of the viewing angle of the electronic device is, the greater the The higher the priority.
  • the above-mentioned processing device may be one of the above-mentioned multiple electronic devices. That is, the above-mentioned processing device may be an electronic device with a voice interaction function enabled. If the processing device determines that it is the target electronic device according to the first image from the image acquisition device, the processing device may enter a wake-up state. That is, the processing device does not need to send a wake-up instruction.
  • the image acquisition device may identify the electronic device contained in the first image, and send information of the identified electronic device to the processing device.
  • the above-mentioned information of the electronic device may include one or more of the following: type, recognition accuracy, and viewing angle deviation.
  • the processing device may determine the priority of the electronic device included in the first image according to the information of the electronic device, and select the electronic device with the highest priority included in the first image from the local device list. The selected electronic device is the target electronic device.
  • the image acquisition device may determine the priority of the electronic devices contained in the first image, and send the priority to the processing device.
  • the processing device may, according to the priorities of the electronic devices contained in the first image, select the electronic device with the highest priority contained in the first image from the local device list.
  • the selected electronic device is the target electronic device.
  • the present application provides a method for waking up a device.
  • a first image is acquired, a target electronic device included in the first image is selected from a plurality of electronic devices, and a wake-up instruction is sent to the target electronic device.
  • the wake-up instruction can be used to trigger the target electronic device to enter the wake-up state.
  • the method in the above third aspect may be executed by an image acquisition device.
  • the above-mentioned process of acquiring the first image may be: when the first user input is detected, the image acquisition device acquires the first image.
  • the aforementioned detection of the first user input may specifically be detection of a wake-up word, or detection of a user operation acting on the first position of the image capture device.
  • the above-mentioned image acquisition device may be smart glasses.
  • the image acquisition apparatus may determine whether the user needs to wake up the device by detecting the above-mentioned first user input.
  • the image acquisition device may perform image acquisition. That is, the above-mentioned first image may be acquired by the image acquisition device when it is determined that the user needs to wake up the device.
  • the image acquisition device can determine the target electronic device through the images it has collected, and instruct the target electronic device to enter the wake-up state.
  • the above-mentioned target electronic equipment is the electronic equipment determined by the image acquisition device that the user wishes to wake up. That is to say, the user can use the image acquisition device to wake up the electronic device he wants to wake up.
  • the target electronic device may enter a wake-up state to respond to the user's voice command. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
  • the method in the third aspect above may be executed by a processing device.
  • the above-mentioned process of acquiring the first image may be: receiving a first instruction from an image acquisition device.
  • the first instruction may include a first image captured by an image capture device.
  • the first instruction may be used to instruct the processing device to select a target electronic device included in the first image from multiple electronic devices.
  • the processing device can determine the target electronic device according to the image from the image acquisition device, and instruct the target electronic device to enter the wake-up state.
  • the above-mentioned target electronic equipment is the electronic equipment determined by the image acquisition device that the user wishes to wake up. That is to say, in the scene where the user speaks the wake-up word to wake up the device, the above-mentioned target electronic device can enter the wake-up state, while other electronic devices with voice interaction functions enabled will not enter the wake-up state. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
  • the specific method for selecting the target electronic device contained in the first image from the above-mentioned multiple electronic devices may be: determining the type, recognition accuracy, and viewing angle of the electronic device contained in the first image At least one of the deviations; the recognition accuracy rate is used to indicate the accuracy rate of the recognition result of the type of electronic device contained in the first image, and the viewing angle deviation is used to indicate the distance between the position of the electronic device in the first image and the center of the first image distance.
  • the priority is determined according to one or more of the type, recognition accuracy and viewing angle deviation; the electronic device's The priority of the type in the wake-up sort determined by type is positively correlated with the priority of the electronic device, the recognition accuracy of the electronic device is positively correlated with the priority of the electronic device, and the viewing angle deviation of the electronic device is negatively correlated with the priority of the electronic device.
  • the recognition accuracy rate of the electronic device Considering the relationship between the recognition accuracy rate of the electronic device and the priority of the above-mentioned electronic device, if the type of the above-mentioned electronic device and the value of the characteristics such as the recognition accuracy rate remain unchanged, the smaller the deviation of the viewing angle of the electronic device is, the greater the The higher the priority.
  • the present application provides a method for waking up a device.
  • the image capture device can capture a first image.
  • the image acquisition device may send a first instruction to the processing device.
  • the first instruction may include a first image.
  • the first instruction may be used to instruct the processing device to select a target electronic device included in the first image from multiple electronic devices.
  • the target electronic device may be an object to which the processing device sends a wake-up instruction.
  • the wake-up instruction can be used to trigger the target electronic device to enter the wake-up state.
  • the above-mentioned first user input may be a voice input including a wake-up word, or may be a user operation acting on the first position of the above-mentioned image acquisition device.
  • the above-mentioned image acquisition device may be smart glasses.
  • the image acquisition device may perform image acquisition when the user needs to wake up the device, and send the acquired image to the processing device.
  • the image capture device may instruct the processing device to determine the target electronic device.
  • the above-mentioned target electronic device may enter a wake-up state to respond to the user's voice command.
  • electronic devices with other voice interaction functions enabled may not enter the wake-up state. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
  • the image acquisition device does not need to perform the operation of determining the target electronic device, which can save the power consumption of the image acquisition device.
  • the aforementioned processing device may be an electronic device with strong computing power, such as a mobile phone, a cloud server, and the like.
  • the present application provides a method for waking up a device.
  • the first electronic device can monitor the wake-up word.
  • the first electronic device may detect whether there are smart glasses in the device wake-up system, and whether the smart glasses are in a wearing state. If there are smart glasses in the device wake-up system, and the smart glasses are in a wearing state, the first electronic device may wait to receive a wake-up instruction.
  • the wake-up instruction can be used to trigger the first electronic device to enter the wake-up state.
  • the first electronic device enters into a wake-up state upon receiving the wake-up instruction.
  • the electronic devices that exist in the above device wake-up system may exist in the local device list.
  • the local device list may be stored in one or more electronic devices in the device wake-up system.
  • the local device list can also be stored in the cloud server. All electronic devices in the local device list can obtain the local device list. That is, all the electronic devices in the local device list can determine which electronic devices are contained in the wake-up system of the above-mentioned devices.
  • the first electronic device may determine whether smart glasses exist in the device wake-up system by determining whether the image acquisition device is included in the local device list. If the above-mentioned local device list includes smart glasses, the first electronic device may determine that there are smart glasses in the device wake-up system. If it is determined that there are smart glasses in the device wake-up system, the first electronic device may further determine whether the smart glasses are in a wearing state. If it is determined that the smart glasses are in the wearing state, the first electronic device may wait for a wake-up instruction without immediately entering the wake-up state. When the first electronic device receives the wake-up instruction, the first electronic device may enter the wake-up state. During the above process of waiting for the wake-up instruction, the first electronic device may not respond to the monitored wake-up words, voice instructions, and the like.
  • the above-mentioned first electronic device is determined as the target electronic device and receives a wake-up instruction.
  • the target electronic device listens to the wake-up word, but does not listen to the voice command (such as in the scenario where the user only speaks the wake-up word). Then, when entering the wake-up state, the above-mentioned target electronic device may output a voice response to the above-mentioned wake-up word.
  • the voice response to the wake word may be, for example, "I am".
  • the target electronic device before receiving the above-mentioned wake-up instruction, the target electronic device neither listens to the wake-up word nor the voice command (such as the user does not say the wake-up word, but the user operation acting on the first position above realizes in scenarios where the device wakes up). Then, when entering the wake-up state, the target electronic device may also output a voice response to the wake-up word. That is to say, when entering the wake-up state but no voice command is heard, the target electronic device can output a voice response to the above-mentioned wake-up word to remind the user that the target electronic device has entered the wake-up state.
  • the user can know which electronic device is awakened, and then instructs the electronic device in the awakened state to perform corresponding operations through voice instructions.
  • the target electronic device can recognize the voice command and execute the user operation corresponding to the voice command.
  • the above-mentioned first electronic device is determined as the target electronic device and receives a wake-up instruction. If the target electronic device listens to the voice command before receiving the wake-up command, then the target electronic device can directly output a voice response to the voice command after entering the wake-up state, and execute the operation corresponding to the voice command. For example, in the scenario where the user speaks the wake-up word and the voice command at one time, or in the scenario where the user speaks the voice command while performing the user operation on the above-mentioned first position or before performing the user operation on the first position, The target electronic device may have listened to the voice command before receiving the wake-up command.
  • the target electronic device may detect whether the sound signal collected during the first time period before receiving the wake-up instruction and after receiving the wake-up instruction contains the voice instruction. In this way, when the user speaks the voice command before the target electronic device receives the wake-up command, the target electronic device does not respond to the voice command because it does not detect the user's voice command.
  • the first electronic device and other electronic devices whose voice interaction function is enabled may negotiate to select an electronic device.
  • An electronic device selected through the above negotiation may enter a wake-up state. Other electronic devices may not enter the wake-up state.
  • the first electronic device and other electronic devices with the voice interaction function enabled may negotiate and select the one that receives the sound signal containing the wake-up word with the highest strength according to the strength of the received sound signal containing the wake-up word.
  • Electronic equipment The electronic device that receives the sound signal containing the wake-up word with the highest intensity may enter the wake-up state.
  • the multiple electronic devices can determine whether the user will wake up the device through the smart glasses.
  • the plurality of electronic devices may wait for a wake-up instruction, and enter into a wake-up state after receiving the wake-up instruction. In this way, the plurality of electronic devices will not all enter the wake-up state after listening to the wake-up word, and false wake-up will occur.
  • the electronic device that has received the wake-up instruction is most likely to be the electronic device that the user wishes to wake up. This can bring a better user experience for the user to use the voice interaction function of the electronic device.
  • the present application provides an electronic device.
  • the electronic device can include memory and a processor.
  • memory can be used to store computer programs.
  • the processor may be used to invoke a computer program, so that the electronic device executes any possible implementation manner in the third aspect, the fourth aspect, or the fifth aspect.
  • the present application provides a chip, the chip is applied to an electronic device, the chip includes one or more processors, and the processor is used to invoke computer instructions so that the electronic device executes the third aspect or the fourth aspect or Any possible implementation manner in the fifth aspect.
  • the present application provides a computer program product containing instructions, which is characterized in that, when the above-mentioned computer program product is run on an electronic device, the electronic device is executed as in the third aspect or the fourth aspect or the fifth aspect. any possible implementation.
  • the present application provides a computer-readable storage medium, including instructions, characterized in that, when the above-mentioned instructions are run on an electronic device, the electronic device is made to execute the method described in the third aspect or the fourth aspect or the fifth aspect. any possible implementation.
  • the electronic device provided in the sixth aspect, the chip provided in the seventh aspect, the computer program product provided in the eighth aspect, and the computer-readable storage medium provided in the ninth aspect are all used to execute the method. Therefore, the beneficial effects that it can achieve can refer to the beneficial effects in the corresponding method, and will not be repeated here.
  • FIG. 1 is a schematic structural diagram of an electronic device provided in an embodiment of the present application.
  • FIG. 2 is a schematic diagram of a device wake-up scenario provided by an embodiment of the present application
  • FIG. 3 is a schematic structural diagram of a communication system provided by an embodiment of the present application.
  • FIG. 4 is a schematic structural diagram of another communication system provided by an embodiment of the present application.
  • FIG. 5 is a schematic diagram of a device wake-up scenario provided by an embodiment of the present application.
  • FIG. 6 is a schematic diagram of images collected by smart glasses provided in an embodiment of the present application.
  • FIG. 7 is a schematic diagram of a device wake-up scenario provided by an embodiment of the present application.
  • FIG. 8 is a schematic diagram of images collected by smart glasses provided in an embodiment of the present application.
  • FIG. 9A and FIG. 9B are schematic diagrams of a device wake-up scenario provided by an embodiment of the present application.
  • FIG. 10 is a flowchart of a method for an electronic device to enter a wake-up state according to an embodiment of the present application
  • Fig. 11 is a schematic structural diagram of smart glasses provided by an embodiment of the present application.
  • Fig. 12 is a flow chart of a method for waking up a device provided by an embodiment of the present application.
  • first and second are used for descriptive purposes only, and cannot be understood as implying or implying relative importance or implicitly specifying the quantity of indicated technical features. Therefore, the features defined as “first” and “second” may explicitly or implicitly include one or more of these features. In the description of the embodiments of the present application, unless otherwise specified, the “multiple” The meaning is two or more.
  • embodiments of the present application provide a device wake-up method and a related device.
  • the electronic device involved in the embodiment of the present application is firstly introduced below.
  • FIG. 1 exemplarily shows a schematic structural diagram of an electronic device 100 .
  • the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (universal serial bus, USB) interface 130, a charging management module 140, a power management module 141, and a battery 142 , antenna 1, antenna 2, mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, earphone jack 170D, sensor module 180, button 190, motor 191, indicator 192, camera 193 , a display screen 194, and a subscriber identification module (subscriber identification module, SIM) card interface 195, etc.
  • SIM subscriber identification module
  • the sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, an ambient light sensor 180L, bone conduction sensor 180M, etc.
  • the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the electronic device 100 .
  • the electronic device 100 may include more or fewer components than shown in the figure, or combine certain components, or separate certain components, or arrange different components.
  • the illustrated components can be realized in hardware, software or a combination of software and hardware.
  • the processor 110 may include one or more processing units, for example: the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural network processor (neural-network processing unit, NPU) wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
  • application processor application processor, AP
  • modem processor graphics processing unit
  • GPU graphics processing unit
  • image signal processor image signal processor
  • ISP image signal processor
  • controller memory
  • video codec digital signal processor
  • DSP digital signal processor
  • baseband processor baseband processor
  • neural network processor neural-network processing unit, NPU
  • the controller may be the nerve center and command center of the electronic device 100 .
  • the controller can generate an operation control signal according to the instruction opcode and timing signal, and complete the control of fetching and executing the instruction.
  • the processor 110 may include a voice wake-up module and a voice command recognition module.
  • the voice wake-up module and the voice command recognition module can be integrated in different processor chips and executed by different chips.
  • the voice wake-up module can be integrated in a coprocessor or DSP chip with low power consumption, and the voice command recognition module can be integrated in an AP or NPU or other chips.
  • the voice wake-up module can be integrated in the same processor chip, and the same chip performs related functions.
  • both the voice wake-up module and the voice command recognition module can be integrated in an AP chip or an NPU or other chips.
  • the processor 110 may also include a voice command execution module. After the speech instruction recognition module recognizes the speech instruction, the speech instruction execution module can execute the operation corresponding to the speech instruction. For example, play music, make calls, send text messages, and more.
  • the electronic device including the above-mentioned voice wake-up module, voice command recognition module and voice command execution module is an electronic device with voice interaction capability.
  • the aforementioned voice interaction capability may mean that the electronic device can respond to a user's voice command and perform an operation corresponding to the voice command.
  • a memory may also be provided in the processor 110 for storing instructions and data.
  • the memory in processor 110 is a cache memory.
  • the memory may hold instructions or data that the processor 110 has just used or recycled. If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated access is avoided, and the waiting time of the processor 110 is reduced, thus improving the efficiency of the system.
  • the USB interface 130 is an interface conforming to the USB standard specification, specifically, it can be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like.
  • the USB interface 130 can be used to connect a charger to charge the electronic device 100 , and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones and play audio through them. This interface can also be used to connect other electronic devices, such as AR devices.
  • the charging management module 140 is configured to receive a charging input from a charger.
  • the charger may be a wireless charger or a wired charger.
  • the charging management module 140 can receive charging input from the wired charger through the USB interface 130 .
  • the charging management module 140 may receive a wireless charging input through a wireless charging coil of the electronic device 100 . While the charging management module 140 is charging the battery 142 , it can also provide power for electronic devices through the power management module 141 .
  • the power management module 141 is used for connecting the battery 142 , the charging management module 140 and the processor 110 .
  • the power management module 141 receives the input from the battery 142 and/or the charging management module 140 to provide power for the processor 110 , the internal memory 121 , the external memory, the display screen 194 , the camera 193 , and the wireless communication module 160 .
  • the power management module 141 may also be disposed in the processor 110 .
  • the power management module 141 and the charging management module 140 may also be set in the same device.
  • the wireless communication function of the electronic device 100 can be realized by the antenna 1 , the antenna 2 , the mobile communication module 150 , the wireless communication module 160 , a modem processor, a baseband processor, and the like.
  • Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals.
  • Each antenna in electronic device 100 may be used to cover single or multiple communication frequency bands. Different antennas can also be multiplexed to improve the utilization of the antennas.
  • Antenna 1 can be multiplexed as a diversity antenna of a wireless local area network.
  • the antenna may be used in conjunction with a tuning switch.
  • the mobile communication module 150 can provide wireless communication solutions including 2G/3G/4G/5G applied on the electronic device 100 .
  • the mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (low noise amplifier, LNA) and the like.
  • the mobile communication module 150 can receive electromagnetic waves through the antenna 1, filter and amplify the received electromagnetic waves, and send them to the modem processor for demodulation.
  • the mobile communication module 150 can also amplify the signals modulated by the modem processor, and convert them into electromagnetic waves through the antenna 1 for radiation.
  • at least part of the functional modules of the mobile communication module 150 may be set in the processor 110 .
  • at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be set in the same device.
  • the wireless communication module 160 can provide wireless local area networks (wireless local area networks, WLAN) (such as wireless fidelity (Wireless Fidelity, Wi-Fi) network), bluetooth (bluetooth, BT), global navigation satellite, etc. applied on the electronic device 100.
  • System global navigation satellite system, GNSS
  • frequency modulation frequency modulation, FM
  • near field communication technology near field communication, NFC
  • infrared technology infrared, IR
  • the wireless communication module 160 may be one or more devices integrating at least one communication processing module.
  • the wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency-modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 .
  • the wireless communication module 160 can also receive the signal to be sent from the processor 110 , frequency-modulate it, amplify it, and convert it into electromagnetic waves through the antenna 2 for radiation.
  • the electronic device 100 realizes the display function through the GPU, the display screen 194 , and the application processor.
  • the GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor. GPUs are used to perform mathematical and geometric calculations for graphics rendering.
  • Processor 110 may include one or more GPUs that execute program instructions to generate or change display information.
  • the display screen 194 is used to display images, videos and the like.
  • the electronic device 100 may include 1 or N display screens 194 , where N is a positive integer greater than 1.
  • the electronic device 100 can realize the shooting function through the ISP, the camera 193 , the video codec, the GPU, the display screen 194 and the application processor.
  • the ISP is used for processing the data fed back by the camera 193 .
  • the light is transmitted to the photosensitive element of the camera through the lens, and the light signal is converted into an electrical signal, and the photosensitive element of the camera transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye.
  • ISP can also perform algorithm optimization on image noise, brightness, and skin color.
  • ISP can also optimize the exposure, color temperature and other parameters of the shooting scene.
  • the ISP may be located in the camera 193 .
  • Camera 193 is used to capture still images or video.
  • the object generates an optical image through the lens and projects it to the photosensitive element.
  • the photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
  • CMOS complementary metal-oxide-semiconductor
  • the photosensitive element converts the light signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal.
  • the ISP outputs the digital image signal to the DSP for processing.
  • DSP converts digital image signals into standard RGB, YUV and other image signals.
  • the electronic device 100 may include 1 or N cameras 193 , where N is a positive integer greater than 1.
  • Digital signal processors are used to process digital signals. In addition to digital image signals, they can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the energy of the frequency point.
  • Video codecs are used to compress or decompress digital video.
  • the electronic device 100 may support one or more video codecs.
  • the electronic device 100 can play or record videos in various encoding formats, for example: moving picture experts group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4 and so on.
  • MPEG moving picture experts group
  • the NPU is a neural-network (NN) computing processor.
  • NN neural-network
  • Applications such as intelligent cognition of the electronic device 100 can be realized through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
  • the external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, so as to expand the storage capacity of the electronic device 100.
  • the external memory card communicates with the processor 110 through the external memory interface 120 to implement a data storage function. Such as saving music, video and other files in the external memory card.
  • the internal memory 121 may be used to store computer-executable program codes including instructions.
  • the processor 110 executes various functional applications and data processing of the electronic device 100 by executing instructions stored in the internal memory 121 .
  • the internal memory 121 may include an area for storing programs and an area for storing data.
  • the stored program area can store an operating system, at least one application program required by a function (such as a sound playing function, an image playing function, etc.) and the like.
  • the storage data area can store data created during the use of the electronic device 100 (such as audio data, phonebook, etc.) and the like.
  • the internal memory 121 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (universal flash storage, UFS) and the like.
  • the electronic device 100 can implement audio functions through the audio module 170 , the speaker 170A, the receiver 170B, the microphone 170C, the earphone interface 170D, and the application processor. Such as music playback, recording, etc.
  • the audio module 170 is used to convert digital audio information into analog audio signal output, and is also used to convert analog audio input into digital audio signal.
  • the audio module 170 may also be used to encode and decode audio signals.
  • the audio module 170 may be set in the processor 110 , or some functional modules of the audio module 170 may be set in the processor 110 .
  • Speaker 170A also referred to as a "horn" is used to convert audio electrical signals into sound signals.
  • Electronic device 100 can listen to music through speaker 170A, or listen to hands-free calls.
  • Receiver 170B also called “earpiece” is used to convert audio electrical signals into sound signals.
  • the microphone 170C also called “microphone” or “microphone” is used to convert sound signals into electrical signals. When making a phone call or sending a voice message, the user can put his mouth close to the microphone 170C to make a sound, and input the sound signal to the microphone 170C.
  • the electronic device 100 may be provided with at least one microphone 170C. In some other embodiments, the electronic device 100 may be provided with two microphones 170C, which may also implement a noise reduction function in addition to collecting sound signals. In some other embodiments, the electronic device 100 can also be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and realize directional recording functions, etc.
  • microphone 170C may interface with a low power processor.
  • a voice wake-up module may be integrated in the low-power processor.
  • the microphone 170C can send the collected sound signal to the low power consumption processor.
  • the voice wake-up module in the low-power processor can detect whether the voice signal contains a preset wake-up word. If included, the low-power processor can wake up the application processor.
  • a voice command recognition module and a voice command execution module may be integrated in the application processor. When the application processor is woken up, the sound signal collected by the microphone 170C can be sent to the application processor through the above-mentioned low power consumption processor.
  • the speech command recognition module in the application processor can recognize the speech command in the sound signal. Further, the voice instruction executing module can execute the operation corresponding to the voice instruction.
  • the microphone 170C and the above-mentioned low-power processor can be in working state in real time.
  • the sound signal collected by the microphone 170C needs to be judged by the low-power processor first whether it contains a preset wake-up word.
  • the application processor is only woken up when the sound signal contains a preset wakeup word. This can save power consumption of the electronic device 100 .
  • the earphone interface 170D is used for connecting wired earphones.
  • the earphone interface 170D can be a USB interface 130, or a 3.5mm open mobile terminal platform (OMTP) standard interface, or a cellular telecommunications industry association of the USA (CTIA) standard interface.
  • OMTP open mobile terminal platform
  • CTIA cellular telecommunications industry association of the USA
  • the pressure sensor 180A is used to sense the pressure signal and convert the pressure signal into an electrical signal.
  • pressure sensor 180A may be disposed on display screen 194 .
  • pressure sensors 180A such as resistive pressure sensors, inductive pressure sensors, and capacitive pressure sensors.
  • a capacitive pressure sensor may be comprised of at least two parallel plates with conductive material.
  • the electronic device 100 determines the intensity of pressure according to the change in capacitance.
  • the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A.
  • the electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A.
  • touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions. For example: when a touch operation with a touch operation intensity less than the first pressure threshold acts on the short message application icon, an instruction to view short messages is executed. When a touch operation whose intensity is greater than or equal to the first pressure threshold acts on the icon of the short message application, the instruction of creating a new short message is executed.
  • the gyro sensor 180B can be used to determine the motion posture of the electronic device 100 .
  • the angular velocity of the electronic device 100 around three axes ie, x, y and z axes
  • the air pressure sensor 180C is used to measure air pressure.
  • the magnetic sensor 180D includes a Hall sensor.
  • the electronic device 100 may use the magnetic sensor 180D to detect the opening and closing of the flip leather case.
  • the acceleration sensor 180E can detect the acceleration of the electronic device 100 in various directions (generally three axes). When the electronic device 100 is stationary, the magnitude and direction of gravity can be detected. It can also be used to identify the posture of electronic devices, and can be used in applications such as horizontal and vertical screen switching, pedometers, etc.
  • the distance sensor 180F is used to measure the distance.
  • the electronic device 100 may measure the distance by infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 may use the distance sensor 180F for distance measurement to achieve fast focusing.
  • Proximity light sensor 180G may include, for example, light emitting diodes (LEDs) and light detectors, such as photodiodes.
  • the light emitting diodes may be infrared light emitting diodes.
  • the electronic device 100 emits infrared light through the light emitting diode.
  • Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it may be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 .
  • the ambient light sensor 180L is used for sensing ambient light brightness.
  • the electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness.
  • the ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures.
  • the ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in the pocket, so as to prevent accidental touch.
  • the fingerprint sensor 180H is used to collect fingerprints.
  • the electronic device 100 can use the collected fingerprint characteristics to implement fingerprint unlocking, access to application locks, take pictures with fingerprints, answer incoming calls with fingerprints, and the like.
  • the temperature sensor 180J is used to detect temperature.
  • the electronic device 100 uses the temperature detected by the temperature sensor 180J to implement a temperature treatment strategy. For example, when the temperature reported by the temperature sensor 180J exceeds the threshold, the electronic device 100 may reduce the performance of the processor located near the temperature sensor 180J, so as to reduce power consumption and implement thermal protection. In other embodiments, when the temperature is lower than another threshold, the electronic device 100 heats the battery 142 to prevent the electronic device 100 from being shut down abnormally due to the low temperature.
  • Touch sensor 180K also known as "touch panel”.
  • the touch sensor 180K can be disposed on the display screen 194, and the touch sensor 180K and the display screen 194 form a touch screen, also called a “touch screen”.
  • the touch sensor 180K is used to detect a touch operation on or near it.
  • the touch sensor can pass the detected touch operation to the application processor to determine the type of touch event.
  • Visual output related to the touch operation can be provided through the display screen 194 .
  • the touch sensor 180K may also be disposed on the surface of the electronic device 100 , which is different from the position of the display screen 194 .
  • the bone conduction sensor 180M can acquire vibration signals.
  • the bone conduction sensor 180M can acquire the vibration signal of the vibrating bone mass of the human voice.
  • the bone conduction sensor 180M can also contact the human pulse and receive the blood pressure beating signal.
  • the bone conduction sensor 180M can also be disposed in the earphone, combined into a bone conduction earphone.
  • the audio module 170 can analyze the voice signal based on the vibration signal of the vibrating bone mass of the vocal part acquired by the bone conduction sensor 180M, so as to realize the voice function.
  • the keys 190 include a power key, a volume key and the like.
  • the key 190 may be a mechanical key. It can also be a touch button.
  • the electronic device 100 can receive key input and generate key signal input related to user settings and function control of the electronic device 100 .
  • the motor 191 can generate a vibrating reminder.
  • the indicator 192 can be an indicator light, and can be used to indicate charging status, power change, and can also be used to indicate messages, missed calls, notifications, and the like.
  • the SIM card interface 195 is used for connecting a SIM card.
  • the SIM card can be connected and separated from the electronic device 100 by inserting it into the SIM card interface 195 or pulling it out from the SIM card interface 195 .
  • the electronic device 100 may support 1 or N SIM card interfaces, where N is a positive integer greater than 1.
  • SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card etc. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the multiple cards may be the same or different.
  • the SIM card interface 195 is also compatible with different types of SIM cards.
  • the SIM card interface 195 is also compatible with external memory cards.
  • the electronic device 100 interacts with the network through the SIM card to implement functions such as calling and data communication.
  • the electronic device 100 adopts an eSIM, that is, an embedded SIM card.
  • the eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100 .
  • the electronic device 100 may be a mobile phone, a tablet computer, a notebook computer, a speaker, a TV, a router, a wearable device (such as smart glasses, a smart watch, a smart bracelet, etc.), a smart home device (such as a refrigerator, a washing machine, an air conditioner, a lamp, etc.) etc.
  • a wearable device such as smart glasses, a smart watch, a smart bracelet, etc.
  • a smart home device such as a refrigerator, a washing machine, an air conditioner, a lamp, etc.
  • Speech recognition applications such as voice assistant applications
  • An electronic device installed with a voice assistant application has a voice interaction function.
  • the voice interaction function when the voice interaction function is turned on, the electronic device can collect the sound in the environment in real time, and detect whether the sound contains a wake-up word.
  • a wake word can be used to wake up an electronic device.
  • the aforementioned waking up the electronic device may mean that the triggering electronic device invokes a processor (such as an application processor) integrated with a voice command recognition module and a voice command execution module to recognize the voice command in the collected sound and execute the operation corresponding to the voice command.
  • a processor such as an application processor
  • the electronic device is in a sleep state.
  • the sleep state may indicate that the application processor of the electronic device is in a sleep state.
  • the microphone and the low-power processor of the electronic device can be in a working state in real time.
  • the application processor of the electronic device can be woken up to perform the operation corresponding to the voice command (for example, the speaker is woken up when it is in a sleep state and plays music according to the user's voice command).
  • the application processor of the electronic device is in a working state. Wherein, the microphone and the low-power processor of the electronic device can be in working state in real time.
  • the application processor of the electronic device can receive the sound collected by the microphone, recognize the voice command contained in the sound and perform the operation corresponding to the voice command (for example, the speaker monitors the wake-up call while playing music. words and turn on the air conditioner according to the user's voice command).
  • a room includes multiple electronic devices with voice interaction functions.
  • a room includes a mobile phone, a speaker and a TV.
  • the mobile phone, speakers, and TV all have voice interaction functions, and the voice interaction functions are all turned on.
  • Users want to instruct the speakers to play music through voice commands.
  • the user can say "Xiaoyi Xiaoyi, I want to listen to the song".
  • "Xiaoyi Xiaoyi" is the default wake-up word.
  • "I want to listen to the song” is the voice command. Since the mobile phone, the sound box and the TV can all collect sounds in the environment, all these multiple electronic devices can monitor the wake-up word.
  • the plurality of electronic devices can all be woken up. When awakened, the plurality of electronic devices can recognize the above-mentioned voice command "I want to listen to a song", and perform the operation corresponding to the voice command, that is, play music.
  • device priority rankings are stored in multiple electronic devices with a voice interaction function.
  • the priority ranking of the devices may be: sound box>television>tablet computer>mobile phone.
  • These multiple electronic devices can communicate with each other after listening to the wake-up word (such as "Xiaoyi Xiaoyi"), and determine the top electronic device among the multiple electronic devices, such as a TV, according to the device priority ranking.
  • the TV may respond to the above-mentioned wake-up word, and wake up the application processor to execute the user's voice command.
  • the method for the television to respond to the wake-up word may be, for example, to answer "I am" by voice.
  • the last electronic device among the above multiple electronic devices will not respond to the above wake-up word.
  • This embodiment of the present application does not specifically limit the prioritization of the foregoing devices.
  • the above-mentioned method for the plurality of electronic devices to communicate with each other after listening to the wake-up word may be a Bluetooth-based communication method. Then the distance between the above-mentioned multiple electronic devices is within the distance range of Bluetooth communication. Wherein, when the user speaks the wake-up word, multiple electronic devices can monitor the wake-up word.
  • the embodiment of the present application does not limit the communication method when the above-mentioned multiple electronic devices negotiate which electronic device should respond to the above-mentioned wake-up word.
  • the above method can reduce the situation that multiple electronic devices respond to the wake-up word.
  • the electronic device that responds to the wake-up word determined through negotiation among the plurality of electronic devices is not necessarily the electronic device that the user wishes to wake up. That is to say, the above method is difficult to meet the actual needs of users. There may still be a problem of false wake-up when electronic equipment is woken up by the above-mentioned method.
  • An embodiment of the present application provides a method for waking up a device.
  • the user can use the smart glasses to wake up the electronic device that he wants to wake up.
  • the smart glasses can collect images.
  • the collected images are images within the user's field of view.
  • the smart glasses can perform image recognition processing on the image to determine the type of electronic equipment contained in the image.
  • the smart glasses can use a ranking algorithm to prioritize the electronic devices contained in this image.
  • the smart glasses can acquire a local device list, and send a wake-up instruction to the above-mentioned electronic device with the highest priority and existing in the local device list.
  • An electronic device with a voice interaction function can be woken up after receiving the wake-up command, while other electronic devices that have not received the wake-up command do not respond to the user's voice command.
  • a user when a user wishes to wake up an electronic device, he usually looks at the electronic device and speaks a voice command. Then, if the user wears smart glasses, the smart glasses can collect images within the user's field of vision, and judge which electronic device the user wants to wake up based on the images. When the electronic device that the user wishes to wake up is determined, the smart glasses may send a wake-up instruction to the electronic device. When the wake-up command is received, the electronic device can be woken up, recognize the user's voice command and execute the operation corresponding to the voice command.
  • the above method involves communication between smart glasses and other electronic devices.
  • a communication system provided in the present application is introduced below.
  • FIG. 3 exemplarily shows a schematic diagram of the communication system 10 .
  • the communication system 10 may include a plurality of electronic devices, and a communication connection 108 may be established between the plurality of electronic devices.
  • the communication system 10 may include smart glasses 101 , a mobile phone 102 , a headset 103 , a tablet computer 104 , a router 105 , a speaker 106 and a TV 107 .
  • the communication system 10 may also include other types of electronic devices.
  • augmented reality augmented reality, AR
  • virtual reality virtual reality
  • AI artificial intelligence
  • car machines car machines
  • game consoles Other smart wearable devices may also include Internet of Things (IOT) devices or smart home devices such as smart water heaters, smart lamps, smart air conditioners, and the like. This embodiment of the present application does not limit it.
  • IOT Internet of Things
  • smart home devices such as smart water heaters, smart lamps, smart air conditioners, and the like.
  • a communication connection 108 may be established between electronic devices, and the communication connection 108 may be a near field communication connection.
  • the near-field communication connection may be a wired connection, such as a universal serial bus (uniersalserialbus, USB) connection, or a wireless connection, such as a Bluetooth communication connection, a Wi-Fi communication connection, a wireless fidelity peer-to-peer , Wi-Fi P2P) communication connection and so on.
  • the embodiment of the present application does not limit the specific manner of the foregoing near field communication connection.
  • the local device list may include electronic devices connected to the same communication network. For example, electronic devices connected to the same home Wi-Fi in a family. Communication connections are established between multiple electronic devices in the local device list.
  • the above communication system 10 has a local device list.
  • the electronic devices included in the communication system 10 are the electronic devices in the local device list. That is, the electronic device that joins the aforementioned communication connection 108 can be added to the aforementioned local device list. An electronic device that exits the communication connection 108 may be removed from the local device list.
  • the local device list may be stored in one or more electronic devices included in the communication system 10, or may be stored in a cloud server.
  • All electronic devices in the communication system 10 can acquire and update the local device list. For example, any electronic device in the communication system 10 may update the local device list according to the detection of electronic devices joining or leaving the communication connection 108 . If the aforementioned local device list is stored in multiple electronic devices in the communication system 10, the local device list can be updated synchronously among the multiple electronic devices. In this way, the local device lists acquired by the electronic devices in the communication system 10 are consistent.
  • the local device list may be created by an electronic device in the communication system 10 described above.
  • the one electronic device may be, for example, the mobile phone 102 .
  • the electronic devices added to the aforementioned local device list may be electronic devices that have undergone trusted identity authentication.
  • the above-mentioned trusted identity authentication may be realized by an electronic device (such as the mobile phone 102) existing in the local device list.
  • handset 102 may add speaker 106 to the local device list in response to a user action agreeing to add speaker 106 to the local device list.
  • the embodiment of the present application does not limit the specific implementation manner of the above-mentioned trusted identity authentication.
  • the above process of trusted identity authentication may be, for example, a process of implementing network distribution for electronic devices.
  • the embodiment of the present application does not limit the communication connection manners of the electronic devices in the above-mentioned communication system 10 .
  • an electronic device in the communication system 10 may establish a communication connection 108 as shown in FIG. 4 .
  • the mobile phone 102 , the tablet computer 104 , the sound box 106 and the TV 107 can establish a Wi-Fi communication connection with the router 105 .
  • the aforementioned electronic devices that establish a Wi-Fi communication connection with the router 105 can access the network through the router 105 to realize the function of surfing the Internet. That is to say, the mobile phone 102, the tablet computer 104, the router 105, the sound box 106 and the TV 107 are in the same local area network (like a home Wi-Fi).
  • the electronic devices in the local device list may include the electronic devices in this one local area network.
  • the smart glasses 101 and the earphone 103 can establish a Bluetooth communication connection with the mobile phone 102 .
  • the mobile phone 102 can update the local device list.
  • the mobile phone 102 may add the smart glasses 101 and the earphone 103 to the local device list.
  • the electronic devices in the local device list may include smart glasses 101 , mobile phone 102 , earphone 103 , tablet computer 104 , router 105 , speaker 106 and TV 107 shown in FIG. 4 .
  • the mobile phone 102 can update the local device list. Specifically, the mobile phone 102 may remove the smart glasses 101 from the local device list. If the mobile phone 102 ends the Wi-Fi communication connection with the router 105 (such as after the user goes out with the mobile phone), and the router 105 is in the above-mentioned local area network, the router 105 can update the above-mentioned local device list. Specifically, the router 105 may remove the mobile phone 105 from the local device list.
  • the smart glasses 101 and the earphone 103 can detect whether the electronic devices (such as smart glasses 101 and earphones 103) that join the communication connection 108 through the mobile phone 102 are connected to the communication system 10. Electronic devices other than the mobile phone 102 are connected. If it is detected that the smart glasses 101 and the earphone 103 are only connected to the mobile phone 102, the smart glasses 101 and the earphone 103 may be removed from the local device list. If it is detected that the smart glasses 101 and the earphone 103 are also connected to other electronic devices (such as the tablet computer 104), the smart glasses 101 and the earphone 103 may still exist in the local device list.
  • the electronic devices such as smart glasses 101 and earphones 103
  • the electronic devices in the local device list can be based on the electronic devices in the local area network established by the router 105 (such as mobile phones 102, tablet computers 104, sound boxes 106, and televisions 107), and the electronic devices in the local area network.
  • An electronic device connected to the electronic device through other wireless connection methods such as smart glasses 101, earphone 103).
  • an application for controlling other electronic devices is installed in the mobile phone 102 .
  • the APP can be, for example, a smart home APP.
  • the electronic devices in the local device list can be the electronic devices (such as router 105, speaker 106, TV 107) that the mobile phone 102 can control through the smart home APP, and other electronic devices that are connected to the mobile phone 102 but cannot be controlled through the smart home APP (such as smart glasses 101, earphones 103).
  • the mobile phone 102 and the electronic devices that can be controlled by the smart home APP are connected to the router 105.
  • the mobile phone 102 can send control instructions to the aforementioned electronic devices that can be controlled by the smart home APP through the router 105 .
  • the mobile phone 102 can also directly communicate with the above-mentioned electronic devices that can be controlled by the smart home APP without forwarding by the router 105 .
  • the embodiment of the present application does not limit the implementation manner in which the mobile phone 102 controls other electronic devices through the smart home APP.
  • FIG. 3 and FIG. 4 are only exemplary descriptions of the embodiments of the present application, and should not limit the present application.
  • the following specifically introduces a schematic diagram of a scenario where a user wakes up an electronic device by means of smart glasses involved in the embodiment of the present application.
  • electronic devices in a family may include smart glasses 101 , mobile phones 102 , routers 105 , speakers 106 , and televisions 107 .
  • the mobile phone 102 , the sound box 106 and the TV 107 all establish Wi-Fi communication connections with the router 105 .
  • a user wears smart glasses 101 .
  • the smart glasses 101 establish a Bluetooth communication connection with the mobile phone 102 . It can be seen from the foregoing embodiments that the smart glasses 101, the mobile phone 102, the router 105, the sound box 106, and the television 107 can form a communication system.
  • the electronic devices in the local device list of the communication system include smart glasses 101 , mobile phones 102 , routers 105 , speakers 106 , and televisions 107 .
  • the user wishes to wake up the speaker 106 and instruct the speaker 106 to play music through a voice command.
  • the user wears the smart glasses 101 and looks at the speaker 106 and says "Xiaoyi Xiaoyi, I want to listen to a song".
  • the mobile phone 102, the sound box 106 and the TV 107 are all electronic devices with voice interaction function and semantic interaction function turned on.
  • the wake-up words used to wake up the mobile phone 102, the speaker 106 and the TV 107 are the same, for example, "Xiaoyi Xiaoyi”.
  • the mobile phone 102, the sound box 106 and the TV 107 can all collect the voice input of the user through the microphone.
  • the voice input above includes the wake-up word "Xiaoyi Xiaoyi" and the voice command "I want to listen to a song”.
  • the mobile phone 102, the speaker 106 and the TV 107 may all enter the pre-wake-up state from the wake-up word monitoring state.
  • the above-mentioned wake-up word monitoring state can collect ambient sound for the electronic device and identify whether the wake-up word is included in the ambient sound.
  • the microphone and the low-power processor of the electronic device can work in real time.
  • the microphone can be used to collect ambient sound.
  • a low-power processor can be used to identify whether the wake word is contained in ambient sounds.
  • the aforementioned pre-awakening state may be a state in which the electronic device detects whether there are smart glasses in the local device list and whether the smart glasses are worn after listening to the wake-up word. That is to say, when listening to the wake-up word, the mobile phone 102, the sound box 106 and the TV 107 can detect whether the smart glasses 101 exist in the local device list and whether the smart glasses 101 are worn, instead of being woken up immediately in response to the wake-up word.
  • the electronic device may wait for receiving a wake-up instruction, and does not respond to the monitored wake-up words, voice instructions, and the like.
  • the electronic device may enter the wake-up state.
  • the mobile phone 102, the sound box 106 and the TV 107 can all enter the wake-up state from the above-mentioned pre-wake-up state.
  • the mobile phone 102, the sound box 106 and the TV 107 can communicate with each other, negotiate and determine an electronic device to respond to the wake-up word.
  • the selected electronic device can enter the wake-up state from the above-mentioned pre-wake-up state.
  • Other electronic devices can enter the above-mentioned wake-up word monitoring state again from the above-mentioned pre-wake-up state. That is, other electronic devices do not respond to the above-mentioned wake-up word.
  • the aforementioned wake-up state may indicate that the voice recognition application of the electronic device is in a state of being woken up.
  • the electronic device can start a speech recognition application. Specifically, the electronic device can start the application processor to recognize the voice command, and execute the operation corresponding to the voice command. It should be noted that, in the aforementioned wake-up state, the electronic device may also monitor in real time whether the ambient sound contains a wake-up word. In a possible implementation, after the electronic device enters the wake-up state, if no voice command is recognized in the ambient sound within a preset time period, the electronic device may enter the wake-up word monitoring state from the wake-up state.
  • the mobile phone 102, the sound box 106 and the TV 107 can all enter the wake-up state from the above-mentioned pre-wake-up state.
  • the mobile phone 102, the sound box 106 and the TV 107 can communicate with each other, negotiate and determine an electronic device to respond to the wake-up word.
  • the smart glasses 101 do not exist in the above local device list, or if the smart glasses 101 exist in the local device list but the smart glasses 101 are not worn, the user cannot wake himself up with the help of the smart glasses 101 Electronic devices that wish to wake up. Then, the electronic devices with voice interaction function may all be woken up after listening to the wake-up word, or negotiate and determine an electronic device that is most likely to be woken up by the user to respond to the wake-up word.
  • the mobile phone 102, the sound box 106 and the TV 107 can wait for a wake-up instruction.
  • the mobile phone 102, the sound box 106 and the TV 107 may wait for a wake-up instruction within a preset time period. If a wake-up instruction is received within a preset time period, the electronic device that receives the wake-up instruction may enter a wake-up state from a pre-wake-up state. If no wake-up instruction is received within the preset time period, the electronic device may enter the wake-up word monitoring state again from the pre-wake-up state.
  • the smart glasses 101 When the smart glasses 101 are in the wearing state, it may be detected whether the user needs to wake up other electronic devices.
  • the smart glasses 101 have a microphone and a processor with low power consumption.
  • the microphone and the low power consumption processor of the smart glasses 101 may be in the working state.
  • the smart glasses 101 can collect ambient sound through a microphone, and use a low-power processor to identify whether the ambient sound contains a wake-up word. When the wake-up word is detected, the smart glasses 101 may determine that the user needs to wake up other electronic devices. Further, the smart glasses 101 can collect images through a camera. The collected images are images within the user's field of view. The smart glasses can perform image recognition processing on the image to determine the type of electronic equipment contained in the image.
  • the image collected by the smart glasses 101 can be as shown in FIG. 6 .
  • Electronic devices in the image include speakers 106 and television 107 .
  • the speaker 106 is located in the center of the image.
  • Television 107 is located on the right edge of the image.
  • the smart glasses can use a sorting algorithm to prioritize the electronic devices in this image. For example, the result of the above priority sorting is that the priority of the sound box 106 is higher than that of the TV 107 .
  • the smart glasses 101 may obtain a local device list, and send a wake-up instruction to the above-mentioned electronic device with the highest priority and existing in the local device list. Since the speaker 106 and the TV 107 both exist in the local device list, and in the above priority sorting, the priority of the speaker 106 is higher than that of the TV 107, the smart glasses 101 can determine that the speaker 106 is the electronic device that the user wants to wake up. Then, the smart glasses 101 may send a wake-up instruction to the speaker 106 .
  • the speaker 106 may enter the wake-up state from the above-mentioned pre-wake-up state.
  • the speaker 106 can use the voice command recognition module to recognize the information contained in the sound after the wake-up word.
  • the voice after the above-mentioned wake-up word contains the voice instruction "I want to listen to a song".
  • the voice command recognition module of the speaker 106 can recognize the voice command.
  • the speaker 106 can execute the operation corresponding to the voice command through the voice command execution module, that is, play music.
  • the speaker 106 can answer "no problem" by voice, and start playing music.
  • the embodiment of the present application does not limit the content of the voice answer of the above-mentioned speaker 106 .
  • the smart glasses 101 may send the result of prioritization of the electronic devices included in the image to the mobile phone 102 .
  • the mobile phone 102 may acquire the local device list, and determine the electronic device that the user wishes to wake up according to the local device list and the result of the prioritization.
  • the above-mentioned electronic device that the user wishes to wake up is the above-mentioned electronic device with the highest priority and existing in the local device list.
  • the mobile phone can send a wake-up instruction to the electronic device that the user wishes to wake up.
  • the mobile phone 102 may send a wake-up command to the electronic device that the user wishes to wake up through the router 102, or directly send a wake-up command to the electronic device that the user wishes to wake up.
  • the embodiment of the present application does not limit the method for the smart glasses 101 or the mobile phone 102 to send a wake-up instruction to the electronic device that the user wishes to wake up.
  • the above-mentioned process of performing image recognition processing on the image and/or the process of prioritizing the electronic devices in the image by using a sorting algorithm may also be implemented by the mobile phone 102 .
  • the above method can effectively save computing resources and power consumption of the smart glasses 101 .
  • the smart glasses 101 may be connected to the router 105 .
  • the smart glasses 101 detect that the user needs to wake up other electronic devices, they can collect images.
  • the process of waking up the instruction can also be realized by the router 105 . This can effectively save computing resources and power consumption of the smart glasses 101 .
  • Smart glasses can be used to determine which electronic device the user wishes to wake up.
  • the smart glasses or an electronic device such as a mobile phone or a router
  • the smart glasses may send a wake-up instruction to the electronic device that the user wishes to wake up.
  • the electronic device that receives the wake-up instruction can enter the wake-up state.
  • the electronic device that monitors the wake-up word but does not receive the wake-up instruction does not enter the wake-up state.
  • the user can use the smart glasses to wake up the electronic device he wants to wake up. This can effectively reduce the situation of false wake-up, and bring a better user experience for the user to use the voice interaction function of the electronic device.
  • FIG. 7 and FIG. 8 exemplarily show another scenario where a user wakes up an electronic device by means of smart glasses according to an embodiment of the present application.
  • electronic devices in a family may include smart glasses 101 , mobile phones 102 , routers 105 , speakers 106 , and televisions 107 .
  • smart glasses 101 mobile phones 102
  • routers 105 routers 105
  • speakers 106 televisions 107 .
  • televisions 107 televisions 107 .
  • the user has awakened the speaker 106 by means of the smart glasses 101 , and instructed the speaker 106 to play music through a voice command. Further, the user wishes to wake up the mobile phone 102 and instruct the mobile phone 102 to send a short message through a voice command. As shown in FIG. 7 , the speaker 106 has played music in response to the user's voice instruction for playing music (such as "I want to listen to a song").
  • the user wears the smart glasses 101 and looks at the mobile phone 102 and says "Xiaoyi Xiaoyi, send a text message to Lao Zhang".
  • the mobile phone 102, the sound box 106 and the TV 107 can all collect the voice input of the user through the microphone.
  • the above-mentioned voice input includes the wake-up word "Xiaoyi Xiaoyi" and the voice command "send a text message to Lao Zhang".
  • the mobile phone 102, the speaker 106, and the TV 107 are all in a wake-up word monitoring state.
  • the mobile phone 102, the speaker 106 and the TV 107 can all enter the pre-wake-up state from the wake-up word monitoring state.
  • the speaker 106 has not exited the wake-up state after receiving the wake-up instruction in the scene shown in FIG. 5 and entering the wake-up state. That is, before the wake-up word shown in FIG. 7 is monitored, the mobile phone 102 and the TV 107 are in the wake-up word monitoring state. The speaker 106 is in an awake state. When the wake-up word is detected, the mobile phone and the TV 107 can enter the pre-wake-up state from the wake-up word monitoring state. Speaker 106 can enter a pre-awake state from an awake state.
  • the mobile phone 102, the sound box 106 and the TV 107 can all obtain the local device list, and detect that the smart glasses 101 exist in the local device list and the smart glasses 101 are in the wearing state. Then, the mobile phone 102, the sound box 106 and the TV 107 can wait for the wake-up instruction, instead of entering the wake-up state immediately.
  • the smart glasses 101 When the smart glasses 101 are in the wearing state, it may be detected whether the user needs to wake up other electronic devices.
  • the method for determining which electronic device in the local device list is the electronic device that the user wants to wake up can refer to the embodiment shown in FIG. 5 above.
  • the smart glasses 101 can collect images.
  • the image is the image within the user's field of view.
  • the electronic devices in this image include a mobile phone 102 .
  • the mobile phone 102 is the electronic device that the user wishes to wake up.
  • Cell phone 102 may receive a wake-up instruction.
  • the mobile phone 102 can enter into the wake-up state, and recognize the information contained in the sound after the wake-up word.
  • the voice after the above-mentioned wake-up word contains the voice instruction "send a text message to Lao Zhang".
  • the voice command recognition module of the mobile phone 102 can recognize the voice command.
  • the mobile phone 102 can execute the operation corresponding to the voice command through the voice command execution module, that is, send a short message.
  • the mobile phone 102 may search whether there is a contact information named "Lao Zhang" in the contacts application. If it exists, the mobile phone 102 can answer "OK, please say the content of the short message" by voice.
  • the embodiment of the present application does not limit the content of the above-mentioned voice answer of the mobile phone 102 .
  • the speaker 106 and the TV 107 may enter the wake-up word monitoring state from the pre-wake-up state if they do not receive a wake-up instruction within a preset period of time after entering the pre-wake-up state.
  • the smart glasses 101 can send the collected images to the mobile phone 102, and instruct the mobile phone 102 to determine which electronic device is the electronic device that the user wishes to wake up.
  • the mobile phone 102 may perform image recognition processing on the image, and use a sorting algorithm to prioritize the identified electronic devices contained in the image, so as to determine the electronic device that the user wishes to wake up. If the mobile phone 102 determines that the electronic device that the user wants to wake up is itself (ie, the mobile phone 102), the mobile phone 102 can enter the wake-up state without sending a wake-up command. If the mobile phone 102 determines that the electronic device that the user wishes to wake up is not itself, the mobile phone 102 may send a wake-up instruction to the determined electronic device that the user wishes to wake up.
  • the user when the user wears the smart glasses 101, he can trigger the smart glasses 101 to recognize the electronic device he wants to wake up by touching a preset position of the smart glasses 101 (such as a position on the temple), and to wake up the electronic device he wants to wake up.
  • the wake-up electronic device sends a wake-up command.
  • the user can speak the voice command directly after touching the preset position of the smart glasses 101 instead of saying the wake-up word.
  • the electronic device that receives the wake-up instruction can recognize the user's voice instruction and execute the operation corresponding to the voice instruction.
  • the above method can not only reduce false wake-ups, but also simplify user operations for users to control electronic devices through voice, and improve user experience.
  • FIG. 9A and FIG. 9B exemplarily show another scenario where a user wakes up an electronic device by means of smart glasses according to an embodiment of the present application.
  • electronic devices in a family may include smart glasses 101 , mobile phones 102 , routers 105 , speakers 106 , and televisions 107 .
  • smart glasses 101 mobile phones 102
  • routers 105 routers 105
  • speakers 106 televisions 107
  • televisions 107 televisions
  • the user wishes to wake up the speaker 106 and instruct the speaker 106 to play music through a voice command.
  • the user may wear the smart glasses 101, look at the sound box 106 and touch a preset position of the smart glasses 101 (such as a position on the temple).
  • the smart glasses 101 When the smart glasses 101 are in the wearing state, it may be detected whether the user needs to wake up other electronic devices.
  • the smart glasses 101 may determine that the user needs to wake up other electronic devices. Further, the smart glasses 101 can collect images through a camera. The image is the image within the user's field of vision. The smart glasses can perform image recognition processing on the image to determine the type of electronic equipment contained in the image. Further, the smart glasses can use a sorting algorithm to prioritize the electronic devices in the image.
  • the images collected by the smart glasses 101 in the scene shown in FIG. 9A may refer to the aforementioned images shown in FIG. 6 (including the sound box 106 and the TV 107 ). For example, the result of the above priority sorting is that the priority of the sound box 106 is higher than that of the TV 107 .
  • the embodiment of the present application does not limit the preset position of the above-mentioned smart glasses 101 .
  • the smart glasses 101 may obtain a local device list, and send a wake-up instruction to the above-mentioned electronic device with the highest priority and existing in the local device list. Since the speaker 106 and the TV 107 both exist in the local device list, and in the above priority sorting, the priority of the speaker 106 is higher than that of the TV 107, the smart glasses 101 can determine that the speaker 106 is the electronic device that the user wants to wake up. Then, the smart glasses 101 may send a wake-up instruction to the speaker 106 .
  • the electronic device may directly enter the wake-up state.
  • the mobile phone 102 , the speaker 106 and the TV 107 are all in the wake-up word monitoring state.
  • the speaker 106 can enter the wake-up state.
  • the speaker 106 can recognize the information in the ambient sound after receiving the wake-up command through the voice command recognition module. If the voice command is not recognized from the ambient sound, the speaker 106 can voice answer "I am here" to remind the user that he has been awakened and can execute the user's voice command. The embodiment of the present application does not limit the content of the voice answer after the speaker 106 receives the wake-up instruction. If a voice instruction is recognized from the ambient sound, such as "I want to listen to a song", the speaker 106 can answer "no problem" and start playing music.
  • the user can issue a voice command "I want to listen to a song” to the speaker 106.
  • the speaker 106 can recognize the voice command through the voice command recognition module, and execute the operation corresponding to the voice command through the voice command execution module, that is, play music. For example, after the speaker 106 recognizes the voice instruction "I want to listen to a song", it can voice answer "no problem" and start playing music.
  • the speaker 106 can use the voice command recognition module to recognize the information in the ambient sound collected from time A before receiving the wake-up command.
  • the user may say the voice command "I want to listen to a song" while looking at the speaker 106 and touching the preset position of the smart glasses 101 . That is to say, the user may speak the voice command before the speaker 106 receives the wake-up command. Then, the speaker 106 recognizes the voice command from the ambient sound collected within a period of time before receiving the wake-up command, which can reduce the occurrence of missed voice command recognition and improve user experience.
  • the smart glasses 101 can be connected to the mobile phone 102 or to the router 105 .
  • the above-mentioned process of performing image recognition processing on an image and/or the process of prioritizing electronic devices in an image by using a sorting algorithm may also be implemented by the mobile phone 102 or by the router 105 . This can effectively save computing resources and power consumption of the smart glasses 101 .
  • the smart glasses can determine which electronic device in the local device list is the electronic device that the user wishes to wake up, and wake up the electronic device that the user wishes to wake up.
  • the user can directly issue voice commands without uttering the wake-up word. This can not only reduce false wake-ups, but also simplify user operations for users to control electronic devices through voice, and improve user experience.
  • the user when wearing the smart glasses 101 , the user can trigger the smart glasses 101 to identify the electronic device that the user wants to wake up by touching a preset position of the smart glasses 101 (such as a position on the temple). Moreover, the user can speak the wake-up word before speaking the voice command. Then, the electronic device that receives the wake-up instruction can enter the wake-up state. If the wake-up word is monitored, the electronic device in the wake-up state can recognize whether the sound after the wake-up word contains a voice command through the voice command recognition module. Further, when the voice command is recognized, the electronic device can execute the operation corresponding to the voice command through the voice command executing module.
  • the smart glasses 101 may collect an image, and determine the electronic device that the user wishes to wake up according to the image.
  • the smart glasses 101 can collect an image, and perform image recognition processing on the image.
  • the smart glasses 101 may send an indication message to the electronic devices in the local device list (such as the mobile phone 102, the sound box 106, the TV 107, etc.).
  • the indication message may be used to indicate that the smart glasses 101 will not send a wake-up instruction.
  • the electronic device such as mobile phone 102, speaker 106, TV 107, etc.
  • the electronic device with the voice interaction function enabled can wait for a wake-up instruction.
  • the electronic devices with the voice interaction function enabled may all enter the wake-up state, or one of the electronic devices with the voice interaction function enabled enters the wake-up state.
  • the electronic device with the voice interaction function enabled may negotiate and select the electronic device that receives the sound signal containing the wake-up word with the highest intensity according to the strength of the received sound signal containing the wake-up word.
  • the electronic device that receives the sound signal containing the wake-up word with the highest intensity may enter the wake-up state.
  • Other devices can enter the wake-up word monitoring state from the pre-wake-up state.
  • the user who is not wearing smart glasses wakes up the device through the wake-up word, and there is no electronic device with the voice interaction function enabled in the field of view of the user wearing the smart glasses, at least one of the electronic devices with the voice interaction function is enabled.
  • the device may enter the wake state in response to the above wake word. In this way, when the user who does not wear the smart glasses is in the same environment as the user who wears the smart glasses, the situation that the user who does not wear the smart glasses cannot wake up the device through the wake-up word.
  • an electronic device having a voice interaction function and the voice interaction function is turned on can always monitor whether the ambient sound contains a wake-up word.
  • the electronic device may enter a pre-wake-up state.
  • the implementation method of whether the electronic device monitors whether the ambient sound contains the wake-up word is specifically introduced below.
  • the electronic device can collect ambient sound through a microphone.
  • the wake-up voice such as "Xiao Yi Xiao Yi"
  • the electronic device can separate the user's wake-up voice from the ambient sound.
  • the electronic device may decode a phoneme sequence from the user's speech signal by using an acoustic model from the wake-up speech.
  • the electronic device can determine whether the decoded phoneme sequence matches the stored wake-up word phoneme sequence. If yes, it indicates that the wake-up voice includes a wake-up word.
  • the electronic device may enter a pre-wake-up state.
  • the embodiment of the present application does not specifically limit the above wake-up voice.
  • the electronic device may collect ambient sound through a microphone.
  • the wake-up voice such as "Xiao Yi Xiao Yi"
  • the electronic device can separate the user's wake-up voice from the ambient sound.
  • the electronic device may decode a phoneme sequence from the user's speech signal by using an acoustic model from the wake-up speech.
  • the electronic device can further decode text information from the decoded phoneme sequence.
  • the electronic device can determine whether the text information decoded from the wake-up voice contains the stored wake-up word text. If yes, it indicates that the wake-up voice includes a wake-up word. When it is determined that the ambient sound contains a wake-up word, the electronic device may enter a pre-wake-up state.
  • the electronic device may extract a wake-up word and a voiceprint feature of the user from the user's wake-up voice.
  • the electronic device may enter a pre-wake-up state. This can realize that only a specific user can wake up the electronic device and control the electronic device through voice commands, which improves the information security of the electronic device.
  • the embodiment of the present application does not limit the specific method for the electronic device to monitor whether the ambient sound contains the wake-up word.
  • the electronic device in the wake-up word monitoring state or in the pre-wake-up state may enter the wake-up state after receiving the wake-up instruction.
  • the above wake-up instruction may be sent by smart glasses or other electronic devices that establish a communication connection with the smart glasses.
  • the above wake-up instruction can be used to instruct the electronic device to enter the wake-up state.
  • the electronic device can start a speech recognition application. Specifically, starting the voice recognition application may start the voice command recognition module and the voice command execution module in the application processor of the electronic device. The electronic device can recognize the user's voice command in the ambient sound through the voice command recognition module, and execute the operation corresponding to the voice command through the voice command execution module. In the wake-up state, the electronic device can also monitor in real time whether the ambient sound contains the wake-up word. If the electronic device in the wake-up state monitors the wake-up word, the electronic device may enter the pre-wake-up state from the wake-up state.
  • the electronic device in the wake-up word monitoring state and in the wake-up state can monitor whether the wake-up word is included in the ambient sound in real time.
  • the electronic device in the wake-up word monitoring state cannot recognize the user's voice command and perform the user operation corresponding to the voice command.
  • the application processor of the electronic device is in a dormant state.
  • the electronic device with the voice interaction function and the voice interaction function after listening to the wake-up word, it can determine whether the smart glasses exist in the local device list and whether the smart glasses are in the wearing state. Immediately to wake up. It can be understood that when the smart glasses exist in the local device list and the smart glasses are in the wearing state, the possibility of the user waking up the electronic device by means of the smart glasses is high. If the smart glasses do not exist in the local device list (for example, the user does not have smart glasses) or the smart glasses exist in the local device list but the smart glasses are not in the wearing state, the possibility of the user waking up the electronic device by means of the smart glasses is low. Then, the electronic device that hears the wake-up word and enters the pre-wake-up state can determine whether to enter the wake-up state according to the method of directly waking up the electronic device through the wake-up word provided in the embodiment of the present application.
  • the following specifically introduces the flow chart of the method for the electronic device to determine whether it enters the wake-up state according to whether there are smart glasses provided by the embodiment of the present application.
  • the method may include steps S101-S106. in:
  • the electronic device monitors a wake-up word.
  • the electronic device can be any of the electronic devices that exist in the local device list and whose voice interaction function is turned on.
  • the implementation method for the electronic device to collect ambient sound and identify whether there is a wake-up word in the ambient sound may refer to the foregoing embodiments, and details are not repeated here.
  • the electronic device may enter a pre-wake-up state.
  • the electronic device queries whether the smart glasses exist in the local device list.
  • the electronic device can acquire a local device list.
  • electronic devices such as mobile phones, speakers, and TVs are all connected to the router and connected to the same home Wi-Fi. Phones, speakers, TVs, and routers all exist in one local device list. If the smart glasses establish a Bluetooth communication connection with the mobile phone, the electronic devices included in the local device list can add the smart glasses. In this way, an electronic device (such as a mobile phone, a sound box, and a TV) can inquire that there are smart glasses in the local device list.
  • the user does not have smart glasses, or the smart glasses have not established a communication connection with any electronic device in the local device list (for example, the smart glasses are turned off), then there are no smart glasses in the local device list.
  • the electronic device may determine whether the smart glasses are in a wearing state.
  • the images collected when the smart glasses are in the wearing state may be equivalent to images within the user's field of view. If the smart glasses are not being worn, the images collected by the smart glasses cannot be considered as images within the user's field of vision. That is to say, the smart glasses in the wearing state can more accurately determine the electronic device that the user wishes to wake up.
  • the electronic device can further determine whether the smart glasses are in a wearing state.
  • the smart glasses establish a Bluetooth communication connection with the mobile phone.
  • Electronic devices such as speakers and TVs can obtain the wearing status of the smart glasses through the mobile phone.
  • the mobile phone can send a message to the smart glasses to ask whether the smart glasses are in the wearing state.
  • the smart glasses may send the wearing detection result to the mobile phone.
  • electronic devices such as mobile phones, speakers, and televisions can obtain the wearing status of the smart glasses.
  • step S103 is optional. Exemplarily, after the electronic device determines that smart glasses exist in the local device list, it may directly execute the following step S104.
  • the electronic device may determine whether a wake-up instruction is received within a preset time period.
  • the electronic device may wait for a wake-up instruction.
  • the smart glasses may send a wake-up instruction to the electronic device that the user wishes to wake up.
  • the electronic device may wait for a preset period of time after determining that the smart glasses are in the wearing state. If the wake-up instruction is received within the preset time period, the electronic device may execute the following step S105. If no wake-up instruction is received within the preset time period, the electronic device may execute the following step S106.
  • the embodiment of the present application does not limit the length of the foregoing preset time period.
  • the electronic device may enter the wake-up state.
  • the electronic device If the electronic device receives the wake-up instruction within the preset time period, the electronic device is the target wake-up device (ie, the electronic device that the user wishes to wake up). In response to a wake-up instruction, the electronic device may enter a wake-up state.
  • the target wake-up device ie, the electronic device that the user wishes to wake up.
  • the electronic device may enter a wake-up state.
  • multiple electronic devices that monitor the wake-up word can communicate with each other, and negotiate to determine the smart glasses.
  • One of the electronic devices enters the wake-up state, and other electronic devices may not enter the wake-up state.
  • the multiple electronic devices may determine the intensity of the sound signal corresponding to the wake-up word that they have monitored. Understandably, the greater the intensity of the sound signal received by an electronic device, the closer the distance between the electronic device and the user. These multiple electronic devices can all send information including the intensity of the sound signal they receive to each other. Further, the multiple electronic devices may negotiate to determine the electronic device with the highest strength of the received sound signal. The electronic device that receives the sound signal with the highest intensity may enter the wake-up state, and other electronic devices may not enter the wake-up state.
  • the method of responding to the monitored wake-up word is not limited.
  • the specific implementation method can refer to the implementation method provided by the embodiment of the present application, when there are multiple electronic devices with the voice interaction function enabled in an environment, the multiple electronic devices respond to the monitored wake-up words. This embodiment of the present application does not describe it in detail.
  • the electronic device may enter a wake-up word monitoring state.
  • the electronic device determines that the smart glasses are in the wearing state, but does not receive a wake-up instruction within a preset time period, then the electronic device is not a target wake-up device.
  • the electronic device can enter a wake-up word monitoring state from a pre-wake-up state.
  • the electronic device with the voice interaction function enabled listens to the wake-up word, it can first determine whether the user will wake up the device with the help of smart glasses. If it is determined that the user will use the smart glasses to wake up the device, the electronic device may wait for a wake-up instruction. If it is determined that the user will not use the smart glasses to wake up the device, the electronic device may respond to the monitored wake-up word. In this way, when the user wears the smart glasses, the electronic device that he wishes to wake up can be woken up by means of the smart glasses. When the user is not wearing the smart glasses, the electronic device can be directly woken up through the wake-up word.
  • an electronic device with a voice interaction function may have a smart wake-up switch.
  • the smart home APP for controlling the above-mentioned electronic device with voice interaction function has a smart wake-up control.
  • the smart wake-up control can be used to turn off or turn on the above-mentioned smart wake-up switch.
  • the electronic device may execute the method shown in FIG. 10 after listening to the wake-up word. In this way, whether the user wears the smart glasses or not, the electronic device can be conveniently woken up. Among them, with the help of smart glasses, users can more accurately wake up the electronic devices they want to wake up.
  • the smart glasses may include a user behavior recognition module 1101 , an image collection module 1102 , an image recognition module 1103 , a device priority determination module 1104 and a device wakeup module 1105 .
  • the multiple modules can be coupled to each other via a bus. in:
  • the user behavior recognition module 1101 can be used to detect whether the user needs to wake up other electronic devices.
  • the user behavior recognition module 1101 may include, but is not limited to: a pressure sensor, a voice recognition sensor, and an inclination sensor.
  • the user behavior recognition module 1101 may use a voice recognition sensor to recognize whether the collected ambient sound contains a preset wake-up word. When listening to the preset wake-up word in the ambient sound, the user behavior recognition module 1101 can determine that the user needs to wake up other electronic devices. Then, the smart glasses can collect images through the image collection module 1102 .
  • the aforementioned preset wake-up words may be stored in the smart glasses. If the wake-up word used to wake up the electronic device whose voice interaction function is enabled in the same local device list as the smart glasses is updated (for example, the user resets the wake-up word), the wake-up word stored in the smart glasses can also be updated synchronously.
  • the smart glasses establish a communication connection with the mobile phone.
  • a smart home APP that controls electronic devices such as speakers and TVs is installed in the mobile phone.
  • the wake-up word for waking up the speaker may be modified in response to a user operation acting on the smart home APP for resetting the wake-up word for the speaker.
  • a modified wake-up word for waking up the speaker can be stored in the mobile phone.
  • the smart glasses can obtain the above-mentioned modified wake-up word for waking up the speaker from the mobile phone.
  • the embodiment of the present application does not limit the method for the smart glasses to acquire the wake-up word for waking up the electronic device.
  • the user behavior recognition module 1101 may use a pressure sensor to detect whether there is a user's touch operation on a preset position of the smart glasses. For example, when detecting a user operation in which a position on the temple is touched twice, the user behavior recognition module 1101 may determine that the user needs to wake up other electronic devices. Then, the smart glasses can collect images through the image collection module 1102 .
  • the embodiment of the present application does not limit the specific implementation method for the user behavior identification module 1101 to detect whether the user needs to wake up other electronic devices.
  • the user behavior recognition module 1101 may also detect whether the user needs to wake up the electronic device by judging whether the user blinks in a preset manner.
  • the smart glasses may also include a wearing detection module (not shown in FIG. 11 ). Before the detection by the user behavior recognition module 1101, the wearing detection module can detect whether the smart glasses are in the wearing state. If it is detected that the smart glasses are in the wearing state, the user behavior recognition module 1101 may perform the detection. If it is detected that the smart glasses are not being worn, the smart glasses may be in a dormant state. The fact that the smart glasses are in a dormant state may indicate that all components in the smart glasses except the wearing detection module are in a dormant state. This saves power consumption of the smart glasses.
  • the implementation method of detecting whether the smart glasses are in the wearing state by the wearing detection module can refer to the foregoing embodiments. I won't go into details here.
  • the image capture module 1102 can be used to capture images.
  • the image acquisition module 1102 may include but is not limited to a camera.
  • the images collected by the smart glasses through the image acquisition module 1102 may be equivalent to the images within the user's field of vision (such as the images shown in FIGS. 6 and 8 in the foregoing embodiments).
  • the embodiment of the present application does not limit the installation position of the camera in the image acquisition module 1102 on the smart glasses.
  • the image recognition module 1103 can be used to perform image recognition processing on the image to determine the electronic equipment included in the image.
  • the image recognition module 1103 may include a device recognition model.
  • the device recognition model may be a neural network model.
  • the device recognition model can be obtained through off-line training.
  • the input of the device recognition model may include an image.
  • One or more electronic devices may be included in the image.
  • the output of the device recognition model may include but not limited to the following features: type of electronic device, recognition accuracy, and viewing angle deviation.
  • Smart glasses can store trained device recognition models before leaving the factory.
  • the above device identification model can be updated.
  • the smart glasses can obtain an updated device recognition model from the server used to train the above device recognition model.
  • the image recognition module 1103 can use the device recognition model to determine the type of electronic device in the image, recognition accuracy, viewing angle deviation and other characteristics.
  • the type of the electronic device may include the category of the electronic device and the specific model of the electronic device.
  • the image recognition module 1103 determines that the type of the speaker 106 shown in FIG. 6 is a speaker Sound X, and the type of the TV 107 is an electronic device of a smart screen S Pro 65.
  • the sound box is a category of the sound box 106, and Sound X is a specific model of the sound box 106.
  • the recognition accuracy may represent the accuracy of recognizing the type of an electronic device in the image.
  • the viewing angle deviation can be used to represent the distance between the electronic device and the center of the user's field of view within the user's field of view. The smaller the viewing angle deviation of the electronic device is, the closer the position of the electronic device is to the center of the user's field of view. The larger the viewing angle deviation of the electronic device is, the closer the position of the electronic device is to the edge of the user's field of view.
  • the viewing angle deviation of the electronic device can be determined through the position of the electronic device in the image collected by the above-mentioned image collection module 1102 .
  • the embodiment of the present application does not limit the specific training method of the above-mentioned device recognition model.
  • the output of the above-mentioned device recognition model may also include but not limited to the following features: electronic device category, recognition accuracy, and viewing angle deviation. That is, when the image recognition module 1103 recognizes the electronic device included in the image, it may only recognize the category of the electronic device (such as the electronic device whose category is a sound box), without being accurate to the model of the electronic device. Then in the subsequent process, the device priority determining module 1104 may also determine the priority of the electronic device according to the category of the electronic device, recognition accuracy and viewing angle deviation.
  • the image recognition module 1103 may transfer these features to the device priority determination module 1104 .
  • the device priority determination module 1104 may be used to prioritize the electronic devices included in the image determined by the image recognition module 1103 .
  • the device priority determination module 1104 may use a ranking algorithm to prioritize electronic devices included in the image.
  • Y may represent the priority of the electronic device. The larger the value of Y, the higher the priority of the electronic device.
  • type may represent a type priority value determined according to the type of the electronic device.
  • can represent the recognition accuracy of the electronic device.
  • may represent the viewing angle deviation of the electronic device.
  • ⁇ 1 , ⁇ 2 , and ⁇ 3 may respectively represent the type priority value of the electronic device, the recognition accuracy rate, and the weight of the viewing angle deviation.
  • ⁇ 1 , ⁇ 2 , and ⁇ 3 are all positive numbers less than 1. The sum of ⁇ 1 , ⁇ 2 , and ⁇ 3 may be 1.
  • the values of ⁇ 1 , ⁇ 2 , and ⁇ 3 can be set according to empirical values.
  • the values of ⁇ 1 , ⁇ 2 , and ⁇ 3 can be updated according to an optimization algorithm, so that the electronic device with the highest priority is the electronic device that the user wishes to wake up.
  • the embodiment of the present application does not specifically limit the above values of ⁇ 1 , ⁇ 2 , and ⁇ 3 .
  • multiple electronic devices with voice interaction functions may be prioritized to be woken up according to categories.
  • Speaker > TV > Tablet > Phone the value of the feature type in the above sorting algorithm may have the following size distribution: type of sound box>type of TV>type of tablet computer>type of mobile phone.
  • the embodiment of the present application does not limit the above-mentioned wake-up priority sorting determined according to the type of the electronic device.
  • the higher the recognition accuracy of the electronic device the greater the probability that the electronic device can match the electronic device in the local device list, and the greater the probability that the electronic device is successfully awakened.
  • the above-mentioned image recognition module 1103 may also extract more features of the electronic device contained in the image from the image.
  • the device priority determining module 1104 may determine the priority of each electronic device according to more or less features.
  • the device priority determining module 1104 may also determine the priority of the electronic device according to one or more of the three characteristics of the electronic device category, recognition accuracy, and viewing angle deviation.
  • the wake-up words used by the user to wake up different electronic devices may be different.
  • the device priority determination module 1104 can first screen the electronic devices contained in the image. Wherein, if the wake-up word used to wake up an electronic device does not match the wake-up word detected by the smart glasses, the device priority determination module 1104 may exclude this electronic device. Further, the device priority determination module 1104 may perform priority sorting on electronic devices that are not excluded in the image. Optionally, if the wake-up word used to wake up an electronic device does not match the wake-up word detected by the smart glasses, the device priority determination module 1104 may determine the priority of this electronic device as the lowest priority. In this way, it is possible to avoid the situation that the smart glasses cannot wake up the user who wishes to wake up due to an error in device identification based on the collected images.
  • the aforementioned screening of electronic devices included in the image may also be implemented by the aforementioned image recognition module 1103 .
  • the image recognition module 1103 performs image recognition on the image shown in FIG. 6 , and transmits the types, recognition accuracy, and viewing angle deviation of the sound box 106 and the TV 107 in the image to the device priority determination module 1104 .
  • the device priority determination module 1104 performs priority sorting on the two electronic devices, and the priority list shown in the following table 1 can be obtained:
  • the priority of the speaker 106 is higher than that of the TV 107 .
  • the electronic device may be represented by its type (such as a sound box Sound X).
  • the embodiment of the present application does not limit the content of the electronic device in the priority list.
  • the device priority determination module 1104 may transmit the above priority list to the device wakeup module 1105 .
  • the device wake-up module 1105 can be used to determine the target wake-up device (that is, the electronic device that the user wants to wake up) according to the local device list and the priority of the electronic devices included in the image, and send a wake-up instruction to the target wake-up device.
  • the device wakeup module 1105 may match the electronic devices in the priority list with the electronic devices in the local device list in descending order of priority. According to the priority list and the local device list, the device wake-up module 1105 may determine the electronic device with the highest priority in the priority list and existing in the local device list as the target wake-up device. The device wake-up module 1105 may send a wake-up instruction to the target wake-up device. Wherein, the wake-up instruction may be directly sent by the smart glasses to the target wake-up device. Alternatively, the wake-up instruction may also be forwarded to the target wake-up device via a mobile phone or a router, such as an electronic device connected to the smart glasses. This embodiment of the present application does not limit it.
  • the device wake-up module 1105 of the smart glasses can obtain the local device list shown in Table 2 below:
  • the device wake-up module 1105 can determine the speaker 106 as the target wake-up device, and send a wake-up instruction to the speaker 106 .
  • the local device list includes multiple electronic devices of the same type.
  • the local device list includes the speaker 106 .
  • the speaker 106 is specifically a speaker Sound X.
  • the local device list also includes a speaker Sound X. That is, two speakers, Sound X, are included in the local device list. If the target wake-up device determined by the device wake-up module 1105 is the sound box Sound X, the device wake-up module 1105 may send an indication message to the two sound box Sound X included in the local device list. The indication message may be used to instruct the two speakers, the Sound X, to determine through negotiation that one enters the wake-up state.
  • the two speakers Sound X when receiving the indication message, can determine who enters the wake-up state according to the intensity of the sound signal containing the wake-up word received respectively. It can be understood that the higher the intensity of the received sound signal containing the wake-up word, the closer the distance between the electronic device and the user, and the greater the possibility that the electronic device is the electronic device that the user wishes to wake up. Then, the speaker Sound X that receives the sound signal containing the wake-up word with the highest intensity can enter the wake-up state.
  • the smart glasses can determine which electronic device the user wants to wake up by collecting images within the user's field of vision. In this way, the user can use the smart glasses to wake up the electronic device he wants to wake up, reducing the situation of false wakeup.
  • the smart glasses may only include the above-mentioned user behavior recognition module 1101 , image collection module 1102 , image recognition module 1103 and device priority determination module 1104 .
  • the above-mentioned device wake-up module 1105 may be included in an electronic device connected to smart glasses such as a mobile phone or a router.
  • the device wake-up module 1105 is included in the mobile phone as an example for illustration.
  • Smart glasses can establish a Bluetooth communication connection with a mobile phone. When the smart glasses obtain the priority list through the device priority determining module 1104 . The smart glasses can send that priority list to the phone.
  • the device wake-up module 1105 in the mobile phone can determine the target wake-up device, and send a wake-up instruction to the target wake-up device.
  • the method for the device wake-up module 1105 to determine the target wake-up device reference may be made to the foregoing embodiments.
  • the smart glasses may not determine the target wake-up device, and send a wake-up instruction to the target wake-up device. This can reduce the requirements on the computing power and storage capacity of the smart glasses, and save the power consumption of the smart glasses.
  • the smart glasses may only include the above-mentioned user behavior recognition module 1101 , image collection module 1102 and image recognition module 1103 .
  • the above device priority determination module 1104 and device wakeup module 1105 may be included in electronic devices connected to smart glasses such as mobile phones or routers.
  • the device priority determination module 1104 and the device wake-up module 1105 are included in the mobile phone as an example for illustration.
  • Smart glasses can establish a Bluetooth communication connection with a mobile phone.
  • the smart glasses determine the type of electronic device in the image, recognition accuracy, viewing angle deviation and other characteristics through the image recognition module 1103, the smart glasses can send these characteristics to the mobile phone.
  • the mobile phone can determine the priority of electronic devices in the image through the device priority determination module 1104 to obtain a priority list.
  • the device wake-up module 1105 in the mobile phone can determine the target wake-up device, and send a wake-up instruction to the target wake-up device.
  • the smart glasses may not determine the priority of the electronic device and the target wake-up device, and send a wake-up instruction to the target wake-up device. This can reduce the requirements on the computing power and storage capacity of the smart glasses, and save the power consumption of the smart glasses.
  • the smart glasses may only include the above-mentioned user behavior recognition module 1101 and image collection module 1102.
  • the above image recognition module 1103, device priority determination module 1104 and device wakeup module 1105 may be included in electronic devices connected to smart glasses such as mobile phones or routers.
  • the image recognition module 1103, the device priority determination module 1104 and the device wakeup module 1105 are included in the mobile phone as an example for illustration.
  • Smart glasses can establish a Bluetooth communication connection with a mobile phone. When the smart glasses acquire an image through the image acquisition module 1102, the image can be sent to the mobile phone.
  • the mobile phone When receiving the image, the mobile phone can determine the target wake-up device through the image recognition module 1103 , device priority determination module 1104 and device wake-up module 1105 , and send a wake-up instruction to the target wake-up device.
  • the image recognition module 1103 the image recognition module 1103
  • device priority determination module 1104 the mobile phone can determine the target wake-up device through the image recognition module 1103 , device priority determination module 1104 and device wake-up module 1105 , and send a wake-up instruction to the target wake-up device.
  • the target wake-up device For the specific method for the mobile phone to determine the target to wake up the device, reference may be made to the foregoing embodiments. I won't go into details here.
  • the smart glasses can perform image recognition processing on the image without the storage device identification module, and send a wake-up instruction to the target wake-up device without determining the priority of the electronic device and the target wake-up device. This can reduce the requirements on the computing power and storage capacity of the smart glasses, and save the power consumption of the smart glasses.
  • the above operation of judging whether the user needs to wake up other electronic devices may also be completed by other electronic devices (such as mobile phones) connected to the smart glasses.
  • the mobile phone can send an instruction to the smart glasses to collect images. After the smart glasses capture the image, the image can be sent to the mobile phone. Based on this image, the phone can determine a target to wake the device.
  • the target wake-up device may also be determined according to the image.
  • FIG. 12 exemplarily shows a flow chart of a method for waking up a device provided by an embodiment of the present application.
  • the method may include steps S201-S207. in:
  • the smart glasses detect that the user needs to wake up other electronic devices.
  • the smart glasses When the smart glasses are in the wearing state, the smart glasses can detect whether the user needs to wake up other electronic devices (such as mobile phones, speakers, TVs, etc.).
  • the smart glasses may include the user behavior recognition module 1101 of the foregoing embodiments.
  • the smart glasses can detect whether the user needs to wake up other electronic devices through the user behavior recognition module 1101 .
  • the smart glasses For a specific implementation method, reference may be made to the foregoing embodiments. I won't go into details here.
  • the smart glasses collect an image, and determine the type, recognition accuracy, and viewing angle deviation of the electronic device included in the image.
  • the smart glasses can capture images.
  • the smart glasses may include the image acquisition module 1102 and the image recognition module 1103 of the foregoing embodiments.
  • the smart glasses can collect images through the image collection module 1102 .
  • the image is the image within the user's field of vision.
  • the smart glasses can use the image recognition module 1103 to determine the type, recognition accuracy and viewing angle deviation of the electronic device included in the image.
  • the smart glasses can send the type of the electronic device, the recognition accuracy, and the deviation of the viewing angle to the mobile phone.
  • the mobile phone may perform prioritization on the electronic devices included in the image.
  • the priority of the electronic device may be determined according to one or more of the following: type of electronic device, recognition accuracy, and viewing angle deviation.
  • the mobile phone may include the device priority determining module 1104 of the foregoing embodiments.
  • the mobile phone may prioritize the electronic devices contained in the image through the device priority determining module 1104, and obtain a priority ranking result.
  • the result of this prioritization can be the priority list of the foregoing embodiment.
  • the mobile phone obtains the local device list, and determines the electronic device with the highest priority among the prioritized results and existing in the local device list as the target wake-up device.
  • the mobile phone may include the device wake-up module 1105 of the foregoing embodiments.
  • the mobile phone can determine the target wake-up device through the device wake-up module 1105, and execute the following step S206.
  • the mobile phone sends a wake-up instruction to the target wake-up device.
  • the cell phone is the target wake-up device. Then, when it is determined that the target wake-up device is itself, the mobile phone can enter the wake-up state.
  • the target wake-up device is an electronic device other than a mobile phone, and the mobile phone may directly send a wake-up instruction to the target wake-up device.
  • both the mobile phone and the target wake-up device are connected to the router.
  • the mobile phone can send a wake-up command to the router.
  • the router can send a wake-up instruction to the target wake-up device.
  • the target wake-up device After receiving the wake-up, the target wake-up device enters the wake-up state, recognizes the voice command and executes an operation corresponding to the voice command.
  • the smart glasses can also send the results of prioritization of electronic devices to other electronic devices (such as routers) connected to themselves, and the electronic device determines the target wake-up device and sends a wake-up instruction to the target wake-up device.
  • other electronic devices such as routers
  • the smart glasses after the smart glasses determine the type, recognition accuracy, and viewing angle deviation of the electronic devices contained in the image, they can also prioritize the electronic devices contained in the image, determine the target to wake up the device, and wake up the device to the target.
  • the device sends a wake-up command. That is to say, the above step S204, the above step S205 and the above step S206 can all be completed by smart glasses.
  • the smart glasses can send the collected images to the mobile phone.
  • Cell phones can identify electronic devices contained in images. That is, the identification of the electronic devices contained in the image in the above step S202 may be performed by the mobile phone. This can reduce the requirements on the computing power and storage capacity of the smart glasses, and save the power consumption of the smart glasses.
  • the user can use the smart glasses to wake up the electronic device he wants to wake up.
  • the method can effectively reduce the situation of false wake-up, and bring a better user experience for the user to use the voice interaction function of the electronic device.
  • the user can also use other types of image acquisition devices to assist in realizing device wake-up.
  • the image acquisition device may be a surveillance camera or the like.
  • the above image acquisition device can detect the first user input, and when the first user input is detected, acquire the first image. By detecting the above-mentioned first user input, the image acquisition device can determine whether the user needs to wake up the device. When it is determined that the user needs to wake up the device, the image acquisition device may perform image acquisition to obtain the above-mentioned first image. It can be understood that, in the case that the user needs to wake up the device, it is more likely that the above-mentioned first image contains the electronic device that the user wants to wake up.
  • the above-mentioned image acquisition apparatus may select a target electronic device included in the first image from a plurality of electronic devices.
  • the image acquisition device may first identify the electronic equipment included in the first image, and obtain information about the electronic equipment included in the first image.
  • the above information of the electronic device may include but not limited to type, recognition accuracy and viewing angle deviation.
  • the image acquisition device may sort the electronic devices included in the first image according to the above electronic device information to obtain the priority of the electronic devices included in the first image. Further, the image acquisition device may determine whether the electronic device included in the first image exists in the device wake-up system.
  • the electronic devices in the above-mentioned device wake-up system may exist in the local device list.
  • the local device list may be stored in one or more electronic devices in the device wake-up system.
  • the local device list may also be stored in a cloud server. All electronic devices in the local device list can obtain the local device list and update the local device list.
  • An electronic device can be added to or deleted from the local device list by electronic devices already present in the local device list.
  • an electronic device establishes a communication connection with another electronic device in the local device list, and completes the trusted identity authentication indicated by the other electronic device.
  • the other electronic device may update the local device list, adding the one electronic device to the local device list. That is, this electronic device can be added to the above-mentioned device to wake up the system.
  • the image acquisition device may acquire the local device list, and determine whether the electronic device included in the first image exists in the local device list.
  • the image acquisition apparatus may determine the electronic device with the highest priority included in the local device list in the first image as the target electronic device, and instruct the target electronic device to enter a wake-up state.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Automation & Control Theory (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Selective Calling Equipment (AREA)

Abstract

A device wake-up method, a related apparatus, and a communication system (10). In the method, when it is detected that a user needs to wake up other electronic devices (100), smart glasses (101) may acquire an image, the image being an image within the field of view of the user; the smart glasses (101) may determine, according to the image, a target device to be woken up and send a wake-up instruction to the target device to be woken up; an electronic device (100) receiving the wake-up instruction may enter a wake-up state. The method can effectively reduce false wake-up, and bring better use experience for the user to use the voice interaction functions of the electronic devices (100).

Description

设备唤醒方法、相关装置及通信系统Equipment wake-up method, related device and communication system
本申请要求于2021年07月26日提交中国专利局、申请号为202110844001.4、申请名称为“设备唤醒方法、相关装置及通信系统”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202110844001.4 and the application name "equipment wake-up method, related device and communication system" submitted to the China Patent Office on July 26, 2021, the entire contents of which are incorporated herein by reference. Applying.
技术领域technical field
本申请涉及终端技术领域,尤其涉及一种设备唤醒方法、相关装置及通信系统。The present application relates to the field of terminal technology, and in particular to a method for waking up a device, a related device and a communication system.
背景技术Background technique
随着电子设备的智能化,越来越多的电子设备具有语音交互的功能。用户可以通过唤醒词在不接触电子设备的情况下唤醒电子设备,并通过语音指令指示电子设备完成相应的任务。但在一个房间中存在多个具有语音交互功能的电子设备(尤其是同一个品牌的多个电子设备)的场景中,若用户说出唤醒词,这多个电子设备可能均监听到唤醒词而均被唤醒。上述误唤醒会对用户产生干扰,降低用户的使用体验。With the intelligentization of electronic devices, more and more electronic devices have voice interaction functions. The user can wake up the electronic device without touching the electronic device through the wake-up word, and instruct the electronic device to complete corresponding tasks through voice commands. However, in the scenario where there are multiple electronic devices with voice interaction functions (especially multiple electronic devices of the same brand) in a room, if the user speaks the wake-up word, these multiple electronic devices may all hear the wake-up word and are all awakened. The above-mentioned false wake-up will cause interference to the user and reduce user experience.
发明内容Contents of the invention
本申请提供一种设备唤醒方法、相关装置及通信系统,可以有效减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。The present application provides a method for waking up a device, a related device, and a communication system, which can effectively reduce the situation of false waking up, and bring better user experience for users using the voice interaction function of electronic devices.
第一方面,本申请提供一种设备唤醒系统。该设备唤醒系统包括图像采集装置和多个电子设备。其中,图像采集装置,可用于检测第一用户输入,并在检测到第一用户输入时,采集第一图像。图像采集装置,还可用于从多个电子设备中选出第一图像包含的目标电子设备,并向目标电子设备发送唤醒指令;唤醒指令用于触发目标电子设备进入唤醒状态。该目标电子设备,可用于响应接收到的唤醒指令,进入唤醒状态。In a first aspect, the present application provides a system for waking up a device. The device wake-up system includes an image acquisition device and multiple electronic devices. Wherein, the image acquisition device can be used to detect the first user input, and when the first user input is detected, acquire the first image. The image acquisition device can also be used to select a target electronic device included in the first image from multiple electronic devices, and send a wake-up instruction to the target electronic device; the wake-up command is used to trigger the target electronic device to enter a wake-up state. The target electronic device may be configured to enter a wake-up state in response to the received wake-up instruction.
上述多个电子设备可以是具有语音交互功能,且语音交互功能开启的电子设备。具有语音交互功能可以表示电子设备可以识别用户的语音指令并执行语音指令对应的操作。The above-mentioned multiple electronic devices may be electronic devices with a voice interaction function, and the voice interaction function is turned on. Having a voice interaction function may mean that the electronic device can recognize a user's voice command and perform an operation corresponding to the voice command.
由上述设备唤醒系统可知,图像采集装置可以通过自己采集的图像确定目标电子设备,并指示目标电子设备进入唤醒状态。上述目标电子设备为图像采集装置确定出的用户希望唤醒的电子设备。也即是说,用户可以借助图像采集装置来唤醒自己希望唤醒的电子设备。在用户说出唤醒词进行设备唤醒的场景中,上述目标电子设备可以进入唤醒状态,来响应用户的语音指令。这样可以减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。From the above device wake-up system, it can be seen that the image acquisition device can determine the target electronic device through the image collected by itself, and instruct the target electronic device to enter the wake-up state. The above-mentioned target electronic equipment is the electronic equipment determined by the image acquisition device that the user wishes to wake up. That is to say, the user can use the image acquisition device to wake up the electronic device he wants to wake up. In a scenario where the user speaks a wake-up word to wake up the device, the target electronic device may enter a wake-up state to respond to the user's voice command. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
结合第一方面,在一些实施例中,上述图像采集装置可以是智能眼镜。With reference to the first aspect, in some embodiments, the image acquisition device may be smart glasses.
可以理解的,用户希望唤醒一个电子设备时,通常会望向这一个电子设备,并说出语音指令。那么,若用户佩戴有智能眼镜,智能眼镜采集的图像即为用户视野范围内的图像。智能眼镜根据自己采集的图像可以更准确地判断出用户希望唤醒的电子设备是哪一个。Understandably, when a user wishes to wake up an electronic device, he usually looks at the electronic device and speaks a voice command. Then, if the user wears smart glasses, the images collected by the smart glasses are the images within the user's field of vision. The smart glasses can more accurately determine which electronic device the user wishes to wake up based on the images collected by themselves.
结合第一方面,在一些实施例中,上述第一用户输入可以为包含唤醒词的语音输入。或者,上述第一用户输入可以为作用在图像采集装置的第一位置上的用户操作。With reference to the first aspect, in some embodiments, the above-mentioned first user input may be voice input including a wake-up word. Alternatively, the above-mentioned first user input may be a user operation acting on the first position of the image capture device.
其中,当监听到唤醒词,图像采集装置可以进行图像采集,得到上述第一图像,并根据 上述第一图像确定目标电子设备。当用户说出唤醒词,除了图像采集装置可以监听到唤醒词,上述多个电子设备也可以监听到唤醒词。当监听到唤醒词,上述多个电子设备可以检测设备唤醒系统中是否存在上述图像采集装置。在一种可能的实现方式中,存在于上述设备唤醒系统中的电子设备可以存在于本地设备列表中。该本地设备列表可以存储于设备唤醒系统中的一个或多个电子设备中。可选的,该本地设备列表也可以存储在云服务器中。本地设备列表中的电子设备均可以获取该本地设备列表。即本地设备列表中的电子设备均可以确定上述设备唤醒系统中包含哪些电子设备。Wherein, when the wake-up word is monitored, the image acquisition device may perform image acquisition to obtain the above-mentioned first image, and determine the target electronic device according to the above-mentioned first image. When the user speaks the wake-up word, in addition to the wake-up word that can be monitored by the image acquisition device, the above-mentioned multiple electronic devices can also monitor the wake-up word. When the wake-up word is detected, the above-mentioned multiple electronic devices may detect whether the above-mentioned image acquisition device exists in the device wake-up system. In a possible implementation manner, the electronic devices that exist in the above device wake-up system may exist in the local device list. The local device list may be stored in one or more electronic devices in the device wake-up system. Optionally, the local device list may also be stored in a cloud server. All electronic devices in the local device list can obtain the local device list. That is, all the electronic devices in the local device list can determine which electronic devices are contained in the wake-up system of the above-mentioned devices.
上述多个电子设备可以通过确定上述本地设备列表中是否包含上述图像采集装置,来确定上述设备唤醒系统中是否存在上述图像采集装置。若上述本地设备列表中包含上述图像采集装置,上述多个电子设备可以确定上述设备唤醒系统中存在上述图像采集装置。若确定出上述设备唤醒系统中存在上述图像采集装置,上述多个电子设备可以进一步确定该图像采集装置是否处于佩戴状态。若确定出该图像采集装置处于佩戴状态,上述多个电子设备可以等待唤醒指令,而不立即进入唤醒状态。当上述多个电子设备中的一个电子设备接收到唤醒指令,这一个电子设备可以进入唤醒状态。在上述等待唤醒指令的过程中,上述多个电子设备可以不响应监听到的唤醒词、语音指令等。其中,上述图像采集装置可以为智能眼镜。The plurality of electronic devices may determine whether the image capture device exists in the wake-up system of the device by determining whether the image capture device is included in the local device list. If the above-mentioned local device list includes the above-mentioned image collection device, the above-mentioned multiple electronic devices may determine that the above-mentioned image collection device exists in the above-mentioned device wake-up system. If it is determined that the image capture device exists in the wake-up system of the device, the multiple electronic devices may further determine whether the image capture device is in a wearing state. If it is determined that the image capture device is in the wearing state, the above-mentioned multiple electronic devices may wait for a wake-up instruction instead of entering the wake-up state immediately. When an electronic device among the plurality of electronic devices receives a wake-up instruction, this electronic device may enter a wake-up state. During the process of waiting for the wake-up instruction, the plurality of electronic devices may not respond to the monitored wake-up words, voice instructions, and the like. Wherein, the above-mentioned image acquisition device may be smart glasses.
在一些实施例中,在接收到上述唤醒指令之前,目标电子设备监听到唤醒词,但未监听到语音指令(如在用户仅说出唤醒词的场景中)。那么,当进入唤醒状态,上述目标电子设备可以输出针对上述唤醒词的语音响应。该针对唤醒词的语音响应可以例如是“我在”。或者,在接收到上述唤醒指令之前,目标电子设备既未监听到唤醒词,也未监听到语音指令(如在用户未说出唤醒词,而通过作用在上述第一位置上的用户操作来实现设备唤醒的场景中)。那么,当进入唤醒状态,上述目标电子设备也可以输出针对上述唤醒词的语音响应。也即是说,当进入唤醒状态但未监听到语音指令的情况下,目标电子设备均可以输出针对上述唤醒词的语音响应,来提示用户该目标电子设备已进入唤醒状态。这样,用户可以知道哪个电子设备被唤醒,进而通过语音指令指示进入唤醒状态的电子设备执行相应的操作。当进入唤醒状态,目标电子设备可以识别语音指令,并执行该语音指令对应的用户操作。In some embodiments, before receiving the above-mentioned wake-up instruction, the target electronic device listens to the wake-up word but does not listen to the voice command (for example, in a scenario where the user only speaks the wake-up word). Then, when entering the wake-up state, the above-mentioned target electronic device may output a voice response to the above-mentioned wake-up word. The voice response to the wake word may be, for example, "I am". Or, before receiving the above-mentioned wake-up instruction, the target electronic device neither listens to the wake-up word nor the voice command (such as the user does not say the wake-up word, but the user operation acting on the first position above realizes in scenarios where the device wakes up). Then, when entering the wake-up state, the target electronic device may also output a voice response to the wake-up word. That is to say, when entering the wake-up state but no voice command is heard, the target electronic device can output a voice response to the above-mentioned wake-up word to remind the user that the target electronic device has entered the wake-up state. In this way, the user can know which electronic device is awakened, and then instructs the electronic device in the awakened state to perform corresponding operations through voice instructions. When entering the wake-up state, the target electronic device can recognize the voice command and execute the user operation corresponding to the voice command.
在一些实施例中,若在接收到上述唤醒指令之前,目标电子设备监听到语音指令,那么,目标电子设备可以在进入唤醒状态后直接输出针对该语音指令的语音响应,并执行该语音指令对应的操作。例如,在用户一次性说出唤醒词和语音指令的场景中,或者,在用户在上述第一位置上进行用户操作的同时或在第一位置上进行用户操作之前说出语音指令的场景中,目标电子设备均可能在接收到唤醒指令之前就监听到了语音指令。其中,目标电子设备可以检测接收到唤醒指令之前的第一时间段内以及接收到唤醒指令之后采集的声音信号中是否包含语音指令。这样,可以减少用户在目标电子设备收到唤醒指令之前说出语音指令时,目标电子设备由于未检测到用户的语音指令而未响应该语音指令的情况。In some embodiments, if the target electronic device listens to the voice command before receiving the above-mentioned wake-up command, then the target electronic device may directly output a voice response to the voice command after entering the wake-up state, and execute the corresponding voice command. operation. For example, in the scenario where the user speaks the wake-up word and the voice command at one time, or in the scenario where the user speaks the voice command while performing the user operation on the above-mentioned first position or before performing the user operation on the first position, The target electronic device may have listened to the voice command before receiving the wake-up command. Wherein, the target electronic device may detect whether the sound signal collected during the first time period before receiving the wake-up instruction and after receiving the wake-up instruction contains the voice instruction. In this way, when the user speaks the voice command before the target electronic device receives the wake-up command, the target electronic device does not respond to the voice command because it does not detect the user's voice command.
在一些实施例中,若上述多个电子设备确定出设备唤醒系统中不存在上述图像采集装置(即本地设备列表中不包含图像采集装置),或者,确定出设备唤醒系统中存在上述图像采集装置但该图像采集装置未处于佩戴状态,上述多个电子设备可以协商选出一个电子设备。上述协商选出的一个电子设备可以进入唤醒状态。其它电子设备可以不进入唤醒状态。在一种可能的实现方式中,上述多个电子设备可以根据接收到包含唤醒词的声音信号的强度,协商选取出接收到包含唤醒词的声音信号的强度最大的电子设备。该接收到包含唤醒词的声音信号强度最大的电子设备可以进入唤醒状态。In some embodiments, if the above-mentioned multiple electronic devices determine that the above-mentioned image capture device does not exist in the device wake-up system (that is, the image capture device is not included in the local device list), or determine that the above-mentioned image capture device exists in the device wake-up system However, the image acquisition device is not in the wearing state, and the above-mentioned multiple electronic devices can negotiate to select one electronic device. An electronic device selected through the above negotiation may enter a wake-up state. Other electronic devices may not enter the wake-up state. In a possible implementation manner, the plurality of electronic devices may negotiate and select an electronic device that receives the sound signal containing the wake-up word with the highest intensity according to the strength of the received sound signal containing the wake-up word. The electronic device that receives the sound signal containing the wake-up word with the highest intensity may enter the wake-up state.
由上述实施例可知,当语音交互功能开启的多个电子设备监听到唤醒词,这多个电子设 备可以判断用户是否会通过图像采集装置来进行设备唤醒。在确定用户会通过图像采集装置来进行设备唤醒的情况下,这多个电子设备可以等待唤醒指令,在接收到唤醒指令之后才进入唤醒状态。这样这多个电子设备不会在监听到唤醒词之后全部进入唤醒状态,出现误唤醒的情况。并且,接收到唤醒指令的电子设备为用户希望唤醒的电子设备的可能性最大。这可以为用户使用电子设备的语音交互功能带来更好的使用体验。It can be seen from the above embodiments that when multiple electronic devices with voice interaction function enabled monitor the wake-up word, these multiple electronic devices can determine whether the user will wake up the device through the image acquisition device. When it is determined that the user will wake up the device through the image acquisition device, the plurality of electronic devices may wait for a wake-up instruction, and enter into a wake-up state after receiving the wake-up instruction. In this way, the plurality of electronic devices will not all enter the wake-up state after listening to the wake-up word, and false wake-up will occur. In addition, the electronic device that has received the wake-up instruction is most likely to be the electronic device that the user wishes to wake up. This can bring a better user experience for the user to use the voice interaction function of the electronic device.
在一些实施例中,上述图像采集装置可以为智能眼镜。上述第一位置可以为智能眼镜的镜腿上的一个位置。In some embodiments, the above-mentioned image acquisition device may be smart glasses. The above-mentioned first position may be a position on the temple of the smart glasses.
结合第一方面,在一些实施例中,上述图像采集装置从多个电子设备中选出第一图像包含的目标电子设备的具体实现方式可以为:确定第一图像包含的电子设备的类型、识别准确率、视角偏差中的至少一项;识别准确率用于指示第一图像包含的电子设备的类型的识别结果的准确率,视角偏差用于指示电子设备在第一图像中的位置与第一图像的中心的距离;With reference to the first aspect, in some embodiments, the specific implementation manner of selecting the target electronic device contained in the first image by the above-mentioned image acquisition device from multiple electronic devices may be: determining the type of the electronic device contained in the first image, identifying At least one of accuracy rate and viewing angle deviation; the recognition accuracy rate is used to indicate the accuracy rate of the recognition result of the type of electronic device contained in the first image, and the viewing angle deviation is used to indicate that the position of the electronic device in the first image is different from the first the distance from the center of the image;
将多个电子设备中包含于第一图像,且优先级最高的电子设备确定为目标电子设备;优先级是根据类型、识别准确率和视角偏差中的一项或多项确定的;电子设备的类型在依据类型确定的唤醒排序中的优先顺序与电子设备的优先级正相关,电子设备的识别准确率与电子设备的优先级正相关,电子设备的视角偏差与电子设备的优先级负相关。Determining the electronic device with the highest priority among the multiple electronic devices included in the first image as the target electronic device; the priority is determined according to one or more of the type, recognition accuracy and viewing angle deviation; the electronic device's The priority of the type in the wake-up sort determined by type is positively correlated with the priority of the electronic device, the recognition accuracy of the electronic device is positively correlated with the priority of the electronic device, and the viewing angle deviation of the electronic device is negatively correlated with the priority of the electronic device.
可以理解的,在考虑电子设备的类型与上述电子设备的优先级的关系的情况下,若上述识别准确率和视角偏差等特征的取值不变,电子设备的类型在依据类型确定的唤醒排序中越靠前,电子设备的优先级越高。在考虑电子设备的识别准确率与上述电子设备的优先级的关系的情况下,若上述电子设备的类型和视角偏差等特征的取值不变,电子设备的识别准确率越高,电子设备的优先级越高。在考虑电子设备的识别准确率与上述电子设备的优先级的关系的情况下,若上述电子设备的类型和识别准确率等特征的取值不变,电子设备的视角偏差越小,电子设备的优先级越高。It can be understood that, considering the relationship between the type of electronic device and the priority of the above-mentioned electronic device, if the values of the above-mentioned features such as recognition accuracy and viewing angle deviation remain unchanged, the type of electronic device will be in the wake-up order determined according to the type. The higher the center, the higher the priority of the electronic device. Considering the relationship between the recognition accuracy rate of the electronic device and the priority of the above-mentioned electronic device, if the values of the characteristics such as the type of the above-mentioned electronic device and the viewing angle deviation remain unchanged, the higher the recognition accuracy rate of the electronic device, the higher the electronic device's The higher the priority. Considering the relationship between the recognition accuracy rate of the electronic device and the priority of the above-mentioned electronic device, if the type of the above-mentioned electronic device and the value of the characteristics such as the recognition accuracy rate remain unchanged, the smaller the deviation of the viewing angle of the electronic device is, the greater the The higher the priority.
由上述实施例可知,电子设备的识别准确率越高,该电子设备能与本地设备列表中的电子设备匹配得上的概率越大,那么该电子设备被成功唤醒的概率也越大。电视设备的视角偏差越小,该电子设备的位置越接近用户视野中心,那么该电子设备是用户希望唤醒的电子设备的概率越大。根据上述类型、识别准确率和视角偏差中的一项或多项可以准确地确定用户希望唤醒的电子设备。这样可以有效减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。It can be known from the above embodiments that the higher the identification accuracy of the electronic device is, the higher the probability that the electronic device can match the electronic device in the local device list, and the higher the probability that the electronic device is successfully awakened. The smaller the viewing angle deviation of the television device is, and the closer the position of the electronic device is to the center of the user's field of vision, the greater the probability that the electronic device is the electronic device that the user wishes to wake up. The electronic device that the user wishes to wake up can be accurately determined according to one or more of the above types, recognition accuracy and viewing angle deviation. In this way, false wakeups can be effectively reduced, and better user experience can be brought to the user when using the voice interaction function of the electronic device.
第二方面,本申请提供一种设备唤醒系统。该设备唤醒系统可包括图像采集装置和处理设备。其中,图像采集装置,可用于检测第一用户输入,并在检测到第一用户输入时,采集第一图像。图像采集装置,还可用于向处理设备发送第一指令,第一指令可包括第一图像,第一指令可用于指示处理设备从多个电子设备中选出第一图像包含的目标电子设备。处理设备,可用于响应第一指令,从多个电子设备中选出第一图像包含的目标电子设备,并向目标电子设备发送唤醒指令。唤醒指令可用于触发目标电子设备进入唤醒状态。In a second aspect, the present application provides a device wake-up system. The device wake-up system may include an image acquisition device and a processing device. Wherein, the image acquisition device can be used to detect the first user input, and when the first user input is detected, acquire the first image. The image acquisition device may also be configured to send a first instruction to the processing device, the first instruction may include the first image, and the first instruction may be used to instruct the processing device to select a target electronic device included in the first image from multiple electronic devices. The processing device may be configured to respond to the first instruction, select a target electronic device included in the first image from multiple electronic devices, and send a wake-up instruction to the target electronic device. The wake-up command can be used to trigger the target electronic device to enter the wake-up state.
上述多个电子设备可以是具有语音交互功能,且语音交互功能开启的电子设备。具有语音交互功能可以表示电子设备可以识别用户的语音指令并执行语音指令对应的操作。The above-mentioned multiple electronic devices may be electronic devices with a voice interaction function, and the voice interaction function is turned on. Having a voice interaction function may mean that the electronic device can recognize a user's voice command and perform an operation corresponding to the voice command.
由上述设备唤醒系统可知,图像采集装置可以在用户需要进行设备唤醒时进行图像采集,并将采集得到的图像发送给处理设备。处理设备可以通过来自图像采集装置的图像确定目标电子设备,并指示目标电子设备进入唤醒状态。上述目标电子设备为图像采集装置确定出的用户希望唤醒的电子设备。也即是说,用户可以借助图像采集装置和处理设备来唤醒自己希望唤醒的电子设备。在用户说出唤醒词进行设备唤醒的场景中,上述目标电子设备可以进入 唤醒状态,来响应用户的语音指令。这样可以减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。It can be known from the above device wake-up system that the image acquisition device can collect images when the user needs to wake up the device, and send the acquired images to the processing device. The processing device may determine the target electronic device through the image from the image acquisition device, and instruct the target electronic device to enter a wake-up state. The above-mentioned target electronic equipment is the electronic equipment determined by the image acquisition device that the user wishes to wake up. That is to say, the user can wake up the electronic device he wants to wake up by means of the image acquisition device and the processing device. In the scenario where the user speaks a wake-up word to wake up the device, the above-mentioned target electronic device may enter a wake-up state to respond to the user's voice command. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
可以理解的,图像采集装置可以不用进行目标电子设备确定的操作,这可以节省图像采集装置的功耗。上述处理设备可以是具有强算力的电子设备,例如手机、云服务器等等。It can be understood that the image acquisition device does not need to perform the operation of determining the target electronic device, which can save the power consumption of the image acquisition device. The aforementioned processing device may be an electronic device with strong computing power, such as a mobile phone, a cloud server, and the like.
结合第二方面,在一些实施例中,上述图像采集装置可以是智能眼镜。With reference to the second aspect, in some embodiments, the above-mentioned image acquisition device may be smart glasses.
可以理解的,用户希望唤醒一个电子设备时,通常会望向这一个电子设备,并说出语音指令。那么,若用户佩戴有智能眼镜,智能眼镜采集的图像即为用户视野范围内的图像。智能眼镜根据自己采集的图像可以更准确地判断出用户希望唤醒的电子设备是哪一个。Understandably, when a user wishes to wake up an electronic device, he usually looks at the electronic device and speaks a voice command. Then, if the user wears smart glasses, the images collected by the smart glasses are the images within the user's field of vision. The smart glasses can more accurately determine which electronic device the user wishes to wake up based on the images collected by themselves.
结合第二方面,在一些实施例中,第一用户输入为包含唤醒词的语音输入;或者,第一用户输入为作用在图像采集装置的第一位置上的用户操作。With reference to the second aspect, in some embodiments, the first user input is a voice input including a wake-up word; or, the first user input is a user operation acting on the first position of the image capture device.
其中,当监听到唤醒词,图像采集装置可以进行图像采集,得到上述第一图像,并根据上述第一图像确定目标电子设备。除了图像采集装置和处理设备,上述设备唤醒系统还可以包括上述多个电子设备。当用户说出唤醒词,除了图像采集装置可以监听到唤醒词,上述多个电子设备也可以监听到唤醒词。Wherein, when the wake-up word is detected, the image acquisition device may perform image acquisition to obtain the above-mentioned first image, and determine the target electronic device according to the above-mentioned first image. In addition to the image acquisition device and the processing device, the above-mentioned device wake-up system may also include the above-mentioned multiple electronic devices. When the user speaks the wake-up word, in addition to the wake-up word that can be monitored by the image acquisition device, the above-mentioned multiple electronic devices can also monitor the wake-up word.
当监听到唤醒词,上述多个电子设备可以检测设备唤醒系统中是否存在上述图像采集装置。在一种可能的实现方式中,存在于上述设备唤醒系统中的电子设备可以存在于本地设备列表中。该本地设备列表可以存储于设备唤醒系统中的一个或多个电子设备中。可选的,该本地设备列表也可以存储在云服务器中。本地设备列表中的电子设备均可以获取该本地设备列表。即本地设备列表中的电子设备均可以确定上述设备唤醒系统中包含哪些电子设备。When the wake-up word is detected, the above-mentioned multiple electronic devices may detect whether the above-mentioned image acquisition device exists in the device wake-up system. In a possible implementation manner, the electronic devices that exist in the above device wake-up system may exist in the local device list. The local device list may be stored in one or more electronic devices in the device wake-up system. Optionally, the local device list may also be stored in a cloud server. All electronic devices in the local device list can obtain the local device list. That is, all the electronic devices in the local device list can determine which electronic devices are contained in the wake-up system of the above-mentioned devices.
上述多个电子设备可以通过确定上述本地设备列表中是否包含上述图像采集装置,来确定上述设备唤醒系统中是否存在上述图像采集装置。若上述本地设备列表中包含上述图像采集装置,上述多个电子设备可以确定上述设备唤醒系统中存在上述图像采集装置。若确定出上述设备唤醒系统中存在上述图像采集装置,上述多个电子设备可以进一步确定该图像采集装置是否处于佩戴状态。若确定出该图像采集装置处于佩戴状态,上述多个电子设备可以等待唤醒指令,而不立即进入唤醒状态。当上述多个电子设备中的一个电子设备接收到唤醒指令,这一个电子设备可以进入唤醒状态。在上述等待唤醒指令的过程中,上述多个电子设备可以不响应监听到的唤醒词、语音指令等。The plurality of electronic devices may determine whether the image capture device exists in the wake-up system of the device by determining whether the image capture device is included in the local device list. If the above-mentioned local device list includes the above-mentioned image collection device, the above-mentioned multiple electronic devices may determine that the above-mentioned image collection device exists in the above-mentioned device wake-up system. If it is determined that the image capture device exists in the wake-up system of the device, the multiple electronic devices may further determine whether the image capture device is in a wearing state. If it is determined that the image capture device is in the wearing state, the above-mentioned multiple electronic devices may wait for a wake-up instruction instead of entering the wake-up state immediately. When an electronic device among the plurality of electronic devices receives a wake-up instruction, this electronic device may enter a wake-up state. During the process of waiting for the wake-up instruction, the plurality of electronic devices may not respond to the monitored wake-up words, voice instructions, and the like.
由上述实施例可知,当语音交互功能开启的多个电子设备监听到唤醒词,这多个电子设备可以判断用户是否会通过图像采集装置来进行设备唤醒。在确定用户会通过图像采集装置来进行设备唤醒的情况下,这多个电子设备可以等待唤醒指令,在接收到唤醒指令之后才进入唤醒状态。这样这多个电子设备不会在监听到唤醒词之后全部进入唤醒状态,出现误唤醒的情况。并且,接收到唤醒指令的电子设备为用户希望唤醒的电子设备的可能性最大。这可以为用户使用电子设备的语音交互功能带来更好的使用体验。It can be seen from the above embodiments that when multiple electronic devices with voice interaction functions enabled monitor the wake-up word, the multiple electronic devices can determine whether the user will wake up the device through the image acquisition device. When it is determined that the user will wake up the device through the image acquisition device, the plurality of electronic devices may wait for a wake-up instruction, and enter into a wake-up state after receiving the wake-up instruction. In this way, the plurality of electronic devices will not all enter the wake-up state after listening to the wake-up word, and false wake-up will occur. In addition, the electronic device that has received the wake-up instruction is most likely to be the electronic device that the user wishes to wake up. This can bring a better user experience for the user to use the voice interaction function of the electronic device.
结合第二方面,在一些实施例中,上述处理设备从多个电子设备中选出第一图像包含的目标电子设备的具体方法可以为:确定第一图像包含的电子设备的类型、识别准确率、视角偏差中的至少一项;识别准确率用于指示第一图像包含的电子设备的类型的识别结果的准确率,视角偏差用于指示电子设备在第一图像中的位置与第一图像的中心的距离。将多个电子设备中包含于第一图像,且优先级最高的电子设备确定为目标电子设备;优先级是根据类型、识别准确率和视角偏差中的一项或多项确定的;电子设备的类型在依据类型确定的唤醒排序中的优先顺序与电子设备的优先级正相关,电子设备的识别准确率与电子设备的优先级正相关,电子设备的视角偏差与电子设备的优先级负相关。With reference to the second aspect, in some embodiments, the specific method for the above-mentioned processing device to select the target electronic device included in the first image from the plurality of electronic devices may be: determine the type of the electronic device included in the first image, the recognition accuracy rate , at least one item of viewing angle deviation; the recognition accuracy is used to indicate the accuracy of the recognition result of the type of electronic device contained in the first image, and the viewing angle deviation is used to indicate the difference between the position of the electronic device in the first image and the position of the first image Center distance. Determining the electronic device with the highest priority among the multiple electronic devices included in the first image as the target electronic device; the priority is determined according to one or more of the type, recognition accuracy and viewing angle deviation; the electronic device's The priority of the type in the wake-up sort determined by type is positively correlated with the priority of the electronic device, the recognition accuracy of the electronic device is positively correlated with the priority of the electronic device, and the viewing angle deviation of the electronic device is negatively correlated with the priority of the electronic device.
可以理解的,在考虑电子设备的类型与上述电子设备的优先级的关系的情况下,若上述识别准确率和视角偏差等特征的取值不变,电子设备的类型在依据类型确定的唤醒排序中越靠前,电子设备的优先级越高。在考虑电子设备的识别准确率与上述电子设备的优先级的关系的情况下,若上述电子设备的类型和视角偏差等特征的取值不变,电子设备的识别准确率越高,电子设备的优先级越高。在考虑电子设备的识别准确率与上述电子设备的优先级的关系的情况下,若上述电子设备的类型和识别准确率等特征的取值不变,电子设备的视角偏差越小,电子设备的优先级越高。It can be understood that, considering the relationship between the type of electronic device and the priority of the above-mentioned electronic device, if the values of the above-mentioned features such as recognition accuracy and viewing angle deviation remain unchanged, the type of electronic device will be in the wake-up order determined according to the type. The higher the center, the higher the priority of the electronic device. Considering the relationship between the recognition accuracy rate of the electronic device and the priority of the above-mentioned electronic device, if the values of the characteristics such as the type of the above-mentioned electronic device and the viewing angle deviation remain unchanged, the higher the recognition accuracy rate of the electronic device, the higher the electronic device's The higher the priority. Considering the relationship between the recognition accuracy rate of the electronic device and the priority of the above-mentioned electronic device, if the type of the above-mentioned electronic device and the value of the characteristics such as the recognition accuracy rate remain unchanged, the smaller the deviation of the viewing angle of the electronic device is, the greater the The higher the priority.
结合第二方面,在一些实施例中,上述处理设备可以是上述多个电子设备中的一个。即上述处理设备可以是语音交互功能开启的电子设备。若处理设备根据来自图像采集装置的第一图像确定出自己为目标电子设备,处理设备可以进入唤醒状态。即处理设备可以不用发送唤醒指令。With reference to the second aspect, in some embodiments, the above-mentioned processing device may be one of the above-mentioned multiple electronic devices. That is, the above-mentioned processing device may be an electronic device with a voice interaction function enabled. If the processing device determines that it is the target electronic device according to the first image from the image acquisition device, the processing device may enter a wake-up state. That is, the processing device does not need to send a wake-up instruction.
结合第二方面,在一些实施例中,上述图像采集装置在采集上述第一图像后,可以识别第一图像中包含的电子设备,并将识别出的电子设备的信息发送给处理设备。上述电子设备的信息可以包括以下一项或多项:类型、识别准确率、视角偏差。处理设备可以根据上述电子设备的信息确定第一图像中包含的电子设备的优先级,并从本地设备列表中选出包含于第一图像且优先级最高的电子设备。该被选出的电子设备即为目标电子设备。With reference to the second aspect, in some embodiments, after capturing the first image, the image acquisition device may identify the electronic device contained in the first image, and send information of the identified electronic device to the processing device. The above-mentioned information of the electronic device may include one or more of the following: type, recognition accuracy, and viewing angle deviation. The processing device may determine the priority of the electronic device included in the first image according to the information of the electronic device, and select the electronic device with the highest priority included in the first image from the local device list. The selected electronic device is the target electronic device.
结合第二方面,在一些实施例中,上述图像采集装置在采集上述第一图像后,可以确定第一图像中包含的电子设备的优先级,并将该优先级发送给处理设备。处理设备可以根据第一图像中包含的电子设备的优先级,从本地设备列表中选出包含于第一图像且优先级最高的电子设备。该被选出的电子设备即为目标电子设备。With reference to the second aspect, in some embodiments, after the image acquisition device acquires the first image, it may determine the priority of the electronic devices contained in the first image, and send the priority to the processing device. The processing device may, according to the priorities of the electronic devices contained in the first image, select the electronic device with the highest priority contained in the first image from the local device list. The selected electronic device is the target electronic device.
第三方面,本申请提供一种设备唤醒方法。在该方法中,获取第一图像,从多个电子设备总选出第一图像包含的目标电子设备,向该目标电子设备发送唤醒指令。该唤醒指令可用于触发目标电子设备进入唤醒状态。In a third aspect, the present application provides a method for waking up a device. In this method, a first image is acquired, a target electronic device included in the first image is selected from a plurality of electronic devices, and a wake-up instruction is sent to the target electronic device. The wake-up instruction can be used to trigger the target electronic device to enter the wake-up state.
结合第三方面,在一些实施例中,上述第三方面的方法可以由图像采集装置执行。其中,上述获取第一图像的过程可以为:当检测到第一用户输入,图像采集装置采集第一图像。上述检测到第一用户输入具体可以为监听到唤醒词,或者,检测到作用在图像采集装置的第一位置上的用户操作。上述图像采集装置可以为智能眼镜。With reference to the third aspect, in some embodiments, the method in the above third aspect may be executed by an image acquisition device. Wherein, the above-mentioned process of acquiring the first image may be: when the first user input is detected, the image acquisition device acquires the first image. The aforementioned detection of the first user input may specifically be detection of a wake-up word, or detection of a user operation acting on the first position of the image capture device. The above-mentioned image acquisition device may be smart glasses.
可以理解的,图像采集装置可以通过检测上述第一用户输入,来确定用户是否需要进行设备唤醒。当确定用户需要进行设备唤醒,图像采集装置可以进行图像采集。即上述第一图像可以是图像采集装置在确定用户需要进行设备唤醒的情况下采集得到的。It can be understood that the image acquisition apparatus may determine whether the user needs to wake up the device by detecting the above-mentioned first user input. When it is determined that the user needs to wake up the device, the image acquisition device may perform image acquisition. That is, the above-mentioned first image may be acquired by the image acquisition device when it is determined that the user needs to wake up the device.
由上述实施例可知,图像采集装置可以通过自己采集的图像确定目标电子设备,并指示目标电子设备进入唤醒状态。上述目标电子设备为图像采集装置确定出的用户希望唤醒的电子设备。也即是说,用户可以借助图像采集装置来唤醒自己希望唤醒的电子设备。在用户说出唤醒词进行设备唤醒的场景中,上述目标电子设备可以进入唤醒状态,来响应用户的语音指令。这样可以减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。It can be known from the above embodiments that the image acquisition device can determine the target electronic device through the images it has collected, and instruct the target electronic device to enter the wake-up state. The above-mentioned target electronic equipment is the electronic equipment determined by the image acquisition device that the user wishes to wake up. That is to say, the user can use the image acquisition device to wake up the electronic device he wants to wake up. In a scenario where the user speaks a wake-up word to wake up the device, the target electronic device may enter a wake-up state to respond to the user's voice command. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
结合第三方面,在一些实施例中,上述第三方面的方法可以由处理设备执行。其中,上述获取第一图像的过程可以为:接收来自图像采集装置的第一指令。该第一指令可包括图像采集装置采集的第一图像。该第一指令可用于指示处理设备从多个电子设备中选出第一图像包含的目标电子设备。With reference to the third aspect, in some embodiments, the method in the third aspect above may be executed by a processing device. Wherein, the above-mentioned process of acquiring the first image may be: receiving a first instruction from an image acquisition device. The first instruction may include a first image captured by an image capture device. The first instruction may be used to instruct the processing device to select a target electronic device included in the first image from multiple electronic devices.
由上述实施例可知,处理设备可以根据来自图像采集装置的图像确定目标电子设备,并 指示目标电子设备进入唤醒状态。上述目标电子设备为图像采集装置确定出的用户希望唤醒的电子设备。也即是说,在用户说出唤醒词进行设备唤醒的场景中,上述目标电子设备可以进入唤醒状态,而其它语音交互功能开启的电子设备不会进入唤醒状态。这样可以减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。It can be seen from the above embodiments that the processing device can determine the target electronic device according to the image from the image acquisition device, and instruct the target electronic device to enter the wake-up state. The above-mentioned target electronic equipment is the electronic equipment determined by the image acquisition device that the user wishes to wake up. That is to say, in the scene where the user speaks the wake-up word to wake up the device, the above-mentioned target electronic device can enter the wake-up state, while other electronic devices with voice interaction functions enabled will not enter the wake-up state. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
结合第三方面,在一些实施例中,上述从多个电子设备中选出第一图像包含的目标电子设备的具体方法可以为:确定第一图像包含的电子设备的类型、识别准确率、视角偏差中的至少一项;识别准确率用于指示第一图像包含的电子设备的类型的识别结果的准确率,视角偏差用于指示电子设备在第一图像中的位置与第一图像的中心的距离。将多个电子设备中包含于第一图像,且优先级最高的电子设备确定为目标电子设备;优先级是根据类型、识别准确率和视角偏差中的一项或多项确定的;电子设备的类型在依据类型确定的唤醒排序中的优先顺序与电子设备的优先级正相关,电子设备的识别准确率与电子设备的优先级正相关,电子设备的视角偏差与电子设备的优先级负相关。With reference to the third aspect, in some embodiments, the specific method for selecting the target electronic device contained in the first image from the above-mentioned multiple electronic devices may be: determining the type, recognition accuracy, and viewing angle of the electronic device contained in the first image At least one of the deviations; the recognition accuracy rate is used to indicate the accuracy rate of the recognition result of the type of electronic device contained in the first image, and the viewing angle deviation is used to indicate the distance between the position of the electronic device in the first image and the center of the first image distance. Determining the electronic device with the highest priority among the multiple electronic devices included in the first image as the target electronic device; the priority is determined according to one or more of the type, recognition accuracy and viewing angle deviation; the electronic device's The priority of the type in the wake-up sort determined by type is positively correlated with the priority of the electronic device, the recognition accuracy of the electronic device is positively correlated with the priority of the electronic device, and the viewing angle deviation of the electronic device is negatively correlated with the priority of the electronic device.
可以理解的,在考虑电子设备的类型与上述电子设备的优先级的关系的情况下,若上述识别准确率和视角偏差等特征的取值不变,电子设备的类型在依据类型确定的唤醒排序中越靠前,电子设备的优先级越高。在考虑电子设备的识别准确率与上述电子设备的优先级的关系的情况下,若上述电子设备的类型和视角偏差等特征的取值不变,电子设备的识别准确率越高,电子设备的优先级越高。在考虑电子设备的识别准确率与上述电子设备的优先级的关系的情况下,若上述电子设备的类型和识别准确率等特征的取值不变,电子设备的视角偏差越小,电子设备的优先级越高。It can be understood that, considering the relationship between the type of electronic device and the priority of the above-mentioned electronic device, if the values of the above-mentioned features such as recognition accuracy and viewing angle deviation remain unchanged, the type of electronic device will be in the wake-up order determined according to the type. The higher the center, the higher the priority of the electronic device. Considering the relationship between the recognition accuracy rate of the electronic device and the priority of the above-mentioned electronic device, if the values of the characteristics such as the type of the above-mentioned electronic device and the viewing angle deviation remain unchanged, the higher the recognition accuracy rate of the electronic device, the higher the electronic device's The higher the priority. Considering the relationship between the recognition accuracy rate of the electronic device and the priority of the above-mentioned electronic device, if the type of the above-mentioned electronic device and the value of the characteristics such as the recognition accuracy rate remain unchanged, the smaller the deviation of the viewing angle of the electronic device is, the greater the The higher the priority.
第四方面,本申请提供一种设备唤醒方法。在该方法中,当检测到第一用户输入,图像采集装置可以采集第一图像。该图像采集装置可以向处理设备发送第一指令。该第一指令可包括第一图像。该第一指令可用于指示处理设备从多个电子设备中选出第一图像包含的目标电子设备。该目标电子设备可以是处理设备发送唤醒指令的对象。该唤醒指令可用于触发目标电子设备进入唤醒状态。In a fourth aspect, the present application provides a method for waking up a device. In this method, when a first user input is detected, the image capture device can capture a first image. The image acquisition device may send a first instruction to the processing device. The first instruction may include a first image. The first instruction may be used to instruct the processing device to select a target electronic device included in the first image from multiple electronic devices. The target electronic device may be an object to which the processing device sends a wake-up instruction. The wake-up instruction can be used to trigger the target electronic device to enter the wake-up state.
上述第一用户输入可以为包含唤醒词的语音输入,或者,为作用在上述图像采集装置的第一位置上的用户操作。The above-mentioned first user input may be a voice input including a wake-up word, or may be a user operation acting on the first position of the above-mentioned image acquisition device.
在一些实施例中,上述图像采集装置可以为智能眼镜。In some embodiments, the above-mentioned image acquisition device may be smart glasses.
由上述实施例可知,图像采集装置可以在用户需要进行设备唤醒时进行图像采集,并将采集得到的图像发送给处理设备。图像采集装置可以指示处理设备确定目标电子设备。那么,在用户说出唤醒词进行设备唤醒的场景中,上述目标电子设备可以进入唤醒状态,来响应用户的语音指令。而其它语音交互功能开启的电子设备可以不进入唤醒状态。这样可以减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。It can be known from the above embodiments that the image acquisition device may perform image acquisition when the user needs to wake up the device, and send the acquired image to the processing device. The image capture device may instruct the processing device to determine the target electronic device. Then, in a scenario where the user speaks a wake-up word to wake up the device, the above-mentioned target electronic device may enter a wake-up state to respond to the user's voice command. However, electronic devices with other voice interaction functions enabled may not enter the wake-up state. In this way, false wakeups can be reduced, and a better user experience can be brought to the user when using the voice interaction function of the electronic device.
可以理解的,图像采集装置可以不用进行目标电子设备确定的操作,这可以节省图像采集装置的功耗。上述处理设备可以是具有强算力的电子设备,例如手机、云服务器等等。It can be understood that the image acquisition device does not need to perform the operation of determining the target electronic device, which can save the power consumption of the image acquisition device. The aforementioned processing device may be an electronic device with strong computing power, such as a mobile phone, a cloud server, and the like.
第五方面,本申请提供一种设备唤醒方法。在该方法中,第一电子设备可以监听到唤醒词。响应于唤醒词,第一电子设备可以检测设备唤醒系统中是否存在智能眼镜,且智能眼镜是否处于佩戴状态。如果设备唤醒系统中存在智能眼镜,且智能眼镜处于佩戴状态,第一电子设备可以等待接收唤醒指令。该唤醒指令可以用于触发第一电子设备进入唤醒状态。第一电子设备接收到唤醒指令,进入唤醒状态。In a fifth aspect, the present application provides a method for waking up a device. In this method, the first electronic device can monitor the wake-up word. In response to the wake-up word, the first electronic device may detect whether there are smart glasses in the device wake-up system, and whether the smart glasses are in a wearing state. If there are smart glasses in the device wake-up system, and the smart glasses are in a wearing state, the first electronic device may wait to receive a wake-up instruction. The wake-up instruction can be used to trigger the first electronic device to enter the wake-up state. The first electronic device enters into a wake-up state upon receiving the wake-up instruction.
在一种可能的实现方式中,存在于上述设备唤醒系统中的电子设备可以存在于本地设备列表中。该本地设备列表可以存储于设备唤醒系统中的一个或多个电子设备中。可选的,该 本地设备列表也可以存储在云服务器中。本地设备列表中的电子设备均可以获取该本地设备列表。即本地设备列表中的电子设备均可以确定上述设备唤醒系统中包含哪些电子设备。In a possible implementation manner, the electronic devices that exist in the above device wake-up system may exist in the local device list. The local device list may be stored in one or more electronic devices in the device wake-up system. Optionally, the local device list can also be stored in the cloud server. All electronic devices in the local device list can obtain the local device list. That is, all the electronic devices in the local device list can determine which electronic devices are contained in the wake-up system of the above-mentioned devices.
当监听到唤醒词,上述第一电子设备可以通过确定上述本地设备列表中是否包含上述图像采集装置,来确定上述设备唤醒系统中是否存在智能眼镜。若上述本地设备列表中包含智能眼镜,第一电子设备可以确定上述设备唤醒系统中存在智能眼镜。若确定出上述设备唤醒系统中存在智能眼镜,第一电子设备可以进一步确定智能眼镜是否处于佩戴状态。若确定出智能眼镜处于佩戴状态,第一电子设备可以等待唤醒指令,而不立即进入唤醒状态。当第一电子设备接收到唤醒指令,第一电子设备可以进入唤醒状态。在上述等待唤醒指令的过程中,第一电子设备可以不响应监听到的唤醒词、语音指令等。When the wake-up word is detected, the first electronic device may determine whether smart glasses exist in the device wake-up system by determining whether the image acquisition device is included in the local device list. If the above-mentioned local device list includes smart glasses, the first electronic device may determine that there are smart glasses in the device wake-up system. If it is determined that there are smart glasses in the device wake-up system, the first electronic device may further determine whether the smart glasses are in a wearing state. If it is determined that the smart glasses are in the wearing state, the first electronic device may wait for a wake-up instruction without immediately entering the wake-up state. When the first electronic device receives the wake-up instruction, the first electronic device may enter the wake-up state. During the above process of waiting for the wake-up instruction, the first electronic device may not respond to the monitored wake-up words, voice instructions, and the like.
在一些实施例中,上述第一电子设备被确定为目标电子设备,并接收到唤醒指令。在接收到上述唤醒指令之前,目标电子设备监听到唤醒词,但未监听到语音指令(如在用户仅说出唤醒词的场景中)。那么,当进入唤醒状态,上述目标电子设备可以输出针对上述唤醒词的语音响应。该针对唤醒词的语音响应可以例如是“我在”。或者,在接收到上述唤醒指令之前,目标电子设备既未监听到唤醒词,也未监听到语音指令(如在用户未说出唤醒词,而通过作用在上述第一位置上的用户操作来实现设备唤醒的场景中)。那么,当进入唤醒状态,上述目标电子设备也可以输出针对上述唤醒词的语音响应。也即是说,当进入唤醒状态但未监听到语音指令的情况下,目标电子设备均可以输出针对上述唤醒词的语音响应,来提示用户该目标电子设备已进入唤醒状态。这样,用户可以知道哪个电子设备被唤醒,进而通过语音指令指示进入唤醒状态的电子设备执行相应的操作。当进入唤醒状态,目标电子设备可以识别语音指令,并执行该语音指令对应的用户操作。In some embodiments, the above-mentioned first electronic device is determined as the target electronic device and receives a wake-up instruction. Before receiving the above-mentioned wake-up instruction, the target electronic device listens to the wake-up word, but does not listen to the voice command (such as in the scenario where the user only speaks the wake-up word). Then, when entering the wake-up state, the above-mentioned target electronic device may output a voice response to the above-mentioned wake-up word. The voice response to the wake word may be, for example, "I am". Or, before receiving the above-mentioned wake-up instruction, the target electronic device neither listens to the wake-up word nor the voice command (such as the user does not say the wake-up word, but the user operation acting on the first position above realizes in scenarios where the device wakes up). Then, when entering the wake-up state, the target electronic device may also output a voice response to the wake-up word. That is to say, when entering the wake-up state but no voice command is heard, the target electronic device can output a voice response to the above-mentioned wake-up word to remind the user that the target electronic device has entered the wake-up state. In this way, the user can know which electronic device is awakened, and then instructs the electronic device in the awakened state to perform corresponding operations through voice instructions. When entering the wake-up state, the target electronic device can recognize the voice command and execute the user operation corresponding to the voice command.
在一些实施例中,上述第一电子设备被确定为目标电子设备,并接收到唤醒指令。若在接收到上述唤醒指令之前,目标电子设备监听到语音指令,那么,目标电子设备可以在进入唤醒状态后直接输出针对该语音指令的语音响应,并执行该语音指令对应的操作。例如,在用户一次性说出唤醒词和语音指令的场景中,或者,在用户在上述第一位置上进行用户操作的同时或在第一位置上进行用户操作之前说出语音指令的场景中,目标电子设备均可能在接收到唤醒指令之前就监听到了语音指令。其中,目标电子设备可以检测接收到唤醒指令之前的第一时间段内以及接收到唤醒指令之后采集的声音信号中是否包含语音指令。这样,可以减少用户在目标电子设备收到唤醒指令之前说出语音指令时,目标电子设备由于未检测到用户的语音指令而未响应该语音指令的情况。In some embodiments, the above-mentioned first electronic device is determined as the target electronic device and receives a wake-up instruction. If the target electronic device listens to the voice command before receiving the wake-up command, then the target electronic device can directly output a voice response to the voice command after entering the wake-up state, and execute the operation corresponding to the voice command. For example, in the scenario where the user speaks the wake-up word and the voice command at one time, or in the scenario where the user speaks the voice command while performing the user operation on the above-mentioned first position or before performing the user operation on the first position, The target electronic device may have listened to the voice command before receiving the wake-up command. Wherein, the target electronic device may detect whether the sound signal collected during the first time period before receiving the wake-up instruction and after receiving the wake-up instruction contains the voice instruction. In this way, when the user speaks the voice command before the target electronic device receives the wake-up command, the target electronic device does not respond to the voice command because it does not detect the user's voice command.
在一些实施例中,若第一电子设备确定出设备唤醒系统中不存在智能眼镜(即本地设备列表中不包含智能眼镜),或者,确定出设备唤醒系统中存在智能眼镜但智能眼镜未处于佩戴状态,第一电子设备和其它语音交互功能开启的电子设备可以协商选出一个电子设备。上述协商选出的一个电子设备可以进入唤醒状态。其它电子设备可以不进入唤醒状态。在一种可能的实现方式中,第一电子设备和其它语音交互功能开启的电子设备可以根据接收到包含唤醒词的声音信号的强度,协商选取出接收到包含唤醒词的声音信号的强度最大的电子设备。该接收到包含唤醒词的声音信号强度最大的电子设备可以进入唤醒状态。In some embodiments, if the first electronic device determines that smart glasses do not exist in the device wake-up system (that is, the smart glasses are not included in the local device list), or determines that there are smart glasses in the device wake-up system but the smart glasses are not being worn state, the first electronic device and other electronic devices whose voice interaction function is enabled may negotiate to select an electronic device. An electronic device selected through the above negotiation may enter a wake-up state. Other electronic devices may not enter the wake-up state. In a possible implementation manner, the first electronic device and other electronic devices with the voice interaction function enabled may negotiate and select the one that receives the sound signal containing the wake-up word with the highest strength according to the strength of the received sound signal containing the wake-up word. Electronic equipment. The electronic device that receives the sound signal containing the wake-up word with the highest intensity may enter the wake-up state.
由上述实施例可知,当语音交互功能开启的多个电子设备监听到唤醒词,这多个电子设备可以判断用户是否会通过智能眼镜来进行设备唤醒。在确定用户会通过智能眼镜来进行设备唤醒的情况下,这多个电子设备可以等待唤醒指令,在接收到唤醒指令之后才进入唤醒状态。这样这多个电子设备不会在监听到唤醒词之后全部进入唤醒状态,出现误唤醒的情况。并且,接收到唤醒指令的电子设备为用户希望唤醒的电子设备的可能性最大。这可以为用户 使用电子设备的语音交互功能带来更好的使用体验。It can be seen from the above-mentioned embodiments that when multiple electronic devices with voice interaction functions enabled monitor the wake-up word, the multiple electronic devices can determine whether the user will wake up the device through the smart glasses. When it is determined that the user will wake up the device through the smart glasses, the plurality of electronic devices may wait for a wake-up instruction, and enter into a wake-up state after receiving the wake-up instruction. In this way, the plurality of electronic devices will not all enter the wake-up state after listening to the wake-up word, and false wake-up will occur. In addition, the electronic device that has received the wake-up instruction is most likely to be the electronic device that the user wishes to wake up. This can bring a better user experience for the user to use the voice interaction function of the electronic device.
第六方面,本申请提供一种电子设备。该电子设备可包括存储器和处理器。其中,存储器可用于存储计算机程序。处理器可用于调用计算机程序,使得电子设备执行如第三方面或第四方面或第五方面中任一可能的实现方式。In a sixth aspect, the present application provides an electronic device. The electronic device can include memory and a processor. Among them, memory can be used to store computer programs. The processor may be used to invoke a computer program, so that the electronic device executes any possible implementation manner in the third aspect, the fourth aspect, or the fifth aspect.
第七方面,本申请提供一种芯片,该芯片应用于电子设备,该芯片包括一个或多个处理器,该处理器用于调用计算机指令以使得该电子设备执行如第三方面或第四方面或第五方面中任一可能的实现方式。In a seventh aspect, the present application provides a chip, the chip is applied to an electronic device, the chip includes one or more processors, and the processor is used to invoke computer instructions so that the electronic device executes the third aspect or the fourth aspect or Any possible implementation manner in the fifth aspect.
第八方面,本申请提供一种包含指令的计算机程序产品,其特征在于,当上述计算机程序产品在电子设备上运行时,使得该电子设备执行如第三方面或第四方面或第五方面中任一可能的实现方式。In an eighth aspect, the present application provides a computer program product containing instructions, which is characterized in that, when the above-mentioned computer program product is run on an electronic device, the electronic device is executed as in the third aspect or the fourth aspect or the fifth aspect. any possible implementation.
第九方面,本申请提供一种计算机可读存储介质,包括指令,其特征在于,当上述指令在电子设备上运行时,使得该电子设备执行如第三方面或第四方面或第五方面中任一可能的实现方式。In a ninth aspect, the present application provides a computer-readable storage medium, including instructions, characterized in that, when the above-mentioned instructions are run on an electronic device, the electronic device is made to execute the method described in the third aspect or the fourth aspect or the fifth aspect. any possible implementation.
可以理解地,上述第六方面提供的电子设备、第七方面提供的芯片、第八方面提供的计算机程序产品和第九方面提供的计算机可读存储介质均用于执行本申请实施例所提供的方法。因此,其所能达到的有益效果可参考对应方法中的有益效果,此处不再赘述。It can be understood that the electronic device provided in the sixth aspect, the chip provided in the seventh aspect, the computer program product provided in the eighth aspect, and the computer-readable storage medium provided in the ninth aspect are all used to execute the method. Therefore, the beneficial effects that it can achieve can refer to the beneficial effects in the corresponding method, and will not be repeated here.
附图说明Description of drawings
图1是本申请实施例提供的一种电子设备的结构示意图;FIG. 1 is a schematic structural diagram of an electronic device provided in an embodiment of the present application;
图2是本申请实施例提供的一种设备唤醒的场景示意图;FIG. 2 is a schematic diagram of a device wake-up scenario provided by an embodiment of the present application;
图3是本申请实施例提供的一种通信系统的结构示意图;FIG. 3 is a schematic structural diagram of a communication system provided by an embodiment of the present application;
图4是本申请实施例提供的另一种通信系统的结构示意图;FIG. 4 is a schematic structural diagram of another communication system provided by an embodiment of the present application;
图5是本申请实施例提供的一种设备唤醒的场景示意图;FIG. 5 is a schematic diagram of a device wake-up scenario provided by an embodiment of the present application;
图6是本申请实施例提供的智能眼镜采集到的图像的示意图;FIG. 6 is a schematic diagram of images collected by smart glasses provided in an embodiment of the present application;
图7是本申请实施例提供的一种设备唤醒的场景示意图;FIG. 7 is a schematic diagram of a device wake-up scenario provided by an embodiment of the present application;
图8是本申请实施例提供的智能眼镜采集到的图像的示意图;FIG. 8 is a schematic diagram of images collected by smart glasses provided in an embodiment of the present application;
图9A和图9B是本申请实施例提供的一种设备唤醒的场景示意图;FIG. 9A and FIG. 9B are schematic diagrams of a device wake-up scenario provided by an embodiment of the present application;
图10是本申请实施例提供的一种电子设备进入唤醒状态的方法流程图;FIG. 10 is a flowchart of a method for an electronic device to enter a wake-up state according to an embodiment of the present application;
图11是本申请实施例提供的一种智能眼镜的结构示意图;Fig. 11 is a schematic structural diagram of smart glasses provided by an embodiment of the present application;
图12是本申请实施例提供的一种设备唤醒方法的流程图。Fig. 12 is a flow chart of a method for waking up a device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合附图对本申请实施例中的技术方案进行清楚、详尽地描述。其中,在本申请实施例的描述中,除非另有说明,“/”表示或的意思,例如,A/B可以表示A或B;文本中的“和/或”仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况,另外,在本申请实施例的描述中,“多个”是指两个或多于两个。The technical solutions in the embodiments of the present application will be described clearly and in detail below in conjunction with the accompanying drawings. Among them, in the description of the embodiments of this application, unless otherwise specified, "/" means or means, for example, A/B can mean A or B; "and/or" in the text is only a description of associated objects The association relationship indicates that there may be three kinds of relationships, for example, A and/or B, which may indicate: A exists alone, A and B exist at the same time, and B exists alone. In addition, in the description of the embodiment of the present application , "plurality" means two or more than two.
以下,术语“第一”、“第二”仅用于描述目的,而不能理解为暗示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征,在本申请实施例的描述中,除非另有说明,“多个”的含 义是两个或两个以上。Hereinafter, the terms "first" and "second" are used for descriptive purposes only, and cannot be understood as implying or implying relative importance or implicitly specifying the quantity of indicated technical features. Therefore, the features defined as "first" and "second" may explicitly or implicitly include one or more of these features. In the description of the embodiments of the present application, unless otherwise specified, the "multiple" The meaning is two or more.
为了减少语音唤醒电子设备时误唤醒的情况,本申请实施例提供一种设备唤醒方法及相关装置。下面先介绍本申请实施例涉及的电子设备。In order to reduce false wake-ups when electronic devices are woken up by voice, embodiments of the present application provide a device wake-up method and a related device. The electronic device involved in the embodiment of the present application is firstly introduced below.
图1示例性示出了一种电子设备100的结构示意图。FIG. 1 exemplarily shows a schematic structural diagram of an electronic device 100 .
如图1所示,电子设备100可以包括处理器110,外部存储器接口120,内部存储器121,通用串行总线(universal serial bus,USB)接口130,充电管理模块140,电源管理模块141,电池142,天线1,天线2,移动通信模块150,无线通信模块160,音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,传感器模块180,按键190,马达191,指示器192,摄像头193,显示屏194,以及用户标识模块(subscriber identification module,SIM)卡接口195等。其中传感器模块180可以包括压力传感器180A,陀螺仪传感器180B,气压传感器180C,磁传感器180D,加速度传感器180E,距离传感器180F,接近光传感器180G,指纹传感器180H,温度传感器180J,触摸传感器180K,环境光传感器180L,骨传导传感器180M等。As shown in Figure 1, the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (universal serial bus, USB) interface 130, a charging management module 140, a power management module 141, and a battery 142 , antenna 1, antenna 2, mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, earphone jack 170D, sensor module 180, button 190, motor 191, indicator 192, camera 193 , a display screen 194, and a subscriber identification module (subscriber identification module, SIM) card interface 195, etc. The sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, an ambient light sensor 180L, bone conduction sensor 180M, etc.
可以理解的是,本申请实施例示意的结构并不构成对电子设备100的具体限定。在本申请另一些实施例中,电子设备100可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It can be understood that, the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the electronic device 100 . In other embodiments of the present application, the electronic device 100 may include more or fewer components than shown in the figure, or combine certain components, or separate certain components, or arrange different components. The illustrated components can be realized in hardware, software or a combination of software and hardware.
处理器110可以包括一个或多个处理单元,例如:处理器110可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 110 may include one or more processing units, for example: the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural network processor (neural-network processing unit, NPU) wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
其中,控制器可以是电子设备100的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。Wherein, the controller may be the nerve center and command center of the electronic device 100 . The controller can generate an operation control signal according to the instruction opcode and timing signal, and complete the control of fetching and executing the instruction.
在一些实施例中,处理器110可包括语音唤醒模块和语音指令识别模块。其中,语音唤醒模块和语音指令识别模块可以集成在不同的处理器芯片中,由不同的芯片执行。例如,语音唤醒模块可以集成在功耗较低的协处理器或DSP芯片中,语音指令识别模块可以集成在AP或NPU或其他芯片中。这样,可以在语音唤醒模块识别到预设的唤醒词后,再启动语音指令识别的模块所在的芯片触发语音指令识别功能,从而节省电子设备的功耗。或者,语音唤醒模块和语音指令识别模块可以集成在相同的处理器芯片中,由同一芯片执行相关功能。例如,语音唤醒模块和语音指令识别模块均可集成在AP芯片或NPU或其他芯片中。In some embodiments, the processor 110 may include a voice wake-up module and a voice command recognition module. Wherein, the voice wake-up module and the voice command recognition module can be integrated in different processor chips and executed by different chips. For example, the voice wake-up module can be integrated in a coprocessor or DSP chip with low power consumption, and the voice command recognition module can be integrated in an AP or NPU or other chips. In this way, after the voice wake-up module recognizes the preset wake-up word, the chip where the voice command recognition module is located can be activated to trigger the voice command recognition function, thereby saving the power consumption of the electronic device. Alternatively, the voice wake-up module and the voice command recognition module can be integrated in the same processor chip, and the same chip performs related functions. For example, both the voice wake-up module and the voice command recognition module can be integrated in an AP chip or an NPU or other chips.
处理器110还可以包括语音指令执行模块。在上述语音指令识别模块识别到语音指令后,语音指令执行模块可以执行语音指令对应的操作。例如,播放音乐、拨打电话、发送短信等等。The processor 110 may also include a voice command execution module. After the speech instruction recognition module recognizes the speech instruction, the speech instruction execution module can execute the operation corresponding to the speech instruction. For example, play music, make calls, send text messages, and more.
可以理解的,包含上述语音唤醒模块、语音指令识别模块和语音指令执行模块的电子设备是具有语音交互能力的电子设备。上述具有语音交互能力可以表示,电子设备可以响应用户的语音指令,并执行该语音指令对应的操作。It can be understood that the electronic device including the above-mentioned voice wake-up module, voice command recognition module and voice command execution module is an electronic device with voice interaction capability. The aforementioned voice interaction capability may mean that the electronic device can respond to a user's voice command and perform an operation corresponding to the voice command.
处理器110中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使用的指令或数 据。如果处理器110需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复存取,减少了处理器110的等待时间,因而提高了系统的效率。A memory may also be provided in the processor 110 for storing instructions and data. In some embodiments, the memory in processor 110 is a cache memory. The memory may hold instructions or data that the processor 110 has just used or recycled. If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated access is avoided, and the waiting time of the processor 110 is reduced, thus improving the efficiency of the system.
USB接口130是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口130可以用于连接充电器为电子设备100充电,也可以用于电子设备100与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如AR设备等。The USB interface 130 is an interface conforming to the USB standard specification, specifically, it can be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like. The USB interface 130 can be used to connect a charger to charge the electronic device 100 , and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones and play audio through them. This interface can also be used to connect other electronic devices, such as AR devices.
充电管理模块140用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。在一些有线充电的实施例中,充电管理模块140可以通过USB接口130接收有线充电器的充电输入。在一些无线充电的实施例中,充电管理模块140可以通过电子设备100的无线充电线圈接收无线充电输入。充电管理模块140为电池142充电的同时,还可以通过电源管理模块141为电子设备供电。The charging management module 140 is configured to receive a charging input from a charger. Wherein, the charger may be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 140 can receive charging input from the wired charger through the USB interface 130 . In some wireless charging embodiments, the charging management module 140 may receive a wireless charging input through a wireless charging coil of the electronic device 100 . While the charging management module 140 is charging the battery 142 , it can also provide power for electronic devices through the power management module 141 .
电源管理模块141用于连接电池142,充电管理模块140与处理器110。电源管理模块141接收电池142和/或充电管理模块140的输入,为处理器110,内部存储器121,外部存储器,显示屏194,摄像头193,和无线通信模块160等供电。在其他一些实施例中,电源管理模块141也可以设置于处理器110中。在另一些实施例中,电源管理模块141和充电管理模块140也可以设置于同一个器件中。The power management module 141 is used for connecting the battery 142 , the charging management module 140 and the processor 110 . The power management module 141 receives the input from the battery 142 and/or the charging management module 140 to provide power for the processor 110 , the internal memory 121 , the external memory, the display screen 194 , the camera 193 , and the wireless communication module 160 . In some other embodiments, the power management module 141 may also be disposed in the processor 110 . In some other embodiments, the power management module 141 and the charging management module 140 may also be set in the same device.
电子设备100的无线通信功能可以通过天线1,天线2,移动通信模块150,无线通信模块160,调制解调处理器以及基带处理器等实现。The wireless communication function of the electronic device 100 can be realized by the antenna 1 , the antenna 2 , the mobile communication module 150 , the wireless communication module 160 , a modem processor, a baseband processor, and the like.
天线1和天线2用于发射和接收电磁波信号。电子设备100中的每个天线可用于覆盖单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将天线1复用为无线局域网的分集天线。在另外一些实施例中,天线可以和调谐开关结合使用。Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals. Each antenna in electronic device 100 may be used to cover single or multiple communication frequency bands. Different antennas can also be multiplexed to improve the utilization of the antennas. For example: Antenna 1 can be multiplexed as a diversity antenna of a wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
移动通信模块150可以提供应用在电子设备100上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块150可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块150可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块150还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块150的至少部分功能模块可以被设置于处理器110中。在一些实施例中,移动通信模块150的至少部分功能模块可以与处理器110的至少部分模块被设置在同一个器件中。The mobile communication module 150 can provide wireless communication solutions including 2G/3G/4G/5G applied on the electronic device 100 . The mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (low noise amplifier, LNA) and the like. The mobile communication module 150 can receive electromagnetic waves through the antenna 1, filter and amplify the received electromagnetic waves, and send them to the modem processor for demodulation. The mobile communication module 150 can also amplify the signals modulated by the modem processor, and convert them into electromagnetic waves through the antenna 1 for radiation. In some embodiments, at least part of the functional modules of the mobile communication module 150 may be set in the processor 110 . In some embodiments, at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be set in the same device.
无线通信模块160可以提供应用在电子设备100上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块160可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块160经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器110。无线通信模块160还可以从处理器110接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The wireless communication module 160 can provide wireless local area networks (wireless local area networks, WLAN) (such as wireless fidelity (Wireless Fidelity, Wi-Fi) network), bluetooth (bluetooth, BT), global navigation satellite, etc. applied on the electronic device 100. System (global navigation satellite system, GNSS), frequency modulation (frequency modulation, FM), near field communication technology (near field communication, NFC), infrared technology (infrared, IR) and other wireless communication solutions. The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency-modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 . The wireless communication module 160 can also receive the signal to be sent from the processor 110 , frequency-modulate it, amplify it, and convert it into electromagnetic waves through the antenna 2 for radiation.
电子设备100通过GPU,显示屏194,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏194和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器110可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The electronic device 100 realizes the display function through the GPU, the display screen 194 , and the application processor. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor. GPUs are used to perform mathematical and geometric calculations for graphics rendering. Processor 110 may include one or more GPUs that execute program instructions to generate or change display information.
显示屏194用于显示图像,视频等。在一些实施例中,电子设备100可以包括1个或N个显示屏194,N为大于1的正整数。The display screen 194 is used to display images, videos and the like. In some embodiments, the electronic device 100 may include 1 or N display screens 194 , where N is a positive integer greater than 1.
电子设备100可以通过ISP,摄像头193,视频编解码器,GPU,显示屏194以及应用处理器等实现拍摄功能。The electronic device 100 can realize the shooting function through the ISP, the camera 193 , the video codec, the GPU, the display screen 194 and the application processor.
ISP用于处理摄像头193反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头193中。The ISP is used for processing the data fed back by the camera 193 . For example, when taking a picture, open the shutter, the light is transmitted to the photosensitive element of the camera through the lens, and the light signal is converted into an electrical signal, and the photosensitive element of the camera transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be located in the camera 193 .
摄像头193用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,电子设备100可以包括1个或N个摄像头193,N为大于1的正整数。Camera 193 is used to capture still images or video. The object generates an optical image through the lens and projects it to the photosensitive element. The photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the light signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal. The ISP outputs the digital image signal to the DSP for processing. DSP converts digital image signals into standard RGB, YUV and other image signals. In some embodiments, the electronic device 100 may include 1 or N cameras 193 , where N is a positive integer greater than 1.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当电子设备100在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。Digital signal processors are used to process digital signals. In addition to digital image signals, they can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the energy of the frequency point.
视频编解码器用于对数字视频压缩或解压缩。电子设备100可以支持一种或多种视频编解码器。这样,电子设备100可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codecs are used to compress or decompress digital video. The electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos in various encoding formats, for example: moving picture experts group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4 and so on.
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备100的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。The NPU is a neural-network (NN) computing processor. By referring to the structure of biological neural networks, such as the transfer mode between neurons in the human brain, it can quickly process input information and continuously learn by itself. Applications such as intelligent cognition of the electronic device 100 can be realized through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
外部存储器接口120可以用于连接外部存储卡,例如Micro SD卡,实现扩展电子设备100的存储能力。外部存储卡通过外部存储器接口120与处理器110通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。The external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, so as to expand the storage capacity of the electronic device 100. The external memory card communicates with the processor 110 through the external memory interface 120 to implement a data storage function. Such as saving music, video and other files in the external memory card.
内部存储器121可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器110通过运行存储在内部存储器121的指令,从而执行电子设备100的各种功能应用以及数据处理。内部存储器121可以包括存储程序区和存储数据区。其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储电子设备100使用过程中所创建的数据(比如音频数据,电话本等)等。此外,内部存储器121可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等。The internal memory 121 may be used to store computer-executable program codes including instructions. The processor 110 executes various functional applications and data processing of the electronic device 100 by executing instructions stored in the internal memory 121 . The internal memory 121 may include an area for storing programs and an area for storing data. Wherein, the stored program area can store an operating system, at least one application program required by a function (such as a sound playing function, an image playing function, etc.) and the like. The storage data area can store data created during the use of the electronic device 100 (such as audio data, phonebook, etc.) and the like. In addition, the internal memory 121 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (universal flash storage, UFS) and the like.
电子设备100可以通过音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,以及应用处理器等实现音频功能。例如音乐播放,录音等。The electronic device 100 can implement audio functions through the audio module 170 , the speaker 170A, the receiver 170B, the microphone 170C, the earphone interface 170D, and the application processor. Such as music playback, recording, etc.
音频模块170用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块170还可以用于对音频信号编码和解码。在一些实施例中,音频模块170可以设置于处理器110中,或将音频模块170的部分功能模块设置于处理器110中。The audio module 170 is used to convert digital audio information into analog audio signal output, and is also used to convert analog audio input into digital audio signal. The audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be set in the processor 110 , or some functional modules of the audio module 170 may be set in the processor 110 .
扬声器170A,也称“喇叭”,用于将音频电信号转换为声音信号。电子设备100可以通过扬声器170A收听音乐,或收听免提通话。Speaker 170A, also referred to as a "horn", is used to convert audio electrical signals into sound signals. Electronic device 100 can listen to music through speaker 170A, or listen to hands-free calls.
受话器170B,也称“听筒”,用于将音频电信号转换成声音信号。Receiver 170B, also called "earpiece", is used to convert audio electrical signals into sound signals.
麦克风170C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风170C发声,将声音信号输入到麦克风170C。电子设备100可以设置至少一个麦克风170C。在另一些实施例中,电子设备100可以设置两个麦克风170C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,电子设备100还可以设置三个,四个或更多麦克风170C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。The microphone 170C, also called "microphone" or "microphone", is used to convert sound signals into electrical signals. When making a phone call or sending a voice message, the user can put his mouth close to the microphone 170C to make a sound, and input the sound signal to the microphone 170C. The electronic device 100 may be provided with at least one microphone 170C. In some other embodiments, the electronic device 100 may be provided with two microphones 170C, which may also implement a noise reduction function in addition to collecting sound signals. In some other embodiments, the electronic device 100 can also be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and realize directional recording functions, etc.
在一些实施例中,麦克风170C可以与低功耗处理器连接。该低功耗处理器中可集成有语音唤醒模块。麦克风170C可以将采集的声音信号发送给该低功耗处理器。低功耗处理器中的语音唤醒模块可以检测该声音信号中是否包含预设的唤醒词。若包含,该低功耗处理器可以唤醒应用处理器。应用处理器中可集成有语音指令识别模块和语音指令执行模块。当应用处理器被唤醒,麦克风170C采集的声音信号可以经过上述低功耗处理器发送给应用处理器。应用处理器中的语音指令识别模块可以识别该声音信号中的语音指令。进一步的,语音指令执行模块可以执行该语音指令对应的操作。In some embodiments, microphone 170C may interface with a low power processor. A voice wake-up module may be integrated in the low-power processor. The microphone 170C can send the collected sound signal to the low power consumption processor. The voice wake-up module in the low-power processor can detect whether the voice signal contains a preset wake-up word. If included, the low-power processor can wake up the application processor. A voice command recognition module and a voice command execution module may be integrated in the application processor. When the application processor is woken up, the sound signal collected by the microphone 170C can be sent to the application processor through the above-mentioned low power consumption processor. The speech command recognition module in the application processor can recognize the speech command in the sound signal. Further, the voice instruction executing module can execute the operation corresponding to the voice instruction.
在电子设备100的语音交互功能开启的情况下,麦克风170C和上述低功耗处理器可以实时处于工作状态。由于麦克风170C采集的声音信号需要先经过低功耗处理器判断是否包含预设的唤醒词。在声音信号包含预设的唤醒词的情况下,应用处理器才被唤醒。这可以节省电子设备100的功耗。When the voice interaction function of the electronic device 100 is turned on, the microphone 170C and the above-mentioned low-power processor can be in working state in real time. The sound signal collected by the microphone 170C needs to be judged by the low-power processor first whether it contains a preset wake-up word. The application processor is only woken up when the sound signal contains a preset wakeup word. This can save power consumption of the electronic device 100 .
耳机接口170D用于连接有线耳机。耳机接口170D可以是USB接口130,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The earphone interface 170D is used for connecting wired earphones. The earphone interface 170D can be a USB interface 130, or a 3.5mm open mobile terminal platform (OMTP) standard interface, or a cellular telecommunications industry association of the USA (CTIA) standard interface.
压力传感器180A用于感受压力信号,可以将压力信号转换成电信号。在一些实施例中,压力传感器180A可以设置于显示屏194。压力传感器180A的种类很多,如电阻式压力传感器,电感式压力传感器,电容式压力传感器等。电容式压力传感器可以是包括至少两个具有导电材料的平行板。当有力作用于压力传感器180A,电极之间的电容改变。电子设备100根据电容的变化确定压力的强度。当有触摸操作作用于显示屏194,电子设备100根据压力传感器180A检测所述触摸操作强度。电子设备100也可以根据压力传感器180A的检测信号计算触摸的位置。在一些实施例中,作用于相同触摸位置,但不同触摸操作强度的触摸操作,可以对应不同的操作指令。例如:当有触摸操作强度小于第一压力阈值的触摸操作作用于短消息应用图标时,执行查看短消息的指令。当有触摸操作强度大于或等于第一压力阈值的触摸操作作用于短消息应用图标时,执行新建短消息的指令。The pressure sensor 180A is used to sense the pressure signal and convert the pressure signal into an electrical signal. In some embodiments, pressure sensor 180A may be disposed on display screen 194 . There are many types of pressure sensors 180A, such as resistive pressure sensors, inductive pressure sensors, and capacitive pressure sensors. A capacitive pressure sensor may be comprised of at least two parallel plates with conductive material. When a force is applied to the pressure sensor 180A, the capacitance between the electrodes changes. The electronic device 100 determines the intensity of pressure according to the change in capacitance. When a touch operation acts on the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A. The electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A. In some embodiments, touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions. For example: when a touch operation with a touch operation intensity less than the first pressure threshold acts on the short message application icon, an instruction to view short messages is executed. When a touch operation whose intensity is greater than or equal to the first pressure threshold acts on the icon of the short message application, the instruction of creating a new short message is executed.
陀螺仪传感器180B可以用于确定电子设备100的运动姿态。在一些实施例中,可以通过陀螺仪传感器180B确定电子设备100围绕三个轴(即,x,y和z轴)的角速度。The gyro sensor 180B can be used to determine the motion posture of the electronic device 100 . In some embodiments, the angular velocity of the electronic device 100 around three axes (ie, x, y and z axes) may be determined by the gyro sensor 180B.
气压传感器180C用于测量气压。The air pressure sensor 180C is used to measure air pressure.
磁传感器180D包括霍尔传感器。电子设备100可以利用磁传感器180D检测翻盖皮套的开合。The magnetic sensor 180D includes a Hall sensor. The electronic device 100 may use the magnetic sensor 180D to detect the opening and closing of the flip leather case.
加速度传感器180E可检测电子设备100在各个方向上(一般为三轴)加速度的大小。当电子设备100静止时可检测出重力的大小及方向。还可以用于识别电子设备姿态,应用于横竖屏切换,计步器等应用。The acceleration sensor 180E can detect the acceleration of the electronic device 100 in various directions (generally three axes). When the electronic device 100 is stationary, the magnitude and direction of gravity can be detected. It can also be used to identify the posture of electronic devices, and can be used in applications such as horizontal and vertical screen switching, pedometers, etc.
距离传感器180F,用于测量距离。电子设备100可以通过红外或激光测量距离。在一些实施例中,拍摄场景,电子设备100可以利用距离传感器180F测距以实现快速对焦。The distance sensor 180F is used to measure the distance. The electronic device 100 may measure the distance by infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 may use the distance sensor 180F for distance measurement to achieve fast focusing.
接近光传感器180G可以包括例如发光二极管(LED)和光检测器,例如光电二极管。 发光二极管可以是红外发光二极管。电子设备100通过发光二极管向外发射红外光。电子设备100使用光电二极管检测来自附近物体的红外反射光。当检测到充分的反射光时,可以确定电子设备100附近有物体。当检测到不充分的反射光时,电子设备100可以确定电子设备100附近没有物体。Proximity light sensor 180G may include, for example, light emitting diodes (LEDs) and light detectors, such as photodiodes. The light emitting diodes may be infrared light emitting diodes. The electronic device 100 emits infrared light through the light emitting diode. Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it may be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 .
环境光传感器180L用于感知环境光亮度。电子设备100可以根据感知的环境光亮度自适应调节显示屏194亮度。环境光传感器180L也可用于拍照时自动调节白平衡。环境光传感器180L还可以与接近光传感器180G配合,检测电子设备100是否在口袋里,以防误触。The ambient light sensor 180L is used for sensing ambient light brightness. The electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness. The ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures. The ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in the pocket, so as to prevent accidental touch.
指纹传感器180H用于采集指纹。电子设备100可以利用采集的指纹特性实现指纹解锁,访问应用锁,指纹拍照,指纹接听来电等。The fingerprint sensor 180H is used to collect fingerprints. The electronic device 100 can use the collected fingerprint characteristics to implement fingerprint unlocking, access to application locks, take pictures with fingerprints, answer incoming calls with fingerprints, and the like.
温度传感器180J用于检测温度。在一些实施例中,电子设备100利用温度传感器180J检测的温度,执行温度处理策略。例如,当温度传感器180J上报的温度超过阈值,电子设备100执行降低位于温度传感器180J附近的处理器的性能,以便降低功耗实施热保护。在另一些实施例中,当温度低于另一阈值时,电子设备100对电池142加热,以避免低温导致电子设备100异常关机。The temperature sensor 180J is used to detect temperature. In some embodiments, the electronic device 100 uses the temperature detected by the temperature sensor 180J to implement a temperature treatment strategy. For example, when the temperature reported by the temperature sensor 180J exceeds the threshold, the electronic device 100 may reduce the performance of the processor located near the temperature sensor 180J, so as to reduce power consumption and implement thermal protection. In other embodiments, when the temperature is lower than another threshold, the electronic device 100 heats the battery 142 to prevent the electronic device 100 from being shut down abnormally due to the low temperature.
触摸传感器180K,也称“触控面板”。触摸传感器180K可以设置于显示屏194,由触摸传感器180K与显示屏194组成触摸屏,也称“触控屏”。触摸传感器180K用于检测作用于其上或附近的触摸操作。触摸传感器可以将检测到的触摸操作传递给应用处理器,以确定触摸事件类型。可以通过显示屏194提供与触摸操作相关的视觉输出。在另一些实施例中,触摸传感器180K也可以设置于电子设备100的表面,与显示屏194所处的位置不同。Touch sensor 180K, also known as "touch panel". The touch sensor 180K can be disposed on the display screen 194, and the touch sensor 180K and the display screen 194 form a touch screen, also called a “touch screen”. The touch sensor 180K is used to detect a touch operation on or near it. The touch sensor can pass the detected touch operation to the application processor to determine the type of touch event. Visual output related to the touch operation can be provided through the display screen 194 . In other embodiments, the touch sensor 180K may also be disposed on the surface of the electronic device 100 , which is different from the position of the display screen 194 .
骨传导传感器180M可以获取振动信号。在一些实施例中,骨传导传感器180M可以获取人体声部振动骨块的振动信号。骨传导传感器180M也可以接触人体脉搏,接收血压跳动信号。在一些实施例中,骨传导传感器180M也可以设置于耳机中,结合成骨传导耳机。音频模块170可以基于所述骨传导传感器180M获取的声部振动骨块的振动信号,解析出语音信号,实现语音功能。The bone conduction sensor 180M can acquire vibration signals. In some embodiments, the bone conduction sensor 180M can acquire the vibration signal of the vibrating bone mass of the human voice. The bone conduction sensor 180M can also contact the human pulse and receive the blood pressure beating signal. In some embodiments, the bone conduction sensor 180M can also be disposed in the earphone, combined into a bone conduction earphone. The audio module 170 can analyze the voice signal based on the vibration signal of the vibrating bone mass of the vocal part acquired by the bone conduction sensor 180M, so as to realize the voice function.
按键190包括开机键,音量键等。按键190可以是机械按键。也可以是触摸式按键。电子设备100可以接收按键输入,产生与电子设备100的用户设置以及功能控制有关的键信号输入。The keys 190 include a power key, a volume key and the like. The key 190 may be a mechanical key. It can also be a touch button. The electronic device 100 can receive key input and generate key signal input related to user settings and function control of the electronic device 100 .
马达191可以产生振动提示。The motor 191 can generate a vibrating reminder.
指示器192可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。The indicator 192 can be an indicator light, and can be used to indicate charging status, power change, and can also be used to indicate messages, missed calls, notifications, and the like.
SIM卡接口195用于连接SIM卡。SIM卡可以通过插入SIM卡接口195,或从SIM卡接口195拔出,实现和电子设备100的接触和分离。电子设备100可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口195可以支持Nano SIM卡,Micro SIM卡,SIM卡等。同一个SIM卡接口195可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口195也可以兼容不同类型的SIM卡。SIM卡接口195也可以兼容外部存储卡。电子设备100通过SIM卡和网络交互,实现通话以及数据通信等功能。在一些实施例中,电子设备100采用eSIM,即:嵌入式SIM卡。eSIM卡可以嵌在电子设备100中,不能和电子设备100分离。The SIM card interface 195 is used for connecting a SIM card. The SIM card can be connected and separated from the electronic device 100 by inserting it into the SIM card interface 195 or pulling it out from the SIM card interface 195 . The electronic device 100 may support 1 or N SIM card interfaces, where N is a positive integer greater than 1. SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card etc. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the multiple cards may be the same or different. The SIM card interface 195 is also compatible with different types of SIM cards. The SIM card interface 195 is also compatible with external memory cards. The electronic device 100 interacts with the network through the SIM card to implement functions such as calling and data communication. In some embodiments, the electronic device 100 adopts an eSIM, that is, an embedded SIM card. The eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100 .
电子设备100可以是手机、平板电脑、笔记本电脑、音箱、电视、路由器、可穿戴设备(如智能眼镜、智能手表、智能手环等)、智能家居设备(如冰箱、洗衣机、空调、电灯等)等等。本申请实施例对电子设备100的具体类型不作限定。The electronic device 100 may be a mobile phone, a tablet computer, a notebook computer, a speaker, a TV, a router, a wearable device (such as smart glasses, a smart watch, a smart bracelet, etc.), a smart home device (such as a refrigerator, a washing machine, an air conditioner, a lamp, etc.) etc. The embodiment of the present application does not limit the specific type of the electronic device 100 .
现在许多电子设备中安装有语音识别应用,例如语音助手应用。安装有语音助手应用的电子设备具有语音交互功能。其中,当语音交互功能开启,电子设备可以实时采集环境中的声音,并检测声音中是否包含唤醒词。唤醒词可用于唤醒电子设备。上述唤醒电子设备可以表示,触发电子设备调用集成有语音指令识别模块和语音指令执行模块的处理器(如应用处理器)来识别采集到的声音中的语音指令,并执行语音指令对应的操作。Speech recognition applications, such as voice assistant applications, are installed in many electronic devices. An electronic device installed with a voice assistant application has a voice interaction function. Wherein, when the voice interaction function is turned on, the electronic device can collect the sound in the environment in real time, and detect whether the sound contains a wake-up word. A wake word can be used to wake up an electronic device. The aforementioned waking up the electronic device may mean that the triggering electronic device invokes a processor (such as an application processor) integrated with a voice command recognition module and a voice command execution module to recognize the voice command in the collected sound and execute the operation corresponding to the voice command.
在一些实施例中,电子设备处于休眠状态。该休眠状态可以表示电子设备的应用处理器处于休眠状态。在处于休眠状态时,电子设备的麦克风和低功耗处理器可以实时处于工作状态。当检测到环境中的声音中包含唤醒词,电子设备的应用处理器可以被唤醒来执行语音指令对应的操作(例如音箱在休眠状态时被唤醒并根据用户的语音指令播放音乐)。在一些实施例中,电子设备的应用处理器处于工作状态。其中,电子设备的麦克风和低功耗处理器可以实时处于工作状态。当检测到环境中的声音中包含唤醒词,电子设备的应用处理器可以接收麦克风采集的声音,识别该声音中包含的语音指令并执行语音指令对应的操作(例如音箱在播放音乐时监听到唤醒词并根据用户的语音指令开启空调)。In some embodiments, the electronic device is in a sleep state. The sleep state may indicate that the application processor of the electronic device is in a sleep state. When in a sleep state, the microphone and the low-power processor of the electronic device can be in a working state in real time. When it is detected that the sound in the environment contains a wake-up word, the application processor of the electronic device can be woken up to perform the operation corresponding to the voice command (for example, the speaker is woken up when it is in a sleep state and plays music according to the user's voice command). In some embodiments, the application processor of the electronic device is in a working state. Wherein, the microphone and the low-power processor of the electronic device can be in working state in real time. When it is detected that the sound in the environment contains a wake-up word, the application processor of the electronic device can receive the sound collected by the microphone, recognize the voice command contained in the sound and perform the operation corresponding to the voice command (for example, the speaker monitors the wake-up call while playing music. words and turn on the air conditioner according to the user's voice command).
在一些实施例中,一个房间中包括多个具有语音交互功能的电子设备。如图2所示,一个房间中包括手机、音箱和电视。手机、音箱和电视均具有语音交互功能,且语音交互功能均开启。用户希望通过语音指令指示音箱播放音乐。用户可以说“小艺小艺,我要听歌”。其中,“小艺小艺”为预设的唤醒词。“我要听歌”为语音指令。由于手机、音箱和电视均可以采集环境中的声音,那么这多个电子设备均可以监听到唤醒词。进而,这多个电子设备均可以被唤醒。当被唤醒,这多个电子设备均可以识别到上述语音指令“我要听歌”,并执行该语音指令对应的操作,即播放音乐。In some embodiments, a room includes multiple electronic devices with voice interaction functions. As shown in Figure 2, a room includes a mobile phone, a speaker and a TV. The mobile phone, speakers, and TV all have voice interaction functions, and the voice interaction functions are all turned on. Users want to instruct the speakers to play music through voice commands. The user can say "Xiaoyi Xiaoyi, I want to listen to the song". Among them, "Xiaoyi Xiaoyi" is the default wake-up word. "I want to listen to the song" is the voice command. Since the mobile phone, the sound box and the TV can all collect sounds in the environment, all these multiple electronic devices can monitor the wake-up word. Furthermore, the plurality of electronic devices can all be woken up. When awakened, the plurality of electronic devices can recognize the above-mentioned voice command "I want to listen to a song", and perform the operation corresponding to the voice command, that is, play music.
如图2所示,当用户说出“小艺小艺,我要听歌”,手机可以语音回答“没问题”,并调用播放音乐的应用来播放音乐。音箱可以语音回答“没问题”,并调用播放音乐的应用来播放音乐。电视也可以回答“没问题”,并调用播放音乐的应用来播放音乐。可以看出,用户希望唤醒的电子设备是音箱。手机和电视被误唤醒。上述误唤醒会对用户产生干扰,降低用户使用电子设备的语音交互功能的使用体验。As shown in Figure 2, when the user says "Xiaoyi Xiaoyi, I want to listen to the song", the mobile phone can answer "no problem" and call the music playing application to play the music. The speaker can answer "no problem" by voice, and call the application that plays music to play music. The TV can also answer "no problem" and invoke the music-playing app to play the music. It can be seen that the electronic device that the user wishes to wake up is a speaker. Mobile phones and TVs were woken up by mistake. The above-mentioned false wake-up will interfere with the user and reduce the user experience of using the voice interaction function of the electronic device.
在一种可能的实现方式中,多个具有语音交互功能的电子设备中均存储有设备优先级排序。该设备优先级排序可以为:音箱>电视>平板电脑>手机。这多个电子设备在监听到唤醒词(如“小艺小艺”)之后可以互相通信,并根据上述设备优先级排序确定出这多个电子设备中排序在最前的电子设备,例如电视。进一步的,电视可以对上述唤醒词进行应答,并唤醒应用处理器来执行用户的语音指令。电视对上述唤醒词进行应答的方法可以例如是语音回答“我在”。上述多个电子设备中排序在后的电子设备不会对上述唤醒词进行应答。In a possible implementation manner, device priority rankings are stored in multiple electronic devices with a voice interaction function. The priority ranking of the devices may be: sound box>television>tablet computer>mobile phone. These multiple electronic devices can communicate with each other after listening to the wake-up word (such as "Xiaoyi Xiaoyi"), and determine the top electronic device among the multiple electronic devices, such as a TV, according to the device priority ranking. Further, the TV may respond to the above-mentioned wake-up word, and wake up the application processor to execute the user's voice command. The method for the television to respond to the wake-up word may be, for example, to answer "I am" by voice. The last electronic device among the above multiple electronic devices will not respond to the above wake-up word.
本申请实施例对上述设备优先级排序不作具体限定。This embodiment of the present application does not specifically limit the prioritization of the foregoing devices.
上述多个电子设备在监听到唤醒词之后互相通信的方法可以是基于蓝牙的通信方法。那么上述多个电子设备之间的距离在蓝牙通信的距离范围内。其中,当用户说出唤醒词,这里多个电子设备均可以监听到该唤醒词。本申请实施例对上述多个电子设备在协商哪个电子设备对上述唤醒词进行响应时的通信方法不作限定。The above-mentioned method for the plurality of electronic devices to communicate with each other after listening to the wake-up word may be a Bluetooth-based communication method. Then the distance between the above-mentioned multiple electronic devices is within the distance range of Bluetooth communication. Wherein, when the user speaks the wake-up word, multiple electronic devices can monitor the wake-up word. The embodiment of the present application does not limit the communication method when the above-mentioned multiple electronic devices negotiate which electronic device should respond to the above-mentioned wake-up word.
上述方法可以减少多个电子设备均对唤醒词进行应答的情况。但根据上述设备优先级排序,这多个电子设备协商确定出的对唤醒词进行应答的电子设备不一定是用户希望唤醒的电子设备。也即是说,上述方法难以满足用户的实际需求。利用上述方法对电子设备进行唤醒 仍可能存在误唤醒的问题。The above method can reduce the situation that multiple electronic devices respond to the wake-up word. However, according to the above device priority ranking, the electronic device that responds to the wake-up word determined through negotiation among the plurality of electronic devices is not necessarily the electronic device that the user wishes to wake up. That is to say, the above method is difficult to meet the actual needs of users. There may still be a problem of false wake-up when electronic equipment is woken up by the above-mentioned method.
本申请实施例提供一种设备唤醒方法。在该方法中,用户可以借助智能眼镜来唤醒自己希望唤醒的电子设备。具体的,智能眼镜处于佩戴状态时,可以检测用户是否需要唤醒其它电子设备。若检测到用户需要唤醒其它电子设备,智能眼镜可以进行图像采集。该采集得到的图像即为用户视野范围内的图像。智能眼镜可以对该图像进行图像识别处理,确定出该图像中包含的电子设备类型。智能眼镜可以利用排序算法对该图像中包含的电子设备进行优先级排序。智能眼镜可以获取本地设备列表,并向上述优先级排序最高且存在于本地设备列表中的电子设备发送唤醒指令。具有语音交互功能的电子设备接收到上述唤醒指令后可以被唤醒,而其它未接收到唤醒指令的电子设备则不响应用户的语音指令。An embodiment of the present application provides a method for waking up a device. In this method, the user can use the smart glasses to wake up the electronic device that he wants to wake up. Specifically, when the smart glasses are in the wearing state, it may be detected whether the user needs to wake up other electronic devices. If it is detected that the user needs to wake up other electronic devices, the smart glasses can collect images. The collected images are images within the user's field of view. The smart glasses can perform image recognition processing on the image to determine the type of electronic equipment contained in the image. The smart glasses can use a ranking algorithm to prioritize the electronic devices contained in this image. The smart glasses can acquire a local device list, and send a wake-up instruction to the above-mentioned electronic device with the highest priority and existing in the local device list. An electronic device with a voice interaction function can be woken up after receiving the wake-up command, while other electronic devices that have not received the wake-up command do not respond to the user's voice command.
可以理解的,用户希望唤醒一个电子设备时,通常会望向这一个电子设备,并说出语音指令。那么,若用户佩戴有智能眼镜,智能眼镜可以采集用户视野范围内的图像,并根据该图像判断用户希望唤醒的电子设备是哪一个。当确定了用户希望唤醒的电子设备,智能眼镜可以向该电子设备发送唤醒指令。当接收到该唤醒指令,电子设备可以被唤醒,识别用户的语音指令并执行该语音指令对应的操作。Understandably, when a user wishes to wake up an electronic device, he usually looks at the electronic device and speaks a voice command. Then, if the user wears smart glasses, the smart glasses can collect images within the user's field of vision, and judge which electronic device the user wants to wake up based on the images. When the electronic device that the user wishes to wake up is determined, the smart glasses may send a wake-up instruction to the electronic device. When the wake-up command is received, the electronic device can be woken up, recognize the user's voice command and execute the operation corresponding to the voice command.
由上述方法可以看出,在一个存在多个具有语音交互功能的电子设备的场景中,用户可以借助智能眼镜来唤醒自己希望唤醒的电子设备。这可以有效减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。It can be seen from the above method that in a scene where there are multiple electronic devices with voice interaction functions, the user can use smart glasses to wake up the electronic device he wants to wake up. This can effectively reduce the situation of false wake-up, and bring a better user experience for the user to use the voice interaction function of the electronic device.
上述方法涉及智能眼镜与其它电子设备的通信,为了便于理解本申请提供的设备唤醒方法,下面介绍本申请提供的一种通信系统。The above method involves communication between smart glasses and other electronic devices. In order to facilitate the understanding of the device wake-up method provided in the present application, a communication system provided in the present application is introduced below.
图3示例性示出了通信系统10的示意图。FIG. 3 exemplarily shows a schematic diagram of the communication system 10 .
通信系统10可以包括多个电子设备,这多个电子设备之间可以建立有通信连接108。例如,如图3所示,通信系统10可以包括智能眼镜101、手机102、耳机103、平板电脑104、路由器105、音箱106和电视107。不限于图3所示的电子设备,通信系统10还可以包括其它类型的电子设备。例如,桌面型计算机、膝上型计算机、手持计算机、增强现实(augmented reality,AR)设备、虚拟现实(virtual reality,VR)设备、人工智能(artificial intelligence,AI)设备、车机、游戏机、其他智能穿戴设备等,还可以包括物联网(internet of things,IOT)设备或智能家居设备如智能热水器、智能灯具、智能空调等等。本申请实施例对此不作限定。这些电子设备的结构可以参考前述图1所示电子设备100的结构示意图。The communication system 10 may include a plurality of electronic devices, and a communication connection 108 may be established between the plurality of electronic devices. For example, as shown in FIG. 3 , the communication system 10 may include smart glasses 101 , a mobile phone 102 , a headset 103 , a tablet computer 104 , a router 105 , a speaker 106 and a TV 107 . Not limited to the electronic devices shown in FIG. 3 , the communication system 10 may also include other types of electronic devices. For example, desktop computers, laptop computers, handheld computers, augmented reality (augmented reality, AR) equipment, virtual reality (virtual reality, VR) equipment, artificial intelligence (artificial intelligence, AI) equipment, car machines, game consoles, Other smart wearable devices may also include Internet of Things (IOT) devices or smart home devices such as smart water heaters, smart lamps, smart air conditioners, and the like. This embodiment of the present application does not limit it. For the structure of these electronic devices, reference may be made to the schematic structural diagram of the electronic device 100 shown in FIG. 1 .
在通信系统10中,各电子设备之间可建立有通信连接108,该通信连接108可以为近场通信连接。该近场通信连接可以是有线连接,如通用串行总线(uniersalserialbus,USB)连接,或者是无线连接,如蓝牙通信连接、Wi-Fi通信连接、无线保真点对点(wireless fidelity peer-to-peer,Wi-Fi P2P)通信连接等等。本申请实施例对上述近场通信连接的具体方式不作限定。In the communication system 10, a communication connection 108 may be established between electronic devices, and the communication connection 108 may be a near field communication connection. The near-field communication connection may be a wired connection, such as a universal serial bus (uniersalserialbus, USB) connection, or a wireless connection, such as a Bluetooth communication connection, a Wi-Fi communication connection, a wireless fidelity peer-to-peer , Wi-Fi P2P) communication connection and so on. The embodiment of the present application does not limit the specific manner of the foregoing near field communication connection.
基于图3所示的通信系统10,这里对本申请实施例涉及的本地设备列表进行介绍。Based on the communication system 10 shown in FIG. 3 , the local device list involved in this embodiment of the present application is introduced here.
在一些实施例中,本地设备列表可包含接入同一个通信网络的电子设备。例如一个家庭中接入同一个家庭Wi-Fi的电子设备。本地设备列表中的多个电子设备之间是建立有通信连接的。示例性的,上述通信系统10具有一个本地设备列表。通信系统10中包含的电子设备即为该本地设备列表中的电子设备。即加入上述通信连接108的电子设备可以被增加至上述本地设备列表中。退出上述通信连接108的电子设备可以从上述本地设备列表中移除。该本 地设备列表可以被存储至通信系统10包含的一个或多个电子设备中,或者可以被存储至云端服务器中。In some embodiments, the local device list may include electronic devices connected to the same communication network. For example, electronic devices connected to the same home Wi-Fi in a family. Communication connections are established between multiple electronic devices in the local device list. Exemplarily, the above communication system 10 has a local device list. The electronic devices included in the communication system 10 are the electronic devices in the local device list. That is, the electronic device that joins the aforementioned communication connection 108 can be added to the aforementioned local device list. An electronic device that exits the communication connection 108 may be removed from the local device list. The local device list may be stored in one or more electronic devices included in the communication system 10, or may be stored in a cloud server.
通信系统10中的电子设备均可以获取、更新该本地设备列表。例如,通信系统10中的任意一个电子设备均可以根据自己检测到加入或退出通信连接108的电子设备的情况更新本地设备列表。若上述本地设备列表存储在通信系统10的多个电子设备中,本地设备列表可以在这多个电子设备中同步被更新。这样,通信系统10中的电子设备获取到的本地设备列表是一致的。All electronic devices in the communication system 10 can acquire and update the local device list. For example, any electronic device in the communication system 10 may update the local device list according to the detection of electronic devices joining or leaving the communication connection 108 . If the aforementioned local device list is stored in multiple electronic devices in the communication system 10, the local device list can be updated synchronously among the multiple electronic devices. In this way, the local device lists acquired by the electronic devices in the communication system 10 are consistent.
在一些实施例中,本地设备列表可以是由上述通信系统10中的一个电子设备创建的。该一个电子设备可以例如是手机102。增加至上述本地设备列表中的电子设备可以是经过可信身份认证的电子设备。上述可信身份认证可以是已存在于本地设备列表中电子设备(如手机102)实现的。例如,响应于同意将音箱106增加至本地设备列表中的用户操作,手机102可以将音箱106增加至本地设备列表中。本申请实施例对上述可信身份认证的具体实现方式不作限定。上述可信身份认证的过程可以例如是为电子设备配网的实现过程。In some embodiments, the local device list may be created by an electronic device in the communication system 10 described above. The one electronic device may be, for example, the mobile phone 102 . The electronic devices added to the aforementioned local device list may be electronic devices that have undergone trusted identity authentication. The above-mentioned trusted identity authentication may be realized by an electronic device (such as the mobile phone 102) existing in the local device list. For example, handset 102 may add speaker 106 to the local device list in response to a user action agreeing to add speaker 106 to the local device list. The embodiment of the present application does not limit the specific implementation manner of the above-mentioned trusted identity authentication. The above process of trusted identity authentication may be, for example, a process of implementing network distribution for electronic devices.
本申请实施例对上述通信系统10中各电子设备的通信连接方式不作限定。The embodiment of the present application does not limit the communication connection manners of the electronic devices in the above-mentioned communication system 10 .
在一种可能的实现方式中,通信系统10中的电子设备可以建立如图4所示的通信连接108。其中,手机102、平板电脑104、音箱106和电视107可以与路由器105建立Wi-Fi通信连接。上述与路由器105建立Wi-Fi通信连接的电子设备可以通过路由器105接入网络,实现上网的功能。也即是说,手机102、平板电脑104、路由器105、音箱106和电视107处于同一个局域网(如同一个家庭Wi-Fi)中。本地设备列表中的电子设备可包含在这一个局域网中的电子设备。智能眼镜101和耳机103可以与手机102建立蓝牙通信连接。当智能眼镜101和耳机103通过蓝牙的方式与手机102连接,而手机102处于上述局域网中,手机102可以更新上述本地设备列表。具体的,手机102可以将智能眼镜101和耳机103增加至本地设备列表中。那么,本地设备列表中的电子设备可以包括图4所示的智能眼镜101、手机102、耳机103、平板电脑104、路由器105、音箱106和电视107。In a possible implementation manner, an electronic device in the communication system 10 may establish a communication connection 108 as shown in FIG. 4 . Wherein, the mobile phone 102 , the tablet computer 104 , the sound box 106 and the TV 107 can establish a Wi-Fi communication connection with the router 105 . The aforementioned electronic devices that establish a Wi-Fi communication connection with the router 105 can access the network through the router 105 to realize the function of surfing the Internet. That is to say, the mobile phone 102, the tablet computer 104, the router 105, the sound box 106 and the TV 107 are in the same local area network (like a home Wi-Fi). The electronic devices in the local device list may include the electronic devices in this one local area network. The smart glasses 101 and the earphone 103 can establish a Bluetooth communication connection with the mobile phone 102 . When the smart glasses 101 and the earphone 103 are connected to the mobile phone 102 via Bluetooth, and the mobile phone 102 is in the local area network, the mobile phone 102 can update the local device list. Specifically, the mobile phone 102 may add the smart glasses 101 and the earphone 103 to the local device list. Then, the electronic devices in the local device list may include smart glasses 101 , mobile phone 102 , earphone 103 , tablet computer 104 , router 105 , speaker 106 and TV 107 shown in FIG. 4 .
若智能眼镜101结束与手机102的蓝牙通信连接,而手机102处于上述局域网中,手机102可以更新上述本地设备列表。具体的,手机102可以将智能眼镜101从本地设备列表中移除。若手机102结束与路由器105的Wi-Fi通信连接(如用户拿着手机出门后),而路由器105处于上述局域网中,路由器105可以更新上述本地设备列表。具体的,路由器105可以将手机105从本地设备列表中移除。其中,在手机102被移除本地设备列表后,仍存在与本地设备列表中的电子设备可以检测通过手机102加入通信连接108的电子设备(如智能眼镜101、耳机103)是否与通信系统10中手机102以外的电子设备连接。若检测出智能眼镜101和耳机103仅与手机102连接,智能眼镜101和耳机103可以从本地设备列表中被移除。若检测出智能眼镜101和耳机103还与其它电子设备(如平板电脑104)连接,智能眼镜101和耳机103仍可存在于本地设备列表中。If the smart glasses 101 end the Bluetooth communication connection with the mobile phone 102, and the mobile phone 102 is in the local area network, the mobile phone 102 can update the local device list. Specifically, the mobile phone 102 may remove the smart glasses 101 from the local device list. If the mobile phone 102 ends the Wi-Fi communication connection with the router 105 (such as after the user goes out with the mobile phone), and the router 105 is in the above-mentioned local area network, the router 105 can update the above-mentioned local device list. Specifically, the router 105 may remove the mobile phone 105 from the local device list. Among them, after the mobile phone 102 is removed from the local device list, there are still electronic devices in the local device list that can detect whether the electronic devices (such as smart glasses 101 and earphones 103) that join the communication connection 108 through the mobile phone 102 are connected to the communication system 10. Electronic devices other than the mobile phone 102 are connected. If it is detected that the smart glasses 101 and the earphone 103 are only connected to the mobile phone 102, the smart glasses 101 and the earphone 103 may be removed from the local device list. If it is detected that the smart glasses 101 and the earphone 103 are also connected to other electronic devices (such as the tablet computer 104), the smart glasses 101 and the earphone 103 may still exist in the local device list.
由上述实现方式可以看出,本地设备列表中的电子设备可以是基于路由器105所建立的局域网中的电子设备(如手机102、平板电脑104、音箱106、电视107),以及与该局域网中的电子设备通过其它无线连接的方法连接的电子设备(如智能眼镜101、耳机103)。As can be seen from the above implementation, the electronic devices in the local device list can be based on the electronic devices in the local area network established by the router 105 (such as mobile phones 102, tablet computers 104, sound boxes 106, and televisions 107), and the electronic devices in the local area network. An electronic device connected to the electronic device through other wireless connection methods (such as smart glasses 101, earphone 103).
在一些实施例中,手机102中安装有用于控制其它电子设备的应用(application,APP)。该APP可以例如是智能家居APP。本地设备列表中的电子设备可以是手机102可通过该智能家居APP控制的电子设备(如路由器105、音箱106、电视107),以及其它与手机102连接但不可通过该智能家居APP控制的电子设备(如智能眼镜101、耳机103)。在一种可能的实 现方式中,手机102以及可通过智能家居APP控制的电子设备均与路由器105连接。响应于作用在智能家居APP的用户操作,手机102可以通过路由器105向上述可通过智能家居APP控制的电子设备发送控制指令。可选的,手机102也可以直接与上述可通过智能家居APP控制的电子设备通信,而无需路由器105转发。本申请实施例对手机102通过智能家居APP控制其它电子设备的实现方式不作限定。In some embodiments, an application (application, APP) for controlling other electronic devices is installed in the mobile phone 102 . The APP can be, for example, a smart home APP. The electronic devices in the local device list can be the electronic devices (such as router 105, speaker 106, TV 107) that the mobile phone 102 can control through the smart home APP, and other electronic devices that are connected to the mobile phone 102 but cannot be controlled through the smart home APP (such as smart glasses 101, earphones 103). In a possible implementation, the mobile phone 102 and the electronic devices that can be controlled by the smart home APP are connected to the router 105. In response to user operations acting on the smart home APP, the mobile phone 102 can send control instructions to the aforementioned electronic devices that can be controlled by the smart home APP through the router 105 . Optionally, the mobile phone 102 can also directly communicate with the above-mentioned electronic devices that can be controlled by the smart home APP without forwarding by the router 105 . The embodiment of the present application does not limit the implementation manner in which the mobile phone 102 controls other electronic devices through the smart home APP.
可以理解的,图3和图4所示的通信系统的结构示意图仅为本申请实施例的示例性说明,不应对本申请构成限定。It can be understood that the schematic structural diagrams of the communication system shown in FIG. 3 and FIG. 4 are only exemplary descriptions of the embodiments of the present application, and should not limit the present application.
下面具体介绍本申请实施例涉及的用户借助智能眼镜唤醒电子设备的场景示意图。The following specifically introduces a schematic diagram of a scenario where a user wakes up an electronic device by means of smart glasses involved in the embodiment of the present application.
如图5所示,一个家庭中的电子设备可包括智能眼镜101、手机102、路由器105、音箱106、电视107。其中,手机102、音箱106和电视107均与路由器105建立Wi-Fi通信连接。用户佩戴有智能眼镜101。智能眼镜101与手机102建立蓝牙通信连接。由前述实施例可知,智能眼镜101、手机102、路由器105、音箱106、电视107可以组成一个通信系统。该通信系统具有的本地设备列表中的电子设备包含智能眼镜101、手机102、路由器105、音箱106、电视107。As shown in FIG. 5 , electronic devices in a family may include smart glasses 101 , mobile phones 102 , routers 105 , speakers 106 , and televisions 107 . Wherein, the mobile phone 102 , the sound box 106 and the TV 107 all establish Wi-Fi communication connections with the router 105 . A user wears smart glasses 101 . The smart glasses 101 establish a Bluetooth communication connection with the mobile phone 102 . It can be seen from the foregoing embodiments that the smart glasses 101, the mobile phone 102, the router 105, the sound box 106, and the television 107 can form a communication system. The electronic devices in the local device list of the communication system include smart glasses 101 , mobile phones 102 , routers 105 , speakers 106 , and televisions 107 .
在图5所示的场景中,用户希望唤醒音箱106并通过语音指令指示音箱106播放音乐。其中,用户佩戴智能眼镜101并望着音箱106说出“小艺小艺,我要听歌”。手机102、音箱106和电视107均为具有语音交互功能且语义交互功能开启的电子设备。用于唤醒手机102、音箱106和电视107的唤醒词是相同的,例如均为“小艺小艺”。手机102、音箱106和电视107均可以通过麦克风采集用户的语音输入。上述语音输入中包括唤醒词“小艺小艺”和语音指令“我要听歌”。In the scenario shown in FIG. 5 , the user wishes to wake up the speaker 106 and instruct the speaker 106 to play music through a voice command. Among them, the user wears the smart glasses 101 and looks at the speaker 106 and says "Xiaoyi Xiaoyi, I want to listen to a song". The mobile phone 102, the sound box 106 and the TV 107 are all electronic devices with voice interaction function and semantic interaction function turned on. The wake-up words used to wake up the mobile phone 102, the speaker 106 and the TV 107 are the same, for example, "Xiaoyi Xiaoyi". The mobile phone 102, the sound box 106 and the TV 107 can all collect the voice input of the user through the microphone. The voice input above includes the wake-up word "Xiaoyi Xiaoyi" and the voice command "I want to listen to a song".
在一种可能的实现方式中,当监听到唤醒词,手机102、音箱106和电视107均可以从唤醒词监听状态进入预唤醒状态。In a possible implementation manner, when the wake-up word is detected, the mobile phone 102, the speaker 106 and the TV 107 may all enter the pre-wake-up state from the wake-up word monitoring state.
上述唤醒词监听状态可以为电子设备采集环境声音,并识别环境声音中是否包含唤醒词的状态。在上述唤醒词监听状态,电子设备的麦克风和低功耗处理器可以实时工作。其中麦克风可用于采集环境声音。低功耗处理器可用于识别环境声音中是否包含唤醒词。The above-mentioned wake-up word monitoring state can collect ambient sound for the electronic device and identify whether the wake-up word is included in the ambient sound. In the aforementioned wake-up word monitoring state, the microphone and the low-power processor of the electronic device can work in real time. The microphone can be used to collect ambient sound. A low-power processor can be used to identify whether the wake word is contained in ambient sounds.
上述预唤醒状态可以为电子设备监听到唤醒词后,检测本地设备列表中是否存在智能眼镜且智能眼镜是否被佩戴的状态。也即是说,当监听到唤醒词,手机102、音箱106和电视107可以检测本地设备列表中是否存在智能眼镜101以及智能眼镜101是否被佩戴,而不是响应该唤醒词立即被唤醒。在上述预唤醒状态,电子设备可以等待接收唤醒指令,且不对监听到的唤醒词、语音指令等进行响应。The aforementioned pre-awakening state may be a state in which the electronic device detects whether there are smart glasses in the local device list and whether the smart glasses are worn after listening to the wake-up word. That is to say, when listening to the wake-up word, the mobile phone 102, the sound box 106 and the TV 107 can detect whether the smart glasses 101 exist in the local device list and whether the smart glasses 101 are worn, instead of being woken up immediately in response to the wake-up word. In the above-mentioned pre-wake-up state, the electronic device may wait for receiving a wake-up instruction, and does not respond to the monitored wake-up words, voice instructions, and the like.
需要进行说明的,在未处于上述唤醒状态时(如处于上述唤醒词监听状态),若接收到唤醒指令,电子设备可以进入唤醒状态。It should be noted that when not in the wake-up state (for example, in the wake-up word monitoring state), if a wake-up instruction is received, the electronic device may enter the wake-up state.
若检测到本地设备列表中不存在智能眼镜101,例如家庭中不存在智能眼镜101或智能眼镜101未与手机102连接,手机102、音箱106和电视107均可以从上述预唤醒状态进入唤醒状态。或者,手机102、音箱106和电视107之间可以通信,协商并确定出一个电子设备来响应上述唤醒词。其中,被选取出的这一个电子设备可以从上述预唤醒状态进入唤醒状态。其它电子设备则可以从上述预唤醒状态再次进入上述唤醒词监听状态。即其它电子设备不响应上述唤醒词。If it is detected that the smart glasses 101 do not exist in the local device list, for example, the smart glasses 101 do not exist in the home or the smart glasses 101 are not connected to the mobile phone 102, the mobile phone 102, the sound box 106 and the TV 107 can all enter the wake-up state from the above-mentioned pre-wake-up state. Alternatively, the mobile phone 102, the sound box 106 and the TV 107 can communicate with each other, negotiate and determine an electronic device to respond to the wake-up word. Wherein, the selected electronic device can enter the wake-up state from the above-mentioned pre-wake-up state. Other electronic devices can enter the above-mentioned wake-up word monitoring state again from the above-mentioned pre-wake-up state. That is, other electronic devices do not respond to the above-mentioned wake-up word.
上述唤醒状态可以表示电子设备的语音识别应用处于被唤醒的状态。在上述唤醒状态,电子设备可以启动语音识别应用。具体的,电子设备可以启动应用处理器来识别语音指令, 并执行语音指令对应的操作。需要进行说明的是,在上述唤醒状态,电子设备也可以实时监听环境声音中是否包含唤醒词。在一种可能的实现方式中,电子设备进入唤醒状态后,若在预设时间段内未在环境声音中识别到语音指令,电子设备可以从唤醒状态进入唤醒词监听状态。The aforementioned wake-up state may indicate that the voice recognition application of the electronic device is in a state of being woken up. In the above wake-up state, the electronic device can start a speech recognition application. Specifically, the electronic device can start the application processor to recognize the voice command, and execute the operation corresponding to the voice command. It should be noted that, in the aforementioned wake-up state, the electronic device may also monitor in real time whether the ambient sound contains a wake-up word. In a possible implementation, after the electronic device enters the wake-up state, if no voice command is recognized in the ambient sound within a preset time period, the electronic device may enter the wake-up word monitoring state from the wake-up state.
若检测到本地设备列表中存在智能眼镜101,但智能眼镜101未被佩戴,手机102、音箱106和电视107均可以从上述预唤醒状态进入唤醒状态。或者,手机102、音箱106和电视107之间可以通信,协商并确定出一个电子设备来响应上述唤醒词。If it is detected that the smart glasses 101 exist in the local device list, but the smart glasses 101 are not worn, the mobile phone 102, the sound box 106 and the TV 107 can all enter the wake-up state from the above-mentioned pre-wake-up state. Alternatively, the mobile phone 102, the sound box 106 and the TV 107 can communicate with each other, negotiate and determine an electronic device to respond to the wake-up word.
可以理解的,在上述本地设备列表中不存在智能眼镜101的情况下,或者在本地设备列表中存在智能眼镜101但智能眼镜101未被佩戴的情况下,用户均无法借助智能眼镜101来唤醒自己希望唤醒的电子设备。那么具有语音交互功能的电子设备在监听到唤醒词之后可能均会被唤醒,或者协商确定出一个最有可能是用户希望唤醒的电子设备来响应唤醒词。It can be understood that if the smart glasses 101 do not exist in the above local device list, or if the smart glasses 101 exist in the local device list but the smart glasses 101 are not worn, the user cannot wake himself up with the help of the smart glasses 101 Electronic devices that wish to wake up. Then, the electronic devices with voice interaction function may all be woken up after listening to the wake-up word, or negotiate and determine an electronic device that is most likely to be woken up by the user to respond to the wake-up word.
若检测到本地设备列表中存在智能眼镜101,且智能眼镜101处于佩戴状态,手机102、音箱106和电视107可以等待唤醒指令。其中,手机102、音箱106和电视107可以在预设时间段内等待唤醒指令。若在预设时间段内接收到唤醒指令,则接收到唤醒指令的电子设备可以从预唤醒状态进入唤醒状态。若在预设时间段内未接收到唤醒指令,则电子设备可以从预唤醒状态再次进入唤醒词监听状态。If it is detected that the smart glasses 101 exist in the local device list, and the smart glasses 101 are in the wearing state, the mobile phone 102, the sound box 106 and the TV 107 can wait for a wake-up instruction. Wherein, the mobile phone 102, the sound box 106 and the TV 107 may wait for a wake-up instruction within a preset time period. If a wake-up instruction is received within a preset time period, the electronic device that receives the wake-up instruction may enter a wake-up state from a pre-wake-up state. If no wake-up instruction is received within the preset time period, the electronic device may enter the wake-up word monitoring state again from the pre-wake-up state.
智能眼镜101处于佩戴状态时,可以检测用户是否需要唤醒其它电子设备。When the smart glasses 101 are in the wearing state, it may be detected whether the user needs to wake up other electronic devices.
在一种可能的实现方式中,智能眼镜101具有麦克风和低功耗处理器。在处于佩戴状态时,智能眼镜101的麦克风和低功耗处理器可以处于工作状态。其中智能眼镜101可以通过麦克风采集环境声音,并通过低功耗处理器识别该环境声音中是否包含唤醒词。当监听到唤醒词,智能眼镜101可以确定用户需要唤醒其它电子设备。进一步的,智能眼镜101可以通过摄像头采集图像。该采集得到的图像即为用户视野范围内的图像。智能眼镜可以对该图像进行图像识别处理,确定出该图像中包含的电子设备类型。In a possible implementation manner, the smart glasses 101 have a microphone and a processor with low power consumption. When in the wearing state, the microphone and the low power consumption processor of the smart glasses 101 may be in the working state. The smart glasses 101 can collect ambient sound through a microphone, and use a low-power processor to identify whether the ambient sound contains a wake-up word. When the wake-up word is detected, the smart glasses 101 may determine that the user needs to wake up other electronic devices. Further, the smart glasses 101 can collect images through a camera. The collected images are images within the user's field of view. The smart glasses can perform image recognition processing on the image to determine the type of electronic equipment contained in the image.
在上述用户戴智能眼镜101并望着音箱106说出“小艺小艺,我要听歌”的场景中,智能眼镜101采集到的图像可以如图6所示。图像中的电子设备包含音箱106和电视107。其中,音箱106位于图像的中央。电视107位于图像的右边缘。智能眼镜可以利用排序算法对该图像中的电子设备进行优先级排序。例如,上述优先级排序得到的结果为音箱106的优先级高于电视107的优先级。In the scene where the user wears the smart glasses 101 and looks at the speaker 106 and says "Xiaoyi Xiaoyi, I want to listen to a song", the image collected by the smart glasses 101 can be as shown in FIG. 6 . Electronic devices in the image include speakers 106 and television 107 . Wherein, the speaker 106 is located in the center of the image. Television 107 is located on the right edge of the image. The smart glasses can use a sorting algorithm to prioritize the electronic devices in this image. For example, the result of the above priority sorting is that the priority of the sound box 106 is higher than that of the TV 107 .
进一步的,智能眼镜101可以获取本地设备列表,并向上述优先级排序最高且存在于本地设备列表中的电子设备发送唤醒指令。由于音箱106和电视107均存在于本地设备列表,且在上述优先级排序中,音箱106的优先级高于电视107的优先级,智能眼镜101可以确定音箱106为用户希望唤醒的电子设备。那么,智能眼镜101可以向音箱106发送唤醒指令。Further, the smart glasses 101 may obtain a local device list, and send a wake-up instruction to the above-mentioned electronic device with the highest priority and existing in the local device list. Since the speaker 106 and the TV 107 both exist in the local device list, and in the above priority sorting, the priority of the speaker 106 is higher than that of the TV 107, the smart glasses 101 can determine that the speaker 106 is the electronic device that the user wants to wake up. Then, the smart glasses 101 may send a wake-up instruction to the speaker 106 .
如图5所示,当接收到唤醒指令,音箱106可以从上述预唤醒状态进入唤醒状态。其中,音箱106可以通过语音指令识别模块来识别唤醒词之后的声音中所包含的信息。上述唤醒词之后的声音中包含语音指令“我要听歌”。音箱106的语音指令识别模块可以识别该语音指令。然后,音箱106可以通过语音指令执行模块执行该语音指令对应的操作,即播放音乐。示例性的,音箱106可以语音回答“没问题”,并开始播放音乐。本申请实施例对上述音箱106语音回答的内容不作限定。As shown in FIG. 5 , when a wake-up instruction is received, the speaker 106 may enter the wake-up state from the above-mentioned pre-wake-up state. Wherein, the speaker 106 can use the voice command recognition module to recognize the information contained in the sound after the wake-up word. The voice after the above-mentioned wake-up word contains the voice instruction "I want to listen to a song". The voice command recognition module of the speaker 106 can recognize the voice command. Then, the speaker 106 can execute the operation corresponding to the voice command through the voice command execution module, that is, play music. Exemplarily, the speaker 106 can answer "no problem" by voice, and start playing music. The embodiment of the present application does not limit the content of the voice answer of the above-mentioned speaker 106 .
上述智能眼镜对图像进行图像识别处理的方法、利用排序算法对图像中的电子设备进行优先级排序的方法将在后续实施例中具体说明,这里先不展开介绍。The above-mentioned method of performing image recognition processing on an image by smart glasses and the method of prioritizing electronic devices in an image by using a sorting algorithm will be described in detail in subsequent embodiments, and will not be introduced here.
在一种可能的实现方式中,由于智能眼镜101与手机102连接,智能眼镜101可以将图像中包含的电子设备的优先级排序的结果发送给手机102。手机102可以获取本地设备列表,并根据本地设备列表和上述优先级排序的结果确定用户希望唤醒的电子设备。上述用户希望唤醒的电子设备即为上述优先级排序最高且存在于本地设备列表中的电子设备。手机可以向用户希望唤醒的电子设备发送唤醒指令。其中,手机102可以通过路由器102向用户希望唤醒的电子设备发送唤醒指令,或者直接向用户希望唤醒的电子设备发送唤醒指令。In a possible implementation manner, since the smart glasses 101 are connected with the mobile phone 102 , the smart glasses 101 may send the result of prioritization of the electronic devices included in the image to the mobile phone 102 . The mobile phone 102 may acquire the local device list, and determine the electronic device that the user wishes to wake up according to the local device list and the result of the prioritization. The above-mentioned electronic device that the user wishes to wake up is the above-mentioned electronic device with the highest priority and existing in the local device list. The mobile phone can send a wake-up instruction to the electronic device that the user wishes to wake up. Wherein, the mobile phone 102 may send a wake-up command to the electronic device that the user wishes to wake up through the router 102, or directly send a wake-up command to the electronic device that the user wishes to wake up.
本申请实施例对智能眼镜101或手机102向用户希望唤醒的电子设备发送唤醒指令的方法不作限定。The embodiment of the present application does not limit the method for the smart glasses 101 or the mobile phone 102 to send a wake-up instruction to the electronic device that the user wishes to wake up.
可选的,上述对图像进行图像识别处理的过程和/或利用排序算法对图像中的电子设备进行优先级排序的过程也可以由手机102来实现。上述方法可以有效节省智能眼镜101的计算资源和功耗。Optionally, the above-mentioned process of performing image recognition processing on the image and/or the process of prioritizing the electronic devices in the image by using a sorting algorithm may also be implemented by the mobile phone 102 . The above method can effectively save computing resources and power consumption of the smart glasses 101 .
在另一种可能的实现方式中,智能眼镜101可以与路由器105连接。智能眼镜101在检测到用户需要唤醒其它电子设备时,可以采集图像。当得到图像,上述对图像进行图像识别处理的过程和/或利用排序算法对图像中的电子设备进行优先级排序的过程和/或确定用户希望唤醒的电子设备并向用户希望唤醒的电子设备发送唤醒指令的过程也可以由路由105来实现。这可以有效节省智能眼镜101的计算资源和功耗。In another possible implementation manner, the smart glasses 101 may be connected to the router 105 . When the smart glasses 101 detect that the user needs to wake up other electronic devices, they can collect images. When the image is obtained, the above-mentioned process of image recognition processing for the image and/or the process of prioritizing the electronic devices in the image by using a sorting algorithm and/or determining the electronic device that the user wants to wake up and sending a message to the electronic device that the user wants to wake up The process of waking up the instruction can also be realized by the router 105 . This can effectively save computing resources and power consumption of the smart glasses 101 .
由图5和图6所示的场景可以看出,当智能眼镜处于佩戴状态时,具有语音交互功能的电子设备在监听到唤醒词后不会立即进入唤醒状态。智能眼镜可用于判断用户希望唤醒的电子设备是哪个。当确定了用户希望唤醒的电子设备,智能眼镜或与智能眼镜连接的电子设备(如手机或路由器)可以向用户希望唤醒的电子设备发送唤醒指令。接收到唤醒指令的电子设备可以进入唤醒状态。而监听到唤醒词但未接收到唤醒指令的电子设备则不进入唤醒状态。利用上述场景中的设备唤醒方法,用户可以借助智能眼镜来唤醒自己希望唤醒的电子设备。这可以有效减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。It can be seen from the scenarios shown in FIG. 5 and FIG. 6 that when the smart glasses are in the wearing state, the electronic device with voice interaction function will not enter the wake-up state immediately after listening to the wake-up word. Smart glasses can be used to determine which electronic device the user wishes to wake up. When it is determined that the electronic device that the user wishes to wake up, the smart glasses or an electronic device (such as a mobile phone or a router) connected to the smart glasses may send a wake-up instruction to the electronic device that the user wishes to wake up. The electronic device that receives the wake-up instruction can enter the wake-up state. However, the electronic device that monitors the wake-up word but does not receive the wake-up instruction does not enter the wake-up state. By using the device wake-up method in the above scenario, the user can use the smart glasses to wake up the electronic device he wants to wake up. This can effectively reduce the situation of false wake-up, and bring a better user experience for the user to use the voice interaction function of the electronic device.
图7和图8示例性示出了本申请实施例涉及的另一个用户借助智能眼镜唤醒电子设备的场景。FIG. 7 and FIG. 8 exemplarily show another scenario where a user wakes up an electronic device by means of smart glasses according to an embodiment of the present application.
如图7所示,一个家庭中的电子设备可包括智能眼镜101、手机102、路由器105、音箱106、电视107。这多个电子设备之间的连接关系可以参考前述图5所示实施例的介绍。这里不再赘述。As shown in FIG. 7 , electronic devices in a family may include smart glasses 101 , mobile phones 102 , routers 105 , speakers 106 , and televisions 107 . For the connection relationship among the multiple electronic devices, reference may be made to the introduction of the embodiment shown in FIG. 5 above. I won't go into details here.
在图7所示的场景中,用户已经借助智能眼镜101唤醒音箱106,并通过语音指令指示音箱106播放音乐。进一步的,用户希望唤醒手机102并通过语音指令指示手机102发送短信。如图7所示,音箱106已经响应用户用于播放音乐的语音指令(如“我要听歌”)播放音乐。用户佩戴智能眼镜101望着手机102说出“小艺小艺,给老张发短信”。手机102、音箱106和电视107均可以通过麦克风采集用户的语音输入。上述语音输入中包括唤醒词“小艺小艺”和语音指令“给老张发短信”。In the scenario shown in FIG. 7 , the user has awakened the speaker 106 by means of the smart glasses 101 , and instructed the speaker 106 to play music through a voice command. Further, the user wishes to wake up the mobile phone 102 and instruct the mobile phone 102 to send a short message through a voice command. As shown in FIG. 7 , the speaker 106 has played music in response to the user's voice instruction for playing music (such as "I want to listen to a song"). The user wears the smart glasses 101 and looks at the mobile phone 102 and says "Xiaoyi Xiaoyi, send a text message to Lao Zhang". The mobile phone 102, the sound box 106 and the TV 107 can all collect the voice input of the user through the microphone. The above-mentioned voice input includes the wake-up word "Xiaoyi Xiaoyi" and the voice command "send a text message to Lao Zhang".
在一种可能的实现方式中,在监听到唤醒词之前,手机102、音箱106和电视107均处于唤醒词监听状态。当监听到唤醒词,手机102、音箱106和电视107均可以从唤醒词监听状态进入预唤醒状态。In a possible implementation manner, before the wake-up word is monitored, the mobile phone 102, the speaker 106, and the TV 107 are all in a wake-up word monitoring state. When the wake-up word is monitored, the mobile phone 102, the speaker 106 and the TV 107 can all enter the pre-wake-up state from the wake-up word monitoring state.
在另一种可能的实现方式中,音箱106在图5所示的场景中接收到唤醒指令进入唤醒状 态后,还未退出唤醒状态。即在监听到图7所示的唤醒词之前,手机102和电视107处于唤醒词监听状态。音箱106处于唤醒状态。当监听到唤醒词,手机和电视107可以从唤醒词监听状态进入预唤醒状态。音箱106可以从唤醒状态进入预唤醒状态。In another possible implementation, the speaker 106 has not exited the wake-up state after receiving the wake-up instruction in the scene shown in FIG. 5 and entering the wake-up state. That is, before the wake-up word shown in FIG. 7 is monitored, the mobile phone 102 and the TV 107 are in the wake-up word monitoring state. The speaker 106 is in an awake state. When the wake-up word is detected, the mobile phone and the TV 107 can enter the pre-wake-up state from the wake-up word monitoring state. Speaker 106 can enter a pre-awake state from an awake state.
在该预唤醒状态,手机102、音箱106和电视107均可以获取本地设备列表,检测出本地设备列表中存在智能眼镜101且智能眼镜101处于佩戴状态。那么,手机102、音箱106和电视107可以等待唤醒指令,而不立即进入唤醒状态。In the pre-awakening state, the mobile phone 102, the sound box 106 and the TV 107 can all obtain the local device list, and detect that the smart glasses 101 exist in the local device list and the smart glasses 101 are in the wearing state. Then, the mobile phone 102, the sound box 106 and the TV 107 can wait for the wake-up instruction, instead of entering the wake-up state immediately.
智能眼镜101处于佩戴状态时,可以检测用户是否需要唤醒其它电子设备。When the smart glasses 101 are in the wearing state, it may be detected whether the user needs to wake up other electronic devices.
智能眼镜101检测到用户需要唤醒其它电子设备后,确定本地设备列表中哪一个电子设备是用户希望唤醒的电子设备的方法可以参考前述图5所示的实施例。After the smart glasses 101 detect that the user needs to wake up other electronic devices, the method for determining which electronic device in the local device list is the electronic device that the user wants to wake up can refer to the embodiment shown in FIG. 5 above.
其中,智能眼镜101可以采集图像。该图像即为用户视野范围内的图像。如图8所示,该图像中的电子设备包括手机102。参考前述实施例的方法,手机102即为用户希望唤醒的电子设备。Wherein, the smart glasses 101 can collect images. The image is the image within the user's field of view. As shown in FIG. 8 , the electronic devices in this image include a mobile phone 102 . Referring to the methods of the foregoing embodiments, the mobile phone 102 is the electronic device that the user wishes to wake up.
手机102可以接收到唤醒指令。当接收到唤醒指令,手机102可以进入唤醒状态,识别唤醒词之后的声音中所包含的信息。上述唤醒词之后的声音中包含语音指令“给老张发短信”。手机102的语音指令识别模块可以识别该语音指令。然后,手机102可以通过语音指令执行模块执行该语音指令对应的操作,即发送短信。示例性的,手机102可以在联系人应用中查找是否存在联系人名称为“老张”的联系方式。若存在,手机102可以语音回答“好的,请说短信内容”。本申请实施例对上述手机102语音回答的内容不作限定。 Cell phone 102 may receive a wake-up instruction. When receiving the wake-up command, the mobile phone 102 can enter into the wake-up state, and recognize the information contained in the sound after the wake-up word. The voice after the above-mentioned wake-up word contains the voice instruction "send a text message to Lao Zhang". The voice command recognition module of the mobile phone 102 can recognize the voice command. Then, the mobile phone 102 can execute the operation corresponding to the voice command through the voice command execution module, that is, send a short message. Exemplarily, the mobile phone 102 may search whether there is a contact information named "Lao Zhang" in the contacts application. If it exists, the mobile phone 102 can answer "OK, please say the content of the short message" by voice. The embodiment of the present application does not limit the content of the above-mentioned voice answer of the mobile phone 102 .
音箱106和电视107在进入预唤醒状态后的预设时间段内未接收到唤醒指令,可以从预唤醒状态进入唤醒词监听状态。The speaker 106 and the TV 107 may enter the wake-up word monitoring state from the pre-wake-up state if they do not receive a wake-up instruction within a preset period of time after entering the pre-wake-up state.
在一些实施例中,智能眼镜101可以将采集到的图像发送给手机102,并指示手机102确定哪个电子设备为用户希望唤醒的电子设备。手机102可以对图像进行图像识别处理,并利用排序算法对识别出来包含在图像中的电子设备进行优先级排序,来确定用户希望唤醒的电子设备。若手机102确定出用户希望唤醒的电子设备为自己(即手机102),手机102可以进入唤醒状态,而不用发送唤醒指令。若手机102确定出用户希望唤醒的电子设备不是自己,手机102可以向确定出的用户希望唤醒的电子设备发送唤醒指令。In some embodiments, the smart glasses 101 can send the collected images to the mobile phone 102, and instruct the mobile phone 102 to determine which electronic device is the electronic device that the user wishes to wake up. The mobile phone 102 may perform image recognition processing on the image, and use a sorting algorithm to prioritize the identified electronic devices contained in the image, so as to determine the electronic device that the user wishes to wake up. If the mobile phone 102 determines that the electronic device that the user wants to wake up is itself (ie, the mobile phone 102), the mobile phone 102 can enter the wake-up state without sending a wake-up command. If the mobile phone 102 determines that the electronic device that the user wishes to wake up is not itself, the mobile phone 102 may send a wake-up instruction to the determined electronic device that the user wishes to wake up.
由图7和图8所示的场景可以看出,用户可以借助智能眼镜唤醒自己希望唤醒的电子设备。这可以有效减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。It can be seen from the scenarios shown in FIG. 7 and FIG. 8 that the user can wake up the electronic device he wants to wake up with the help of smart glasses. This can effectively reduce the situation of false wake-up, and bring a better user experience for the user to use the voice interaction function of the electronic device.
在一些实施例中,用户在佩戴智能眼镜101时,可以通过触碰智能眼镜101的预设位置(如镜腿的一个位置)来触发智能眼镜101识别自己希望唤醒的电子设备,并向自己希望唤醒的电子设备发送唤醒指令。其中,用户可以在触碰智能眼镜101的预设位置后直接说出语音指令,而不用说出唤醒词。接收到上述唤醒指令的电子设备可以识别用户的语音指令并执行该语音指令对应的操作。上述方法不仅可以减少误唤醒的情况,而且可以简化用户通过语音操控电子设备的用户操作,提高用户的使用体验。In some embodiments, when the user wears the smart glasses 101, he can trigger the smart glasses 101 to recognize the electronic device he wants to wake up by touching a preset position of the smart glasses 101 (such as a position on the temple), and to wake up the electronic device he wants to wake up. The wake-up electronic device sends a wake-up command. Wherein, the user can speak the voice command directly after touching the preset position of the smart glasses 101 instead of saying the wake-up word. The electronic device that receives the wake-up instruction can recognize the user's voice instruction and execute the operation corresponding to the voice instruction. The above method can not only reduce false wake-ups, but also simplify user operations for users to control electronic devices through voice, and improve user experience.
图9A和图9B示例性示出了本申请实施例涉及的另一个用户借助智能眼镜唤醒电子设备的场景。FIG. 9A and FIG. 9B exemplarily show another scenario where a user wakes up an electronic device by means of smart glasses according to an embodiment of the present application.
如图9A所示,一个家庭中的电子设备可包括智能眼镜101、手机102、路由器105、音箱106、电视107。这多个电子设备之间的连接关系可以参考前述图5所示实施例的介绍。这 里不再赘述。As shown in FIG. 9A , electronic devices in a family may include smart glasses 101 , mobile phones 102 , routers 105 , speakers 106 , and televisions 107 . For the connection relationship among the multiple electronic devices, reference may be made to the introduction of the embodiment shown in FIG. 5 above. I won't repeat them here.
在图9A和图9B所示的场景中,用户希望唤醒音箱106并通过语音指令指示音箱106播放音乐。其中,用户可以佩戴智能眼镜101,望着音箱106并触碰智能眼镜101的预设位置(如镜腿上的一个位置)。In the scenarios shown in FIG. 9A and FIG. 9B , the user wishes to wake up the speaker 106 and instruct the speaker 106 to play music through a voice command. Wherein, the user may wear the smart glasses 101, look at the sound box 106 and touch a preset position of the smart glasses 101 (such as a position on the temple).
智能眼镜101处于佩戴状态时,可以检测用户是否需要唤醒其它电子设备。When the smart glasses 101 are in the wearing state, it may be detected whether the user needs to wake up other electronic devices.
在一种可能的实现方式中,当检测到作用在智能眼镜101的预设位置的触碰操作,智能眼镜101可以确定用户需要唤醒其它电子设备。进一步的,智能眼镜101可以通过摄像头采集图像。该图像即为用户视野范围内的图像。智能眼镜可以对该图像进行图像识别处理,确定出该图像中包含的电子设备类型。进一步的,智能眼镜可以利用排序算法对该图像中的电子设备进行优先级排序。智能眼镜101在图9A所示的场景采集得到的图像可以参考前述图6所示的图像(包含音箱106和电视107)。例如,上述优先级排序得到的结果为音箱106的优先级高于电视107的优先级。In a possible implementation manner, when a touch operation acting on a preset position of the smart glasses 101 is detected, the smart glasses 101 may determine that the user needs to wake up other electronic devices. Further, the smart glasses 101 can collect images through a camera. The image is the image within the user's field of vision. The smart glasses can perform image recognition processing on the image to determine the type of electronic equipment contained in the image. Further, the smart glasses can use a sorting algorithm to prioritize the electronic devices in the image. The images collected by the smart glasses 101 in the scene shown in FIG. 9A may refer to the aforementioned images shown in FIG. 6 (including the sound box 106 and the TV 107 ). For example, the result of the above priority sorting is that the priority of the sound box 106 is higher than that of the TV 107 .
本申请实施例对上述智能眼镜101的预设位置不作限定。The embodiment of the present application does not limit the preset position of the above-mentioned smart glasses 101 .
智能眼镜101可以获取本地设备列表,并向上述优先级排序最高且存在与本地设备列表中的电子设备发送唤醒指令。由于音箱106和电视107均存在于本地设备列表,且在上述优先级排序中,音箱106的优先级高于电视107的优先级,智能眼镜101可以确定音箱106为用户希望唤醒的电子设备。那么,智能眼镜101可以向音箱106发送唤醒指令。The smart glasses 101 may obtain a local device list, and send a wake-up instruction to the above-mentioned electronic device with the highest priority and existing in the local device list. Since the speaker 106 and the TV 107 both exist in the local device list, and in the above priority sorting, the priority of the speaker 106 is higher than that of the TV 107, the smart glasses 101 can determine that the speaker 106 is the electronic device that the user wants to wake up. Then, the smart glasses 101 may send a wake-up instruction to the speaker 106 .
在一种可能的实现方式中,在未监听到唤醒词但接收到上述唤醒指令的情况下,电子设备可以直接进入唤醒状态。如图9A所示,手机102、音箱106和电视107均处于唤醒词监听状态。当接收到上述来自智能眼镜101的唤醒指令,音箱106可以进入唤醒状态。In a possible implementation manner, when the wake-up word is not monitored but the wake-up instruction is received, the electronic device may directly enter the wake-up state. As shown in FIG. 9A , the mobile phone 102 , the speaker 106 and the TV 107 are all in the wake-up word monitoring state. When receiving the wake-up instruction from the smart glasses 101, the speaker 106 can enter the wake-up state.
当进入唤醒状态,音箱106可以通过语音指令识别模块识别接收到唤醒指令后的环境声音中的信息。若未从环境声音中识别出语音指令,音箱106可以语音回答“我在”,来提示用户自己已被唤醒,可以执行用户的语音指令。本申请实施例对上述音箱106接收到唤醒指令后的语音回答的内容不作限定。若从环境声音中识别出语音指令,如“我要听歌”,音箱106可以语音回答“没问题”,并开始播放音乐。When entering the wake-up state, the speaker 106 can recognize the information in the ambient sound after receiving the wake-up command through the voice command recognition module. If the voice command is not recognized from the ambient sound, the speaker 106 can voice answer "I am here" to remind the user that he has been awakened and can execute the user's voice command. The embodiment of the present application does not limit the content of the voice answer after the speaker 106 receives the wake-up instruction. If a voice instruction is recognized from the ambient sound, such as "I want to listen to a song", the speaker 106 can answer "no problem" and start playing music.
如图9B所示,用户听到音箱106语音回答“我在”之后,可以向音箱106下发语音指令“我要听歌”。音箱106可以通过语音指令识别模块识别该语音指令,并通过语音指令执行模块执行该语音指令对应的操作,即播放音乐。例如,音箱106在识别到语音指令“我要听歌”之后,可以语音回答“没问题”,并开始播放音乐。As shown in FIG. 9B , after hearing the voice answer "I'm here" from the speaker 106, the user can issue a voice command "I want to listen to a song" to the speaker 106. The speaker 106 can recognize the voice command through the voice command recognition module, and execute the operation corresponding to the voice command through the voice command execution module, that is, play music. For example, after the speaker 106 recognizes the voice instruction "I want to listen to a song", it can voice answer "no problem" and start playing music.
可选的,当进入唤醒状态,音箱106可以通过语音指令识别模块识别从接收到唤醒指令之前的时刻A开始,采集到的环境声音中的信息。示例性的,用户可能在望着音箱106并触摸智能眼镜101的预设位置的同时,说出语音指令“我要听歌”。也即是说,用户可能在音箱106接收到唤醒指令之前就说出了语音指令。那么,音箱106从接收到唤醒指令之前的一段时间内开始采集的环境声音中进行语音指令识别,可以减少语音指令漏识别的情况,提高用户的使用体验。Optionally, when entering the wake-up state, the speaker 106 can use the voice command recognition module to recognize the information in the ambient sound collected from time A before receiving the wake-up command. Exemplarily, the user may say the voice command "I want to listen to a song" while looking at the speaker 106 and touching the preset position of the smart glasses 101 . That is to say, the user may speak the voice command before the speaker 106 receives the wake-up command. Then, the speaker 106 recognizes the voice command from the ambient sound collected within a period of time before receiving the wake-up command, which can reduce the occurrence of missed voice command recognition and improve user experience.
需要进行说明的是,智能眼镜101可以与手机102或与路由器105连接。上述对图像进行图像识别处理的过程和/或利用排序算法对图像中的电子设备进行优先级排序的过程也可以由手机102或由路由器105来实现。这可以有效节省智能眼镜101的计算资源和功耗。It should be noted that the smart glasses 101 can be connected to the mobile phone 102 or to the router 105 . The above-mentioned process of performing image recognition processing on an image and/or the process of prioritizing electronic devices in an image by using a sorting algorithm may also be implemented by the mobile phone 102 or by the router 105 . This can effectively save computing resources and power consumption of the smart glasses 101 .
由上述图9A和图9B所示的场景可知,用户希望通过语音控制电子设备时,可以佩戴智能眼镜望着该电子设备,并触碰智能眼镜的预设位置,说出语音指令。智能眼镜在确定用户 希望唤醒其它电子设备时可以判断本地设备列表中哪一个电子设备是用户希望唤醒的电子设备,并唤醒用户希望唤醒的电子设备。在该场景中用户可直接下发语音指令而无需说出唤醒词。这不仅可以减少误唤醒的情况,而且可以简化用户通过语音操控电子设备的用户操作,提高用户的使用体验。As can be seen from the scenarios shown in FIGS. 9A and 9B above, when a user wants to control an electronic device through voice, he can wear smart glasses to look at the electronic device, touch a preset position of the smart glasses, and speak a voice command. When determining that the user wishes to wake up other electronic devices, the smart glasses can determine which electronic device in the local device list is the electronic device that the user wishes to wake up, and wake up the electronic device that the user wishes to wake up. In this scenario, the user can directly issue voice commands without uttering the wake-up word. This can not only reduce false wake-ups, but also simplify user operations for users to control electronic devices through voice, and improve user experience.
在一些实施例中,用户在佩戴智能眼镜101时,可以通过触碰智能眼镜101的预设位置(如镜腿的一个位置)来触发智能眼镜101识别自己希望唤醒的电子设备。并且,用户可以在说出语音指令之前,说出唤醒词。那么,接收到唤醒指令的电子设备可以进入唤醒状态。若监听到唤醒词,进入唤醒状态的电子设备可以通过语音指令识别模块识别唤醒词之后的声音中是否包含语音指令。进一步的,当识别到语音指令,该电子设备可以通过语音指令执行模块执行该语音指令对应的操作。In some embodiments, when wearing the smart glasses 101 , the user can trigger the smart glasses 101 to identify the electronic device that the user wants to wake up by touching a preset position of the smart glasses 101 (such as a position on the temple). Moreover, the user can speak the wake-up word before speaking the voice command. Then, the electronic device that receives the wake-up instruction can enter the wake-up state. If the wake-up word is monitored, the electronic device in the wake-up state can recognize whether the sound after the wake-up word contains a voice command through the voice command recognition module. Further, when the voice command is recognized, the electronic device can execute the operation corresponding to the voice command through the voice command executing module.
在一些实施例中,当监听到唤醒词,智能眼镜101可以采集图像,并根据该图像确定用户希望唤醒的电子设备。在用户A和用户B均在家中,且用户A佩戴智能眼镜,用户B说出唤醒词进行设备唤醒的场景下,智能眼镜101采集的图像中可能不存在电子设备。当监听到唤醒词(即用户B的语音输入),智能眼镜101可以采集图像,并对该图像进行图像识别处理。当识别到图像中不存在电子设备,智能眼镜101可以向本地设备列表中的电子设备(如手机102、音箱106、电视107等)发送指示消息。该指示消息可用于指示智能眼镜101将不会发送唤醒指令。当监听到上述唤醒词(即用户B的语音输入),语音交互功能开启的电子设备(如手机102、音箱106、电视107等)可以查看本地设备列表中是否包含智能眼镜,并判断智能眼镜是否处于佩戴状态。若本地设备列表中存在智能眼镜且智能眼镜处于佩戴状态,语音交互功能开启的电子设备可以等待唤醒指令。当接收到上述来自智能眼镜的指示消息,语音交互功能开启的电子设备可以均进入唤醒状态,或者语音交互功能开启的电子设备中的一个进入唤醒状态。示例性的,语音交互功能开启的电子设备可以根据接收到包含唤醒词的声音信号的强度,协商选取出接收到包含唤醒词的声音信号的强度最大的电子设备。该接收到包含唤醒词的声音信号强度最大的电子设备可以进入唤醒状态。而其它设备可以从预唤醒状态进入唤醒词监听状态。In some embodiments, when a wake-up word is detected, the smart glasses 101 may collect an image, and determine the electronic device that the user wishes to wake up according to the image. In a scenario where both user A and user B are at home, and user A wears smart glasses, and user B speaks a wake-up word to wake up the device, there may be no electronic device in the image collected by the smart glasses 101 . When monitoring the wake-up word (that is, user B's voice input), the smart glasses 101 can collect an image, and perform image recognition processing on the image. When it is recognized that there is no electronic device in the image, the smart glasses 101 may send an indication message to the electronic devices in the local device list (such as the mobile phone 102, the sound box 106, the TV 107, etc.). The indication message may be used to indicate that the smart glasses 101 will not send a wake-up instruction. When the above-mentioned wake-up word (that is, the voice input of user B) is monitored, the electronic device (such as mobile phone 102, speaker 106, TV 107, etc.) is worn. If the smart glasses exist in the local device list and the smart glasses are worn, the electronic device with the voice interaction function enabled can wait for a wake-up instruction. When receiving the above indication message from the smart glasses, the electronic devices with the voice interaction function enabled may all enter the wake-up state, or one of the electronic devices with the voice interaction function enabled enters the wake-up state. Exemplarily, the electronic device with the voice interaction function enabled may negotiate and select the electronic device that receives the sound signal containing the wake-up word with the highest intensity according to the strength of the received sound signal containing the wake-up word. The electronic device that receives the sound signal containing the wake-up word with the highest intensity may enter the wake-up state. Other devices can enter the wake-up word monitoring state from the pre-wake-up state.
也即是说,若未佩戴智能眼镜的用户通过唤醒词来唤醒设备,且佩戴智能眼镜的用户的视野范围不存在语音交互功能开启的电子设备,语音交互功能开启的电子设备中至少有一个电子设备可以响应上述唤醒词进入唤醒状态。这样可以避免未佩戴智能眼镜的用户与佩戴智能眼镜的用户处于同一个环境时,未佩戴智能眼镜的用户无法通过唤醒词来唤醒设备的情况。That is to say, if the user who is not wearing smart glasses wakes up the device through the wake-up word, and there is no electronic device with the voice interaction function enabled in the field of view of the user wearing the smart glasses, at least one of the electronic devices with the voice interaction function is enabled. The device may enter the wake state in response to the above wake word. In this way, when the user who does not wear the smart glasses is in the same environment as the user who wears the smart glasses, the situation that the user who does not wear the smart glasses cannot wake up the device through the wake-up word.
由前述实施例可知,具有语音交互功能且语音交互功能处于开启状态的电子设备可以时时监听环境声音中是否包含唤醒词。当监听到唤醒词,电子设备可以进入预唤醒状态。It can be seen from the foregoing embodiments that an electronic device having a voice interaction function and the voice interaction function is turned on can always monitor whether the ambient sound contains a wake-up word. When the wake-up word is detected, the electronic device may enter a pre-wake-up state.
下面具体介绍电子设备监听环境声音中是否包含唤醒词的实现方法。The implementation method of whether the electronic device monitors whether the ambient sound contains the wake-up word is specifically introduced below.
在一些实施例中,电子设备可以通过麦克风采集环境声音。其中,当用户在电子设备附近说出唤醒语音(如“小艺小艺”),环境声音中可包含唤醒语音。在采集到该环境声音后,电子设备可以从该环境声音中分离出用户的唤醒语音。接着,电子设备可以从该唤醒语音中,利用声学模型从用户的语音信号中解码出音素序列。在从该唤醒语音中解码出音素序列后,电子设备可以判断该解码出的音素序列是否与已存储的唤醒词音素序列匹配。若是,则表明该唤醒语音中包含唤醒词。当确定环境声音中包含唤醒词,电子设备可以进入预唤醒状态。 本申请实施例对上述唤醒语音不作具体限定。In some embodiments, the electronic device can collect ambient sound through a microphone. Wherein, when the user speaks the wake-up voice (such as "Xiao Yi Xiao Yi") near the electronic device, the wake-up voice may be included in the ambient sound. After collecting the ambient sound, the electronic device can separate the user's wake-up voice from the ambient sound. Next, the electronic device may decode a phoneme sequence from the user's speech signal by using an acoustic model from the wake-up speech. After decoding the phoneme sequence from the wake-up speech, the electronic device can determine whether the decoded phoneme sequence matches the stored wake-up word phoneme sequence. If yes, it indicates that the wake-up voice includes a wake-up word. When it is determined that the ambient sound contains a wake-up word, the electronic device may enter a pre-wake-up state. The embodiment of the present application does not specifically limit the above wake-up voice.
在另一些实施例中,电子设备可以通过麦克风采集环境声音。其中,当用户在电子设备附近说出唤醒语音(如“小艺小艺”),环境声音中可包含唤醒语音。在采集到该环境声音后,电子设备可以从该环境声音中分离出用户的唤醒语音。接着,电子设备可以从该唤醒语音中,利用声学模型从用户的语音信号中解码出音素序列。然后,通过语音模型以及语音模型的发音字典,电子设备可以从解码出来的音素序列中进一步解码出文字信息。在解码出文字信息后,电子设备可以判断从唤醒语音中解码出的文字信息是否包含已存储的唤醒词文本。若是,则表明该唤醒语音中包含唤醒词。当确定环境声音中包含唤醒词,电子设备可以进入预唤醒状态。In other embodiments, the electronic device may collect ambient sound through a microphone. Wherein, when the user speaks the wake-up voice (such as "Xiao Yi Xiao Yi") near the electronic device, the wake-up voice may be included in the ambient sound. After collecting the ambient sound, the electronic device can separate the user's wake-up voice from the ambient sound. Next, the electronic device may decode a phoneme sequence from the user's speech signal by using an acoustic model from the wake-up speech. Then, through the speech model and the pronunciation dictionary of the speech model, the electronic device can further decode text information from the decoded phoneme sequence. After decoding the text information, the electronic device can determine whether the text information decoded from the wake-up voice contains the stored wake-up word text. If yes, it indicates that the wake-up voice includes a wake-up word. When it is determined that the ambient sound contains a wake-up word, the electronic device may enter a pre-wake-up state.
在一种可能的实现方式中,电子设备可以从用户的唤醒语音中提取出唤醒词和用户的声纹特征。当唤醒词与已存储的唤醒词模板匹配,且用户的声纹特征与已存储的声纹特征模板匹配时,电子设备可以进入预唤醒状态。这可以实现由特定的用户才能唤醒电子设备并通过语音指令控制电子设备,提高了电子设备的信息安全。In a possible implementation manner, the electronic device may extract a wake-up word and a voiceprint feature of the user from the user's wake-up voice. When the wake-up word matches the stored wake-up word template, and the user's voiceprint feature matches the stored voiceprint feature template, the electronic device may enter a pre-wake-up state. This can realize that only a specific user can wake up the electronic device and control the electronic device through voice commands, which improves the information security of the electronic device.
本申请实施例对电子设备监听环境声音中是否包含唤醒词的具体方法不作限定。The embodiment of the present application does not limit the specific method for the electronic device to monitor whether the ambient sound contains the wake-up word.
在一些实施例中,处于唤醒词监听状态或处于预唤醒状态的电子设备在接收到唤醒指令后可以进入唤醒状态。上述唤醒指令可以是由智能眼镜或者其它与智能眼镜建立有通信连接的电子设备发送的。上述唤醒指令可用于指示电子设备进入唤醒状态。In some embodiments, the electronic device in the wake-up word monitoring state or in the pre-wake-up state may enter the wake-up state after receiving the wake-up instruction. The above wake-up instruction may be sent by smart glasses or other electronic devices that establish a communication connection with the smart glasses. The above wake-up instruction can be used to instruct the electronic device to enter the wake-up state.
在唤醒状态,电子设备可以启动语音识别应用。具体的,启动语音识别应用可以为电子设备启动应用处理器中的语音指令识别模块和语音指令执行模块。电子设备可以通过语音指令识别模块来识别环境声音中用户的语音指令,并通过语音指令执行模块来执行语音指令对应的操作。电子设备在唤醒状态也可实时监听环境声音中是否包含唤醒词。若处于唤醒状态的电子设备监听到唤醒词,该电子设备可以从唤醒状态进入预唤醒状态。In the wake-up state, the electronic device can start a speech recognition application. Specifically, starting the voice recognition application may start the voice command recognition module and the voice command execution module in the application processor of the electronic device. The electronic device can recognize the user's voice command in the ambient sound through the voice command recognition module, and execute the operation corresponding to the voice command through the voice command execution module. In the wake-up state, the electronic device can also monitor in real time whether the ambient sound contains the wake-up word. If the electronic device in the wake-up state monitors the wake-up word, the electronic device may enter the pre-wake-up state from the wake-up state.
可以理解的,处于唤醒词监听状态和处于唤醒状态的电子设备均可以实时监听环境声音中是否包含唤醒词。但处于唤醒词监听状态的电子设备无法识别用户的语音指令以及执行语音指令对应的用户操作。例如,在处于唤醒词监听状态时,电子设备的应用处理器处于休眠状态。It can be understood that the electronic device in the wake-up word monitoring state and in the wake-up state can monitor whether the wake-up word is included in the ambient sound in real time. However, the electronic device in the wake-up word monitoring state cannot recognize the user's voice command and perform the user operation corresponding to the voice command. For example, when in the wake word monitoring state, the application processor of the electronic device is in a dormant state.
在一些实施例中,具有语音交互功能且语音交互功能处于开启状态的电子设备在监听到唤醒词后,可以通过判断本地设备列表中是否存在智能眼镜以及智能眼镜是否处于佩戴状态,来确定自己是否立即进入唤醒状态。可以理解的,当本地设备列表中存在智能眼镜且智能眼镜处于佩戴状态时,用户借助智能眼镜唤醒电子设备的可能性较高。若本地设备列表中不存在智能眼镜(如用户没有智能眼镜)或者本地设备列表中存在智能眼镜但智能眼镜未处于佩戴状态,用户借助智能眼镜唤醒电子设备的可能性较低。那么监听到唤醒词进入预唤醒状态的电子设备可以根据本申请实施例提供的通过唤醒词直接唤醒电子设备的方法来确定是否进入唤醒状态。In some embodiments, after the electronic device with the voice interaction function and the voice interaction function is turned on, after listening to the wake-up word, it can determine whether the smart glasses exist in the local device list and whether the smart glasses are in the wearing state. Immediately to wake up. It can be understood that when the smart glasses exist in the local device list and the smart glasses are in the wearing state, the possibility of the user waking up the electronic device by means of the smart glasses is high. If the smart glasses do not exist in the local device list (for example, the user does not have smart glasses) or the smart glasses exist in the local device list but the smart glasses are not in the wearing state, the possibility of the user waking up the electronic device by means of the smart glasses is low. Then, the electronic device that hears the wake-up word and enters the pre-wake-up state can determine whether to enter the wake-up state according to the method of directly waking up the electronic device through the wake-up word provided in the embodiment of the present application.
下面具体介绍本申请实施例提供的电子设备根据是否存在智能眼镜来确定自己是否进入唤醒状态的方法流程图。The following specifically introduces the flow chart of the method for the electronic device to determine whether it enters the wake-up state according to whether there are smart glasses provided by the embodiment of the present application.
如图10所示,该方法可包括步骤S101~S106。其中:As shown in FIG. 10, the method may include steps S101-S106. in:
S101、电子设备监听到唤醒词。S101. The electronic device monitors a wake-up word.
电子设备可以为存在于本地设备列表中,语音交互功能处于开启状态的电子设备中的任 一个。电子设备采集环境声音并识别环境声音中是否存在唤醒词的实现方法可以参考前述实施例,这里不再赘述。The electronic device can be any of the electronic devices that exist in the local device list and whose voice interaction function is turned on. The implementation method for the electronic device to collect ambient sound and identify whether there is a wake-up word in the ambient sound may refer to the foregoing embodiments, and details are not repeated here.
当监听到唤醒词,电子设备可以进入预唤醒状态。When the wake-up word is detected, the electronic device may enter a pre-wake-up state.
S102、电子设备查询本地设备列表中是否存在智能眼镜。S102. The electronic device queries whether the smart glasses exist in the local device list.
在预唤醒状态,电子设备可以获取本地设备列表。本地设备列表更新和存储的方法可以参考前述实施例。示例性的,手机、音箱、电视等电子设备均与路由器连接,接入同一个家庭Wi-Fi。手机、音箱、电视和路由器均存在于一个本地设备列表中。若智能眼镜与手机建立蓝牙通信连接,本地设备列表中包含的电子设备可以增加智能眼镜。这样,电子设备(如手机、音箱、电视)可以查询到本地设备列表中存在智能眼镜。In the pre-awakening state, the electronic device can acquire a local device list. For the method for updating and storing the local device list, reference may be made to the foregoing embodiments. Exemplarily, electronic devices such as mobile phones, speakers, and TVs are all connected to the router and connected to the same home Wi-Fi. Phones, speakers, TVs, and routers all exist in one local device list. If the smart glasses establish a Bluetooth communication connection with the mobile phone, the electronic devices included in the local device list can add the smart glasses. In this way, an electronic device (such as a mobile phone, a sound box, and a TV) can inquire that there are smart glasses in the local device list.
可以理解的,若用户没有智能眼镜,或者智能眼镜未与本地设备列表中任一个电子设备建立通信连接(如智能眼镜处于关机状态),那么本地设备列表中不存在智能眼镜。Understandably, if the user does not have smart glasses, or the smart glasses have not established a communication connection with any electronic device in the local device list (for example, the smart glasses are turned off), then there are no smart glasses in the local device list.
S103、若查询到本地设备列表中存在智能眼镜,电子设备可以判断智能眼镜是否处于佩戴状态。S103. If it is found that smart glasses exist in the local device list, the electronic device may determine whether the smart glasses are in a wearing state.
可以理解的,智能眼镜处于佩戴状态时采集的图像,可以相当于用户视野范围内的图像。若智能眼镜未处于佩戴状态,那么智能眼镜采集的图像不能认为是用户视野范围内的图像。也即是说,处于佩戴状态的智能眼镜才能较准确地判断出用户希望唤醒的电子设备。It can be understood that the images collected when the smart glasses are in the wearing state may be equivalent to images within the user's field of view. If the smart glasses are not being worn, the images collected by the smart glasses cannot be considered as images within the user's field of vision. That is to say, the smart glasses in the wearing state can more accurately determine the electronic device that the user wishes to wake up.
在查询到本地设备列表中存在智能眼镜后,电子设备可以进一步判断智能眼镜是否处于佩戴状态。在一种可能的实现方式中,智能眼镜与手机建立有蓝牙通信连接。音箱、电视等电子设备可以通过手机获取智能眼镜的佩戴状态。其中,手机可以向智能眼镜发送消息,来询问智能眼镜是否处于佩戴状态。After inquiring that the smart glasses exist in the local device list, the electronic device can further determine whether the smart glasses are in a wearing state. In a possible implementation manner, the smart glasses establish a Bluetooth communication connection with the mobile phone. Electronic devices such as speakers and TVs can obtain the wearing status of the smart glasses through the mobile phone. Wherein, the mobile phone can send a message to the smart glasses to ask whether the smart glasses are in the wearing state.
智能眼镜可以进行佩戴检测。示例性的,智能眼镜可以通过陀螺仪传感器和加速度传感器来检测自己的姿态确定自己是否处于佩戴状态。或者,智能眼镜可以通过眼动跟踪技术来确定自己是否处于佩戴状态。本申请实施例对智能眼镜进行佩戴检测的方法不作限定。其中,佩戴检测的具体实现方法可以参考现有技术。本申请实施例对此不作赘述。Smart glasses can perform wearing detection. Exemplarily, the smart glasses can determine whether they are in a wearing state by detecting their posture through a gyroscope sensor and an acceleration sensor. Alternatively, smart glasses can use eye-tracking technology to determine whether they are wearing them. The embodiment of the present application does not limit the method for detecting wearing of smart glasses. Wherein, for a specific implementation method of wearing detection, reference may be made to the prior art. This embodiment of the present application does not describe it in detail.
响应于上述来自手机用于询问智能眼镜是否处于佩戴状态的消息,智能眼镜可以将佩戴检测的结果发送给手机。这样,手机、音箱、电视等电子设备可以获取智能眼镜的佩戴状态。In response to the message from the mobile phone asking whether the smart glasses are in the wearing state, the smart glasses may send the wearing detection result to the mobile phone. In this way, electronic devices such as mobile phones, speakers, and televisions can obtain the wearing status of the smart glasses.
在一些实施例中,步骤S103是可选的。示例性的,电子设备在确定本地设备列表中存在智能眼镜后,可以直接执行下述步骤S104。In some embodiments, step S103 is optional. Exemplarily, after the electronic device determines that smart glasses exist in the local device list, it may directly execute the following step S104.
S104、若智能眼镜处于佩戴状态,电子设备可以判断在预设时间段内是否接收到唤醒指令。S104. If the smart glasses are in the wearing state, the electronic device may determine whether a wake-up instruction is received within a preset time period.
当判断出智能眼镜处于佩戴状态,电子设备可以等待唤醒指令。其中,智能眼镜在确定用户希望唤醒的电子设备后,可以向用户希望唤醒的电子设备发送唤醒指令。When it is determined that the smart glasses are in a wearing state, the electronic device may wait for a wake-up instruction. Wherein, after the smart glasses determine the electronic device that the user wishes to wake up, they may send a wake-up instruction to the electronic device that the user wishes to wake up.
在一些实施例中,电子设备可以在判断出智能眼镜处于佩戴状态后,等待预设时间段。若在预设时间段内接收到唤醒指令,电子设备可以执行下述步骤S105。若在预设时间段内未接收到唤醒指令,电子设备可以执行下述步骤S106。本申请实施例对上述预设时间段的长度不作限定。In some embodiments, the electronic device may wait for a preset period of time after determining that the smart glasses are in the wearing state. If the wake-up instruction is received within the preset time period, the electronic device may execute the following step S105. If no wake-up instruction is received within the preset time period, the electronic device may execute the following step S106. The embodiment of the present application does not limit the length of the foregoing preset time period.
S105、若在预设时间段内接收到唤醒指令,或者若本地设备列表中不存在智能眼镜,或者若本地设备列表中存在智能眼镜但智能眼镜未处于佩戴状态,电子设备可以进入唤醒状态。S105. If a wake-up instruction is received within a preset time period, or if there is no smart glasses in the local device list, or if there are smart glasses in the local device list but the smart glasses are not being worn, the electronic device may enter the wake-up state.
若电子设备在预设时间段内接收到唤醒指令,则该电子设备是目标唤醒设备(即用户希望唤醒的电子设备)。响应于唤醒指令,电子设备可以进入唤醒状态。If the electronic device receives the wake-up instruction within the preset time period, the electronic device is the target wake-up device (ie, the electronic device that the user wishes to wake up). In response to a wake-up instruction, the electronic device may enter a wake-up state.
在本地设备列表中不存在智能眼镜的情况下,或者在本地设备列表中存在智能眼镜但智 能眼镜未处于佩戴状态的情况下,用户不会借助智能眼镜来唤醒其它电子设备。那么,响应于监听到的唤醒词,电子设备可以进入唤醒状态。If there are no smart glasses in the local device list, or if there are smart glasses in the local device list but the smart glasses are not being worn, the user will not use the smart glasses to wake up other electronic devices. Then, in response to the detected wake-up word, the electronic device may enter a wake-up state.
可选的,若本地设备列表中不存在智能眼镜,或者若本地设备列表中存在智能眼镜但智能眼镜未处于佩戴状态,监听到唤醒词的多个电子设备之间可以通信,协商确定这多个电子设备中的一个电子设备进入唤醒状态,其它电子设备可以不进入唤醒状态。Optionally, if the smart glasses do not exist in the local device list, or if the smart glasses exist in the local device list but the smart glasses are not in the wearing state, multiple electronic devices that monitor the wake-up word can communicate with each other, and negotiate to determine the smart glasses. One of the electronic devices enters the wake-up state, and other electronic devices may not enter the wake-up state.
在一种可能的实现方式中,这多个电子设备可以确定自己监听到唤醒词对应的声音信号的强度。可以理解的,一个电子设备接收到的声音信号的强度越大,这一个电子设备与用户之间的距离越近。这多个电子设备均可以互相发送包含自己接收到的声音信号的强度的信息。进一步的,这多个电子设备可以协商确定出接收到的声音信号的强度最大的电子设备。该接收到的声音信号的强度最大的电子设备可以进入唤醒状态,其它电子设备可以不进入唤醒状态。In a possible implementation manner, the multiple electronic devices may determine the intensity of the sound signal corresponding to the wake-up word that they have monitored. Understandably, the greater the intensity of the sound signal received by an electronic device, the closer the distance between the electronic device and the user. These multiple electronic devices can all send information including the intensity of the sound signal they receive to each other. Further, the multiple electronic devices may negotiate to determine the electronic device with the highest strength of the received sound signal. The electronic device that receives the sound signal with the highest intensity may enter the wake-up state, and other electronic devices may not enter the wake-up state.
本申请实施例对电子设备在判断出用户不会通过智能眼镜来唤醒电子设备的情况下(如本地设备列表中不存在智能眼镜的情况,或者本地设备列表中存在智能眼镜但智能眼镜未处于佩戴状态的情况),响应监听到的唤醒词的方法不作限定。其中,具体的实现方法可以参考本申请实施例提供的一个环境内存在多个语音交互功能开启的电子设备时,这多个电子设备响应监听到的唤醒词的实现方法。本申请实施例对此不作赘述。In the embodiment of the present application, when the electronic device determines that the user will not wake up the electronic device through the smart glasses (for example, the smart glasses do not exist in the local device list, or the smart glasses exist in the local device list but the smart glasses are not wearing state), the method of responding to the monitored wake-up word is not limited. Wherein, the specific implementation method can refer to the implementation method provided by the embodiment of the present application, when there are multiple electronic devices with the voice interaction function enabled in an environment, the multiple electronic devices respond to the monitored wake-up words. This embodiment of the present application does not describe it in detail.
S106、若在预设时间段未接收到唤醒指令,电子设备可以进入唤醒词监听状态。S106. If no wake-up command is received within a preset time period, the electronic device may enter a wake-up word monitoring state.
若电子设备判断出智能眼镜处于佩戴状态,但在预设时间段内未接收到唤醒指令,则该电子设备不是目标唤醒设备。该电子设备可以从预唤醒状态进入唤醒词监听状态。If the electronic device determines that the smart glasses are in the wearing state, but does not receive a wake-up instruction within a preset time period, then the electronic device is not a target wake-up device. The electronic device can enter a wake-up word monitoring state from a pre-wake-up state.
由上述图10所示的方法可知,语音交互功能开启的电子设备在监听到唤醒词后,可以先判断用户是否会借助智能眼镜进行设备唤醒。若判断出用户会借助智能眼镜进行设备唤醒,电子设备可以等待唤醒指令。若判断出用户不会借助智能眼镜进行设备唤醒,电子设备可以对监听到的唤醒词进行响应。这样,当用户佩戴有智能眼镜时,可以借助智能眼镜唤醒自己希望唤醒的电子设备。当用户未佩戴智能眼镜时,可以通过唤醒词直接唤醒电子设备。From the method shown in FIG. 10 above, it can be seen that after the electronic device with the voice interaction function enabled listens to the wake-up word, it can first determine whether the user will wake up the device with the help of smart glasses. If it is determined that the user will use the smart glasses to wake up the device, the electronic device may wait for a wake-up instruction. If it is determined that the user will not use the smart glasses to wake up the device, the electronic device may respond to the monitored wake-up word. In this way, when the user wears the smart glasses, the electronic device that he wishes to wake up can be woken up by means of the smart glasses. When the user is not wearing the smart glasses, the electronic device can be directly woken up through the wake-up word.
在一些实施例中,具有语音交互功能的电子设备上可具有智能唤醒开关。或者,用于控制上述具有语音交互功能的电子设备上的智能家居APP中具有智能唤醒控件。该智能唤醒控件可用于关闭或者开启上述智能唤醒开关。其中,当上述智能唤醒开关开启,电子设备可以在监听到唤醒词后执行如图10所示的方法。这样,无论用户是否佩戴智能眼镜,均可方便地唤醒电子设备。其中,借助智能眼镜,用户可以更准确地唤醒自己希望唤醒的电子设备。In some embodiments, an electronic device with a voice interaction function may have a smart wake-up switch. Alternatively, the smart home APP for controlling the above-mentioned electronic device with voice interaction function has a smart wake-up control. The smart wake-up control can be used to turn off or turn on the above-mentioned smart wake-up switch. Wherein, when the above-mentioned smart wake-up switch is turned on, the electronic device may execute the method shown in FIG. 10 after listening to the wake-up word. In this way, whether the user wears the smart glasses or not, the electronic device can be conveniently woken up. Among them, with the help of smart glasses, users can more accurately wake up the electronic devices they want to wake up.
下面结合前述借助智能眼镜进行设备唤醒的场景介绍本申请实施例提供的一种智能眼镜的结构示意图。The following introduces a schematic structural diagram of smart glasses provided by an embodiment of the present application in combination with the aforementioned scenario of device wake-up by means of smart glasses.
如图11所示,智能眼镜可以包括用户行为识别模块1101、图像采集模块1102、图像识别模块1103、设备优先级确定模块1104和设备唤醒模块1105。这多个模块可以通过总线相互耦合。其中:As shown in FIG. 11 , the smart glasses may include a user behavior recognition module 1101 , an image collection module 1102 , an image recognition module 1103 , a device priority determination module 1104 and a device wakeup module 1105 . The multiple modules can be coupled to each other via a bus. in:
用户行为识别模块1101可用于检测用户是否需要唤醒其它电子设备。The user behavior recognition module 1101 can be used to detect whether the user needs to wake up other electronic devices.
用户行为识别模块1101可以包括但不限于:压力传感器、语音识别传感器、倾角传感器。The user behavior recognition module 1101 may include, but is not limited to: a pressure sensor, a voice recognition sensor, and an inclination sensor.
在一种可能的实现方式中,用户行为识别模块1101可以通过语音识别传感器识别采集到的环境声音中是否包含预设的唤醒词。当监听到环境声音中包含预设的唤醒词,用户行为识 别模块1101可以确定用户需要唤醒其它电子设备。那么,智能眼镜可以通过图像采集模块1102进行图像采集。In a possible implementation manner, the user behavior recognition module 1101 may use a voice recognition sensor to recognize whether the collected ambient sound contains a preset wake-up word. When listening to the preset wake-up word in the ambient sound, the user behavior recognition module 1101 can determine that the user needs to wake up other electronic devices. Then, the smart glasses can collect images through the image collection module 1102 .
其中,智能眼镜中可存储有上述预设的唤醒词。若用于唤醒与智能眼镜在同一个本地设备列表中语音交互功能开启的电子设备的唤醒词被更新(例如用户重新设置唤醒词),智能眼镜中存储的唤醒词也可以同步更新。示例性的,智能眼镜与手机建立有通信连接。手机中安装有控制音箱、电视等电子设备的智能家居APP。响应于作用在该智能家居APP中用于重新设置音箱的唤醒词的用户操作,用于唤醒音箱的唤醒词可以被修改。手机中可存储有修改后用于唤醒音箱的唤醒词。智能眼镜可以从手机获取上述修改后用于唤醒音箱的唤醒词。本申请实施例对智能眼镜获取用于唤醒电子设备的唤醒词的方法不作限定。Wherein, the aforementioned preset wake-up words may be stored in the smart glasses. If the wake-up word used to wake up the electronic device whose voice interaction function is enabled in the same local device list as the smart glasses is updated (for example, the user resets the wake-up word), the wake-up word stored in the smart glasses can also be updated synchronously. Exemplarily, the smart glasses establish a communication connection with the mobile phone. A smart home APP that controls electronic devices such as speakers and TVs is installed in the mobile phone. The wake-up word for waking up the speaker may be modified in response to a user operation acting on the smart home APP for resetting the wake-up word for the speaker. A modified wake-up word for waking up the speaker can be stored in the mobile phone. The smart glasses can obtain the above-mentioned modified wake-up word for waking up the speaker from the mobile phone. The embodiment of the present application does not limit the method for the smart glasses to acquire the wake-up word for waking up the electronic device.
在一种可能的实现方式中,用户行为识别模块1101可以通过压力传感器检测智能眼镜的预设位置上是否有用户的触碰操作。例如,当检测到作用在镜腿上一个位置被触碰两次的用户操作,用户行为识别模块1101可以确定用户需要唤醒其它电子设备。那么,智能眼镜可以通过图像采集模块1102进行图像采集。In a possible implementation manner, the user behavior recognition module 1101 may use a pressure sensor to detect whether there is a user's touch operation on a preset position of the smart glasses. For example, when detecting a user operation in which a position on the temple is touched twice, the user behavior recognition module 1101 may determine that the user needs to wake up other electronic devices. Then, the smart glasses can collect images through the image collection module 1102 .
本申请实施例对用户行为识别模块1101检测用户是否需要唤醒其它电子设备的具体实现方法不作限定。例如,用户行为识别模块1101还可以通过判断用户是否按照预设的方式眨眼等,来检测用户是否需要唤醒电子设备。The embodiment of the present application does not limit the specific implementation method for the user behavior identification module 1101 to detect whether the user needs to wake up other electronic devices. For example, the user behavior recognition module 1101 may also detect whether the user needs to wake up the electronic device by judging whether the user blinks in a preset manner.
需要进行说明的是,智能眼镜还可包含佩戴检测模块(图11中未示出)。在用户行为识别模块1101进行检测前,佩戴检测模块可以检测智能眼镜是否处于佩戴状态。若检测到智能眼镜处于佩戴状态,用户行为识别模块1101可以进行检测。若检测到智能眼镜未处于佩戴状态,智能眼镜可以处于休眠状态。智能眼镜处于休眠状态可以表示,智能眼镜中除了佩戴检测模块以外的组件均处于休眠状态。这可以节省智能眼镜的功耗。其中,佩戴检测模块检测智能眼镜是否处于佩戴状态的实现方法可以参考前述实施例。这里不再赘述。It should be noted that the smart glasses may also include a wearing detection module (not shown in FIG. 11 ). Before the detection by the user behavior recognition module 1101, the wearing detection module can detect whether the smart glasses are in the wearing state. If it is detected that the smart glasses are in the wearing state, the user behavior recognition module 1101 may perform the detection. If it is detected that the smart glasses are not being worn, the smart glasses may be in a dormant state. The fact that the smart glasses are in a dormant state may indicate that all components in the smart glasses except the wearing detection module are in a dormant state. This saves power consumption of the smart glasses. Wherein, the implementation method of detecting whether the smart glasses are in the wearing state by the wearing detection module can refer to the foregoing embodiments. I won't go into details here.
图像采集模块1102可用于采集图像。The image capture module 1102 can be used to capture images.
图像采集模块1102可包含但不限于摄像头。The image acquisition module 1102 may include but is not limited to a camera.
当智能眼镜处于佩戴状态,智能眼镜通过图像采集模块1102采集的图像,可以相当于用户视野范围内的图像(例如前述实施例中图6和图8所示的图像)。本申请实施例对图像采集模块1102中的摄像头在智能眼镜上的安装位置不作限定。When the smart glasses are in the wearing state, the images collected by the smart glasses through the image acquisition module 1102 may be equivalent to the images within the user's field of vision (such as the images shown in FIGS. 6 and 8 in the foregoing embodiments). The embodiment of the present application does not limit the installation position of the camera in the image acquisition module 1102 on the smart glasses.
图像识别模块1103可用于对图像进行图像识别处理,确定出该图像中包含的电子设备。The image recognition module 1103 can be used to perform image recognition processing on the image to determine the electronic equipment included in the image.
图像识别模块1103中可包括设备识别模型。该设备识别模型可以是神经网络模型。该设备识别模型可以是离线训练得到的。其中,该设备识别模型的输入可以包括图像。该图像中可包含一个或多个电子设备。该设备识别模型的输出可以包括但不限于以下特征:电子设备的类型、识别准确率、视角偏差。The image recognition module 1103 may include a device recognition model. The device recognition model may be a neural network model. The device recognition model can be obtained through off-line training. Wherein, the input of the device recognition model may include an image. One or more electronic devices may be included in the image. The output of the device recognition model may include but not limited to the following features: type of electronic device, recognition accuracy, and viewing angle deviation.
智能眼镜在出厂前可以存储训练好的设备识别模型。可选的,上述设备识别模型可以更新。智能眼镜可以从用于训练上述设备识别模型的服务器中获取更新的设备识别模型。Smart glasses can store trained device recognition models before leaving the factory. Optionally, the above device identification model can be updated. The smart glasses can obtain an updated device recognition model from the server used to train the above device recognition model.
图像识别模块1103可以通过该设备识别模型确定图像中电子设备的类型、识别准确率和视角偏差等等特征。其中,电子设备的类型可以包括该电子设备的类别以及该电子设备的具体型号。例如,图像识别模块1103确定出图6所示音箱106的类型为音箱Sound X,电视107的类型为智慧屏S Pro 65的电子设备。音箱为音箱106的类别,Sound X为音箱106的具体型号。识别准确率可以表示识别图像中一个电子设备的类型的准确率。一个电子设备的识别 准确率越高,这一个电子设备的类型实际为图像识别模块1103确定的类型的可能性越大。例如,图像识别模块1103确定出图6所示音箱106为音箱Sound X的识别准确率越高,音箱106为音箱Sound X的可能性越大。视角偏差可用于表示电子设备在用户视野范围内与用户视野中心的距离。电子设备的视角偏差越小,该电子设备的位置越接近用户视野中心。电子设备的视角偏差越大,该电子设备的位置越接近用户视野范围的边缘。可以理解的,电子设备的视角偏差可以通过该电子设备在上述图像采集模块1102采集的图像中的位置确定。电子设备在图像中的位置越接近图像的中心,电子设备的视角偏差越小。电子设备在图像中的位置与图像的中心距离越远,电子设备的视角偏差越大。The image recognition module 1103 can use the device recognition model to determine the type of electronic device in the image, recognition accuracy, viewing angle deviation and other characteristics. Wherein, the type of the electronic device may include the category of the electronic device and the specific model of the electronic device. For example, the image recognition module 1103 determines that the type of the speaker 106 shown in FIG. 6 is a speaker Sound X, and the type of the TV 107 is an electronic device of a smart screen S Pro 65. The sound box is a category of the sound box 106, and Sound X is a specific model of the sound box 106. The recognition accuracy may represent the accuracy of recognizing the type of an electronic device in the image. The higher the recognition accuracy rate of an electronic device, the greater the possibility that the type of this electronic device is actually the type determined by the image recognition module 1103. For example, the higher the recognition accuracy rate of the image recognition module 1103 determining that the speaker 106 shown in FIG. 6 is the speaker Sound X, the greater the possibility that the speaker 106 is the speaker Sound X. The viewing angle deviation can be used to represent the distance between the electronic device and the center of the user's field of view within the user's field of view. The smaller the viewing angle deviation of the electronic device is, the closer the position of the electronic device is to the center of the user's field of view. The larger the viewing angle deviation of the electronic device is, the closer the position of the electronic device is to the edge of the user's field of view. It can be understood that the viewing angle deviation of the electronic device can be determined through the position of the electronic device in the image collected by the above-mentioned image collection module 1102 . The closer the position of the electronic device in the image is to the center of the image, the smaller the viewing angle deviation of the electronic device is. The farther the position of the electronic device in the image is from the center of the image, the greater the deviation of the viewing angle of the electronic device.
本申请实施例对上述设备识别模型的具体训练方法不作限定。The embodiment of the present application does not limit the specific training method of the above-mentioned device recognition model.
在一些实施例中,上述设备识别模型的输出也可以包括但不限于以下特征:电子设备的类别、识别准确率、视角偏差。即图像识别模块1103在识别图像中包含的电子设备时,可以仅识别电子设备的类别(如类别为音箱的电子设备),而不用精确至电子设备的型号。那么在后续过程中,设备优先级确定模块1104也可以根据电子设备的类别、识别准确率和视角偏差来确定电子设备的优先级。In some embodiments, the output of the above-mentioned device recognition model may also include but not limited to the following features: electronic device category, recognition accuracy, and viewing angle deviation. That is, when the image recognition module 1103 recognizes the electronic device included in the image, it may only recognize the category of the electronic device (such as the electronic device whose category is a sound box), without being accurate to the model of the electronic device. Then in the subsequent process, the device priority determining module 1104 may also determine the priority of the electronic device according to the category of the electronic device, recognition accuracy and viewing angle deviation.
当得到确定图像中电子设备的类型、识别准确率和视角偏差等特征,图像识别模块1103可以将这些特征传递至设备优先级确定模块1104。After obtaining features such as the type of the electronic device in the determined image, recognition accuracy, and viewing angle deviation, the image recognition module 1103 may transfer these features to the device priority determination module 1104 .
设备优先级确定模块1104可用于对图像识别模块1103确定出的图像中包含的电子设备进行优先级排序。The device priority determination module 1104 may be used to prioritize the electronic devices included in the image determined by the image recognition module 1103 .
设备优先级确定模块1104可以利用排序算法对图像中包含的电子设备进行优先级排序。The device priority determination module 1104 may use a ranking algorithm to prioritize electronic devices included in the image.
在一种可能的实现方式中,上述排序算法可以为:Y=β 1*type+β 2*α+β 3*θ。其中,Y可以表示电子设备的优先级。Y的值越大,电子设备的优先级越高。type可以表示根据电子设备的类型确定的类型优先级取值。α可以表示电子设备的识别准确率。θ可以表示电子设备的视角偏差。β 1、β 2、β 3可以分别表示电子设备的类型优先级取值、识别准确率、视角偏差的权重。β 1、β 2、β 3均为小于1的正数。β 1、β 2、β 3之和可以为1。β 1、β 2、β 3的取值可以根据经验值设定。可选的,β 1、β 2、β 3的取值可以根据优化算法更新,使得优先级最高的电子设备为用户希望唤醒的电子设备。本申请实施例对上述β 1、β 2、β 3的取值不作具体限定。 In a possible implementation manner, the above sorting algorithm may be: Y=β 1 *type+β 2 *α+β 3 *θ. Wherein, Y may represent the priority of the electronic device. The larger the value of Y, the higher the priority of the electronic device. type may represent a type priority value determined according to the type of the electronic device. α can represent the recognition accuracy of the electronic device. θ may represent the viewing angle deviation of the electronic device. β 1 , β 2 , and β 3 may respectively represent the type priority value of the electronic device, the recognition accuracy rate, and the weight of the viewing angle deviation. β 1 , β 2 , and β 3 are all positive numbers less than 1. The sum of β 1 , β 2 , and β 3 may be 1. The values of β 1 , β 2 , and β 3 can be set according to empirical values. Optionally, the values of β 1 , β 2 , and β 3 can be updated according to an optimization algorithm, so that the electronic device with the highest priority is the electronic device that the user wishes to wake up. The embodiment of the present application does not specifically limit the above values of β 1 , β 2 , and β 3 .
可以理解的,响应于监听到的唤醒词,多个语音交互功能的电子设备根据类别可以存在被唤醒的优先级排序。例如,音箱>电视>平板电脑>手机。也即是说,对于上述排序算法中的特征type的取值可以具有下述大小的分布:音箱的type>电视的type>平板电脑的type>手机的type。本申请实施例对上述依据电子设备的类型确定的被唤醒的优先级排序不作限定。It can be understood that, in response to the monitored wake-up words, multiple electronic devices with voice interaction functions may be prioritized to be woken up according to categories. For example, Speaker > TV > Tablet > Phone. That is to say, the value of the feature type in the above sorting algorithm may have the following size distribution: type of sound box>type of TV>type of tablet computer>type of mobile phone. The embodiment of the present application does not limit the above-mentioned wake-up priority sorting determined according to the type of the electronic device.
电子设备的识别准确率越高,该电子设备能与本地设备列表中的电子设备匹配得上的概率越大,那么该电子设备被成功唤醒的概率也越大。The higher the recognition accuracy of the electronic device, the greater the probability that the electronic device can match the electronic device in the local device list, and the greater the probability that the electronic device is successfully awakened.
电子设备的视角偏差越小,该电子设备的位置越接近用户视野中心,那么该电子设备是用户希望唤醒的电子设备的概率越大。The smaller the viewing angle deviation of the electronic device and the closer the position of the electronic device is to the center of the user's field of vision, the greater the probability that the electronic device is the electronic device that the user wishes to wake up.
不限于上述电子设备的类型、识别准确率、视角偏差,上述图像识别模块1103还可从图像中提取更多图像中包含的电子设备的特征。设备优先级确定模块1104可以根据更多或更少的特征来确定各电子设备的优先级。例如,设备优先级确定模块1104也可以根据电子设备的类别、识别准确率、视角偏差这三个特征中的一个或多个特征来确定电子设备的优先级。Not limited to the above-mentioned type of electronic device, recognition accuracy, and viewing angle deviation, the above-mentioned image recognition module 1103 may also extract more features of the electronic device contained in the image from the image. The device priority determining module 1104 may determine the priority of each electronic device according to more or less features. For example, the device priority determining module 1104 may also determine the priority of the electronic device according to one or more of the three characteristics of the electronic device category, recognition accuracy, and viewing angle deviation.
在一些实施例中,用户唤醒不同的电子设备的唤醒词可能是不同的。在智能眼镜检测出 用户需要唤醒其它电子设备且监听到唤醒词的情况下,设备优先级确定模块1104可以先对图像中包含的电子设备进行筛选。其中,若用于唤醒一个电子设备的唤醒词与智能眼镜监听到的唤醒词不匹配,设备优先级确定模块1104可以将这一个电子设备排除。进一步的,设备优先级确定模块1104可以对图像中未被排除的电子设备进行优先级排序。可选的,若用于唤醒一个电子设备的唤醒词与智能眼镜监听到的唤醒词不匹配,设备优先级确定模块1104可以将这一个电子设备的优先级确定为最低的优先级。这样,可以避免由于智能眼镜根据采集的图像进行设备识别出现错误时,无法唤醒用户希望唤醒的情况。In some embodiments, the wake-up words used by the user to wake up different electronic devices may be different. When the smart glasses detect that the user needs to wake up other electronic devices and the wake-up word is heard, the device priority determination module 1104 can first screen the electronic devices contained in the image. Wherein, if the wake-up word used to wake up an electronic device does not match the wake-up word detected by the smart glasses, the device priority determination module 1104 may exclude this electronic device. Further, the device priority determination module 1104 may perform priority sorting on electronic devices that are not excluded in the image. Optionally, if the wake-up word used to wake up an electronic device does not match the wake-up word detected by the smart glasses, the device priority determination module 1104 may determine the priority of this electronic device as the lowest priority. In this way, it is possible to avoid the situation that the smart glasses cannot wake up the user who wishes to wake up due to an error in device identification based on the collected images.
可选的,上述对图像中包含的电子设备进行筛选也可以由上述图像识别模块1103来实现。Optionally, the aforementioned screening of electronic devices included in the image may also be implemented by the aforementioned image recognition module 1103 .
示例性的,图像识别模块1103对图6所示的图像进行图像识别,并将该图像中音箱106和电视107的类型、识别准确率、视角偏差传递至设备优先级确定模块1104。设备优先级确定模块1104对这两个电子设备进行优先级排序,可以得到下列表1所示的优先级列表:Exemplarily, the image recognition module 1103 performs image recognition on the image shown in FIG. 6 , and transmits the types, recognition accuracy, and viewing angle deviation of the sound box 106 and the TV 107 in the image to the device priority determination module 1104 . The device priority determination module 1104 performs priority sorting on the two electronic devices, and the priority list shown in the following table 1 can be obtained:
优先级列表 priority list
音箱106Speaker 106
电视107 TV 107
表1Table 1
由表1可知,音箱106的优先级高于电视107的优先级。上述优先级列表中可以通过电子设备的类型(如音箱Sound X)来表示该电子设备。本申请实施例对优先级列表中表示电子设备的内容不作限定。It can be seen from Table 1 that the priority of the speaker 106 is higher than that of the TV 107 . In the above priority list, the electronic device may be represented by its type (such as a sound box Sound X). The embodiment of the present application does not limit the content of the electronic device in the priority list.
当得到上述优先级列表,设备优先级确定模块1104可以将上述优先级列表传递至设备唤醒模块1105。When the above priority list is obtained, the device priority determination module 1104 may transmit the above priority list to the device wakeup module 1105 .
设备唤醒模块1105可用于根据本地设备列表以及图像中包含的电子设备的优先级,确定目标唤醒设备(即用户希望唤醒的电子设备),并向该目标唤醒设备发送唤醒指令。The device wake-up module 1105 can be used to determine the target wake-up device (that is, the electronic device that the user wants to wake up) according to the local device list and the priority of the electronic devices included in the image, and send a wake-up instruction to the target wake-up device.
当接收到上述优先级列表,设备唤醒模块1105可以按照优先级从高到低的顺序,将优先级列表中的电子设备与本地设备列表中的电子设备进行匹配。根据上述优先级列表和本地设备列表,设备唤醒模块1105可以将优先级列表中优先级最高且存在于本地设备列表中的电子设备确定为目标唤醒设备。设备唤醒模块1105可以向该目标唤醒设备发送唤醒指令。其中,该唤醒指令可以是智能眼镜直接发送给目标唤醒设备的。或者,该唤醒指令也可以是经过手机或路由器等于智能眼镜连接的电子设备转发给目标唤醒设备的。本申请实施例对此不作限定。When receiving the above priority list, the device wakeup module 1105 may match the electronic devices in the priority list with the electronic devices in the local device list in descending order of priority. According to the priority list and the local device list, the device wake-up module 1105 may determine the electronic device with the highest priority in the priority list and existing in the local device list as the target wake-up device. The device wake-up module 1105 may send a wake-up instruction to the target wake-up device. Wherein, the wake-up instruction may be directly sent by the smart glasses to the target wake-up device. Alternatively, the wake-up instruction may also be forwarded to the target wake-up device via a mobile phone or a router, such as an electronic device connected to the smart glasses. This embodiment of the present application does not limit it.
示例性的,在如图5所示的场景中,智能眼镜的设备唤醒模块1105可以获得下列表2所示的本地设备列表:Exemplarily, in the scenario shown in FIG. 5, the device wake-up module 1105 of the smart glasses can obtain the local device list shown in Table 2 below:
本地设备列表local device list
手机102 mobile phone 102
音箱106 Speaker 106
电视107 TV 107
智能眼镜101 Smart Glasses 101
……...
表2Table 2
由表1和表2可知,设备唤醒模块1105可以将音箱106确定为目标唤醒设备,并向音箱106发送唤醒指令。It can be known from Table 1 and Table 2 that the device wake-up module 1105 can determine the speaker 106 as the target wake-up device, and send a wake-up instruction to the speaker 106 .
在一些实施例中,本地设备列表中包含多个类型相同的电子设备。示例性的,本地设备列表中包含音箱106。该音箱106具体为音箱Sound X。除了音箱106,本地设备列表中还包含一个音箱Sound X。即本地设备列表中包含两个音箱Sound X。若设备唤醒模块1105确定出的目标唤醒设备为音箱Sound X,设备唤醒模块1105可以向本地设备列表中包含的两个音箱Sound X发送指示消息。该指示消息可用于指示这两个音箱Sound X协商确定出一个进入唤醒状态。在一种可能的实现方式中,当接收到该指示消息,这两个音箱Sound X可以通过各自接收到包含唤醒词的声音信号的强度来确定谁进入唤醒状态。可以理解的,接收到包含唤醒词的声音信号的强度越高,电子设备与用户之间距离越近,该电子设备为用户希望唤醒的电子设备的可能性越大。那么,接收到包含唤醒词的声音信号的强度最大的音箱Sound X可以进入唤醒状态。In some embodiments, the local device list includes multiple electronic devices of the same type. Exemplarily, the local device list includes the speaker 106 . The speaker 106 is specifically a speaker Sound X. In addition to the speaker 106, the local device list also includes a speaker Sound X. That is, two speakers, Sound X, are included in the local device list. If the target wake-up device determined by the device wake-up module 1105 is the sound box Sound X, the device wake-up module 1105 may send an indication message to the two sound box Sound X included in the local device list. The indication message may be used to instruct the two speakers, the Sound X, to determine through negotiation that one enters the wake-up state. In a possible implementation manner, when receiving the indication message, the two speakers Sound X can determine who enters the wake-up state according to the intensity of the sound signal containing the wake-up word received respectively. It can be understood that the higher the intensity of the received sound signal containing the wake-up word, the closer the distance between the electronic device and the user, and the greater the possibility that the electronic device is the electronic device that the user wishes to wake up. Then, the speaker Sound X that receives the sound signal containing the wake-up word with the highest intensity can enter the wake-up state.
由图11所示的智能眼镜的结构示意图可知,智能眼镜可以通过采集用户视野范围内的图像来判断用户希望唤醒的电子设备是哪一个。这样,用户可以借助智能眼镜来唤醒自己希望唤醒的电子设备,减少误唤醒的情况。From the schematic structural diagram of the smart glasses shown in FIG. 11 , it can be seen that the smart glasses can determine which electronic device the user wants to wake up by collecting images within the user's field of vision. In this way, the user can use the smart glasses to wake up the electronic device he wants to wake up, reducing the situation of false wakeup.
在一些实施例中,智能眼镜可以仅包含上述用户行为识别模块1101、图像采集模块1102、图像识别模块1103和设备优先级确定模块1104。上述设备唤醒模块1105可以包含于手机或路由器等与智能眼镜连接的电子设备中。这里以设备唤醒模块1105包含于手机为例进行说明。智能眼镜可以与手机建立蓝牙通信连接。当智能眼镜通过设备优先级确定模块1104得到优先级列表。智能眼镜可以将该优先级列表发送给手机。手机中的设备唤醒模块1105可以确定目标唤醒设备,并向目标唤醒设备发送唤醒指令。设备唤醒模块1105确定目标唤醒设备的方法可以参考前述实施例。In some embodiments, the smart glasses may only include the above-mentioned user behavior recognition module 1101 , image collection module 1102 , image recognition module 1103 and device priority determination module 1104 . The above-mentioned device wake-up module 1105 may be included in an electronic device connected to smart glasses such as a mobile phone or a router. Here, the device wake-up module 1105 is included in the mobile phone as an example for illustration. Smart glasses can establish a Bluetooth communication connection with a mobile phone. When the smart glasses obtain the priority list through the device priority determining module 1104 . The smart glasses can send that priority list to the phone. The device wake-up module 1105 in the mobile phone can determine the target wake-up device, and send a wake-up instruction to the target wake-up device. For the method for the device wake-up module 1105 to determine the target wake-up device, reference may be made to the foregoing embodiments.
在上述实施例中,智能眼镜可以不用确定目标唤醒设备,并向目标唤醒设备发送唤醒指令。这可以降低对智能眼镜计算能力和存储能力的要求,节省智能眼镜的功耗。In the above embodiment, the smart glasses may not determine the target wake-up device, and send a wake-up instruction to the target wake-up device. This can reduce the requirements on the computing power and storage capacity of the smart glasses, and save the power consumption of the smart glasses.
在一些实施例中,智能眼镜可以仅包含上述用户行为识别模块1101、图像采集模块1102和图像识别模块1103。上述设备优先级确定模块1104和设备唤醒模块1105可以包含于手机或路由器等与智能眼镜连接的电子设备中。这里以设备优先级确定模块1104和设备唤醒模块1105包含于手机为例进行说明。智能眼镜可以与手机建立蓝牙通信连接。当智能眼镜通过图像识别模块1103确定出图像中电子设备的类型、识别准确率和视角偏差等等特征,智能眼镜可以将这些特征发送给手机。手机可以通过设备优先级确定模块1104确定图像中电子设备的优先级,得到优先级列表。进一步的,根据优先级列表和本地设备列表,手机中的设备唤醒模块1105可以确定目标唤醒设备,并向目标唤醒设备发送唤醒指令。In some embodiments, the smart glasses may only include the above-mentioned user behavior recognition module 1101 , image collection module 1102 and image recognition module 1103 . The above device priority determination module 1104 and device wakeup module 1105 may be included in electronic devices connected to smart glasses such as mobile phones or routers. Here, the device priority determination module 1104 and the device wake-up module 1105 are included in the mobile phone as an example for illustration. Smart glasses can establish a Bluetooth communication connection with a mobile phone. When the smart glasses determine the type of electronic device in the image, recognition accuracy, viewing angle deviation and other characteristics through the image recognition module 1103, the smart glasses can send these characteristics to the mobile phone. The mobile phone can determine the priority of electronic devices in the image through the device priority determination module 1104 to obtain a priority list. Further, according to the priority list and the local device list, the device wake-up module 1105 in the mobile phone can determine the target wake-up device, and send a wake-up instruction to the target wake-up device.
在上述实施例中,智能眼镜可以不用确定电子设备的优先级和目标唤醒设备,并向目标唤醒设备发送唤醒指令。这可以降低对智能眼镜计算能力和存储能力的要求,节省智能眼镜的功耗。In the above embodiment, the smart glasses may not determine the priority of the electronic device and the target wake-up device, and send a wake-up instruction to the target wake-up device. This can reduce the requirements on the computing power and storage capacity of the smart glasses, and save the power consumption of the smart glasses.
在一些实施例中,智能眼镜可以仅包含上述用户行为识别模块1101和图像采集模块 1102。上述图像识别模块1103、设备优先级确定模块1104和设备唤醒模块1105可以包含于手机或路由器等与智能眼镜连接的电子设备中。这里以图像识别模块1103、设备优先级确定模块1104和设备唤醒模块1105包含于手机为例进行说明。智能眼镜可以与手机建立蓝牙通信连接。当智能眼镜通过图像采集模块1102采集得到图像,可以将该图像发送给手机。当接收到该图像,手机可以通过图像识别模块1103、设备优先级确定模块1104和设备唤醒模块1105可以确定出目标唤醒设备,并向目标唤醒设备发送唤醒指令。手机确定出目标唤醒设备的具体方法可以参考前述实施例。这里不再赘述。In some embodiments, the smart glasses may only include the above-mentioned user behavior recognition module 1101 and image collection module 1102. The above image recognition module 1103, device priority determination module 1104 and device wakeup module 1105 may be included in electronic devices connected to smart glasses such as mobile phones or routers. Here, the image recognition module 1103, the device priority determination module 1104 and the device wakeup module 1105 are included in the mobile phone as an example for illustration. Smart glasses can establish a Bluetooth communication connection with a mobile phone. When the smart glasses acquire an image through the image acquisition module 1102, the image can be sent to the mobile phone. When receiving the image, the mobile phone can determine the target wake-up device through the image recognition module 1103 , device priority determination module 1104 and device wake-up module 1105 , and send a wake-up instruction to the target wake-up device. For the specific method for the mobile phone to determine the target to wake up the device, reference may be made to the foregoing embodiments. I won't go into details here.
在上述实施例中,智能眼镜可以不用存储设备识别模块,对图像进行图像识别处理,并且不用确定电子设备的优先级和目标唤醒设备,向目标唤醒设备发送唤醒指令。这可以降低对智能眼镜计算能力和存储能力的要求,节省智能眼镜的功耗。In the above embodiment, the smart glasses can perform image recognition processing on the image without the storage device identification module, and send a wake-up instruction to the target wake-up device without determining the priority of the electronic device and the target wake-up device. This can reduce the requirements on the computing power and storage capacity of the smart glasses, and save the power consumption of the smart glasses.
在一些实施例中,上述判断用户是否需要唤醒其它电子设备的操作也可以是由其它与智能眼镜连接的电子设备(如手机)完成的。例如,手机监听到唤醒词,手机可以向智能眼镜发送采集图像的指令。智能眼镜采集图像后可以将图像发送给手机。手机可以根据该图像确定目标唤醒设备。可选的,智能眼镜采集图像后也可以根据该图像确定目标唤醒设备。In some embodiments, the above operation of judging whether the user needs to wake up other electronic devices may also be completed by other electronic devices (such as mobile phones) connected to the smart glasses. For example, when the mobile phone monitors the wake-up word, the mobile phone can send an instruction to the smart glasses to collect images. After the smart glasses capture the image, the image can be sent to the mobile phone. Based on this image, the phone can determine a target to wake the device. Optionally, after the smart glasses collect the image, the target wake-up device may also be determined according to the image.
图12示例性示出了本申请实施例提供的一种设备唤醒方法的流程图。FIG. 12 exemplarily shows a flow chart of a method for waking up a device provided by an embodiment of the present application.
如图12所示,该方法可包括步骤S201~S207。其中:As shown in FIG. 12, the method may include steps S201-S207. in:
S201、智能眼镜检测到用户需要唤醒其它电子设备。S201. The smart glasses detect that the user needs to wake up other electronic devices.
当智能眼镜处于佩戴状态,智能眼镜可以检测用户是否需要唤醒其它电子设备(如手机、音箱、电视等)。其中,智能眼镜可以包含前述实施例的用户行为识别模块1101。智能眼镜可以通过用户行为识别模块1101检测用户是否需要唤醒其它电子设备。具体的实现方法可以参考前述实施例。这里不再赘述。When the smart glasses are in the wearing state, the smart glasses can detect whether the user needs to wake up other electronic devices (such as mobile phones, speakers, TVs, etc.). Wherein, the smart glasses may include the user behavior recognition module 1101 of the foregoing embodiments. The smart glasses can detect whether the user needs to wake up other electronic devices through the user behavior recognition module 1101 . For a specific implementation method, reference may be made to the foregoing embodiments. I won't go into details here.
S202、智能眼镜采集图像,并确定图像中包含的电子设备的类型、识别准确率、视角偏差。S202. The smart glasses collect an image, and determine the type, recognition accuracy, and viewing angle deviation of the electronic device included in the image.
当检测到用户需要唤醒其它电子设备,智能眼镜可以采集图像。其中,智能眼镜可以包含前述实施例的图像采集模块1102和图像识别模块1103。智能眼镜可以通过图像采集模块1102采集图像。该图像即为用户视野范围内的图像。智能眼镜可以通过图像识别模块1103确定出该图像中包含的电子设备的类型、识别准确率和视角偏差。具体的实现方法可以参考前述实施例。这里不再赘述。When it detects that the user needs to wake up other electronic devices, the smart glasses can capture images. Wherein, the smart glasses may include the image acquisition module 1102 and the image recognition module 1103 of the foregoing embodiments. The smart glasses can collect images through the image collection module 1102 . The image is the image within the user's field of vision. The smart glasses can use the image recognition module 1103 to determine the type, recognition accuracy and viewing angle deviation of the electronic device included in the image. For a specific implementation method, reference may be made to the foregoing embodiments. I won't go into details here.
S203、智能眼镜可以向手机发送电子设备的类型、识别准确率、视角偏差。S203. The smart glasses can send the type of the electronic device, the recognition accuracy, and the deviation of the viewing angle to the mobile phone.
S204、手机可以对图像中包含的电子设备进行优先级排序。电子设备的优先级可根据一下一项或多项确定:电子设备的类型、识别准确率、视角偏差。S204. The mobile phone may perform prioritization on the electronic devices included in the image. The priority of the electronic device may be determined according to one or more of the following: type of electronic device, recognition accuracy, and viewing angle deviation.
手机可包含前述实施例的设备优先级确定模块1104。手机可以通过设备优先级确定模块1104对图像中包含的电子设备进行优先级排序,得到优先级排序的结果。该优先级排序的结果可以为前述实施例的优先级列表。The mobile phone may include the device priority determining module 1104 of the foregoing embodiments. The mobile phone may prioritize the electronic devices contained in the image through the device priority determining module 1104, and obtain a priority ranking result. The result of this prioritization can be the priority list of the foregoing embodiment.
S205、手机获取本地设备列表,将优先级排序的结果中优先级最高且存在于本地设备列表中的电子设备确定为目标唤醒设备。S205. The mobile phone obtains the local device list, and determines the electronic device with the highest priority among the prioritized results and existing in the local device list as the target wake-up device.
手机可包含前述实施例的设备唤醒模块1105。当接收到上述优先级排序的结果,手机可以通过设备唤醒模块1105确定目标唤醒设备,并执行下述步骤S206。The mobile phone may include the device wake-up module 1105 of the foregoing embodiments. When receiving the above priority sorting result, the mobile phone can determine the target wake-up device through the device wake-up module 1105, and execute the following step S206.
S206、手机向目标唤醒设备发送唤醒指令。S206. The mobile phone sends a wake-up instruction to the target wake-up device.
在一些实施例中,手机即为目标唤醒设备。那么,当确定目标唤醒设备为自己后,手机 可以进入唤醒状态。In some embodiments, the cell phone is the target wake-up device. Then, when it is determined that the target wake-up device is itself, the mobile phone can enter the wake-up state.
在一些实施例中,目标唤醒设备为手机以外的其他电子设备,手机可以直接向目标唤醒设备发送唤醒指令。可选的,手机与目标唤醒设备均与路由器连接。手机可以将唤醒指令发送给该路由器。该路由器可以将唤醒指令发送给目标唤醒设备。In some embodiments, the target wake-up device is an electronic device other than a mobile phone, and the mobile phone may directly send a wake-up instruction to the target wake-up device. Optionally, both the mobile phone and the target wake-up device are connected to the router. The mobile phone can send a wake-up command to the router. The router can send a wake-up instruction to the target wake-up device.
S207、当接收到唤醒之后,目标唤醒设备进入唤醒状态,识别语音指令并执行语音指令对应的操作。S207. After receiving the wake-up, the target wake-up device enters the wake-up state, recognizes the voice command and executes an operation corresponding to the voice command.
不限于上述手机,智能眼镜还可以将电子设备的优先级排序的结果发送给其它与自己连接的电子设备(如路由器),并由该电子设备确定目标唤醒设备以及向目标唤醒设备发送唤醒指令。Not limited to the above-mentioned mobile phones, the smart glasses can also send the results of prioritization of electronic devices to other electronic devices (such as routers) connected to themselves, and the electronic device determines the target wake-up device and sends a wake-up instruction to the target wake-up device.
在一些实施例中,智能眼镜在确定图像中包含的电子设备的类型、识别准确率、视角偏差后,还可以对图像中包含的电子设备进行优先级排序,并确定目标唤醒设备,向目标唤醒设备发送唤醒指令。也即是说,上述步骤S204、上述步骤S205和上述步骤S206均可以有智能眼镜完成。In some embodiments, after the smart glasses determine the type, recognition accuracy, and viewing angle deviation of the electronic devices contained in the image, they can also prioritize the electronic devices contained in the image, determine the target to wake up the device, and wake up the device to the target. The device sends a wake-up command. That is to say, the above step S204, the above step S205 and the above step S206 can all be completed by smart glasses.
在一些实施例中,智能眼镜在进行图像采集后,可以将采集得到的图像发送给手机。手机可以识别图像中包含的电子设备。即上述步骤S202中识别图像中包含的电子设备可以是由手机完成的。这可以降低对智能眼镜计算能力和存储能力的要求,节省智能眼镜的功耗。In some embodiments, after the smart glasses collect the images, they can send the collected images to the mobile phone. Cell phones can identify electronic devices contained in images. That is, the identification of the electronic devices contained in the image in the above step S202 may be performed by the mobile phone. This can reduce the requirements on the computing power and storage capacity of the smart glasses, and save the power consumption of the smart glasses.
由图12所示的方法可知,用户可以借助智能眼镜来唤醒自己希望唤醒的电子设备。该方法可以有效减少误唤醒的情况,为用户使用电子设备的语音交互功能带来更好的使用体验。As can be seen from the method shown in FIG. 12 , the user can use the smart glasses to wake up the electronic device he wants to wake up. The method can effectively reduce the situation of false wake-up, and bring a better user experience for the user to use the voice interaction function of the electronic device.
在一些实施例中,不限于上述智能眼镜,用户还可以借助其它类型的图像采集装置来辅助实现设备唤醒。示例性的,该图像采集装置可以是监控摄像头等等。In some embodiments, not limited to the above-mentioned smart glasses, the user can also use other types of image acquisition devices to assist in realizing device wake-up. Exemplarily, the image acquisition device may be a surveillance camera or the like.
上述图像采集装置可以检测第一用户输入,并在检测到第一用户输入时,采集第一图像。通过检测上述第一用户输入,图像采集装置可以判断用户是否需要进行设备唤醒。在判断出用户需要进行设备唤醒的情况下,图像采集装置可以进行图像采集,得到上述第一图像。可以理解的,在用户需要进行设备唤醒的情况下,上述第一图像中包含用户希望唤醒的电子设备的可能性更高。The above image acquisition device can detect the first user input, and when the first user input is detected, acquire the first image. By detecting the above-mentioned first user input, the image acquisition device can determine whether the user needs to wake up the device. When it is determined that the user needs to wake up the device, the image acquisition device may perform image acquisition to obtain the above-mentioned first image. It can be understood that, in the case that the user needs to wake up the device, it is more likely that the above-mentioned first image contains the electronic device that the user wants to wake up.
上述图像采集装置可以从多个电子设备中选出第一图像包含的目标电子设备。其中,图像采集装置可以先识别第一图像中包含的电子设备,得到第一图像中包含的电子设备的信息。上述电子设备的信息可以包括但不限于类型、识别准确率和视角偏差。图像采集装置可以根据上述电子设备的信息对第一图像中包含的电子设备进行优先级排序,得到第一图像中包含的电子设备的优先级。进一步的,图像采集装置可以判断第一图像中包含的电子设备是否存在于设备唤醒系统。The above-mentioned image acquisition apparatus may select a target electronic device included in the first image from a plurality of electronic devices. Wherein, the image acquisition device may first identify the electronic equipment included in the first image, and obtain information about the electronic equipment included in the first image. The above information of the electronic device may include but not limited to type, recognition accuracy and viewing angle deviation. The image acquisition device may sort the electronic devices included in the first image according to the above electronic device information to obtain the priority of the electronic devices included in the first image. Further, the image acquisition device may determine whether the electronic device included in the first image exists in the device wake-up system.
上述设备唤醒系统中的电子设备可以存在于本地设备列表中。本地设备列表可以存储于设备唤醒系统中的一个或多个电子设备中。可选的,该本地设备列表也可以存储在云服务器中。本地设备列表中的电子设备均可以获取该本地设备列表并更新该本地设备列表。一个电子设备可以通过已经存在于该本地设备列表中的电子设备,被增加至该本地设备列表或者从该本地设备列表中被删除。示例性的,一个电子设备和存在于该本地设备列表中的另一个电子设备建立通信连接,并完成了这另一个电子设备指示的可信身份认证。这另一个电子设备可以更新该本地设备列表,将这一个电子设备增加至该本地设备列表。即这一个电子设备可 以加入上述设备唤醒系统。上述本地设备列表被更新后,存在于本地设备列表中的电子设备均可以获得更新后的本地设备列表。上述设备唤醒系统中电子设备的通信连接方式可以参考前述图3和图4所示通信系统的介绍。这里不再赘述。The electronic devices in the above-mentioned device wake-up system may exist in the local device list. The local device list may be stored in one or more electronic devices in the device wake-up system. Optionally, the local device list may also be stored in a cloud server. All electronic devices in the local device list can obtain the local device list and update the local device list. An electronic device can be added to or deleted from the local device list by electronic devices already present in the local device list. Exemplarily, an electronic device establishes a communication connection with another electronic device in the local device list, and completes the trusted identity authentication indicated by the other electronic device. The other electronic device may update the local device list, adding the one electronic device to the local device list. That is, this electronic device can be added to the above-mentioned device to wake up the system. After the above local device list is updated, all electronic devices in the local device list can obtain the updated local device list. For the communication connection mode of the electronic device in the above-mentioned device wake-up system, reference may be made to the introduction of the communication system shown in FIG. 3 and FIG. 4 . I won't go into details here.
那么,图像采集装置可以获取本地设备列表,并判断第一图像中包含的电子设备是否存在与上述本地设备列表中。图像采集装置可以将本地设备列表中包含于第一图像,且优先级最高的电子设备确定为上述目标电子设备,并指示该目标电子设备进入唤醒状态。Then, the image acquisition device may acquire the local device list, and determine whether the electronic device included in the first image exists in the local device list. The image acquisition apparatus may determine the electronic device with the highest priority included in the local device list in the first image as the target electronic device, and instruct the target electronic device to enter a wake-up state.
以上所述,以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的范围。As mentioned above, the above embodiments are only used to illustrate the technical solutions of the present application, and are not intended to limit them; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still understand the foregoing The technical solutions described in each embodiment are modified, or some of the technical features are replaced equivalently; and these modifications or replacements do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the various embodiments of the application.

Claims (23)

  1. 一种设备唤醒系统,其特征在于,所述设备唤醒系统包括图像采集装置和多个电子设备,其中,A device wake-up system, characterized in that the device wake-up system includes an image acquisition device and a plurality of electronic devices, wherein,
    所述图像采集装置,用于检测第一用户输入,并在检测到第一用户输入时,采集第一图像;The image acquisition device is configured to detect a first user input, and when the first user input is detected, acquire a first image;
    所述图像采集装置,还用于从所述多个电子设备中选出所述第一图像包含的目标电子设备,并向所述目标电子设备发送唤醒指令;所述唤醒指令用于触发所述目标电子设备进入唤醒状态;The image acquisition device is further configured to select a target electronic device contained in the first image from the plurality of electronic devices, and send a wake-up instruction to the target electronic device; the wake-up instruction is used to trigger the The target electronic device enters a wake-up state;
    所述目标电子设备,用于响应接收到的所述唤醒指令,进入所述唤醒状态。The target electronic device is configured to enter the wake-up state in response to the received wake-up instruction.
  2. 根据权利要求1所述的设备唤醒系统,其特征在于,所述第一用户输入为包含唤醒词的语音输入;或者,所述第一用户输入为作用在所述图像采集装置的第一位置上的用户操作。The device wake-up system according to claim 1, wherein the first user input is voice input including a wake-up word; or, the first user input acts on the first position of the image acquisition device user actions.
  3. 根据权利要求1或2所述的设备唤醒系统,其特征在于,所述图像采集装置为智能眼镜;The device wake-up system according to claim 1 or 2, wherein the image acquisition device is smart glasses;
    所述多个电子设备,还用于监听唤醒词;The multiple electronic devices are also used to monitor wake-up words;
    所述多个电子设备,还用于在监听到所述唤醒词时检测所述设备唤醒系统中是否存在所述图像采集装置,且所述图像采集装置是否处于佩戴状态;The multiple electronic devices are further configured to detect whether the image capture device exists in the device wake-up system when the wake-up word is heard, and whether the image capture device is in a wearing state;
    所述多个电子设备,还用于在确定出所述设备唤醒系统中存在所述图像采集装置,且所述图像采集装置处于佩戴状态时,等待接收所述唤醒指令,并响应于接收到的所述唤醒指令进入所述唤醒状态。The plurality of electronic devices are further configured to wait for receiving the wake-up instruction when it is determined that the image capture device exists in the device wake-up system and the image capture device is in the wearing state, and respond to the received The wakeup instruction enters the wakeup state.
  4. 根据权利要求1-3中任一项所述的设备唤醒系统,其特征在于,所述图像采集装置具体用于:The device wake-up system according to any one of claims 1-3, wherein the image acquisition device is specifically used for:
    确定所述第一图像包含的电子设备的类型、识别准确率、视角偏差中的至少一项;所述识别准确率用于指示所述第一图像包含的电子设备的类型的识别结果的准确率,所述视角偏差用于指示所述电子设备在所述第一图像中的位置与所述第一图像的中心的距离;Determine at least one of the type of electronic equipment contained in the first image, recognition accuracy, and viewing angle deviation; the recognition accuracy is used to indicate the accuracy of the recognition result of the type of electronic equipment contained in the first image , the viewing angle deviation is used to indicate the distance between the position of the electronic device in the first image and the center of the first image;
    将所述多个电子设备中包含于所述第一图像,且优先级最高的电子设备确定为所述目标电子设备;所述优先级是根据所述类型、所述识别准确率和所述视角偏差中的一项或多项确定的;所述电子设备的所述类型在依据类型确定的唤醒排序中的优先顺序与所述电子设备的所述优先级正相关,所述电子设备的所述识别准确率与所述电子设备的所述优先级正相关,所述电子设备的所述视角偏差与所述电子设备的所述优先级负相关。Determining the electronic device with the highest priority among the plurality of electronic devices included in the first image as the target electronic device; the priority is based on the type, the recognition accuracy and the viewing angle Determined by one or more of the deviations; the priority order of the type of the electronic device in the wake-up order determined according to the type is positively correlated with the priority of the electronic device, and the priority of the electronic device The recognition accuracy rate is positively correlated with the priority of the electronic device, and the viewing angle deviation of the electronic device is negatively correlated with the priority of the electronic device.
  5. 根据权利要求1或2或4所述的设备唤醒系统,其特征在于,所述图像采集装置为智能眼镜。The device wake-up system according to claim 1, 2 or 4, wherein the image acquisition device is smart glasses.
  6. 一种设备唤醒系统,其特征在于,所述设备唤醒系统包括图像采集装置和处理设备,其中,A device wake-up system, characterized in that the device wake-up system includes an image acquisition device and a processing device, wherein,
    所述图像采集装置,用于检测第一用户输入,并在检测到第一用户输入时,采集第一图 像;The image acquisition device is used to detect the first user input, and when the first user input is detected, acquire the first image;
    所述图像采集装置,还用于向所述处理设备发送第一指令,所述第一指令包括所述第一图像,所述第一指令用于指示所述处理设备从多个电子设备中选出所述第一图像包含的目标电子设备;The image acquisition device is further configured to send a first instruction to the processing device, the first instruction includes the first image, and the first instruction is used to instruct the processing device to select Find out the target electronic device included in the first image;
    所述处理设备,用于响应所述第一指令,从所述多个电子设备中选出所述第一图像包含的目标电子设备,并向所述目标电子设备发送唤醒指令,所述唤醒指令用于触发所述目标电子设备进入唤醒状态。The processing device is configured to respond to the first instruction, select a target electronic device contained in the first image from the plurality of electronic devices, and send a wake-up instruction to the target electronic device, the wake-up instruction It is used to trigger the target electronic device to enter the wake-up state.
  7. 根据权利要求6所述的设备唤醒系统,其特征在于,所述第一用户输入为包含唤醒词的语音输入;或者,所述第一用户输入为作用在所述图像采集装置的第一位置上的用户操作。The device wake-up system according to claim 6, wherein the first user input is voice input including a wake-up word; or, the first user input acts on the first position of the image acquisition device user actions.
  8. 根据权利要求6或7所述的设备唤醒系统,其特征在于,所述设备唤醒系统还包括所述多个电子设备,其中,The device wake-up system according to claim 6 or 7, wherein the device wake-up system further comprises the plurality of electronic devices, wherein,
    所述多个电子设备,用于响应所述唤醒指令,进入所述唤醒状态。The multiple electronic devices are configured to enter the wake-up state in response to the wake-up instruction.
  9. 根据权利要求8所述的设备唤醒系统,其特征在于,所述图像采集装置为智能眼镜,所述多个电子设备还用于:The device wake-up system according to claim 8, wherein the image acquisition device is smart glasses, and the plurality of electronic devices are also used for:
    监听唤醒词;Listen for the wake word;
    在监听到所述唤醒词时检测所述设备唤醒系统中是否存在所述图像采集装置,且所述图像采集装置是否处于佩戴状态;Detecting whether the image capture device exists in the device wake-up system when the wake-up word is heard, and whether the image capture device is in a wearing state;
    在所述设备唤醒系统中存在所述图像采集装置,且所述图像采集装置处于佩戴状态的情况下,等待接收所述唤醒指令。Waiting to receive the wake-up instruction when the image capture device exists in the device wake-up system and the image capture device is in a wearing state.
  10. 根据权利要求6-9中任一项所述的设备唤醒系统,其特征在于,所述处理设备具体用于:The device wake-up system according to any one of claims 6-9, wherein the processing device is specifically used for:
    确定所述第一图像包含的电子设备的类型、识别准确率、视角偏差中的至少一项;所述识别准确率用于指示所述第一图像包含的电子设备的类型的识别结果的准确率,所述视角偏差用于指示所述电子设备在所述第一图像中的位置与所述第一图像的中心的距离;Determine at least one of the type of electronic equipment contained in the first image, recognition accuracy, and viewing angle deviation; the recognition accuracy is used to indicate the accuracy of the recognition result of the type of electronic equipment contained in the first image , the viewing angle deviation is used to indicate the distance between the position of the electronic device in the first image and the center of the first image;
    将所述多个电子设备中包含于所述第一图像,且优先级最高的电子设备确定为所述目标电子设备;所述优先级是根据所述类型、所述识别准确率和所述视角偏差中的一项或多项确定的;所述电子设备的所述类型在依据类型确定的唤醒排序中的优先顺序与所述电子设备的所述优先级正相关,所述电子设备的所述识别准确率与所述电子设备的所述优先级正相关,所述电子设备的所述视角偏差与所述电子设备的所述优先级负相关。Determining the electronic device with the highest priority among the plurality of electronic devices included in the first image as the target electronic device; the priority is based on the type, the recognition accuracy and the viewing angle Determined by one or more of the deviations; the priority order of the type of the electronic device in the wake-up order determined according to the type is positively correlated with the priority of the electronic device, and the priority of the electronic device The recognition accuracy rate is positively correlated with the priority of the electronic device, and the viewing angle deviation of the electronic device is negatively correlated with the priority of the electronic device.
  11. 根据权利要求6-8或10中任一项所述的设备唤醒系统,其特征在于,所述图像采集装置为智能眼镜。The device wake-up system according to any one of claims 6-8 or 10, wherein the image acquisition device is smart glasses.
  12. 一种设备唤醒方法,其特征在于,所述方法包括:A device wake-up method, characterized in that the method comprises:
    获取第一图像;get the first image;
    从多个电子设备中选出所述第一图像包含的目标电子设备;selecting a target electronic device included in the first image from a plurality of electronic devices;
    向所述目标电子设备发送唤醒指令;所述唤醒指令用于触发所述目标电子设备进入唤醒 状态。Send a wake-up instruction to the target electronic device; the wake-up instruction is used to trigger the target electronic device to enter a wake-up state.
  13. 根据权利要求12所述的方法,其特征在于,所述方法由图像采集装置执行;The method according to claim 12, characterized in that the method is executed by an image acquisition device;
    所述获取第一图像,具体包括:The acquisition of the first image specifically includes:
    当检测到第一用户输入,采集所述第一图像。When a first user input is detected, the first image is captured.
  14. 根据权利要求13所述的方法,其特征在于,所述检测到第一用户输入,具体包括:The method according to claim 13, wherein the detecting the first user input specifically comprises:
    监听到唤醒词;或者,A wake word is detected; or,
    检测到作用在所述图像采集装置的第一位置上的用户操作。A user operation acting on a first position of the image capture device is detected.
  15. 根据权利要求12所述的方法,其特征在于,所述方法由处理设备执行;The method according to claim 12, wherein the method is performed by a processing device;
    所述获取第一图像,具体包括:The acquisition of the first image specifically includes:
    接收来自图像采集装置的第一指令,所述第一指令包括所述图像采集装置采集的所述第一图像,所述第一指令用于指示所述处理设备从所述多个电子设备中选出所述第一图像包含的所述目标电子设备。receiving a first instruction from an image acquisition device, where the first instruction includes the first image captured by the image acquisition device, and the first instruction is used to instruct the processing device to select from the plurality of electronic devices Display the target electronic device included in the first image.
  16. 根据权利要求13-15中任一项所述的方法,其特征在于,所述图像采集装置为智能眼镜。The method according to any one of claims 13-15, wherein the image acquisition device is smart glasses.
  17. 根据权利要求12-16中任一项所述的方法,其特征在于,所述从多个电子设备中选出所述第一图像包含的目标电子设备,具体包括:The method according to any one of claims 12-16, wherein the selecting the target electronic device contained in the first image from a plurality of electronic devices specifically includes:
    确定所述第一图像包含的电子设备的类型、识别准确率、视角偏差中的至少一项;所述识别准确率用于指示所述第一图像包含的电子设备的类型的识别结果的准确率,所述视角偏差用于指示所述电子设备在所述第一图像中的位置与所述第一图像的中心的距离;Determine at least one of the type of electronic equipment contained in the first image, recognition accuracy, and viewing angle deviation; the recognition accuracy is used to indicate the accuracy of the recognition result of the type of electronic equipment contained in the first image , the viewing angle deviation is used to indicate the distance between the position of the electronic device in the first image and the center of the first image;
    将所述多个电子设备中包含于所述第一图像,且优先级最高的电子设备确定为所述目标电子设备;所述优先级是根据所述类型、所述识别准确率和所述视角偏差中的一项或多项确定的;所述电子设备的所述类型在依据类型确定的唤醒排序中的优先顺序与所述电子设备的所述优先级正相关,所述电子设备的所述识别准确率与所述电子设备的所述优先级正相关,所述电子设备的所述视角偏差与所述电子设备的所述优先级负相关。Determining the electronic device with the highest priority among the plurality of electronic devices included in the first image as the target electronic device; the priority is based on the type, the recognition accuracy and the viewing angle determined by one or more of the deviations; the priority order of the type of the electronic device in the wake-up order determined by type is positively correlated with the priority of the electronic device, and the priority of the electronic device The recognition accuracy rate is positively correlated with the priority of the electronic device, and the viewing angle deviation of the electronic device is negatively correlated with the priority of the electronic device.
  18. 一种设备唤醒的方法,其特征在于,所述方法包括:A method for waking up a device, characterized in that the method includes:
    当检测到第一用户输入,图像采集装置采集第一图像;When the first user input is detected, the image acquisition device acquires the first image;
    所述图像采集装置向处理设备发送第一指令,所述第一指令包括所述第一图像,所述第一指令用于指示所述处理设备从多个电子设备中选出所述第一图像包含的目标电子设备,所述目标电子设备为所述处理设备发送唤醒指令的对象,所述唤醒指令用于触发所述目标电子设备进入唤醒状态。The image acquisition device sends a first instruction to a processing device, the first instruction includes the first image, and the first instruction is used to instruct the processing device to select the first image from a plurality of electronic devices A target electronic device is included, and the target electronic device is an object to which the processing device sends a wake-up instruction, and the wake-up instruction is used to trigger the target electronic device to enter a wake-up state.
  19. 根据权利要求18所述的方法,其特征在于,所述图像采集装置为智能眼镜。The method according to claim 18, wherein the image acquisition device is smart glasses.
  20. 根据权利要求18或19所述的方法,其特征在于,所述检测到第一用户输入,具体包括:The method according to claim 18 or 19, wherein the detecting the first user input specifically comprises:
    监听到唤醒词;或者,A wake word is detected; or,
    检测到作用在所述图像采集装置的第一位置上的用户操作。A user operation acting on a first position of the image capture device is detected.
  21. 一种设备唤醒方法,其特征在于,所述方法包括:A device wake-up method, characterized in that the method comprises:
    第一电子设备监听到唤醒词;The first electronic device monitors the wake-up word;
    响应于所述唤醒词,所述第一电子设备检测设备唤醒系统中是否存在智能眼镜,且所述智能眼镜是否处于佩戴状态;In response to the wake-up word, the first electronic device detects whether there are smart glasses in the device wake-up system, and whether the smart glasses are in a wearing state;
    如果所述设备唤醒系统中存在所述智能眼镜,且所述智能眼镜处于佩戴状态,所述第一电子设备等待接收唤醒指令,所述唤醒指令用于触发所述第一电子设备进入唤醒状态;If the smart glasses exist in the device wake-up system, and the smart glasses are in the wearing state, the first electronic device waits to receive a wake-up instruction, and the wake-up instruction is used to trigger the first electronic device to enter the wake-up state;
    所述第一电子设备接收到所述唤醒指令,进入所述唤醒状态。The first electronic device enters the wake-up state upon receiving the wake-up instruction.
  22. 一种电子设备,其特征在于,所述电子设备包括存储器和处理器,其中,所述存储器用于存储计算机程序,所述处理器用于调用所述计算机程序,使得所述电子设备执行权利要求12-17或18-20或21中任一项所述的方法。An electronic device, characterized in that the electronic device includes a memory and a processor, wherein the memory is used to store a computer program, and the processor is used to call the computer program, so that the electronic device executes claim 12 - the method of any one of 17 or 18-20 or 21.
  23. 一种计算机可读存储介质,包括指令,其特征在于,当所述指令在电子设备上运行,使得所述电子设备执行权利要求12-17或18-20或21中任一项所述的方法。A computer-readable storage medium, comprising instructions, characterized in that, when the instructions are run on an electronic device, the electronic device executes the method described in any one of claims 12-17 or 18-20 or 21 .
PCT/CN2022/107411 2021-07-26 2022-07-22 Device wake-up method, related apparatus, and communication system WO2023005844A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110844001.4 2021-07-26
CN202110844001.4A CN115691485A (en) 2021-07-26 2021-07-26 Equipment awakening method, related device and communication system

Publications (1)

Publication Number Publication Date
WO2023005844A1 true WO2023005844A1 (en) 2023-02-02

Family

ID=85044703

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/107411 WO2023005844A1 (en) 2021-07-26 2022-07-22 Device wake-up method, related apparatus, and communication system

Country Status (2)

Country Link
CN (1) CN115691485A (en)
WO (1) WO2023005844A1 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105204628A (en) * 2015-09-01 2015-12-30 涂悦 Voice control method based on visual awakening
KR20180084392A (en) * 2017-01-17 2018-07-25 삼성전자주식회사 Electronic device and operating method thereof
CN109817211A (en) * 2019-02-14 2019-05-28 珠海格力电器股份有限公司 A kind of electric control method, device, storage medium and electric appliance
CN109992237A (en) * 2018-01-03 2019-07-09 腾讯科技(深圳)有限公司 Intelligent sound apparatus control method, device, computer equipment and storage medium
CN111007732A (en) * 2019-11-12 2020-04-14 珠海格力电器股份有限公司 Air conditioner vision wake-up-free identification method and system based on scale change and intelligent home
CN111128157A (en) * 2019-12-12 2020-05-08 珠海格力电器股份有限公司 Wake-up-free voice recognition control method for intelligent household appliance, computer readable storage medium and air conditioner
CN111145739A (en) * 2019-12-12 2020-05-12 珠海格力电器股份有限公司 Vision-based awakening-free voice recognition method, computer-readable storage medium and air conditioner

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105204628A (en) * 2015-09-01 2015-12-30 涂悦 Voice control method based on visual awakening
KR20180084392A (en) * 2017-01-17 2018-07-25 삼성전자주식회사 Electronic device and operating method thereof
CN109992237A (en) * 2018-01-03 2019-07-09 腾讯科技(深圳)有限公司 Intelligent sound apparatus control method, device, computer equipment and storage medium
CN109817211A (en) * 2019-02-14 2019-05-28 珠海格力电器股份有限公司 A kind of electric control method, device, storage medium and electric appliance
CN111007732A (en) * 2019-11-12 2020-04-14 珠海格力电器股份有限公司 Air conditioner vision wake-up-free identification method and system based on scale change and intelligent home
CN111128157A (en) * 2019-12-12 2020-05-08 珠海格力电器股份有限公司 Wake-up-free voice recognition control method for intelligent household appliance, computer readable storage medium and air conditioner
CN111145739A (en) * 2019-12-12 2020-05-12 珠海格力电器股份有限公司 Vision-based awakening-free voice recognition method, computer-readable storage medium and air conditioner

Also Published As

Publication number Publication date
CN115691485A (en) 2023-02-03

Similar Documents

Publication Publication Date Title
WO2021000876A1 (en) Voice control method, electronic equipment and system
JP7426470B2 (en) Voice activation method and electronic device
CN114726946B (en) Method for automatically switching Bluetooth audio coding modes, electronic equipment and readable storage medium
WO2021013137A1 (en) Voice wake-up method and electronic device
JP2022529033A (en) Bluetooth connection method, device, and system
CN110784830B (en) Data processing method, Bluetooth module, electronic device and readable storage medium
CN111369988A (en) Voice awakening method and electronic equipment
CN112334860B (en) Touch control method of wearable device, wearable device and system
CN111696562B (en) Voice wake-up method, device and storage medium
WO2022007944A1 (en) Device control method, and related apparatus
CN111835907A (en) Method, equipment and system for switching service across electronic equipment
WO2022257563A1 (en) Volume adjustment method, and electronic device and system
WO2022042274A1 (en) Voice interaction method and electronic device
WO2022161077A1 (en) Speech control method, and electronic device
WO2023005844A1 (en) Device wake-up method, related apparatus, and communication system
CN113572798B (en) Device control method, system, device, and storage medium
CN115731923A (en) Command word response method, control equipment and device
CN114116610A (en) Method, device, electronic equipment and medium for acquiring storage information
WO2020034104A1 (en) Voice recognition method, wearable device, and system
CN114666631B (en) Sound effect adjusting method and electronic equipment
CN115734323B (en) Power consumption optimization method and device
CN112334977B (en) Voice recognition method, wearable device and system
EP4310664A1 (en) Audio output method, media file recording method, and electronic device
WO2024055881A1 (en) Clock synchronization method, electronic device, system, and storage medium
WO2021189418A1 (en) Service providing method and apparatus

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22848449

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE