CN111599361A - Awakening method and device, computer storage medium and air conditioner - Google Patents

Awakening method and device, computer storage medium and air conditioner Download PDF

Info

Publication number
CN111599361A
CN111599361A CN202010406233.7A CN202010406233A CN111599361A CN 111599361 A CN111599361 A CN 111599361A CN 202010406233 A CN202010406233 A CN 202010406233A CN 111599361 A CN111599361 A CN 111599361A
Authority
CN
China
Prior art keywords
user
mouth shape
wake
voice
awakening
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010406233.7A
Other languages
Chinese (zh)
Inventor
贾鸿本
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Aux Air Conditioning Co Ltd
Ningbo Aux Electric Co Ltd
Original Assignee
Aux Air Conditioning Co Ltd
Ningbo Aux Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Aux Air Conditioning Co Ltd, Ningbo Aux Electric Co Ltd filed Critical Aux Air Conditioning Co Ltd
Priority to CN202010406233.7A priority Critical patent/CN111599361A/en
Publication of CN111599361A publication Critical patent/CN111599361A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Air Conditioning Control Device (AREA)

Abstract

The invention provides a wake-up method, which comprises the following steps: acquiring a voice awakening word, wherein the voice awakening word corresponds to a preset mouth shape; comparing the position of a sound source which sends out the voice awakening word with the position of a user, and comparing the mouth shape of the user with a preset mouth shape; when the position of the sound source matches the position of the user, and the user's mouth shape matches a predetermined mouth shape, a wake-up operation is performed. The method can preliminarily judge whether the awakening word is sent by the user or not by comparing the sound source position with the user position, and further confirm whether the awakening word is sent by the user or not by combining the mouth shape, so that the problem that the air conditioner is awakened by mistake due to mistake identification of the awakening word can be effectively avoided.

Description

Awakening method and device, computer storage medium and air conditioner
Technical Field
The invention relates to the technical field of air conditioners, in particular to a wake-up method, a wake-up device, a computer storage medium and an air conditioner.
Background
The intelligent air conditioner generally has a voice control function, and a user can control the air conditioner to be turned on or off, adjust the operation mode, set the temperature, set the wind speed and the like by speaking. At present, air conditioners with voice control functions are all provided with a wake-up mechanism, voice can be recognized only after the air conditioners are waken up, and the design can prevent the situation of misoperation of users in the process of daily use of the air conditioners, such as sending out voice commands unintentionally when the air conditioners are used, and changing the mode of the air conditioners by mistake, turning on and off the air conditioners by mistake and the like. The awakening mechanism has the advantages that when the air conditioner is not required to be controlled by voice, the voice module of the air conditioner can be in a standby state, and standby power consumption is reduced.
In real life, there are situations where a speech module is awoken by mistake due to environmental disturbances, such as television sound disturbances. At present, the reduction of the awakening sensitivity is a solution, but the method cannot fundamentally solve the reason, so that not only can the condition of mistaken awakening still exist, but also the normal use of the user can be influenced, and the instruction sent by the user cannot be identified. The other solution is that the human body sensor detects the position of the human body to judge whether the awakening command is sent by the user, but when the position of the user is closer to the sound source interference position, the method cannot eliminate the interference sound source.
Disclosure of Invention
The invention mainly aims to provide a wake-up method and an air conditioner so as to reduce the situation that the voice control air conditioner is mistakenly awakened.
One aspect of the present invention provides a wake-up method, including: acquiring a voice awakening word, wherein the voice awakening word corresponds to a preset awakening mouth shape; comparing the position of a sound source which sends the voice awakening word with the position of a user, and comparing the mouth shape of the user with the preset awakening mouth shape; and when the position of the sound source is matched with the position of the user and the mouth shape of the user is matched with the preset awakening mouth shape, executing awakening operation.
Therefore, whether the awakening words are sent out from the direction of the user can be confirmed by comparing the sound source position with the user position, and whether the awakening words are sent out by the user is further confirmed by identifying the mouth shape of the user, so that mistaken awakening of the air conditioner is avoided.
Optionally, the obtaining a voice wakeup word includes: receiving a voice signal of a user, the voice signal being received by a microphone array; and detecting voice awakening words in the voice signals.
Therefore, the air conditioner can receive indoor sound signals in real time and obtain awakening words in time.
Optionally, the comparing the position of the sound source which utters the voice wakeup word with the position of the user includes: calculating the position of the sound source according to the time difference of the voice signal reaching each microphone in the microphone array; acquiring a user image; acquiring the position of the user according to the user image; and comparing the position of the sound source with the position of the user, and judging whether the position of the sound source is matched with the position of the user.
Therefore, the sound source position and the user position can be obtained, and whether the awakening word is possibly sent by the user or not can be judged.
Optionally, the obtaining the position of the user according to the user image includes: dividing a real space represented by the user image into a plurality of regions; identifying a region in which the user is located in the user image; and taking the position of the area where the user is located as the position of the user.
Therefore, the current position of the user can be obtained according to the user image.
Optionally, the comparing the position of the sound source which sends the voice awakening word with the position of the user, and comparing the mouth shape of the user with the preset awakening mouth shape comprises: comparing the position of the sound source which sends the voice awakening word with the position of a user; if the position of the sound source is not matched with the position of the user, not comparing the mouth shape of the user with the preset awakening mouth shape; and if the position of the sound source is matched with the position of the user, comparing the mouth shape of the user with the preset awakening mouth shape.
Therefore, when the sound source position is not matched with the user position, the awakening word can be judged not to be sent by the user, the mouth shape of the user is stopped being further identified, the calculation amount can be reduced, the waste of calculation resources and storage resources is prevented, when the sound source position is matched with the user position, whether the user sends the awakening word or not is confirmed by identifying the mouth shape of the user, and mistaken awakening of the air conditioner is prevented.
Optionally, the comparing the position of the sound source that sends the voice wake-up word with the position of the user, and comparing the mouth shape of the user with the preset wake-up mouth shape further includes: acquiring a user image; according to the user image, recognizing the mouth shape of the user in the user image; and comparing the mouth shape of the user with the preset awakening mouth shape, and judging whether the mouth shape of the user is matched with the preset awakening mouth shape.
Therefore, whether the awakening word is sent by the user or not can be confirmed according to the mouth shape.
Optionally, the wake action comprises sounding a wake feedback tone to the user.
Therefore, the air conditioner can be awakened to start working, and a user is reminded that the air conditioner is awakened.
In another aspect, the present invention further provides a wake-up apparatus, which is applied to the wake-up method according to any one of the first aspect, and includes: the device comprises a wake-up word acquisition module, a voice wake-up word acquisition module and a voice recognition module, wherein the wake-up word acquisition module is used for acquiring a voice wake-up word which corresponds to a preset wake-up mouth shape; the comparison module is used for comparing the position of a sound source sending the voice awakening word with the position of a user and comparing the mouth shape of the user with the preset awakening mouth shape; and the awakening module is used for executing awakening operation when the position of the sound source is matched with the position of the user and the mouth shape of the user is matched with the preset awakening mouth shape.
In another aspect, the present invention further provides a computer readable storage medium, on which a computer program is stored, which, when being executed by a processor, implements the steps of the wake-up method according to any one of the first aspect.
Another aspect of the present invention provides an air conditioner, comprising: a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the wake-up method according to any of the first aspect when executing the computer program.
The advantages of the air conditioner, the wake-up device and the computer readable storage medium are the same as those of the wake-up method, and are not described herein again.
Drawings
Fig. 1 schematically shows a flowchart of a wake-up method according to an embodiment of the present invention;
FIG. 2 is a schematic diagram illustrating an embodiment of the present invention for obtaining a location of a user;
FIG. 3 is a schematic diagram of a microphone array receiving a speech signal according to an embodiment of the invention;
FIG. 4 is a flow chart schematically illustrating another wake-up method provided by an embodiment of the present invention;
fig. 5 is a schematic structural diagram of a wake-up apparatus according to another embodiment of the present invention;
fig. 6 schematically shows a structural diagram of each module of an air conditioner according to another embodiment of the present invention.
Detailed Description
In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Fig. 1 schematically shows a flowchart of a wake-up method according to an embodiment of the present invention. In the present embodiment, an air conditioner is taken as an application scenario, and the wakeup method in the present embodiment is described in detail with reference to fig. 1 and fig. 2 to 6.
Referring to fig. 1, a wake-up method according to an embodiment of the present invention includes S110 to S130.
S110, acquiring a voice awakening word, wherein the voice awakening word corresponds to a preset awakening mouth shape.
In this embodiment, when the air conditioner is in a standby state, the sound in the environment where the air conditioner is located is monitored in real time, the user sends an instruction, the user needs to send a wakeup word to wake up the air conditioner first, and after the air conditioner is woken up, the air conditioner can receive other instructions for controlling the operation of the air conditioner by the user.
In this embodiment, after the air conditioner is powered on and initialized, the air conditioner enters a standby mode, and monitors a voice signal sent by a user in real time, before detecting a voice wakeup word, the user voice should be recognized first, and the voice wakeup word is detected from the user voice, including steps S101 to S102.
S101, receiving a voice signal of a user, wherein the voice signal is received by a microphone array.
In this embodiment, the air conditioner monitors all indoor sounds including video sounds, footstep sounds, conversation sounds among multiple users, indoor voices transmitted outdoors and the like, and after receiving the sounds, the air conditioner needs to extract the voices from the sounds as effective voice signals, so that external interference is reduced, and the probability of misrecognition is reduced.
It can be understood that the number of the microphones of the microphone array is more than 2, and the microphones can be arranged in a plurality of rows according to a word or not arranged according to a word.
S102, voice awakening words in the voice signals are detected.
In this embodiment, the voice wake-up word may exist in a plurality of sentence-type voice signals sent by the user, and the voice wake-up word may be extracted from the voice signals by using technical means such as natural language processing and voiceprint comparison, so as to accurately obtain the voice wake-up word.
It can be understood that the voice wake-up word may include various forms, for example, the voice wake-up word may include voice commands such as "hello", "power on", "start up", and the like, and may also be customized by the user.
S120, comparing the position of the sound source which sends the voice awakening word with the position of the user, and comparing the mouth shape of the user with the preset awakening mouth shape.
In this embodiment, if the user has sent the wake-up word, the position of the sound source is the same as the position of the user, and the mouth shape of the user should be the preset wake-up mouth shape, i.e., the position of the sound source and the position of the user are compared, and the mouth shape of the user and the preset wake-up mouth shape are compared, so that the mistaken recognition of the air conditioner caused by the mistaken recognition of other sounds as the wake-up word sent by the user can be prevented through the two comparisons.
Optionally, the position of the sound source is compared with the position of the user, and the mouth shape of the user is compared with a preset awakening mouth shape, and the two steps may be executed without being separated from each other.
In this embodiment, the position of the sound source is compared with the position of the user, and then the mouth shape of the user is compared with a preset awakening mouth shape, including S121 to S123.
And S121, comparing the position of the sound source which sends the voice awakening word with the position of the user.
In this embodiment, both the sound source position and the position of the user are sent to the controller of the air conditioner, so that the air conditioner determines whether the two positions are the same position to determine whether the voice wakeup word is sent by the user, and if the sound source position is the same as the position of the user, the wakeup word is sent from the direction in which the user is located, and the user may send the wakeup word. Specifically, S121 includes S1211 to S1214.
S1211, calculating a position of the sound source according to a time difference between arrival of the voice signal at each microphone of the microphone array.
Referring to fig. 3, in the present embodiment, a voice signal is received by a microphone array, and the sound source position is calculated according to a time difference of the voice signal reaching each microphone in the microphone array. As shown in fig. 3, the time and angle of arrival of the voice signal (straight arrow shown on the left side of the figure) at each microphone may be different due to different positions of each microphone, the waveform (wavy line shown on the right side of the figure) formed by the voice signal received by each microphone is different, and the sound source direction or the position of the sound source of the voice signal can be determined according to the combination of the time differences of the voice signal received by each microphone.
Optionally, the air conditioner may further obtain a specific position of the sound source by combining the time difference of the speech signal obtained by the microphone array according to the intensity of the speech signal.
And S1212, acquiring the user image.
In this embodiment, the user image is obtained through the camera, the user image refers to an image in a space where the air conditioner is currently located, and is used for recording user activities, the camera may be a dedicated camera configured for the air conditioner itself, or may be another camera used by the user daily, and the camera may perform data transmission with the air conditioner through a network.
S1213, according to the user image, the position of the user is obtained.
Referring to fig. 2, in this embodiment, the actual space represented by the user image is divided into a plurality of areas, the area where the user is located in the user image is identified, and the position of the area where the user is located is used as the position of the user. For example, as shown in fig. 2, the user image is divided into 8 regions, and through the algorithm analysis, the position of the user between the region 2 and the region 3 in the space can be determined, and further, according to the outline and the display size of the user in the image, the specific position of the user can be determined.
Optionally, after the camera acquires the user image, the camera uploads the user image to the server, the server executes a corresponding recognition algorithm to determine the user position, and then feeds the user position back to the camera, or the camera can directly perform local operation to obtain the user position. The server has large storage space and strong computing capability, and can quickly acquire the position of the user.
It can be understood that the image taken by the camera may be a video or a picture.
S1214, comparing the position of the sound source with the position of the user, and judging whether the position of the sound source is matched with the position of the user.
In the present embodiment, when the position of the sound source matches the position of the user, S123 is performed again, and when the position of the sound source does not match the position of the user, S122 is performed.
And S122, if the position of the sound source is not matched with the position of the user, not comparing the mouth shape of the user with the preset awakening mouth shape.
In this embodiment, if the position of the sound source is not matched with the position of the user, it is indicated that the wake-up word is not sent by the user, and actions such as identifying the mouth shape of the user, comparing the mouth shape of the user with a preset wake-up mouth shape, and the like are not performed, so that computing resources and storage resources of a controller and a camera of the air conditioner can be saved.
S123, if the position of the sound source is matched with the position of the user, comparing the mouth shape of the user with the preset awakening mouth shape.
In this embodiment, before comparing the user' S mouth shape with the awakening mouth shape, the method further includes S1231-1233.
And S1231, acquiring the user image.
In this embodiment, the user image and the image for identifying the user position are the same image.
S1231, according to the user image, recognizing the mouth shape of the user in the user image.
In this embodiment, after the camera captures the user image, the camera may transmit the image to the server, so that the server processes the user image to obtain data related to the mouth shape of the user in the image, such as the shape, size, and picture of the mouth shape. The server has large storage space and strong computing capability, and can quickly obtain the mouth shape related data of the user.
Optionally, after the camera captures the user image, the mouth shape of the user in the user image is locally recognized, and the related data of the mouth shape of the user is acquired.
S1231, comparing the mouth shape of the user with the preset awakening mouth shape, and judging whether the mouth shape of the user is matched with the preset awakening mouth shape.
In this embodiment, after acquiring the mouth shape data of the user, the data related to the mouth shape of the user is compared with the preset data of the waking mouth shape, for example, the similarity of the mouth shape data is calculated, the similarity of the mouth shape image is compared, and whether the mouth shape of the user is the waking mouth shape is determined.
It is understood that the preset mouth shape data is trained by a plurality of mouth shape data, the number of which is more than one.
Optionally, the preset die data is stored locally at the camera, on a server, or other storable medium.
S130, when the position of the sound source is matched with the position of the user, and the mouth shape of the user is matched with the preset awakening mouth shape, awakening operation is executed.
In this embodiment, when the position of the sound source is matched with the position of the user and the mouth shape of the user is matched with the preset awakening mouth shape, it is determined that the voice awakening word is sent by the user, and then an awakening operation is executed, so that the air conditioner corresponds to the next voice instruction of the user.
In this embodiment, the wake-up operation includes sending a wake-up feedback sound to the user to prompt the user that the air conditioner is woken up, and can receive other voice commands from the user.
In this embodiment, if the sound source position is the same as the position of the user, the mouth shape is not the wake-up mouth shape, which indicates that the wake-up word is sent from the direction where the user is located, but is not the instruction sent by the user to wake up the air conditioner, the wake-up operation is not executed, and false wake-up of the air conditioner is avoided.
Referring to fig. 4, fig. 4 schematically illustrates another implementation manner of the wake-up method provided by the embodiment of the present disclosure, including steps S410 to S450.
S410, acquiring a voice awakening word, wherein the voice awakening word corresponds to a preset awakening mouth shape;
and S420, acquiring the mouth shape of the user.
And S430, comparing the mouth shape of the user with the preset awakening mouth shape.
S440, if the mouth shape of the user is matched with the preset awakening mouth shape, acquiring the position of the sound source of the voice awakening word and the position of the user.
S450, comparing the position of the sound source of the voice awakening word with the position of the user.
S460, when the position of the sound source is matched with the position of the user, performing awakening operation.
In this embodiment, it is determined whether the mouth shape of the user is the mouth shape of the wake-up word, and it is determined whether the position of the user is the same as the position of the sound source that emits the wake-up word.
Referring to fig. 5, another embodiment of the invention provides a wake-up apparatus 500, which includes a wake-up word obtaining module 510, a comparing module 520, and a wake-up module 530, which are described in detail below. The device may perform the wake-up method as shown in fig. 1 and 4.
A wake-up word obtaining module 510, configured to obtain a voice wake-up word, where the voice wake-up word corresponds to a preset wake-up mouth shape;
a comparison module 520, configured to compare the position of the sound source that emits the voice wake-up word with the position of the user, and compare the mouth shape of the user with the preset wake-up mouth shape;
a wake-up module 530 configured to perform a wake-up operation when the position of the sound source matches the position of the user and the mouth shape of the user matches the preset wake-up mouth shape.
In this embodiment, when the wakeup word acquisition module 510 detects a voice wakeup word, the wakeup comparison module 520 acquires a position of a sound source that sends the voice wakeup word according to a time difference between arrival of a voice signal including the voice wakeup word at a microphone array, acquires a position of a user according to a user image captured by a camera, and compares the position of the sound source with the position of the user, when the position of the sound source matches the position of the user, the comparison module 520 acquires related data of a mouth shape of the user according to the user image, compares the data with preset wakeup word mouth shape data, and when the mouth shape data matches with the mouth shape data of a threshold, proves that the mouth shape data is a wakeup word mouth shape, the comparison module 520 sends a wakeup instruction to the wakeup module 530; the wake-up module 530 wakes up other operating modules of the air conditioner and sends a feedback tone to the user to prompt the user that the user can continue to issue instructions.
It is understood that the wakeup word acquiring module 510, the comparing module 520, and the wakeup module 530 may be combined and implemented in one module, or any one of the modules may be split into multiple modules. Alternatively, at least part of the functionality of one or more of these modules may be combined with at least part of the functionality of the other modules and implemented in one module. According to an embodiment of the present invention, at least one of the wake word obtaining module 510, the comparing module 520, and the wake module 530 may be implemented at least in part as a hardware circuit, such as a Field Programmable Gate Array (FPGA), a Programmable Logic Array (PLA), a system on a chip, a system on a substrate, a system on a package, an Application Specific Integrated Circuit (ASIC), or may be implemented in hardware or firmware in any other reasonable manner of integrating or packaging a circuit, or in a suitable combination of three implementations of software, hardware, and firmware. Alternatively, at least one of the wake word obtaining module 510, the comparing module 520, and the wake module 530 may be at least partially implemented as a computer program module, which may perform the functions of the respective modules when the program is executed by a computer.
Another embodiment of the disclosure also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of the method of any of fig. 1.
Another embodiment of the present invention provides an air conditioner, a memory, a processor and a computer program stored in the memory and capable of running on the processor, wherein when the processor executes the computer program, the wake-up method as shown in any one of fig. 1 is implemented, so as to prevent a false wake-up situation occurring when the air conditioner is controlled by voice.
Referring to fig. 6, in the present embodiment, the air conditioner includes a microphone array 601, a voice module 602, a camera module 603, a control module 604, a broadcast module 605, and a load module 606. The microphone array 601 is used for receiving sound signals of the room where the air conditioner is located; the voice module 602 is configured to receive a voice signal transmitted by the microphone array 601, identify a voice signal in the voice signal, detect whether the voice signal includes a voice wakeup word, and acquire a position of a sound source; a camera module 603 for taking an image of the user and acquiring a position and a mouth shape of the user; the control module 604 is configured to compare whether the position of the user and the position of the sound source are the same, identify whether the mouth shape of the user is a wakeup word mouth shape, identify that the position of the user and the position of the sound source are the same, and control the responsible module 606 to operate and control the broadcast module 605 to broadcast a wakeup feedback tone when the mouth shape of the user is the wakeup word mouth shape; the broadcasting module 605 is configured to broadcast a wake-up feedback sound after it is confirmed that the user sends the wake-up word; and a load module 606 for heating or cooling.
Optionally, the voice module 602, the camera module 603, and the controller 604 may be divided into an offline operating mode and an online operating mode, for example, after the camera module 603 captures an image of a user, the image of the user may be locally analyzed to obtain a position and a mouth shape of the user, where the offline operating mode is an offline operating mode, and requires a chip with an arithmetic capability and a memory for storing the image of the user, and the method is faster in speed for obtaining the position and the mouth shape of the user, and is not limited by a network, but is still limited by a memory size (the larger the memory is, the more image templates and data can be stored for comparison, the higher the accuracy for identifying the position and the mouth shape of the user is), and the identification capability is limited; also can pass through the network with the user image and send the server to, make the server acquire user's position and mouth type according to the user image, this mode is online mode, need with camera and network connection, it possesses the communication ability to need the camera, because the storage space of server is bigger, the computing power is stronger, user position and mouth type that can be more accurate discernment, the chip computing power that makes the camera itself need not too strong, the memory need not too big, the hardware cost of having practiced thrift the air conditioner, but its operating efficiency can receive the network restriction.
It can be understood that the functions of acquiring the position of the user, the position of the sound source, the mouth shape of the user, comparing the positions of the user and the sound source, comparing the mouth shape of the user and the preset waking word mouth shape, and the like, which can be realized by the air conditioner, can be selectively divided into online and offline working modes according to the actual situation, that is, partial functions of the modules such as the voice module 602, the camera module 603, the controller 604, and the like, can be realized in the online mode, and partial functions are realized in the offline mode.
The advantages of the air conditioner provided in this embodiment are the same as the wake-up method provided in the above embodiment, and are not described herein again.
Although the present invention is disclosed above, the present invention is not limited thereto. Various changes and modifications may be effected therein by one skilled in the art without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (10)

1. A method of waking up, comprising:
acquiring a voice awakening word, wherein the voice awakening word corresponds to a preset awakening mouth shape;
comparing the position of a sound source which sends the voice awakening word with the position of a user, and comparing the mouth shape of the user with the preset awakening mouth shape;
and when the position of the sound source is matched with the position of the user and the mouth shape of the user is matched with the preset awakening mouth shape, executing awakening operation.
2. The method according to claim 1, wherein the obtaining the voice wake-up word comprises:
receiving a voice signal of a user, the voice signal being received by a microphone array;
and detecting voice awakening words in the voice signals.
3. The method according to claim 2, wherein comparing the position of the sound source that utters the voice wake-up word with the position of the user comprises:
calculating the position of the sound source according to the time difference of the voice signal reaching each microphone in the microphone array;
acquiring a user image;
acquiring the position of the user according to the user image;
and comparing the position of the sound source with the position of the user, and judging whether the position of the sound source is matched with the position of the user.
4. The wake-up method according to claim 3, wherein the obtaining the location of the user from the user image comprises:
dividing a real space represented by the user image into a plurality of regions;
identifying a region in which the user is located in the user image;
and taking the position of the area where the user is located as the position of the user.
5. The method according to claim 1, wherein comparing the position of the sound source that utters the voice wake-up word with the position of the user, and comparing the mouth shape of the user with the preset wake-up mouth shape comprises:
comparing the position of the sound source which sends the voice awakening word with the position of a user;
if the position of the sound source is not matched with the position of the user, not comparing the mouth shape of the user with the preset awakening mouth shape;
and if the position of the sound source is matched with the position of the user, comparing the mouth shape of the user with the preset awakening mouth shape.
6. The method for waking up as claimed in claim 1, wherein the comparing the position of the sound source emitting the voice wake-up word with the position of the user and the comparing the mouth shape of the user with the preset wake-up mouth shape further comprises:
acquiring a user image;
according to the user image, recognizing the mouth shape of the user in the user image;
and comparing the mouth shape of the user with the preset awakening mouth shape, and judging whether the mouth shape of the user is matched with the preset awakening mouth shape.
7. Wake-up method according to claim 1, characterized in that the wake-up operation comprises issuing a wake-up feedback tone to the user.
8. A wake-up device, for use in a wake-up method according to any one of claims 1 to 7, comprising:
the device comprises a wake-up word acquisition module, a voice wake-up word acquisition module and a voice recognition module, wherein the wake-up word acquisition module is used for acquiring a voice wake-up word which corresponds to a preset wake-up mouth shape;
the comparison module is used for comparing the position of a sound source sending the voice awakening word with the position of a user and comparing the mouth shape of the user with the preset awakening mouth shape;
and the awakening module is used for executing awakening operation when the position of the sound source is matched with the position of the user and the mouth shape of the user is matched with the preset awakening mouth shape.
9. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the wake-up method of any one of claims 1 to 7.
10. An air conditioner comprising: memory, processor and computer program stored on the memory and executable on the processor, characterized in that the processor implements the wake-up method according to any of claims 1 to 7 when executing the computer program.
CN202010406233.7A 2020-05-14 2020-05-14 Awakening method and device, computer storage medium and air conditioner Pending CN111599361A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010406233.7A CN111599361A (en) 2020-05-14 2020-05-14 Awakening method and device, computer storage medium and air conditioner

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010406233.7A CN111599361A (en) 2020-05-14 2020-05-14 Awakening method and device, computer storage medium and air conditioner

Publications (1)

Publication Number Publication Date
CN111599361A true CN111599361A (en) 2020-08-28

Family

ID=72192227

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010406233.7A Pending CN111599361A (en) 2020-05-14 2020-05-14 Awakening method and device, computer storage medium and air conditioner

Country Status (1)

Country Link
CN (1) CN111599361A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112188341A (en) * 2020-09-24 2021-01-05 江苏紫米电子技术有限公司 Earphone awakening method and device, earphone and medium
CN112433770A (en) * 2020-11-19 2021-03-02 北京华捷艾米科技有限公司 Wake-up method and device for equipment, electronic equipment and computer storage medium
CN112634911A (en) * 2020-12-21 2021-04-09 苏州思必驰信息科技有限公司 Man-machine conversation method, electronic device and computer readable storage medium
CN112669837A (en) * 2020-12-15 2021-04-16 北京百度网讯科技有限公司 Awakening method and device of intelligent terminal and electronic equipment
CN113066488A (en) * 2021-03-26 2021-07-02 深圳市欧瑞博科技股份有限公司 Voice wake-up intelligent control method and device, electronic equipment and storage medium
CN113096656A (en) * 2021-03-30 2021-07-09 深圳创维-Rgb电子有限公司 Terminal device awakening method and device and computer device
CN113257251A (en) * 2021-05-11 2021-08-13 深圳优地科技有限公司 Robot user identification method, apparatus and storage medium
CN115223548A (en) * 2021-06-29 2022-10-21 达闼机器人股份有限公司 Voice interaction method, voice interaction device and storage medium
WO2024034980A1 (en) * 2022-08-09 2024-02-15 Samsung Electronics Co., Ltd. Context-aware false trigger mitigation for automatic speech recognition (asr) systems or other systems

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199545A (en) * 2014-08-28 2014-12-10 青岛海信移动通信技术股份有限公司 Method and device for executing preset operations based on mouth shapes
CN105096935A (en) * 2014-05-06 2015-11-25 阿里巴巴集团控股有限公司 Voice input method, device, and system
CN207440970U (en) * 2017-08-29 2018-06-01 来邦科技股份公司 A kind of alarm terminal based on Mouth-Shape Recognition
CN108154140A (en) * 2018-01-22 2018-06-12 北京百度网讯科技有限公司 Voice awakening method, device, equipment and computer-readable medium based on lip reading
CN110910878A (en) * 2019-11-27 2020-03-24 珠海格力电器股份有限公司 Voice wake-up control method and device, storage medium and household appliance
CN111145739A (en) * 2019-12-12 2020-05-12 珠海格力电器股份有限公司 Vision-based awakening-free voice recognition method, computer-readable storage medium and air conditioner

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105096935A (en) * 2014-05-06 2015-11-25 阿里巴巴集团控股有限公司 Voice input method, device, and system
CN104199545A (en) * 2014-08-28 2014-12-10 青岛海信移动通信技术股份有限公司 Method and device for executing preset operations based on mouth shapes
CN207440970U (en) * 2017-08-29 2018-06-01 来邦科技股份公司 A kind of alarm terminal based on Mouth-Shape Recognition
CN108154140A (en) * 2018-01-22 2018-06-12 北京百度网讯科技有限公司 Voice awakening method, device, equipment and computer-readable medium based on lip reading
CN110910878A (en) * 2019-11-27 2020-03-24 珠海格力电器股份有限公司 Voice wake-up control method and device, storage medium and household appliance
CN111145739A (en) * 2019-12-12 2020-05-12 珠海格力电器股份有限公司 Vision-based awakening-free voice recognition method, computer-readable storage medium and air conditioner

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112188341A (en) * 2020-09-24 2021-01-05 江苏紫米电子技术有限公司 Earphone awakening method and device, earphone and medium
CN112188341B (en) * 2020-09-24 2024-03-12 江苏紫米电子技术有限公司 Earphone awakening method and device, earphone and medium
CN112433770A (en) * 2020-11-19 2021-03-02 北京华捷艾米科技有限公司 Wake-up method and device for equipment, electronic equipment and computer storage medium
CN112669837A (en) * 2020-12-15 2021-04-16 北京百度网讯科技有限公司 Awakening method and device of intelligent terminal and electronic equipment
CN112634911A (en) * 2020-12-21 2021-04-09 苏州思必驰信息科技有限公司 Man-machine conversation method, electronic device and computer readable storage medium
CN113066488B (en) * 2021-03-26 2023-10-27 深圳市欧瑞博科技股份有限公司 Voice wakeup intelligent control method and device, electronic equipment and storage medium
CN113066488A (en) * 2021-03-26 2021-07-02 深圳市欧瑞博科技股份有限公司 Voice wake-up intelligent control method and device, electronic equipment and storage medium
CN113096656A (en) * 2021-03-30 2021-07-09 深圳创维-Rgb电子有限公司 Terminal device awakening method and device and computer device
CN113257251A (en) * 2021-05-11 2021-08-13 深圳优地科技有限公司 Robot user identification method, apparatus and storage medium
CN113257251B (en) * 2021-05-11 2024-05-24 深圳优地科技有限公司 Robot user identification method, apparatus and storage medium
CN115223548A (en) * 2021-06-29 2022-10-21 达闼机器人股份有限公司 Voice interaction method, voice interaction device and storage medium
CN115223548B (en) * 2021-06-29 2023-03-14 达闼机器人股份有限公司 Voice interaction method, voice interaction device and storage medium
WO2024034980A1 (en) * 2022-08-09 2024-02-15 Samsung Electronics Co., Ltd. Context-aware false trigger mitigation for automatic speech recognition (asr) systems or other systems

Similar Documents

Publication Publication Date Title
CN111599361A (en) Awakening method and device, computer storage medium and air conditioner
CN108231079B (en) Method, apparatus, device and computer-readable storage medium for controlling electronic device
KR102335717B1 (en) Voice control system and wake-up method thereof, wake-up device and home appliance, coprocessor
CN111223497B (en) Nearby wake-up method and device for terminal, computing equipment and storage medium
CN107403621B (en) Voice wake-up device and method
JP4675840B2 (en) Remote controller and home appliance
US10991372B2 (en) Method and apparatus for activating device in response to detecting change in user head feature, and computer readable storage medium
CN107103906B (en) Method for waking up intelligent device for voice recognition, intelligent device and medium
WO2018169568A1 (en) Query endpointing based on lip detection
CN105575395A (en) Voice wake-up method and apparatus, terminal, and processing method thereof
CN111161714B (en) Voice information processing method, electronic equipment and storage medium
CN108806673B (en) Intelligent device control method and device and intelligent device
TW201403590A (en) Signal processing apparatus and signal processing method
CN109166575A (en) Exchange method, device, smart machine and the storage medium of smart machine
CN112130918A (en) Intelligent device awakening method, device and system and intelligent device
KR20190001067A (en) Method and apparatus for speech recognition
CN112233676A (en) Intelligent device awakening method and device, electronic device and storage medium
CN112420044A (en) Voice recognition method, voice recognition device and electronic equipment
CN113506568A (en) Central control and intelligent equipment control method
CN113160815A (en) Intelligent control method, device and equipment for voice awakening and storage medium
CN112272332B (en) Awakening method and device of intelligent set top box, electronic equipment and storage medium
CN116226701A (en) Identification module, identification method and intelligent door lock
CN112269322A (en) Awakening method and device of intelligent device, electronic device and medium
CN113905264A (en) Voice control system based on voice remote controller
CN112786044A (en) Voice control method, device, main controller, robot and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200828

RJ01 Rejection of invention patent application after publication