WO2020215741A1

WO2020215741A1 - Speech recognition device and wake-up response method therefor, and computer storage medium

Info

Publication number: WO2020215741A1
Application number: PCT/CN2019/124117
Authority: WO
Inventors: 何瑞澄
Original assignee: 广东美的白色家电技术创新中心有限公司; 美的集团股份有限公司
Priority date: 2019-04-26
Filing date: 2019-12-09
Publication date: 2020-10-29
Also published as: CN111862964A; CN111862964B

Abstract

A speech recognition device and a wake-up response method therefor, wherein a plurality of speech recognition devices form a regional network, and the plurality of speech recognition devices are divided into a central device and at least one non-central device. The wake-up response method comprises: the speech recognition devices analyzing a collected speech signal so as to obtain distance information (S201); the central device receiving the distance information of the non-central device (S202); the central device comparing the distance information of the central device to the distance information of the non-central device; determining a speech recognition device to send a response (S203), and the central device sending to the non-central device a notification for whether to respond to the speech signal (S204); and the speech recognition device to send a response responding to the speech signal (S205). One speech recognition device from a plurality of speech recognition devices that may respond to a speech signal may be determined to respond to the speech signal.

Description

Speech recognition equipment, wake-up response method thereof, and computer storage medium

This application claims the priority of the Chinese patent application filed on April 26, 2019 with the application number 2019103430447 and the invention title of "Voice Recognition Device and its Wake-up Response Method, Computer Storage Medium", which is incorporated into this by reference. Application.

【Technical Field】

This application relates to the field of voice wake-up, in particular to a voice recognition device and its wake-up response method, and computer storage media.

【Background technique】

Technologies such as voice recognition and voice interaction have been applied in many fields. Devices equipped with voice recognition systems generally respond to voice signals after being awakened when they receive voice signals.

For multiple voice recognition devices in the same area, it may be awakened by voice signals and respond at the same time. In general application scenarios, the user obviously only wakes up one voice recognition device, and multiple voice recognition devices are Waking up and responding at the same time will cause the problem of mutual interference between multiple voice recognition devices. For example, the sound broadcast by one voice recognition device in response to the voice signal will be received and responded by another voice recognition device, and vice versa, that is, mutual interference occurs. The problem of interference.

[Content of the invention]

The present application provides a wake-up response method for a voice recognition device, a voice recognition device, and a computer storage medium, so as to solve the mutual interference problem caused by multiple voice recognition devices responding to the wake-up voice at the same time in the prior art.

In order to solve the above technical problems, this application provides a wake-up response method for voice recognition devices. Multiple voice recognition devices form a network. The multiple voice recognition devices are divided into a central device and at least one non-central device; the wake-up response method includes: central The device analyzes the collected voice signals to obtain the distance information of the central device; the distance information of the central device indicates the distance between the central device and the signal source of the voice signal; the distance information of the non-central device is received, and the distance information of the non-central device is determined by the non-central device Obtained by analyzing the collected voice signals, indicating the distance between the non-central device and the signal source; comparing the distance information of the central device and the distance information of the non-central device; determining the voice recognition device to be responded, and the voice recognition device to be responded to the voice response in the regional network Signal voice recognition equipment.

In order to solve the above technical problems, this application provides a wake-up response method for a voice recognition device. Multiple voice recognition devices form a regional network. The multiple voice recognition devices are divided into a central device and at least one non-central device; the wake-up response method includes: The non-central device analyzes the collected voice signals to obtain the distance information of the non-central device; the distance information of the non-central device indicates the distance between the non-central device and the signal source of the voice signal; and sends the distance information of the non-central device to the central device, The hub device compares the distance information of the non-central device with the distance information of the hub device to determine the voice recognition device to be responded; the distance information of the hub device indicates the distance between the hub device and the source of the voice signal, and the voice recognition device to be responded is the area Voice recognition equipment that responds to voice signals in the network.

In order to solve the above technical problems, the present application provides a voice recognition device, which includes a processor and a memory, a computer program is stored in the memory, and the processor is used to execute the computer program to implement the steps of the wake-up response method.

In order to solve the above technical problems, this application provides a computer storage medium in which a computer program is stored, and when the computer program is executed, the steps of the above wake-up response method are realized.

In the wake-up response method of the present application, multiple voice recognition devices form a network, where the voice recognition device determines the distance information from the signal source of the voice signal by analyzing the collected voice signal. The multiple voice recognition devices are divided into a central device and at least one non-central device. The hub device obtains its own distance information, and accepts the distance information of the non-central device; then compares its own distance information with the distance information of the non-central device to determine the voice recognition device to be responded to, which is the local area network Voice recognition equipment in response to voice signals. In the present application, the voice recognition devices forming the network do not respond temporarily after being awakened by the voice signal. The central device first determines which one should respond, so as to avoid the problem of mutual interference after multiple voice recognition devices respond.

【Explanation of drawings】

Figure 1 is a schematic diagram of the structure of a network formed by interconnecting voice recognition devices of the present application;

Figure 2 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of the present application in a single area network;

Figure 3 is a schematic diagram of the positional relationship between three microphones of a linear array and a signal source;

FIG. 4 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of the present application in a multi-area network;

FIG. 5 is a schematic diagram of the work flow of the hub device side of the wake-up response method of the voice recognition device of this application;

FIG. 6 is a schematic diagram of the non-central device side work flow of the wake-up response method of the voice recognition device of this application;

FIG. 7 is a schematic structural diagram of an embodiment of a speech recognition device according to the present application;

Fig. 8 is a schematic structural diagram of an embodiment of a computer storage medium of the present application.

【Detailed ways】

In order to enable those skilled in the art to better understand the technical solutions of the present invention, a wake-up response method for a voice recognition device, voice recognition device and computer storage medium provided in this application will be further detailed below in conjunction with the accompanying drawings and specific implementations. description.

The wake-up response method of the present application is applied to the situation where multiple voice recognition devices can respond to the same voice signal. In this case, taking the field of home appliances as an example, there are multiple home appliances in the same area or in multiple adjacent areas, where All household appliances have a voice recognition function, that is, as a voice recognition device. For example, there are voice recognition devices such as televisions, air conditioners, and refrigerators in the living room area; voice recognition devices such as refrigerators, microwave ovens, kettles, and rice cookers exist in the kitchen area. When the user sends out a voice signal in the living room area, due to the sound propagation characteristics, multiple household appliances in the living room area may receive the voice signal and respond to the voice signal. At this time, multiple household appliances In the case of responding, in this case, the response sound of the household appliance A may be received and responded by the household appliance B, which may cause mutual interference between the household appliances and fail to respond to the user's needs normally. For example, when a user sends a voice signal between the living room area and the kitchen area, both areas can receive the voice signal and respond to the voice signal, and the problem of mutual interference may also occur.

For the speech recognition device of the present application, it is a mode of waking up first and then responding, that is, being awakened by a voice signal sent by the user first, and then responding to the voice signal. In this regard, this application introduces a selection determination mechanism between wake-up and response, that is, after being awakened by a voice signal, it does not respond temporarily, and then responds when it is determined that a response is needed.

Specifically, for a single area, multiple voice recognition devices are connected to each other to form a regional network. One voice recognition device is used as the hub device in the regional network. The hub device determines which voice recognition device in the regional network responds to the regional network. voice signal.

For multiple areas, the hub device of each area network first determines the voice recognition device to be responded to the voice signal in the area network. After that, a first hub device among all the hub devices determines the waiting voice recognition device in which area network. Respond to the voice recognition device to respond, thereby solving the problem of mutual interference caused by multiple voice recognition devices responding to voice signals.

In the application of household appliances, since the central device needs to be able to respond to the user's voice signal at any time to determine the device that responds to the voice signal, it is generally selected to connect to the power source for a long time and basically not power off the household appliance; and the interactive screen is preferred. Of home appliances as the network hub device, which facilitates related settings through the interactive screen. For example, the refrigerator serves as a central device.

Generally speaking, each area, such as the living room area and the home appliance in the kitchen area, can form an area network. The area network corresponds to the division of areas. On the network connection, it does not necessarily form a separate area network, that is, it may be Home appliances in all areas of a family can be connected to each other to form a whole home appliance network.

The network constituted in this application includes, but is not limited to, a local area network composed of WIFI wireless network, a local area network composed of a wired network, a LAN composed of Bluetooth mesh, a local area network composed of zigbee, a local area network composed of RS485, a local area network composed of LoRa, a local area network composed of 1394, LAN composed of CAN and so on. The communication mechanism of the formed network includes but is not limited to UDP, TCP/IP, HTTP, MQTT, CoAP, etc., to ensure that each voice recognition device on the same network can quickly and reliably exchange information.

With regard to the wake-up response method of the present application, the following describes the wake-up response method starting from the network formed by the voice recognition device.

Please refer to FIG. 1, which is a schematic diagram of the structure of a network formed by interconnecting voice recognition devices of this application. The area in Figure 1 is divided into living room area A, kitchen area B, and bedroom area C; in living room area A, voice recognition equipment includes: refrigerator A1, TV A2, air purifier A3; in kitchen area B, voice recognition equipment includes: Range hood B1, rice cooker B2, wall breaker B3; in bedroom area C, voice recognition equipment includes: air conditioner C1, humidifier C2. All voice recognition devices are connected to form a network, and the voice recognition devices in each area also form a regional network.

The voice devices in each regional network are divided into a central device and at least one non-central device, and the central device determines the voice recognition device to respond to the voice signal in the local network. The hub devices of all regional networks are further divided into a first hub device and at least one second hub device. The first hub device determines which voice recognition device in the regional network will respond to the voice signal.

In some embodiments of this application, voice devices in the local area network are not only divided into hub devices and non-central devices, but also have a wake-up priority. The wake-up priority can be set by the manufacturer when the voice recognition device is shipped from the factory. After the network, the voice recognition device with the highest wake-up priority automatically serves as the central device of the regional network; the wake-up priority can also be set when the network is constructed, set by the user, or set by the service provider who builds the network; according to the set wake-up priority The voice recognition device with the highest wake-up priority is the central device of the network.

In the network shown in Figure 1, the priority of living room area A is A1>A2>A3, the priority of kitchen area B is B1>B2>B3, and the priority of bedroom area C is C1>C2; where A1 , B1 and C1 respectively serve as the central equipment of their respective local area networks. There is also a priority ordering between the hub devices of each area network A1>B1>C1. In this application, A1 is the first hub device, and B1 and C1 are the second hub devices.

The network shown in Figure 1 can realize wake-up response in a single area and wake-up response in multiple areas. For details, please refer to Figures 2 and 4. Figure 2 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of this application on a single area network, and Figure 4 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of this application on a multi-area network .

As shown in Figure 2, the implementation of the wake-up response method in a single area network includes the following steps.

S201: The voice recognition device analyzes the collected voice signal to obtain distance information.

In this step, the voice recognition device mainly performs two actions, collection and analysis. After the user, the signal source, sends out the voice signal, the voice recognition device can collect the voice signal. Because each voice recognition device has a different relative position with the user, the voice signal it collects is also different. Among them, the voice recognition equipment far away from the user may not be able to collect voice signals even in the local area network.

The voice recognition devices analyze the voice signals collected by each. In this embodiment, all voice recognition devices in each regional network have the same voice signal analysis mechanism to facilitate subsequent comparison calculations. The voice signal is analyzed and calculated to obtain distance information, which indicates the distance between the voice recognition device and the signal source of the voice signal.

Since it is necessary to determine the voice recognition device to respond to the voice signal based on the distance information, the distance information includes the identification of the voice recognition device and the distance value used for judgment.

In this embodiment, the distance value of the distance information may be determined according to the voice signals collected by at least three microphones. That is, at least three microphones are provided on the voice recognition device, and each microphone collects voice signals. Firstly, at least three microphones are used to separately collect voice signals, where the relative positions of the at least three microphones on the voice recognition device are fixed; then, the distance value of the distance information is calculated based on the relative positions of the at least three microphones and the voice signals respectively collected.

Specifically, according to the voice signals collected by the at least three microphones, the relative positions of the at least three microphones and the signal source are calculated; according to the relative positions of the at least three microphones and the signal source, and the relative positions between the at least three microphones, Calculate the distance value of the distance information.

For example, if the voice recognition device has three microphones arranged in a linear array, please refer to FIG. 3, which is a schematic diagram of the positional relationship between the three microphones of the linear array and the signal source.

For specific calculations, first use the DOA algorithm to calculate the relative position of every two adjacent microphones and the signal source; use the DOA algorithm to calculate the voice signals of mic1 and mic2 to obtain the relative azimuth angle θ ₁ ; use the DOA algorithm to calculate the voice signals of mic2 and mic3, Obtain the relative azimuth angle θ ₂ .

Calculate the distance l between mic2 and the signal source according to the following equations.

tanθ ₁ = h/(x+1.5d)

tanθ ₂ =h/(x+0.5d)

l=(h ² +(x+d) ² ) ^1/2

Among them, d is the relative distance between the microphones mic, and the distance value l calculated above is the distance value of the distance information between the voice recognition device and the signal source.

For area A in this embodiment, the distance value of the distance information obtained by the device A1 is recorded as LA1, the distance value of the distance information obtained by the device A2 is recorded as LA2, and the distance value of the distance information obtained by the device A3 is recorded as LA3.

In this step S201, the central hub device analyzes the collected voice signal to obtain the sound distance information of the central device; the non-central device analyzes the collected voice signal to obtain the distance information of the non-central device.

S202: The hub device receives the distance information of the non-central device.

After the voice recognition device calculates and obtains the distance information, the non-central device sends the distance information obtained by itself to the central device. In this embodiment, the hub device A1 receives the distance information sent by the non-central device.

S203: The hub device compares the distance information of the hub device with the distance information of the non-central device, and determines the voice recognition device to be responded.

In this step, the hub device compares the distance information of the hub device with the distance information of the non-central device, so as to determine the voice recognition device in the area network that responds to the voice signal. Specifically, the hub device uses a sorting algorithm to compare the distance values of the distance information, and obtains the sorting of the distance values of all the distance information, so as to obtain the distance information with the smallest distance value, that is, the voice recognition device that is closest to the signal source of the voice signal. The closer the distance, the larger the user may be the voice signal sent by the voice recognition device. The voice recognition device corresponding to the distance information with the smallest distance value is the voice recognition device to be responded.

Sorting algorithms include, but are not limited to, insertion sort, Hill sort, selection sort, heap sort, bubble sort, quick sort, merge sort, computational sort, bucket sort, radix sort, etc. In this embodiment, the order of the distance value of the distance information is LA2<LA1<LA3.

When the distance information is compared and analyzed, there may be two or more distance information with the smallest distance value. In this case, the device that responds to the voice signal is determined based on the wake-up priority of the voice recognition device. Among the voice recognition devices corresponding to the distance information with the smallest distance value, the one with the highest priority is determined as the voice recognition device to be responded.

S204: The hub device sends a notification whether to respond to the voice signal to the non-central device.

After the hub device determines the voice recognition device to respond to the voice signal, it can send a notification of whether to respond to the voice signal to the non-central device, that is, to all voice recognition devices that have been awakened but have not responded to the voice signal through the network. The notification may be a specific response or no response, and may also be device information of the determined voice recognition device that responds to the voice signal. It is also possible to only send a notification to the voice recognition device to be responded, and other voice recognition devices that have not received the notification do not respond, but those that receive the notification respond.

S205: The voice recognition device to be responded responds to the voice signal.

The identified voice recognition device can respond to the voice signal, while other voice recognition devices do not. It is ensured that only one voice recognition device responds to the voice signal without causing mutual interference.

The method shown in Figure 2 above is applied to the voice wake-up recognition of a single area network. After the voice recognition device in the single area network is awakened by voice information, it does not respond immediately, but after the central device of the single area network determines the responding device, Respond again.

The realization of the wake-up response method of the multi-area network is based on the confirmation of the voice recognition device to be responded in the single-area network shown in Figure 2. Specifically, a multi-area network is a plurality of interconnected area networks. The hub devices of each area network are connected to each other. They are divided into a first hub device and at least one second hub device. Each area network determines its response After the voice recognition device, the first hub device further confirms the voice recognition device that responds to the voice signal.

The steps for implementing the wake-up response method for each regional network in the multi-regional network will not be repeated. Please also refer to FIG. 4. The wake-up response method of the multi-regional network further includes the following steps.

S401: The second hub device sends second distance information to the first hub device, and the first hub device receives the second distance information.

In a multi-area network, the first hub device needs to compare the distance information of the voice recognition device to be responded to in all regional networks to determine the voice recognition device that responds to the voice signal. The voice recognition device to be responded to is determined in a single regional network A voice recognition device that responds to voice signals; in the application of a multi-area network, the voice recognition device to be responded determined by a single regional network does not respond immediately; instead, the first central device receives multiple voice recognition The recognition device confirms which one responds to the voice signal, that is, the final voice recognition device that responds to the voice signal is determined. Therefore, in this step S401, the second central device sends its second distance information to the first central device. The second distance information is the distance information of the voice recognition device to be responded in the area where the second central device is located.

For example, in area A, A1 compares LA1, LA2, and LA3 to determine that the voice recognition device to be responded is A2; in area B, B1 compares LB1, LB2, and LB3 to determine that the voice recognition device to respond is B3; in area C, C1 compares LC1 and LC2 to determine that the responding device is C1.

B1 sends the distance information LB3 of the voice recognition device B3 to be responded in its local area network to A1, and C1 also sends the distance information LC1 to A1, and the distance information of the voice recognition device A2 to be responded determined by A1 itself is LA2.

S402: The first hub device compares the second distance information with the first distance information, and determines a voice recognition device that responds to the voice signal.

The first hub device compares the distance information of each voice recognition device to be responded, that is, the first distance information and the second distance information. The first distance information is the distance information of the voice recognition device to be responded in the local network where the first hub device is located.

The comparison process of this step S402 is similar to the comparison process of the foregoing step S203, and the details are not repeated here. That is, the distance value of the first distance information and the distance value of the second distance information are compared to obtain the distance information with the smallest distance value; the voice recognition device corresponding to the distance information with the smallest distance value is determined to respond to the voice signal.

In this embodiment, A1 compares LA2, LB3, and LC1; thereby determining the voice recognition device that responds to the voice signal, for example, B2. Similarly, the obtained distance information with the smallest distance value may have two or more. In this case, the device that responds to the voice signal is further determined according to the wake-up priority of the voice recognition device, that is, the distance information with the smallest distance value corresponds to Among the voice recognition devices, the one with the highest priority is determined as the voice recognition device to be responded.

S403: The first hub device sends a notification whether to respond to the voice signal to other voice recognition devices in the multi-area network.

After the first hub device determines the voice recognition device that responds to the voice signal, it can directly send notifications to the entire network, that is, multiple regional networks, or it can first send notifications to hub devices in each regional network, and then each hub device can send notifications to non- The hub device sends a notification. Similarly, it can only be sent to the voice recognition device that responds to the voice signal, and other devices that have not received the notification will not respond.

S404: The determined voice recognition device responds to the voice signal.

This step S404 is similar to the above step S205, and will not be described again.

The method shown in Figure 4 is applied to multi-region voice wake-up recognition. After each region determines the voice device that should respond to the region, the first central device further determines which region’s voice device responds, so as to ensure that only A voice recognition device responds to voice signals.

In the network applied in Figure 2 and Figure 4, the voice recognition device has a wake-up priority sequence, so when the highest priority voice recognition device fails, the next wake-up priority can be determined according to the wake-up priority sequence. The voice recognition device serves as the hub device or the first hub device.

For voice recognition equipment, it can periodically detect whether it has the highest wake-up priority in the local area network, or detect whether it has the highest wake-up priority when the local network changes; if it detects that it is the current local network The highest wake-up priority in, that is, in response to detecting that it is the highest wake-up priority in the local area network, it operates as a hub device.

The wake-up response method implemented in the network of this embodiment is based on the fact that the voice recognition device in the network has a wake-up priority order, and the voice recognition device as a network hub device can compare distance information. Therefore, the voice recognition device newly added to the network also needs to comply with the wake-up mechanism of this embodiment, which can be set by the hub device.

The hub device can obtain the device information of the voice recognition device joining the network. Analyze device information according to preset rules to re-order the voice recognition devices in the network to wake up priority.

Each voice recognition device is equipped with a voice recognition system, which determines the wake-up priority, voice recognition algorithm, wake-up template, etc. If the newly added voice recognition device has a different voice recognition system, that is, it has different wake-up priority settings, the network hub device can reorder according to its own wake-up priority settings. For example, in the network A1-A2-A3, the newly added voice recognition device A4, whose wake-up priority is set to be greater than A3, can reorder the wake-up priority as A1>A2>A4>A3.

If the newly added voice recognition device has the same voice recognition system, that is, it has the same wake-up priority setting, the wake-up priority of the voice recognition device that joins the network first will be higher. For example, the newly added voice recognition device A3 has the same voice recognition system as the previous A3, the previous A3 is used as A31, the newly added one is used as A32, and the wake-up priority is reordered as A1>A2>A31>A32.

For the network of this embodiment, all the steps in which the wake-up response method is implemented can be completed inside the network, so the voice recognition device of this embodiment can run offline.

In the single area network formed by the interconnection of the above voice recognition devices, the voice recognition device can play two roles, one is to operate as a central device, and the other is to operate as a non-central device. For each speech recognition device, it can be used as a central device with more powerful functions; it can also be used as a non-central device with lighter weight.

In the field of household appliances, for large household appliances, such as refrigerators, televisions, etc., a voice recognition system with more powerful functions can be loaded into it, so that it can be used as a central device; for small household appliances, such as rice cookers, electric kettles, etc., The voice recognition system with lightweight functions makes it only a non-central device.

For a voice recognition device that can be used as a network hub device, please refer to FIG. 5 for the steps of implementing the wake-up response method. FIG. 5 is a schematic diagram of the hub device side workflow of the wake-up response method of the voice recognition device of the present application. As a network hub device, its wake-up response method includes the following steps.

S501: Analyze the collected voice signal to obtain distance information of the central device.

For the hub device in each area network, this step S501 is completed in the above step S201, and the details will not be repeated.

S502: Receive distance information of a non-central device that is a non-central device.

This step S502 corresponds to the above step S202, and the details are not repeated here.

S503: Compare the distance information of the central device with the distance information of the non-central device, and determine the voice recognition device to be responded in the regional network.

This step S503 is similar to the above step S203, and the details will not be described in detail.

The above steps use the voice recognition device as the role of the central device to illustrate the steps in implementing the single-area wake-up response method. The specific details of each step and the specific details of the operation of the central device have also been described above, so they will not be Repeat. The voice recognition device of this embodiment can determine a voice recognition device that responds to the voice signal from multiple voice recognition devices, thereby avoiding the problem of mutual interference due to all responses.

Further, for a multi-area network, the hub device is further divided into a first hub device and a second hub device. For the first hub device, it further performs the following steps.

S504: The first hub device receives the second distance information.

This step S504 is completed in the above step S401, and the details are not repeated here.

S506: Compare the first distance information with the second distance information, and determine a voice recognition device that responds to the voice signal.

This step S506 is similar to the above step S402, and the details are not repeated here.

For the second hub device, it performs the following steps.

S505: The second hub device sends second distance information to the first hub device, so that the first hub device compares the first distance information with the second distance information, so as to determine a voice recognition device that responds to the voice signal.

This step S505 is completed in the above steps S401-S402, and the details are not repeated here.

Further, in a multi-area network, the first hub device further determines which area network's to-be-responsive voice recognition device responds to the voice signal.

From the perspective of a non-central device, please refer to FIG. 6 for the steps of implementing the wake-up response method. FIG. 6 is a schematic diagram of the non-central device side work flow of the voice recognition device wake-up response method of the present application. The voice recognition device is a non-central device, and the wake-up response method of this embodiment includes the following steps.

S601: Analyze the collected voice signal to obtain distance information of the non-central device.

This step S601 is similar to the above step S201, both of which are obtaining distance information, and the specific process will not be repeated.

S602: Send the distance information of the non-central device to the central device, so that the central device compares the distance information of the non-central device with the distance information of the central device to determine the voice recognition device to be responded to.

As a non-central device, after collecting the voice signal, it does not respond to the voice signal immediately, but performs calculation and analysis to obtain distance information, and then transmits the distance information to the central device for analysis and comparison, and the central device confirms the response Voice recognition equipment for voice signals.

In this embodiment, the role of the voice recognition device as a non-central device is used to illustrate the steps in implementing the wake-up response method. The specific details of each step and the specific details of the operation of the non-central device have also been described above. Repeat it again. The voice recognition device of this embodiment does not respond immediately after receiving the voice signal, but decides whether to respond after receiving the notification, which avoids the problem of mutual interference caused by simultaneous response with other voice recognition devices.

The above wake-up response method is implemented by a voice recognition device. Therefore, this application also proposes a voice recognition device. Please refer to Figure 7. Figure 7 is a schematic structural diagram of an embodiment of the voice recognition device of this application. The voice recognition device 100 in this embodiment may be a household appliance. , Which includes at least three microphones 11, a processor 12, and a memory 13 connected to each other. The voice recognition device 100 of this embodiment can implement the above-mentioned wake-up response method embodiment. Among them, at least three microphones 11 have a fixed relative position and are used to collect voice signals, a computer program is stored in the memory 13, and the processor 12 is used to execute the computer program to implement the above wake-up response method.

Specifically, at least three microphones 11 are used to collect voice signals; the processor 12 is used to calculate the distance information between the voice recognition device and the signal source of the voice signal according to the relative positions of the at least three microphones and the voice signals collected respectively, And compare all distance information to determine the voice recognition device that responds to the voice signal; send notifications to other voice recognition devices whether they respond to the voice signal.

Alternatively, at least three microphones 11 are used to collect voice signals; the processor 12 is used to calculate the distance information between the voice recognition device and the signal source of the voice signal based on the relative positions of the at least three microphones and the voice signals collected separately, and calculate the distance The information is sent to the central device, and it is determined whether to respond according to the received notification sent by the central device whether it responds to the voice signal.

Among them, the processor 12 may be an integrated circuit chip with signal processing capability. The processor 12 may also be a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), an off-the-shelf programmable gate array (FPGA) or other programmable logic device, a discrete gate or transistor logic device, a discrete hardware component . The general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.

For the method of the foregoing embodiment, it may exist in the form of a computer program. Therefore, this application proposes a computer storage medium. Please refer to FIG. 8. FIG. 8 is a schematic structural diagram of an embodiment of the computer storage medium of the present application. The computer storage medium 200 of this embodiment stores a computer program 21, which can be executed to implement the method in the foregoing embodiment.

The computer storage medium 200 of this embodiment may be a U disk, a mobile hard disk, a read-only memory (ROM, Read-Only Memory), a random access memory (RAM, Random Access Memory), a magnetic disk or an optical disk, etc., which can store program instructions. Or it may also be a server storing the program instructions, and the server may send the stored program instructions to other devices to run, or it may run the stored program instructions by itself.

In the several embodiments provided in this application, it should be understood that the disclosed method and device can be implemented in other ways. For example, the device implementation described above is merely illustrative, for example, the division of modules or units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components can be combined or It can be integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.

The units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of this embodiment.

In addition, the functional units in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.

If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium. Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , Including several instructions to make a computer device (which can be a personal computer, a server, or a network device, etc.) or a processor execute all or part of the steps of the methods in the various embodiments of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program code .

The above are only implementations of this application, and do not limit the scope of this application. Any equivalent structure or equivalent process transformation made by using the description and drawings of this application, or directly or indirectly applied to other related technologies In the same way, all fields are included in the scope of patent protection of this application.

Claims

A wake-up response method for a voice recognition device, wherein the multiple voice recognition devices form a regional network, and the multiple voice recognition devices are divided into a central device and at least one non-central device; the wake-up response method includes :

The hub device analyzes the collected voice signals to obtain distance information of the hub device; the distance information of the hub device indicates the distance between the hub device and the signal source of the voice signal;

Receiving distance information of the non-central device, where the distance information of the non-central device is obtained by analyzing the collected voice signal by the non-central device, and represents the distance between the non-central device and the signal source;

Comparing the distance information of the central device with the distance information of the non-central device;

A voice recognition device to be responded to is determined, and the voice recognition device to be responded is a voice recognition device that responds to the voice signal in the local area network.
The wake-up response method according to claim 1, wherein the comparing the distance information of the central device with the distance information of the non-central device to determine the voice recognition device to be responded to comprises:

Comparing the distance value of the distance information of the hub device with the distance value of the distance information of the non-central device to obtain the distance information with the smallest distance value;

It is determined that the voice recognition device corresponding to the distance information with the smallest distance value is the voice recognition device to be responded.
The wake-up response method according to claim 2, wherein the multiple voice recognition devices have a wake-up priority; the voice recognition device corresponding to the distance information with the smallest distance value is determined to be the voice recognition device to be responded Equipment, including:

Among the voice recognition devices corresponding to the distance information with the smallest distance value, the voice recognition device with the highest wake-up priority is determined as the voice recognition device to be responded.
The wake-up response method according to claim 1, wherein the wake-up response method further comprises:

The hub device sends a notification whether to respond to the voice signal to the non-central device.
The wake-up response method according to claim 1, wherein a plurality of the regional networks are connected to each other, and the multiple central devices in the regional network are divided into a first central device and at least one second central device; The wake-up response method further includes:

The second hub device sends second distance information to the first hub device, so that the first hub device compares the second distance information with the first distance information, thereby determining the voice recognition in response to the voice signal equipment;

The first distance information is the distance information of the voice recognition device to be responded in the local area network where the first hub device is located, and the second distance information is the distance of the voice recognition device to be responded to in the area network where the second hub device is located information.
The wake-up response method according to claim 1, wherein a plurality of the regional networks are connected to each other, and the multiple central devices in the regional network are divided into a first central device and at least one second central device; The wake-up response method further includes:

Receiving, by the first hub device, second distance information, where the second distance information is the distance information of the voice recognition device to be responded to in the regional network where the second hub device is located;

The second distance information and the first distance information are compared to determine a voice recognition device that responds to the voice signal, and the first distance information is the distance information of the voice recognition device to be responded to in the area network where the first hub device is located .
The wake-up response method according to claim 5 or 6, wherein the comparing the second distance information with the first distance information to determine a voice recognition device that responds to the voice signal comprises:

Comparing the distance value of the first distance information with the distance value of the second distance information to obtain the distance information with the smallest distance value;

It is determined that the voice recognition device corresponding to the distance information with the smallest distance value responds to the voice signal.
The wake-up response method according to claim 7, wherein the multiple voice recognition devices have a wake-up priority; the determining that the voice recognition device corresponding to the distance information with the smallest distance value responds to the voice signal, comprising :

Among the voice recognition devices corresponding to the distance information with the smallest distance value, it is determined that the voice recognition device with the highest wake-up priority responds to the voice signal.
The wake-up response method of claim 5 or 6, wherein the wake-up response method further comprises:

The first hub device sends a notification whether to respond to the voice signal to other voice recognition devices in the multiple area networks.
The wake-up response method according to any one of claims 1-6, wherein the distance information of the central device and the distance information of the non-central device are collectively referred to as distance information; the collected voice signal is analyzed to obtain the distance information ,include:

Collecting the voice signals through at least three microphones respectively, and the relative positions of the at least three microphones on the voice recognition device are fixed;

The distance value of the distance information is calculated according to the relative positions of the at least three microphones and the voice signals collected respectively.
The wake-up response method according to claim 10, wherein the calculating the distance value of the distance information according to the relative positions of the at least three microphones and the voice signals collected respectively includes:

Calculating the relative positions of the at least three microphones and the signal source according to the voice signals collected by the at least three microphones;

The distance value of the distance information is calculated according to the relative position of the at least three microphones and the signal source, and the relative position between the at least three microphones.
The wake-up response method according to claim 11, wherein the calculating the relative positions of the at least three microphones and the signal source according to the voice signals collected by the at least three microphones respectively comprises:

The DOA algorithm is used to calculate the voice signals respectively collected by the three microphones of the linear array, and the relative positions of every two adjacent microphones and the signal source are obtained.
A wake-up response method for a voice recognition device, wherein the multiple voice recognition devices form a regional network, and the multiple voice recognition devices are divided into a central device and at least one non-central device; the wake-up response method includes :

The non-central device analyzes the collected voice signal to obtain the distance information of the non-central device; the distance information of the non-central device indicates the distance between the non-central device and the signal source of the voice signal;

Sending the distance information of the non-central device to the central device, so that the central device compares the distance information of the non-central device with the distance information of the central device to determine the voice recognition device to be responded;

The distance information of the hub device indicates the distance between the hub device and the signal source of the voice signal, and the voice recognition device to be responded is a voice recognition device in the local area network that responds to the voice signal.
The wake-up response method of claim 13, wherein the central device compares the distance information of the central device with the distance information of the non-central device to determine the voice recognition device to be responded to, comprising:

The hub device compares the distance value of the distance information of the hub device with the distance value of the distance information of the non-central device to obtain the distance information with the smallest distance value;

It is determined that the voice recognition device corresponding to the distance information with the smallest distance value is the voice recognition device to be responded.
The wake-up response method according to claim 14, wherein the plurality of voice recognition devices have a wake-up priority; the voice recognition device corresponding to the distance information with the smallest distance value is determined to be the voice recognition device to be responded Equipment, including:

Among the voice recognition devices corresponding to the distance information with the smallest distance value, the voice recognition device with the highest wake-up priority is determined as the voice recognition device to be responded.
The wake-up response method according to claim 13, wherein the wake-up response method further comprises:

Receiving a notification sent by the hub device whether to respond to the voice signal.
The wake-up response method according to any one of claims 13-16, wherein the distance information of the central device and the distance information of the non-central device are collectively referred to as distance information; the collected voice signals are analyzed to obtain the distance information ,include:

Collecting the voice signals through at least three microphones respectively, and the relative positions of the at least three microphones on the voice recognition device are fixed;

The distance value of the distance information is calculated according to the relative positions of the at least three microphones and the voice signals collected respectively.
The wake-up response method according to claim 17, wherein the calculating the distance value of the distance information according to the relative positions of the at least three microphones and the voice signals collected respectively comprises:

Calculating the relative positions of the at least three microphones and the signal source according to the voice signals collected by the at least three microphones;

The distance value of the distance information is calculated according to the relative position of the at least three microphones and the signal source, and the relative position between the at least three microphones.
The wake-up response method according to claim 18, wherein the calculating the relative positions of the at least three microphones and the signal source according to the voice signals collected by the at least three microphones respectively comprises:

The DOA algorithm is used to calculate the voice signals respectively collected by the three microphones of the linear array, and the relative positions of every two adjacent microphones and the signal source are obtained.
A voice recognition device, characterized in that the voice recognition device includes a processor and a memory; a computer program is stored in the memory, and the processor is used to execute the computer program to implement any of claims 1-19 One of the steps of the method.
The voice recognition device according to claim 20, wherein the voice recognition device comprises at least three microphones with fixed relative positions.
A computer storage medium, wherein the computer storage medium stores a computer program, and the computer program is executed to implement the steps of the method according to any one of claims 1-19.