WO2020215741A1 - Speech recognition device and wake-up response method therefor, and computer storage medium - Google Patents

Speech recognition device and wake-up response method therefor, and computer storage medium Download PDF

Info

Publication number
WO2020215741A1
WO2020215741A1 PCT/CN2019/124117 CN2019124117W WO2020215741A1 WO 2020215741 A1 WO2020215741 A1 WO 2020215741A1 CN 2019124117 W CN2019124117 W CN 2019124117W WO 2020215741 A1 WO2020215741 A1 WO 2020215741A1
Authority
WO
WIPO (PCT)
Prior art keywords
distance information
voice recognition
wake
voice
recognition device
Prior art date
Application number
PCT/CN2019/124117
Other languages
French (fr)
Chinese (zh)
Inventor
何瑞澄
Original Assignee
广东美的白色家电技术创新中心有限公司
美的集团股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广东美的白色家电技术创新中心有限公司, 美的集团股份有限公司 filed Critical 广东美的白色家电技术创新中心有限公司
Publication of WO2020215741A1 publication Critical patent/WO2020215741A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Definitions

  • This application relates to the field of voice wake-up, in particular to a voice recognition device and its wake-up response method, and computer storage media.
  • the present application provides a wake-up response method for a voice recognition device, a voice recognition device, and a computer storage medium, so as to solve the mutual interference problem caused by multiple voice recognition devices responding to the wake-up voice at the same time in the prior art.
  • this application provides a wake-up response method for voice recognition devices.
  • Multiple voice recognition devices form a network.
  • the multiple voice recognition devices are divided into a central device and at least one non-central device;
  • the wake-up response method includes: central The device analyzes the collected voice signals to obtain the distance information of the central device; the distance information of the central device indicates the distance between the central device and the signal source of the voice signal; the distance information of the non-central device is received, and the distance information of the non-central device is determined by the non-central device Obtained by analyzing the collected voice signals, indicating the distance between the non-central device and the signal source; comparing the distance information of the central device and the distance information of the non-central device; determining the voice recognition device to be responded, and the voice recognition device to be responded to the voice response in the regional network Signal voice recognition equipment.
  • this application provides a wake-up response method for a voice recognition device.
  • Multiple voice recognition devices form a regional network.
  • the multiple voice recognition devices are divided into a central device and at least one non-central device;
  • the wake-up response method includes:
  • the non-central device analyzes the collected voice signals to obtain the distance information of the non-central device;
  • the distance information of the non-central device indicates the distance between the non-central device and the signal source of the voice signal; and sends the distance information of the non-central device to the central device
  • the hub device compares the distance information of the non-central device with the distance information of the hub device to determine the voice recognition device to be responded;
  • the distance information of the hub device indicates the distance between the hub device and the source of the voice signal, and the voice recognition device to be responded is the area Voice recognition equipment that responds to voice signals in the network.
  • the present application provides a voice recognition device, which includes a processor and a memory, a computer program is stored in the memory, and the processor is used to execute the computer program to implement the steps of the wake-up response method.
  • this application provides a computer storage medium in which a computer program is stored, and when the computer program is executed, the steps of the above wake-up response method are realized.
  • multiple voice recognition devices form a network, where the voice recognition device determines the distance information from the signal source of the voice signal by analyzing the collected voice signal.
  • the multiple voice recognition devices are divided into a central device and at least one non-central device.
  • the hub device obtains its own distance information, and accepts the distance information of the non-central device; then compares its own distance information with the distance information of the non-central device to determine the voice recognition device to be responded to, which is the local area network Voice recognition equipment in response to voice signals.
  • the voice recognition devices forming the network do not respond temporarily after being awakened by the voice signal.
  • the central device first determines which one should respond, so as to avoid the problem of mutual interference after multiple voice recognition devices respond.
  • Figure 1 is a schematic diagram of the structure of a network formed by interconnecting voice recognition devices of the present application
  • Figure 2 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of the present application in a single area network;
  • Figure 3 is a schematic diagram of the positional relationship between three microphones of a linear array and a signal source
  • FIG. 4 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of the present application in a multi-area network;
  • FIG. 5 is a schematic diagram of the work flow of the hub device side of the wake-up response method of the voice recognition device of this application;
  • FIG. 6 is a schematic diagram of the non-central device side work flow of the wake-up response method of the voice recognition device of this application;
  • FIG. 7 is a schematic structural diagram of an embodiment of a speech recognition device according to the present application.
  • Fig. 8 is a schematic structural diagram of an embodiment of a computer storage medium of the present application.
  • the wake-up response method of the present application is applied to the situation where multiple voice recognition devices can respond to the same voice signal.
  • voice recognition devices such as televisions, air conditioners, and refrigerators in the living room area; voice recognition devices such as refrigerators, microwave ovens, kettles, and rice cookers exist in the kitchen area.
  • voice recognition devices such as televisions, air conditioners, and refrigerators in the living room area
  • voice recognition devices such as refrigerators, microwave ovens, kettles, and rice cookers exist in the kitchen area.
  • the response sound of the household appliance A may be received and responded by the household appliance B, which may cause mutual interference between the household appliances and fail to respond to the user's needs normally.
  • the household appliance B may cause mutual interference between the household appliances and fail to respond to the user's needs normally.
  • both areas can receive the voice signal and respond to the voice signal, and the problem of mutual interference may also occur.
  • the speech recognition device of the present application it is a mode of waking up first and then responding, that is, being awakened by a voice signal sent by the user first, and then responding to the voice signal.
  • this application introduces a selection determination mechanism between wake-up and response, that is, after being awakened by a voice signal, it does not respond temporarily, and then responds when it is determined that a response is needed.
  • multiple voice recognition devices are connected to each other to form a regional network.
  • One voice recognition device is used as the hub device in the regional network.
  • the hub device determines which voice recognition device in the regional network responds to the regional network. voice signal.
  • the hub device of each area network first determines the voice recognition device to be responded to the voice signal in the area network. After that, a first hub device among all the hub devices determines the waiting voice recognition device in which area network. Respond to the voice recognition device to respond, thereby solving the problem of mutual interference caused by multiple voice recognition devices responding to voice signals.
  • the central device In the application of household appliances, since the central device needs to be able to respond to the user's voice signal at any time to determine the device that responds to the voice signal, it is generally selected to connect to the power source for a long time and basically not power off the household appliance; and the interactive screen is preferred.
  • the network hub device which facilitates related settings through the interactive screen.
  • the refrigerator serves as a central device.
  • each area such as the living room area and the home appliance in the kitchen area, can form an area network.
  • the area network corresponds to the division of areas. On the network connection, it does not necessarily form a separate area network, that is, it may be Home appliances in all areas of a family can be connected to each other to form a whole home appliance network.
  • the network constituted in this application includes, but is not limited to, a local area network composed of WIFI wireless network, a local area network composed of a wired network, a LAN composed of Bluetooth mesh, a local area network composed of zigbee, a local area network composed of RS485, a local area network composed of LoRa, a local area network composed of 1394, LAN composed of CAN and so on.
  • the communication mechanism of the formed network includes but is not limited to UDP, TCP/IP, HTTP, MQTT, CoAP, etc., to ensure that each voice recognition device on the same network can quickly and reliably exchange information.
  • the following describes the wake-up response method starting from the network formed by the voice recognition device.
  • FIG. 1 is a schematic diagram of the structure of a network formed by interconnecting voice recognition devices of this application.
  • the area in Figure 1 is divided into living room area A, kitchen area B, and bedroom area C; in living room area A, voice recognition equipment includes: refrigerator A1, TV A2, air purifier A3; in kitchen area B, voice recognition equipment includes: Range hood B1, rice cooker B2, wall breaker B3; in bedroom area C, voice recognition equipment includes: air conditioner C1, humidifier C2. All voice recognition devices are connected to form a network, and the voice recognition devices in each area also form a regional network.
  • the voice devices in each regional network are divided into a central device and at least one non-central device, and the central device determines the voice recognition device to respond to the voice signal in the local network.
  • the hub devices of all regional networks are further divided into a first hub device and at least one second hub device. The first hub device determines which voice recognition device in the regional network will respond to the voice signal.
  • voice devices in the local area network are not only divided into hub devices and non-central devices, but also have a wake-up priority.
  • the wake-up priority can be set by the manufacturer when the voice recognition device is shipped from the factory. After the network, the voice recognition device with the highest wake-up priority automatically serves as the central device of the regional network; the wake-up priority can also be set when the network is constructed, set by the user, or set by the service provider who builds the network; according to the set wake-up priority The voice recognition device with the highest wake-up priority is the central device of the network.
  • the priority of living room area A is A1>A2>A3
  • the priority of kitchen area B is B1>B2>B3
  • the priority of bedroom area C is C1>C2; where A1 , B1 and C1 respectively serve as the central equipment of their respective local area networks.
  • A1 , B1 and C1 respectively serve as the central equipment of their respective local area networks.
  • A1 is the first hub device
  • B1 and C1 are the second hub devices.
  • Figure 1 can realize wake-up response in a single area and wake-up response in multiple areas.
  • Figure 2 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of this application on a single area network
  • Figure 4 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of this application on a multi-area network .
  • the implementation of the wake-up response method in a single area network includes the following steps.
  • S201 The voice recognition device analyzes the collected voice signal to obtain distance information.
  • the voice recognition device mainly performs two actions, collection and analysis. After the user, the signal source, sends out the voice signal, the voice recognition device can collect the voice signal. Because each voice recognition device has a different relative position with the user, the voice signal it collects is also different. Among them, the voice recognition equipment far away from the user may not be able to collect voice signals even in the local area network.
  • the voice recognition devices analyze the voice signals collected by each.
  • all voice recognition devices in each regional network have the same voice signal analysis mechanism to facilitate subsequent comparison calculations.
  • the voice signal is analyzed and calculated to obtain distance information, which indicates the distance between the voice recognition device and the signal source of the voice signal.
  • the distance information includes the identification of the voice recognition device and the distance value used for judgment.
  • the distance value of the distance information may be determined according to the voice signals collected by at least three microphones. That is, at least three microphones are provided on the voice recognition device, and each microphone collects voice signals. Firstly, at least three microphones are used to separately collect voice signals, where the relative positions of the at least three microphones on the voice recognition device are fixed; then, the distance value of the distance information is calculated based on the relative positions of the at least three microphones and the voice signals respectively collected.
  • the relative positions of the at least three microphones and the signal source are calculated; according to the relative positions of the at least three microphones and the signal source, and the relative positions between the at least three microphones, Calculate the distance value of the distance information.
  • FIG. 3 is a schematic diagram of the positional relationship between the three microphones of the linear array and the signal source.
  • d is the relative distance between the microphones mic
  • the distance value l calculated above is the distance value of the distance information between the voice recognition device and the signal source.
  • the distance value of the distance information obtained by the device A1 is recorded as LA1
  • the distance value of the distance information obtained by the device A2 is recorded as LA2
  • the distance value of the distance information obtained by the device A3 is recorded as LA3.
  • the central hub device analyzes the collected voice signal to obtain the sound distance information of the central device; the non-central device analyzes the collected voice signal to obtain the distance information of the non-central device.
  • S202 The hub device receives the distance information of the non-central device.
  • the non-central device After the voice recognition device calculates and obtains the distance information, the non-central device sends the distance information obtained by itself to the central device.
  • the hub device A1 receives the distance information sent by the non-central device.
  • the hub device compares the distance information of the hub device with the distance information of the non-central device, and determines the voice recognition device to be responded.
  • the hub device compares the distance information of the hub device with the distance information of the non-central device, so as to determine the voice recognition device in the area network that responds to the voice signal.
  • the hub device uses a sorting algorithm to compare the distance values of the distance information, and obtains the sorting of the distance values of all the distance information, so as to obtain the distance information with the smallest distance value, that is, the voice recognition device that is closest to the signal source of the voice signal. The closer the distance, the larger the user may be the voice signal sent by the voice recognition device.
  • the voice recognition device corresponding to the distance information with the smallest distance value is the voice recognition device to be responded.
  • Sorting algorithms include, but are not limited to, insertion sort, Hill sort, selection sort, heap sort, bubble sort, quick sort, merge sort, computational sort, bucket sort, radix sort, etc.
  • order of the distance value of the distance information is LA2 ⁇ LA1 ⁇ LA3.
  • the device that responds to the voice signal is determined based on the wake-up priority of the voice recognition device.
  • the voice recognition devices corresponding to the distance information with the smallest distance value the one with the highest priority is determined as the voice recognition device to be responded.
  • the hub device sends a notification whether to respond to the voice signal to the non-central device.
  • the hub device After the hub device determines the voice recognition device to respond to the voice signal, it can send a notification of whether to respond to the voice signal to the non-central device, that is, to all voice recognition devices that have been awakened but have not responded to the voice signal through the network.
  • the notification may be a specific response or no response, and may also be device information of the determined voice recognition device that responds to the voice signal. It is also possible to only send a notification to the voice recognition device to be responded, and other voice recognition devices that have not received the notification do not respond, but those that receive the notification respond.
  • S205 The voice recognition device to be responded responds to the voice signal.
  • the identified voice recognition device can respond to the voice signal, while other voice recognition devices do not. It is ensured that only one voice recognition device responds to the voice signal without causing mutual interference.
  • the method shown in Figure 2 above is applied to the voice wake-up recognition of a single area network. After the voice recognition device in the single area network is awakened by voice information, it does not respond immediately, but after the central device of the single area network determines the responding device, Respond again.
  • a multi-area network is a plurality of interconnected area networks.
  • the hub devices of each area network are connected to each other. They are divided into a first hub device and at least one second hub device. Each area network determines its response After the voice recognition device, the first hub device further confirms the voice recognition device that responds to the voice signal.
  • the steps for implementing the wake-up response method for each regional network in the multi-regional network will not be repeated. Please also refer to FIG. 4.
  • the wake-up response method of the multi-regional network further includes the following steps.
  • S401 The second hub device sends second distance information to the first hub device, and the first hub device receives the second distance information.
  • the first hub device needs to compare the distance information of the voice recognition device to be responded to in all regional networks to determine the voice recognition device that responds to the voice signal.
  • the voice recognition device to be responded to is determined in a single regional network A voice recognition device that responds to voice signals; in the application of a multi-area network, the voice recognition device to be responded determined by a single regional network does not respond immediately; instead, the first central device receives multiple voice recognition
  • the recognition device confirms which one responds to the voice signal, that is, the final voice recognition device that responds to the voice signal is determined. Therefore, in this step S401, the second central device sends its second distance information to the first central device.
  • the second distance information is the distance information of the voice recognition device to be responded in the area where the second central device is located.
  • A1 compares LA1, LA2, and LA3 to determine that the voice recognition device to be responded is A2; in area B, B1 compares LB1, LB2, and LB3 to determine that the voice recognition device to respond is B3; in area C, C1 compares LC1 and LC2 to determine that the responding device is C1.
  • B1 sends the distance information LB3 of the voice recognition device B3 to be responded in its local area network to A1, and C1 also sends the distance information LC1 to A1, and the distance information of the voice recognition device A2 to be responded determined by A1 itself is LA2.
  • the first hub device compares the second distance information with the first distance information, and determines a voice recognition device that responds to the voice signal.
  • the first hub device compares the distance information of each voice recognition device to be responded, that is, the first distance information and the second distance information.
  • the first distance information is the distance information of the voice recognition device to be responded in the local network where the first hub device is located.
  • the comparison process of this step S402 is similar to the comparison process of the foregoing step S203, and the details are not repeated here. That is, the distance value of the first distance information and the distance value of the second distance information are compared to obtain the distance information with the smallest distance value; the voice recognition device corresponding to the distance information with the smallest distance value is determined to respond to the voice signal.
  • A1 compares LA2, LB3, and LC1; thereby determining the voice recognition device that responds to the voice signal, for example, B2.
  • the obtained distance information with the smallest distance value may have two or more.
  • the device that responds to the voice signal is further determined according to the wake-up priority of the voice recognition device, that is, the distance information with the smallest distance value corresponds to Among the voice recognition devices, the one with the highest priority is determined as the voice recognition device to be responded.
  • the first hub device sends a notification whether to respond to the voice signal to other voice recognition devices in the multi-area network.
  • the first hub device After the first hub device determines the voice recognition device that responds to the voice signal, it can directly send notifications to the entire network, that is, multiple regional networks, or it can first send notifications to hub devices in each regional network, and then each hub device can send notifications to non- The hub device sends a notification. Similarly, it can only be sent to the voice recognition device that responds to the voice signal, and other devices that have not received the notification will not respond.
  • S404 The determined voice recognition device responds to the voice signal.
  • This step S404 is similar to the above step S205, and will not be described again.
  • the method shown in Figure 4 is applied to multi-region voice wake-up recognition. After each region determines the voice device that should respond to the region, the first central device further determines which region’s voice device responds, so as to ensure that only A voice recognition device responds to voice signals.
  • the voice recognition device has a wake-up priority sequence, so when the highest priority voice recognition device fails, the next wake-up priority can be determined according to the wake-up priority sequence.
  • the voice recognition device serves as the hub device or the first hub device.
  • the voice recognition equipment For voice recognition equipment, it can periodically detect whether it has the highest wake-up priority in the local area network, or detect whether it has the highest wake-up priority when the local network changes; if it detects that it is the current local network The highest wake-up priority in, that is, in response to detecting that it is the highest wake-up priority in the local area network, it operates as a hub device.
  • the wake-up response method implemented in the network of this embodiment is based on the fact that the voice recognition device in the network has a wake-up priority order, and the voice recognition device as a network hub device can compare distance information. Therefore, the voice recognition device newly added to the network also needs to comply with the wake-up mechanism of this embodiment, which can be set by the hub device.
  • the hub device can obtain the device information of the voice recognition device joining the network. Analyze device information according to preset rules to re-order the voice recognition devices in the network to wake up priority.
  • Each voice recognition device is equipped with a voice recognition system, which determines the wake-up priority, voice recognition algorithm, wake-up template, etc. If the newly added voice recognition device has a different voice recognition system, that is, it has different wake-up priority settings, the network hub device can reorder according to its own wake-up priority settings. For example, in the network A1-A2-A3, the newly added voice recognition device A4, whose wake-up priority is set to be greater than A3, can reorder the wake-up priority as A1>A2>A4>A3.
  • the wake-up priority of the voice recognition device that joins the network first will be higher.
  • the newly added voice recognition device A3 has the same voice recognition system as the previous A3, the previous A3 is used as A31, the newly added one is used as A32, and the wake-up priority is reordered as A1>A2>A31>A32.
  • the voice recognition device can play two roles, one is to operate as a central device, and the other is to operate as a non-central device.
  • the voice recognition device can be used as a central device with more powerful functions; it can also be used as a non-central device with lighter weight.
  • a voice recognition system with more powerful functions can be loaded into it, so that it can be used as a central device; for small household appliances, such as rice cookers, electric kettles, etc.,
  • the voice recognition system with lightweight functions makes it only a non-central device.
  • FIG. 5 is a schematic diagram of the hub device side workflow of the wake-up response method of the voice recognition device of the present application.
  • its wake-up response method includes the following steps.
  • S501 Analyze the collected voice signal to obtain distance information of the central device.
  • this step S501 is completed in the above step S201, and the details will not be repeated.
  • S502 Receive distance information of a non-central device that is a non-central device.
  • This step S502 corresponds to the above step S202, and the details are not repeated here.
  • S503 Compare the distance information of the central device with the distance information of the non-central device, and determine the voice recognition device to be responded in the regional network.
  • This step S503 is similar to the above step S203, and the details will not be described in detail.
  • the above steps use the voice recognition device as the role of the central device to illustrate the steps in implementing the single-area wake-up response method.
  • the specific details of each step and the specific details of the operation of the central device have also been described above, so they will not be Repeat.
  • the voice recognition device of this embodiment can determine a voice recognition device that responds to the voice signal from multiple voice recognition devices, thereby avoiding the problem of mutual interference due to all responses.
  • the hub device is further divided into a first hub device and a second hub device.
  • the first hub device it further performs the following steps.
  • S504 The first hub device receives the second distance information.
  • This step S504 is completed in the above step S401, and the details are not repeated here.
  • S506 Compare the first distance information with the second distance information, and determine a voice recognition device that responds to the voice signal.
  • This step S506 is similar to the above step S402, and the details are not repeated here.
  • the second hub device For the second hub device, it performs the following steps.
  • the second hub device sends second distance information to the first hub device, so that the first hub device compares the first distance information with the second distance information, so as to determine a voice recognition device that responds to the voice signal.
  • This step S505 is completed in the above steps S401-S402, and the details are not repeated here.
  • the first hub device further determines which area network's to-be-responsive voice recognition device responds to the voice signal.
  • FIG. 6 is a schematic diagram of the non-central device side work flow of the voice recognition device wake-up response method of the present application.
  • the voice recognition device is a non-central device, and the wake-up response method of this embodiment includes the following steps.
  • S601 Analyze the collected voice signal to obtain distance information of the non-central device.
  • This step S601 is similar to the above step S201, both of which are obtaining distance information, and the specific process will not be repeated.
  • S602 Send the distance information of the non-central device to the central device, so that the central device compares the distance information of the non-central device with the distance information of the central device to determine the voice recognition device to be responded to.
  • a non-central device after collecting the voice signal, it does not respond to the voice signal immediately, but performs calculation and analysis to obtain distance information, and then transmits the distance information to the central device for analysis and comparison, and the central device confirms the response Voice recognition equipment for voice signals.
  • the role of the voice recognition device as a non-central device is used to illustrate the steps in implementing the wake-up response method.
  • the specific details of each step and the specific details of the operation of the non-central device have also been described above. Repeat it again.
  • the voice recognition device of this embodiment does not respond immediately after receiving the voice signal, but decides whether to respond after receiving the notification, which avoids the problem of mutual interference caused by simultaneous response with other voice recognition devices.
  • FIG. 7 is a schematic structural diagram of an embodiment of the voice recognition device of this application.
  • the voice recognition device 100 in this embodiment may be a household appliance. , which includes at least three microphones 11, a processor 12, and a memory 13 connected to each other.
  • the voice recognition device 100 of this embodiment can implement the above-mentioned wake-up response method embodiment.
  • at least three microphones 11 have a fixed relative position and are used to collect voice signals
  • a computer program is stored in the memory 13, and the processor 12 is used to execute the computer program to implement the above wake-up response method.
  • At least three microphones 11 are used to collect voice signals; the processor 12 is used to calculate the distance information between the voice recognition device and the signal source of the voice signal according to the relative positions of the at least three microphones and the voice signals collected respectively, And compare all distance information to determine the voice recognition device that responds to the voice signal; send notifications to other voice recognition devices whether they respond to the voice signal.
  • At least three microphones 11 are used to collect voice signals; the processor 12 is used to calculate the distance information between the voice recognition device and the signal source of the voice signal based on the relative positions of the at least three microphones and the voice signals collected separately, and calculate the distance
  • the information is sent to the central device, and it is determined whether to respond according to the received notification sent by the central device whether it responds to the voice signal.
  • the processor 12 may be an integrated circuit chip with signal processing capability.
  • the processor 12 may also be a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), an off-the-shelf programmable gate array (FPGA) or other programmable logic device, a discrete gate or transistor logic device, a discrete hardware component .
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • FPGA off-the-shelf programmable gate array
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
  • FIG. 8 is a schematic structural diagram of an embodiment of the computer storage medium of the present application.
  • the computer storage medium 200 of this embodiment stores a computer program 21, which can be executed to implement the method in the foregoing embodiment.
  • the computer storage medium 200 of this embodiment may be a U disk, a mobile hard disk, a read-only memory (ROM, Read-Only Memory), a random access memory (RAM, Random Access Memory), a magnetic disk or an optical disk, etc., which can store program instructions. Or it may also be a server storing the program instructions, and the server may send the stored program instructions to other devices to run, or it may run the stored program instructions by itself.
  • the disclosed method and device can be implemented in other ways.
  • the device implementation described above is merely illustrative, for example, the division of modules or units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components can be combined or It can be integrated into another system, or some features can be ignored or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of this embodiment.
  • each unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the technical solution of this application essentially or the part that contributes to the existing technology or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , Including several instructions to make a computer device (which can be a personal computer, a server, or a network device, etc.) or a processor execute all or part of the steps of the methods in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program code .

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Telephonic Communication Services (AREA)

Abstract

A speech recognition device and a wake-up response method therefor, wherein a plurality of speech recognition devices form a regional network, and the plurality of speech recognition devices are divided into a central device and at least one non-central device. The wake-up response method comprises: the speech recognition devices analyzing a collected speech signal so as to obtain distance information (S201); the central device receiving the distance information of the non-central device (S202); the central device comparing the distance information of the central device to the distance information of the non-central device; determining a speech recognition device to send a response (S203), and the central device sending to the non-central device a notification for whether to respond to the speech signal (S204); and the speech recognition device to send a response responding to the speech signal (S205). One speech recognition device from a plurality of speech recognition devices that may respond to a speech signal may be determined to respond to the speech signal.

Description

语音识别设备及其唤醒响应方法、计算机存储介质Speech recognition equipment, wake-up response method thereof, and computer storage medium
本申请要求于2019年04月26日提交的申请号为2019103430447,发明名称为“语音识别设备及其唤醒响应方法、计算机存储介质”的中国专利申请的优先权,其通过引用方式全部并入本申请。This application claims the priority of the Chinese patent application filed on April 26, 2019 with the application number 2019103430447 and the invention title of "Voice Recognition Device and its Wake-up Response Method, Computer Storage Medium", which is incorporated into this by reference. Application.
【技术领域】【Technical Field】
本申请涉及语音唤醒领域,特别是涉及一种语音识别设备及其唤醒响应方法、计算机存储介质。This application relates to the field of voice wake-up, in particular to a voice recognition device and its wake-up response method, and computer storage media.
【背景技术】【Background technique】
语音识别,语音交互等技术已应用在多个领域,对于搭载了语音识别系统的设备一般在收到语音信号时会被唤醒后对语音信号进行响应。Technologies such as voice recognition and voice interaction have been applied in many fields. Devices equipped with voice recognition systems generally respond to voice signals after being awakened when they receive voice signals.
对于同一区域内的多个语音识别设备,可能出现同时被语音信号唤醒并响应的情况,而在一般的应用场景中,用户显然只会对一个语音识别设备进行唤醒,并且多个语音识别设备的同时唤醒并响应会导致多个语音识别设备之间相互干扰的问题,例如一个语音识别设备响应所述语音信号而播报的声音会被另一个语音识别设备接收并响应,反之亦然,即产生相互干扰的问题。For multiple voice recognition devices in the same area, it may be awakened by voice signals and respond at the same time. In general application scenarios, the user obviously only wakes up one voice recognition device, and multiple voice recognition devices are Waking up and responding at the same time will cause the problem of mutual interference between multiple voice recognition devices. For example, the sound broadcast by one voice recognition device in response to the voice signal will be received and responded by another voice recognition device, and vice versa, that is, mutual interference occurs. The problem of interference.
【发明内容】[Content of the invention]
本申请提供一种语音识别设备的唤醒响应方法、语音识别设备及计算机存储介质,以解决现有技术中多个语音识别设备同时响应唤醒语音,而造成的相互干扰问题。The present application provides a wake-up response method for a voice recognition device, a voice recognition device, and a computer storage medium, so as to solve the mutual interference problem caused by multiple voice recognition devices responding to the wake-up voice at the same time in the prior art.
为解决上述技术问题,本申请提供一种语音识别设备的唤醒响应方法,多个语音识别设备构成网络,多个语音识别设备分为一个中枢设备和至少一个非中枢设备;唤醒响应方法包括:中枢设备分析采集的语音信号,以获得中枢设备的距离信息;中枢设备的距离信息表示中枢设备与语音信号的信号源的距离;接收非中枢设备的距离信息,非中枢设备的距离信息由非中枢设备分析采集的语音信号而获得,表示非中枢设备与信号源的距离;比较中枢设备的距离信息和非中枢设备的距离信息;确定待响应语音识别设备,待响应语音识别设备为 区域网络中响应语音信号的语音识别设备。In order to solve the above technical problems, this application provides a wake-up response method for voice recognition devices. Multiple voice recognition devices form a network. The multiple voice recognition devices are divided into a central device and at least one non-central device; the wake-up response method includes: central The device analyzes the collected voice signals to obtain the distance information of the central device; the distance information of the central device indicates the distance between the central device and the signal source of the voice signal; the distance information of the non-central device is received, and the distance information of the non-central device is determined by the non-central device Obtained by analyzing the collected voice signals, indicating the distance between the non-central device and the signal source; comparing the distance information of the central device and the distance information of the non-central device; determining the voice recognition device to be responded, and the voice recognition device to be responded to the voice response in the regional network Signal voice recognition equipment.
为解决上述技术问题,本申请提供一种语音识别设备的唤醒响应方法,多个语音识别设备构成区域网络,多个语音识别设备分为一个中枢设备和至少一个非中枢设备;唤醒响应方法包括:非中枢设备分析采集的语音信号,以获得非中枢设备的距离信息;非中枢设备的距离信息表示非中枢设备与所述语音信号的信号源的距离;向中枢设备发送非中枢设备的距离信息,以由中枢设备比较非中枢设备的距离信息和中枢设备的距离信息,来确定待响应语音识别设备;中枢设备的距离信息表示中枢设备与语音信号的信号源的距离,待响应语音识别设备为区域网络中响应语音信号的语音识别设备。In order to solve the above technical problems, this application provides a wake-up response method for a voice recognition device. Multiple voice recognition devices form a regional network. The multiple voice recognition devices are divided into a central device and at least one non-central device; the wake-up response method includes: The non-central device analyzes the collected voice signals to obtain the distance information of the non-central device; the distance information of the non-central device indicates the distance between the non-central device and the signal source of the voice signal; and sends the distance information of the non-central device to the central device, The hub device compares the distance information of the non-central device with the distance information of the hub device to determine the voice recognition device to be responded; the distance information of the hub device indicates the distance between the hub device and the source of the voice signal, and the voice recognition device to be responded is the area Voice recognition equipment that responds to voice signals in the network.
为解决上述技术问题,本申请提供一种语音识别设备,其包括处理器和存储器,存储器中存储有计算机程序,处理器用于执行计算机程序以实现唤醒响应方法的步骤。In order to solve the above technical problems, the present application provides a voice recognition device, which includes a processor and a memory, a computer program is stored in the memory, and the processor is used to execute the computer program to implement the steps of the wake-up response method.
为解决上述技术问题,本申请提供一种计算机存储介质,其中存储有计算机程序,计算机程序被执行时实现上述唤醒响应方法的步骤。In order to solve the above technical problems, this application provides a computer storage medium in which a computer program is stored, and when the computer program is executed, the steps of the above wake-up response method are realized.
本申请唤醒响应方法中多个语音识别设备构成网络,其中,语音识别设备通过分析采集的语音信号,来确定其与语音信号的信号源的距离信息。多个语音识别设备分为一个中枢设备和至少一个非中枢设备。中枢设备获取其自身的距离信息,并接受非中枢设备的距离信息;然后比较自身的距离信息和非中枢设备的距离信息,从而确定待响应语音识别设备,该待响应语音识别设备即本区域网络中响应语音信号的语音识别设备。本申请中对于构成网络的语音识别设备,在被语音信号唤醒后,暂时不响应,先由中枢设备来确定该由哪个进行响应,从而避免多个语音识别设备均响应后相互干扰的问题。In the wake-up response method of the present application, multiple voice recognition devices form a network, where the voice recognition device determines the distance information from the signal source of the voice signal by analyzing the collected voice signal. The multiple voice recognition devices are divided into a central device and at least one non-central device. The hub device obtains its own distance information, and accepts the distance information of the non-central device; then compares its own distance information with the distance information of the non-central device to determine the voice recognition device to be responded to, which is the local area network Voice recognition equipment in response to voice signals. In the present application, the voice recognition devices forming the network do not respond temporarily after being awakened by the voice signal. The central device first determines which one should respond, so as to avoid the problem of mutual interference after multiple voice recognition devices respond.
【附图说明】【Explanation of drawings】
图1是本申请语音识别设备相互连接所构成网络的结构示意图;Figure 1 is a schematic diagram of the structure of a network formed by interconnecting voice recognition devices of the present application;
图2是本申请语音识别设备的唤醒响应方法应用在单区域网络的流程示意图;Figure 2 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of the present application in a single area network;
图3是线性阵列的三个麦克风与信号源的位置关系示意图;Figure 3 is a schematic diagram of the positional relationship between three microphones of a linear array and a signal source;
图4是本申请语音识别设备的唤醒响应方法应用在多区域网络的流程示意图;FIG. 4 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of the present application in a multi-area network;
图5是本申请语音识别设备的唤醒响应方法的中枢设备端工作流程示意图;FIG. 5 is a schematic diagram of the work flow of the hub device side of the wake-up response method of the voice recognition device of this application;
图6是本申请语音识别设备的唤醒响应方法的非中枢设备端工作流程示意图;FIG. 6 is a schematic diagram of the non-central device side work flow of the wake-up response method of the voice recognition device of this application;
图7是本申请语音识别设备一实施例的结构示意图;FIG. 7 is a schematic structural diagram of an embodiment of a speech recognition device according to the present application;
图8是本申请计算机存储介质一实施例的结构示意图。Fig. 8 is a schematic structural diagram of an embodiment of a computer storage medium of the present application.
【具体实施方式】【Detailed ways】
为使本领域的技术人员更好地理解本发明的技术方案,下面结合附图和具体实施方式对本申请所提供的一种语音识别设备的唤醒响应方法、语音识别设备及计算机存储介质做进一步详细描述。In order to enable those skilled in the art to better understand the technical solutions of the present invention, a wake-up response method for a voice recognition device, voice recognition device and computer storage medium provided in this application will be further detailed below in conjunction with the accompanying drawings and specific implementations. description.
本申请唤醒响应方法应用于多个语音识别设备均可对同一语音信号进行响应的情况,对于这种情况,以家电领域为例,在同一区域或多个相邻区域存在多个家电设备,其中家电设备均具有语音识别功能,即作为语音识别设备。例如客厅区域存在电视机、空调、冰箱等语音识别设备;厨房区域存在冰箱、微波炉、热水壶、电饭煲等语音识别设备。当用户在客厅区域发出语音信号时,由于声音传播特性,在客厅区域内的多个家电设备均可能接收到该语音信号,并对该语音信号进行响应,此时则会出现多个家电设备均进行回应的情况,在该情况下,A家电设备回应的声音可能又被B家电设备接收并响应,继而导致家电设备之间相互干扰,而无法正常回应用户的需求。还例如当用户在客厅区域和厨房区域之间发出语音信号时,两个区域均可接收到语音信号,并对该语音信号进行响应,也会出现相互干扰的问题。The wake-up response method of the present application is applied to the situation where multiple voice recognition devices can respond to the same voice signal. In this case, taking the field of home appliances as an example, there are multiple home appliances in the same area or in multiple adjacent areas, where All household appliances have a voice recognition function, that is, as a voice recognition device. For example, there are voice recognition devices such as televisions, air conditioners, and refrigerators in the living room area; voice recognition devices such as refrigerators, microwave ovens, kettles, and rice cookers exist in the kitchen area. When the user sends out a voice signal in the living room area, due to the sound propagation characteristics, multiple household appliances in the living room area may receive the voice signal and respond to the voice signal. At this time, multiple household appliances In the case of responding, in this case, the response sound of the household appliance A may be received and responded by the household appliance B, which may cause mutual interference between the household appliances and fail to respond to the user's needs normally. For example, when a user sends a voice signal between the living room area and the kitchen area, both areas can receive the voice signal and respond to the voice signal, and the problem of mutual interference may also occur.
对于本申请语音识别设备来说,为先唤醒后响应的模式,即先被用户发出的语音信号唤醒,然后再对该语音信号进行响应回复。对此,本申请在唤醒和响应之间引入选择确定机制,即在被语音信号唤醒后,暂时不响应,在确定需要响应时再回复。For the speech recognition device of the present application, it is a mode of waking up first and then responding, that is, being awakened by a voice signal sent by the user first, and then responding to the voice signal. In this regard, this application introduces a selection determination mechanism between wake-up and response, that is, after being awakened by a voice signal, it does not respond temporarily, and then responds when it is determined that a response is needed.
具体来说对于单个区域,将多个语音识别设备相互连接构成区域网络,其中一个语音识别设备作为该区域网络中的中枢设备,由中枢设备来确定本区域网络中由哪个语音识别设备来响应该语音信号。Specifically, for a single area, multiple voice recognition devices are connected to each other to form a regional network. One voice recognition device is used as the hub device in the regional network. The hub device determines which voice recognition device in the regional network responds to the regional network. voice signal.
对于多个区域,首先每个区域网络的中枢设备确定本区域网络中响应语音信号的待响应语音识别设备,此后,再由所有中枢设备中一个第一中枢设备来 确定由哪个区域网络中的待响应语音识别设备来响应,从而解决多个语音识别设备均响应语音信号而造成相互干扰的问题。For multiple areas, the hub device of each area network first determines the voice recognition device to be responded to the voice signal in the area network. After that, a first hub device among all the hub devices determines the waiting voice recognition device in which area network. Respond to the voice recognition device to respond, thereby solving the problem of mutual interference caused by multiple voice recognition devices responding to voice signals.
在家电领域的应用中,由于中枢设备需要随时能够应对用户的语音信号,以确定响应语音信号的设备,因此一般选择长时间连接电源,基本不会断电的家电设备;且优先选择具有交互屏幕的家电设备来作为网络中枢设备,方便通过交互屏幕进行相关设置。例如,冰箱作为中枢设备。In the application of household appliances, since the central device needs to be able to respond to the user's voice signal at any time to determine the device that responds to the voice signal, it is generally selected to connect to the power source for a long time and basically not power off the household appliance; and the interactive screen is preferred. Of home appliances as the network hub device, which facilitates related settings through the interactive screen. For example, the refrigerator serves as a central device.
一般来说,每个区域,例如客厅区域、厨房区域中的家电设备均可分别构成区域网络,该区域网络对应于区域的划分,在网络连接上,不一定构成单独的区域网络,即可能在一个家庭中所有区域的家电设备可相互连接构成整体的家电设备网络。Generally speaking, each area, such as the living room area and the home appliance in the kitchen area, can form an area network. The area network corresponds to the division of areas. On the network connection, it does not necessarily form a separate area network, that is, it may be Home appliances in all areas of a family can be connected to each other to form a whole home appliance network.
本申请中所构成的网络包括并不仅限于WIFI无线网络组成的局域网、有线网络组成的局域网、蓝牙mesh组成的局域网、zigbee组成的局域网、RS485组成的局域网、LoRa组成的局域网、1394组成的局域网、CAN组成的局域网等等。所构成网络的通讯机制包括并不仅限于UDP、TCP/IP、HTTP、MQTT、CoAP等等,确保同一网络的每个语音识别设备能够快速和可靠地进行信息交互。The network constituted in this application includes, but is not limited to, a local area network composed of WIFI wireless network, a local area network composed of a wired network, a LAN composed of Bluetooth mesh, a local area network composed of zigbee, a local area network composed of RS485, a local area network composed of LoRa, a local area network composed of 1394, LAN composed of CAN and so on. The communication mechanism of the formed network includes but is not limited to UDP, TCP/IP, HTTP, MQTT, CoAP, etc., to ensure that each voice recognition device on the same network can quickly and reliably exchange information.
对于本申请的唤醒响应方法,下面从语音识别设备所构成的网络出发,对唤醒响应方法进行说明。With regard to the wake-up response method of the present application, the following describes the wake-up response method starting from the network formed by the voice recognition device.
请参阅图1,图1是本申请语音识别设备相互连接所构成网络的结构示意图。图1中区域划分为客厅区域A、厨房区域B、卧室区域C;在客厅区域A,语音识别设备包括:冰箱A1、电视机A2、空气净化器A3;在厨房区域B,语音识别设备包括:抽油烟机B1、电饭煲B2、破壁机B3;在卧室区域C,语音识别设备包括:空调C1、加湿器C2。所有的语音识别设备连接构成网络,每个区域中的语音识别设备也构成区域网络。Please refer to FIG. 1, which is a schematic diagram of the structure of a network formed by interconnecting voice recognition devices of this application. The area in Figure 1 is divided into living room area A, kitchen area B, and bedroom area C; in living room area A, voice recognition equipment includes: refrigerator A1, TV A2, air purifier A3; in kitchen area B, voice recognition equipment includes: Range hood B1, rice cooker B2, wall breaker B3; in bedroom area C, voice recognition equipment includes: air conditioner C1, humidifier C2. All voice recognition devices are connected to form a network, and the voice recognition devices in each area also form a regional network.
每个区域网络中的语音设备分为一个中枢设备和至少一个非中枢设备,由中枢设备确定本区域网络中响应语音信号的待响应语音识别设备。而所有区域网络的中枢设备又分为一个第一中枢设备和至少一个第二中枢设备,由第一中枢设备来确定具体由哪个区域网络中的待响应语音识别设备来响应语音信号。The voice devices in each regional network are divided into a central device and at least one non-central device, and the central device determines the voice recognition device to respond to the voice signal in the local network. The hub devices of all regional networks are further divided into a first hub device and at least one second hub device. The first hub device determines which voice recognition device in the regional network will respond to the voice signal.
在本申请一些实施例中,区域网络中的语音设备不仅仅分为中枢设备和非中枢设备,其还进一步具有唤醒优先级,唤醒优先级可由厂商在出厂语音识别设备时进行设置,在连接构成网络后,最高唤醒优先级的语音识别设备自动作为区域网络的中枢设备;唤醒优先级也可以在构建网络时设置,由用户自主设 置,或由搭建网络的服务商设置;根据所设置的唤醒优先级,最高唤醒优先级的语音识别设备作为网络的中枢设备。In some embodiments of this application, voice devices in the local area network are not only divided into hub devices and non-central devices, but also have a wake-up priority. The wake-up priority can be set by the manufacturer when the voice recognition device is shipped from the factory. After the network, the voice recognition device with the highest wake-up priority automatically serves as the central device of the regional network; the wake-up priority can also be set when the network is constructed, set by the user, or set by the service provider who builds the network; according to the set wake-up priority The voice recognition device with the highest wake-up priority is the central device of the network.
在图1所示网络中,客厅区域A的优先级排序为A1>A2>A3,厨房区域B的优先级排序为B1>B2>B3,卧室区域C的优先级排序为C1>C2;其中A1、B1、C1分别作为各自所在区域网络的中枢设备。各个区域网络的中枢设备之间也有优先级排序A1>B1>C1,本申请中,A1作为第一中枢设备,B1和C1作为第二中枢设备。In the network shown in Figure 1, the priority of living room area A is A1>A2>A3, the priority of kitchen area B is B1>B2>B3, and the priority of bedroom area C is C1>C2; where A1 , B1 and C1 respectively serve as the central equipment of their respective local area networks. There is also a priority ordering between the hub devices of each area network A1>B1>C1. In this application, A1 is the first hub device, and B1 and C1 are the second hub devices.
图1所示网络可实现在单区域内的唤醒响应,以及在多区域的唤醒响应。具体请参阅图2和图4,图2是本申请语音识别设备的唤醒响应方法应用在单区域网络的流程示意图,图4是本申请语音识别设备的唤醒响应方法应用在多区域网络的流程示意图。The network shown in Figure 1 can realize wake-up response in a single area and wake-up response in multiple areas. For details, please refer to Figures 2 and 4. Figure 2 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of this application on a single area network, and Figure 4 is a schematic flow diagram of the application of the wake-up response method of the voice recognition device of this application on a multi-area network .
如图2,对于单区域网络中唤醒响应方法的实现,包括以下步骤。As shown in Figure 2, the implementation of the wake-up response method in a single area network includes the following steps.
S201:语音识别设备分析采集的语音信号,获得距离信息。S201: The voice recognition device analyzes the collected voice signal to obtain distance information.
本步骤中语音识别设备主要进行两个动作,采集和分析。在用户即信号源发出语音信号后,语音识别设备均可对语音信号进行采集,每个语音识别设备由于与用户的相对位置不同,其所采集到的语音信号也不同。其中距离用户比较远的语音识别设备,虽然在区域网络中,也可能并不能采集到语音信号。In this step, the voice recognition device mainly performs two actions, collection and analysis. After the user, the signal source, sends out the voice signal, the voice recognition device can collect the voice signal. Because each voice recognition device has a different relative position with the user, the voice signal it collects is also different. Among them, the voice recognition equipment far away from the user may not be able to collect voice signals even in the local area network.
语音识别设备对各自所采集到的语音信号进行分析,本实施例每个区域网络中所有语音识别设备对语音信号的分析机制均是相同的,以便于后续的比较计算。对语音信号进行分析计算获得距离信息,距离信息表示了语音识别设备与该语音信号的信号源的距离。The voice recognition devices analyze the voice signals collected by each. In this embodiment, all voice recognition devices in each regional network have the same voice signal analysis mechanism to facilitate subsequent comparison calculations. The voice signal is analyzed and calculated to obtain distance information, which indicates the distance between the voice recognition device and the signal source of the voice signal.
由于需要根据距离信息来确定响应语音信号的待响应语音识别设备,因而距离信息中包括语音识别设备的标识,以及用于判断的距离值。Since it is necessary to determine the voice recognition device to respond to the voice signal based on the distance information, the distance information includes the identification of the voice recognition device and the distance value used for judgment.
本实施例中,距离信息的距离值可根据至少三个麦克风所采集的语音信号来确定。即在语音识别设备上设置有至少三个麦克风,每个麦克风均采集语音信号。首先通过至少三个麦克风分别采集语音信号,其中,至少三个麦克风在语音识别设备上的相对位置固定;然后根据至少三个麦克风的相对位置及分别采集的语音信号,计算距离信息的距离值。In this embodiment, the distance value of the distance information may be determined according to the voice signals collected by at least three microphones. That is, at least three microphones are provided on the voice recognition device, and each microphone collects voice signals. Firstly, at least three microphones are used to separately collect voice signals, where the relative positions of the at least three microphones on the voice recognition device are fixed; then, the distance value of the distance information is calculated based on the relative positions of the at least three microphones and the voice signals respectively collected.
具体来说,根据至少三个麦克风分别采集的语音信号,计算至少三个麦克风与信号源的相对方位;根据至少三个麦克风与信号源的相对方位,以及至少三个麦克风之间的相对位置,计算距离信息的距离值。Specifically, according to the voice signals collected by the at least three microphones, the relative positions of the at least three microphones and the signal source are calculated; according to the relative positions of the at least three microphones and the signal source, and the relative positions between the at least three microphones, Calculate the distance value of the distance information.
例如,若语音识别设备上具有线性阵列排布的三个麦克风,请参阅图3所示,图3是线性阵列的三个麦克风与信号源的位置关系示意图。For example, if the voice recognition device has three microphones arranged in a linear array, please refer to FIG. 3, which is a schematic diagram of the positional relationship between the three microphones of the linear array and the signal source.
具体计算,首先采用DOA算法,计算每两相邻麦克风与信号源的相对方位;利用DOA算法计算mic1和mic2的语音信号,获得相对方位角θ 1;利用DOA算法计算mic2和mic3的语音信号,获得相对方位角θ 2For specific calculations, first use the DOA algorithm to calculate the relative position of every two adjacent microphones and the signal source; use the DOA algorithm to calculate the voice signals of mic1 and mic2 to obtain the relative azimuth angle θ 1 ; use the DOA algorithm to calculate the voice signals of mic2 and mic3, Obtain the relative azimuth angle θ 2 .
根据以下方程组计算得到mic2与信号源的距离值l。Calculate the distance l between mic2 and the signal source according to the following equations.
tanθ 1=h/(x+1.5d) tanθ 1 = h/(x+1.5d)
tanθ 2=h/(x+0.5d) tanθ 2 =h/(x+0.5d)
l=(h 2+(x+d) 2) 1/2 l=(h 2 +(x+d) 2 ) 1/2
其中,d即麦克风mic之间的相对距离,以上所算得的距离值l即为语音识别设备与信号源的距离信息的距离值。Among them, d is the relative distance between the microphones mic, and the distance value l calculated above is the distance value of the distance information between the voice recognition device and the signal source.
对于本实施例区域A中,设备A1所获得距离信息的距离值记为LA1,设备A2所获得距离信息的距离值记为LA2,设备A3所获得距离信息的距离值记为LA3。For area A in this embodiment, the distance value of the distance information obtained by the device A1 is recorded as LA1, the distance value of the distance information obtained by the device A2 is recorded as LA2, and the distance value of the distance information obtained by the device A3 is recorded as LA3.
本步骤S201中中枢设备分析采集的语音信号,获得中枢设备的响距离信息;而非中枢设备分析采集的语音信号,获得非中枢设备的距离信息。In this step S201, the central hub device analyzes the collected voice signal to obtain the sound distance information of the central device; the non-central device analyzes the collected voice signal to obtain the distance information of the non-central device.
S202:中枢设备接收非中枢设备的距离信息。S202: The hub device receives the distance information of the non-central device.
语音识别设备计算获得距离信息后,其中,非中枢设备将自身获得的距离信息发送至中枢设备。本实施例中,中枢设备A1接收到非中枢设备发送的距离信息。After the voice recognition device calculates and obtains the distance information, the non-central device sends the distance information obtained by itself to the central device. In this embodiment, the hub device A1 receives the distance information sent by the non-central device.
S203:中枢设备比较中枢设备的距离信息和非中枢设备的距离信息,确定待响应语音识别设备。S203: The hub device compares the distance information of the hub device with the distance information of the non-central device, and determines the voice recognition device to be responded.
本步骤中,中枢设备比较中枢设备的距离信息和非中枢设备的距离信息,从而确定区域网络中响应语音信号的待语音识别设备。具体来说,中枢设备采用排序算法来比较距离信息的距离值,获得所有距离信息的距离值的排序,从而得到距离值最小的距离信息,即表示距离语音信号的信号源最近的语音识别设备,距离越近表示用户越大可能是对该语音识别设备发出的语音信号。距离值最小的距离信息所对应的语音识别设备即为待响应语音识别设备。In this step, the hub device compares the distance information of the hub device with the distance information of the non-central device, so as to determine the voice recognition device in the area network that responds to the voice signal. Specifically, the hub device uses a sorting algorithm to compare the distance values of the distance information, and obtains the sorting of the distance values of all the distance information, so as to obtain the distance information with the smallest distance value, that is, the voice recognition device that is closest to the signal source of the voice signal. The closer the distance, the larger the user may be the voice signal sent by the voice recognition device. The voice recognition device corresponding to the distance information with the smallest distance value is the voice recognition device to be responded.
排序算法包括且不限于插入排序、希尔排序、选择排序、堆排序、冒泡排序、快速排序、归并排序、计算排序、桶排序、基数排序等等。本实施例对距离信息距离值的排序为LA2<LA1<LA3。Sorting algorithms include, but are not limited to, insertion sort, Hill sort, selection sort, heap sort, bubble sort, quick sort, merge sort, computational sort, bucket sort, radix sort, etc. In this embodiment, the order of the distance value of the distance information is LA2<LA1<LA3.
在对距离信息进行比较分析时,所得到的距离值最小的距离信息可能有两个甚至多个,此时,则进一步依据语音识别设备的唤醒优先级排序来确定响应语音信号的设备,即在距离值最小的距离信息对应的语音识别设备中,确定优先级最高的作为待响应语音识别设备。When the distance information is compared and analyzed, there may be two or more distance information with the smallest distance value. In this case, the device that responds to the voice signal is determined based on the wake-up priority of the voice recognition device. Among the voice recognition devices corresponding to the distance information with the smallest distance value, the one with the highest priority is determined as the voice recognition device to be responded.
S204:中枢设备向非中枢设备发送是否响应语音信号的通知。S204: The hub device sends a notification whether to respond to the voice signal to the non-central device.
中枢设备在确定响应语音信号的待响应语音识别设备后,则可通过网络向非中枢设备,即向区域网络中所有被唤醒但还未响应的语音识别设备发送是否响应该语音信号的通知,该通知可为具体的是响应或无需响应,也可为所确定的响应该语音信号的语音识别设备的设备信息。也可仅向待响应语音识别设备发送通知,其他未接到通知的语音识别设备不做响应,而接收到通知的则做响应。After the hub device determines the voice recognition device to respond to the voice signal, it can send a notification of whether to respond to the voice signal to the non-central device, that is, to all voice recognition devices that have been awakened but have not responded to the voice signal through the network. The notification may be a specific response or no response, and may also be device information of the determined voice recognition device that responds to the voice signal. It is also possible to only send a notification to the voice recognition device to be responded, and other voice recognition devices that have not received the notification do not respond, but those that receive the notification respond.
S205:待响应语音识别设备响应语音信号。S205: The voice recognition device to be responded responds to the voice signal.
所确定的语音识别设备即可响应语音信号,而其他的语音识别设备则不响应。保证了只有一个语音识别设备来响应该语音信号,而不会造成相互干扰的问题。The identified voice recognition device can respond to the voice signal, while other voice recognition devices do not. It is ensured that only one voice recognition device responds to the voice signal without causing mutual interference.
以上图2所示的方法应用于单区域网络的语音唤醒识别,单区域网络中语音识别设备被语音信息唤醒后,并不立即响应,而是由单区域网络的中枢设备确定响应的设备后,再做响应。The method shown in Figure 2 above is applied to the voice wake-up recognition of a single area network. After the voice recognition device in the single area network is awakened by voice information, it does not respond immediately, but after the central device of the single area network determines the responding device, Respond again.
多区域网络的唤醒响应方法的实现,基于图2所示单区域网络中待响应语音识别设备的确认。具体来说,多区域网络即多个相互连接的区域网络,每个区域网络的中枢设备相互连接,区分为一个第一中枢设备和至少一个第二中枢设备,在每个区域网络确定其待响应语音识别设备后,再由第一中枢设备进一步确认响应语音信号的语音识别设备。The realization of the wake-up response method of the multi-area network is based on the confirmation of the voice recognition device to be responded in the single-area network shown in Figure 2. Specifically, a multi-area network is a plurality of interconnected area networks. The hub devices of each area network are connected to each other. They are divided into a first hub device and at least one second hub device. Each area network determines its response After the voice recognition device, the first hub device further confirms the voice recognition device that responds to the voice signal.
多区域网络中每个区域网络实现唤醒响应方法的步骤不再赘述,另请参阅图4,多区域网络的唤醒响应方法还包括以下步骤。The steps for implementing the wake-up response method for each regional network in the multi-regional network will not be repeated. Please also refer to FIG. 4. The wake-up response method of the multi-regional network further includes the following steps.
S401:第二中枢设备向第一中枢设备发送第二距离信息,第一中枢设备接收第二距离信息。S401: The second hub device sends second distance information to the first hub device, and the first hub device receives the second distance information.
在多区域网络中,第一中枢设备需比较所有区域网络中待响应语音识别设备的距离信息,从而确定响应语音信号的语音识别设备,待响应语音识别设备为在单个区域网络中所判断出的响应语音信号的语音识别设备;而在多区域网络的应用中,单个区域网络所确定出的待响应语音识别设备,并不立刻进行响 应;而是由第一中枢设备再从多个待响应语音识别设备中确认由哪个来响应语音信号,即确定最终的响应语音信号的语音识别设备。因而本步骤S401中第二中枢设备将其第二距离信息发送给第一中枢设备,第二距离信息即第二中枢设备所在区域的待响应语音识别设备的距离信息。In a multi-area network, the first hub device needs to compare the distance information of the voice recognition device to be responded to in all regional networks to determine the voice recognition device that responds to the voice signal. The voice recognition device to be responded to is determined in a single regional network A voice recognition device that responds to voice signals; in the application of a multi-area network, the voice recognition device to be responded determined by a single regional network does not respond immediately; instead, the first central device receives multiple voice recognition The recognition device confirms which one responds to the voice signal, that is, the final voice recognition device that responds to the voice signal is determined. Therefore, in this step S401, the second central device sends its second distance information to the first central device. The second distance information is the distance information of the voice recognition device to be responded in the area where the second central device is located.
例如,区域A中,由A1比较LA1、LA2、LA3,确定待响应语音识别设备为A2;区域B中,由B1比较LB1、LB2、LB3,确定待响应语音识别设备为B3;区域C中,由C1比较LC1、LC2,确定待响应设备为C1。For example, in area A, A1 compares LA1, LA2, and LA3 to determine that the voice recognition device to be responded is A2; in area B, B1 compares LB1, LB2, and LB3 to determine that the voice recognition device to respond is B3; in area C, C1 compares LC1 and LC2 to determine that the responding device is C1.
B1将其所在区域网络的待响应语音识别设备B3的距离信息LB3发送给A1,C1也将距离信息LC1发送给A1,而A1自身所确定的待响应语音识别设备A2的距离信息为LA2。B1 sends the distance information LB3 of the voice recognition device B3 to be responded in its local area network to A1, and C1 also sends the distance information LC1 to A1, and the distance information of the voice recognition device A2 to be responded determined by A1 itself is LA2.
S402:第一中枢设备比较第二距离信息和第一距离信息,确定响应语音信号的语音识别设备。S402: The first hub device compares the second distance information with the first distance information, and determines a voice recognition device that responds to the voice signal.
第一中枢设备比较每个待响应语音识别设备的距离信息,即第一距离信息和第二距离信息,第一距离信息为第一中枢设备所在区域网络中的待响应语音识别设备的距离信息。The first hub device compares the distance information of each voice recognition device to be responded, that is, the first distance information and the second distance information. The first distance information is the distance information of the voice recognition device to be responded in the local network where the first hub device is located.
本步骤S402的比较过程与上述步骤S203的比较过程类似,具体不再赘述。即比较第一距离信息的距离值和第二距离信息的距离值,得到距离值最小的距离信息;确定距离值最小的距离信息对应的语音识别设备响应语音信号。The comparison process of this step S402 is similar to the comparison process of the foregoing step S203, and the details are not repeated here. That is, the distance value of the first distance information and the distance value of the second distance information are compared to obtain the distance information with the smallest distance value; the voice recognition device corresponding to the distance information with the smallest distance value is determined to respond to the voice signal.
本实施例中A1比较LA2、LB3、LC1;从而确定响应语音信号的语音识别设备,例如为B2。同样,所得到的距离值最小的距离信息可能有两个甚至多个,此时,则进一步依据语音识别设备的唤醒优先级排序来确定响应语音信号的设备,即在距离值最小的距离信息对应的语音识别设备中,确定优先级最高的作为待响应语音识别设备。In this embodiment, A1 compares LA2, LB3, and LC1; thereby determining the voice recognition device that responds to the voice signal, for example, B2. Similarly, the obtained distance information with the smallest distance value may have two or more. In this case, the device that responds to the voice signal is further determined according to the wake-up priority of the voice recognition device, that is, the distance information with the smallest distance value corresponds to Among the voice recognition devices, the one with the highest priority is determined as the voice recognition device to be responded.
S403:第一中枢设备向多区域网络中的其他语音识别设备发送是否响应语音信号的通知。S403: The first hub device sends a notification whether to respond to the voice signal to other voice recognition devices in the multi-area network.
第一中枢设备在确定响应语音信号的语音识别设备后,可直接向全网,即多个区域网络发送通知,或者也可首先向各个区域网络的中枢设备发送通知,再由各个中枢设备向非中枢设备发送通知。同样,也可仅发送给响应语音信号的语音识别设备,其他未接收到通知的不作响应。After the first hub device determines the voice recognition device that responds to the voice signal, it can directly send notifications to the entire network, that is, multiple regional networks, or it can first send notifications to hub devices in each regional network, and then each hub device can send notifications to non- The hub device sends a notification. Similarly, it can only be sent to the voice recognition device that responds to the voice signal, and other devices that have not received the notification will not respond.
S404:所确定的语音识别设备响应语音信号。S404: The determined voice recognition device responds to the voice signal.
本步骤S404与上述步骤S205类似,不再赘述。This step S404 is similar to the above step S205, and will not be described again.
图4所示的方法应用于多区域的语音唤醒识别,在每个区域确定本区域应响应的语音设备后,再由第一中枢设备来进一步确定由哪个区域的语音设备响应,从而保证仅有一个语音识别设备来响应语音信号。The method shown in Figure 4 is applied to multi-region voice wake-up recognition. After each region determines the voice device that should respond to the region, the first central device further determines which region’s voice device responds, so as to ensure that only A voice recognition device responds to voice signals.
在图2和图4所应用的网络中,语音识别设备具有唤醒优先级的排序,因而在最高优先级的语音识别设备出现故障时,可根据唤醒优先级的排序来确定下一唤醒优先级的语音识别设备作为中枢设备或第一中枢设备。In the network applied in Figure 2 and Figure 4, the voice recognition device has a wake-up priority sequence, so when the highest priority voice recognition device fails, the next wake-up priority can be determined according to the wake-up priority sequence. The voice recognition device serves as the hub device or the first hub device.
对于语音识别设备来说,可周期性的检测其自身在区域网络中是否为最高唤醒优先级,也可在区域网络发生变化时检测自身是否为最高唤醒优先级;若检测到自身为当前区域网络中的最高唤醒优先级,即响应于检测到在区域网络中为最高唤醒优先级,则作为中枢设备运行。For voice recognition equipment, it can periodically detect whether it has the highest wake-up priority in the local area network, or detect whether it has the highest wake-up priority when the local network changes; if it detects that it is the current local network The highest wake-up priority in, that is, in response to detecting that it is the highest wake-up priority in the local area network, it operates as a hub device.
本实施例网络中实现唤醒响应方法,所基于的是网络中语音识别设备具有唤醒优先级排序,且语音识别设备作为网络中枢设备可进行距离信息的比较。因而对于新加入到网络中的语音识别设备,也需要符合本实施例的唤醒机制,可由中枢设备来进行相关设置。The wake-up response method implemented in the network of this embodiment is based on the fact that the voice recognition device in the network has a wake-up priority order, and the voice recognition device as a network hub device can compare distance information. Therefore, the voice recognition device newly added to the network also needs to comply with the wake-up mechanism of this embodiment, which can be set by the hub device.
中枢设备可获取加入网络的语音识别设备的设备信息。根据预设规则分析设备信息,以重新对网络中的语音识别设备进行唤醒优先级的排序。The hub device can obtain the device information of the voice recognition device joining the network. Analyze device information according to preset rules to re-order the voice recognition devices in the network to wake up priority.
每个语音识别设备均搭载有语音识别系统,语音识别系统决定了唤醒优先级,语音识别算法,唤醒模板等。若新加入的语音识别设备具有不同语音识别系统,即其具有不同的唤醒优先级设置,网络中枢设备则可根据其本身的唤醒优先级设置来重新排序。例如网络A1-A2-A3,新加入的语音识别设备A4,其唤醒优先级的设置为大于A3,则可对将唤醒优先级重新排序为A1>A2>A4>A3。Each voice recognition device is equipped with a voice recognition system, which determines the wake-up priority, voice recognition algorithm, wake-up template, etc. If the newly added voice recognition device has a different voice recognition system, that is, it has different wake-up priority settings, the network hub device can reorder according to its own wake-up priority settings. For example, in the network A1-A2-A3, the newly added voice recognition device A4, whose wake-up priority is set to be greater than A3, can reorder the wake-up priority as A1>A2>A4>A3.
若新加入的语音识别设备具有相同的语音识别系统,即其具有相同的唤醒优先级设置,则将以先加入网络的语音识别设备的唤醒优先级为更高。例如,新加入的语音识别设备A3,与之前的A3具有相同的语音识别系统,则之前的A3作为A31,新加入的作为A32,唤醒优先级的重新排序为A1>A2>A31>A32。If the newly added voice recognition device has the same voice recognition system, that is, it has the same wake-up priority setting, the wake-up priority of the voice recognition device that joins the network first will be higher. For example, the newly added voice recognition device A3 has the same voice recognition system as the previous A3, the previous A3 is used as A31, the newly added one is used as A32, and the wake-up priority is reordered as A1>A2>A31>A32.
对于本实施例网络来说,其中实现唤醒响应方法的所有步骤均可在网络内部完成,因而本实施例的语音识别设备可离线运行。For the network of this embodiment, all the steps in which the wake-up response method is implemented can be completed inside the network, so the voice recognition device of this embodiment can run offline.
在以上语音识别设备相互连接所构成的单区域网络中,语音识别设备可作为两种角色,一是作为中枢设备运作,另一是作为非中枢设备运作。对于每一 语音识别设备,其可作为中枢设备,具有较强较多的功能;也可仅作为非中枢设备,具有轻量化的功能。In the single area network formed by the interconnection of the above voice recognition devices, the voice recognition device can play two roles, one is to operate as a central device, and the other is to operate as a non-central device. For each speech recognition device, it can be used as a central device with more powerful functions; it can also be used as a non-central device with lighter weight.
在家电领域,对于大型家电,例如冰箱、电视机等,可在其中加载功能较强较多的语音识别系统,使其能够作为中枢设备;而对于小型家电,如电饭煲,电水壶等,可在其中加载轻量级功能的语音识别系统,使其仅作为非中枢设备。In the field of household appliances, for large household appliances, such as refrigerators, televisions, etc., a voice recognition system with more powerful functions can be loaded into it, so that it can be used as a central device; for small household appliances, such as rice cookers, electric kettles, etc., The voice recognition system with lightweight functions makes it only a non-central device.
对于能够作为网络中枢设备的语音识别装置,其实现唤醒响应方法的步骤请参阅图5,图5是本申请语音识别设备的唤醒响应方法的中枢设备端工作流程示意图。作为网络中枢设备,其实现唤醒响应方法包括以下步骤。For a voice recognition device that can be used as a network hub device, please refer to FIG. 5 for the steps of implementing the wake-up response method. FIG. 5 is a schematic diagram of the hub device side workflow of the wake-up response method of the voice recognition device of the present application. As a network hub device, its wake-up response method includes the following steps.
S501:分析采集的语音信号,以获得中枢设备的距离信息。S501: Analyze the collected voice signal to obtain distance information of the central device.
对于每个区域网络中的中枢设备时,本步骤S501在上述步骤S201中完成,具体不再赘述。For the hub device in each area network, this step S501 is completed in the above step S201, and the details will not be repeated.
S502:接收非中枢设备的非中枢设备的距离信息。S502: Receive distance information of a non-central device that is a non-central device.
本步骤S502与上述步骤S202对应,具体不再赘述。This step S502 corresponds to the above step S202, and the details are not repeated here.
S503:比较中枢设备的距离信息和非中枢设备的距离信息,确定区域网络中的待响应语音识别设备。S503: Compare the distance information of the central device with the distance information of the non-central device, and determine the voice recognition device to be responded in the regional network.
本步骤S503与上述步骤S203类似,具体不再赘述。This step S503 is similar to the above step S203, and the details will not be described in detail.
上述步骤以语音识别设备作为中枢设备的角色,来说明其在实现单区域唤醒响应方法时的步骤,其中每个步骤的具体细节,中枢设备运行的具体细节也已在上文中描述,因此不再赘述。本实施例语音识别设备可从多个语音识别设备中确定响应该语音信号的一个语音识别设备,从而避免了均响应而相互干扰的问题。The above steps use the voice recognition device as the role of the central device to illustrate the steps in implementing the single-area wake-up response method. The specific details of each step and the specific details of the operation of the central device have also been described above, so they will not be Repeat. The voice recognition device of this embodiment can determine a voice recognition device that responds to the voice signal from multiple voice recognition devices, thereby avoiding the problem of mutual interference due to all responses.
进一步的,对于多区域网络,中枢设备还分为第一中枢设备和第二中枢设备,对于第一中枢设备来说,其进一步执行以下步骤。Further, for a multi-area network, the hub device is further divided into a first hub device and a second hub device. For the first hub device, it further performs the following steps.
S504:第一中枢设备接收第二距离信息。S504: The first hub device receives the second distance information.
本步骤S504在上述步骤S401中完成,具体不再赘述。This step S504 is completed in the above step S401, and the details are not repeated here.
S506:比较第一距离信息和第二距离信息,确定响应语音信号的语音识别设备。S506: Compare the first distance information with the second distance information, and determine a voice recognition device that responds to the voice signal.
本步骤S506与上述步骤S402类似,具体不再赘述。This step S506 is similar to the above step S402, and the details are not repeated here.
对于第二中枢设备来说,其则执行以下步骤。For the second hub device, it performs the following steps.
S505:第二中枢设备向第一中枢设备发送第二距离信息,以由第一中枢设备比较第一距离信息和第二距离信息,从而确定响应语音信号的语音识别设备。S505: The second hub device sends second distance information to the first hub device, so that the first hub device compares the first distance information with the second distance information, so as to determine a voice recognition device that responds to the voice signal.
本步骤S505在上述步骤S401-S402中完成,具体不再赘述。This step S505 is completed in the above steps S401-S402, and the details are not repeated here.
进一步的,在多区域网络中,由第一中枢设备进一步确定由哪个区域网络中的待响应语音识别设备来响应语音信号。Further, in a multi-area network, the first hub device further determines which area network's to-be-responsive voice recognition device responds to the voice signal.
从非中枢设备的角度来看,其实现唤醒响应方法的步骤请参阅图6,图6是本申请语音识别设备唤醒响应方法的非中枢设备端工作流程示意图。该语音识别设备作为非中枢设备,本实施例唤醒响应方法包括以下步骤。From the perspective of a non-central device, please refer to FIG. 6 for the steps of implementing the wake-up response method. FIG. 6 is a schematic diagram of the non-central device side work flow of the voice recognition device wake-up response method of the present application. The voice recognition device is a non-central device, and the wake-up response method of this embodiment includes the following steps.
S601:分析采集的语音信号,以获得非中枢设备的距离信息。S601: Analyze the collected voice signal to obtain distance information of the non-central device.
本步骤S601与上述步骤S201类似,均为获取距离信息,具体过程不再赘述。This step S601 is similar to the above step S201, both of which are obtaining distance information, and the specific process will not be repeated.
S602:向中枢设备发送非中枢设备的距离信息,以由中枢设备比较非中枢设备的距离信息和中枢设备的距离信息,来确定待响应语音识别设备。S602: Send the distance information of the non-central device to the central device, so that the central device compares the distance information of the non-central device with the distance information of the central device to determine the voice recognition device to be responded to.
作为非中枢设备,其在采集到语音信号后,并不立刻响应该语音信号,而是进行计算分析获得距离信息,然后再将该距离信息传送给中枢设备进行分析比较,由中枢设备来确认响应语音信号的语音识别设备。As a non-central device, after collecting the voice signal, it does not respond to the voice signal immediately, but performs calculation and analysis to obtain distance information, and then transmits the distance information to the central device for analysis and comparison, and the central device confirms the response Voice recognition equipment for voice signals.
本实施例以语音识别设备作为非中枢设备的角色,来说明其在实现唤醒响应方法时的步骤,其中每个步骤的具体细节,非中枢设备运行的具体细节也已在上文中描述,因此不再赘述。本实施例语音识别设备在接收到语音信号后不会立即响应,而是在收到通知后再决定是否响应,避免了与其他语音识别设备同时响应,造成的相互干扰的问题。In this embodiment, the role of the voice recognition device as a non-central device is used to illustrate the steps in implementing the wake-up response method. The specific details of each step and the specific details of the operation of the non-central device have also been described above. Repeat it again. The voice recognition device of this embodiment does not respond immediately after receiving the voice signal, but decides whether to respond after receiving the notification, which avoids the problem of mutual interference caused by simultaneous response with other voice recognition devices.
上述唤醒响应方法由语音识别设备实现,因而本申请还提出语音识别设备,请参阅图7,图7是本申请语音识别设备一实施例的结构示意图,本实施例语音识别设备100可以是家用电器,其包括相互连接的至少三个麦克风11,处理器12和存储器13,本实施例语音识别设备100可实现上述唤醒响应方法的实施例。其中,至少三个麦克风11相对位置固定,用于采集语音信号,存储器13中存储有计算机程序,处理器12用于执行计算机程序以实现上述唤醒响应方法。The above wake-up response method is implemented by a voice recognition device. Therefore, this application also proposes a voice recognition device. Please refer to Figure 7. Figure 7 is a schematic structural diagram of an embodiment of the voice recognition device of this application. The voice recognition device 100 in this embodiment may be a household appliance. , Which includes at least three microphones 11, a processor 12, and a memory 13 connected to each other. The voice recognition device 100 of this embodiment can implement the above-mentioned wake-up response method embodiment. Among them, at least three microphones 11 have a fixed relative position and are used to collect voice signals, a computer program is stored in the memory 13, and the processor 12 is used to execute the computer program to implement the above wake-up response method.
具体来说,至少三个麦克风11用于采集语音信号;处理器12用于根据至少三个麦克风的相对位置及分别采集的语音信号,计算获得语音识别设备与语音信号的信号源的距离信息,并比较所有的距离信息,以确定响应语音信号的语音识别设备;向其他语音识别设备发送是否响应语音信号的通知。Specifically, at least three microphones 11 are used to collect voice signals; the processor 12 is used to calculate the distance information between the voice recognition device and the signal source of the voice signal according to the relative positions of the at least three microphones and the voice signals collected respectively, And compare all distance information to determine the voice recognition device that responds to the voice signal; send notifications to other voice recognition devices whether they respond to the voice signal.
或者,至少三个麦克风11用于采集语音信号;处理器12用于根据至少三个麦克风的相对位置及分别采集的语音信号,计算获得语音识别设备与语音信 号的信号源的距离信息,将距离信息发送至中枢设备,根据所接收到的中枢设备发送的是否响应语音信号的通知,来确定是否响应。Alternatively, at least three microphones 11 are used to collect voice signals; the processor 12 is used to calculate the distance information between the voice recognition device and the signal source of the voice signal based on the relative positions of the at least three microphones and the voice signals collected separately, and calculate the distance The information is sent to the central device, and it is determined whether to respond according to the received notification sent by the central device whether it responds to the voice signal.
其中,处理器12可以是一种集成电路芯片,具有信号的处理能力。处理器12还可以是通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现成可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。Among them, the processor 12 may be an integrated circuit chip with signal processing capability. The processor 12 may also be a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), an off-the-shelf programmable gate array (FPGA) or other programmable logic device, a discrete gate or transistor logic device, a discrete hardware component . The general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
对于上述实施例的方法,其可以计算机程序的形式存在,因而本申请提出一种计算机存储介质,请参阅图8,图8是本申请计算机存储介质一实施例的结构示意图。本实施例计算机存储介质200中存储有计算机程序21,其可被执行以实现上述实施例中的方法。For the method of the foregoing embodiment, it may exist in the form of a computer program. Therefore, this application proposes a computer storage medium. Please refer to FIG. 8. FIG. 8 is a schematic structural diagram of an embodiment of the computer storage medium of the present application. The computer storage medium 200 of this embodiment stores a computer program 21, which can be executed to implement the method in the foregoing embodiment.
本实施例计算机存储介质200可以是U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等可以存储程序指令的介质,或者也可以为存储有该程序指令的服务器,该服务器可将存储的程序指令发送给其他设备运行,或者也可以自运行该存储的程序指令。The computer storage medium 200 of this embodiment may be a U disk, a mobile hard disk, a read-only memory (ROM, Read-Only Memory), a random access memory (RAM, Random Access Memory), a magnetic disk or an optical disk, etc., which can store program instructions. Or it may also be a server storing the program instructions, and the server may send the stored program instructions to other devices to run, or it may run the stored program instructions by itself.
在本申请所提供的几个实施例中,应该理解到,所揭露的方法和装置,可以通过其它的方式实现。例如,以上所描述的装置实施方式仅仅是示意性的,例如,模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed method and device can be implemented in other ways. For example, the device implementation described above is merely illustrative, for example, the division of modules or units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components can be combined or It can be integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施方式方案的目的。The units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, the functional units in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本申请各个实施方式方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium. Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , Including several instructions to make a computer device (which can be a personal computer, a server, or a network device, etc.) or a processor execute all or part of the steps of the methods in the various embodiments of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program code .
以上所述仅为本申请的实施方式,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only implementations of this application, and do not limit the scope of this application. Any equivalent structure or equivalent process transformation made by using the description and drawings of this application, or directly or indirectly applied to other related technologies In the same way, all fields are included in the scope of patent protection of this application.

Claims (22)

  1. 一种语音识别设备的唤醒响应方法,其特征在于,所述多个语音识别设备构成区域网络,所述多个语音识别设备分为一个中枢设备和至少一个非中枢设备;所述唤醒响应方法包括:A wake-up response method for a voice recognition device, wherein the multiple voice recognition devices form a regional network, and the multiple voice recognition devices are divided into a central device and at least one non-central device; the wake-up response method includes :
    所述中枢设备分析采集的语音信号,以获得所述中枢设备的距离信息;所述中枢设备的距离信息表示所述中枢设备与所述语音信号的信号源的距离;The hub device analyzes the collected voice signals to obtain distance information of the hub device; the distance information of the hub device indicates the distance between the hub device and the signal source of the voice signal;
    接收所述非中枢设备的距离信息,所述非中枢设备的距离信息由所述非中枢设备分析采集的所述语音信号而获得,表示所述非中枢设备与所述信号源的距离;Receiving distance information of the non-central device, where the distance information of the non-central device is obtained by analyzing the collected voice signal by the non-central device, and represents the distance between the non-central device and the signal source;
    比较所述中枢设备的距离信息和所述非中枢设备的距离信息;Comparing the distance information of the central device with the distance information of the non-central device;
    确定待响应语音识别设备,所述待响应语音识别设备为所述区域网络中响应所述语音信号的语音识别设备。A voice recognition device to be responded to is determined, and the voice recognition device to be responded is a voice recognition device that responds to the voice signal in the local area network.
  2. 根据权利要求1所述的唤醒响应方法,其特征在于,所述比较所述中枢设备的距离信息和所述非中枢设备的距离信息,确定待响应语音识别设备,包括:The wake-up response method according to claim 1, wherein the comparing the distance information of the central device with the distance information of the non-central device to determine the voice recognition device to be responded to comprises:
    比较所述中枢设备的距离信息的距离值和所述非中枢设备的距离信息的距离值,得到距离值最小的距离信息;Comparing the distance value of the distance information of the hub device with the distance value of the distance information of the non-central device to obtain the distance information with the smallest distance value;
    确定所述距离值最小的距离信息对应的语音识别设备为所述待响应语音识别设备。It is determined that the voice recognition device corresponding to the distance information with the smallest distance value is the voice recognition device to be responded.
  3. 根据权利要求2所述的唤醒响应方法,其特征在于,所述多个语音识别设备具有唤醒优先级;所述确定所述距离值最小的距离信息对应的语音识别设备为所述待响应语音识别设备,包括:The wake-up response method according to claim 2, wherein the multiple voice recognition devices have a wake-up priority; the voice recognition device corresponding to the distance information with the smallest distance value is determined to be the voice recognition device to be responded Equipment, including:
    在所述距离值最小的距离信息对应的语音识别设备中,确定唤醒优先级最高的作为所述待响应语音识别设备。Among the voice recognition devices corresponding to the distance information with the smallest distance value, the voice recognition device with the highest wake-up priority is determined as the voice recognition device to be responded.
  4. 根据权利要求1所述的唤醒响应方法,其特征在于,所述唤醒响应方法进一步包括:The wake-up response method according to claim 1, wherein the wake-up response method further comprises:
    所述中枢设备向所述非中枢设备发送是否响应所述语音信号的通知。The hub device sends a notification whether to respond to the voice signal to the non-central device.
  5. 根据权利要求1所述的唤醒响应方法,其特征在于,多个所述区域网络相互连接,所述区域网络中的多个中枢设备分为一个第一中枢设备和至少一个第二中枢设备;所述唤醒响应方法进一步包括:The wake-up response method according to claim 1, wherein a plurality of the regional networks are connected to each other, and the multiple central devices in the regional network are divided into a first central device and at least one second central device; The wake-up response method further includes:
    所述第二中枢设备向所述第一中枢设备发送第二距离信息,以由所述第一 中枢设备比较所述第二距离信息和第一距离信息,从而确定响应所述语音信号的语音识别设备;The second hub device sends second distance information to the first hub device, so that the first hub device compares the second distance information with the first distance information, thereby determining the voice recognition in response to the voice signal equipment;
    所述第一距离信息为所述第一中枢设备所在区域网络的待响应语音识别设备的距离信息,所述第二距离信息为所述第二中枢设备所在区域网络的待响应语音识别设备的距离信息。The first distance information is the distance information of the voice recognition device to be responded in the local area network where the first hub device is located, and the second distance information is the distance of the voice recognition device to be responded to in the area network where the second hub device is located information.
  6. 根据权利要求1所述的唤醒响应方法,其特征在于,多个所述区域网络相互连接,所述区域网络中的多个中枢设备分为一个第一中枢设备和至少一个第二中枢设备;所述唤醒响应方法进一步包括:The wake-up response method according to claim 1, wherein a plurality of the regional networks are connected to each other, and the multiple central devices in the regional network are divided into a first central device and at least one second central device; The wake-up response method further includes:
    所述第一中枢设备接收第二距离信息,所述第二距离信息为所述第二中枢设备所在区域网络的待响应语音识别设备的距离信息;Receiving, by the first hub device, second distance information, where the second distance information is the distance information of the voice recognition device to be responded to in the regional network where the second hub device is located;
    比较所述第二距离信息和第一距离信息,以确定响应所述语音信号的语音识别设备,所述第一距离信息为所述第一中枢设备所在区域网络的待响应语音识别设备的距离信息。The second distance information and the first distance information are compared to determine a voice recognition device that responds to the voice signal, and the first distance information is the distance information of the voice recognition device to be responded to in the area network where the first hub device is located .
  7. 根据权利要求5或6所述的唤醒响应方法,其特征在于,所述比较所述第二距离信息和第一距离信息,以确定响应所述语音信号的语音识别设备,包括:The wake-up response method according to claim 5 or 6, wherein the comparing the second distance information with the first distance information to determine a voice recognition device that responds to the voice signal comprises:
    比较所述第一距离信息的距离值和所述第二距离信息的距离值,得到距离值最小的距离信息;Comparing the distance value of the first distance information with the distance value of the second distance information to obtain the distance information with the smallest distance value;
    确定所述距离值最小的距离信息对应的语音识别设备响应所述语音信号。It is determined that the voice recognition device corresponding to the distance information with the smallest distance value responds to the voice signal.
  8. 根据权利要求7所述的唤醒响应方法,其特征在于,所述多个语音识别设备具有唤醒优先级;所述确定所述距离值最小的距离信息对应的语音识别设备响应所述语音信号,包括:The wake-up response method according to claim 7, wherein the multiple voice recognition devices have a wake-up priority; the determining that the voice recognition device corresponding to the distance information with the smallest distance value responds to the voice signal, comprising :
    在所述距离值最小的距离信息对应的语音识别设备中,确定唤醒优先级最高的语音识别设备响应所述语音信号。Among the voice recognition devices corresponding to the distance information with the smallest distance value, it is determined that the voice recognition device with the highest wake-up priority responds to the voice signal.
  9. 根据权利要求5或6所述的唤醒响应方法,其特征在于,所述唤醒响应方法进一步包括:The wake-up response method of claim 5 or 6, wherein the wake-up response method further comprises:
    所述第一中枢设备向所述多个区域网络中的其他语音识别设备发送是否响应所述语音信号的通知。The first hub device sends a notification whether to respond to the voice signal to other voice recognition devices in the multiple area networks.
  10. 根据权利要求1-6中任一项所述的唤醒响应方法,其特征在于,所述中枢设备的距离信息和所述非中枢设备的距离信息统称为距离信息;分析采集的语音信号获得距离信息,包括:The wake-up response method according to any one of claims 1-6, wherein the distance information of the central device and the distance information of the non-central device are collectively referred to as distance information; the collected voice signal is analyzed to obtain the distance information ,include:
    通过至少三个麦克风分别采集所述语音信号,所述至少三个麦克风在所述语音识别设备上的相对位置固定;Collecting the voice signals through at least three microphones respectively, and the relative positions of the at least three microphones on the voice recognition device are fixed;
    根据所述至少三个麦克风的相对位置及分别采集的语音信号,计算所述距离信息的距离值。The distance value of the distance information is calculated according to the relative positions of the at least three microphones and the voice signals collected respectively.
  11. 根据权利要求10所述的唤醒响应方法,其特征在于,所述根据所述至少三个麦克风的相对位置及分别采集的语音信号,计算所述距离信息的距离值,包括:The wake-up response method according to claim 10, wherein the calculating the distance value of the distance information according to the relative positions of the at least three microphones and the voice signals collected respectively includes:
    根据所述至少三个麦克风分别采集的语音信号,计算所述至少三个麦克风与所述信号源的相对方位;Calculating the relative positions of the at least three microphones and the signal source according to the voice signals collected by the at least three microphones;
    根据所述至少三个麦克风与所述信号源的相对方位,以及所述至少三个麦克风之间的相对位置,计算所述距离信息的距离值。The distance value of the distance information is calculated according to the relative position of the at least three microphones and the signal source, and the relative position between the at least three microphones.
  12. 根据权利要求11所述的唤醒响应方法,其特征在于,所述根据所述至少三个麦克风分别采集的语音信号,计算所述至少三个麦克风与所述信号源的相对方位,包括:The wake-up response method according to claim 11, wherein the calculating the relative positions of the at least three microphones and the signal source according to the voice signals collected by the at least three microphones respectively comprises:
    利用DOA算法计算线性阵列的三个麦克风分别采集到的语音信号,获得每两相邻所述麦克风与所述信号源的相对方位。The DOA algorithm is used to calculate the voice signals respectively collected by the three microphones of the linear array, and the relative positions of every two adjacent microphones and the signal source are obtained.
  13. 一种语音识别设备的唤醒响应方法,其特征在于,所述多个语音识别设备构成区域网络,所述多个语音识别设备分为一个中枢设备和至少一个非中枢设备;所述唤醒响应方法包括:A wake-up response method for a voice recognition device, wherein the multiple voice recognition devices form a regional network, and the multiple voice recognition devices are divided into a central device and at least one non-central device; the wake-up response method includes :
    所述非中枢设备分析采集的语音信号,以获得所述非中枢设备的距离信息;所述非中枢设备的距离信息表示所述非中枢设备与所述语音信号的信号源的距离;The non-central device analyzes the collected voice signal to obtain the distance information of the non-central device; the distance information of the non-central device indicates the distance between the non-central device and the signal source of the voice signal;
    向所述中枢设备发送非中枢设备的距离信息,以由所述中枢设备比较所述非中枢设备的距离信息和所述中枢设备的距离信息,来确定待响应语音识别设备;Sending the distance information of the non-central device to the central device, so that the central device compares the distance information of the non-central device with the distance information of the central device to determine the voice recognition device to be responded;
    所述中枢设备的距离信息表示所述中枢设备与所述语音信号的信号源的距离,所述待响应语音识别设备为所述区域网络中响应所述语音信号的语音识别设备。The distance information of the hub device indicates the distance between the hub device and the signal source of the voice signal, and the voice recognition device to be responded is a voice recognition device in the local area network that responds to the voice signal.
  14. 根据权利要求13所述的唤醒响应方法,其特征在于,所述中枢设备比较所述中枢设备的距离信息和所述非中枢设备的距离信息,确定待响应语音识别设备,包括:The wake-up response method of claim 13, wherein the central device compares the distance information of the central device with the distance information of the non-central device to determine the voice recognition device to be responded to, comprising:
    所述中枢设备比较所述中枢设备的距离信息的距离值和所述非中枢设备的距离信息的距离值,得到距离值最小的距离信息;The hub device compares the distance value of the distance information of the hub device with the distance value of the distance information of the non-central device to obtain the distance information with the smallest distance value;
    确定所述距离值最小的距离信息对应的语音识别设备为所述待响应语音识别设备。It is determined that the voice recognition device corresponding to the distance information with the smallest distance value is the voice recognition device to be responded.
  15. 根据权利要求14所述的唤醒响应方法,其特征在于,所述多个语音识别设备具有唤醒优先级;所述确定所述距离值最小的距离信息对应的语音识别设备为所述待响应语音识别设备,包括:The wake-up response method according to claim 14, wherein the plurality of voice recognition devices have a wake-up priority; the voice recognition device corresponding to the distance information with the smallest distance value is determined to be the voice recognition device to be responded Equipment, including:
    在所述距离值最小的距离信息对应的语音识别设备中,确定唤醒优先级最高的作为所述待响应语音识别设备。Among the voice recognition devices corresponding to the distance information with the smallest distance value, the voice recognition device with the highest wake-up priority is determined as the voice recognition device to be responded.
  16. 根据权利要求13所述的唤醒响应方法,其特征在于,所述唤醒响应方法进一步包括:The wake-up response method according to claim 13, wherein the wake-up response method further comprises:
    接收所述中枢设备发送的是否响应所述语音信号的通知。Receiving a notification sent by the hub device whether to respond to the voice signal.
  17. 根据权利要求13-16中任一项所述的唤醒响应方法,其特征在于,所述中枢设备的距离信息和所述非中枢设备的距离信息统称为距离信息;分析采集的语音信号获得距离信息,包括:The wake-up response method according to any one of claims 13-16, wherein the distance information of the central device and the distance information of the non-central device are collectively referred to as distance information; the collected voice signals are analyzed to obtain the distance information ,include:
    通过至少三个麦克风分别采集所述语音信号,所述至少三个麦克风在所述语音识别设备上的相对位置固定;Collecting the voice signals through at least three microphones respectively, and the relative positions of the at least three microphones on the voice recognition device are fixed;
    根据所述至少三个麦克风的相对位置及分别采集的语音信号,计算所述距离信息的距离值。The distance value of the distance information is calculated according to the relative positions of the at least three microphones and the voice signals collected respectively.
  18. 根据权利要求17所述的唤醒响应方法,其特征在于,所述根据所述至少三个麦克风的相对位置及分别采集的语音信号,计算所述距离信息的距离值,包括:The wake-up response method according to claim 17, wherein the calculating the distance value of the distance information according to the relative positions of the at least three microphones and the voice signals collected respectively comprises:
    根据所述至少三个麦克风分别采集的语音信号,计算所述至少三个麦克风与所述信号源的相对方位;Calculating the relative positions of the at least three microphones and the signal source according to the voice signals collected by the at least three microphones;
    根据所述至少三个麦克风与所述信号源的相对方位,以及所述至少三个麦克风之间的相对位置,计算所述距离信息的距离值。The distance value of the distance information is calculated according to the relative position of the at least three microphones and the signal source, and the relative position between the at least three microphones.
  19. 根据权利要求18所述的唤醒响应方法,其特征在于,所述根据所述至少三个麦克风分别采集的语音信号,计算所述至少三个麦克风与所述信号源的相对方位,包括:The wake-up response method according to claim 18, wherein the calculating the relative positions of the at least three microphones and the signal source according to the voice signals collected by the at least three microphones respectively comprises:
    利用DOA算法计算线性阵列的三个麦克风分别采集到的语音信号,获得每两相邻所述麦克风与所述信号源的相对方位。The DOA algorithm is used to calculate the voice signals respectively collected by the three microphones of the linear array, and the relative positions of every two adjacent microphones and the signal source are obtained.
  20. 一种语音识别设备,其特征在于,所述语音识别设备包括处理器和存储器;所述存储器中存储有计算机程序,所述处理器用于执行所述计算机程序以实现如权利要求1-19中任一项所述方法的步骤。A voice recognition device, characterized in that the voice recognition device includes a processor and a memory; a computer program is stored in the memory, and the processor is used to execute the computer program to implement any of claims 1-19 One of the steps of the method.
  21. 根据权利要求20所述的语音识别设备,其特征在于,所述语音识别设备包括相对位置固定的至少三个麦克风。The voice recognition device according to claim 20, wherein the voice recognition device comprises at least three microphones with fixed relative positions.
  22. 一种计算机存储介质,其特征在于,所述计算机存储介质存储有计算机程序,所述计算机程序被执行以实现如权利要求1-19中任一项所述方法的步骤。A computer storage medium, wherein the computer storage medium stores a computer program, and the computer program is executed to implement the steps of the method according to any one of claims 1-19.
PCT/CN2019/124117 2019-04-26 2019-12-09 Speech recognition device and wake-up response method therefor, and computer storage medium WO2020215741A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910343044.7 2019-04-26
CN201910343044.7A CN111862964B (en) 2019-04-26 2019-04-26 Voice recognition equipment and wake-up response method thereof as well as computer storage medium

Publications (1)

Publication Number Publication Date
WO2020215741A1 true WO2020215741A1 (en) 2020-10-29

Family

ID=72940705

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/124117 WO2020215741A1 (en) 2019-04-26 2019-12-09 Speech recognition device and wake-up response method therefor, and computer storage medium

Country Status (2)

Country Link
CN (1) CN111862964B (en)
WO (1) WO2020215741A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113450791A (en) * 2021-04-28 2021-09-28 珠海格力电器股份有限公司 Voice equipment control method and device, storage medium and voice equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017135531A1 (en) * 2016-02-05 2017-08-10 삼성전자(주) Voice recognition apparatus and method, and voice recognition system
CN108337601A (en) * 2018-01-30 2018-07-27 出门问问信息科技有限公司 The control method and device of speaker
CN109377987A (en) * 2018-08-31 2019-02-22 百度在线网络技术(北京)有限公司 Exchange method, device, equipment and the storage medium of intelligent sound equipment room
CN109509468A (en) * 2018-11-01 2019-03-22 珠海格力电器股份有限公司 A kind of equipment executes the method and device of voice broadcast task
CN109658927A (en) * 2018-11-30 2019-04-19 北京小米移动软件有限公司 Wake-up processing method, device and the management equipment of smart machine

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017135531A1 (en) * 2016-02-05 2017-08-10 삼성전자(주) Voice recognition apparatus and method, and voice recognition system
CN108337601A (en) * 2018-01-30 2018-07-27 出门问问信息科技有限公司 The control method and device of speaker
CN109377987A (en) * 2018-08-31 2019-02-22 百度在线网络技术(北京)有限公司 Exchange method, device, equipment and the storage medium of intelligent sound equipment room
CN109509468A (en) * 2018-11-01 2019-03-22 珠海格力电器股份有限公司 A kind of equipment executes the method and device of voice broadcast task
CN109658927A (en) * 2018-11-30 2019-04-19 北京小米移动软件有限公司 Wake-up processing method, device and the management equipment of smart machine

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113450791A (en) * 2021-04-28 2021-09-28 珠海格力电器股份有限公司 Voice equipment control method and device, storage medium and voice equipment
CN113450791B (en) * 2021-04-28 2023-08-04 珠海格力电器股份有限公司 Voice equipment control method and device, storage medium and voice equipment

Also Published As

Publication number Publication date
CN111862964A (en) 2020-10-30
CN111862964B (en) 2024-03-22

Similar Documents

Publication Publication Date Title
US9431014B2 (en) Intelligent placement of appliance response to voice command
US11297470B2 (en) User location aware smart event handling
US9438440B2 (en) Proximity detection of internet of things (IoT) devices using sound chirps
US20220044685A1 (en) Voice Recognition Device, Waking-Up and Responding Method of the Same, and Computer Storage Medium
CN105957519B (en) Method and system for simultaneously performing voice control on multiple regions, server and microphone
JP2017524285A (en) Generating location profiles for Internet of Things devices based on extended location information associated with one or more nearby Internet of Things devices
CN110568771B (en) System and method for intelligently and cooperatively controlling intelligent household equipment
WO2021012581A1 (en) Voice recognition device and wake-up response method therefor, and computer storage medium
CN104967617B (en) A kind of data processing method and device
WO2020224265A1 (en) Voice control method and apparatus
CN105049922A (en) Proximity detection of candidate companion display device in same room as primary display using upnp
WO2020215741A1 (en) Speech recognition device and wake-up response method therefor, and computer storage medium
CN109214497B (en) People counting method and device, intelligent household equipment control method and device, and air conditioner
US11546688B2 (en) Loudspeaker device, method, apparatus and device for adjusting sound effect thereof, and medium
WO2023193411A1 (en) Network distribution method and apparatus for devices, computer device, and storage medium
CN107979641A (en) A kind of intelligent domestic system based on cloud computing
CN112086097A (en) Instruction response method of voice terminal, electronic device and computer storage medium
CN113496701A (en) Voice interaction system, method, equipment and conference system
US20160127460A1 (en) Multi-hop wireless peer-to-peer discovery protocol
US9801218B2 (en) Establishing method for self-organization network of wireless nodes
CN110160219B (en) Air conditioning system and control method thereof, air conditioner and control method of intelligent household appliance system
CN115312048A (en) Equipment awakening method and device, storage medium and electronic device
WO2023033731A1 (en) Sensing mesh network
CN117666371A (en) Control method and device of intelligent equipment, storage medium and electronic device
CN115731928A (en) Response device determination method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19926047

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19926047

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 28.03.2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19926047

Country of ref document: EP

Kind code of ref document: A1