CN111415657A

CN111415657A - Decentralized device, multi-device system and voice control method thereof

Info

Publication number: CN111415657A
Application number: CN201910011444.8A
Authority: CN
Inventors: 何云鹏; 高君效
Original assignee: Chipintelli Technology Co Ltd
Current assignee: Chipintelli Technology Co Ltd
Priority date: 2019-01-07
Filing date: 2019-01-07
Publication date: 2020-07-14

Abstract

A decentralized device comprises a voice recognition module, an energy value calculation module, an energy value screening and sorting module, a transceiver module and an execution module, wherein the energy value calculation module, the transceiver module and the execution module are in signal connection with the energy value screening and sorting module, the energy value calculation module is also in signal connection with the voice recognition module and the transceiver module, and the transceiver modules of different devices can be in mutual communication. A decentralized multi-device system comprising a plurality of devices as described above. According to the invention, the multi-device identification adopts a decentralized processing mode, so that centralized processing equipment is removed, the identification is converted into self-processing of the equipment, and complex calculation is decomposed into a plurality of devices to be carried out in parallel, the calculation speed is improved, the network bandwidth requirement is reduced, the response speed of the system is improved, and the experience of a user is improved.

Description

Decentralized device, multi-device system and voice control method thereof

Technical Field

The invention belongs to the field of artificial intelligence, relates to a voice recognition technology, and particularly relates to a decentralized device, a multi-device system and a voice control method thereof.

Background

With the advent of speech recognition functionality, there may be many intelligent speech devices in the same scene, such as a lobby, that need to be awakened with an awakening word when in use. However, if the wake-up words of these devices are the same, then it may happen that multiple devices respond to the wake-up word of the user, interfering with the real needs of the user. For example, in one particular scenario shown in fig. 1, a user utters a voice command, and the wake-up words of devices a1, a2, A3 are identical, and the command may be recognized and executed.

In order to solve the above problems, a common method in the prior art is to uniformly determine which device a user is close to by a processing center, such as a cloud server, and which device responds to a wake-up instruction of the user, but this method needs to use an additional server for determination, which increases the cost; in the prior art, calculation and sequencing can be performed by directly receiving voice information and then calculating, and a device for executing a command is selected by calculating the distance between each device.

Disclosure of Invention

In order to overcome the defects in the prior art, the invention discloses a decentralized device, a multi-device system and a voice control method thereof.

The decentralized equipment comprises a voice recognition module, an energy value calculation module, an energy value screening and sorting module, a transceiver module and an execution module, wherein the energy value calculation module, the transceiver module and the execution module are in signal connection with the energy value screening and sorting module, the energy value calculation module is also in signal connection with the voice recognition module and the transceiver module, and the transceiver modules of different equipment can be in mutual communication.

The decentralized multi-device system comprises a plurality of devices, wherein the awakening words of at least two devices in the plurality of devices are the same.

The invention also discloses a decentralized multi-equipment system voice control method, which comprises the following steps:

s1 the device recognizes the awakening word WS in the voice;

s2, the device calculates the energy value PA of the voice signal received by itself, and packs the calculation result and the identified awakening word WS to send to all devices in the system, and meanwhile receives the data packet containing the awakening word and the energy value sent by other devices in the system;

s3, after the awakening words in all the data packets are identified, the awakening words matched with the equipment are screened out;

the screening range comprises a self-identified awakening word WS and a received awakening word WS;

s4, sorting the screened awakening words according to the energy value, executing the command if the voice signal energy value PA corresponding to the awakening word identified by the equipment with the highest ranking is obtained, and otherwise, terminating the process.

Preferably, the energy value is an audio signal energy value or a signal-to-noise ratio of the audio signal energy to the environmental noise.

Preferably, after the device identifies the wake word WS in the voice in S1, the device first compares whether the wake word WS matches itself, if yes, the subsequent steps are continued, otherwise, the subsequent steps are terminated.

According to the invention, the multi-device identification adopts a decentralized processing mode, so that centralized processing equipment is removed, the identification is converted into self-processing of the equipment, and complex calculation is decomposed into a plurality of devices to be carried out in parallel, the calculation speed is improved, the network bandwidth requirement is reduced, the response speed of the system is improved, and the experience of a user is improved.

Drawings

FIG. 1 is a schematic diagram of an exemplary embodiment of the present invention; the letters a1, a2, A3, B in fig. 1 denote different devices.

Fig. 2 shows a schematic diagram of the internal modules of an embodiment of the apparatus according to the present invention.

Fig. 3 is a flow chart illustrating a speech control method according to an embodiment of the present invention.

Detailed Description

The following provides a more detailed description of the present invention.

The decentralized device, the multi-device system and the voice control method thereof are mainly used for solving the problem of selecting the device for executing the command from the devices with the same awakening words selectively.

For example, in the scenario shown in fig. 1, devices a1, a2, A3, and B exist, where the wakeup words of devices a1, a2, and A3 are all a0, and the wakeup word of device B is B0, and after the user utters the wakeup word a0, all devices a1, a2, and A3 recognize the wakeup word, but generally, the true purpose of the user is to want the device a1 closest to the user to act.

The wake-up word is a special word for waking up the device in voice recognition, each device has a unique wake-up word, and in a voice recognition environment, the device firstly recognizes the wake-up word and verifies that the wake-up word is matched with the device, and then the device can continue to perform subsequent command recognition or command execution and other operations.

In order to solve the problems, the invention discloses decentralized equipment which comprises a voice recognition module, an energy value calculation module, an energy value screening and sorting module, a transmitting and receiving module and an execution module, wherein the energy value calculation module, the transmitting and receiving module and the execution module are all in signal connection with the energy value screening and sorting module, the energy value calculation module is also in signal connection with the voice recognition module and the transmitting and receiving module, and the transmitting and receiving modules of different equipment can be communicated with each other.

The system is applied to a system which is composed of a plurality of devices and has at least two devices with the same awakening words, such as a plurality of ceiling lamps with different positions in an auditorium, air conditioners in different rooms in the same house and the like.

The voice control method of the decentralized multi-device system comprises the following steps

S1 the device recognizes the awakening word WS in the voice;

The speech recognition process applying the present invention is illustrated by taking the application scenario of fig. 1 as an example as follows:

a user sends out a wake-up word A0, after the wake-up word is recognized by each device A1, A2, A3 and B in the system, wherein after the A1, A2 and A3 receive the wake-up word, the energy values of the voice signals received by the devices are calculated to be PA1, PA2 and PA3 respectively, and the recognized wake-up word and the energy values are packaged and sent through a transceiving module respectively.

The energy value is typically an audio signal energy value or a signal-to-noise ratio of the audio signal energy to the ambient noise, the energy value being proportional to the square of the amplitude of the waveform.

The transceiver module directly transmits the information after identification to the transceiver module of other equipment, the information is transmitted only by few bytes generally, an indoor short-distance wireless transmission mode such as Bluetooth and the like is adopted, and generally, no extra wire or larger bandwidth is adopted.

The communication between the transceiver modules of each device can be carried out in a wired or wireless mode, and for wireless short-distance application, the communication between the transceiver modules can be carried out in modes such as Bluetooth, WIFI, 2.4G wireless modules and the like.

The voice recognition method and the voice recognition device are based on the premise that the voice device is locally recognized, voice information does not need to be uploaded to a cloud end or other central servers for recognition and turnover, the voice recognition module of the device can utilize a CI1006 series voice recognition chip of the applicant's Qingtai Lung company to perform local recognition, and functions of energy value calculation, energy value screening and sorting and the like can also be realized in the CI1006 series voice recognition chip of the applicant's QingTai Lung company by utilizing writing software.

For the device B of which the self-awakening word is not A0, the device B can be calculated and sent as A1, A2 and A3, and whether the device B is matched with the device B or not can be firstly compared, if yes, the subsequent steps are continued, and if not, the subsequent steps are terminated, so that unnecessary power consumption is saved.

After the device a1 calculates the self-received wake word energy value PA1, it sorts the received wake word energy values PA2 and PA3 transmitted by the devices a2 and A3 in the energy value screening and sorting module, if the PA1 energy value is found to be the highest, the device a1 executes the command, otherwise, the process is not executed and terminated.

Among the devices a1, a2, A3, the device whose energy value recognized by the self is the highest executes the command.

For the condition of simultaneously receiving a plurality of different awakening words, after receiving various awakening words, the energy value screening and sorting module firstly screens the awakening words matched with the equipment, wherein the screening range comprises the awakening words WS identified by the energy value screening and sorting module and then sorts the awakening words.

For a user, when sending a command, the user generally wants to control a device closest to the user, the closer the device is, the less the audio signal is attenuated on a propagation path, the higher the received energy value is, and the device needing to execute the command is defined through energy value sequencing.

The foregoing is a description of preferred embodiments of the present invention, and the preferred embodiments in the preferred embodiments may be combined and combined in any combination, if not obviously contradictory or prerequisite to a certain preferred embodiment, and the specific parameters in the examples and the embodiments are only for the purpose of clearly illustrating the inventor's invention verification process and are not intended to limit the patent protection scope of the present invention, which is defined by the claims and the equivalent structural changes made by the content of the description of the present invention are also included in the protection scope of the present invention.

Claims

1. A decentralized device is characterized by comprising a voice recognition module, an energy value calculation module, an energy value screening and sorting module, a transceiver module and an execution module, wherein the energy value calculation module, the transceiver module and the execution module are in signal connection with the energy value screening and sorting module, the energy value calculation module is also in signal connection with the voice recognition module and the transceiver module, and the transceiver modules of different devices can be in mutual communication.

2. A decentralized multi-device system comprising a plurality of devices according to claim 1, wherein the wake-up words of at least two of said devices are the same.

3. A voice control method for a decentralized multi-device system is characterized by comprising the following steps:

s1 the device recognizes the awakening word WS in the voice;

4. The decentralized multi-device speech control method according to claim 3, wherein the energy value is an audio signal energy value or a signal-to-noise ratio of audio signal energy to ambient noise.

5. The decentralized multi-device voice control method according to claim 3, wherein the device in S1 recognizes the wake word WS in the voice and then first compares whether it matches itself, if so, continues the subsequent steps, otherwise, terminates the subsequent steps.