CN109473110B

CN109473110B - Method, device and system for preventing voice interaction equipment from being awoken by mistake and using method

Info

Publication number: CN109473110B
Application number: CN201811642265.6A
Authority: CN
Inventors: 曾勋; 雷雄国; 雷玉雄; 刘寒英; 黄海艳; 程栋梁; 陈庆安
Original assignee: Sipic Technology Co Ltd
Current assignee: AI Speech Ltd
Priority date: 2018-12-29
Filing date: 2018-12-29
Publication date: 2022-01-21
Anticipated expiration: 2038-12-29
Also published as: CN109473110A

Abstract

The invention discloses a method for preventing voice interaction equipment from being awoken by mistake, which comprises the steps of setting main control equipment for the voice interaction equipment; when the voice interaction equipment acquires the voice awakening instruction, the voice interaction equipment carries out an authorization request to the main control equipment and carries out voice awakening processing according to the authorization instruction returned by the main control equipment; the main control device and the voice interaction device are communicated through one-to-one link. The invention also discloses a device and a system for preventing the voice interaction equipment from being awoken by mistake, and the method, the device and the system provided by the invention can overcome the problems that anyone can awake any intelligent equipment provided with the awaking word in the same range in the prior art, so that the interaction with the equipment is disordered, the user experience is influenced and the like. The one-to-one awakening of the equipment and the user is really achieved through the voiceprint recognition technology, and the false awakening rate is reduced.

Description

Method, device and system for preventing voice interaction equipment from being awoken by mistake and using method

Technical Field

The invention relates to the technical field of voice interaction, in particular to a method, a device, a system and a using method for preventing voice interaction equipment from being awoken by mistake.

Background

With the popularization of intelligent electronic products and the development of voice technology, more and more intelligent electronic products can utilize voice control to perform voice interactive operation. Taking the smart sound box as an example, the smart sound box is controlled by voice at present, and when the smart sound box is in a dormant state, the smart sound box can be awakened by speaking a preset awakening word, and then receives a voice instruction sent by a user to perform voice interaction. However, the problem exists at present that the awakening words set by the products of the same merchant or the products of the same series are the same, any person speaks the corresponding awakening word to awaken the machine within the awakenable range of the intelligent sound box, and in the research and development process, research and development personnel often encounter the situation of 'one-for-one-hundred response' during testing. Therefore, many troubles are caused, and the user experience is affected.

Disclosure of Invention

In order to solve the problems, the inventor conceives that the voiceprint recognition technology is applied to intelligent voice box, intelligent story teller and other voice interaction equipment, and the equipment is locked through voiceprints. Therefore, only the person who inputs the voiceprint information into the intelligent device can wake up the machine to perform corresponding voice operation. For the condition that children exist in some families, the equipment can be locked by the voiceprint of the adult, so that the problem of equipment failure caused by random awakening of the children and the like can be avoided.

According to a first aspect of the present invention, there is provided a method for preventing a voice interaction device from being awoken by mistake, comprising the following steps:

setting a master control device for the voice interaction device;

when the voice interaction equipment acquires the voice awakening instruction, the voice interaction equipment carries out an authorization request to the main control equipment and carries out voice awakening processing according to the authorization instruction returned by the main control equipment;

the main control device and the voice interaction device are communicated through one-to-one link.

According to a second aspect of the present invention, there is provided an apparatus for preventing a voice interaction device from being mistakenly woken, comprising:

the voice print registration module is used for registering and binding voice print storage for the voice interaction equipment;

the communication module is used for establishing one-to-one bidirectional communication connection with the voice interaction equipment;

and the authorization authentication module is used for generating an authorization instruction according to the received authorization request of the voice interaction equipment and the voiceprint stored by the voiceprint registration module and outputting the authorization instruction to the voice interaction equipment.

According to a third aspect of the present invention, there is provided a system for preventing a voice interaction device from being awoken by mistake, comprising at least two voice interaction devices and a main control device, wherein the main control device is the above control apparatus for preventing a voice interaction device from being awoken by mistake, and is in a one-to-one connection state with one of the voice interaction devices;

the voice interaction device comprises

The voice awakening module is used for awakening and monitoring and outputting the voice awakening instruction to the authorization request module when the voice awakening instruction is monitored; and performing voice wake-up processing;

and the authorization request module is used for outputting an authorization request to the main control equipment according to the voice awakening instruction and calling the voice awakening module to perform voice awakening processing according to the authorization instruction returned by the main control equipment.

According to a fourth aspect of the present invention, there is provided a method for implementing false wake-up prevention by using the above system, including the following steps:

performing voiceprint registration of voice interaction equipment on the main control equipment;

setting currently controlled voice interaction equipment on the main control equipment;

and when receiving the authorization request, the main control equipment acquires the voiceprint information to be matched with the registered voiceprint, and outputs an authorization instruction to the currently controlled voice interaction equipment according to a matching result. .

According to the method, the device and the system provided by the invention, the problems that any intelligent equipment provided with the awakening word can be awakened by anyone in the same range in the prior art, the interaction of the equipment is disordered, the user experience is influenced and the like can be solved, the voice interaction equipment is subjected to authorized control on the main control equipment through the voiceprint recognition technology, a lock is equivalently arranged on the voice interaction equipment, the equipment can be awakened only when the correct awakening word is received in an unlocking state, the one-to-one awakening of the equipment and the user is really realized, the false awakening rate is reduced, and the intelligent equipment can be prevented from being awakened by other people.

Drawings

FIG. 1 is a flowchart illustrating a method for preventing a voice interaction device from being mistakenly awakened according to an embodiment of the present invention;

FIG. 2 is a schematic block diagram of an apparatus for preventing a voice interaction device from being awoken by mistake according to another embodiment of the present invention;

FIG. 3 is a block diagram of a system for preventing a voice interaction device from being awoken by mistake according to an embodiment of the present invention;

fig. 4 is a flowchart of a method for preventing a voice interaction device from being awoken by a system for preventing the voice interaction device from being awoken by mistake according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.

The invention may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. The invention may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote computer storage media including memory storage devices.

As used in this disclosure, "module," "device," "system," and the like are intended to refer to a computer-related entity, either hardware, a combination of hardware and software, or software in execution. In particular, for example, an element may be, but is not limited to being, a process running on a processor, an object, an executable, a thread of execution, a program, and/or a computer. Also, an application or script running on a server, or a server, may be an element. One or more elements may be in a process and/or thread of execution and an element may be localized on one computer and/or distributed between two or more computers and may be operated by various computer-readable media. The elements may also communicate by way of local and/or remote processes based on a signal having one or more data packets, e.g., from a data packet interacting with another element in a local system, distributed system, and/or across a network in the internet with other systems by way of the signal.

Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.

The method for preventing the voice interaction device from being awoken by mistake can be applied to any terminal device with a voice function, such as terminal devices of smart phones, tablet computers, smart homes and the like. The present invention will be described in further detail with reference to the accompanying drawings.

Fig. 1 schematically shows a flowchart of a method for preventing a voice interaction device from being mistakenly awakened according to an embodiment of the invention. As shown in fig. 1, the present embodiment includes the following steps:

step S101: and setting a master control device for the voice interaction device. The voice interaction equipment comprises intelligent equipment such as an intelligent sound box, a story machine and a television, and a voice interaction application terminal is installed on the intelligent equipment, for example, a voice assistant of a company must be taken away, so that the intelligent equipment can realize a voice interaction function. The main control device is implemented as an intelligent terminal device installed with the APP controlling the voice interaction device, for example, as a smart phone, which can control any voice interaction device connected in pair therewith. The master control device and the voice interaction device communicate with each other through a one-to-one link, for example, a bluetooth pairing connection, so as to ensure that one master control device can only authorize one voice interaction device, that is, to ensure that the controlled voice interaction device is unique.

Step S102: and when the voice interaction equipment acquires the voice awakening instruction, the voice interaction equipment carries out an authorization request to the main control equipment, and voice awakening processing is carried out according to the authorization instruction returned by the main control equipment.

Illustratively, the voice interaction device picks up a voice wake-up instruction sent by a user through a device with an audio acquisition function, such as a microphone array, and the like, for example, a user voice instruction, such as "hello, little delay", "i want to start", is acquired. When the voice interaction device acquires the voice wake-up instruction, the voice interaction device outputs the voice wake-up instruction or voiceprint information of the voice wake-up instruction to the main control device for authorization request, the voiceprint information is obtained according to the voice wake-up instruction sent by the user and comprises information such as tone, volume and the like of the voice wake-up instruction sent by the user, and obtaining the voiceprint information according to the voice instruction can be realized by referring to the prior art. And the voice interaction equipment performs awakening response processing according to the received authorization instruction returned by the main control equipment.

The authorization request sent by the voice interaction device may be directly an acquired voice wake-up instruction to be processed and identified by the main control device, or may be generated according to voiceprint information after the voice interaction device performs voiceprint extraction. The authorization request may further include information for identifying the voice interaction device, such as a device name of the voice interaction device.

The master control device is provided with a voiceprint database, the voiceprint database stores personal information and voiceprint information when a user registers voiceprint, and the voiceprint information registration mode can refer to the prior art. When the main control device receives the authorization request, voiceprint information is obtained from the authorization request or extracted according to a voice wake-up instruction in the authorization request, the voiceprint information is compared with a voiceprint database, if the voiceprint information exists in the voiceprint database, namely the comparison is successful, the voice wake-up instruction of the user is authorized, and an authorization success instruction such as a success authorization instruction is returned to the voice interaction device. The authorization instruction is returned to the voice interaction device by acquiring the voice interaction device which currently establishes one-to-one connection, such as Bluetooth connection, and interacting through the one-to-one connection. After the voice interaction device obtains the authorization instruction, the response function of the voice interaction device is triggered according to the authorization instruction, illustratively, if the authorization instruction is "success", the voice interaction device will respond and send out a response word, such as "hello, owner", "hello, and yoda". And then the user sends out a voice instruction to carry out voice interaction.

If the voiceprint information does not exist in the voiceprint database, that is, the comparison fails, the main control device does not authorize the voice wakeup command of the user, and returns a command of failed authorization to the voice interaction device, for example, returns a command of failed authorization. The voice interaction equipment receives the authorization failure instruction and triggers a sleep instruction, and then the voice interaction equipment continues to keep a sleep state without any response.

According to the method provided by the embodiment of the invention, the authorized control of the voice interaction device based on the voiceprint is realized by setting the main control device and registering the voiceprint in the main control device, so that the authorized voice interaction device can be awakened by the voice awakening instruction sent by the current user, other voice interaction devices can only be in a dormant state, and the problems that any intelligent voice interaction device provided with the awakening word can be awakened by anyone in the same range in the prior art, the interaction of the device is disordered, the user experience is influenced and the like can be solved. According to the technical scheme of the embodiment of the invention, one-to-one awakening of the equipment and the user is really realized through the voiceprint recognition technology, the false awakening rate is reduced, and the intelligent voice interaction equipment can be prevented from being awakened by other people.

Fig. 2 schematically shows a functional block diagram of a control device for preventing a voice interaction device from being awoken by mistake according to an embodiment of the present invention, as shown in fig. 2,

the control device for preventing the voice interaction equipment from being awoken by mistake comprises a voiceprint registration module 201, a communication module 202 and an authorization authentication module 203.

The voiceprint registration module 201 is configured to register a bound voiceprint for the voice interaction device, and store the registered voiceprint information. Illustratively, the voice interaction device may be a smart device such as a smart speaker, story machine, and television. The way in which the voiceprint is registered can be implemented with reference to prior art voiceprint registration methods.

The communication module 202 is used for establishing a one-to-one bi-directional communication connection with a voice interaction device, illustratively implemented as a bluetooth-enabled communication module.

The authorization authentication module 203 is configured to generate an authorization instruction according to the received authorization request of the voice interaction device and the voiceprint stored in the voiceprint registration module 201, and output the authorization instruction to the voice interaction device.

Illustratively, in the main control device, that is, the control terminal installed with the apparatus of this embodiment, a voiceprint library is registered in advance for the voice interaction device, and the voiceprint library stores voiceprint information registered by the user. When receiving an authorization request of the voice interaction device, the main control device outputs an authorization instruction to the currently connected voice interaction device according to a matching result of voiceprint information and a voiceprint library, wherein the authorization request includes voiceprint information or voice information, when the voiceprint information is the voiceprint information, voiceprint matching is directly performed with the voiceprint information stored in the voiceprint registration module 201, when the voiceprint information is the voice information, the voiceprint information is extracted first, then the voiceprint matching is performed, and the authorization instruction is output according to the matching result. The authorization instruction comprises an authorization success instruction and an authorization failure instruction, if the authorization is successful, the authorization success instruction is returned to the voice interaction device, the response function of the voice interaction device is triggered, the voice interaction device responds and sends out response words, such as response words of 'hello, owner', 'hello, and sita', and then the user sends out the voice instruction to carry out voice interaction. If the authorization fails, an instruction of authorization failure is returned to the semantic interaction device, where the instruction of authorization failure triggers an instruction of dormancy of the voice interaction device, and the voice interaction device continues to maintain the dormant state, and a specific implementation manner of not responding to the part may refer to the method part, which is not described herein again.

According to the device of the embodiment, the problems that any intelligent device provided with the awakening word can be awakened by anyone within the same range in the prior art, interaction confusion of the device is caused, user experience is influenced, and the like can be solved. The method really achieves one-to-one awakening of the equipment and the user through the voiceprint recognition technology, reduces the false awakening rate, and prevents the intelligent equipment of the user from being awakened by other people. And this device can freely install on arbitrary intelligent terminal to support multiple voice interaction equipment, have very strong practicality.

Fig. 3 schematically shows a block diagram of a system for preventing a voice interaction device from being mistakenly awakened according to an embodiment of the invention, as shown in fig. 3,

the system for preventing the voice interaction equipment from being awoken by mistake comprises at least two voice interaction equipment 3 and a main control device 4, wherein the main control device 4 is the control device for preventing the voice interaction equipment from being awoken by mistake, and the main control device and one of the voice interaction equipment 3 are connected in a one-to-one connection state through Bluetooth.

The voice interaction device 3 comprises a voice wake-up module 301 and an authorization request module 302. Wherein,

the voice wake-up module 201 is configured to wake up and monitor, and output the voice wake-up instruction to the authorization request module 302 when the voice wake-up instruction is monitored, so as to implement an audio acquisition device with a sound pickup function. And the voice awakening module 201 can also perform voice awakening processing, that is, control the voice interaction device 3 to perform corresponding voice interaction according to the voice awakening word.

The authorization request module 302 is configured to output an authorization request to the main control device 4 according to the voice wake-up instruction, and call the voice wake-up module 301 to perform voice wake-up processing according to the authorization instruction returned by the main control device 4, where the processing of the authorization request may refer to the above-mentioned method part.

In particular implementations, the authorization request may be to include voiceprint information or may be to include voice information (i.e., a voice wake up instruction). When the authorization request includes voiceprint information, voiceprint extraction needs to be performed on the voice interaction device, in this case, the authorization request module 302 is configured to include a voiceprint recognition unit 3021, which is configured to perform voiceprint recognition on the voice wake-up instruction and output voiceprint information, and may be implemented by referring to the existing voiceprint recognition technology.

According to the system of the embodiment, the problems that any intelligent device provided with the awakening word can be awakened by anyone within the same range in the prior art, interaction confusion among devices is caused, user experience is influenced and the like can be solved. The method really achieves one-to-one awakening of the equipment and the user through the voiceprint recognition technology, reduces the false awakening rate, and prevents the intelligent equipment of the user from being awakened by other people.

Fig. 4 schematically shows a flowchart of a method for preventing a voice interaction device from being awoken by an awoken system according to an embodiment of the present invention, where as shown in fig. 4, this embodiment includes the following steps:

step S401: and carrying out voiceprint registration of the voice interaction equipment on the main control equipment. The user outputs voice through a microphone and other devices with an audio acquisition function on the main control equipment to perform voiceprint registration, and the voiceprint registration mode can be realized by referring to the prior art.

Step S402: and setting the currently controlled voice interaction equipment on the main control equipment. Illustratively, a bluetooth function is started on the master control device, and the master control device is used for matching and connecting with the controlled voice interaction device, and the voice interaction device which is successfully connected with the master control device is used as the currently controlled voice interaction device.

Step S403: and when receiving the authorization request, the main control equipment acquires the voiceprint information to be matched with the registered voiceprint, and outputs an authorization instruction to the currently controlled voice interaction equipment according to a matching result. The user sends a voice wake-up instruction, the main control device matches with the registered voiceprint based on voiceprint recognition after receiving the voice wake-up instruction, and authorization is realized according to the method of fig. 1 so as to start the voice interaction function of the voice interaction device or enable the voice interaction device to continuously keep a dormant state.

According to the method of the embodiment, the user can easily and freely perform voice interaction, and the voice interaction device which is subjected to voiceprint binding with the user can be awakened, so that the privacy and exclusive right of the user are ensured. In addition, the one-to-one setting almost does not have the condition of 'one-for-one-response' false awakening, so that the experience of the user is greatly improved.

Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a general hardware platform, and certainly can also be implemented by hardware. Based on such understanding, the above technical solutions substantially or contributing to the related art may be embodied in the form of a software product, which may be stored in a computer-readable storage medium, such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method according to the embodiments or some parts of the embodiments.

Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present application, and not to limit the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions in the embodiments of the present application.

Claims

1. The method for preventing the voice interaction equipment from being awoken by mistake is characterized by comprising

Setting a master control device for the voice interaction device, wherein the master control device is an intelligent terminal device provided with an app for controlling the voice interaction device;

the master control device communicates with the voice interaction device through a one-to-one link, and the main control device returns the authorization instruction to the voice interaction device, namely returns the authorization instruction to the voice interaction device which currently establishes the one-to-one link, so that the voice interaction device currently controlled by one master control device is ensured to be unique through the one-to-one link.

2. The method according to claim 1, wherein the voice interaction device performs an authorization request to the main control device when acquiring the voice wake-up instruction, and performing the voice wake-up processing according to the authorization instruction comprises

When the voice interaction device obtains a voice awakening instruction, outputting the voice awakening instruction or voiceprint information of the voice awakening instruction to a main control device for authorization request;

and the voice interaction equipment performs awakening response processing according to the received authorization instruction returned by the main control equipment.

3. The method of claim 2, wherein the voice interaction device comprises a smart speaker, a story machine, and a television.

4. The system for preventing the voice interaction equipment from being awoken by mistake is characterized by comprising at least two voice interaction equipment and a main control equipment, wherein the main control equipment

Included

The voice print registration module is used for registering and storing voice prints for the voice interaction equipment;

the authorization authentication module is used for generating an authorization instruction according to the received authorization request of the voice interaction equipment and the voiceprint stored by the voiceprint registration module and outputting the authorization instruction to the voice interaction equipment;

the voice interaction equipment comprises

The voice awakening module is used for awakening and monitoring, outputting the voice awakening instruction to the authorization request module when the voice awakening instruction is monitored, and performing voice awakening processing;

the authorization request module is used for outputting an authorization request to the main control equipment according to the voice awakening instruction and calling the voice awakening module to perform voice awakening processing according to the authorization instruction returned by the main control equipment;

the master control device and one of the voice interaction devices are in a one-to-one connection state, and the step of returning the authorization instruction to the voice interaction device by the master control device is to return the authorization instruction to the voice interaction device which currently establishes the one-to-one link, so that the voice interaction device currently controlled by one master control device is ensured to be unique through the one-to-one link.

5. The system of claim 4, wherein the authorization request includes voiceprint information,

the authorization request module comprises

And the voiceprint recognition unit is used for carrying out voiceprint recognition on the voice awakening instruction and outputting voiceprint information.

6. The system of claim 4, wherein the one-to-one connection is a Bluetooth connection.

7. Method for preventing false wake-up by using system of any one of claims 4 to 6, comprising

and when receiving the authorization request, the main control equipment acquires the voiceprint information to be matched with the registered voiceprint, and outputs an authorization instruction to the currently controlled voice interaction equipment according to a matching result.

8. The method according to claim 7, wherein the setting of the currently controlled voice interaction device on the master device is implemented by establishing a one-to-one connection with one of the voice interaction devices at the master device, and setting the currently connected voice interaction device as the currently controlled voice interaction device.