CN115294986B

CN115294986B - Method for reducing false awakening of intelligent voice interaction

Info

Publication number: CN115294986B
Application number: CN202211228563.7A
Authority: CN
Inventors: 徐绍杰
Original assignee: Shenzhen Xinyi Network Technology Co ltd
Current assignee: Shenzhen Xinyi Network Technology Co ltd
Priority date: 2022-10-09
Filing date: 2022-10-09
Publication date: 2023-01-10
Anticipated expiration: 2042-10-09
Also published as: CN115294986A

Abstract

The invention belongs to the technical field of voice interaction, and discloses a method for reducing false awakening of intelligent voice interaction, which comprises the following steps: the data analysis equipment receives x pieces of sound data y1 at the same moment, the sound data are collected by the intelligent voice interaction equipment, the recorded n pieces of sound fingerprint data are compared with the collected sound data y for analysis, if the recorded n pieces of sound fingerprint data are matched with the collected sound data y, the data analysis equipment is sent, the obtained sound data are marked as y1, and the sound fingerprint data comprise pre-stored sound frequency and pre-stored awakening words; establishing a data set for the x sound data y1, analyzing the data set, determining the awakening object, generating an awakening instruction after determining the awakening object, sending the awakening instruction to the intelligent voice interaction equipment corresponding to the awakening object, and awakening the corresponding intelligent voice interaction equipment according to the awakening instruction; let nearest intelligent voice interaction equipment and user interactive, interactive effect is better, can avoid many voice interaction equipment of same region to be awaken up simultaneously, effectively promotes mutual experience.

Description

Method for reducing false awakening of intelligent voice interaction

Technical Field

The invention belongs to the technical field of voice interaction, and particularly relates to a method for reducing false awakening of intelligent voice interaction.

Background

With the popularization of intelligent electronic products and the development of voice technology, more and more electronic products can perform voice interactive operation by using voice control. For example, the intelligent sound box can be awakened by speaking a preset awakening word when in a dormant state, and receives a voice instruction sent by a user to perform voice interaction. The problem that exists at present is that a series of products of the same merchant set up the same awakening word, so in a scene with a plurality of products, any person saying the awakening word can awaken all devices in the awakening range, and the phenomenon of 'one-for-one-hundred-response' appears, which affects user experience.

In the prior art, a method for preventing false awakening is provided, and different awakening words are usually set for different intelligent sound boxes, but the memory burden of a user is undoubtedly increased, so that the intelligent sound box becomes the burden of the user.

In view of this, the inventor of the present application invented a method for reducing the false wake-up of intelligent voice interaction.

Disclosure of Invention

The invention aims to solve the technical problems and provides a method for reducing the false awakening of intelligent voice interaction.

The technical scheme of the invention is as follows: a method for reducing false awakening of intelligent voice interaction is applied to data analysis equipment and comprises the following steps:

the data analysis equipment receives x pieces of sound data y1 at the same moment, the sound data y are collected by the intelligent voice interaction equipment,

the sound data y comprises sound frequency and time for collecting the sound; comparing and analyzing the recorded n sound fingerprint data with the collected sound data y, if the n sound fingerprint data are matched with the collected sound data y, sending data analysis equipment, marking the obtained sound data as y1, if the n sound fingerprint data are not matched with the collected sound data y, not sending the data analysis equipment, wherein the value of n is an integer greater than 1, and the sound fingerprint data comprise pre-stored sound frequency and pre-stored awakening words; and x is an integer which is less than or equal to m and greater than 0, establishing a data set for the x sound data y1, analyzing the data set, determining the awakening object at this time, generating an awakening instruction after determining the awakening object, sending the awakening instruction to the intelligent voice interaction equipment corresponding to the awakening object, and awakening the corresponding intelligent voice interaction equipment according to the awakening instruction.

Preferably, in the above, the comparing and analyzing process of the collected voice data and the voice fingerprint data is as follows:

acquiring words in the collected sound data, comparing the sound frequency with a pre-stored sound frequency, comparing the words with pre-stored awakening words, and if the sound frequency is matched with the pre-stored sound frequency and the words are matched with the pre-stored awakening words, sending the obtained sound data y1 to data analysis equipment; otherwise, no transmission is performed.

Preferably, in the above, a data set is established for the x pieces of sound data y1, and the data set is analyzed to determine the current awakening object, where the determining process includes: and acquiring time t1 in the x pieces of sound data y1, and taking the intelligent voice interaction equipment corresponding to the minimum time t1 in the data set as a current awakening object.

Preferably, in the above, the sound data further includes a volume value z, if the number of the time t1 is multiple, the volume value z in the x sound data y1 is obtained, a sum of the time t1 and the volume value z corresponding to the time t1 is marked as an expression coefficient k, and the intelligent voice interaction device corresponding to the largest expression coefficient k1 in the data set is used as the present awakening object; and if the number of the expression coefficients k1 is multiple, randomly determining one expression coefficient k1, and taking the intelligent voice interaction equipment corresponding to the expression coefficient k1 as the awakening object at this time.

Preferably, in the above, before the collected sound data is compared and analyzed with the sound fingerprint data, the content of the words matched with the pre-stored wakeup word is marked as p1 and time s1, the content of the words matched with the pre-stored wakeup word is marked as p2 and time s2, the absolute value of s2 subtracted from s1 is marked as s, and whether to continue the comparison and analysis of the collected sound data and the sound fingerprint data is determined according to the magnitude of the absolute value s;

comparing and analyzing the magnitude of the absolute value s with a time threshold, and if the absolute value s is less than or equal to the time threshold, not continuing to compare and analyze the acquired sound data with the sound fingerprint data; and if the absolute value s is larger than the time threshold, continuously comparing and analyzing the acquired sound data and the sound fingerprint data.

A method for reducing false awakening of intelligent voice interaction is applied to intelligent voice interaction equipment and comprises the following steps:

the intelligent voice interaction equipment collects voice data y, wherein the voice data y comprises voice frequency and voice collection time; comparing and analyzing the recorded n sound fingerprint data with the collected sound data y, if the n sound fingerprint data are matched with the collected sound data y, sending data analysis equipment, marking the obtained sound data as y1, if the n sound fingerprint data are not matched with the collected sound data y, not sending the data analysis equipment, wherein the value of n is an integer greater than 1, and the sound fingerprint data comprise pre-stored sound frequency and pre-stored awakening words; the data analysis equipment receives x sound data y1 at the same time, x is an integer which is smaller than or equal to m and larger than 0, a data set is established for the x sound data y1, the data set is analyzed, the object awakened at this time is determined, an awakening instruction is generated after the awakening object is determined, the awakening instruction is sent to the intelligent voice interaction equipment corresponding to the awakening object, and the corresponding intelligent voice interaction equipment is awakened according to the awakening instruction.

Preferably, in the above, the sound data further includes a volume value z, if the number of the time t1 is multiple, the volume value z in the x sound data y1 is obtained, a sum of the time t1 and the volume value z corresponding to the time t1 is marked as an expression coefficient k, and the intelligent voice interaction device corresponding to the largest expression coefficient k1 in the data set is used as the present awakening object; and if the number of the expression coefficients k1 is multiple, arbitrarily determining one expression coefficient k1, and taking the intelligent voice interaction equipment corresponding to the expression coefficient k1 as the awakening object of the time.

Preferably, in the above, before the comparison analysis of the collected sound data and the sound fingerprint data, the content of the word matching the pre-stored wakeup word is marked as p1 and time s1, the content of the word matching the pre-stored wakeup word is marked as p2 and time s2, the absolute value of s2 subtracted from s1 is marked as s, and whether the comparison analysis of the collected sound data and the sound fingerprint data is continued is determined according to the magnitude of the absolute value s;

Due to the adoption of the technical scheme, the invention has the beneficial effects that:

(1) Through the sound data of the matching that a plurality of intelligence voice interaction equipment received, through confirming sound data time and sound size value, confirm the voice interaction equipment nearest from the sound source, let nearest intelligence voice interaction equipment interactive with the user, interactive effect is better, can avoid many voice interaction equipment of same region to be awaken up simultaneously, effectively promotes mutual experience.

(2) When a user uses the preset awakening word to talk about the voice interaction device, whether the voice interaction device is awakened or not is determined by pre-storing the appearance time of the awakening word and the appearance time of the content of the back part of the awakening word and calculating the difference value of the two times, and an awakening condition is added to the voice interaction device, so that the voice interaction device is prevented from being awakened by mistake in the situation, the intelligent degree of the voice interaction device is increased, and the use experience is improved.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are required to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and that those skilled in the art can also obtain other related drawings based on the drawings without inventive efforts.

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.

Fig. 1 is a schematic diagram illustrating a method for reducing false wake-up of smart voice interaction according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be described clearly and completely below, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example one

As shown in fig. 1, the method for reducing false wake-up of intelligent voice interaction according to this embodiment includes m intelligent voice interaction devices and a data analysis device, where the m intelligent voice interaction devices are in signal connection with the data analysis device.

The intelligent voice interaction equipment collects voice data y, wherein the voice data y comprises voice frequency and time for collecting the voice; the method comprises the steps of recording n sound fingerprint data, recording n value of an integer larger than 1 in advance by a user, wherein the sound fingerprint data comprise a pre-stored sound frequency and a pre-stored awakening word, and the pre-stored awakening word is a word for awakening the intelligent device, such as 'Xiaowang classmates'.

The intelligent voice interaction equipment compares and analyzes the acquired voice data y with the voice fingerprint data, if the acquired voice data y is matched with the voice fingerprint data y, the voice data y is sent to the data analysis equipment, the voice data obtained this time is marked as y1, and the voice data y is sent to the data analysis equipment; if not, not sending.

The data analysis equipment receives x sound data y1 at the same moment, wherein x is an integer smaller than or equal to m and larger than 0, a data set is established for the x sound data y1, the data set is analyzed, the awakening object at this time is determined, an awakening instruction is generated after the awakening object is determined, and the awakening instruction is sent to the intelligent voice interaction equipment corresponding to the awakening object.

Wherein, the comparison and analysis process of the collected sound data and the sound fingerprint data is as follows:

The method comprises the following steps of establishing a data set for x sound data y1, analyzing the data set, and determining an awakening object at this time, wherein the determining step comprises the following steps: acquiring time t1 in x sound data y1, taking the intelligent voice interaction equipment corresponding to the minimum time t1 in a data set as a current awakening object, acquiring a volume value z in the x sound data y1 if the number of the time t1 is multiple, marking the sum of the time t1 and the volume value z corresponding to the time t1 as an expression coefficient k, and taking the intelligent voice interaction equipment corresponding to the maximum expression coefficient k1 in the data set as the current awakening object; and if the number of the expression coefficients k1 is multiple, arbitrarily determining one expression coefficient k1, and taking the intelligent voice interaction equipment corresponding to the expression coefficient k1 as the awakening object of the time.

Through the sound data of the matching that a plurality of intelligence pronunciation interactive device received, through confirming sound data time and sound magnitude value, confirm the pronunciation interactive device nearest from the sound source, let nearest intelligence pronunciation interactive device and user interactive, interactive effect is better, can avoid many pronunciation interactive devices in same region to be awaken up simultaneously, effectively promotes mutual experience.

In the voice interaction equipment at the in-service use in-process, often can replace voice interaction equipment itself with presetting awakening word, with family or friend discussion voice interaction equipment's use experience, promptly use "the queen classmate" to replace voice interaction equipment, when discussing "the queen classmate" with family or friend use experience, the "queen classmate" has appeared in this in-process sound data, but do not want to use "the queen classmate", voice interaction equipment is easily awaken up this moment, influence and use experience.

In order to solve the above situation, the design is further improved; before the collected sound data and the sound fingerprint data are compared and analyzed, the content matched with the pre-stored awakening word in the word is marked as p1 and time s1, the content of the part of the content matched with the pre-stored awakening word in the word is marked as p2 and time s2, the absolute value of s2 subtracted from s1 is marked as s, and whether the collected sound data and the sound fingerprint data are continuously compared and analyzed is judged according to the absolute value s.

Comparing and analyzing the absolute value s with a time threshold, wherein if the absolute value s is less than or equal to the time threshold, the time threshold can be set by a user or a developer, and the comparison and analysis of the acquired sound data and the sound fingerprint data is not continued; and if the absolute value s is larger than the time threshold, continuously comparing and analyzing the acquired sound data and the sound fingerprint data.

For example: the use experience of the Xiaowang classmates is good, the Xiaowang classmates in the sentence is p1, and the use experience of the Xiaowang classmates is good is p2.

When a user uses the preset awakening word to talk about the voice interaction device, whether the voice interaction device is awakened or not is determined by pre-storing the appearance time of the awakening word and the appearance time of the content of the back part of the awakening word and calculating the difference value of the two times, and an awakening condition is added to the voice interaction device, so that the voice interaction device is prevented from being awakened by mistake in the situation, the intelligent degree of the voice interaction device is increased, and the use experience is improved.

Example two

A method for reducing false awakening of intelligent voice interaction comprises m intelligent voice interaction devices and data analysis devices, wherein the m intelligent voice interaction devices are in signal connection with the data analysis devices.

The intelligent voice interaction device comprises a data acquisition module, a voice identity recording module and a verification module.

The data acquisition module acquires sound data y, and the sound data y comprises sound frequency and time for acquiring the sound.

The sound identity recording module records n sound fingerprint data, the value of n is an integer larger than 1, the n is recorded in advance by a user, the sound fingerprint data comprise pre-stored sound frequency and pre-stored awakening words, and the pre-stored awakening words are words for awakening the intelligent equipment, such as' classmates of King.

The verification module is used for comparing and analyzing the collected sound data y with the sound fingerprint data, sending the sound data y to the data analysis equipment if the collected sound data y is matched with the sound fingerprint data, marking the obtained sound data y as y1 and sending the marked sound data y to the data analysis equipment; and if not, not sending.

The data analysis equipment receives x sound data y1 at the same time, wherein x is an integer which is less than or equal to m and greater than 0, a data set is established for the x sound data y1, the data set is analyzed, the awakening object is determined, an awakening instruction is generated after the awakening object is determined, and the awakening instruction is sent to the intelligent voice interaction equipment corresponding to the awakening object.

Wherein, the comparison and analysis process of the collected voice data and the voice fingerprint data is as follows:

The method comprises the following steps of establishing a data set for x sound data y1, analyzing the data set, and determining an awakening object at this time, wherein the determining step comprises the following steps: acquiring time t1 in x sound data y1, taking intelligent voice interaction equipment corresponding to the minimum time t1 in a data set as a current awakening object, acquiring volume values z in the x sound data y1 if the number of the time t1 is multiple, marking the sum of the time t1 and the volume values z corresponding to the time t1 as an expression coefficient k, and taking the intelligent voice interaction equipment corresponding to the maximum expression coefficient k1 in the data set as the current awakening object; and if the number of the expression coefficients k1 is multiple, randomly determining one expression coefficient k1, and taking the intelligent voice interaction equipment corresponding to the expression coefficient k1 as the awakening object at this time.

Before the collected sound data and the sound fingerprint data are compared and analyzed, the content matched with the pre-stored awakening word in the word is marked as p1 and time s1, the content of the part of the content matched with the pre-stored awakening word in the word is marked as p2 and time s2, the absolute value of s2 subtracted from s1 is marked as s, and whether the collected sound data and the sound fingerprint data are continuously compared and analyzed is judged according to the absolute value s.

In the description herein, references to the description of "one embodiment," "an example," "a specific example," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

The preferred embodiments of the invention disclosed above are intended to be illustrative only. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise forms disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best utilize the invention. The invention is limited only by the claims and their full scope and equivalents.

Claims

1. A method for reducing false awakening of intelligent voice interaction is applied to data analysis equipment and is characterized by comprising the following steps:

the data analysis equipment receives x pieces of sound data y1 at the same moment, the sound data y is collected by the intelligent voice interaction equipment, and the sound data y comprises sound frequency and time for collecting the sound; comparing and analyzing the recorded n sound fingerprint data with the collected sound data y, if the n sound fingerprint data are matched with the collected sound data y, sending data analysis equipment, marking the obtained sound data as y1, if the n sound fingerprint data are not matched with the collected sound data y, not sending the data analysis equipment, wherein the value of n is an integer greater than 1, and the sound fingerprint data comprise pre-stored sound frequency and pre-stored awakening words; x is an integer which is less than or equal to m and greater than 0, a data set is established for x sound data y1, the data set is analyzed, the awakening object at this time is determined, an awakening instruction is generated after the awakening object is determined, the awakening instruction is sent to the intelligent voice interaction equipment corresponding to the awakening object, and the corresponding intelligent voice interaction equipment is awakened according to the awakening instruction;

the comparison and analysis process of the collected voice data and the voice fingerprint data is as follows:

acquiring words in the collected sound data, comparing the sound frequency with a pre-stored sound frequency, comparing the words with pre-stored awakening words, and if the sound frequency is matched with the pre-stored sound frequency and the words are matched with the pre-stored awakening words, sending the obtained sound data y1 to data analysis equipment; otherwise, the transmission is not carried out;

establishing a data set for the x sound data y1, analyzing the data set, and determining the awakening object, wherein the determination process comprises the following steps: acquiring time t1 in x pieces of sound data y1, and taking intelligent voice interaction equipment corresponding to the minimum time t1 in a data set as a current awakening object;

if the number of the time t1 is multiple, obtaining the volume values z in the x sound data y1, marking the sum of the volume values z corresponding to the time t1 and the time t1 as an expression coefficient k, and taking the intelligent voice interaction device corresponding to the maximum expression coefficient k1 in the data set as a current awakening object; if the number of the expression coefficients k1 is multiple, one expression coefficient k1 is determined arbitrarily, and the intelligent voice interaction device corresponding to the expression coefficient k1 is taken as the awakening object of the time;

before the collected sound data and the sound fingerprint data are compared and analyzed, marking the content matched with the pre-stored awakening word in the word as p1 and time s1, marking the part of the content matched with the pre-stored awakening word in the word as p2 and time s2, subtracting the absolute value of s2 from s1 as s, and judging whether to continue the comparison and analysis of the collected sound data and the sound fingerprint data or not according to the magnitude of the absolute value s;

comparing and analyzing the absolute value s with a time threshold, and if the absolute value s is less than or equal to the time threshold, not continuing to compare and analyze the acquired sound data with the sound fingerprint data; and if the absolute value s is larger than the time threshold, continuously comparing and analyzing the acquired sound data and the sound fingerprint data.

2. A method for reducing false awakening of intelligent voice interaction is applied to intelligent voice interaction equipment and is characterized by comprising the following steps:

the intelligent voice interaction equipment collects voice data y, wherein the voice data y comprises voice frequency and voice collection time; comparing and analyzing the recorded n sound fingerprint data with the acquired sound data y, if the recorded n sound fingerprint data are matched with the acquired sound data y, sending the data to a data analysis device, marking the obtained sound data as y1, and if the recorded n sound fingerprint data are not matched with the acquired sound data y, not sending the data to the data analysis device, wherein the value of n is an integer larger than 1, and the sound fingerprint data comprise pre-stored sound frequency and pre-stored awakening words; the data analysis equipment receives x sound data y1 at the same moment, wherein x is an integer which is less than or equal to m and is greater than 0, a data set is established for the x sound data y1, the data set is analyzed, a current awakening object is determined, an awakening instruction is generated after the awakening object is determined, the awakening instruction is sent to the intelligent voice interaction equipment corresponding to the awakening object, and the corresponding intelligent voice interaction equipment is awakened according to the awakening instruction;

the comparison and analysis process of the collected sound data and the sound fingerprint data is as follows: