CN110782904A

CN110782904A - User account switching method of intelligent voice equipment

Info

Publication number: CN110782904A
Application number: CN201911083446.4A
Authority: CN
Inventors: 张成亮; 徐庭锐; 刘洋廷; 郝放; 简红美; 高玉东; 毕可骅; 王飞
Original assignee: Sichuan Changhong Electric Co Ltd
Current assignee: Sichuan Changhong Electric Co Ltd
Priority date: 2019-11-07
Filing date: 2019-11-07
Publication date: 2020-02-11

Abstract

The invention relates to the field of intelligent voice equipment, and discloses a user account switching method of intelligent voice equipment, which is used for solving the problem that the intelligent voice equipment is not fast enough in user account switching. Firstly, acquiring awakening word audio signals spoken by different users, converting the awakening word audio signals into digital signals, inputting the digital signals into an RNN (neural network), and clustering feature vectors of awakening words output by the RNN neural network; when a user switches accounts and speaks a wake-up word, the equipment collects an audio signal of the wake-up word, converts the audio signal into a digital signal, inputs the digital signal into the RNN neural network, calculates the distance between a feature vector of the current wake-up word output by the RNN neural network and each cluster center vector, and if the distance between the feature vector of the current wake-up word and the nearest cluster center vector does not exceed a threshold value, takes the nearest cluster center vector as the account of the current user, thereby switching the accounts. The method and the device are suitable for switching the user account of the intelligent voice equipment.

Description

User account switching method of intelligent voice equipment

Technical Field

The invention relates to the field of intelligent voice equipment, in particular to a user account switching method of intelligent voice equipment.

Background

With the vigorous development of the internet of things and the artificial intelligence technology, especially the progress of the voice recognition technology, the equipment becomes more and more intelligent, and a user can operate and control the equipment only through voice. For a scene that one device such as a household television and a sound box corresponds to a plurality of users, the device needs to store preference settings and related contents of different users, so that a quick and effective user account switching method is needed.

At present, account management is mainly carried out through a matched APP by intelligent voice equipment, the account management comprises registration and login and the like, so that user role switching is completed, and the switching mode is not direct and fast enough in the voice interaction era and influences user experience.

In addition, the mainstream intelligent voice equipment is activated by the wake-up word, that is, the user needs to speak the wake-up word first to activate the voice interaction function of the equipment, so that the problem of switching the user is most natural when the user starts with the wake-up word.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: the user account switching method of the intelligent voice equipment is used for solving the problem that the intelligent voice equipment is not fast enough in user account switching.

In order to solve the problems, the invention adopts the technical scheme that: the user account switching method of the intelligent voice equipment is characterized by comprising the following steps

Step 1: collecting awakening word audio signals spoken by different users, and converting the word audio signals into digital signals with fixed length;

step 2: inputting all collected voice digital signals into an RNN (neural network), outputting a feature vector of a wake word by the RNN, and clustering all the feature vectors by a clustering algorithm, wherein a clustering center vector is used as an account identifier;

and step 3: when a user switches accounts and speaks a wake-up word, the equipment collects a wake-up word audio signal and converts the word audio signal into a digital signal with a fixed length;

and 4, step 4: and (3) inputting the digital signals in the step (3) into the RNN neural network which is the same as that in the step (2), outputting the feature vector of the current awakening word by the RNN neural network, calculating the distance between the feature vector of the current awakening word and each cluster center vector, and if the distance between the feature vector of the current awakening word and the nearest cluster center vector does not exceed a threshold value, taking the nearest cluster center vector as the account of the current user, thereby switching the accounts.

Furthermore, in order to facilitate the increase and decrease management of the user account, the invention can also comprise the following steps:

if the distance between the feature vector of the word awakened currently and the nearest cluster center vector exceeds a threshold value, taking the feature vector as a new cluster center;

and if the sample size belonging to a certain clustering center is still less than the set sample size threshold value after the number of the continuously acquired awakening word audio signals exceeds the set sample increment, removing the clustering center and directly discarding the corresponding sample data.

Further, for reasonable clustering, the sample increment may be 100, and the sample size threshold may be 28.

Further, the digital signal may be a binary number of 128 bits.

Further, the RNN neural network is an LSTM neural network.

Further, for reasonable setting, the threshold value may be 0.6.

The invention has the beneficial effects that: in the invention, the user can quickly complete the switching of the user account only by speaking the corresponding awakening word, thereby improving the user experience.

Drawings

Fig. 1 is a flowchart of switching user accounts according to an embodiment.

Detailed Description

In order to overcome the disadvantage that the user account switching of the intelligent voice device is not fast enough, a user account switching method of the intelligent voice device is provided, as shown in fig. 1, the method comprises the following steps:

And 5: if the distance between the feature vector of the word awakened currently and the nearest cluster center vector exceeds a threshold value, taking the feature vector as a new cluster center;

The present invention will be specifically described below by way of examples.

The embodiment provides a user account switching method of intelligent voice equipment, which mainly comprises model training, model use and iterative training;

firstly, model training is carried out: firstly, acquiring awakening word audio signals spoken by different users, and converting the awakening word audio signals into 128-bit binary digital signals through sampling quantization coding; and then all the collected voice digital signals are input into an RNN neural network, the RNN neural network is a standard single-layer LSTM network, the number of hidden nodes is 128, an ADAM optimization algorithm is adopted, the momentum is 0.5, the initial learning rate is 0.0002, the attenuation is half of every iteration for 50 times, 32 voice digital signals are simultaneously input into every training as a batch, the loss function is the distance between the feature vector output by the LSTM network and the clustering center vector, and the convergence condition is that the iteration times reach 600 times or the error of the loss function is lower than 0.6.

All the characteristic vectors output by the RNN neural network are clustered through a K-means clustering algorithm, and the clustering center vector is used as an account identifier.

The initial value of the category number K of the K-means clustering algorithm is 2, the convergence condition is that the mean square error value is less than 0.8 or the category distribution result of any sample point is not changed, and the distance calculation formula of the characteristic vector and the clustering center vector is Euclidean distance.

Then entering a model using stage: after a user speaks a wakeup word, the equipment collects an audio signal of the wakeup word and converts the audio signal into a 128-bit binary digital signal through sampling quantization coding; and inputting the characteristic vectors into an RNN neural network, calculating the distance between the output characteristic vectors and each clustering center vector, wherein the distance between the output characteristic vectors and the nearest clustering center vector does not exceed a threshold value of 0.6, namely, the output characteristic vectors are used as the account of the current user, and therefore, the account is switched.

And finally, carrying out iterative training of the model: if the distance between the feature vector of the awakening word of a certain user and the nearest clustering center vector exceeds a threshold value of 0.6, taking the feature vector as a new clustering center; and if the number of the continuously acquired awakening word audio signals exceeds the set sample increment of 100, and the sample size belonging to a certain clustering center is still less than the set sample size threshold value of 28, removing the clustering center and directly discarding the corresponding sample data.

Claims

1. A user account switching method of intelligent voice equipment is characterized by comprising the following steps:

2. The method for switching the user account of the intelligent voice device according to claim 1, further comprising the step 5:

3. The method for switching the user account of the intelligent voice device according to claim 2, wherein the sample increment is 100, and the sample size threshold is 28.

4. The method for switching the user account of the intelligent voice device according to claim 1, wherein the digital signal is a 128-bit binary number.

5. The method of claim 1, wherein the RNN neural network is an LSTM neural network.

6. The method for switching the user account of the intelligent voice device according to claim 1, wherein the threshold is 0.6.