CN112233670A

CN112233670A - Voice interaction method and system based on alexa cloud service

Info

Publication number: CN112233670A
Application number: CN202010885996.4A
Authority: CN
Inventors: 何志宏; 高裘生
Original assignee: Fuzhou Zhixiang Information Technology Co ltd
Current assignee: Fuzhou Zhixiang Information Technology Co ltd
Priority date: 2020-08-28
Filing date: 2020-08-28
Publication date: 2021-01-15

Abstract

The invention provides a voice interaction method and a voice interaction system based on alexa cloud service in the technical field of intelligent sound boxes, wherein the method comprises the following steps: s10, setting a wake-up word of the sound box, a state of the light display corresponding to each execution instruction, an interface corresponding to each execution instruction and activation duration; step S20, the sound box receives the sound in the receiving range in real time, and activates an alexa voice assistant after verifying the received sound based on the awakening word; step S30, the sound box continuously receives the voice command sent by the user in the activation duration, converts the voice command into an execution command and then sequentially inputs the execution command into an alexa voice assistant; and S40, executing the received execution instruction by the alexa voice assistant, controlling a display screen to perform interface response, controlling light to display a corresponding state, keeping long connection of the alexa voice assistant through a WebSocket protocol, and monitoring the execution condition of the execution instruction. The invention has the advantages that: the intelligent sound box is connected for a long time, interface response is carried out, and user experience is greatly improved.

Description

Voice interaction method and system based on alexa cloud service

Technical Field

The invention relates to the technical field of intelligent sound boxes, in particular to a voice interaction method and system based on alexa cloud service.

Background

Along with the continuous progress of science and technology, intelligent audio amplifier has appeared in people's the field of vision gradually, and intelligent audio amplifier not only can play the music, can also carry out voice interaction with the user. However, in the process of executing a task, if an interruption occurs due to a network disconnection or the like, the conventional smart speaker cannot continue to execute the task, and cannot respond to a corresponding interface in the voice interaction process, which results in low user experience.

Therefore, how to provide a voice interaction method and system based on alexa cloud service to realize long connection of the intelligent sound box and perform interface response, so as to improve user experience, becomes a problem to be solved urgently.

Disclosure of Invention

The technical problem to be solved by the invention is to provide a voice interaction method and system based on alexa cloud service, so that long connection of an intelligent sound box is realized, interface response is carried out, and user experience is further improved.

In one aspect, the invention provides a voice interaction method based on alexa cloud service, which comprises the following steps:

s10, setting a wake-up word of the sound box, a state of the light display corresponding to each execution instruction, an interface corresponding to each execution instruction and activation duration;

step S20, the sound box receives the sound in the receiving range in real time, and activates an alexa voice assistant after verifying the received sound based on the awakening word;

step S30, the sound box continuously receives the voice command sent by the user in the activation duration, converts the voice command into an execution command and then sequentially inputs the execution command into an alexa voice assistant;

and S40, executing the received execution instruction by the alexa voice assistant, controlling a display screen to perform interface response, controlling light to display a corresponding state, keeping long connection of the alexa voice assistant through a WebSocket protocol, and monitoring the execution condition of the execution instruction.

Further, the step S20 is specifically:

the sound box receives sound in a receiving range in real time by using a sound pickup, converts the received sound into characters in real time by using a voice engine, compares whether the converted characters are consistent with the awakening words or not, and activates an alexa voice assistant if the characters are consistent with the awakening words; if not, the voice in the receiving range is continuously received and identified.

Further, the step S30 is specifically:

the loudspeaker box is in the activation duration, utilize the adapter to continuously receive the voice command that the user sent, utilize voiceprint recognition technology to be right the voice command is categorised, utilize neural network recognition classification behind the latent intention of voice command, will voice command inputs alexa voice assistant in proper order after converting into the executive instruction.

Further, in the step S30, the execution instruction includes an execution duration.

Further, in step S40, the maintaining of the long connection of the alexa voice assistant by the WebSocket protocol includes:

step S41, setting a heartbeat cycle, monitoring whether the execution instruction generates interruption in the execution duration, if yes, entering step S42; if not, go to step S20;

step S42, monitoring whether the interruption is recovered or not by using a WebSocket protocol and taking the heartbeat cycle as an interval, and if yes, continuing to execute the execution instruction; if not, continuously monitoring whether the interruption is recovered or not by taking the heartbeat cycle as an interval.

On the other hand, the invention provides a voice interaction system based on alexa cloud service, which comprises the following modules:

the sound box initialization module is used for setting a wake-up word of the sound box, a state of the light display corresponding to each execution instruction, an interface corresponding to the response of each execution instruction and activation duration;

the alexa voice assistant activation module is used for receiving the sound in the receiving range in real time by the sound box, verifying the received sound based on the awakening word and activating the alexa voice assistant;

the instruction receiving module is used for continuously receiving voice instructions sent by a user in the activation duration by the sound box, converting the voice instructions into execution instructions and then sequentially inputting the execution instructions into an alexa voice assistant;

and the instruction execution module is used for executing the received execution instruction by the alexa voice assistant, controlling the display screen to perform interface response, controlling the lamplight to display a corresponding state, keeping the long connection of the alexa voice assistant through a WebSocket protocol and monitoring the execution condition of the execution instruction.

Further, the alexa voice assistant activation module specifically includes:

Further, the instruction receiving module specifically includes:

Further, in the instruction receiving module, the execution instruction includes an execution duration.

Further, in the instruction execution module, the maintaining of the long connection of the alexa voice assistant through the WebSocket protocol includes:

the interruption monitoring unit is used for setting a heartbeat cycle, monitoring whether the execution instruction generates interruption within the execution duration, and entering the heartbeat testing unit if the execution instruction generates interruption within the execution duration; if not, entering an alexa voice assistant activation module;

the heartbeat testing unit is used for monitoring whether interruption is recovered or not by using a WebSocket protocol and taking the heartbeat cycle as an interval, and if yes, the execution instruction is continuously executed; if not, continuously monitoring whether the interruption is recovered or not by taking the heartbeat cycle as an interval.

The invention has the advantages that:

1. the method comprises the steps that long connection of an alexa voice assistant is maintained through a WebSocket protocol, execution conditions of an execution instruction are monitored, when the execution instruction is interrupted within execution duration, heartbeat testing is conducted at intervals of a heart state cycle, the execution instruction continues to be executed after interruption is recovered, and long connection of the intelligent sound box is achieved; by setting the corresponding response interfaces of the execution instructions, the alexa voice assistant makes the display screen jump to the corresponding interfaces after receiving the execution instructions, so that the interface response of the intelligent sound box is realized, and further the user experience is greatly improved.

2. By adopting the alexa voice assistant, the accuracy of English recognition is greatly improved.

3. By setting the activation duration, after waking up the alexa voice assistant, a user can continuously issue voice commands in the activation duration, and the alexa voice assistant does not need to be waken up once every time the voice commands are issued, so that continuous interaction can be performed with the loudspeaker box, and further user experience is greatly improved.

4. And classifying the voice command by utilizing a voiceprint recognition technology, so that the sound box can recognize different users, and further, carrying out preference setting according to different users. For example, music is played, wherein a user A prefers rock music, a user B prefers movie and television golden music, and when the sound box receives a voice instruction for playing music, if the user A who sends the voice instruction is identified by utilizing a voiceprint recognition technology, the rock music is played, so that the sound box is more intelligent, and further user experience is greatly improved.

5. And the potential intention of the classified voice instruction is recognized by utilizing the neural network, so that the recognition accuracy of the voice instruction is greatly improved.

Drawings

The invention will be further described with reference to the following examples with reference to the accompanying drawings.

Fig. 1 is a flowchart of a voice interaction method based on alexa cloud service according to the present invention.

Fig. 2 is a schematic structural diagram of a voice interaction system based on alexa cloud service according to the present invention.

Detailed Description

The technical scheme in the embodiment of the application has the following general idea: the long connection of the alexa voice assistant is kept through a WebSocket protocol, when the execution instruction generates interruption within the execution duration, a heartbeat test is carried out by taking a heart-state cycle as an interval, and the execution instruction is continuously executed after the interruption is recovered; by setting the corresponding responding interface of each execution instruction, the alexa voice assistant makes the display screen jump to the corresponding interface after receiving the execution instruction; and then realize the long connection of intelligent audio amplifier to carry out interface response, and then promote user experience.

The intelligent sound box used by the invention is provided with a display screen, an indicator light, a sound pick-up and a wireless communication module; the display screen is used for displaying an interface corresponding to the execution instruction, the indicating lamp is used for displaying different states so as to inform a user of the current running condition of the loudspeaker box, the sound pick-up is used for picking up the sound made by the user, and the wireless communication module is used for being connected and interacted with the server or other intelligent equipment.

Referring to fig. 1 to 2, a preferred embodiment of a voice interaction method based on alexa cloud service according to the present invention includes the following steps:

s10, setting a wake-up word of the sound box, a state of the light display corresponding to each execution instruction, an interface corresponding to each execution instruction and activation duration; by setting the corresponding response interfaces of the execution instructions, the alexa voice assistant makes the display screen jump to the corresponding interfaces after receiving the execution instructions, so that the interface response of the intelligent sound box is realized, and further the user experience is greatly improved.

Step S20, the sound box receives the sound in the receiving range in real time, and activates an alexa voice assistant after verifying the received sound based on the awakening word; by adopting the alexa voice assistant, the accuracy of English recognition is greatly improved.

Step S30, the sound box continuously receives the voice command sent by the user in the activation duration, converts the voice command into an execution command and then sequentially inputs the execution command into an alexa voice assistant; by setting the activation duration, after waking up the alexa voice assistant, a user can continuously issue voice commands in the activation duration, and the alexa voice assistant does not need to be waken up once every time the voice commands are issued, so that continuous interaction can be performed with the loudspeaker box, and further user experience is greatly improved.

And S40, executing the received execution instruction by the alexa voice assistant, controlling a display screen to perform interface response, controlling light to display a corresponding state, keeping long connection of the alexa voice assistant through a WebSocket protocol, and monitoring the execution condition of the execution instruction. The method comprises the steps of keeping long connection of an alexa voice assistant through a WebSocket protocol, monitoring the execution condition of an execution instruction, carrying out heartbeat test at intervals of a heart state cycle when the execution instruction is interrupted within the execution time, and continuing to execute the execution instruction after interruption is recovered, so that long connection of the intelligent sound box is achieved.

The step S20 specifically includes:

The step S30 specifically includes:

the sound box continuously receives a voice instruction sent by a user within the activation duration by using a sound pick-up, classifies the voice instruction by using a voiceprint recognition technology, and converts the voice instruction into an execution instruction by using a voice engine and then sequentially inputs the execution instruction into an alexa voice assistant after recognizing the potential intention of the classified voice instruction by using a neural network; the execution instruction is a precise text instruction. And classifying the voice command by utilizing a voiceprint recognition technology, so that the sound box can recognize different users, and further, carrying out preference setting according to different users. For example, music is played, wherein a user A prefers rock music, a user B prefers movie and television golden music, and when the sound box receives a voice instruction for playing music, if the user A who sends the voice instruction is identified by utilizing a voiceprint recognition technology, the rock music is played, so that the sound box is more intelligent, and further user experience is greatly improved. And the potential intention of the classified voice instruction is recognized by utilizing the neural network, so that the recognition accuracy of the voice instruction is greatly improved.

In step S30, the execution instruction includes an execution time length, for example, if music is played for half an hour, the execution time length of the execution instruction is half an hour.

In step S40, the maintaining of the long connection of the alexa voice assistant by using the WebSocket protocol includes:

For example, the execution instruction is to play music for one hour, the heartbeat cycle is one minute, when the music is played for half an hour, the interruption is caused by a network reason, whether the network is recovered or not is monitored every one minute, and if the network is recovered, the music is continuously played until the playing is full for one hour.

The invention discloses a preferred embodiment of a voice interaction system based on alexa cloud service, which comprises the following modules:

the sound box initialization module is used for setting a wake-up word of the sound box, a state of the light display corresponding to each execution instruction, an interface corresponding to the response of each execution instruction and activation duration; by setting the corresponding response interfaces of the execution instructions, the alexa voice assistant makes the display screen jump to the corresponding interfaces after receiving the execution instructions, so that the interface response of the intelligent sound box is realized, and further the user experience is greatly improved.

The alexa voice assistant activation module is used for receiving the sound in the receiving range in real time by the sound box, verifying the received sound based on the awakening word and activating the alexa voice assistant; by adopting the alexa voice assistant, the accuracy of English recognition is greatly improved.

The instruction receiving module is used for continuously receiving voice instructions sent by a user in the activation duration by the sound box, converting the voice instructions into execution instructions and then sequentially inputting the execution instructions into an alexa voice assistant; by setting the activation duration, after waking up the alexa voice assistant, a user can continuously issue voice commands in the activation duration, and the alexa voice assistant does not need to be waken up once every time the voice commands are issued, so that continuous interaction can be performed with the loudspeaker box, and further user experience is greatly improved.

And the instruction execution module is used for executing the received execution instruction by the alexa voice assistant, controlling the display screen to perform interface response, controlling the lamplight to display a corresponding state, keeping the long connection of the alexa voice assistant through a WebSocket protocol and monitoring the execution condition of the execution instruction. The method comprises the steps of keeping long connection of an alexa voice assistant through a WebSocket protocol, monitoring the execution condition of an execution instruction, carrying out heartbeat test at intervals of a heart state cycle when the execution instruction is interrupted within the execution time, and continuing to execute the execution instruction after interruption is recovered, so that long connection of the intelligent sound box is achieved.

The alexa voice assistant activation module specifically comprises:

The instruction receiving module is specifically as follows:

In the instruction receiving module, the execution instruction includes an execution duration, for example, if music is played for half an hour, the execution duration of the execution instruction is half an hour.

In the instruction execution module, the maintaining of the long connection of the alexa voice assistant through the WebSocket protocol specifically includes:

In summary, the invention has the advantages that:

Although specific embodiments of the invention have been described above, it will be understood by those skilled in the art that the specific embodiments described are illustrative only and are not limiting upon the scope of the invention, and that equivalent modifications and variations can be made by those skilled in the art without departing from the spirit of the invention, which is to be limited only by the appended claims.

Claims

1. A voice interaction method based on alexa cloud service is characterized in that: the method comprises the following steps:

2. The voice interaction method based on alexa cloud service as claimed in claim 1, characterized in that: the step S20 specifically includes:

3. The voice interaction method based on alexa cloud service as claimed in claim 1, characterized in that: the step S30 specifically includes:

4. The voice interaction method based on alexa cloud service as claimed in claim 1, characterized in that: in step S30, the execution instruction includes an execution duration.

5. The voice interaction method based on alexa cloud service as claimed in claim 1, characterized in that: in step S40, the maintaining of the long connection of the alexa voice assistant by using the WebSocket protocol includes:

6. A voice interaction system based on alexa cloud service is characterized in that: the system comprises the following modules:

7. The voice interaction system based on alexa cloud service as claimed in claim 6, characterized in that: the alexa voice assistant activation module specifically comprises:

8. The voice interaction system based on alexa cloud service as claimed in claim 6, characterized in that: the instruction receiving module is specifically as follows:

9. The voice interaction system based on alexa cloud service as claimed in claim 6, characterized in that: in the instruction receiving module, the execution instruction includes an execution duration.

10. The voice interaction system based on alexa cloud service as claimed in claim 6, characterized in that: in the instruction execution module, the maintaining of the long connection of the alexa voice assistant through the WebSocket protocol specifically includes: