CN114283799A - Voice interaction method, device, equipment and storage medium - Google Patents

Voice interaction method, device, equipment and storage medium Download PDF

Info

Publication number
CN114283799A
CN114283799A CN202111394851.5A CN202111394851A CN114283799A CN 114283799 A CN114283799 A CN 114283799A CN 202111394851 A CN202111394851 A CN 202111394851A CN 114283799 A CN114283799 A CN 114283799A
Authority
CN
China
Prior art keywords
voice
rope skipping
user
module
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111394851.5A
Other languages
Chinese (zh)
Inventor
琚炜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Damo Institute Hangzhou Technology Co Ltd
Original Assignee
Alibaba Damo Institute Hangzhou Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Damo Institute Hangzhou Technology Co Ltd filed Critical Alibaba Damo Institute Hangzhou Technology Co Ltd
Priority to CN202111394851.5A priority Critical patent/CN114283799A/en
Publication of CN114283799A publication Critical patent/CN114283799A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • User Interface Of Digital Computer (AREA)

Abstract

The disclosure relates to a voice interaction method, device, equipment and storage medium. According to the method and the device, the first user voice collected by the voice collection module is obtained through the rope skipping device, voice recognition is carried out on the first user voice, a first text corresponding to the first user voice is obtained, and if the first text comprises target keywords, the rope skipping device is controlled to enter a working state. Furthermore, the rope skipping device acquires the second user voice acquired by the voice acquisition module, performs voice recognition on the second user voice to obtain a second text corresponding to the second user voice, executes an instruction corresponding to the second text, generates an instruction execution result, and further can play the instruction execution result through the voice playing module. The user can interact with the rope skipping device in a voice interaction mode, so that the user experience is improved, and the interestingness of rope skipping is greatly increased.

Description

Voice interaction method, device, equipment and storage medium
Technical Field
The present disclosure relates to the field of information technologies, and in particular, to a voice interaction method, apparatus, device, and storage medium.
Background
The skipping rope is a very effective aerobic sport and is a sport game in which one person or a plurality of people do various skipping actions in a rope which is circularly swung.
However, the inventor of the present application has found that the existing skipping rope has poor interaction with the user, thereby degrading the user experience.
Disclosure of Invention
In order to solve the technical problems or at least partially solve the technical problems, the present disclosure provides a voice interaction method, apparatus, device and storage medium, so that a user can interact with a rope skipping device in a voice interaction manner, thereby improving user experience and greatly increasing the interest of rope skipping.
In a first aspect, the disclosed embodiments provide a voice interaction method, which is applied to a rope skipping apparatus, where the rope skipping apparatus includes a rope portion and a handle portion, the handle portion includes a voice acquisition module, a voice playing module, a rope skipping counting unit and a computing unit, and the computing unit is configured to execute the method, where the method includes:
acquiring first user voice acquired by the voice acquisition module, and performing voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
if the first text comprises a target keyword, controlling the rope skipping equipment to enter a working state;
acquiring second user voice acquired by the voice acquisition module, and performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
executing the instruction corresponding to the second text and generating an instruction execution result;
playing at least one of the following through the voice playing module: the instruction execution result, the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, and a corresponding first preset voice when the rope skipping count and/or the rope skipping time reach a preset target.
In a second aspect, the disclosed embodiments provide a voice interaction method, which is applied to an exercise device, where the exercise device includes a voice acquisition module, a voice playing module, an amount of exercise statistics unit, and a calculation unit, where the calculation unit is configured to execute the method, and the method includes:
acquiring first user voice acquired by the voice acquisition module, and performing voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
if the first text comprises the target keyword, controlling the motion equipment to enter a working state;
acquiring second user voice acquired by the voice acquisition module, and performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
executing the instruction corresponding to the second text and generating an instruction execution result;
playing at least one of the following through the voice playing module: the instruction execution result, the motion amount counted by the motion amount counting unit and a first preset voice corresponding to the motion amount reaching a preset target.
In a third aspect, an embodiment of the present disclosure provides a voice interaction apparatus, where the apparatus is disposed on a rope skipping device, the rope skipping device includes a rope portion and a handle portion, the handle portion includes a voice collecting module, a voice playing module, a rope skipping counting unit, and the apparatus includes:
the acquisition module is used for acquiring the first user voice acquired by the voice acquisition module;
the recognition module is used for carrying out voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
the control module is used for controlling the rope skipping equipment to enter a working state when the first text comprises a target keyword;
the acquisition module is further configured to: acquiring a second user voice acquired by the voice acquisition module;
the identification module is further configured to: performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
the execution module is used for executing the instruction corresponding to the second text and generating an instruction execution result;
the control module is further configured to play at least one of the following through the voice playing module: the instruction execution result, the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, and a corresponding first preset voice when the rope skipping count and/or the rope skipping time reach a preset target.
In a fourth aspect, an embodiment of the present disclosure provides a voice interaction device, where the device is disposed in a piece of sports equipment, where the piece of sports equipment includes a voice collecting module, a voice playing module, a motion amount statistic unit, and the device includes:
the acquisition module is used for acquiring the first user voice acquired by the voice acquisition module;
the recognition module is used for carrying out voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
the control module is used for controlling the motion equipment to enter a working state if the first text comprises the target keyword;
the acquisition module is further configured to: acquiring a second user voice acquired by the voice acquisition module;
the recognition module is further used for performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
the execution module is used for executing the instruction corresponding to the second text and generating an instruction execution result;
the control module is further configured to play at least one of the following through the voice playing module: the instruction execution result, the motion amount counted by the motion amount counting unit and a first preset voice corresponding to the motion amount reaching a preset target.
In a fifth aspect, an embodiment of the present disclosure provides an electronic device, including:
a memory;
a processor; and
a computer program;
wherein the computer program is stored in the memory and configured to be executed by the processor to implement the method of the first or second aspect.
In a sixth aspect, the disclosed embodiments provide a computer-readable storage medium having a computer program stored thereon, the computer program being executed by a processor to implement the method of the first or second aspect.
According to the voice interaction method, the voice interaction device, the voice interaction equipment and the storage medium, the first user voice collected by the voice collection module is obtained through the rope skipping equipment, voice recognition is carried out on the first user voice, a first text corresponding to the first user voice is obtained, and if the first text comprises the target keyword, the rope skipping equipment is controlled to enter the working state. Furthermore, the rope skipping device acquires the second user voice acquired by the voice acquisition module, performs voice recognition on the second user voice to obtain a second text corresponding to the second user voice, executes an instruction corresponding to the second text, generates an instruction execution result, and further can play the instruction execution result through the voice playing module. The user can interact with the rope skipping device in a voice interaction mode, so that the user experience is improved, and the interestingness of rope skipping is greatly increased.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.
In order to more clearly illustrate the embodiments or technical solutions in the prior art of the present disclosure, the drawings used in the description of the embodiments or prior art will be briefly described below, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without inventive exercise.
Fig. 1 is a flowchart of a voice interaction method provided by an embodiment of the present disclosure;
fig. 2 is a schematic diagram of an application scenario provided by the embodiment of the present disclosure;
fig. 3 is a schematic diagram of another application scenario provided by the embodiment of the present disclosure;
FIG. 4 is a flowchart of a voice interaction method according to another embodiment of the present disclosure;
fig. 5 is a schematic diagram of another application scenario provided by the embodiment of the present disclosure;
FIG. 6 is a flowchart of a voice interaction method according to another embodiment of the present disclosure;
FIG. 7 is a flowchart of a voice interaction method according to another embodiment of the present disclosure;
fig. 8 is a schematic structural diagram of a voice interaction apparatus according to an embodiment of the present disclosure;
fig. 9 is a schematic structural diagram of a voice interaction apparatus according to another embodiment of the present disclosure;
fig. 10 is a schematic structural diagram of an embodiment of an electronic device provided in the embodiment of the present disclosure.
Detailed Description
In order that the above objects, features and advantages of the present disclosure may be more clearly understood, aspects of the present disclosure will be further described below. It should be noted that the embodiments and features of the embodiments of the present disclosure may be combined with each other without conflict.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present disclosure, but the present disclosure may be practiced in other ways than those described herein; it is to be understood that the embodiments disclosed in the specification are only a few embodiments of the present disclosure, and not all embodiments.
The skipping rope is a very effective aerobic sport and is a sport game in which one person or a plurality of people do various skipping actions in a rope which is circularly swung. However, the existing skipping rope has poor interactivity with the user, thereby reducing the user experience. To solve this problem, embodiments of the present disclosure provide a voice interaction method, which is described below with reference to specific embodiments.
Fig. 1 is a flowchart of a voice interaction method provided in an embodiment of the present disclosure. The method is applied to rope skipping equipment, the rope skipping equipment comprises a rope part and a handle part, the handle part comprises a voice acquisition module, a voice playing module, a rope skipping counting unit and a calculating unit, and the calculating unit is used for executing the method. Specifically, the voice acquisition module, the voice playing module, the rope skipping counting unit and the calculating unit can be arranged inside the handle part, or can be arranged at two ends of the handle part. The rope skipping device shown in fig. 2 comprises a rope portion 21 and a handle portion 22, wherein a voice acquisition module, a voice playing module, a rope skipping counting unit and a calculating unit, which may be a processor, are disposed in the handle portion 22. As shown in fig. 3, the handle portion 22 includes a voice collecting module 31, a voice playing module 32, a rope skipping counting unit 33 and a calculating unit 34, wherein the voice collecting module 31, the voice playing module 32 and the rope skipping counting unit 33 are respectively connected with the calculating unit 34. In addition, the handle portions 22 at the left and right sides may be respectively provided with a voice collecting module, a voice playing module, a rope skipping counting unit and a calculating unit. Alternatively, some of the voice collecting module, the voice playing module, the rope skipping counting unit and the calculating unit may be disposed at one side of the handle portion, the rest of the modules or units may be disposed at the other side of the handle portion, and the modules or units at both side of the handle portion may be connected to each other, for example, the rope portion may include a transmission line for transmitting signals, and the transmission line may connect the modules or units at both side of the handle portion.
In addition, the present embodiment does not limit the installation positions of the voice collection module, the voice playing module, the rope skipping counting unit and the calculating unit in the handle portion, for example, as shown in fig. 3, the voice collection module 31 is disposed at one end of the handle portion 22, the voice playing module 32 and the rope skipping counting unit 33 are disposed at the other end of the handle portion 22, and the calculating unit 34 is disposed at an intermediate position of the handle portion 22. The voice acquisition module may be a sound sensor, a sound collector, or a microphone, and the voice playing module may be a speaker. The voice interaction method according to the embodiment of the present disclosure may be specifically executed by a processor in the rope skipping apparatus 22. Specifically, the processor may count, or the processor may acquire the count counted by the rope skipping counting unit. Additionally, the processor may convert the rope jump count into heat, such as calories consumed by the user. In addition, the handle portion may further include a battery for powering the sound collection module, the voice playing module, the rope skipping counting unit, and the processor. Alternatively, the processor may also absorb the energy of the motion itself to charge other modules or units. Further, the processor may perform off-line calculations, or as shown in fig. 5, the handle portion may be provided with a communication interface through which the processor may communicate with a remote server 41 or a user terminal 42, where the specific communication protocol is not limited, and may be, for example, a wireless communication protocol. The remote server 41 may be a cloud. As shown in fig. 1, the method comprises the following specific steps:
s101, first user voice collected by the voice collecting module is obtained, voice recognition is carried out on the first user voice, and a first text corresponding to the first user voice is obtained.
For example, when the user does not use the rope skipping apparatus, the rope skipping apparatus may be in a standby state. To wake up the rope jump device, the user may speak a wake-up voice to the rope jump device, which may include a wake-up word, for example. Correspondingly, the voice acquisition module can acquire the awakening voice, and the awakening voice can be recorded as the first user voice. Further, the voice acquisition module can transmit the acquired awakening voice to the processor through a data bus or other modes, the processor can perform voice recognition on the awakening voice, the voice recognition process can include voice feature extraction, so that a text corresponding to the awakening voice is obtained, and the text is recorded as a first text.
S102, if the first text comprises the target key words, controlling the skipping rope equipment to enter a working state.
For example, the processor may detect whether a target keyword is included in the first text, where the target keyword may specifically be a wake word as described above, and if the wake word is included in the first text, the processor may control the skipping rope device to enter an operating state, i.e., a wake state. That is, the user may wake up the jump rope apparatus through a voice wake-up (KWS) technology, which is a technology of performing voice recognition through a specified Keyword, which may be a wake-up word as described above.
In addition, when the skipping rope device enters the working state, the processor can play a prompting sentence after waking up, such as "the device is started", "what needs me help", "hi, whether the skipping rope target of today needs to be set" and the like, through the voice playing module.
S103, second user voice collected by the voice collecting module is obtained, voice recognition is carried out on the second user voice, and a second text corresponding to the second user voice is obtained.
Specifically, after the rope skipping device enters the working state, the user may continue to send the instruction to the rope skipping device in a voice manner, for example, the user may speak a second user voice to the rope skipping device. Accordingly, the voice collection module can collect the second user voice. The voice acquisition module can transmit the acquired second user voice to the processor through a data bus or other modes, and the processor can perform voice recognition on the second user voice so as to obtain a second text corresponding to the second user voice.
And S104, executing the instruction corresponding to the second text and generating an instruction execution result.
For example, the second user voice may be "increase volume", and thus, the instruction corresponding to the second text is a volume control instruction and is a volume control instruction of "increase volume". Specifically, the processor may execute the volume control instruction, that is, control the volume of the voice playing module to increase. After the processor executes the volume control instruction, an instruction execution result may also be generated, for example, the instruction execution result may be "volume has been increased".
For another example, the second user voice may be "play music", specifically, the processor may control the voice playing module to play music specified by the user, or music stored in the cloud by the user, and the music may be used as the sports background music.
S105, playing at least one of the following through the voice playing module: the instruction execution result, the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, and a corresponding first preset voice when the rope skipping count and/or the rope skipping time reach a preset target.
Specifically, the processor may play "volume has increased" through the voice playing module. In some cases, the instruction execution result generated by the processor may be a Text-type instruction execution result, and further, the processor may convert the Text-type instruction execution result into a voice through a Speech synthesis (Text To Speech, TTS) technology, so as To play the voice through the voice playing module.
Or, in other cases, after the rope skipping device enters the working state, the user can skip the rope at will. In the rope skipping process, the processor obtains the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, and judges whether the rope skipping count and/or the rope skipping time reach a preset target, wherein the preset target can be a preset numerical value. For example, when the rope skipping count reaches 100, or the rope skipping time of the user reaches 10 minutes, the processor may control the voice playing module to play a first preset voice, which may be an encouraging word such as "100 completed", "refuel", etc.
In addition, in some other embodiments, after the rope skipping device enters the working state, the rope skipping device can also actively play prompting words such as caution items of rope skipping and warm-up movement before rope skipping through the voice playing module.
In addition, the rope skipping device can be further provided with a display component, and the display component can display information such as time information, weather information, rope skipping counting, rope skipping time and rope skipping targets, so that a user can view the information through the display component.
It can be seen that voice Interaction (Speech Interaction) between the user and the rope skipping apparatus can be realized through S101-S105 as described above, that is, the user can interact with the rope skipping apparatus by using voice, and the voice Interaction process can include voice recognition, voice synthesis, voice understanding, and the like. Therefore, the skipping rope equipment can become Artificial Intelligence Internet of Things (AIOT) equipment with a voice interaction function, and can be applied to skipping rope equipment for students to do physical exercises.
According to the embodiment of the method and the device for rope skipping, the first user voice collected by the voice collection module is obtained through the rope skipping device, voice recognition is conducted on the first user voice, a first text corresponding to the first user voice is obtained, and if the first text comprises the target keyword, the rope skipping device is controlled to enter the working state. Furthermore, the rope skipping device acquires the second user voice acquired by the voice acquisition module, performs voice recognition on the second user voice to obtain a second text corresponding to the second user voice, executes an instruction corresponding to the second text, generates an instruction execution result, and further can play the instruction execution result through the voice playing module. The user can interact with the rope skipping device in a voice interaction mode, so that the user experience is improved, and the interestingness of rope skipping is greatly increased.
In addition, the voice interaction technology belongs to the artificial intelligence category, and has the characteristics of high technical maturity, high user acceptance degree, low requirement on hardware cost and the like, so that the voice interaction technology can be applied to many scenes.
In addition, the rope skipping device provided by the embodiment of the disclosure has a networking function, for example, a plurality of rope skipping devices can access the same network, and the plurality of rope skipping devices can send out rope skipping instructions through the network at the same time, so that a plurality of users can perform rope skipping competitions. In the process of competition, the rope skipping counting or timing is carried out by the plurality of rope skipping devices at the same time, and the rope skipping counting or timing counted by the rope skipping devices can be transmitted among different rope skipping devices, so that each rope skipping device can broadcast the current competition situation through voice, and each user is encouraged to be actively engaged in competition. In addition, after the match is finished, each rope skipping device can also broadcast the match score, the ranking and the like of each user in a voice mode.
Fig. 4 is a flowchart of a voice interaction method according to another embodiment of the disclosure. The method comprises the following steps as shown in fig. 4:
s301, first user voice collected by the voice collecting module is obtained, voice recognition is carried out on the first user voice, and a first text corresponding to the first user voice is obtained.
Specifically, the implementation manners of S301 and S101 are consistent with specific principles, and are not described herein again.
S302, performing voiceprint recognition on the first user voice to obtain a voiceprint corresponding to the first user voice.
For example, after the voice capture module transmits the wake-up voice captured by the voice capture module to the processor through a data bus or other means, the processor may also perform Voiceprint Recognition (VPR), which is a technology for recognizing the identity of the user through voice. The voiceprint corresponding to the awakening voice can be obtained through voiceprint recognition.
S303, if the first text comprises target keywords and the voiceprint corresponding to the first user voice is matched with a pre-stored target voiceprint, controlling the skipping rope equipment to enter a working state.
It will be appreciated that the skipping rope device may have a user voiceprint pre-stored therein, which may be noted as a target voiceprint. When the processor in the rope skipping device converts the awakening voice into a first text and recognizes the voiceprint corresponding to the awakening voice, whether the first text comprises the target keyword or not can be judged, and whether the voiceprint corresponding to the awakening voice is matched with the pre-stored target voiceprint or not can be judged. And if the first text comprises the target keyword and the voiceprint corresponding to the awakening voice is matched with the prestored target voiceprint, controlling the skipping rope equipment to enter a working state by the processor.
S304, second user voice collected by the voice collecting module is obtained, voice recognition is carried out on the second user voice, and a second text corresponding to the second user voice is obtained.
And S305, executing the instruction corresponding to the second text, and generating an instruction execution result.
S306, playing at least one of the following through the voice playing module: the instruction execution result, the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, and a corresponding first preset voice when the rope skipping count and/or the rope skipping time reach a preset target.
Specifically, the implementation process and specific principles of S304-S306 are consistent with those of S103-S S105, and are not described herein again.
According to the method and the device for realizing the voice print recognition of the first user voice, the voice print corresponding to the first user voice is obtained, the first text comprises the target key words, and the skipping rope device is controlled to enter the working state under the condition that the voice print corresponding to the first user voice is matched with the pre-stored target voice print. Therefore, user identity recognition can be achieved through the voiceprint corresponding to the first user voice, and the situation that the users take or confuse respective skipping ropes by mistake under the condition that the users skip ropes for multiple users and multiple skipping ropes is avoided.
In addition, it can be understood that the rope skipping device enters an operating state or a standby state, which affects the voice interaction function of the rope skipping device, for example, when the rope skipping device enters the operating state, the rope skipping device can perform normal voice interaction with the user. However, the operating state or the standby state of the rope skipping device does not affect the basic function of the rope, namely, the basic function of the rope is swung by the user so as to ensure that the user does various skipping actions.
In a possible implementation manner, the instruction corresponding to the second text is used for setting a rope skipping target; after the instruction execution result is played through the voice playing module, the method further comprises: obtaining rope skipping counting and/or rope skipping time; and if the rope skipping count and/or the rope skipping time reach the rope skipping target, playing a second preset voice through the voice playing module.
In addition, after the rope skipping device enters the working state, the user can also set the rope skipping device in a voice interaction mode. For example, the second user voice may be "set target 500" or "set time 30 minutes". After the voice acquisition module acquires the voice of the second user, the processor obtains a second text corresponding to the voice of the second user through voice recognition, and the instruction corresponding to the second text is 'set the rope skipping target to be 500 or 30 minutes'. When the processor executes the instruction corresponding to the second text, the skipping target, for example, 500 or 30 minutes, can be stored. Further, the processor may generate an instruction execution result, and the instruction execution result may specifically be "set to complete", so that the instruction execution result is played through the voice playing module. Next, if the user swings the rope to jump, the rope skipping counting unit in the rope skipping device may count the rope skipping count and/or the rope skipping time, specifically, the rope skipping count may be counted in a physical manner. When the processor acquires the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, the processor can judge whether the rope skipping count and/or the rope skipping time reach the stored rope skipping target. For example, if the rope skipping count has reached 500, or the rope skipping time has reached 30 minutes, the processor may control the voice playing module to play a second preset voice, where the second preset voice may specifically be "target-done cheering", "very your real club", or the like. Therefore, the user does not need to count the rope skipping count and/or the rope skipping time of the user in real time in the rope skipping process, and the user does not need to check the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit in real time. When the user's rope jump count and/or rope jump time reaches the rope jump target, the rope jump device may automatically prompt the user that the target has been completed.
Optionally, when the rope skipping count and/or the rope skipping time do not reach the rope skipping target and the rope skipping count and/or the rope skipping time reach a preset target, a first preset voice is played through the voice playing module.
For example, some users may not be able to skip 500 continuously, in which case, when the rope skipping count of the user reaches 100, or the rope skipping time of the user reaches 10 minutes, the processor may control the voice playing module to play a first preset voice, which may be an encouraging word such as "100 have been completed", "refuel", etc.
In another possible implementation manner, the instruction corresponding to the second text is used to acquire target information, and the instruction execution result includes the target information and a rope skipping target matched with the target information.
For example, the second user voice may also be a question for acquiring the target information. The target information is not limited to weather information, temperature information, environmental information, and the like. For example, the second user speech may be "what is the weather today". After the voice acquisition module acquires the voice of the second user, the processor obtains a second text corresponding to the voice of the second user through voice recognition, and the instruction corresponding to the second text is 'acquiring weather information'. When the processor executes the instruction corresponding to the second text, request information for acquiring weather information and/or a rope skipping target matched with the weather information can be sent to the remote server 41 or the user terminal 42 shown in fig. 5 through the communication interface in the rope skipping device. Further, the remote server 41 or the user terminal 42 may transmit the weather information and/or the rope skipping target matched with the weather information to the rope skipping device, so that the weather information and/or the rope skipping target matched with the weather information is played through the voice playing module.
When the rope skipping count and/or the rope skipping time reach the preset target or the rope skipping target set by the user, the voice playing module plays the first preset voice or the second preset voice, so that the user does not need to count the rope skipping count and/or the rope skipping time of the user in real time or check the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit in real time in the rope skipping process. When the rope skipping count and/or the rope skipping time of the user reach a preset target or a rope skipping target, the rope skipping device can automatically prompt the user in a voice mode, and therefore user experience is further improved. Compared with the traditional skipping rope, the skipping rope based on voice interaction greatly increases the interestingness and is beneficial to exciting students to use.
In addition, the user can set some parameters (such as a rope skipping target) for the rope skipping device in a voice mode, so that the setting operation is simple and convenient. In general, the volume of the skipping rope device is small, so that a small keyboard or too many buttons can be avoided on the skipping rope device.
In summary, the voice interaction process between the user and the rope skipping apparatus generally includes the following steps as shown in fig. 6:
and S501, collecting sound by a microphone.
For example, a microphone collects a user's voice.
And S502, voice feature extraction.
For example, the processor performs speech feature extraction on the user speech collected by the microphone.
S503, the rope skipping device wakes up.
The rope skipping device waking up process may include voice waking up, voiceprint recognition, voice synthesis, and the like. For example, the processor may determine whether a wake-up word is included in the user's voice to implement voice wake-up. In addition, the processor can also perform voiceprint recognition on the user voice so as to recognize the user identity. Further, if the rope skipping device is awakened, the processor can also perform voice synthesis to generate an awakened prompt statement. Further, the speaker plays the awakened prompt statement.
And S504, executing the instruction by the rope skipping equipment and synthesizing voice.
After the rope skipping device is awakened, the user can send an instruction to the rope skipping device in a voice interaction mode, so that the rope skipping device can execute the instruction and generate an instruction execution result. Further, the instruction execution result is converted into voice by voice synthesis. Further, the speaker plays the instruction execution result.
And S505, playing sound by a loudspeaker.
The embodiment can realize voice interaction with the rope skipping device through the user, and improves the interest of rope skipping and the user experience.
Fig. 7 is a schematic flowchart of a voice interaction method according to an embodiment of the present disclosure. The method is applied to exercise equipment, for example, the exercise equipment is not limited to the rope skipping equipment, and can be exercise equipment such as a treadmill, a dumbbell, an ellipsometer, a rowing machine, a barbell and the like. Specifically, the exercise device comprises a voice acquisition module, a voice playing module, an exercise amount statistic unit and a calculation unit, wherein the calculation unit is used for executing the method. As shown in fig. 7, the method includes the following steps:
s701, first user voice collected by the voice collection module is obtained, voice recognition is carried out on the first user voice, and a first text corresponding to the first user voice is obtained.
S702, if the first text comprises the target key words, controlling the motion equipment to enter a working state.
S703, second user voice collected by the voice collection module is obtained, voice recognition is carried out on the second user voice, and a second text corresponding to the second user voice is obtained.
And S704, executing the instruction corresponding to the second text, and generating an instruction execution result.
S705, playing at least one of the following through the voice playing module: the instruction execution result, the motion amount counted by the motion amount counting unit and a first preset voice corresponding to the motion amount reaching a preset target.
It is understood that the implementation process and specific principles of S701-S705 may refer to S101-S105 described above, and are not described herein again.
Fig. 8 is a schematic structural diagram of a voice interaction apparatus according to an embodiment of the present disclosure. The voice interaction device provided by the embodiment of the disclosure can execute the processing flow provided by the voice interaction method, the device is arranged on a rope skipping device, the rope skipping device comprises a rope part and a handle part, the handle part comprises a voice acquisition module, a voice playing module, a rope skipping counting unit and the device, and the device can be the calculating unit. As shown in fig. 8, the voice interaction apparatus 80 includes:
an obtaining module 81, configured to obtain the first user voice collected by the voice collecting module;
the recognition module 82 is configured to perform speech recognition on the first user speech to obtain a first text corresponding to the first user speech;
the control module 83 is configured to control the rope skipping device to enter a working state when the first text includes a target keyword;
the obtaining module 81 is further configured to: acquiring a second user voice acquired by the voice acquisition module;
the identification module 82 is further configured to: performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
the execution module 84 is configured to execute the instruction corresponding to the second text and generate an instruction execution result;
the control module 83 is further configured to play at least one of the following through the voice playing module: the instruction execution result, the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, and a corresponding first preset voice when the rope skipping count and/or the rope skipping time reach a preset target.
Optionally, the identification module 82 is further configured to: carrying out voiceprint recognition on the first user voice to obtain a voiceprint corresponding to the first user voice; the control module 83 is specifically configured to: and when the first text comprises target keywords and the voiceprint corresponding to the first user voice is matched with a pre-stored target voiceprint, controlling the skipping rope equipment to enter a working state.
Optionally, the instruction corresponding to the second text is used for setting a rope skipping target; the obtaining module 81 is further configured to obtain a rope skipping count and/or rope skipping time after the instruction execution result is played by the voice playing module; the control module 83 is further configured to: and when the rope skipping count and/or the rope skipping time reach the rope skipping target, playing a second preset voice through the voice playing module.
Optionally, the control module 83 is further configured to: and when the rope skipping count and/or the rope skipping time do not reach the rope skipping target and the rope skipping count and/or the rope skipping time reach the preset target, playing a first preset voice through the voice playing module.
Optionally, the instruction corresponding to the second text is used to obtain target information, and the instruction execution result includes the target information and a rope skipping target matched with the target information.
The voice interaction apparatus in the embodiment shown in fig. 8 can be used to implement the technical solutions of the above method embodiments, and the implementation principles and technical effects are similar, which are not described herein again.
Fig. 9 is a schematic structural diagram of a voice interaction apparatus according to another embodiment of the present disclosure. The voice interaction device provided by the embodiment of the disclosure can execute the processing flow provided by the voice interaction method embodiment, the device is arranged on the sports equipment, and the sports equipment comprises a voice acquisition module, a voice playing module, a motion amount statistic unit and the device. As shown in fig. 9, the voice interaction apparatus 90 includes:
the acquisition module 91 is configured to acquire the first user voice acquired by the voice acquisition module;
the recognition module 92 is configured to perform voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
the control module 93 is configured to control the motion device to enter a working state if the first text includes a target keyword;
the obtaining module 91 is further configured to: acquiring a second user voice acquired by the voice acquisition module;
the identification module 92 is further configured to: performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
the execution module 94 is configured to execute the instruction corresponding to the second text and generate an instruction execution result;
the control module 93 is further configured to play at least one of the following through the voice playing module: the instruction execution result, the motion amount counted by the motion amount counting unit and a first preset voice corresponding to the motion amount reaching a preset target.
The voice interaction apparatus in the embodiment shown in fig. 9 can be used to implement the technical solutions of the above method embodiments, and the implementation principles and technical effects are similar, and are not described herein again.
The internal functions and structure of the voice interaction apparatus, which can be implemented as an electronic device, are described above. Fig. 10 is a schematic structural diagram of an embodiment of an electronic device provided in the embodiment of the present disclosure. As shown in fig. 10, the electronic device includes a memory 101 and a processor 102.
A memory 101 for storing programs. In addition to the above-described programs, the memory 101 may also be configured to store other various data to support operations on the electronic device. Examples of such data include instructions for any application or method operating on the electronic device, contact data, phonebook data, messages, pictures, videos, and so forth.
The memory 101 may be implemented by any type or combination of volatile or non-volatile memory devices, such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.
A processor 102, coupled to the memory 101, for executing the program stored in the memory 101 to:
acquiring first user voice acquired by the voice acquisition module, and performing voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
if the first text comprises a target keyword, controlling the rope skipping equipment to enter a working state;
acquiring second user voice acquired by the voice acquisition module, and performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
executing the instruction corresponding to the second text and generating an instruction execution result;
playing at least one of the following through the voice playing module: the instruction execution result, the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, and a corresponding first preset voice when the rope skipping count and/or the rope skipping time reach a preset target.
Further, as shown in fig. 10, the electronic device may further include: communication components 103, power components 104, audio components 105, display 106, and other components. Only some of the components are schematically shown in fig. 10, and the electronic device is not meant to include only the components shown in fig. 10.
The communication component 103 is configured to facilitate wired or wireless communication between the electronic device and other devices. The electronic device may access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component 103 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 103 further comprises a Near Field Communication (NFC) module to facilitate short-range communication. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
The power supply component 104 provides power to various components of the electronic device. The power components 104 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power for an electronic device.
The audio component 105 is configured to output and/or input audio signals. For example, the audio component 105 includes a Microphone (MIC) configured to receive external audio signals when the electronic device is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signal may further be stored in the memory 101 or transmitted via the communication component 103. In some embodiments, audio component 105 also includes a speaker for outputting audio signals.
The display 106 includes a screen, which may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation.
In addition, the embodiment of the present disclosure also provides a computer readable storage medium, on which a computer program is stored, where the computer program is executed by a processor to implement the voice interaction method described in the above embodiment.
It is noted that, in this document, relational terms such as "first" and "second," and the like, may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The foregoing are merely exemplary embodiments of the present disclosure, which enable those skilled in the art to understand or practice the present disclosure. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the disclosure. Thus, the present disclosure is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (11)

1. A voice interaction method, wherein the method is applied to a rope skipping apparatus, the rope skipping apparatus comprises a rope portion and a handle portion, the handle portion comprises a voice acquisition module, a voice playing module, a rope skipping counting unit and a computing unit, the computing unit is used for executing the method, and the method comprises:
acquiring first user voice acquired by the voice acquisition module, and performing voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
if the first text comprises a target keyword, controlling the rope skipping equipment to enter a working state;
acquiring second user voice acquired by the voice acquisition module, and performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
executing the instruction corresponding to the second text and generating an instruction execution result;
playing at least one of the following through the voice playing module: the instruction execution result, the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, and a corresponding first preset voice when the rope skipping count and/or the rope skipping time reach a preset target.
2. The method of claim 1, wherein before controlling the rope skipping apparatus to enter the working state if the first text includes a target keyword, the method further comprises:
carrying out voiceprint recognition on the first user voice to obtain a voiceprint corresponding to the first user voice;
if the first text comprises the target keyword, controlling the rope skipping device to enter a working state, wherein the working state comprises the following steps:
and if the first text comprises target keywords and the voiceprint corresponding to the first user voice is matched with a pre-stored target voiceprint, controlling the skipping rope equipment to enter a working state.
3. The method of claim 1, wherein the instruction corresponding to the second text is used for setting a rope skipping target;
after the instruction execution result is played through the voice playing module, the method further comprises:
obtaining rope skipping counting and/or rope skipping time;
and if the rope skipping count and/or the rope skipping time reach the rope skipping target, playing a second preset voice through the voice playing module.
4. The method of claim 3, wherein the method further comprises:
and when the rope skipping count and/or the rope skipping time do not reach the rope skipping target and the rope skipping count and/or the rope skipping time reach the preset target, playing a first preset voice through the voice playing module.
5. The method of claim 1, wherein an instruction corresponding to the second text is used for acquiring target information, and the instruction execution result comprises the target information and a rope skipping target matched with the target information.
6. A voice interaction method is applied to sports equipment, the sports equipment comprises a voice acquisition module, a voice playing module, a motion quantity statistical unit and a computing unit, the computing unit is used for executing the method, and the method comprises the following steps:
acquiring first user voice acquired by the voice acquisition module, and performing voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
if the first text comprises the target keyword, controlling the motion equipment to enter a working state;
acquiring second user voice acquired by the voice acquisition module, and performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
executing the instruction corresponding to the second text and generating an instruction execution result;
playing at least one of the following through the voice playing module: the instruction execution result, the motion amount counted by the motion amount counting unit and a first preset voice corresponding to the motion amount reaching a preset target.
7. A voice interaction device, wherein the device is arranged on a rope skipping device, the rope skipping device comprises a rope part and a handle part, the handle part comprises a voice acquisition module, a voice playing module, a rope skipping counting unit and the device, and the device comprises:
the acquisition module is used for acquiring the first user voice acquired by the voice acquisition module;
the recognition module is used for carrying out voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
the control module is used for controlling the rope skipping equipment to enter a working state when the first text comprises a target keyword;
the acquisition module is further configured to: acquiring a second user voice acquired by the voice acquisition module;
the identification module is further configured to: performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
the execution module is used for executing the instruction corresponding to the second text and generating an instruction execution result;
the control module is further configured to play at least one of the following through the voice playing module: the instruction execution result, the rope skipping count and/or the rope skipping time counted by the rope skipping counting unit, and a corresponding first preset voice when the rope skipping count and/or the rope skipping time reach a preset target.
8. The apparatus of claim 7, wherein the identification module is further configured to: carrying out voiceprint recognition on the first user voice to obtain a voiceprint corresponding to the first user voice;
the control module is specifically configured to:
and when the first text comprises target keywords and the voiceprint corresponding to the first user voice is matched with a pre-stored target voiceprint, controlling the skipping rope equipment to enter a working state.
9. The utility model provides a voice interaction device, wherein, the device sets up in sports equipment, sports equipment includes voice acquisition module, pronunciation broadcast module, amount of exercise statistics unit and the device, the device includes:
the acquisition module is used for acquiring the first user voice acquired by the voice acquisition module;
the recognition module is used for carrying out voice recognition on the first user voice to obtain a first text corresponding to the first user voice;
the control module is used for controlling the motion equipment to enter a working state if the first text comprises the target keyword;
the acquisition module is further configured to: acquiring a second user voice acquired by the voice acquisition module;
the recognition module is further used for performing voice recognition on the second user voice to obtain a second text corresponding to the second user voice;
the execution module is used for executing the instruction corresponding to the second text and generating an instruction execution result;
the control module is further configured to play at least one of the following through the voice playing module: the instruction execution result, the motion amount counted by the motion amount counting unit and a first preset voice corresponding to the motion amount reaching a preset target.
10. An electronic device, comprising:
a memory;
a processor; and
a computer program;
wherein the computer program is stored in the memory and configured to be executed by the processor to implement the method of any one of claims 1-6.
11. A computer-readable storage medium, on which a computer program is stored, wherein the computer program, when executed by a processor, implements the method of any one of claims 1-6.
CN202111394851.5A 2021-11-23 2021-11-23 Voice interaction method, device, equipment and storage medium Pending CN114283799A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111394851.5A CN114283799A (en) 2021-11-23 2021-11-23 Voice interaction method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111394851.5A CN114283799A (en) 2021-11-23 2021-11-23 Voice interaction method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN114283799A true CN114283799A (en) 2022-04-05

Family

ID=80869772

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111394851.5A Pending CN114283799A (en) 2021-11-23 2021-11-23 Voice interaction method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN114283799A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115120926A (en) * 2022-07-21 2022-09-30 九江学院 A intelligent rope skipping ware for physical training
CN115240668A (en) * 2022-07-06 2022-10-25 广东开放大学(广东理工职业学院) Voice interaction household control method and robot

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115240668A (en) * 2022-07-06 2022-10-25 广东开放大学(广东理工职业学院) Voice interaction household control method and robot
CN115240668B (en) * 2022-07-06 2023-06-02 广东开放大学(广东理工职业学院) Voice interaction home control method and robot
CN115120926A (en) * 2022-07-21 2022-09-30 九江学院 A intelligent rope skipping ware for physical training
CN115120926B (en) * 2022-07-21 2023-11-03 九江学院 A intelligent rope skipping ware for physical training

Similar Documents

Publication Publication Date Title
US11722638B2 (en) Video communication device, video communication method, and video communication mediating method
CN114283799A (en) Voice interaction method, device, equipment and storage medium
CN108021572B (en) Reply information recommendation method and device
US8172637B2 (en) Programmable interactive talking device
CN110570840B (en) Intelligent device awakening method and device based on artificial intelligence
CN111312235B (en) Voice interaction method, device and system
CN110364151A (en) A kind of method and electronic equipment of voice wake-up
CN109493849A (en) Voice awakening method, device and electronic equipment
CN108735209A (en) Wake up word binding method, smart machine and storage medium
CN105869641A (en) Speech recognition device and speech recognition method
WO2021008538A1 (en) Voice interaction method and related device
CN108776693B (en) Voice question searching method based on customized corpus and family education equipment
CN110070863A (en) A kind of sound control method and device
CN104133851A (en) Audio similarity detecting method, audio similarity detecting device and electronic equipment
CN108735216A (en) A kind of voice based on semantics recognition searches topic method and private tutor's equipment
CN107680614B (en) Audio signal processing method, apparatus and storage medium
CN109871238A (en) Voice interactive method, device and storage medium
CN109035968A (en) Piano study auxiliary system and piano
CN111522592A (en) Intelligent terminal awakening method and device based on artificial intelligence
CN108668024B (en) Voice processing method and terminal
CN112634897A (en) Equipment awakening method and device, storage medium and electronic device
CN108322770A (en) Video frequency program recognition methods, relevant apparatus, equipment and system
CN112439180B (en) Intelligent voice playing method and equipment
CN110730330A (en) Sound processing method and related product
CN110378677B (en) Red envelope pickup method and device, mobile terminal and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination