CN108096841B

CN108096841B - Voice interaction method and device, electronic equipment and readable storage medium

Info

Publication number: CN108096841B
Application number: CN201711385724.2A
Authority: CN
Inventors: 于情情; 樊祥东; 周文波
Original assignee: Zhuhai Juntian Electronic Technology Co Ltd
Current assignee: Zhuhai Juntian Electronic Technology Co Ltd
Priority date: 2017-12-20
Filing date: 2017-12-20
Publication date: 2021-06-04
Anticipated expiration: 2037-12-20
Also published as: CN108096841A

Abstract

The embodiment of the invention provides a voice interaction method, a voice interaction device, electronic equipment and a readable storage medium. The method is applied to the electronic equipment and comprises the following steps: when detecting that a game runs on the electronic equipment, judging whether voice is monitored or not; if the voice is monitored, taking the monitored voice as a target voice, and inputting the target voice into a preset voice reply model to obtain a reply voice which is output by the voice reply model and aims at the target voice; wherein the voice reply model is to: acquiring a reply voice of the voice input to the voice reply model based on a mapping relation between the preset voice and the preset reply voice; presetting the voice comprises: the game voice is sent out during the running process of the game; and playing the reply voice of the target voice. By applying the voice interaction scheme provided by the embodiment of the invention, the response can be performed aiming at the voice sent by the game, the loneliness of the game player in the game playing process is reduced, and the game experience of the game player is improved.

Description

Voice interaction method and device, electronic equipment and readable storage medium

Technical Field

The present invention relates to the field of game technologies, and in particular, to a voice interaction method, apparatus, electronic device, and readable storage medium.

Background

At present, users often use electronic devices such as mobile phones and computers to play games. Also, during game play, games often emit game voices such as go, enemy, and 5 seconds to battle field and game over.

The inventor has found that since these game voices are voices uttered by the game itself for prompting the user to perform operations, those skilled in the art recognize that these voices are not necessary, nor are there any need for voice interaction.

Disclosure of Invention

Embodiments of the present invention provide a voice interaction method, apparatus, electronic device, and readable storage medium, so as to reply to a game voice sent by a game, reduce a sense of loneliness of a game player during a game playing process, and thereby improve a game experience of the game player. The specific technical scheme is as follows:

in a first aspect, an embodiment of the present invention provides a voice interaction method, which is applied to an electronic device, and the method may include:

when detecting that a game runs on the electronic equipment, judging whether voice is monitored or not;

if the voice is monitored, taking the monitored voice as a target voice, and inputting the target voice into a preset voice reply model to obtain a reply voice which is output by the voice reply model and aims at the target voice; wherein the voice reply model is to: acquiring a reply voice of the voice input to the voice reply model based on a mapping relation between the preset voice and the preset reply voice; presetting the voice comprises: game voice which is sent out by the game in the game running process;

and playing the reply voice of the target voice.

Optionally, the preset voice may include: game voice sent by the game in the game running process and preset user voice; the user speech includes: the voice of the user is made during the running of the game.

Optionally, the step of inputting the target speech into a preset speech reply model to obtain a reply speech for the target speech, which is output by the speech reply model, may include:

inputting the target voice into a preset voice reply model, and judging whether a preset voice matched with the target voice exists in the voice reply model or not;

if yes, determining the preset reply voice corresponding to the preset voice matched with the target voice as follows: a reply voice to the target voice.

Optionally, when it is determined that the preset speech matched with the target speech does not exist in the speech reply model, the method may further include:

newly adding a preset reply voice corresponding to the target voice;

and updating the voice reply model by using the target voice and the newly added preset reply voice.

Optionally, the preset reply voice corresponding to the preset voice matched with the target voice is determined as: the step of replying to the voice by the target voice may include:

searching the intonation corresponding to the target preset reply voice based on the intonation determination table; wherein, the target preset reply voice is: a preset reply voice corresponding to a preset voice matched with the target voice; the intonation determination table records: mapping relation between preset intonation and preset reply voice;

and adjusting the tone of the target preset reply voice by using the searched tone to obtain the reply voice of the target voice.

Optionally, the voice reply model is for: obtaining a reply voice of the voice input to the voice reply model based on a mapping relation between the preset voice, a mode identifier of a game mode to which the preset voice belongs and a preset reply voice corresponding to the preset voice in the game mode to which the preset voice belongs;

the step of inputting the target speech into a preset speech reply model to obtain a reply speech output by the speech reply model and specific to the target speech may include:

determining a mode identifier of a current game mode of the game as a target mode identifier;

and inputting the target voice and the target mode identifier into a preset voice reply model to obtain a reply voice which is output by the voice reply model and corresponds to the target voice in the current game mode.

In a second aspect, an embodiment of the present invention provides a voice interaction apparatus, which is applied to an electronic device, and the apparatus may include:

the judging unit is used for judging whether the voice is monitored or not when the game running on the electronic equipment is detected;

the reply voice obtaining unit is used for taking the monitored voice as a target voice when the judging unit judges the monitored voice, inputting the target voice into a preset voice reply model and obtaining reply voice which is output by the voice reply model and aims at the target voice; wherein the voice reply model is to: acquiring a reply voice of the voice input to the voice reply model based on a mapping relation between the preset voice and the preset reply voice; presetting the voice comprises: game voice which is sent out by the game in the game running process;

and the reply voice playing unit is used for playing the reply voice of the target voice.

Optionally, the preset voice includes: game voice sent by the game in the game running process and preset user voice; the user speech includes: the voice of the user is made during the running of the game.

Alternatively, the reply voice obtaining unit may include:

the judging subunit is used for taking the monitored voice as a target voice when the judging unit judges the monitored voice, inputting the target voice into a preset voice reply model, and judging whether a preset voice matched with the target voice exists in the voice reply model;

the first determining subunit is configured to, when the determining subunit determines that the preset reply voice corresponding to the preset voice matched with the target voice exists, determine that: a reply voice to the target voice.

Optionally, in an embodiment of the present invention, the apparatus may further include:

the adding unit is used for newly adding the preset reply voice corresponding to the target voice when the judging subunit judges that the preset voice matched with the target voice does not exist in the voice reply model;

and the updating unit is used for updating the voice reply model by utilizing the target voice and the preset reply voice newly added by the adding unit.

Optionally, the first determining subunit may specifically be configured to:

accordingly, the reply voice obtaining unit may include:

a second determining subunit, configured to determine a mode identifier of a current game mode of the game as a target mode identifier;

and the reply voice obtaining subunit is used for inputting the target voice and the target mode identifier determined by the second determining subunit into a preset voice reply model to obtain a reply voice which is output by the voice reply model and corresponds to the target voice in the current game mode.

In a third aspect, an embodiment of the present invention provides an electronic device, including a processor, a communication interface, a memory, and a communication bus, where the processor and the communication interface complete communication between the memory and the processor through the communication bus;

a memory for storing a computer program;

and the processor is used for realizing the method steps provided by any voice interaction method embodiment of the first aspect when executing the program stored in the memory.

In a fourth aspect, an embodiment of the present invention provides a readable storage medium, where the readable storage medium is a readable storage medium on an electronic device, and a computer program is stored in the readable storage medium, where the computer program, when executed by a processor, implements the method steps provided in any one of the voice interaction method embodiments of the first aspect. In the embodiment of the invention, when the electronic equipment detects that the electronic equipment runs with a game, the electronic equipment can judge whether the voice is monitored. If the voice is monitored, the monitored voice can be used as a target voice, and the target voice can be input into a preset voice reply model. Wherein, since the voice reply model is used for: and obtaining a model of the reply voice of the voice input to the voice reply model based on the mapping relation between the preset voice and the preset reply voice. And, since the preset voice includes: the game sounds during the game play. Therefore, when the monitored target voice is the game voice sent by the game, after the target voice is input into the voice reply model, the reply voice which is output by the voice reply model and is aimed at the target voice can be obtained. The resulting reply voice may then be played. Therefore, the game voice sent by the game can be replied, the loneliness of the game player in the game playing process is reduced, and the game experience of the game player is improved.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a flowchart of a voice interaction method according to an embodiment of the present invention;

fig. 2 is a schematic diagram of a voice interaction scenario according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of a voice interaction apparatus according to an embodiment of the present invention;

fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In order to solve the problems in the prior art, embodiments of the present invention provide a voice interaction method, apparatus, electronic device, and readable storage medium.

First, a voice interaction method provided by an embodiment of the present invention is described below.

It can be understood that the voice interaction method provided by the embodiment of the present invention is applied to electronic devices, including but not limited to desktop computers, tablet computers, and mobile phones.

Referring to fig. 1, a voice interaction method provided in an embodiment of the present invention may include the following steps:

s101: when detecting that a game runs on the electronic equipment, judging whether voice is monitored or not; if the voice is monitored, executing step S102;

s102: if the voice is monitored, taking the monitored voice as a target voice, and inputting the target voice into a preset voice reply model to obtain a reply voice which is output by the voice reply model and aims at the target voice; wherein the voice reply model is to: acquiring a reply voice of the voice input to the voice reply model based on a mapping relation between the preset voice and the preset reply voice; presetting the voice comprises: the game voice is sent out during the running process of the game;

s103: and playing the reply voice of the target voice.

In the embodiment of the invention, when the electronic equipment detects that the electronic equipment runs with a game, the electronic equipment can judge whether the voice is monitored. If the voice is monitored, the monitored voice can be used as a target voice, and the target voice can be input into a preset voice reply model. Wherein, since the voice reply model is used for: and obtaining a model of the reply voice of the voice input to the voice reply model based on the mapping relation between the preset voice and the preset reply voice. And, since the preset voice includes: the game sounds during the game play. Therefore, when the monitored target voice is the game voice sent by the game, after the target voice is input into the voice reply model, the reply voice which is output by the voice reply model and is aimed at the target voice can be obtained. The resulting reply voice may then be played. Therefore, the game voice sent by the game can be replied, the loneliness of the game player in the game playing process is reduced, and the game experience of the game player is improved.

As can be understood by those skilled in the art, in the field of voice interaction technology, the electronic device can reply to a user voice uttered by a user, so as to implement voice interaction. However, in the field of game technology, game voice is generated by the game itself, and is used for prompting the user to perform game operations or for bringing a good hearing experience to the user. Thus, those skilled in the art will recognize that there is no need to reply to such game speech uttered by the machine (i.e., electronic device), i.e., there is no need to voice-interact with the game speech.

In the embodiment of the invention, the inventor overcomes the bias that the game voice sent by the machine does not need to be replied, and adopts the technical means discarded by the skilled person due to the bias to obtain the voice interaction method provided by the embodiment of the invention.

The following describes the voice interaction method provided by the embodiment of the present invention in detail with reference to specific examples.

It is assumed that the electronic device a can run the game W, and the game W can send out game voices such as "welcome to the game W", "enemy to battle field, please make battle preparation", and game end in the running process, which are not illustrated herein.

Then, when the game W is executed on the electronic device a, the electronic device a may detect that the game W is executed in the memory, that is, may detect that the electronic device a itself is executed with the game W. At this time, the electronic apparatus a may determine whether a sound monitor (e.g., a microphone) built in the electronic apparatus a has heard the voice.

When the electronic device a monitors the voice in the process of running the game W: when the enemy arrives at the battlefield for 5 seconds and please make preparation for battle, the electronic device a may use the monitored voice as the target voice and input the target voice into a preset voice reply model.

Wherein, since the voice reply model is used for: and obtaining the reply voice of the voice input to the voice reply model based on the mapping relation between the preset voice and the preset reply voice, and outputting the obtained reply voice. Moreover, the preset voices recorded in the voice reply model include: the game W utters a game voice during the game execution. Thus, the speech reply model has recorded therein: the preset voice is 'enemy still has 5 seconds to arrive at a battlefield and please make battle preparation', and the preset reply voice corresponding to the preset voice. That is, the voice reply model may be used to recognize game speech uttered by the game.

Assuming that the preset voice recorded in the voice reply model is that the enemy still arrives at the battlefield for 5 seconds, the corresponding preset reply voice of "please make battle preparation" is: "refuel! ". In this case, after the electronic device a inputs the monitored target voice "enemy has 5 seconds to the battlefield and please make battle preparation" to the voice reply model, the electronic device a may determine that: the preset voice "enemy still has 5 seconds to arrive at the battlefield, please make battle preparation" matching the target voice. Therefore, the preset reply voice' refuel!corresponding to the preset voice can be obtained! ". At this point, electronic device A may "refuel! "reply voice as target voice and" refuel "to reply voice! "play. Therefore, the play partner of the game player is virtualized in a mode of replying the game voice sent by the game, the solitary feeling of the game player playing the game alone is reduced, and the game experience of the game player is improved.

It will be appreciated that the preset voice recorded in the voice reply model "enemy has 5 seconds to arrive at the battlefield, and the corresponding preset reply voice for" ready to fight "includes but is not limited to" refuel! "one skilled in the art can set the preset reply voice corresponding to each preset voice according to specific requirements, which is not described in detail herein.

In addition, the preset voices recorded in the voice reply model include: the game W utters a game voice during the game execution. It is reasonable that the game sound generated during the running of the game, such as the game K installed in the electronic device a, can be included.

Of course, in order to enhance the interactivity between the game player and the game and further improve the game experience of the player, the preset voice in the embodiment of the present invention may include, in addition to the game voice issued during the game, the user voice issued by the game player (i.e., the user) during the running of the game. That is, the voice reply model may also be used to recognize user speech uttered by the user during the game. For example, the voice reply model includes: the voice uttered by the user during the running of the game W (i.e. the target voice) "wastes" and "uses the preset reply voice corresponding to" wastes ". Therefore, the game player can carry out voice interaction with the game in the game playing process, and the interest of the game is enhanced.

Then, after receiving the voice "waste with you" uttered by the user, the electronic device a may input the voice "waste with you" into a preset voice reply model. Then, it can be judged that a preset voice matching "waste with you" exists in the voice reply model. Further, a preset reply voice corresponding to "waste you" may be used as: the user utters a "waste" reply voice and plays the reply voice.

In order to enable the reply voice played by the electronic device a to better conform to the context, the electronic device a may further search, after the preset reply voice "is found, a tone corresponding to the voice" by using the tone determination table. Wherein, the intonation confirms that records in the table has: "two" with the corresponding intonation "sadness intonation". Therefore, the found tone of "sad tone" can be used for adjusting the tone of "sad tone", and then a reply voice "with the sad tone can be obtained. Therefore, the phenomenon that the intonation of all the reply voices is uniform is avoided, and the game experience of game players is improved.

It is understood that the person skilled in the art can set the following according to the actual situation: the preset tones recorded in the tone determination table and corresponding to each preset reply voice are not illustrated herein.

In addition, when the electronic device a monitors the voice uttered by the user during the running of the game W: "begin combat," and no: when the preset voice is matched with the voice "start fighting", in one implementation, the electronic device a may perform keyword analysis on the voice "start fighting", and may analyze the keyword "fighting" in the voice "start fighting". Then, the keyword "battle" is used to match with the preset voice in the voice reply model. Therefore, the keyword ' battle ' and the preset voice ' enemy in the voice reply model can be judged to arrive at the battlefield in 5 seconds, and the match of the battle preparation is made. At this time, it can be obtained that the preset voice "enemy arrives at the battlefield for 5 seconds, please get ready for battle" corresponding to the preset reply voice "refuel! ", and will" refuel! "as the reply voice of" start battle ", and further" refuel "to the reply voice! "play.

In another implementation manner, a preset reply voice "refuel!corresponding to the voice" start fighting "can be added! ", and utilizes the voice" start battle "and its corresponding preset reply voice" refuel! "update the voice reply model, and then play the reply voice" refuel!when the voice "start fighting" is monitored next time! ", this is also reasonable. Therefore, the voice reply model can continuously perform machine learning, so that the electronic equipment can obtain more comprehensive and accurate voice reply.

Wherein, the newly added preset reply voice of the voice "start fighting" is "refuel! It is reasonable that "may be input to the voice reply model by those skilled in the art, or may be obtained by the electronic device a after performing keyword analysis and keyword matching on the voice" start battle "in the above-described manner.

In addition, it is understood that a plurality of game modes may exist for one game, for example, there are a single play mode (i.e., one-to-one fighting mode) and a group fighting mode (e.g., five-to-five fighting mode) for the game W. Suppose that when the game is started in both the single play mode and the group play mode, a game voice is given, "enemy still arrives at the battlefield for 5 seconds, please make a battle preparation". In this case, in order to perform more accurate reply for the game speech in each mode, the speech reply model in the embodiment of the present invention may specifically be used to: and acquiring the reply voice of the voice input to the voice reply model based on the mapping relation among the preset voice, the mode identifier of the game mode to which the preset voice belongs and the preset reply voice corresponding to the preset voice in the game mode to which the preset voice belongs.

For example, when the preset voice is "enemy arrives at the battlefield for 5 seconds and please make preparation for battle", the preset reply voice corresponding to the preset voice in the single-click mode may be set to "fuel-in! And the preset reply voice corresponding to the preset voice in the team fighting mode can be set as' home fueling! ".

Thus, when the electronic device a monitors the voice "enemy arrives at the battlefield for 5 seconds and please make preparation for battle", the current game mode of the game W can also be determined, for example, to be the group battle mode. Furthermore, the mode identification of the group fighting mode can be obtained and used as the target mode identification. Then, the monitored voice and the target mode identification are input into the voice reply model, so that the following steps can be obtained and played: the voice reply model outputs the reply voice corresponding to the monitored voice in the team fighting mode, namely' big people refuel! ". Like this, can reply with distinguishing to the same recreation pronunciation under the different game modes in same recreation for the pronunciation that electronic equipment A replied more press close to the game mode, reduced the solitary sense that game player played alone, promoted game player's gaming experience.

The voice interaction method provided by the embodiment of the invention is further described below with reference to a specific voice interaction scenario.

Referring to fig. 2, for a game W running on an electronic device, when a player (i.e., user) selects hero, the device emits a predetermined voice identifying the selected hero. After receiving the voice sent by the device itself, the electronic device may obtain a reply voice corresponding to the voice according to a preset voice reply model, where the obtained reply voice is, for example: and e, selecting a legal teacher and being suitable for remote control. At this time, the electronic device may reply to the voice: and e, selecting a legal teacher and being suitable for remote control. This allows game W to explain the selected hero in real time.

And, when the user replies: after using your wasted voice, the electronic device may receive the voice, obtain a preset reply voice "corresponding to" using your wasted voice ", and reply" with the damaged intonation.

In addition, for game play, when the user selects the single play mode and clicks to start the game, the electronic device may issue: the enemy also arrives at the battlefield for 5 seconds, please prepare for the battle. After receiving the voice, the electronic device can reply with: refuel! When the user selects the team mode and clicks to start the game, the electronic device may issue: the enemy also arrives at the battlefield for 5 seconds, please prepare for the battle. After receiving the voice, the electronic device can reply with: oiling everywhere!

In the game process, when the equipment sends out the voice prompt of hero death, the electronic equipment can reply the following steps after receiving the voice prompt: you die again, you bird. At the end of the game, when the device issues: when the game is finished and the voice prompt is received by the electronic equipment, the electronic equipment can reply the following steps: after the battle is finished, let I make a poem: east wind start … ….

Therefore, by the voice interaction mode, the game voice sent by the game can be replied, the loneliness of the game player in the game playing process is reduced, and the game experience of the game player can be improved.

It should be noted that this example is intended to describe a specific voice interaction scenario, and does not describe in detail the specific process of obtaining the reply voice. The specific process of obtaining the reply voice can be referred to the related description above, and is not described herein.

Corresponding to the above method embodiment, an embodiment of the present invention further provides a voice interaction apparatus, applied to an electronic device, and referring to fig. 3, the apparatus may include:

a judging unit 301, configured to judge whether a voice is monitored when it is detected that a game is running on the electronic device;

a reply voice obtaining unit 302, configured to, when the determining unit 301 determines that a voice is monitored, take the monitored voice as a target voice, and input the target voice into a preset voice reply model to obtain a reply voice output by the voice reply model and directed at the target voice; wherein the voice reply model is to: acquiring a reply voice of the voice input to the voice reply model based on a mapping relation between the preset voice and the preset reply voice; presetting the voice comprises: game voice which is sent out by the game in the game running process;

a reply voice playing unit 303, configured to play a reply voice of the target voice.

By applying the device provided by the embodiment of the invention, the electronic equipment can judge whether the voice is monitored or not when the electronic equipment detects that the electronic equipment runs a game. If the voice is monitored, the monitored voice can be used as a target voice, and the target voice can be input into a preset voice reply model. Wherein, since the voice reply model is used for: and obtaining a model of the reply voice of the voice input to the voice reply model based on the mapping relation between the preset voice and the preset reply voice. And, since the preset voice includes: the game sounds during the game play. Therefore, when the monitored target voice is the game voice sent by the game, after the target voice is input into the voice reply model, the reply voice which is output by the voice reply model and is aimed at the target voice can be obtained. The resulting reply voice may then be played. Therefore, the game voice sent by the game can be replied, the loneliness of the game player in the game playing process is reduced, and the game experience of the game player is improved.

Optionally, the reply voice obtaining unit 302 includes:

a judging subunit, configured to, when the judging unit 301 judges that the voice is monitored, take the monitored voice as a target voice, input the target voice into a preset voice reply model, and judge whether a preset voice matching the target voice exists in the voice reply model;

Optionally, the first determining subunit is specifically configured to:

accordingly, the reply voice obtaining unit 302 includes:

Corresponding to the above method embodiment, an embodiment of the present invention further provides an electronic device, referring to fig. 4, the electronic device includes a processor 401, a communication interface 402, a memory 403, and a communication bus 404, where the processor 401, the communication interface 402, and the memory 403 complete communication with each other through the communication bus 404;

a memory 403 for storing a computer program;

the processor 401 is configured to implement the method steps provided by any one of the above embodiments of the voice interaction method when executing the program stored in the memory.

The communication bus mentioned in the electronic device may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown, but this does not mean that there is only one bus or one type of bus.

The communication interface is used for communication between the electronic equipment and other equipment.

The Memory may include a Random Access Memory (RAM) or a Non-Volatile Memory (NVM), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.

The Processor may be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; but also Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components.

Corresponding to the above method embodiment, an embodiment of the present invention further provides a readable storage medium, where the readable storage medium is a readable storage medium in an electronic device, and a computer program is stored in the readable storage medium, and when executed by a processor, the computer program implements the method steps provided by any of the above voice interaction method embodiments.

After the computer program stored in the readable storage medium provided by the embodiment of the invention is executed by the processor of the electronic device, the electronic device can judge whether to monitor the voice when detecting that the electronic device runs a game. If the voice is monitored, the monitored voice can be used as a target voice, and the target voice can be input into a preset voice reply model. Wherein, since the voice reply model is used for: and obtaining a model of the reply voice of the voice input to the voice reply model based on the mapping relation between the preset voice and the preset reply voice. And, since the preset voice includes: the game sounds during the game play. Therefore, when the monitored target voice is the game voice sent by the game, after the target voice is input into the voice reply model, the reply voice which is output by the voice reply model and is aimed at the target voice can be obtained. The resulting reply voice may then be played. Therefore, the game voice sent by the game can be replied, the loneliness of the game player in the game playing process is reduced, and the game experience of the game player is improved.

It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.

All the embodiments in the present specification are described in a related manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, as for the apparatus, the electronic device and the readable storage medium embodiments, since they are substantially similar to the method embodiments, the description is relatively simple, and in relation to the description, reference may be made to some parts of the description of the method embodiments.

The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims

1. A voice interaction method is applied to an electronic device, and comprises the following steps:

if the voice is monitored, taking the monitored voice as a target voice, and determining a mode identifier of the current game mode of the game as a target mode identifier; inputting the target voice and the target mode identification into a preset voice reply model to obtain reply voice which is output by the voice reply model and corresponds to the target voice in the current game mode; wherein the voice reply model is to: obtaining reply voice of the voice input to the voice reply model based on a mapping relation among preset voice, a mode identifier of a game mode to which the preset voice belongs and preset reply voice corresponding to the preset voice in the game mode to which the preset voice belongs; the preset voice comprises: the game voice is sent out by the game in the game running process;

searching the intonation corresponding to the target preset reply voice based on the intonation determination table; wherein the target preset reply voice is: a preset reply voice corresponding to a preset voice matched with the target voice; the intonation determination table records: mapping relation between preset intonation and preset reply voice;

performing tone adjustment on the target preset reply voice by using the searched tone to obtain the reply voice of the target voice;

and playing the reply voice of the target voice.

2. The method of claim 1, wherein the preset speech comprises: the game voice sent by the game in the game running process and the preset user voice; the user speech includes: the user utters voice during the running of the game.

3. The method of claim 1, wherein when it is determined that there is no preset speech in the speech reply model matching the target speech, the method further comprises:

newly adding a preset reply voice corresponding to the target voice;

4. A voice interaction device is applied to electronic equipment, and the device comprises:

a reply voice obtaining unit including:

the judging subunit is used for taking the monitored voice as a target voice when the judging unit judges the monitored voice, inputting the target voice into a preset voice reply model, and judging whether a preset voice matched with the target voice exists in the voice reply model or not; wherein the voice reply model is to: acquiring reply voice of the voice input to the voice reply model based on a mapping relation between preset voice and preset reply voice; the preset voice comprises: the game voice is sent out by the game in the game running process;

the first determining subunit is used for searching the intonation corresponding to the target preset reply voice based on the intonation determining table; wherein the target preset reply voice is: a preset reply voice corresponding to a preset voice matched with the target voice; the intonation determination table records: mapping relation between preset intonation and preset reply voice; performing tone adjustment on the target preset reply voice by using the searched tone to obtain the reply voice of the target voice;

the reply voice playing unit is used for playing the reply voice of the target voice;

the voice reply model is to: obtaining reply voice of the voice input to the voice reply model based on a mapping relation among preset voice, a mode identifier of a game mode to which the preset voice belongs and preset reply voice corresponding to the preset voice in the game mode to which the preset voice belongs;

the reply voice obtaining unit includes:

a second determining subunit, configured to determine a mode identifier of a current game mode of the game, as a target mode identifier;

and the reply voice obtaining subunit is configured to input the target voice and the target mode identifier determined by the second determining subunit into a preset voice reply model, so as to obtain a reply voice output by the voice reply model and corresponding to the target voice in the current game mode.

5. The apparatus of claim 4, wherein the preset voice comprises: the game voice sent by the game in the game running process and the preset user voice; the user speech includes: the user utters voice during the running of the game.

6. The apparatus of claim 4, further comprising:

7. An electronic device is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor and the communication interface are used for realizing mutual communication by the memory through the communication bus;

a memory for storing a computer program;

a processor for implementing the method steps of any one of claims 1 to 3 when executing a program stored in the memory.

8. A readable storage medium, characterized in that the readable storage medium is a readable storage medium on an electronic device, in which a computer program is stored which, when being executed by a processor, realizes the method steps of any one of claims 1-3.