CN107452386B

CN107452386B - Voice data processing method and system

Info

Publication number: CN107452386B
Application number: CN201710703578.7A
Authority: CN
Inventors: 谢兵; 黎广斌; 张旭辉; 王东洋; 张天铖
Original assignee: Lenovo Beijing Ltd
Current assignee: Lenovo Beijing Ltd
Priority date: 2017-08-16
Filing date: 2017-08-16
Publication date: 2020-03-24
Anticipated expiration: 2037-08-16
Also published as: CN107452386A

Abstract

The invention provides a voice data processing method and a system, wherein the method comprises the following steps: acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group; determining a target electronic device closest to a sender of voice data from the plurality of electronic devices; controlling a voice assistant of the target electronic device to respond to the voice data; therefore, in the invention, when the voice assistants of the electronic devices all receive the voice data sent by the user, only the voice assistant of the target electronic device closest to the user can respond to the voice data, so that manual operation of the user is not needed, and the user experience is improved.

Description

Voice data processing method and system

Technical Field

The invention relates to the technical field of data processing, in particular to a voice data processing method and system.

Background

At present, most electronic devices such as mobile phones, smart sound boxes, smart televisions and the like support voice assistants, and the voice assistants have the functions of realizing voice control, information query and the like in a voice interaction mode.

The voice assistant of the electronic equipment can perform voice interaction when the voice assistant is in a wake-up state.

As an application scenario, when a user wants to perform voice interaction with one electronic device, if voice assistants on a plurality of electronic devices around the user are all in an awake state, then all of the electronic devices respond to the voice information of the user, which is obviously contrary to the intention of the user. When the situation occurs, the user needs to control the voice assistant of other electronic equipment to exit the awakening state, the operation is complicated, and the user experience is reduced.

Disclosure of Invention

In view of this, the present invention provides a method and a system for processing voice data to simplify user operations and improve user experience.

In order to achieve the purpose, the invention provides the following technical scheme:

a method of speech data processing, comprising:

acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group;

determining a target electronic device closest to a sender of voice data from the plurality of electronic devices;

a voice assistant that controls the target electronic device responds to the voice data.

Preferably, the method further comprises the following steps:

judging whether the plurality of electronic devices are located in the same network;

correspondingly, the determining a target electronic device closest to the sender of the voice data from the plurality of electronic devices includes:

a target electronic device closest to a sender of voice data is determined among a plurality of electronic devices belonging to the same network.

Preferably, the method further comprises the following steps:

judging whether the voice data sent by the voice assistants of the plurality of electronic devices at the same time belong to the same voiceprint;

a target electronic device closest to a sender of voice data is determined from among a plurality of electronic devices whose voice data belong to the same voiceprint.

Preferably, the method further comprises the following steps:

and if not, controlling the voice assistants of the electronic equipment to respectively respond to the voice data.

Preferably, determining a target electronic device closest to a sender of voice data from among the plurality of electronic devices includes:

receiving distance parameters respectively sent by the plurality of electronic devices; the distance parameter is the distance between the electronic equipment and the sender of the voice data, which is determined by the electronic equipment based on the attribute information of the voice data;

and determining the electronic equipment with the minimum distance parameter as target electronic equipment.

acquiring attribute information of the voice data;

and determining the electronic equipment of which the attribute information meets the preset condition as target electronic equipment.

A speech data processing system comprising:

the electronic equipment runs with a voice assistant respectively and is used for acquiring voice data through the voice assistant;

the server is used for acquiring voice data simultaneously sent by the voice assistants of a plurality of electronic devices belonging to the same group, determining a target electronic device which is closest to a sender of the voice data from the plurality of electronic devices, and controlling the voice assistant of the target electronic device to respond to the voice data.

Preferably, the server is further configured to determine whether the plurality of electronic devices are located in the same network, and is specifically configured to determine a target electronic device closest to a sender of the voice data from among the plurality of electronic devices belonging to the same network.

Preferably, the server is further configured to determine whether the voice data sent by the voice assistants of the plurality of electronic devices at the same time belong to the same voiceprint, and specifically, determine a target electronic device closest to the sender of the voice data from the plurality of electronic devices whose voice data belong to the same voiceprint.

Preferably, the electronic device is configured to determine a distance parameter between the electronic device and a sender of the voice data based on attribute information of the voice data;

the server is specifically configured to receive the distance parameters sent by the multiple electronic devices, and determine the electronic device with the smallest distance parameter as the target electronic device.

Preferably, the server is specifically configured to acquire attribute information of the voice data, and determine that an electronic device whose attribute information satisfies a preset condition is a target electronic device.

As can be seen from the above technical solutions, compared with the prior art, the present invention discloses a voice data processing method, including: receiving voice data simultaneously transmitted by voice assistants of a plurality of electronic devices belonging to the same group, determining a target electronic device closest to a sender of the voice data from the plurality of electronic devices, and controlling the voice assistants of the target electronic devices to respond to the voice data; therefore, in the invention, when the voice assistants of the electronic devices all receive the voice data sent by the user, only the voice assistant of the target electronic device closest to the user can respond to the voice data, so that manual operation of the user is not needed, and the user experience is improved.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.

Fig. 1 is a schematic flow chart of a voice data processing method according to an embodiment of the present invention;

fig. 2 is a schematic structural diagram of an application scenario of a voice data processing method according to an embodiment of the present invention;

FIG. 3 is a flowchart illustrating a voice data processing method according to another embodiment of the present invention;

FIG. 4 is a flowchart illustrating a voice data processing method according to another embodiment of the present invention;

fig. 5 is a schematic structural diagram of a voice data system according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

An embodiment of the present invention discloses a voice data processing method, as shown in fig. 1, the method includes the following steps:

step 101: acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group;

the data processing method can be applied to a server, and the server is used for receiving voice data sent by voice assistants of electronic equipment, and determining the voice data sent by the voice assistants of a plurality of electronic equipment belonging to the same group at the same time when receiving the voice data sent by the voice assistants of the plurality of electronic equipment.

It is understood that "simultaneously" means the same time, but due to the network delay and other factors, the "simultaneously" in the present invention can refer to the time difference being within a preset time, and the preset time is particularly short, such as 0.5 second, 1 second and so on. For example, the time difference between the voice data sent by the voice assistants of the two electronic devices is 1 second, which can be considered that the two electronic devices simultaneously send the voice data.

Optionally, for the received voice data sent by the voice assistants of the multiple electronic devices simultaneously, the voice data sent by the voice assistants of the multiple electronic devices belonging to the same group simultaneously needs to be determined, where the group to which the electronic devices belong may be determined by the identity of the electronic devices, and the electronic devices having the same identity belong to the same group. The identity may be an identity set by a user to represent that a plurality of electronic devices all belong to the same holder, and a specific implementation form of the identity is not limited in the present invention.

It should be noted that the identification of the electronic device may refer to the identification of the electronic device itself, or may refer to the identification of the voice assistant of the electronic device.

The voice assistant of the electronic device may carry the identity when sending the voice data to the server, or the identity of each electronic device may be stored in the server in advance, so that the server performs the grouping in advance.

Step 102: determining a target electronic device closest to a sender of voice data from among a plurality of electronic devices;

in practical applications, when a user wants to perform voice interaction with an electronic device, the user is generally close to the electronic device, that is, if there are multiple electronic devices around the user, the user is closest to the electronic device that the user wants to perform voice interaction with. Therefore, in the present application, the server can determine the target electronic device closest to the sender of the voice data from the plurality of electronic devices.

And determining at least one electronic device from the plurality of electronic devices as a target electronic device closest to the sender of the voice data.

Optionally, in an embodiment, determining a target electronic device closest to a sender of voice data from among the plurality of electronic devices includes the following processes:

(1) receiving distance parameters respectively sent by a plurality of electronic devices;

the distance parameter is the distance between the electronic equipment and the sender of the voice data, which is determined by the electronic equipment based on the attribute information of the voice data.

The electronic equipment calculates the distance parameter between the electronic equipment and the sender of the voice data based on the attribute information of the voice data collected by the voice assistant, and sends the distance parameter to the server. It should be noted that the electronic device may send the distance parameter directly to the server by carrying the distance parameter in voice data, or send the distance parameter to the server separately.

The attribute information of the voice data includes one or more of the following parameters: signal-to-noise ratio parameters, energy intensity. The electronic equipment stores different corresponding relations between the attribute information and the distance parameter in advance, searches the distance parameter corresponding to the attribute information of the voice data from the corresponding relations stored in advance, and sends the distance parameter to the server.

(2) And determining the electronic equipment with the minimum distance parameter as target electronic equipment.

After receiving the distance parameters sent by the electronic equipment, the server determines the electronic equipment with the minimum distance parameter as the target electronic equipment from a plurality of electronic equipment belonging to the same group.

Optionally, in another embodiment, determining a target electronic device closest to a sender of voice data from among the plurality of electronic devices includes the following processes:

(1) acquiring attribute information of the voice data;

after receiving voice data sent by a voice assistant of the electronic equipment, determining attribute information of the voice data, wherein the attribute information of the voice data comprises one or more of the following parameters: signal-to-noise ratio parameters, energy intensity.

(2) And determining the electronic equipment of which the attribute information meets the preset condition as target electronic equipment.

In one implementation, attribute information of voice data sent by voice assistants of multiple electronic devices in the same group may be compared, and an electronic device whose attribute information satisfies a preset condition is determined as a target electronic device, where the preset condition is a condition for determining that a distance between the electronic device and a sender of the voice data is closest. In one embodiment, when the attribute information is a signal-to-noise ratio parameter, the preset condition is a condition that the signal-to-noise ratio is optimal among a plurality of voice data; when the attribute information is energy intensity, the preset condition is a condition with the strongest energy intensity in the plurality of voice data. Taking the attribute information as the signal-to-noise ratio parameter as an example, the electronic device with the best signal-to-noise ratio is determined as the target electronic device from the attribute information of the voice data sent by the voice assistants of the electronic devices in the same group.

In another implementation manner, attribute information of voice data sent by voice assistants of multiple electronic devices in the same group may be compared with a preset attribute information, and an electronic device that meets the preset attribute information is determined to be a target electronic device.

Step 103: the voice assistant of the control target electronic device responds to the voice data.

The target electronic device is the electronic device that is closest to the sender of the voice data, i.e., the electronic device with which the user wants to have voice interaction, and thus controls the target electronic device voice assistant to respond to the voice data.

The embodiment of the invention provides a data processing method, which comprises the following steps: receiving voice data simultaneously transmitted by voice assistants of a plurality of electronic devices belonging to the same group, determining a target electronic device closest to a sender of the voice data from the plurality of electronic devices, and controlling the voice assistants of the target electronic devices to respond to the voice data; therefore, in the invention, when the voice assistants of the electronic devices all receive the voice data sent by the user, only the voice assistant of the target electronic device closest to the user can respond to the voice data, so that manual operation of the user is not needed, and the user experience is improved.

In an application scenario, as shown in fig. 2, the server 100 receives voice data simultaneously transmitted by the voice assistant of the electronic device 201, the voice assistant of the electronic device 202, and the voice assistant of the electronic device 203, respectively, and determines that the electronic device 201 and the electronic device 202 belong to the same group, so that it is determined from the electronic device 201 and the electronic device 202 that the electronic device 201 is closest to the sender of the voice data, and then controls the voice assistant of the electronic device 201 to respond to the voice data transmitted by the voice assistant thereof, and prohibits the electronic device 202 from responding to the voice data transmitted by the voice assistant thereof. Since electronic device 203 is in a different group than electronic device 201 and electronic device 202, electronic device 203 is also controlled to respond to the voice data sent by its voice assistant.

That is, in the above scenario, the user has the electronic device 201 and the electronic device 202 around him and wants to perform voice interaction with the electronic device 201, and since the voice assistant of the electronic device 201 and the voice assistant of the electronic device 202 are both in the wake-up state, both the voice assistant of the electronic device 201 and the voice assistant of the electronic device 202 can receive the voice data and send it to the server.

Another embodiment of the present invention discloses a method for processing voice data, as shown in fig. 3, the method includes the following steps:

step 301: acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group;

step 302: judging whether the electronic devices are located in the same network, if so, entering step 303; if not, go to step 305;

specifically, the electronic devices are electronic devices belonging to the same group and whose voice assistants send voice data to the server at the same time, and whether the electronic devices are located at the same position is determined by judging whether the electronic devices are located in the same network, if so, determining that the electronic devices are located at the same position, and if so, determining that the electronic devices are located at different positions.

Alternatively, whether the plurality of electronic devices are located in the same network may be determined based on network addresses or network identifications of the electronic devices, electronic devices having the same network address or the same network identification may be located in the same network, and electronic devices having different network addresses or different network identifications may be located in different networks.

Step 303: determining a target electronic device closest to a sender of voice data from a plurality of electronic devices belonging to the same network;

the electronic devices belonging to the same network are located at the same position, and the voice data collected by the voice assistant is sent by the same sender, so that the target electronic device closest to the sender of the voice data is determined from the electronic devices belonging to the same network.

Step 304: controlling a voice assistant of the target electronic equipment to respond to the voice data;

step 305: and respectively responding to the voice data by controlling the voice assistants of the electronic equipment belonging to different networks.

Since electronic devices belonging to different networks are located at different positions, even if the voice assistants of the plurality of electronic devices simultaneously transmit voice data to the server, the voice data received by the voice assistant of each electronic device belonging to the same group is not transmitted by the same sender.

As an application scenario, electronic device a and electronic device B belong to the same group, electronic device a is located at the home of user 1, and electronic device B is located at the company of user 1, and there is a moment that while family 2 of user 1 uses electronic device a to perform voice interaction, user 1 uses electronic device B to perform voice interaction. Therefore, in this case, even if the electronic apparatus a and the electronic apparatus B belong to the same group and the voice data is transmitted to the server by the voice assistant at the same time, since the two electronic apparatuses belong to different networks, the server controls the voice assistant of the electronic apparatus a and the voice assistant of the electronic apparatus B to respond to the received voice data, respectively.

The embodiment of the invention provides a data processing method, which comprises the following steps: receiving voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group, judging whether the electronic devices are located in the same network, determining a target electronic device which is closest to a sender of the voice data from the electronic devices belonging to the same network, and controlling the voice assistant of the target electronic device to respond to the voice data; therefore, in this embodiment, when the voice assistants of the multiple electronic devices all receive the voice data sent by the user, the target electronic device closest to the user can be determined from the electronic devices belonging to the same network, and the voice assistants are controlled to respond to the voice data, so that manual operation by the user is not required, user experience is improved, and accuracy of data processing is improved.

Another embodiment of the present invention discloses a method for processing voice data, as shown in fig. 4, the method includes the following steps:

step 401: acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group;

step 402: judging whether the voice data sent by the voice assistants of the plurality of electronic devices at the same time belong to the same voiceprint, if so, entering step 403; if not, go to step 405;

specifically, the electronic devices are electronic devices belonging to the same group and whose voice assistants send voice data to the server at the same time, and whether the voice data collected by the voice assistants of the electronic devices are sent by the same sender is determined by judging whether the electronic devices belong to the same voiceprint, if the electronic devices belong to the same voiceprint, the voice data collected by the voice assistants of the electronic devices are determined to be sent by the same sender, and if the electronic devices belong to different voiceprints, the voice data collected by the voice assistants of the electronic devices are determined to be sent by different senders.

Step 403: determining a target electronic device closest to a sender of voice data from a plurality of electronic devices for which the voice data belongs to the same voiceprint;

step 404: controlling a voice assistant of the target electronic equipment to respond to the voice data;

step 405: a voice assistant that controls a plurality of electronic devices and devices whose voice data belongs to different voiceprints respectively responds to the voice data.

Since the voice data belongs to different voiceprints, it can be determined that the voice data collected by different electronic device voice assistants is sent by different senders.

As an application scenario, the electronic device a and the electronic device B belong to the same group and are both located at the home of the user 1, and at this moment, the user 1 performs voice interaction using the electronic device a, and at the same time, the family 2 of the user 1 also performs voice interaction using the electronic device B. Therefore, in this case, even if the electronic device a and the electronic device B belong to the same group and the voice data is transmitted to the server by the voice assistant at the same time, since the voice data collected by the voice assistants of the two electronic devices is transmitted by different senders, the server controls the voice assistant of the electronic device a and the voice assistant of the electronic device B to respond to the received voice data, respectively.

The embodiment of the invention provides a data processing method, which comprises the following steps: receiving voice data sent by voice assistants of a plurality of electronic devices belonging to the same group at the same time, judging whether the voice data sent by the voice assistants of the plurality of electronic devices at the same time belong to the same voiceprint, determining a target electronic device which is closest to a sender of the voice data from the plurality of electronic devices of which the voice data belong to the same voiceprint, and controlling the voice assistants of the target electronic devices to respond to the voice data; therefore, in this embodiment, when the voice assistants of the multiple electronic devices all receive the voice data sent by the user, the target electronic device closest to the user can be determined from the electronic devices with the voice data belonging to the same voiceprint, and the voice assistants are controlled to respond to the voice data, so that manual operation by the user is not needed, user experience is improved, and accuracy of data processing is improved.

Corresponding to the above voice data processing method, an embodiment of the present invention further discloses a voice data processing system, which is described below by embodiments respectively:

an embodiment of the present invention discloses a voice data processing system, as shown in fig. 5, the system includes: a server 100, an electronic device 201, an electronic device 202, and an electronic device 203;

the electronic device 201 and the electronic device 202 belong to the same group, and the electronic device 203 does not belong to the group to which the electronic device 201 and the electronic device 202 belong, may belong to another group, or may not belong to any group.

The electronic device 201, the electronic device 202, and the electronic device 203 each run a voice assistant, and when voice data is collected by the voice assistant, the voice data is sent to the server 100.

In the present invention, the voice data processing system includes a plurality of electronic devices, and the present embodiment only takes three electronic devices as an example.

The server 100 is configured to acquire voice data simultaneously transmitted by voice assistants of a plurality of electronic devices belonging to the same group, determine a target electronic device closest to a sender of the voice data from the plurality of electronic devices, and control the voice assistants of the target electronic devices to respond to the voice data. In this embodiment, the server 100 is configured to obtain voice data simultaneously transmitted by the voice assistants of the electronic device 201 and the electronic device 202, determine a target electronic device closest to a sender of the voice data from the electronic device 201 and the electronic device 202, and control the voice assistant of the target electronic device to respond to the voice data.

It is understood that "simultaneously" means the same time, but due to the network delay and other factors, the "simultaneously" in the present invention can refer to the time difference being within a preset time, and the preset time is particularly short, such as 0.5 second, 1 second and so on.

Optionally, the server may determine the group to which the electronic device belongs by using the identity of the electronic device, and the electronic devices having the same identity belong to the same group. The identity may be a user-defined identity used to characterize multiple electronic devices belonging to the same holder, and the specific implementation form of the present invention is not limited.

Optionally, in an embodiment, the electronic device is configured to determine a distance parameter between the electronic device and a sender of the voice data based on attribute information of the voice data collected by the voice assistant of the electronic device; the server is used for receiving the distance parameters respectively sent by the electronic devices and determining the electronic device with the minimum distance parameter as the target electronic device.

It should be noted that the electronic device may send the distance parameter directly to the server by carrying the distance parameter in voice data, or send the distance parameter to the server separately.

The attribute information of the voice data includes one or more of the following parameters: signal-to-noise ratio parameters, energy intensity. The electronic equipment stores different corresponding relations between the attribute information and the distance parameter in advance, searches the distance parameter corresponding to the attribute information of the voice data from the corresponding relations stored in advance and sends the distance parameter to the server; after receiving the distance parameters sent by the electronic equipment, the server determines the electronic equipment with the minimum distance parameter as the target electronic equipment from a plurality of electronic equipment belonging to the same group.

Optionally, in another embodiment, the server is specifically configured to obtain attribute information of the voice data, and determine that the electronic device whose attribute information meets the preset condition is the target electronic device.

The attribute information of the voice data includes one or more of the following parameters: signal-to-noise ratio parameters, energy intensity.

In an implementation manner, when the server determines that the electronic device whose attribute information satisfies the preset condition is the target electronic device, the server may compare the attribute information of the voice data sent by the voice assistants of a plurality of electronic devices in the same group, and determine that the electronic device whose attribute information satisfies the preset condition is the target electronic device, where the preset condition is a condition for determining that a distance between the electronic device and a sender of the voice data is closest. In one embodiment, when the attribute information is a signal-to-noise ratio parameter, the preset condition is a condition that the signal-to-noise ratio is optimal among a plurality of voice data; when the attribute information is energy intensity, the preset condition is a condition with the strongest energy intensity in the plurality of voice data.

In another implementation manner, when the server determines that the electronic device whose attribute information meets the preset condition is the target electronic device, the server may compare the attribute information of the voice data sent by the voice assistants of multiple electronic devices in the same group with a preset attribute information, and determine that the electronic device that meets the preset attribute information is the target electronic device.

Therefore, in the embodiment, when the voice assistants of the plurality of electronic devices all receive the voice data sent by the user, only the voice assistant of the target electronic device closest to the user responds to the voice data, so that manual operation of the user is not needed, and user experience is improved.

In this embodiment, the server is further configured to determine whether the plurality of electronic devices are located in the same network; accordingly, the server determines a target electronic device closest to the sender of the voice data from among the plurality of electronic devices, specifically, determines a target electronic device closest to the sender of the voice data from among the plurality of electronic devices belonging to the same network.

The server determines whether the plurality of electronic devices are located at the same position by judging whether the plurality of electronic devices are located in the same network, determines that the plurality of electronic devices are located at the same position if the plurality of electronic devices are located in the same network, and determines that the plurality of electronic devices are located at different positions if the plurality of electronic devices are located in different networks.

Alternatively, the server may determine whether the plurality of electronic devices are located in the same network based on network addresses or network identifications of the electronic devices, electronic devices having the same network address or the same network identification are located in the same network, and electronic devices having different network addresses or different network identifications are located in different networks.

In another embodiment of the present invention, the server is further configured to control the electronic devices belonging to different networks to respectively respond to the voice data sent by their voice assistants.

Therefore, in the invention, when the voice assistants of a plurality of electronic devices all receive the voice data sent by the user, the target electronic device closest to the user can be determined from the electronic devices belonging to the same network, and the voice assistants are controlled to respond to the voice data, so that manual operation of the user is not needed, the user experience is improved, and the accuracy of data processing is improved.

In this embodiment, the server is further configured to determine whether the voice data sent by the voice assistants of the multiple electronic devices simultaneously belong to the same voiceprint, and accordingly, the server determines a target electronic device closest to the sender of the voice data from the multiple electronic devices, specifically, determines a target electronic device closest to the sender of the voice data from the multiple electronic devices where the voice data belong to the same voiceprint.

The server determines whether the voice data collected by the voice assistants of the electronic devices are sent by the same sender by judging whether the electronic devices belong to the same voiceprint, determines that the voice data collected by the voice assistants of the electronic devices are sent by the same sender if the electronic devices belong to the same voiceprint, and determines that the voice data collected by the voice assistants of the electronic devices are sent by different senders if the electronic devices belong to different voiceprints.

In another embodiment of the present invention, the server is further configured to control the electronic devices whose voice data belong to different voiceprints to respectively respond to the voice data sent by their voice assistants.

Therefore, in this embodiment, when the voice assistants of the multiple electronic devices all receive the voice data sent by the user, the target electronic device closest to the user can be determined from the electronic devices with the voice data belonging to the same voiceprint, and the voice assistants are controlled to respond to the voice data, so that manual operation by the user is not needed, user experience is improved, and accuracy of data processing is improved.

The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. A method for processing voice data, comprising:

acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group, wherein the electronic devices with the same identity belong to the same group, and the identity is an identity which is set by a user and used for representing that the electronic devices all belong to the same holder;

controlling a voice assistant of the target electronic device to respond to the voice data;

wherein determining a target electronic device that is closest to a sender of voice data from the plurality of electronic devices comprises:

2. The method of claim 1, further comprising:

3. The method of claim 1, further comprising:

4. The method of claim 2 or 3, further comprising:

5. The method of claim 1, wherein determining a target electronic device from the plurality of electronic devices that is closest to a sender of voice data comprises:

acquiring attribute information of the voice data;

6. A speech data processing system, comprising:

the server is used for acquiring voice data sent by voice assistants of a plurality of electronic devices belonging to the same group at the same time, determining a target electronic device which is closest to a sender of the voice data from the plurality of electronic devices, and controlling the voice assistants of the target electronic devices to respond to the voice data, wherein the electronic devices with the same identity belong to the same group, and the identity is an identity which is set by a user and used for representing that the plurality of electronic devices all belong to the same holder;

7. The system according to claim 6, wherein the server is further configured to determine whether the plurality of electronic devices are located in the same network, and in particular to determine a target electronic device closest to the sender of the voice data from among the plurality of electronic devices belonging to the same network.

8. The system according to claim 6, wherein the server is further configured to determine whether the voice data sent by the voice assistants of the plurality of electronic devices simultaneously belong to the same voiceprint, and in particular to determine a target electronic device closest to the sender of the voice data from among the plurality of electronic devices whose voice data belong to the same voiceprint.