CN107452386B - Voice data processing method and system - Google Patents

Voice data processing method and system Download PDF

Info

Publication number
CN107452386B
CN107452386B CN201710703578.7A CN201710703578A CN107452386B CN 107452386 B CN107452386 B CN 107452386B CN 201710703578 A CN201710703578 A CN 201710703578A CN 107452386 B CN107452386 B CN 107452386B
Authority
CN
China
Prior art keywords
voice data
electronic devices
voice
electronic device
sender
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710703578.7A
Other languages
Chinese (zh)
Other versions
CN107452386A (en
Inventor
谢兵
黎广斌
张旭辉
王东洋
张天铖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CN201710703578.7A priority Critical patent/CN107452386B/en
Publication of CN107452386A publication Critical patent/CN107452386A/en
Application granted granted Critical
Publication of CN107452386B publication Critical patent/CN107452386B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention provides a voice data processing method and a system, wherein the method comprises the following steps: acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group; determining a target electronic device closest to a sender of voice data from the plurality of electronic devices; controlling a voice assistant of the target electronic device to respond to the voice data; therefore, in the invention, when the voice assistants of the electronic devices all receive the voice data sent by the user, only the voice assistant of the target electronic device closest to the user can respond to the voice data, so that manual operation of the user is not needed, and the user experience is improved.

Description

Voice data processing method and system
Technical Field
The invention relates to the technical field of data processing, in particular to a voice data processing method and system.
Background
At present, most electronic devices such as mobile phones, smart sound boxes, smart televisions and the like support voice assistants, and the voice assistants have the functions of realizing voice control, information query and the like in a voice interaction mode.
The voice assistant of the electronic equipment can perform voice interaction when the voice assistant is in a wake-up state.
As an application scenario, when a user wants to perform voice interaction with one electronic device, if voice assistants on a plurality of electronic devices around the user are all in an awake state, then all of the electronic devices respond to the voice information of the user, which is obviously contrary to the intention of the user. When the situation occurs, the user needs to control the voice assistant of other electronic equipment to exit the awakening state, the operation is complicated, and the user experience is reduced.
Disclosure of Invention
In view of this, the present invention provides a method and a system for processing voice data to simplify user operations and improve user experience.
In order to achieve the purpose, the invention provides the following technical scheme:
a method of speech data processing, comprising:
acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group;
determining a target electronic device closest to a sender of voice data from the plurality of electronic devices;
a voice assistant that controls the target electronic device responds to the voice data.
Preferably, the method further comprises the following steps:
judging whether the plurality of electronic devices are located in the same network;
correspondingly, the determining a target electronic device closest to the sender of the voice data from the plurality of electronic devices includes:
a target electronic device closest to a sender of voice data is determined among a plurality of electronic devices belonging to the same network.
Preferably, the method further comprises the following steps:
judging whether the voice data sent by the voice assistants of the plurality of electronic devices at the same time belong to the same voiceprint;
correspondingly, the determining a target electronic device closest to the sender of the voice data from the plurality of electronic devices includes:
a target electronic device closest to a sender of voice data is determined from among a plurality of electronic devices whose voice data belong to the same voiceprint.
Preferably, the method further comprises the following steps:
and if not, controlling the voice assistants of the electronic equipment to respectively respond to the voice data.
Preferably, determining a target electronic device closest to a sender of voice data from among the plurality of electronic devices includes:
receiving distance parameters respectively sent by the plurality of electronic devices; the distance parameter is the distance between the electronic equipment and the sender of the voice data, which is determined by the electronic equipment based on the attribute information of the voice data;
and determining the electronic equipment with the minimum distance parameter as target electronic equipment.
Preferably, determining a target electronic device closest to a sender of voice data from among the plurality of electronic devices includes:
acquiring attribute information of the voice data;
and determining the electronic equipment of which the attribute information meets the preset condition as target electronic equipment.
A speech data processing system comprising:
the electronic equipment runs with a voice assistant respectively and is used for acquiring voice data through the voice assistant;
the server is used for acquiring voice data simultaneously sent by the voice assistants of a plurality of electronic devices belonging to the same group, determining a target electronic device which is closest to a sender of the voice data from the plurality of electronic devices, and controlling the voice assistant of the target electronic device to respond to the voice data.
Preferably, the server is further configured to determine whether the plurality of electronic devices are located in the same network, and is specifically configured to determine a target electronic device closest to a sender of the voice data from among the plurality of electronic devices belonging to the same network.
Preferably, the server is further configured to determine whether the voice data sent by the voice assistants of the plurality of electronic devices at the same time belong to the same voiceprint, and specifically, determine a target electronic device closest to the sender of the voice data from the plurality of electronic devices whose voice data belong to the same voiceprint.
Preferably, the electronic device is configured to determine a distance parameter between the electronic device and a sender of the voice data based on attribute information of the voice data;
the server is specifically configured to receive the distance parameters sent by the multiple electronic devices, and determine the electronic device with the smallest distance parameter as the target electronic device.
Preferably, the server is specifically configured to acquire attribute information of the voice data, and determine that an electronic device whose attribute information satisfies a preset condition is a target electronic device.
As can be seen from the above technical solutions, compared with the prior art, the present invention discloses a voice data processing method, including: receiving voice data simultaneously transmitted by voice assistants of a plurality of electronic devices belonging to the same group, determining a target electronic device closest to a sender of the voice data from the plurality of electronic devices, and controlling the voice assistants of the target electronic devices to respond to the voice data; therefore, in the invention, when the voice assistants of the electronic devices all receive the voice data sent by the user, only the voice assistant of the target electronic device closest to the user can respond to the voice data, so that manual operation of the user is not needed, and the user experience is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.
Fig. 1 is a schematic flow chart of a voice data processing method according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of an application scenario of a voice data processing method according to an embodiment of the present invention;
FIG. 3 is a flowchart illustrating a voice data processing method according to another embodiment of the present invention;
FIG. 4 is a flowchart illustrating a voice data processing method according to another embodiment of the present invention;
fig. 5 is a schematic structural diagram of a voice data system according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
An embodiment of the present invention discloses a voice data processing method, as shown in fig. 1, the method includes the following steps:
step 101: acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group;
the data processing method can be applied to a server, and the server is used for receiving voice data sent by voice assistants of electronic equipment, and determining the voice data sent by the voice assistants of a plurality of electronic equipment belonging to the same group at the same time when receiving the voice data sent by the voice assistants of the plurality of electronic equipment.
It is understood that "simultaneously" means the same time, but due to the network delay and other factors, the "simultaneously" in the present invention can refer to the time difference being within a preset time, and the preset time is particularly short, such as 0.5 second, 1 second and so on. For example, the time difference between the voice data sent by the voice assistants of the two electronic devices is 1 second, which can be considered that the two electronic devices simultaneously send the voice data.
Optionally, for the received voice data sent by the voice assistants of the multiple electronic devices simultaneously, the voice data sent by the voice assistants of the multiple electronic devices belonging to the same group simultaneously needs to be determined, where the group to which the electronic devices belong may be determined by the identity of the electronic devices, and the electronic devices having the same identity belong to the same group. The identity may be an identity set by a user to represent that a plurality of electronic devices all belong to the same holder, and a specific implementation form of the identity is not limited in the present invention.
It should be noted that the identification of the electronic device may refer to the identification of the electronic device itself, or may refer to the identification of the voice assistant of the electronic device.
The voice assistant of the electronic device may carry the identity when sending the voice data to the server, or the identity of each electronic device may be stored in the server in advance, so that the server performs the grouping in advance.
Step 102: determining a target electronic device closest to a sender of voice data from among a plurality of electronic devices;
in practical applications, when a user wants to perform voice interaction with an electronic device, the user is generally close to the electronic device, that is, if there are multiple electronic devices around the user, the user is closest to the electronic device that the user wants to perform voice interaction with. Therefore, in the present application, the server can determine the target electronic device closest to the sender of the voice data from the plurality of electronic devices.
And determining at least one electronic device from the plurality of electronic devices as a target electronic device closest to the sender of the voice data.
Optionally, in an embodiment, determining a target electronic device closest to a sender of voice data from among the plurality of electronic devices includes the following processes:
(1) receiving distance parameters respectively sent by a plurality of electronic devices;
the distance parameter is the distance between the electronic equipment and the sender of the voice data, which is determined by the electronic equipment based on the attribute information of the voice data.
The electronic equipment calculates the distance parameter between the electronic equipment and the sender of the voice data based on the attribute information of the voice data collected by the voice assistant, and sends the distance parameter to the server. It should be noted that the electronic device may send the distance parameter directly to the server by carrying the distance parameter in voice data, or send the distance parameter to the server separately.
The attribute information of the voice data includes one or more of the following parameters: signal-to-noise ratio parameters, energy intensity. The electronic equipment stores different corresponding relations between the attribute information and the distance parameter in advance, searches the distance parameter corresponding to the attribute information of the voice data from the corresponding relations stored in advance, and sends the distance parameter to the server.
(2) And determining the electronic equipment with the minimum distance parameter as target electronic equipment.
After receiving the distance parameters sent by the electronic equipment, the server determines the electronic equipment with the minimum distance parameter as the target electronic equipment from a plurality of electronic equipment belonging to the same group.
Optionally, in another embodiment, determining a target electronic device closest to a sender of voice data from among the plurality of electronic devices includes the following processes:
(1) acquiring attribute information of the voice data;
after receiving voice data sent by a voice assistant of the electronic equipment, determining attribute information of the voice data, wherein the attribute information of the voice data comprises one or more of the following parameters: signal-to-noise ratio parameters, energy intensity.
(2) And determining the electronic equipment of which the attribute information meets the preset condition as target electronic equipment.
In one implementation, attribute information of voice data sent by voice assistants of multiple electronic devices in the same group may be compared, and an electronic device whose attribute information satisfies a preset condition is determined as a target electronic device, where the preset condition is a condition for determining that a distance between the electronic device and a sender of the voice data is closest. In one embodiment, when the attribute information is a signal-to-noise ratio parameter, the preset condition is a condition that the signal-to-noise ratio is optimal among a plurality of voice data; when the attribute information is energy intensity, the preset condition is a condition with the strongest energy intensity in the plurality of voice data. Taking the attribute information as the signal-to-noise ratio parameter as an example, the electronic device with the best signal-to-noise ratio is determined as the target electronic device from the attribute information of the voice data sent by the voice assistants of the electronic devices in the same group.
In another implementation manner, attribute information of voice data sent by voice assistants of multiple electronic devices in the same group may be compared with a preset attribute information, and an electronic device that meets the preset attribute information is determined to be a target electronic device.
Step 103: the voice assistant of the control target electronic device responds to the voice data.
The target electronic device is the electronic device that is closest to the sender of the voice data, i.e., the electronic device with which the user wants to have voice interaction, and thus controls the target electronic device voice assistant to respond to the voice data.
The embodiment of the invention provides a data processing method, which comprises the following steps: receiving voice data simultaneously transmitted by voice assistants of a plurality of electronic devices belonging to the same group, determining a target electronic device closest to a sender of the voice data from the plurality of electronic devices, and controlling the voice assistants of the target electronic devices to respond to the voice data; therefore, in the invention, when the voice assistants of the electronic devices all receive the voice data sent by the user, only the voice assistant of the target electronic device closest to the user can respond to the voice data, so that manual operation of the user is not needed, and the user experience is improved.
In an application scenario, as shown in fig. 2, the server 100 receives voice data simultaneously transmitted by the voice assistant of the electronic device 201, the voice assistant of the electronic device 202, and the voice assistant of the electronic device 203, respectively, and determines that the electronic device 201 and the electronic device 202 belong to the same group, so that it is determined from the electronic device 201 and the electronic device 202 that the electronic device 201 is closest to the sender of the voice data, and then controls the voice assistant of the electronic device 201 to respond to the voice data transmitted by the voice assistant thereof, and prohibits the electronic device 202 from responding to the voice data transmitted by the voice assistant thereof. Since electronic device 203 is in a different group than electronic device 201 and electronic device 202, electronic device 203 is also controlled to respond to the voice data sent by its voice assistant.
That is, in the above scenario, the user has the electronic device 201 and the electronic device 202 around him and wants to perform voice interaction with the electronic device 201, and since the voice assistant of the electronic device 201 and the voice assistant of the electronic device 202 are both in the wake-up state, both the voice assistant of the electronic device 201 and the voice assistant of the electronic device 202 can receive the voice data and send it to the server.
Another embodiment of the present invention discloses a method for processing voice data, as shown in fig. 3, the method includes the following steps:
step 301: acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group;
step 302: judging whether the electronic devices are located in the same network, if so, entering step 303; if not, go to step 305;
specifically, the electronic devices are electronic devices belonging to the same group and whose voice assistants send voice data to the server at the same time, and whether the electronic devices are located at the same position is determined by judging whether the electronic devices are located in the same network, if so, determining that the electronic devices are located at the same position, and if so, determining that the electronic devices are located at different positions.
Alternatively, whether the plurality of electronic devices are located in the same network may be determined based on network addresses or network identifications of the electronic devices, electronic devices having the same network address or the same network identification may be located in the same network, and electronic devices having different network addresses or different network identifications may be located in different networks.
Step 303: determining a target electronic device closest to a sender of voice data from a plurality of electronic devices belonging to the same network;
the electronic devices belonging to the same network are located at the same position, and the voice data collected by the voice assistant is sent by the same sender, so that the target electronic device closest to the sender of the voice data is determined from the electronic devices belonging to the same network.
Step 304: controlling a voice assistant of the target electronic equipment to respond to the voice data;
step 305: and respectively responding to the voice data by controlling the voice assistants of the electronic equipment belonging to different networks.
Since electronic devices belonging to different networks are located at different positions, even if the voice assistants of the plurality of electronic devices simultaneously transmit voice data to the server, the voice data received by the voice assistant of each electronic device belonging to the same group is not transmitted by the same sender.
As an application scenario, electronic device a and electronic device B belong to the same group, electronic device a is located at the home of user 1, and electronic device B is located at the company of user 1, and there is a moment that while family 2 of user 1 uses electronic device a to perform voice interaction, user 1 uses electronic device B to perform voice interaction. Therefore, in this case, even if the electronic apparatus a and the electronic apparatus B belong to the same group and the voice data is transmitted to the server by the voice assistant at the same time, since the two electronic apparatuses belong to different networks, the server controls the voice assistant of the electronic apparatus a and the voice assistant of the electronic apparatus B to respond to the received voice data, respectively.
The embodiment of the invention provides a data processing method, which comprises the following steps: receiving voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group, judging whether the electronic devices are located in the same network, determining a target electronic device which is closest to a sender of the voice data from the electronic devices belonging to the same network, and controlling the voice assistant of the target electronic device to respond to the voice data; therefore, in this embodiment, when the voice assistants of the multiple electronic devices all receive the voice data sent by the user, the target electronic device closest to the user can be determined from the electronic devices belonging to the same network, and the voice assistants are controlled to respond to the voice data, so that manual operation by the user is not required, user experience is improved, and accuracy of data processing is improved.
Another embodiment of the present invention discloses a method for processing voice data, as shown in fig. 4, the method includes the following steps:
step 401: acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group;
step 402: judging whether the voice data sent by the voice assistants of the plurality of electronic devices at the same time belong to the same voiceprint, if so, entering step 403; if not, go to step 405;
specifically, the electronic devices are electronic devices belonging to the same group and whose voice assistants send voice data to the server at the same time, and whether the voice data collected by the voice assistants of the electronic devices are sent by the same sender is determined by judging whether the electronic devices belong to the same voiceprint, if the electronic devices belong to the same voiceprint, the voice data collected by the voice assistants of the electronic devices are determined to be sent by the same sender, and if the electronic devices belong to different voiceprints, the voice data collected by the voice assistants of the electronic devices are determined to be sent by different senders.
Step 403: determining a target electronic device closest to a sender of voice data from a plurality of electronic devices for which the voice data belongs to the same voiceprint;
step 404: controlling a voice assistant of the target electronic equipment to respond to the voice data;
step 405: a voice assistant that controls a plurality of electronic devices and devices whose voice data belongs to different voiceprints respectively responds to the voice data.
Since the voice data belongs to different voiceprints, it can be determined that the voice data collected by different electronic device voice assistants is sent by different senders.
As an application scenario, the electronic device a and the electronic device B belong to the same group and are both located at the home of the user 1, and at this moment, the user 1 performs voice interaction using the electronic device a, and at the same time, the family 2 of the user 1 also performs voice interaction using the electronic device B. Therefore, in this case, even if the electronic device a and the electronic device B belong to the same group and the voice data is transmitted to the server by the voice assistant at the same time, since the voice data collected by the voice assistants of the two electronic devices is transmitted by different senders, the server controls the voice assistant of the electronic device a and the voice assistant of the electronic device B to respond to the received voice data, respectively.
The embodiment of the invention provides a data processing method, which comprises the following steps: receiving voice data sent by voice assistants of a plurality of electronic devices belonging to the same group at the same time, judging whether the voice data sent by the voice assistants of the plurality of electronic devices at the same time belong to the same voiceprint, determining a target electronic device which is closest to a sender of the voice data from the plurality of electronic devices of which the voice data belong to the same voiceprint, and controlling the voice assistants of the target electronic devices to respond to the voice data; therefore, in this embodiment, when the voice assistants of the multiple electronic devices all receive the voice data sent by the user, the target electronic device closest to the user can be determined from the electronic devices with the voice data belonging to the same voiceprint, and the voice assistants are controlled to respond to the voice data, so that manual operation by the user is not needed, user experience is improved, and accuracy of data processing is improved.
Corresponding to the above voice data processing method, an embodiment of the present invention further discloses a voice data processing system, which is described below by embodiments respectively:
an embodiment of the present invention discloses a voice data processing system, as shown in fig. 5, the system includes: a server 100, an electronic device 201, an electronic device 202, and an electronic device 203;
the electronic device 201 and the electronic device 202 belong to the same group, and the electronic device 203 does not belong to the group to which the electronic device 201 and the electronic device 202 belong, may belong to another group, or may not belong to any group.
The electronic device 201, the electronic device 202, and the electronic device 203 each run a voice assistant, and when voice data is collected by the voice assistant, the voice data is sent to the server 100.
In the present invention, the voice data processing system includes a plurality of electronic devices, and the present embodiment only takes three electronic devices as an example.
The server 100 is configured to acquire voice data simultaneously transmitted by voice assistants of a plurality of electronic devices belonging to the same group, determine a target electronic device closest to a sender of the voice data from the plurality of electronic devices, and control the voice assistants of the target electronic devices to respond to the voice data. In this embodiment, the server 100 is configured to obtain voice data simultaneously transmitted by the voice assistants of the electronic device 201 and the electronic device 202, determine a target electronic device closest to a sender of the voice data from the electronic device 201 and the electronic device 202, and control the voice assistant of the target electronic device to respond to the voice data.
It is understood that "simultaneously" means the same time, but due to the network delay and other factors, the "simultaneously" in the present invention can refer to the time difference being within a preset time, and the preset time is particularly short, such as 0.5 second, 1 second and so on.
Optionally, the server may determine the group to which the electronic device belongs by using the identity of the electronic device, and the electronic devices having the same identity belong to the same group. The identity may be a user-defined identity used to characterize multiple electronic devices belonging to the same holder, and the specific implementation form of the present invention is not limited.
It should be noted that the identification of the electronic device may refer to the identification of the electronic device itself, or may refer to the identification of the voice assistant of the electronic device.
The voice assistant of the electronic device may carry the identity when sending the voice data to the server, or the identity of each electronic device may be stored in the server in advance, so that the server performs the grouping in advance.
And determining at least one electronic device from the plurality of electronic devices as a target electronic device closest to the sender of the voice data.
Optionally, in an embodiment, the electronic device is configured to determine a distance parameter between the electronic device and a sender of the voice data based on attribute information of the voice data collected by the voice assistant of the electronic device; the server is used for receiving the distance parameters respectively sent by the electronic devices and determining the electronic device with the minimum distance parameter as the target electronic device.
It should be noted that the electronic device may send the distance parameter directly to the server by carrying the distance parameter in voice data, or send the distance parameter to the server separately.
The attribute information of the voice data includes one or more of the following parameters: signal-to-noise ratio parameters, energy intensity. The electronic equipment stores different corresponding relations between the attribute information and the distance parameter in advance, searches the distance parameter corresponding to the attribute information of the voice data from the corresponding relations stored in advance and sends the distance parameter to the server; after receiving the distance parameters sent by the electronic equipment, the server determines the electronic equipment with the minimum distance parameter as the target electronic equipment from a plurality of electronic equipment belonging to the same group.
Optionally, in another embodiment, the server is specifically configured to obtain attribute information of the voice data, and determine that the electronic device whose attribute information meets the preset condition is the target electronic device.
The attribute information of the voice data includes one or more of the following parameters: signal-to-noise ratio parameters, energy intensity.
In an implementation manner, when the server determines that the electronic device whose attribute information satisfies the preset condition is the target electronic device, the server may compare the attribute information of the voice data sent by the voice assistants of a plurality of electronic devices in the same group, and determine that the electronic device whose attribute information satisfies the preset condition is the target electronic device, where the preset condition is a condition for determining that a distance between the electronic device and a sender of the voice data is closest. In one embodiment, when the attribute information is a signal-to-noise ratio parameter, the preset condition is a condition that the signal-to-noise ratio is optimal among a plurality of voice data; when the attribute information is energy intensity, the preset condition is a condition with the strongest energy intensity in the plurality of voice data.
In another implementation manner, when the server determines that the electronic device whose attribute information meets the preset condition is the target electronic device, the server may compare the attribute information of the voice data sent by the voice assistants of multiple electronic devices in the same group with a preset attribute information, and determine that the electronic device that meets the preset attribute information is the target electronic device.
Therefore, in the embodiment, when the voice assistants of the plurality of electronic devices all receive the voice data sent by the user, only the voice assistant of the target electronic device closest to the user responds to the voice data, so that manual operation of the user is not needed, and user experience is improved.
In this embodiment, the server is further configured to determine whether the plurality of electronic devices are located in the same network; accordingly, the server determines a target electronic device closest to the sender of the voice data from among the plurality of electronic devices, specifically, determines a target electronic device closest to the sender of the voice data from among the plurality of electronic devices belonging to the same network.
The server determines whether the plurality of electronic devices are located at the same position by judging whether the plurality of electronic devices are located in the same network, determines that the plurality of electronic devices are located at the same position if the plurality of electronic devices are located in the same network, and determines that the plurality of electronic devices are located at different positions if the plurality of electronic devices are located in different networks.
Alternatively, the server may determine whether the plurality of electronic devices are located in the same network based on network addresses or network identifications of the electronic devices, electronic devices having the same network address or the same network identification are located in the same network, and electronic devices having different network addresses or different network identifications are located in different networks.
In another embodiment of the present invention, the server is further configured to control the electronic devices belonging to different networks to respectively respond to the voice data sent by their voice assistants.
Therefore, in the invention, when the voice assistants of a plurality of electronic devices all receive the voice data sent by the user, the target electronic device closest to the user can be determined from the electronic devices belonging to the same network, and the voice assistants are controlled to respond to the voice data, so that manual operation of the user is not needed, the user experience is improved, and the accuracy of data processing is improved.
In this embodiment, the server is further configured to determine whether the voice data sent by the voice assistants of the multiple electronic devices simultaneously belong to the same voiceprint, and accordingly, the server determines a target electronic device closest to the sender of the voice data from the multiple electronic devices, specifically, determines a target electronic device closest to the sender of the voice data from the multiple electronic devices where the voice data belong to the same voiceprint.
The server determines whether the voice data collected by the voice assistants of the electronic devices are sent by the same sender by judging whether the electronic devices belong to the same voiceprint, determines that the voice data collected by the voice assistants of the electronic devices are sent by the same sender if the electronic devices belong to the same voiceprint, and determines that the voice data collected by the voice assistants of the electronic devices are sent by different senders if the electronic devices belong to different voiceprints.
In another embodiment of the present invention, the server is further configured to control the electronic devices whose voice data belong to different voiceprints to respectively respond to the voice data sent by their voice assistants.
Therefore, in this embodiment, when the voice assistants of the multiple electronic devices all receive the voice data sent by the user, the target electronic device closest to the user can be determined from the electronic devices with the voice data belonging to the same voiceprint, and the voice assistants are controlled to respond to the voice data, so that manual operation by the user is not needed, user experience is improved, and accuracy of data processing is improved.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (8)

1. A method for processing voice data, comprising:
acquiring voice data simultaneously sent by voice assistants of a plurality of electronic devices belonging to the same group, wherein the electronic devices with the same identity belong to the same group, and the identity is an identity which is set by a user and used for representing that the electronic devices all belong to the same holder;
determining a target electronic device closest to a sender of voice data from the plurality of electronic devices;
controlling a voice assistant of the target electronic device to respond to the voice data;
wherein determining a target electronic device that is closest to a sender of voice data from the plurality of electronic devices comprises:
receiving distance parameters respectively sent by the plurality of electronic devices; the distance parameter is the distance between the electronic equipment and the sender of the voice data, which is determined by the electronic equipment based on the attribute information of the voice data;
and determining the electronic equipment with the minimum distance parameter as target electronic equipment.
2. The method of claim 1, further comprising:
judging whether the plurality of electronic devices are located in the same network;
correspondingly, the determining a target electronic device closest to the sender of the voice data from the plurality of electronic devices includes:
a target electronic device closest to a sender of voice data is determined among a plurality of electronic devices belonging to the same network.
3. The method of claim 1, further comprising:
judging whether the voice data sent by the voice assistants of the plurality of electronic devices at the same time belong to the same voiceprint;
correspondingly, the determining a target electronic device closest to the sender of the voice data from the plurality of electronic devices includes:
a target electronic device closest to a sender of voice data is determined from among a plurality of electronic devices whose voice data belong to the same voiceprint.
4. The method of claim 2 or 3, further comprising:
and if not, controlling the voice assistants of the electronic equipment to respectively respond to the voice data.
5. The method of claim 1, wherein determining a target electronic device from the plurality of electronic devices that is closest to a sender of voice data comprises:
acquiring attribute information of the voice data;
and determining the electronic equipment of which the attribute information meets the preset condition as target electronic equipment.
6. A speech data processing system, comprising:
the electronic equipment runs with a voice assistant respectively and is used for acquiring voice data through the voice assistant;
the server is used for acquiring voice data sent by voice assistants of a plurality of electronic devices belonging to the same group at the same time, determining a target electronic device which is closest to a sender of the voice data from the plurality of electronic devices, and controlling the voice assistants of the target electronic devices to respond to the voice data, wherein the electronic devices with the same identity belong to the same group, and the identity is an identity which is set by a user and used for representing that the plurality of electronic devices all belong to the same holder;
wherein determining a target electronic device that is closest to a sender of voice data from the plurality of electronic devices comprises:
receiving distance parameters respectively sent by the plurality of electronic devices; the distance parameter is the distance between the electronic equipment and the sender of the voice data, which is determined by the electronic equipment based on the attribute information of the voice data;
and determining the electronic equipment with the minimum distance parameter as target electronic equipment.
7. The system according to claim 6, wherein the server is further configured to determine whether the plurality of electronic devices are located in the same network, and in particular to determine a target electronic device closest to the sender of the voice data from among the plurality of electronic devices belonging to the same network.
8. The system according to claim 6, wherein the server is further configured to determine whether the voice data sent by the voice assistants of the plurality of electronic devices simultaneously belong to the same voiceprint, and in particular to determine a target electronic device closest to the sender of the voice data from among the plurality of electronic devices whose voice data belong to the same voiceprint.
CN201710703578.7A 2017-08-16 2017-08-16 Voice data processing method and system Active CN107452386B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710703578.7A CN107452386B (en) 2017-08-16 2017-08-16 Voice data processing method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710703578.7A CN107452386B (en) 2017-08-16 2017-08-16 Voice data processing method and system

Publications (2)

Publication Number Publication Date
CN107452386A CN107452386A (en) 2017-12-08
CN107452386B true CN107452386B (en) 2020-03-24

Family

ID=60492616

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710703578.7A Active CN107452386B (en) 2017-08-16 2017-08-16 Voice data processing method and system

Country Status (1)

Country Link
CN (1) CN107452386B (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108597536A (en) * 2018-03-20 2018-09-28 成都星环科技有限公司 A kind of interactive system based on acoustic information positioning
CN108682414A (en) * 2018-04-20 2018-10-19 深圳小祺智能科技有限公司 Sound control method, voice system, equipment and storage medium
CN109243443B (en) * 2018-09-28 2022-05-31 联想(北京)有限公司 Voice control method and device and electronic equipment
CN109639961B (en) * 2018-11-08 2021-05-18 联想(北京)有限公司 Acquisition method and electronic equipment
CN111354361A (en) * 2018-12-21 2020-06-30 深圳市优必选科技有限公司 Emotion communication method and system and robot
CN111862988B (en) * 2019-04-26 2023-03-03 广东美的白色家电技术创新中心有限公司 Voice recognition device, wake-up response method thereof, and computer storage medium
CN112289313A (en) * 2019-07-01 2021-01-29 华为技术有限公司 Voice control method, electronic equipment and system
CN110415694A (en) * 2019-07-15 2019-11-05 深圳市易汇软件有限公司 A kind of method that more intelligent sound boxes cooperate
CN110474973B (en) 2019-08-08 2022-02-08 三星电子(中国)研发中心 Method, system and equipment for sharing intelligent engine by multiple equipment
CN110660389A (en) * 2019-09-11 2020-01-07 北京小米移动软件有限公司 Voice response method, device, system and equipment
CN111128150A (en) * 2019-11-27 2020-05-08 云知声智能科技股份有限公司 Method and device for awakening intelligent voice equipment
CN110910880B (en) * 2019-11-29 2022-05-10 广东美的厨房电器制造有限公司 Voice control method, system, device and storage medium
CN111614770B (en) * 2020-05-22 2022-06-17 云知声智能科技股份有限公司 Single awakening method, device and system
CN111614768B (en) * 2020-05-22 2022-10-18 云知声智能科技股份有限公司 Single awakening method, device and system
CN112750439B (en) * 2020-12-29 2023-10-03 恒玄科技(上海)股份有限公司 Speech recognition method, electronic device and storage medium
CN114968166A (en) * 2021-02-26 2022-08-30 华为技术有限公司 Voice interaction method and electronic equipment
CN114143133A (en) * 2021-11-26 2022-03-04 深圳康佳电子科技有限公司 Decentralized intelligent household appliance and voice management system thereof

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105788599A (en) * 2016-04-14 2016-07-20 北京小米移动软件有限公司 Speech processing method, router and intelligent speech control system
CN106951209A (en) * 2017-03-29 2017-07-14 联想(北京)有限公司 A kind of control method, device and electronic equipment

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105788599A (en) * 2016-04-14 2016-07-20 北京小米移动软件有限公司 Speech processing method, router and intelligent speech control system
CN106951209A (en) * 2017-03-29 2017-07-14 联想(北京)有限公司 A kind of control method, device and electronic equipment

Also Published As

Publication number Publication date
CN107452386A (en) 2017-12-08

Similar Documents

Publication Publication Date Title
CN107452386B (en) Voice data processing method and system
EP3340243B1 (en) Method for performing voice control on device with microphone array, and device thereof
CN103941686B (en) Sound control method and system
KR101698094B1 (en) Apparatus and method for providing service corresponding to a service zone
CN106714081B (en) Network connection method and intelligent device
US20160132029A1 (en) Method for configuring and controlling smart home products
CN107148072B (en) Method and system for acquiring target resource parameters of intelligent terminal application
CN105307166B (en) Terminal switching method, access device, terminal and system
CN103944983A (en) Error correction method and system for voice control instruction
CN104575503A (en) Speech recognition method and device
CN105554283A (en) Information processing method and electronic devices
WO2018095235A1 (en) Call forwarding method and system
CN105282326A (en) Control method, electronic equipment and electronic device
CN104486761A (en) Terminal communication method and device
TW200837716A (en) Method of recognizing voice commands cooperatively and system thereof
CN109994112A (en) Control method, server, speech recognition apparatus and the medium of speech recognition apparatus
CN102868819B (en) A kind of process the method and device of communication service under common account
CN105684475A (en) Proximity-based services
WO2018054061A1 (en) Short message recognition method, device, system and computer storage medium
CN104010269A (en) Method and device for sending file in communication terminal
EP3236681B1 (en) Automatic calling synchronization system and method
CN105578427B (en) Communications feedback method, response mobile terminal and communication system
CN105187679B (en) A kind of personalization electric control method and device
CN104219729B (en) connection processing method and electronic equipment
CN105911577A (en) Network positioning information processing method, network positioning information processing device and intelligent terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant