CN110322878A

CN110322878A - A kind of sound control method, electronic equipment and system

Info

Publication number: CN110322878A
Application number: CN201910586437.0A
Authority: CN
Inventors: 孙渊; 伍晓晖; 屈伸
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2019-07-01
Filing date: 2019-07-01
Publication date: 2019-10-11
Also published as: CN112289313A; WO2021000876A1

Abstract

The application provides a kind of sound control method, electronic equipment and system, is related to voice control technology field.The problem of under more device contexts, solving the voice assistant for waking up the equipment nearest apart from user, and can only be responded by it to the voice command of user, may cause response failure.The specific scheme is that under more device contexts after user says and wakes up word, arbitration being waken up by more equipment, an equipment in more equipment being selected to carry out wake-up response, and the voice command said by the equipment acquisition user for carrying out wake-up response.According to collected voice command, by the arbitration of more capacities of equipment by having the equipment for executing the function that the voice command corresponds to event in more equipment, to execute the corresponding event of the voice command, the response to voice command is completed.

Description

A kind of sound control method, electronic equipment and system

Technical field

This application involves voice control technology field more particularly to a kind of sound control methods, electronic equipment and system.

Background technique

Voice assistant is important application of the artificial intelligence on mobile phone.Mobile phone can carry out intelligence with user by voice assistant The intelligent interaction with instant question and answer can be talked with.It can also identify the voice command of user's input, and trigger mobile phone and execute this automatically The corresponding event of voice command.Under normal conditions, voice assistant be in a dormant state, user using before voice assistant, Voice wake-up can be carried out to voice assistant.Only after voice assistant is waken up, it just can receive and identify user's input Voice command.Voice data for wake-up is properly termed as waking up word.For example, for waking up word and be " the small small E of E ".If with Family is wanted to trigger mobile phone to play music using voice assistant, then can say " the small small E of E ", first to wake up voice assistant.It is helped in voice After hand is waken up, user says " playing music " again.Mobile phone can receive using voice assistant and identify the voice command, and touch Hair mobile phone plays music automatically.

With the development of technology, voice control using more and more extensive.Such as, many home equipments all support voice at present Control function.Such as voice control function can be realized by installing voice assistant in home equipment.It is used in this way, will exist It include the scene of multiple equipment for supporting voice control function, i.e., more device contexts (such as in user family) in the local environment of family.? Under more device contexts, the identical equipment of word is waken up if existed in this multiple equipment, after user says and wakes up word, is had The voice assistant of the identical equipment for waking up word can be waken up, and the voice command that subsequent to user can all say is identified simultaneously It makes a response.For example, as shown in Figure 1, there is a speaker 101 in user family parlor, 103 3 equipment of television set 102 and mobile phone, these three Equipment is mounted on voice assistant, and waking up word is " the small small E of E ".So, after user, which says, wakes up word " the small small E of E ", sound The voice assistant of case 101, television set 102 and mobile phone 103 can be waken up.After user continues to say " playing music ", sound Case 101, television set 102 and mobile phone 103 can receive and identify the voice command, and play music automatically.

It in the prior art, can (local device, which can be, above-mentioned has voice control by server or local device Any one in the equipment of function) it is based on speech energy, it carries out more equipment and wakes up arbitration.There is identical wake-up word from multiple Equipment in select an equipment to wake up its voice assistant, to be identified simultaneously by voice command of the equipment to user Response.Wherein, speech energy is used to indicate the distance between equipment and user.For example, to carry out more equipment wake-ups by server For arbitration, continuing with Fig. 1, server can be selected according to speech energy from speaker 101, television set 102 and mobile phone 103 The equipment nearest apart from user is selected out, if speaker 101 wakes up its voice assistant, other equipment are then not responding to wake-up word, i.e., not Wake up its voice assistant.In this way, will only have speaker 101 to the voice command of user after user continues to say voice command It is identified and is responded.

At least there are the following problems for the prior art: in the scheme that above-mentioned more equipment wake up arbitration, saying wake-up in user After word, the equipment nearest apart from user can wake up its voice assistant, and respond to the subsequent voice command said of user.But It is, if the corresponding event of voice command that user says, the equipment are unable to complete, if voice command is " navigating to somewhere ", But for example above-mentioned speaker 101 of the equipment nearest from user does not have navigation feature, then will lead to response failure.At this point, removing non-user It is moved to the equipment for having navigation feature, near such as above-mentioned mobile phone 103, and re-speaks and wakes up word and voice command, otherwise language Sound control system realizes that navigation is difficult to complete.

Summary of the invention

The embodiment of the present application provides a kind of sound control method, electronic equipment and system.Under more device contexts, solve The voice assistant of the equipment nearest apart from user is waken up, and voice command of user can only be responded by it, may cause The problem of response failure.

In order to achieve the above object, the embodiment of the present application adopts the following technical scheme that

In a first aspect, the embodiment of the present application provides a kind of sound control method, this method can be applied to voice control system System, the speech control system may include: one group of equipment and server, which, which includes at least, has voice control function The first electronic equipment and the second electronic equipment.This method may include: to want the voice control function using equipment in user When, corresponding wake-up word, such as the first voice data can be said.At this point, the first electronic equipment and the second electronic equipment can connect respectively Receive the first voice data of user；The wake-up that first electronic equipment is registered in determining the first voice data and the first electronic equipment When word is identical, the energy information for the first voice data that the first electronic equipment detects itself is sent to server；Second electronics When the wake-up word that equipment is registered in determining the first voice data and the second electronic equipment is identical, the second electronics is sent to server The energy information for the first voice data that equipment itself detects；The first voice that server is detected according to the first electronic equipment The energy information for the first voice data that the energy information of data and the second electronic equipment detect can carry out more equipment and wake up punching It cuts out, that is, judges which equipment wake-up response is carried out by.The energy for the first voice data that such as the first electronic equipment detects is greater than The energy for the first voice data that second electronic equipment detects, then server, which can determine, carries out wake-up sound by the first electronic equipment It answers, and first can be sent to the first electronic equipment and wake up instruction；First electronic equipment in response to receive first wake up indicate, The voice control function of the first electronic equipment can be waken up；In this way, user is saying voice name, after second speech data, call out The first electronic equipment after awake voice control function can receive the second speech data of user, and send second language to server Sound data；Server can carry out more capacity of equipment punchings according to second speech data, that is, judge which equipment to execute the second language by Sound data correspond to event, and e.g., which can determine target electronic device from one group of equipment, which has Execute the function that second speech data corresponds to event；Server sends content instruction to target electronic device, which is designated as The corresponding instruction of second speech data or content are designated as execution second speech data and correspond to data required for the event；In this way, mesh Mark electronic equipment can be indicated according to content, execute the corresponding event of second speech data.

By adopting the above technical scheme, under more device contexts, for user after saying wake-up word and voice command, server is logical Excessive equipment wakes up arbitration and the arbitration of more capacities of equipment, not only can only wake up one of equipment, as apart from user recently Equipment carries out wake-up response.Moreover, when the equipment for carrying out waking up response does not have execution voice command and corresponds to the function of event, It is not required to user shift position, user is not needed yet and re-speaks wake-up word and voice command, can have execution voice by correspondence It orders the equipment of the function of corresponding event to execute the corresponding event of the voice command, completes the response to voice command.So that Electronic equipment is more intelligent, realizes the efficient interactive between electronic equipment and user.Meanwhile improving the use body of user It tests.

In one possible implementation, above-mentioned one group of equipment may also include third electronic equipment；Wherein, third electricity Sub- equipment does not have voice control function；Or, the third electronic equipment has voice control function, but third electronic equipment and use The distance between family is greater than the pickup distance of third electronic equipment.In this way, allowing the coverage area of voice control more than electricity The pickup range of sub- equipment.For example, the pickup distance for being provided with the television set of 6 microphones is usually within 5 meters, using this Shen Please embodiment method, even if the distance between user and the television set more than 5 meters, can also control it by voice control The events such as the automatic broadcasting for executing video.In addition, user is not required to without clearly saying and need to play video on the television set It is television set that user is specified, which to need the equipment for carrying out video playing, only needs user to say " playing certain video ", using the present embodiment Method, the television set can also be triggered and play video automatically.

In alternatively possible implementation, when receiving the first voice data, the first electronic equipment and the second electronics The voice control function of equipment is not waken up.

In alternatively possible implementation, this method can also include: that server is ordered to the transmission of the first electronic equipment Response instruction is enabled, command response instruction, which will be used to indicate the first electronic device prompts user, to execute second by target electronic device The corresponding event of voice data；First electronic equipment is indicated according to command response, prompts user that will be executed by target electronic device The corresponding event of second speech data.In this way, carrying out the equipment for waking up response, i.e. the first electronic equipment passes through prompt, such as voice Prompt prompts user that will respond in which equipment to voice command, improves the usage experience of user.

In alternatively possible implementation, above-mentioned server is determined from one group of equipment according to second speech data Target electronic device out specifically may include: ability information and second language of the server according to each equipment in one group of equipment Sound data choose the equipment for having and executing the function that second speech data corresponds to event from one group of equipment.If one group of equipment In only exist an equipment and have and execute second speech data and correspond to the function of event, then server determines that the equipment is that target is electric Sub- equipment.If having the function of executing second speech data and correspond to event, server in one group of equipment there are multiple equipment Determine that an equipment is target electronic device from multiple equipment.Wherein, in some embodiments, target electronic device is multiple Any one in equipment.In some other embodiment, target electronic device meets at least one of the following conditions: target Electronic equipment be in multiple equipment between user the shortest equipment of distance；Target electronic device is in open state；Target electricity Sub- equipment is not determined for executing the corresponding event of other voice data within a preset time；Or, target electronic device is more The highest equipment of user's frequency of use in a equipment.So, it not only can choose to provide and correspond to thing for execution voice command The equipment of part function responds voice command, but can choose the equipment for being best suitable for user's intention and execute voice life Corresponding event is enabled, so that voice control is more intelligent, while improving the usage experience of user.

In alternatively possible implementation, this method can also include: each equipment in one group of equipment respectively to Server reports respective ability information；Server stores the ability information of each equipment in one group of equipment.Server by utilizing is deposited The ability information of each equipment in one group of equipment of storage just can determine that providing the standby voice command that executes corresponds to setting for Event Function It is standby.

In alternatively possible implementation, this method, which can also include: server, sends the to the second electronic equipment Two wake up instruction, which wakes up according to second and indicate, determine the voice control function for not waking up the second electronic equipment Energy；Or, the second electronic equipment determines does not receive the first wake-up instruction within a preset time, determination does not wake up the second electronic equipment Voice control function.The second electronic equipment detect wake up word after, can according in the feedback or preset time of server not Feedback is received, does not need to carry out wake-up response to determine.

Second aspect, the embodiment of the present application provide a kind of sound control method, and this method can be applied to one group of equipment, should One group of equipment includes at least the first electronic equipment for having voice control function and the second electronic equipment, this method may include: When user wants the voice control function using equipment, corresponding wake-up word, such as the first voice data can be said.At this point, the One electronic equipment and the second electronic equipment can receive the first voice data of user respectively；First electronic equipment is determining the first language When the wake-up word registered in sound data and the first electronic equipment is identical, the first voice that available first electronic equipment detects The energy information of data；The wake-up word phase that second electronic equipment is registered in determining the first voice data and the second electronic equipment Together, the energy letter for the first voice data that the second electronic equipment detects can be sent to the first electronic equipment as main equipment Breath；The first electronic equipment as main equipment can carry out more equipment and wake up punching, that is, judge which equipment to carry out wake-up response by. As the first electronic equipment can be according to the energy information and the second electronic equipment for the first voice data that the first electronic equipment detects The energy information of the first voice data detected, determination carries out wake-up response from the first electronic equipment and the second electronic equipment Equipment；If the energy for the first voice data that the first electronic equipment detects is greater than first that the second electronic equipment detects The energy of voice data then can determine and carry out wake-up response by the first electronic equipment, the first electronic equipment wakes up the first electronics and sets Standby voice control function after second speech data, wakes up voice control function in this way, user is saying voice name The first electronic equipment afterwards receives the second speech data of user；If the first voice data that the second electronic equipment detects Energy is greater than the energy for the first voice data that the first electronic equipment detects, determines and carries out wake-up sound by the second electronic equipment It answers, then the first electronic equipment sends first to the second electronic equipment and wakes up instruction, and the second electronic equipment wakes up in response to first to be referred to Show, wake up the voice control function of the second electronic equipment, in this way, user is saying voice name, after second speech data, calls out The second electronic equipment after awake voice control function receives the second speech data of user, and is sent to the first electronic equipment；The One electronic equipment can carry out more capacity of equipment punchings according to second speech data, that is, judge which equipment to execute the second voice number by According to corresponding event, e.g., the first electronic equipment can determine target electronic device from one group of equipment, and target electronic device, which has, to be held Row second speech data corresponds to the function of event；If target electronic device is the first electronic equipment, the first electronic equipment is to the Two voice data are analyzed, and the corresponding instruction of second speech data is obtained, corresponding according to instruction execution second speech data Event；Alternatively, the first electronic equipment is obtained from server executes the corresponding data required for the event of second speech data, according to number According to the corresponding event of execution second speech data；If target electronic device is not the first electronic equipment, the first electronic equipment to Target electronic device sends content instruction；Content is designated as the corresponding instruction of second speech data or content is designated as execution Two voice data correspond to data required for the event；Target electronic device is indicated according to content, and it is corresponding to execute second speech data Event.

By adopting the above technical scheme, under more device contexts, user is set after saying wake-up word and voice command as master Standby electronic equipment arbitration can be waken up by more equipment and more capacities of equipment are arbitrated, and not only can only wake up one of equipment, Equipment such as nearest apart from user carries out wake-up response.Moreover, not having execution voice command in the equipment for carrying out waking up response When the function of corresponding event, it is not required to user shift position, user is not needed yet and re-speaks wake-up word and voice command, Ke Yiyou Corresponding have the equipment for executing the function that voice command correspond to event and execute the corresponding event of the voice command, and completion is to voice The response of order.So that electronic equipment is more intelligent, the efficient interactive between electronic equipment and user is realized.Meanwhile it improving The usage experience of user.

In one possible implementation, above-mentioned one group of equipment can also include third electronic equipment；Wherein, the third Electronic equipment does not have voice control function；Or, the third electronic equipment has a voice control function, but third electronic equipment with The distance between user is greater than the pickup distance of third electronic equipment.In this way, allowing the coverage area of voice control to be more than The pickup range of electronic equipment, that is to say, that, can also even if the distance between user and certain electronic equipment are more than its pickup range By voice control, to control its automatic corresponding event of execution.In addition, user without clearly say need the electronic equipment execute Event, that is, being not required to the specified equipment for needing to be implemented event of user is the electronic equipment, only needs user to say " executing something ", uses The method of the present embodiment can also trigger the electronic equipment and execute corresponding event automatically.

In alternatively possible implementation, if the second electronic equipment is the equipment for carrying out waking up response, this method It can also include: the first electronic equipment to the transmission command response instruction of the second electronic equipment, command response instruction is used to indicate Second electronic device prompts user will be executed the corresponding event of second speech data by target electronic device；Second electronic equipment root It is indicated according to command response, prompts user that will execute the corresponding event of second speech data by target electronic device；Or if first Electronic equipment is the equipment for carrying out waking up response, method further include: the first electronic device prompts user will be by target electronic device Execute the corresponding event of second speech data.In this way, the equipment for wake up response passes through prompt, such as voice prompting, prompt use Family will respond voice command in which equipment, improve the usage experience of user.

In alternatively possible implementation, above-mentioned first electronic equipment is according to second speech data, from one group of equipment In determine target electronic device, specifically may include: ability of first electronic equipment according to each equipment in one group of equipment Information and second speech data choose the equipment for having and executing the function that second speech data corresponds to event from one group of equipment. The function that second speech data corresponds to event is executed if only existing an equipment in one group of equipment and having, the first electronic equipment Determine that the equipment is target electronic device.If having execution second speech data there are multiple equipment in one group of equipment corresponds to thing The function of part, then the first electronic equipment determines that an equipment is target electronic device from multiple equipment.Wherein, in some implementations In example, target electronic device is any one in multiple equipment.In some other embodiment, target electronic device meet with At least one of lower condition: target electronic device be in multiple equipment between user the shortest equipment of distance；Target electronic Equipment is in open state；Target electronic device is not determined for executing the corresponding thing of other voice data within a preset time Part；Or, target electronic device is the highest equipment of user's frequency of use in multiple equipment.So, it not only can choose out Have the equipment that execution voice command corresponds to Event Function to respond voice command, but can choose out and be best suitable for user The equipment of intention executes the corresponding event of voice command, so that voice control is more intelligent, while improving the use of user Experience.

In alternatively possible implementation, this method can also include: in one group of equipment in addition to the first electronic equipment Each equipment report respective ability information to the first electronic equipment respectively；First electronic equipment stores each in one group of equipment The ability information of equipment.The ability information of each equipment in one group of equipment of storage is utilized as the electronic equipment of main equipment, just It can determine that and provide the standby equipment for executing voice command and corresponding to Event Function.

In alternatively possible implementation, if the first electronic equipment is the equipment for carrying out waking up response, this method It can also include: the first electronic equipment to the second electronic equipment transmission the second wake-up instruction, the second electronic equipment is called out according to second It wakes up and indicates, determine the voice control function for not waking up the second electronic equipment；Or, the second electronic equipment determines within a preset time not The first wake-up instruction is received, determines the voice control function for not waking up the second electronic equipment.It is set as from the electronics of equipment After detecting wake-up word, it can not need to carry out to determine according to feedback is not received in the feedback or preset time of main equipment Wake up response.

The third aspect, the embodiment of the present application provide a kind of sound control method, and this method can be applied to have voice control First electronic equipment of function processed, first electronic equipment are contained in one group of equipment, which further includes having voice Second electronic equipment of control function, this method may include: that can say when user wants the voice control function using equipment Wake up word, such as the first voice data accordingly out.At this point, the first electronic equipment can receive the first voice data of user；First When the wake-up word that electronic equipment is registered in determining the first voice data and the first electronic equipment is identical, first is sent to server The energy information for the first voice data that electronic equipment detects；First electronic equipment receives the wake-up instruction that server is sent, The energy information and the second electronics that wake-up instruction is the first voice data that server is detected according to the first electronic equipment are set The energy information of standby the first voice data detected, which is determined, by the first electronic equipment send after wake-up response, the first electricity The energy for the first voice data that sub- equipment detects is greater than the energy for the first voice data that the second electronic equipment detects；The One electronic equipment is indicated in response to waking up, and wakes up the voice control function of the first electronic equipment；In this way, user is saying voice life Name, after second speech data, the first electronic equipment after waking up voice control function receives the second speech data of user；The One electronic equipment sends second speech data to server；First electronic equipment receives the command response instruction that server is sent, Command response instruction, which will be used to indicate the first electronic device prompts user, to execute second speech data pair by target electronic device The event answered, the target electronic device are servers according to second speech data, and that determines from one group of equipment has execution The equipment that second speech data corresponds to the function of event；First electronic equipment is indicated according to command response, prompts user will be by mesh It marks electronic equipment and executes the corresponding event of second speech data.

By adopting the above technical scheme, under more device contexts, user is after saying wake-up word, including the first electronic equipment Multiple equipment in one group of equipment passes through the energy transmission for the data that will test to server, so that server carries out more equipment Wake up arbitration.If the first electronic equipment is the equipment for carrying out waking up response, the voice that can be said by that will collect user Order is transferred to server, so that server carries out more capacity of equipment arbitrations.It is set in this way, not only can only wake up one of them Standby, equipment such as nearest apart from user carries out wake-up response.Voice life is executed moreover, not having in the equipment for carrying out waking up response When enabling the function of corresponding event, it is not required to user shift position, does not also need user and re-speak to wake up word and voice command, it can be with Had the equipment for executing the function that voice command correspond to event by corresponding and executed the corresponding event of the voice command, completion is to language The response of sound order.So that electronic equipment is more intelligent, the efficient interactive between electronic equipment and user is realized.Meanwhile it mentioning The high usage experience of user.

In one possible implementation, which can also include third electronic equipment；Wherein, third electronics Equipment does not have voice control function；Or, third electronic equipment has a voice control function, but third electronic equipment and user it Between distance be greater than third electronic equipment pickup distance.

In alternatively possible implementation, when receiving the first voice data, the voice control of the first electronic equipment Function is to be waken up.

In alternatively possible implementation, if above-mentioned target electronic device is the first electronic equipment, this method It can also include: that the first electronic equipment receives the content instruction that server is sent, it is corresponding which is designated as second speech data Instruction or the content be designated as execute second speech data correspond to data required for the event；First electronic equipment is according to content Instruction executes the corresponding event of second speech data.

Fourth aspect, the embodiment of the present application provide a kind of sound control method, and this method can be applied to the second electronics and set Standby, which is contained in one group of equipment, which further includes the first electronics for having voice control function Equipment, first electronic equipment are used to receive the first voice data and second speech data of user, and the first voice data is to call out Awake word, second speech data is voice command；This method may include: the instruction of the second electronic equipment reception content, content instruction Data required for the event are corresponded to for the corresponding instruction of second speech data, or execution second speech data；Second electronic equipment root It is indicated according to content, executes the corresponding event of second speech data.

By adopting the above technical scheme, under more device contexts, even if electronic equipment is not the equipment being waken up, then pass through clothes More capacities of equipment arbitration of business device.When the equipment for carrying out waking up response does not have execution voice command and corresponds to the function of event, It is not required to user shift position, user is not needed yet and re-speaks wake-up word and voice command, can have execution voice by correspondence The equipment for ordering the function of corresponding event, the corresponding event of the voice command is executed such as second electronic equipment, is completed to language The response of sound order.So that electronic equipment is more intelligent, the efficient interactive between electronic equipment and user is realized.Meanwhile it mentioning The high usage experience of user.

In one possible implementation, above-mentioned second electronic equipment does not have voice control function；Or, the second electronics Equipment has a voice control function, but the distance between the second electronic equipment and user be greater than the pickup of the second electronic equipment away from From.

In alternatively possible implementation, the second electronic equipment has voice control function, and between user Distance is less than or equal to the pickup distance of the second electronic equipment；This method can also include: that the second electronic equipment receives the first language Sound data；When the wake-up word that second electronic equipment is registered in determining the first voice data and the second electronic equipment is identical, send The energy information for the first voice data that second electronic equipment detects.When receiving the first voice data, the second electronic equipment Voice control function be not waken up.

In alternatively possible implementation, this method can also include: that the second wake-up of the second electronic equipment reception refers to Show, wake up and indicate according to second, determines the voice control function for not waking up second electronic equipment；Or, the second electronic equipment It determines and does not receive the first wake-up instruction within a preset time, determine the voice control function for not waking up second electronic equipment Energy.

5th aspect, the embodiment of the present application provide a kind of sound control method, and this method can be applied to have voice control First electronic equipment of function processed, first electronic equipment are contained in one group of equipment, which further includes having voice Second electronic equipment of control function, this method may include: the first voice data that the first electronic equipment receives user；First When the wake-up word that electronic equipment is registered in determining the first voice data and the first electronic equipment is identical, the first electronic equipment is obtained The energy information of the first voice data detected；First electronic equipment receives the second electronic equipment that the second electronic equipment is sent The energy information of the first voice data detected；The first voice number that first electronic equipment is detected according to the first electronic equipment According to the energy information of the first voice data that detects of energy information and the second electronic equipment, from the first electronic equipment and second The equipment for carrying out waking up response is determined in electronic equipment；If the energy for the first voice data that the first electronic equipment detects is big In the energy for the first voice data that the second electronic equipment detects, determines and wake-up response is carried out by the first electronic equipment, then the One electronic equipment wakes up the voice control function of the first electronic equipment, and the first electronic equipment after waking up voice control function connects Receive the second speech data of user；It is set if the energy for the first voice data that the second electronic equipment detects is greater than the first electronics The energy of standby the first voice data detected, determines and carries out wake-up response by the second electronic equipment, then the first electronic equipment to Second electronic equipment sends first and wakes up instruction, and receives the second speech data that the second electronic equipment is sent, the second voice number Instruction is waken up in response to first according to being the second electronic equipment, after the voice control function for waking up the second electronic equipment, is said in user It is collected after second speech data out；First electronic equipment determines target from one group of equipment according to second speech data Electronic equipment, target electronic device, which has, executes the function that second speech data corresponds to event；If target electronic device is the One electronic equipment, the first electronic equipment analyze second speech data, obtain the corresponding instruction of second speech data, according to The corresponding event of instruction execution second speech data；Or first electronic equipment from server obtain execute second speech data pair The data required for the event answered execute the corresponding event of second speech data according to data；If target electronic device is not One electronic equipment, the first electronic equipment send content instruction to target electronic device, and it is corresponding that content is designated as second speech data Instruction or content be designated as executing second speech data and correspond to data required for the event, for target electronic device execution the The corresponding event of two voice data.

In one possible implementation, above-mentioned one group of equipment can also include third electronic equipment；Wherein, third electricity Sub- equipment does not have voice control function；Or, third electronic equipment has voice control function, but third electronic equipment and user The distance between be greater than third electronic equipment pickup distance.

In alternatively possible implementation, when receiving the first voice data, the voice control of the first electronic equipment Function is not waken up.

In alternatively possible implementation, if the second electronic equipment is the equipment for carrying out waking up response, this method It can also include: the first electronic equipment to the transmission command response instruction of the second electronic equipment, command response instruction is used to indicate Second electronic device prompts user will be executed the corresponding event of second speech data by target electronic device；Or if the first electronics Equipment is the equipment for carrying out waking up response, and this method can also include: that the first electronic device prompts user will be set by target electronic It is standby to execute the corresponding event of second speech data.

In alternatively possible implementation, the first electronic equipment is according to second speech data, from one group of equipment really Target electronic device is made, specifically may include: ability information of first electronic equipment according to each equipment in one group of equipment, And second speech data, the equipment for having and executing the function that second speech data corresponds to event is chosen from one group of equipment.If Has the function of executing second speech data and correspond to event in one group of equipment there are an equipment, then the first electronic equipment, which determines, is somebody's turn to do Equipment is target electronic device.If having the function for executing second speech data and corresponding to event in one group of equipment there are multiple equipment Can, then the first electronic equipment determines that an equipment is target electronic device from multiple equipment；Wherein, in some embodiments, Target electronic device is any one in multiple equipment.In some other embodiment, target electronic device meets following item At least one of part: target electronic device be in multiple equipment between user the shortest equipment of distance；Target electronic device In open state；Target electronic device is not determined for executing the corresponding event of other voice data within a preset time； Or, target electronic device is the highest equipment of user's frequency of use in multiple equipment.

In alternatively possible implementation, this method can also include: that the first electronic equipment receives in one group of equipment The respective ability information that each equipment in addition to the first electronic equipment reports respectively；First electronic equipment stores in one group of equipment The ability information of each equipment.

In alternatively possible implementation, if the first electronic equipment is the equipment for carrying out waking up response, this method Can also to include: the first electronic equipment, which send second to the second electronic equipment, wakes up instruction, and second, which wakes up instruction, is used to indicate the Two electronic equipments are responded without waking up.

6th aspect, the embodiment of the present application provide a kind of sound control method, are applied to server, which is contained in Speech control system, speech control system further include: one group of equipment, which, which includes at least, has voice control function First electronic equipment and the second electronic equipment；This method may include: that server receives the first electricity that the first electronic equipment is sent The energy information for the first voice data that sub- equipment detects, the second electronic equipment that the second electronic equipment is sent detect the The energy information of one voice data；The energy information for the first voice data that server is detected according to the first electronic equipment and The energy information for the first voice data that two electronic equipments detect determines and by the first electronic equipment carries out wake-up response, to the One electronic equipment sends first and wakes up instruction；Wherein, the energy for the first voice data that the first electronic equipment detects is greater than the The energy for the first voice data that two electronic equipments detect；Server receives the second voice number that the first electronic equipment is sent According to；Server determines that target electronic device, target electronic device have execution according to second speech data from one group of equipment Second speech data corresponds to the function of event；Server sends content instruction to target electronic device, and content is designated as the second language The corresponding instruction of sound data or content are designated as execution second speech data and correspond to data required for the event, are used to indicate target Electronic equipment executes the corresponding event of second speech data.

By adopting the above technical scheme, under more device contexts, for user after saying wake-up word and voice command, server can Arbitration and the arbitration of more capacities of equipment are waken up by more equipment, not only can only wake up one of equipment, it is such as nearest apart from user Equipment carry out wake-up response.Moreover, not having the function of executing voice command and correspond to event in the equipment for carrying out waking up response When, it is not required to user shift position, user is not needed yet and re-speaks wake-up word and voice command, can have execution language by correspondence Sound order corresponds to the equipment of the function of event to execute the corresponding event of the voice command, completes the response to voice command.Make It is more intelligent to obtain electronic equipment, realizes the efficient interactive between electronic equipment and user.Meanwhile improving the use body of user It tests.

In alternatively possible implementation, this method can also include: that server is ordered to the transmission of the first electronic equipment Response instruction is enabled, command response instruction, which will be used to indicate the first electronic device prompts user, to execute second by target electronic device The corresponding event of voice data.

In alternatively possible implementation, server determines mesh from one group of equipment according to second speech data Electronic equipment is marked, specifically may include: server according to the ability information of each equipment in one group of equipment and the second voice number According to selection has the equipment for executing the function that second speech data corresponds to event from one group of equipment.If deposited in one group of equipment Has the function of executing second speech data and correspond to event in an equipment, then server determines that the equipment sets for target electronic It is standby.It executes second speech data if had in one group of equipment there are multiple equipment and corresponds to the function of event, server is from more Determine that an equipment is target electronic device in a equipment.Wherein, in some embodiments, target electronic device is multiple equipment In any one.In some other embodiment, target electronic device meets at least one of the following conditions: target electronic Equipment be in multiple equipment between user the shortest equipment of distance；Target electronic device is in open state；Target electronic is set It is standby not to be determined for executing the corresponding event of other voice data within a preset time；Or, target electronic device is multiple sets The standby middle highest equipment of user's frequency of use.

In alternatively possible implementation, this method can also include: that server receives each of one group of equipment The respective ability information that equipment reports；Server stores the ability information of each equipment in one group of equipment.

In alternatively possible implementation, this method, which can also include: server, sends the to the second electronic equipment Two wake up instruction, and the second wake-up instruction is used to indicate the second electronic equipment and responds without waking up.

7th aspect, the embodiment of the present application provide a kind of electronic equipment, comprising: one or more processors and memory； Memory is coupled with one or more processors, and for storing computer program code, computer program code includes memory Computer instruction, when one or more processors computer instructions, which executes such as the third aspect or third Sound control method described in any one of possible implementation of aspect；Alternatively, the electronic equipment executes such as fourth aspect Or sound control method described in any one of possible implementation of fourth aspect；Alternatively, the electronic equipment executes such as the Sound control method described in any one of possible implementation of five aspects or the 5th aspect.

Eighth aspect, the embodiment of the present application provide a kind of server, comprising: one or more processors and memory；It deposits Reservoir is coupled with one or more processors, and memory includes meter for storing computer program code, computer program code Calculation machine instruction, when one or more processors computer instructions, in terms of which executes the such as the 6th or in terms of the 6th Any one of possible implementation described in sound control method.

9th aspect, the embodiment of the present application provides a kind of computer storage medium, including computer instruction, when computer refers to When order is run on an electronic device, so that electronic equipment is executed as appointed in the possible implementation of the third aspect or the third aspect Sound control method described in one；Alternatively, the electronic equipment is made to execute the possible reality such as fourth aspect or fourth aspect Sound control method described in any one of existing mode；Alternatively, in terms of making the electronic equipment execute the such as the 5th or in terms of the 5th Any one of possible implementation described in sound control method.

Tenth aspect, the embodiment of the present application provides a kind of computer storage medium, including computer instruction, when computer refers to Order is when running on an electronic device, so that server is executed as any in the possible implementation in terms of the 6th or in terms of the 6th Sound control method described in.

Tenth on the one hand, and the embodiment of the present application provides a kind of computer program product, when computer program product is calculating When being run on machine, so that computer executes the language as described in any one of possible implementation of the third aspect or the third aspect Sound controlling method；Alternatively, executing computer such as any one of the possible implementation of fourth aspect or fourth aspect institute The sound control method stated；Alternatively, make computer execute as the 5th aspect or the 5th in terms of possible implementation in times Sound control method described in one.

12nd aspect, the embodiment of the present application provides a kind of computer program product, when computer program product is calculating When being run on machine, so that computer executes the language as described in any one of possible implementation in terms of the 6th or in terms of the 6th Sound controlling method.

13rd aspect, the embodiment of the present application provide a kind of device, which has in the method for realizing above-mentioned various aspects The function of electronic equipment, such as the first electronic equipment, the second electronic equipment or third electronic equipment behavior.Function can pass through hardware It realizes, corresponding software realization can also be executed by hardware.Hardware or software include one or more opposite with above-mentioned function The module answered, for example, receiving unit or module, transmission unit or module, wakeup unit or module etc..

Fourteenth aspect, the embodiment of the present application provide a kind of device, which has in the method for realizing above-mentioned various aspects The function of server behavior.Function can also execute corresponding software realization by hardware realization by hardware.Hardware or Software includes one or more modules corresponding with above-mentioned function, for example, transmission unit or module, receiving unit or module, Determination unit or module etc..

15th aspect, the embodiment of the present application provide a kind of speech control system, which may include: one Group equipment and server, one group of equipment include at least the first electronic equipment and the second electronic equipment for having voice control function； First electronic equipment and the second electronic equipment receive the first voice data of user respectively；First electronic equipment determines the first voice The wake-up word registered in data and the first electronic equipment is identical, sends the first voice that the first electronic equipment detects to server The energy information of data；Second electronic equipment determines that the wake-up word registered in the first voice data and the second electronic equipment is identical, The energy information for the first voice data that the second electronic equipment detects is sent to server；Server is according to the first electronic equipment The energy information for the first voice data that the energy information and the second electronic equipment of the first voice data detected detect, really It is fixed that wake-up response is carried out by the first electronic equipment, first, which is sent, to the first electronic equipment wakes up instruction；Wherein, the first electronic equipment The energy of the first voice data detected is greater than the energy for the first voice data that the second electronic equipment detects；First electronics Equipment wakes up in response to first and indicates, wakes up the voice control function of the first electronic equipment；The after waking up voice control function The second speech data of one electronic equipment reception user；First electronic equipment sends second speech data to server；Server According to second speech data, target electronic device is determined from one group of equipment, target electronic device, which has, executes the second voice Data correspond to the function of event；Server sends content instruction to target electronic device, and content is designated as second speech data pair The instruction or content answered are designated as execution second speech data and correspond to data required for the event；Target electronic device is according to content Instruction executes the corresponding event of second speech data.

In one possible implementation, above-mentioned one group of equipment can also include: third electronic equipment；Wherein, third Electronic equipment does not have voice control function；Or, third electronic equipment has voice control function, but third electronic equipment and use The distance between family is greater than the pickup distance of third electronic equipment.

16th aspect, the embodiment of the present application provide a kind of speech control system, which may include: one Group equipment, one group of equipment include at least the first electronic equipment and the second electronic equipment for having voice control function；First electronics Equipment and the second electronic equipment receive the first voice data of user respectively；First electronic equipment determines the first voice data and The wake-up word registered in one electronic equipment is identical, obtains the energy information for the first voice data that the first electronic equipment detects； Second electronic equipment determines that the wake-up word registered in the first voice data and the second electronic equipment is identical, sends out to the first electronic equipment The energy information for the first voice data for sending the second electronic equipment to detect；First electronic equipment is detected according to the first electronic equipment To the first voice data energy information and the energy information of the first voice data that detects of the second electronic equipment, from first The equipment for carrying out waking up response is determined in electronic equipment and the second electronic equipment；If the first language that the first electronic equipment detects The energy of sound data is greater than the energy for the first voice data that the second electronic equipment detects, the first electronic equipment is determined by first Electronic equipment carries out wake-up response, then the first electronic equipment wakes up the voice control function of the first electronic equipment, wakes up voice control The first electronic equipment after function processed receives the second speech data of user；If the first language that the second electronic equipment detects The energy of sound data is greater than the energy for the first voice data that the first electronic equipment detects, the first electronic equipment is determined by second Electronic equipment carries out wake-up response, then the first electronic equipment sends first to the second electronic equipment and wakes up instruction, and the second electronics is set Standby wake up in response to first indicates, wakes up the voice control function of the second electronic equipment, second after waking up voice control function Electronic equipment receives the second speech data of user, and is sent to the first electronic equipment；First electronic equipment is according to the second voice Data determine that target electronic device, target electronic device have execution second speech data and correspond to event from one group of equipment Function；If target electronic device is the first electronic equipment, the first electronic equipment analyzes second speech data, obtains The corresponding instruction of second speech data, according to the corresponding event of instruction execution second speech data；Alternatively, the first electronic equipment from Server, which obtains, executes the corresponding data required for the event of second speech data, and it is corresponding to execute second speech data according to data Event；If target electronic device is not the first electronic equipment, the first electronic equipment sends content instruction to target electronic device； Content is designated as the corresponding instruction of second speech data or content is designated as execution second speech data and corresponds to number required for the event According to；Target electronic device is indicated according to content, executes the corresponding event of second speech data.

It should be understood that in the application not to the description of technical characteristic, technical solution, beneficial effect or similar language It is to imply that all feature and advantage may be implemented in arbitrary single embodiment.On the contrary, it is to be appreciated that for feature Or the description of beneficial effect means at least one embodiment to include specific technical characteristic, technical solution or beneficial effect Fruit.Therefore, identical implementation is not necessarily meant to refer to for the description of technical characteristic, technical solution or beneficial effect in this specification Example.In turn, can by it is any it is appropriate in a manner of combine technical characteristic as described in this embodiment, technical solution and beneficial to effect Fruit.It will be understood to those of skill in the art that without the specific technical characteristics of one or more of specific embodiment, technical solution or Embodiment can be realized in beneficial effect.It in other embodiments, can also be in no specific embodiment for embodying all embodiments Identify additional technical characteristic and beneficial effect.

Detailed description of the invention

Fig. 1 is a kind of schematic diagram of a scenario of more equipment voice controls provided by the embodiments of the present application；

Fig. 2 is a kind of rough schematic view of speech control system provided by the embodiments of the present application；

Fig. 3 is the structural schematic diagram of a kind of electronic equipment provided by the embodiments of the present application；

Fig. 4 is a kind of flow diagram of sound control method provided by the embodiments of the present application；

Fig. 5 is the schematic diagram of a scenario of the more equipment voice controls of another kind provided by the embodiments of the present application；

Fig. 6 is the schematic diagram of a scenario of another more equipment voice control provided by the embodiments of the present application；

Fig. 7 is the flow diagram of another sound control method provided by the embodiments of the present application.

Specific embodiment

Hereinafter, term " first ", " second " are used for descriptive purposes only and cannot be understood as indicating or suggesting relative importance Or implicitly indicate the quantity of indicated technical characteristic.Define " first " as a result, the feature of " second " can be expressed or Implicitly include one or more of the features.In the description of the present embodiment, unless otherwise indicated, the meaning of " plurality " is Two or more.

Sound control method provided by the embodiments of the present application can be applied in one group of equipment.Wherein, it sets for described one group Standby may include multiple equipment, at least there are two equipment in this multiple equipment and has voice control function, and it is identical to wake up word. In the embodiment of the present application, this application scenarios can be known as more device contexts.Under more device contexts, user is called out saying It wakes up after word and voice command, using the method for the present embodiment, is set even if having to execute the voice command and correspond to the function of event Standby is not recently, the corresponding event of the voice command can also to be executed by the equipment apart from user, is completed to voice command Response.So that electronic equipment is more intelligent, the efficient interactive between electronic equipment and user is realized.Meanwhile improving user Usage experience.

It in some embodiments, can be by installing voice assistant in the electronic device, so that the electronic equipment realizes language Sound control function.Voice assistant is in a dormant state under normal circumstances.User is in the voice control function using electronic equipment Before energy, voice wake-up can be carried out to voice assistant.Wherein, the voice data for waking up voice assistant is properly termed as waking up word (or waking up voice).The wake-up word can it is registered in advance in the electronic device.Wake-up voice assistant can described in the present embodiment To refer to, the wake-up word that electronic equipment is said in response to user starts voice assistant.Voice control function can refer to: electronics After the voice assistant starting of equipment, user can trigger electronic equipment certainly by saying voice command (e.g., one section of voice data) It is dynamic to execute the corresponding event of the voice command.

In addition, above-mentioned voice assistant can be the Embedded Application (i.e. the system application of electronic equipment) in electronic equipment, It is also possible to that application can be downloaded.Embedded Application is the application journey that a part realized as electronic equipment (such as mobile phone) provides Sequence.Can download can provide internet protocol multimedia subsystem (the Internet Protocol of oneself using being one Multimedia Subsystem, IMS) connection application program.Application, which can be downloaded, to be pre-installed in electronic equipment, But it is downloaded by user and third-party application in the electronic device is installed.

It is described in detail below in conjunction with embodiment of the attached drawing to the embodiment of the present application.

Fig. 2 is a kind of composition schematic diagram of speech control system provided by the embodiments of the present application.The speech control system can To be applied in above-described one group of equipment.The multiple equipment that one group of equipment includes meets one or more in the following conditions It is a: to be connected to the same wireless access point (such as WiFi access point), the same account has been logged in, by user setting at same group In.

Wherein, as an example, which may include at least two electronic equipments: for example, the first electronics is set Standby 201 and second electronic equipment 202.First electronic equipment 201 and the second electronic equipment 202 are provided with voice control function, such as It is mounted on voice assistant.And the wake-up word for waking up voice assistant is identical, is such as " the small small E of E ".

Under normal conditions, when electronic equipment (such as above-mentioned first electronic equipment 201 or the second electronic equipment 202) and user it Between distance be less than or equal to preset distance, such as 5 meters when, user say wake up word after, electronic equipment can detect that this is called out Awake word, and determine the need for waking up the voice assistant in the equipment.In the present embodiment, above-mentioned first electronic equipment, 201 He The distance between second electronic equipment 202 and user are respectively less than or are equal to the preset distance.That is, saying wake-up in user After word " the small small E of E ", the first electronic equipment 201 and the second electronic equipment 202 can detecte the wake-up word.

In the present embodiment, more equipment can be carried out and wake up arbitration, i.e. the first electronic equipment 201 and the second electronic equipment An equipment is only had in 202 to be responded to word is waken up.That is, only having an equipment wakes up its voice assistant.And After user continues to say voice command, the voice command that user says is identified by the equipment.

In addition, can also carry out more capacity of equipment arbitrations, that is, judge whether the equipment for waking up voice assistant has execution voice Order the function of corresponding event.If the equipment for waking up voice assistant, which does not have, executes the function that the voice command corresponds to event, It can then transfer to have the equipment for executing the function that the voice command corresponds to event to execute.

For example, the second electronic equipment 202 responds the wake-up word after user says and wakes up word " the small small E of E ", I.e. the second electronic equipment 202 has waken up its voice assistant.And it receives and identifies the voice command " navigating to somewhere " that user says. But the second electronic equipment 202 does not have navigation feature, and the first electronic equipment 201 has navigation feature, then it can be by the first electricity Sub- equipment 201 executes the corresponding event of voice command " navigating to somewhere ".Alternatively, one group of equipment may also include other electronics Equipment, such as third electronic equipment 204, and the third electronic equipment 204 has navigation feature, then can be by the third electronic equipment 204 execute the corresponding event of voice command " navigating to somewhere ".The distance between the third electronic equipment 204 and user can be with Less than or equal to the preset distance, the preset distance can also be greater than.In addition, the third electronic equipment 204 can have voice Control function may not possess voice control function.

Wherein, in some embodiments, execute above-mentioned more equipment wake up the equipment of arbitration and the arbitration of more capacities of equipment can be with For any one equipment in above-mentioned first electronic equipment 201 and the second electronic equipment 202.In this embodiment it is possible to will hold The above-mentioned more equipment of row wake up arbitration and the equipment of more capacities of equipment arbitration is known as main equipment.It has been pre-saved in the main equipment multiple The ability information of equipment.Multiple equipment includes above-mentioned first electronic equipment 201 and the second electronic equipment 202, can also include it His electronic equipment, such as above-mentioned third electronic equipment 204.

In further embodiments, the equipment that above-mentioned more equipment wake up arbitration and the arbitration of more capacities of equipment is executed to be also possible to Server.As shown in Figure 2, which can also include server 203.The server 203 is capable of providing intelligent sound Service, pre-saves the ability information of multiple equipment.For example, the first electronic equipment 201, the second electronic equipment 202 and Other electronic equipments (such as above-mentioned third electronic equipment 204) can report to the ability information of itself when powering on or restarting The server 203, so as to its storage.In another example electronic equipment (such as the first electronic equipment 201, the second electronic equipment 202 and Other electronic equipments) ability information of itself periodically can also be reported into the server 203, so as to its storage.Certainly, Electronic equipment can also when the ability information for determining itself changes by the ability information after variation up to server, so as to It is updated the ability information of the equipment of storage.

Illustratively, electronic equipment described in the embodiment of the present application, such as above-mentioned first electronic equipment 201, the second electronics are set It can be mobile phone, tablet computer, desktop type, on knee, handheld computer, notebook electricity for 202 and third electronic equipment 204 Brain, desktop computer, Ultra-Mobile PC (ultra-mobile personal computer, UMPC), net book, with And cellular phone, personal digital assistant (personal digital assistant, PDA), augmented reality (augmented Reality, AR) virtual reality (virtual reality, VR) equipment, media player, television set, intelligent sound box, intelligence The equipment such as wrist-watch intelligent earphone.The embodiment of the present application is not particularly limited the specific form of electronic equipment.The tool of electronic equipment Body structure can refer to the description of Fig. 3 corresponding embodiment.

In addition, in some embodiments, above-mentioned first electronic equipment 201, the second electronic equipment 202 and third electronic equipment 204 can be the electronic equipment of same type, such as the first electronic equipment 201, the second electronic equipment 202 and third electronic equipment 204 be mobile phone.In some other embodiment, above-mentioned first electronic equipment 201, the second electronic equipment 202 and third electronics Equipment 204 can be different types of electronic equipment, and such as the first electronic equipment 201 is mobile phone, and the second electronic equipment 202 is intelligence Speaker, third electronic equipment 204 are television set (as shown in Figure 2).

Referring to FIG. 3, being the structural schematic diagram of a kind of electronic equipment provided by the embodiments of the present application.

As shown in figure 3, electronic equipment may include processor 110, and external memory interface 120, internal storage 121, Universal serial bus (universal serial bus, USB) interface 130, charge management module 140, power management module 141, battery 142, antenna 1, antenna 2, mobile communication module 150, wireless communication module 160, audio-frequency module 170, loudspeaker 170A, receiver 170B, microphone 170C, earphone interface 170D, sensor module 180, key 190, motor 191, indicator 192, camera 193, display screen 194 and Subscriber Identity Module (subscriber identification module, SIM) card interface 195 etc..Wherein, sensor module 180 may include pressure sensor 180A, gyro sensor 180B, gas Pressure sensor 180C, Magnetic Sensor 180D, acceleration transducer 180E, range sensor 180F refer to close to optical sensor 180G Line sensor 180H, temperature sensor 180J, touch sensor 180K, ambient light sensor 180L, bone conduction sensor 180M Deng.

It is understood that the structure of the present embodiment signal does not constitute the specific restriction to electronic equipment.At other In embodiment, electronic equipment may include perhaps combining certain components than illustrating more or fewer components or splitting certain Component or different component layouts.The component of diagram can be realized with hardware, the combination of software or software and hardware.

Processor 110 may include one or more processing units, such as: processor 110 may include application processor (application processor, AP), modem processor, graphics processor (graphics processing Unit, GPU), image-signal processor (image signal processor, ISP), controller, memory, coding and decoding video Device, digital signal processor (digital signal processor, DSP), baseband processor and/or Processing with Neural Network Device (neural-network processing unit, NPU) etc..Wherein, different processing units can be independent device, Also it can integrate in one or more processors.

Controller can be nerve center and the command centre of electronic equipment.Controller can according to instruction operation code and when Sequential signal generates operating control signal, the control completing instruction fetch and executing instruction.

In the embodiment of the present application, wake-up word can be set in electronic equipment (such as " the small small E of E ").Above-mentioned DSP can lead to Cross the microphone 170C real-time monitoring voice data of electronic equipment.It, can be to the language monitored when DSP monitors voice data Sound data are verified, to determine if the wake-up word of doubtful setting in the electronic device.If verification passes through, if electronics is set In a dormant state, then DSP can wake up AP to standby AP, and AP is notified to be verified again to the voice data is received.? When verification passes through again, AP can determine that the voice data matches with the wake-up word of setting in the electronic device.

Memory can also be set in processor 110, for storing instruction and data.In some embodiments, processor Memory in 110 is cache memory.The memory can save the instruction that processor 110 is just used or is recycled Or data.If processor 110 needs to reuse the instruction or data, can be called directly from the memory.It avoids Repeated access, reduces the waiting time of processor 110, thus improves the efficiency of system.

In some embodiments, processor 110 may include one or more interfaces.Interface may include integrated circuit (inter-integrated circuit, I2C) interface, integrated circuit built-in audio (inter-integrated circuit Sound, I2S) interface, pulse code modulation (pulse code modulation, PCM) interface, universal asynchronous receiving-transmitting transmitter (universal asynchronous receiver/transmitter, UART) interface, mobile industry processor interface (mobile industry processor interface, MIPI), universal input export (general-purpose Input/output, GPIO) interface, Subscriber Identity Module (subscriber identity module, SIM) interface, and/or Universal serial bus (universal serial bus, USB) interface etc..

Charge management module 140 is used to receive charging input from charger.Wherein, charger can be wireless charger, It is also possible to wired charger.In the embodiment of some wired chargings, charge management module 140 can pass through usb 1 30 Receive the charging input of wired charger.In the embodiment of some wireless chargings, charge management module 140 can pass through electronics The Wireless charging coil of equipment receives wireless charging input.It, can be with while charge management module 140 is that battery 142 charges It is power electronic equipment by power management module 141.

Power management module 141 is for connecting battery 142, charge management module 140 and processor 110.Power management mould Block 141 receives the input of battery 142 and/or charge management module 140, is processor 110, internal storage 121, external storage Device, display screen 194, the power supply such as camera 193 and wireless communication module 160.Power management module 141 can be also used for monitoring Battery capacity, circulating battery number, the parameters such as cell health state (electric leakage, impedance).In some other embodiment, power supply pipe Reason module 141 also can be set in processor 110.In further embodiments, power management module 141 and Charge Management mould Block 140 also can be set in the same device.

The wireless communication function of electronic equipment can pass through antenna 1, antenna 2, mobile communication module 150, radio communication mold Block 160, modem processor and baseband processor etc. are realized.

Antenna 1 and antenna 2 electromagnetic wave signal for transmitting and receiving.Each antenna in electronic equipment can be used for covering list A or multiple communication bands.Different antennas can also be multiplexed, to improve the utilization rate of antenna.Such as: antenna 1 can be multiplexed For the diversity antenna of WLAN.In other embodiments, antenna can be used in combination with tuning switch.

Mobile communication module 150 can be provided using the wireless communications such as including 2G/3G/4G/5G on an electronic device Solution.Mobile communication module 150 may include at least one filter, switch, power amplifier, low-noise amplifier (low noise amplifier, LNA) etc..Mobile communication module 150 can receive electromagnetic wave by antenna 1, and to received electricity Magnetic wave is filtered, and the processing such as amplification is sent to modem processor and is demodulated.Mobile communication module 150 can also be right The modulated modulated signal amplification of demodulation processor, switchs to electromagenetic wave radiation through antenna 1 and goes out.In some embodiments, it moves At least partly functional module of dynamic communication module 150 can be arranged in processor 110.In some embodiments, mobile logical At least partly functional module of letter module 150 can be arranged in the same device at least partly module of processor 110. For example, mobile communication module 150 can be interacted with server in some embodiments of the application, is such as detecting and calling out It wakes up after the matched voice data of word, the energy information of the voice data detected is sent to server, receive what server returned Instruction is waken up, to determine the need for carrying out wake-up response according to wake-up instruction.In another example receiving the interior of server transmission Hold instruction, is indicated to execute the corresponding event of user voice command according to the content.

Wireless communication module 160 can be provided using on an electronic device including WLAN (wireless Local area networks, WLAN) (such as Wireless Fidelity (wireless fidelity, Wi-Fi) network), bluetooth (bluetooth, BT), Global Navigation Satellite System (global navigation satellite system, GNSS), frequency modulation (frequency modulation, FM), the short distance wireless communication technology (near field communication, NFC) are red The solution of the wireless communications such as outer technology (infrared, IR).Wireless communication module 160 can be integrated into few communication One or more devices of processing module.Wireless communication module 160 receives electromagnetic wave via antenna 2, by electromagnetic wave signal frequency modulation And filtering processing, by treated, signal is sent to processor 110.Wireless communication module 160 can also connect from processor 110 Signal to be sent is received, frequency modulation is carried out to it, is amplified, is switched to electromagenetic wave radiation through antenna 2 and go out.For example, some in the application In embodiment, wireless communication module 160 can be interacted with other electronic equipments, such as detected and waken up the matched language of word After sound data, the energy information of the voice data detected is sent to other electronic equipments, receives calling out for electronic equipment return It wakes up and indicates, to determine the need for carrying out wake-up response according to wake-up instruction.In another example receiving electronic equipment transmission Content instruction indicates to execute the corresponding event of user voice command according to the content.

In some embodiments, the antenna 1 of electronic equipment and mobile communication module 150 couple, antenna 2 and radio communication mold Block 160 couples, and allowing electronic equipment, technology is communicated with network and other equipment by wireless communication.The wireless communication Technology may include global system for mobile communications (global system for mobile communications, GSM), lead to With grouping wireless service (general packet radio service, GPRS), CDMA accesses (code division Multiple access, CDMA), wideband code division multiple access (wideband code division multiple access, WCDMA), time division CDMA (time-division code division multiple access, TD-SCDMA), it is long Phase evolution (long term evolution, LTE), BT, GNSS, WLAN, NFC, FM and/or IR technology etc..The GNSS can To include GPS (global positioning system, GPS), Global Navigation Satellite System (global Navigation satellite system, GLONASS), Beidou satellite navigation system (beidou navigation Satellite system, BDS), quasi- zenith satellite system (quasi-zenith satellite system, QZSS) and/or Satellite-based augmentation system (satellite based augmentation systems, SBAS).

Electronic equipment realizes display function by GPU, display screen 194 and application processor etc..GPU is image procossing Microprocessor, connect display screen 194 and application processor.GPU is calculated for executing mathematics and geometry, is rendered for figure. Processor 110 may include one or more GPU, execute program instructions to generate or change display information.

Display screen 194 is for showing image, video etc..Display screen 194 includes display panel.Display panel can use liquid Crystal display screen (liquid crystal display, LCD), Organic Light Emitting Diode (organic light-emitting Diode, OLED), active matrix organic light-emitting diode or active-matrix organic light emitting diode (active-matrix Organic light emitting diode, AMOLED), Flexible light-emitting diodes (flex light-emitting Diode, FLED), Miniled, MicroLed, Micro-oLed, light emitting diode with quantum dots (quantum dot light Emitting diodes, QLED) etc..In some embodiments, electronic equipment may include 1 or N number of display screen 194, N are Positive integer greater than 1.

Electronic equipment can pass through ISP, camera 193, Video Codec, GPU, display screen 194 and application processing Device etc. realizes shooting function.

ISP is used to handle the data of the feedback of camera 193.For example, opening shutter when taking pictures, light is passed by camera lens It is delivered on camera photosensitive element, optical signal is converted to electric signal, and camera photosensitive element passes to the electric signal at ISP Reason, is converted into macroscopic image.ISP can also be to the noise of image, brightness, colour of skin progress algorithm optimization.ISP can be with Exposure to photographed scene, the parameter optimizations such as colour temperature.In some embodiments, ISP can be set in camera 193.

Camera 193 is for capturing still image or video.Object generates optical imagery by camera lens and projects photosensitive member Part.Photosensitive element can be charge-coupled device (charge coupled device, CCD) or complementary metal oxide is partly led Body (complementary metal-oxide-semiconductor, CMOS) phototransistor.Photosensitive element turns optical signal It changes electric signal into, electric signal is passed into ISP later and is converted into data image signal.Data image signal is output to DSP by ISP Working process.Data image signal is converted into the RGB of standard, the picture signal of the formats such as YUV by DSP.In some embodiments, Electronic equipment may include 1 or N number of camera 193, and N is the positive integer greater than 1.

Digital signal processor, in addition to can handle data image signal, can also handle it for handling digital signal His digital signal.For example, digital signal processor is used to carry out Fourier to frequency point energy when electronic equipment is when frequency point selects Transformation etc..

Video Codec is used for compression of digital video or decompression.Electronic equipment can support one or more videos Codec.In this way, electronic equipment can play or record the video of a variety of coded formats, and such as: dynamic image expert group (moving picture experts group, MPEG) 1, MPEG2, mpeg 3, MPEG4 etc..

NPU is neural network (neural-network, NN) computation processor, by using for reference biological neural network structure, Such as transfer mode between human brain neuron is used for reference, it, can also continuous self study to input information fast processing.Pass through NPU The application such as intelligent cognition of electronic equipment may be implemented, such as: image recognition, recognition of face, speech recognition, text understanding etc..

External memory interface 120 can be used for connecting external memory card, such as Micro SD card, realize that extension electronics is set Standby storage capacity.External memory card is communicated by external memory interface 120 with processor 110, realizes data storage function. Such as by music, the files such as video are stored in external memory card.

Internal storage 121 can be used for storing computer executable program code, and the executable program code includes Instruction.Processor 110 is stored in the instruction of internal storage 121 by operation, answers thereby executing the various functions of electronic equipment With and data processing.Internal storage 121 may include storing program area and storage data area.Wherein, storing program area can Storage program area, application program (such as sound-playing function, image player function etc.) needed at least one function etc..It deposits Storage data field can store the data (such as audio data, phone directory etc.) etc. created in electronic equipment use process.In addition, interior Portion's memory 121 may include high-speed random access memory, can also include nonvolatile memory, for example, at least a magnetic Disk storage device, flush memory device, generic flash memory (universal flash storage, UFS) etc..

Electronic equipment can pass through audio-frequency module 170, loudspeaker 170A, receiver 170B, microphone 170C, earphone interface 170D and application processor etc. realize audio-frequency function.Such as music, recording etc..

Audio-frequency module 170 is used to for digitized audio message to be converted into analog audio signal output, is also used for analogue audio frequency Input is converted to digital audio and video signals.Audio-frequency module 170 can be also used for audio-frequency signal coding and decoding.In some embodiments In, audio-frequency module 170 can be set in processor 110, or the partial function module of audio-frequency module 170 is set to processor In 110.

Loudspeaker 170A, also referred to as " loudspeaker ", for audio electrical signal to be converted to voice signal.Electronic equipment can pass through Loudspeaker 170A listens to music, or listens to hand-free call.

Receiver 170B, also referred to as " earpiece ", for audio electrical signal to be converted into voice signal.When electronic equipment answers electricity It, can be by the way that receiver 170B be answered voice close to human ear when words or voice messaging.

Microphone 170C, also referred to as " microphone ", " microphone ", for voice signal to be converted to electric signal.When making a phone call Or when sending voice messaging or needing to trigger the electronic equipment certain events of execution by voice assistant, user can be leaned on by mouth Nearly microphone 170C sounding, is input to microphone 170C for voice signal.At least one microphone can be set in electronic equipment 170C.In further embodiments, two microphone 170C can be set in electronic equipment, can be in addition to collected sound signal Realize decrease of noise functions.In further embodiments, electronic equipment can also be arranged three, four or more microphone 170C, in fact Existing collected sound signal, noise reduction can also identify sound source, realize directional recording function etc..

Earphone interface 170D is for connecting wired earphone.Earphone interface 170D can be usb 1 30, be also possible to Opening mobile electronic device platform (open mobile terminal platform, OMTP) standard interface of 3.5mm, the U.S. Cellular telecommunication industrial association (cellular telecommunications industry association of the USA, CTIA) standard interface.

Pressure signal can be converted into electric signal for experiencing pressure signal by pressure sensor 180A.In some implementations In example, pressure sensor 180A be can be set in display screen 194.There are many type of pressure sensor 180A, such as resistive pressure Sensor, inductance pressure transducer, capacitance pressure transducer, etc..Capacitance pressure transducer, can be including at least two Parallel-plate with conductive material.When effectively acting on pressure sensor 180A, the capacitor between electrode changes.Electronic equipment root The intensity of pressure is determined according to the variation of capacitor.When there is touch operation to act on display screen 194, electronic equipment is according to pressure sensor 180A detects the touch operation intensity.Electronic equipment can also calculate touch according to the detection signal of pressure sensor 180A Position.In some embodiments, identical touch location, but the touch operation of different touch operation intensity are acted on, can be corresponded to Different operational orders.Such as: it is answered when there is touch operation of the touch operation intensity less than first pressure threshold value to act on short message When with icon, the instruction for checking short message is executed.When the touch for having touch operation intensity to be greater than or equal to first pressure threshold value is grasped When acting on short message application icon, the instruction of newly-built short message is executed.

Gyro sensor 180B is determined for the athletic posture of electronic equipment.In some embodiments, Ke Yitong It crosses gyro sensor 180B and determines that electronic equipment surrounds the angular speed of three axis (that is, x, y and z-axis).Gyro sensor 180B can be used for shooting stabilization.Illustratively, when pressing shutter, gyro sensor 180B detects the angle of electronic equipment shake Degree goes out the distance that lens module needs to compensate according to angle calculation, camera lens is allowed to offset the shake of electronic equipment by counter motion, Realize stabilization.Gyro sensor 180B can be also used for navigating, somatic sensation television game scene.

Baroceptor 180C is for measuring air pressure.In some embodiments, electronic equipment passes through baroceptor 180C The atmospheric pressure value measured calculates height above sea level, auxiliary positioning and navigation.

Magnetic Sensor 180D includes Hall sensor.Electronic equipment can use Magnetic Sensor 180D flip cover leather sheath Folding.In some embodiments, when electronic equipment is liding machine, electronic equipment can be according to Magnetic Sensor 180D flip cover Folding.And then according to the folding condition of the leather sheath detected or the folding condition of flip lid, the characteristics such as setting flip lid automatic unlocking.

Acceleration transducer 180E can detect the size of electronic equipment (generally three axis) acceleration in all directions.When It can detect that size and the direction of gravity when electronic equipment is static.It can be also used for identification electronic equipment posture, be applied to vertical and horizontal Screen switching, the application such as pedometer.

Range sensor 180F, for measuring distance.Electronic equipment can pass through infrared or laser distance measuring.Some In embodiment, photographed scene, electronic equipment can use range sensor 180F ranging to realize rapid focus.

It may include such as light emitting diode (LED) and photodetector, such as photodiode close to optical sensor 180G. Light emitting diode can be infrared light-emitting diode.Electronic equipment launches outward infrared light by light emitting diode.Electronic equipment The infrared external reflection light from neighbouring object is detected using photodiode.When detecting sufficient reflected light, electricity can be determined Sub- equipment nearby has object.When detecting insufficient reflected light, electronic equipment can determine that electronic equipment does not have object nearby Body.Electronic equipment can use close to optical sensor 180G and detect user's hand-hold electronic equipments close to ear call, so as to automatic Extinguish screen and achievees the purpose that power saving.It can also be used for leather sheath mode, pocket pattern automatic unlocking and lock close to optical sensor 180G Screen.

Ambient light sensor 180L is for perceiving environmental light brightness.Electronic equipment can according to the environmental light brightness of perception from It adapts to adjust 194 brightness of display screen.Automatic white balance adjustment when ambient light sensor 180L can also be used for taking pictures.Ambient light sensing Device 180L can also cooperate with close to optical sensor 180G, electronic equipment be detected whether in pocket, with false-touch prevention.

Fingerprint sensor 180H is for acquiring fingerprint.The fingerprint characteristic that electronic equipment can use acquisition realizes fingerprint solution Lock accesses application lock, and fingerprint is taken pictures, fingerprint incoming call answering etc..

Temperature sensor 180J is for detecting temperature.In some embodiments, electronic equipment utilizes temperature sensor 180J The temperature of detection executes Temperature Treatment strategy.For example, when the temperature sensor 180J temperature reported is more than threshold value, electronic equipment The performance for reducing the processor being located near temperature sensor 180J is executed, implements Thermal protection to reduce power consumption.At other In embodiment, when temperature is lower than another threshold value, electronic equipment heats battery 142, causes electronic equipment different to avoid low temperature Often shutdown.In some other embodiment, when temperature is lower than another threshold value, electronic equipment holds the output voltage of battery 142 Row boosting, to avoid shutting down extremely caused by low temperature.

Touch sensor 180K, also referred to as " touch panel ".Touch sensor 180K can be set in display screen 194, by touching It touches sensor 180K and display screen 194 forms touch screen, also referred to as " touch screen ".Touch sensor 180K acts on it for detecting On or near touch operation.The touch operation that touch sensor can will test passes to application processor, to determine touching Touch event type.Visual output relevant to touch operation can be provided by display screen 194.In further embodiments, it touches Touching sensor 180K also can be set in the surface of electronic equipment, different from the location of display screen 194.

The available vibration signal of bone conduction sensor 180M.In some embodiments, bone conduction sensor 180M can be with Obtain the vibration signal of human body part vibration bone block.Bone conduction sensor 180M can also contact human pulse, receive blood pressure and jump Dynamic signal.In some embodiments, bone conduction sensor 180M also can be set in earphone, be combined into bone conduction earphone.Sound Frequency module 170 can parse voice based on the vibration signal for the part vibration bone block that the bone conduction sensor 180M is obtained Signal realizes phonetic function.The blood pressure jitter solution that application processor can be obtained based on the bone conduction sensor 180M Heart rate information is analysed, realizes heart rate detecting function.

Key 190 includes power button, volume key etc..Key 190 can be mechanical key.It is also possible to touch-key. Electronic equipment can receive key-press input, and it is defeated to generate key signals related with the user setting of electronic equipment and function control Enter.

Motor 191 can produce vibration prompt.Motor 191 can be used for calling vibration prompt, can be used for touching vibration Dynamic feedback.For example, acting on the touch operation of different application (such as taking pictures, audio broadcasting etc.), different vibrations can be corresponded to Feedback effects.The touch operation of 194 different zones of display screen is acted on, motor 191 can also correspond to different vibrational feedback effects. Different application scenarios (such as: time alarm receives information, alarm clock, game etc.) different vibrational feedback effects can also be corresponded to Fruit.Touch vibrational feedback effect can also be supported customized.

Indicator 192 can be indicator light, can serve to indicate that charged state, electric quantity change can be used for instruction and disappear Breath, missed call, notice etc..

SIM card interface 195 is for connecting SIM card.SIM card can be by being inserted into SIM card interface 195, or from SIM card interface 195 extract, and realization is contacting and separating with electronic equipment.Electronic equipment can support that 1 or N number of SIM card interface, N are greater than 1 Positive integer.SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card etc..The same SIM card interface 195 can be inserted into multiple cards simultaneously.The type of multiple cards may be the same or different.SIM card interface 195 can also be with Compatible different types of SIM card.SIM card interface 195 can also be with compatible external storage card.Electronic equipment passes through SIM card and network The functions such as call and data communication are realized in interaction.In some embodiments, electronic equipment uses eSIM, it may be assumed that embedded SIM Card.ESIM card can it is embedding in the electronic device, cannot be separated with electronic equipment.

Method in following embodiment can be realized in the electronic equipment with above-mentioned hardware configuration.

In the embodiment of the present application, under above-mentioned more device contexts, after user says and wakes up word and voice command, pass through More equipment wake up arbitration and an equipment in more equipment are selected to carry out wake-up response.And arbitrated by more capacities of equipment, it is carrying out The equipment for waking up response, which does not have, executes voice command when corresponding to the function of event, can be by having execution voice command in more equipment The equipment of the function of corresponding event executes the corresponding event of the voice command, completes the response to voice command.

Wherein, above-mentioned more equipment wake up arbitration and above-mentioned more capacity of equipment arbitrations and can be set by one of them in more equipment It is standby to realize, it can also be realized by server.It is arbitrated and more capacity of equipment arbitration equipments below according to realizing that more equipment wake up Difference, describe in detail to sound control method provided by the embodiments of the present application.In addition, Fig. 1 is combined in following embodiment, with More device contexts are as follows: there are speaker 101,103 3 equipment of television set 102 and mobile phone in user family parlor, these three equipment are respectively mounted There is a voice assistant, and waking up word is to be illustrated for " the small small E of E ".

Fig. 4 is a kind of flow diagram of sound control method provided by the embodiments of the present application.The embodiment is with more equipment It wakes up arbitration and more capacities of equipment is arbitrated for being realized by server.As shown in figure 4, this method may include following S401- S409。

S401, speaker 101, television set 102 and mobile phone 103 receive the first voice data of user's input respectively.

For example, first voice data can be above-mentioned wake-up word " the small small E of E ".

For being equipped with the electronic equipment of voice assistant, acquired without other software and hardwares using microphone in the electronic equipment In the case where voice data, whether the DSP of electronic equipment can have voice data input by microphone real-time monitoring user.One As in the case of, user want using electronic equipment voice control function when, can electronic equipment pickup apart from interior hair Sound, the voice input that will be issued to microphone.At this point, if microphone acquisition is used without other software and hardwares in electronic equipment Voice data, then the DSP of electronic equipment can monitor corresponding voice data by microphone, such as the first voice data, and It is cached.

For example, as shown in connection with fig. 5, user is sitting on the sofa in parlor, when wanting using voice control function, it may be said that Wake up word " the small small E of E " out.Such as speaker 101, the pickup distance of television set 102 and mobile phone 103 is 4 meters, and soft without other Microphone acquisition voice data is used in hardware, then speaker 101, and the DSP of television set 102 and mobile phone 103 can be by respective Microphone detection to waking up word " the small small E of E " corresponding first voice data.

S402, speaker 101, television set 102 and mobile phone 103 respectively verify the first voice data received, really Fixed first voice data is the wake-up word of registration.

After electronic equipment receives above-mentioned first voice data, which can be verified, that is, sentenced Whether first voice data that disconnecting receives is the wake-up word registered in the electronic device.If verification passes through, show to connect The first voice data received is to wake up word, and following S403 can be performed.If verification does not pass through, show the first language received Sound data are not to wake up word, and electronic equipment can delete the first voice data of caching at this time.

Illustratively, it specifically may include: DSP pairs of electronic equipment that electronic equipment, which verify to the first voice data, The text of the wake-up word of the text of first voice data and registration in the electronic device carries out the matching of lower accuracy.If DSP's fits through, and in a dormant state, then DSP can wake up AP to the AP of electronic equipment, and by AP to first voice data Text and registration in the electronic device wake-up word text carry out degree of precision matching.If the matching of AP also by, Then electronic equipment can determine that first voice data is the wake-up word registered.If the matching of DSP does not pass through or of AP With not passing through, then it is the wake-up word of registration that electronic equipment, which can determine first voice data not,.

For example, in conjunction with the example in above-mentioned S401, the DSP of speaker 101, television set 102 and mobile phone 103 detects wake-up word After " the small small E of E " corresponding first voice data, first voice data can be verified by respective DSP and AP respectively.Such as In the present embodiment, speaker 101, television set 102 and mobile phone 103 pass through the verification of the first voice data detected, i.e., The first voice data that three confirmly detects is the wake-up word of registration.

S403, speaker 101, television set 102 and mobile phone 103 report the first voice data detected to server respectively Energy information.

Wherein, energy information is used to indicate the distance between equipment and user.In some embodiments, energy information can be with By signal-to-noise ratio, one or more of acoustic pressure etc. is indicated.For example, by taking energy information is indicated by acoustic pressure as an example.In conjunction with Example in S402, in speaker 101, television set 102 and mobile phone 103 determine that the first voice data detected is calling out for registration It wakes up after word, the acoustic pressure of speaker 101, the first voice data that television set 102 and mobile phone 103 can respectively detect itself is surveyed Amount, and the acoustic pressure of the first voice data obtained to server reporting measurement.Wherein acoustic pressure is bigger, indicates between equipment and user Distance it is closer.

The energy for the first voice data that S404, server are reported according to speaker 101, television set 102 and mobile phone 103 is believed Breath, determines that speaker 101 carries out wake-up response.

Server can carry out set after the energy information for receiving the first voice data that multiple electronic equipments report more Standby to wake up arbitration, i.e., server can select one of equipment to carry out wake-up response from this multiple electronic equipment.

For example, server is receiving speaker 101, what television set 102 and mobile phone 103 were sent in conjunction with the example in S403 After the acoustic pressure of first voice data, acoustic pressure can be selected maximum according to the size of acoustic pressure, i.e., equipment nearest apart from user carries out Wake up response.Shown in Figure 5, speaker 101, television set 102 and the distance between mobile phone 103 and user are respectively 2 meters, 3 meters With 2.5 meters.Correspondingly, the acoustic pressure for the first voice data that speaker 101 measures is maximum, mobile phone 103 takes second place, and television set 102 measures The first voice data acoustic pressure it is minimum.Therefore, server may be selected speaker 101 and carry out wake-up response.As server can be to Speaker 101 sends first and wakes up instruction, which, which is used to indicate, carries out wake-up response.In addition, server can also be to Television set 102 and mobile phone 103 send the second wake-up instruction respectively, which is used to indicate responds without waking up. Alternatively, server can not also send any instruction to television set 102 and mobile phone 103, but television set 102 and mobile phone 103 are true It is fixed not receive any wake-up instruction within a preset time, when such as above-mentioned first wake-up indicates, determines and responded without waking up.

S405, speaker 101 wake up voice assistant, receive the second speech data of user's input.

Second speech data is reported to server by S406, speaker 101.

For example, as shown in figure 5, speaker 101 receive the first wake-up instruction after, its voice control function can be waken up, As waken up its voice assistant.Speaker 101, which also can play, wakes up answering tone, such as " I ".And television set 102 and mobile phone 103 then root Instruction is waken up according to second received not responding.User can continue to say voice command.In this way, the AP of speaker 101 can lead to Microphone detection is crossed to the corresponding voice data of the voice command, such as second speech data.At this point, speaker 101 can be by the second language Sound data report to server.

S407, server determine speaker 101, have execution second speech data in television set 102 and mobile phone 103 and correspond to thing The equipment of the function of part.

Server can carry out more capacity of equipment arbitrations, that is, service after receiving the second speech data that speaker 101 reports Device can determine in multiple electronic equipments which electronic equipment, which has, executes the second speech data according to the second speech data The function of corresponding event.Wherein, in some embodiments, electronic equipment can be in device power or when restarting, by the energy of itself Force information reports to server automatically, so that server is stored.In some other embodiment, electronic equipment can also week The ability information of itself is reported into phase property server automatically.Electronic equipment can also be in the ability information hair for detecting itself The ability information of itself is reported to server when changing automatically.It, can be in this way, after server receives second speech data The second speech data is divided using automatic speech recognition (automatic speech recognition, ASR) technology Analysis, the acquisition execution second speech data correspond to event and need which type of function electronic equipment has.Then according to determining As a result and the ability information of multiple electronic equipments of storage, determine to have from this multiple electronic equipment and execute second language The equipment that sound data correspond to the function of event.

For example, in conjunction with the example in Fig. 5 and above-mentioned S401-S406, it is assumed that speaker 101, television set 102 and mobile phone 103 exist When powering on, the ability information of itself has been reported respectively.Ability information as speaker 101 reports includes: music playback function, weather Broadcast function.The ability information that television set 102 reports includes: video playback capability.The ability information that mobile phone 103 reports includes: Navigation feature.Mark (the matchmaker of such as equipment of ability information and electronic equipment that then server can report each electronic equipment The address body access control (media access control, MAC)) storage is corresponded to, such as the energy for the electronic equipment that server stores The corresponding relationship of the mark of force information and electronic equipment is as shown in table 1.

Table 1

The mark of electronic equipment	The ability information of equipment
		MAC Address 1	Music playback function, weather broadcast function
MAC Address 2	Video playback capability
		MAC Address 3	Navigation feature

Wherein, in table 1, MAC Address 1 is the mark of speaker 101, and MAC Address 2 is the mark of television set 102, MAC Location 3 is the mark of mobile phone 103.In addition, it is necessary to explanation, speaker 101, television set 102 and mobile phone 103 can power on every time When just report itself primary ability information to server, so as to when the ability information of equipment updates, server end also can and When be updated.

For example, for the voice command said with user, i.e. second speech data is " playing the film wandering earth ".Service After device receives the second speech data " playing the film wandering earth ", " film wandering can be played to the second speech data The earth " is analyzed, and is determined and is executed " playing the film wandering earth " corresponding event, that is, executes and play setting for the film wandering earth It is standby to have video playback capability.Server can determine the equipment for being identified as MAC Address 2, i.e. television set 102 according to table 1 Has video playback capability.That is, server is determined in speaker 101, television set 102 and mobile phone 103, television set 102 It is the equipment for having the function of execution second speech data " playing the film wandering earth " corresponding event.

In another example the voice command said with user, i.e. second speech data is for " navigating to somewhere ".Server connects After receiving the second speech data " navigating to somewhere ", which can be analyzed, really Fixed to execute " navigating to somewhere " corresponding event, that is, navigation feature need to be had by executing the equipment for navigating to somewhere.Server can be with According to table 1, the equipment for being identified as MAC Address 3 is determined, i.e. mobile phone 103 has navigation feature.That is, server determines Out in speaker 101, television set 102 and mobile phone 103, mobile phone 103 is that have execution second speech data " navigating to somewhere " to correspond to The equipment of the function of event.

S408, server send content instruction to having the equipment for executing the function that the second speech data corresponds to event.

S409, have the equipment for executing the second speech data function that correspond to event and indicated according to content, execution second The corresponding event of voice data.

Above content instruction can be execution second speech data and correspond to data required for the event.For example, as shown in fig. 6, For the voice command said with user, i.e. second speech data is " playing the film wandering earth ".Above content instruction can be with It is the broadcasting link of film " the wandering earth ".In this way, server can send electricity to television set 102 in conjunction with the example in S407 The broadcasting link of shadow " the wandering earth ".After television set 102 receives the broadcasting link, film can be played according to the broadcasting link " the wandering earth ", as shown in Figure 6.Wherein, S408 and S409 executes the second speech data and corresponds to the function of event in Fig. 4 to have The equipment of energy is that television set 102 is to exemplify.

Above content instruction is also possible to the corresponding instruction of second speech data.In another example the voice life said with user It enables, i.e., second speech data is for " navigating to somewhere ".Above content instruction, which can be, " navigates to certain with second speech data The corresponding instruction in ground ".In this way, server can send to mobile phone 103 and " lead with second speech data in conjunction with the example in S407 Navigate to somewhere " corresponding instruction.Mobile phone 103 can star navigation application according to the instruction received, and displaying navigates to the ground Route, and carry out voice broadcast.Certainly, content instruction is also possible to second speech data itself, in this way, mobile phone 103 is receiving To after second speech data, analysis can be carried out to second speech data and obtains the corresponding instruction of second speech data, and executing should Instruction.

In addition, server can also send command response instruction to speaker 101, command response instruction is used to indicate speaker 101 carry out voice command response.In some embodiments, the second language is executed if the server determine that going out other electronic equipments and having Sound data correspond to the function of event, and speaker 101 does not have the function, then server can send command response to speaker 101 Instruction, command response instruction are used to indicate speaker 101 and prompt user that will execute voice command correspondence on other electronic equipments Event.

For example, server determines that television set 102 has execution second speech data in conjunction with the example in above-mentioned S402 The function of " playing the film wandering earth " corresponding event, and speaker 101 does not have the function.Server can be sent out to speaker 101 Command response is sent to indicate, command response instruction is used to indicate speaker 101 and prompts user that will play film on television set 102 " the wandering earth ".As shown in fig. 6, speaker 101 can carry out voice broadcast according to command response instruction " will play on a television set Film " the wandering earth " ".In another example server determines that mobile phone 103 has and executes the second language in conjunction with the example in above-mentioned S402 The function of the corresponding event of sound data " navigating to somewhere ", and speaker 101 does not have the function.Server can be sent out to mobile phone 103 Command response is sent to indicate, command response instruction is used to indicate speaker 101 and prompts user that will navigate on mobile phone 103.Sound Case 101 can carry out voice broadcast " will navigate on mobile phone " according to command response instruction.

In further embodiments, if the server determine that out speaker 101 have execute second speech data correspond to event Function, then server can to speaker 101 send voice command response and content instruction.In this way, speaker 101 can be according to voice Command response carries out voice broadcast, and the content of such as casting is " will execute certain event ", and is indicated according to content, executes the second voice The corresponding event of data.

It should be noted that in the embodiment of the present application, user, which says, wakes up word (i.e. the first voice data) and voice life It enables (i.e. second speech data) to can be continuously, is also possible to discontinuous.As user can continuously say wake up word and Voice command " the small small E of E plays the film wandering earth ".It can also first say and wake up word " the small small E of E ", have device plays hearing Answering tone is waken up, as after " me ", is saying voice command " playing the film wandering earth ".If user is continuously to say wake-up Word and voice command, then after the equipment for determining to carry out waking up response, which can not play wake-up answering tone, but After the command response instruction for receiving server transmission, direct playing alert tones are indicated according to the command response, it such as " will be in TV Film " the wandering earth " is played on machine ".

Wherein, above-mentioned S407-S409 is only to include speaker 101, television set 102 and mobile phone 103 3 in more device contexts It is illustrated for a equipment.It in further embodiments, can also include other electronic equipments in more device contexts. The electronic equipment can have voice control function, may not possess voice control function.And has language in the electronic equipment When sound control function, waking up word can be different from above-mentioned wake-up word " the small small E of E "；Alternatively, the electronic equipment has voice control Function, wake up word it is identical with above-mentioned wake-up word " the small small E of E ", but the distance between the electronic equipment and user more than its pickup away from From.Under such a scenario, if being stored with the ability information of the electronic equipment in server, and server determines the electricity Sub- equipment is that have the equipment for executing the function that above-mentioned second speech data corresponds to event, then server can also refer to content Show and be sent to the electronic equipment, so that the electronic equipment is indicated according to content, executes the corresponding event of second speech data.This Sample allows the coverage area of voice control more than the pickup range of electronic equipment.For example, being provided with the electricity of 6 microphones Depending on machine pickup distance be usually 5 meters within, using the method for the embodiment of the present application, though between user and the television set away from From more than 5 meters, the events such as its automatic broadcasting for executing video can also be controlled by voice control.In addition, user is without bright Really say and need to play video on the television set, that is, be not required to user it is specified to need the equipment for carrying out video playing be television set, only It needs user to say " playing certain video ", using the method for the present embodiment, the television set can also be triggered and play video automatically.

In addition, smart home is universal, the electronic equipment for having voice control function is more and more, and electronic equipment has Function it is also more and more.If determined finally only according to the example in above-mentioned S407 according to the ability information of electronic equipment The equipment that voice command corresponds to event is executed, then may exist simultaneously multiple electronic equipments and be provided with execution voice command correspondence The case where function of event.In some embodiments, server can from this it is multiple have execute voice command correspond to event An electronic equipment is selected in the electronic equipment of function arbitrarily to execute the corresponding event of voice command.In other embodiments In, server can also be in conjunction with user and each electricity in the electronic equipment of multiple functions of having the corresponding event of execution voice command The distance between sub- equipment selects the electronic equipment nearest apart from user to execute the corresponding event of voice command.Server is also It can be according to the state of each electronic equipment in multiple electronic equipments for having and executing the function that voice command corresponds to event, in this way It is no to be in open state, if to be determined for executing corresponding event of other voice commands etc. within a preset time, to select One of electronic equipment executes the corresponding event of voice command.For example, determining two electronic equipments (such as electricity in server Sub- equipment 1 and electronic equipment 2) have and executes voice command and correspond to the function of event, but electronic equipment 1 is before a few minutes, quilt It determines for executing the corresponding event of another voice command, then server can choose electronic equipment 2 to execute current speech life Enable corresponding event.Server can also record the use habit of different user (can distinguish different user with vocal print), in conjunction with The use habit electricity that selection user is commonly used from multiple electronic equipments for having and executing the function that voice command corresponds to event Sub- equipment executes the corresponding event of voice command.For example, video from the point of view of television set 1 is commonly used in the user 1 of server record, Then after the instruction for receiving user 1 plays the voice command of video, if the server determine that television set 1 and television set 2 are equal out Has broadcasting video capability, server can select television set 1 to play video in conjunction with the use habit of user.Certainly, server One in the use habit of the distance between above-mentioned electronic equipment and user, the state of electronic equipment and user can also be combined It is a or it is multiple determine an electronic equipment to integrate and execute the corresponding event of voice command, the present embodiment is not done herein to be had Body limitation.So, it can choose the equipment for being best suitable for user's intention and execute the corresponding event of voice command, so that language Sound control system is more intelligent, while improving the usage experience of user.

Fig. 7 is the flow diagram of another sound control method provided by the embodiments of the present application.The embodiment to set more For standby wake-up arbitration and the arbitration of more capacities of equipment are realized by main equipment.Wherein, which can be speaker 101, television set 102 and mobile phone 103 in any one equipment, in the embodiment by taking main equipment is mobile phone 103 as an example.As shown in fig. 7, this method It may include following S701-S709.

S701, speaker 101, television set 102 and mobile phone 103 receive the first voice data of user's input respectively.

S702, speaker 101, television set 102 and mobile phone 103 respectively verify the first voice data received, really Fixed first voice data is the wake-up word of registration.

Wherein, in the specific descriptions and embodiment illustrated in fig. 4 of S701 and S702 in S401 and S402 corresponding content description It is identical, it no longer repeats one by one herein.

S703, speaker 101 and television set 102 report the energy of the first voice data detected to believe to mobile phone 103 respectively Breath.

The energy information for the first voice data that S704, mobile phone 103 are reported according to speaker 101 and television set 102, and from The energy information for the first voice data that body measurement obtains determines that speaker 101 carries out wake-up response.

Wherein, in the specific descriptions and embodiment illustrated in fig. 4 of S703 and S704 in S403 and S404 corresponding content description It is similar.Difference is that in the present embodiment, more equipment, which wake up, arbitrates by executing as the mobile phone 103 of main equipment, therefore, sound Case 101 and television set 102 are that the energy information of the first voice data has been reported to mobile phone 103.

S705, speaker 101 wake up voice assistant, receive the second speech data of user's input.

Second speech data is reported to mobile phone 103 by S706, speaker 101.

S707, mobile phone 103 determine speaker 101, and it is corresponding to have execution second speech data in television set 102 and mobile phone 103 The equipment of the function of event.

Wherein, in the specific descriptions and embodiment illustrated in fig. 4 of S705-S707 in S405-S407 corresponding content description class Seemingly.Difference is: 1, in the present embodiment, more capacity of equipment arbitrations as the mobile phone 103 of main equipment by executing, therefore, sound Case 101 has been reported to mobile phone 103 after receiving second speech data, by the second speech data.Certainly, in the present embodiment In, the voice data of user's input can also be acquired by the mobile phone 103 as main equipment itself.2, itself is stored in mobile phone 103 With the ability information of other electronic equipments, for example, electricity can be stored in mobile phone 103 shown in the table 1 in embodiment as shown in Figure 4 The corresponding relationship of the mark of the ability information and electronic equipment of sub- equipment, to determine have execution according to the corresponding relationship The equipment that two voice data correspond to the function of event.

In the present embodiment, it is set if mobile phone 103 is determined to have to execute second speech data and correspond to the function of event Standby is just itself, as mobile phone 103.At this point, if you do not need to obtaining content instruction with server interaction, then mobile phone 103 can be straight It connects and the second speech data is analyzed, corresponding instruction is obtained, then according to the instruction execution and the second speech data pair The event answered, if necessary and server interaction obtains content instruction, then mobile phone 103 can send request message to server, with Content instruction is handed down to the mobile phone 103 by request server.

If mobile phone 103 determines that having the equipment for executing the function that second speech data corresponds to event is other equipment, Such as speaker 101 or television set 102, then following S708-S709 can be performed.

S708, mobile phone 103 send content instruction to having the equipment for executing the function that the second speech data corresponds to event.

S709, have the equipment for executing the second speech data function that correspond to event and indicated according to content, execution second The corresponding event of voice data.

Wherein, mobile phone 103 can send request message to server, to obtain content instruction, and will be under content instruction The equipment for having and executing the function that second speech data corresponds to event is issued, so that it indicates to execute the second voice number according to content According to corresponding event.Wherein, in Fig. 7 S708 and S709 with have execute the second speech data correspond to event function equipment It is to exemplify for television set 102.

Certainly, in some other embodiment, if it is determined that provide the standby function of executing second speech data and correspond to event Equipment be other equipment, and be not the equipment for carrying out waking up response, i.e. not instead of speaker 101, television set 102, then conduct A kind of alternative of S708, mobile phone 103 can send above-mentioned second speech data to television set 102.Television set 102 can root It is interacted according to the second speech data with server, to obtain above content instruction.

In further embodiments, if it is determined that providing the standby equipment for executing the function that second speech data corresponds to event is Other equipment, and be the equipment for carrying out waking up response, it is speaker 101, then as a kind of alternative of S708, mobile phone 103 Instruction information can be sent to speaker 101, which, which is used to indicate, responds voice command by speaker 101.At this point, Speaker 101 can be interacted according to the second speech data received with server, to obtain above content instruction.

It should be noted that above-mentioned S708 and S709 are to need interact with server acquisition content and indicate just to be able to achieve pair Illustrate for the response of voice command.Content instruction is obtained if you do not need to interacting with server, then is provided in determination The standby equipment not instead of speaker 101 for executing second speech data and corresponding to the function of event, when television set 102, mobile phone 103 can be with Second speech data is sent to television set 102, television set 102 can analyze the second speech data, obtain correspondence and refer to It enables, then according to the corresponding event of the instruction execution second speech data.Mobile phone 103 can also carry out second speech data Analysis obtains corresponding instruction, the instruction is then sent to television set 102, so that television set 10 is according to instruction execution second language The corresponding event of sound data.Determine to have to execute equipment that second speech data corresponds to the function of event be speaker 101 when, Mobile phone 103 can send instruction information to speaker 101, and speaker 101 can be directly according to instruction information, to the second speech data It is analyzed, obtains corresponding instruction, then according to the corresponding event of the instruction execution second speech data.

In addition, mobile phone 103 can also send command response instruction to speaker 101, command response instruction is used to indicate sound Case 101 carries out voice command response.The specific descriptions of voice command response can be referred in embodiment illustrated in fig. 4 in corresponding to The specific descriptions of appearance.It can also be with reference to corresponding to S407-S409 in embodiment illustrated in fig. 4 for other descriptions of S707-S709 The description of content.It is no longer described in detail herein.

It should be noted that in the embodiment of the present application, between electronic equipment (between such as mobile phone 103 and speaker 101, hand Machine 103 and television set 102) interaction, can by between two electronic equipments using Bluetooth protocol establish bluetooth connection come It realizes, it can also be by being realized between two electronic equipments using the Wi-Fi connection that Wi-Fi agreement is established.Certainly, may be used also With using being realized using the connection that other short-range communication protocols are established, the present embodiment herein and is not particularly limited.

Using above-mentioned Fig. 4 or method shown in Fig. 7, under more device contexts, user is saying wake-up word and voice command Afterwards, arbitration is waken up by more equipment and more capacities of equipment are arbitrated, not only can only wake up one of equipment, such as most apart from user Close equipment carries out wake-up response.Moreover, not having the function for executing voice command and corresponding to event in the equipment for carrying out waking up response When energy, it is not required to user shift position, user is not needed yet and re-speaks wake-up word and voice command, can have execution by correspondence Voice command corresponds to the equipment of the function of event to execute the corresponding event of the voice command, completes the response to voice command. So that electronic equipment is more intelligent, the efficient interactive between electronic equipment and user is realized.Meanwhile improving the use of user Experience.

Other embodiments of the application also provide a kind of computer storage medium, which may include calculating Machine instruction, when the computer instruction is run on electronic equipment (such as above-mentioned speaker 101, television set 102 or mobile phone 103), makes It obtains the electronic equipment and executes each step such as electronic equipment execution in the corresponding embodiment of Fig. 7.

Other embodiments of the application also provide a kind of computer program product, when the computer program product is in computer When upper operation so that the computer execute as electronic equipment in the corresponding embodiment of Fig. 7 (such as above-mentioned speaker 101, television set 102 or Mobile phone 103) execute each step.

Other embodiments of the application also provide a kind of device, which, which has, realizes electricity in the corresponding embodiment of above-mentioned Fig. 7 The function of sub- equipment (such as above-mentioned speaker 101, television set 102 or mobile phone 103) behavior.The function can by hardware realization, Corresponding software realization can also be executed by hardware.The hardware or software include one or more corresponding with above-mentioned function Module, for example, receiving unit or module, determination unit or module, transmission unit or module etc..

Through the above description of the embodiments, it is apparent to those skilled in the art that, for description It is convenienct and succinct, only the example of the division of the above functional modules, in practical application, can according to need and will be upper It states function distribution to be completed by different functional modules, i.e., the internal structure of device is divided into different functional modules, to complete All or part of function described above.

In several embodiments provided herein, it should be understood that disclosed device and method can pass through it Its mode is realized.For example, the apparatus embodiments described above are merely exemplary, for example, the module or unit It divides, only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components It may be combined or can be integrated into another device, or some features can be ignored or not executed.Another point, it is shown or The mutual coupling, direct-coupling or communication connection discussed can be through some interfaces, the indirect coupling of device or unit It closes or communicates to connect, can be electrical property, mechanical or other forms.

The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown can be a physical unit or multiple physical units, it can and it is in one place, or may be distributed over Multiple and different places.Some or all of unit therein can be selected to realize this embodiment scheme according to the actual needs Purpose.

It, can also be in addition, each functional unit in each embodiment of the application can integrate in one processing unit It is that each unit physically exists alone, can also be integrated in one unit with two or more units.Above-mentioned integrated list Member both can take the form of hardware realization, can also realize in the form of software functional units.

If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can store in a read/write memory medium.Based on this understanding, the technical solution of the embodiment of the present application is substantially The all or part of the part that contributes to existing technology or the technical solution can be in the form of software products in other words It embodies, which is stored in a storage medium, including some instructions are used so that an equipment (can be list Piece machine, chip etc.) or processor (processor) execute each embodiment the method for the application all or part of the steps. And storage medium above-mentioned includes: that USB flash disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), arbitrary access are deposited The various media that can store program code such as reservoir (Random Access Memory, RAM), magnetic or disk.

The above, the only specific embodiment of the application, but the protection scope of the application is not limited thereto, it is any Change or replacement within the technical scope of the present application should all be covered within the scope of protection of this application.Therefore, this Shen Protection scope please should be based on the protection scope of the described claims.

Claims

1. a kind of sound control method, which is characterized in that be applied to speech control system, the speech control system includes: one Group equipment and server, one group of equipment includes at least the first electronic equipment for having voice control function and the second electronics is set It is standby, which comprises

First electronic equipment and second electronic equipment receive the first voice data of user respectively；

First electronic equipment determines that the wake-up word registered in first voice data and first electronic equipment is identical, The energy information for first voice data that first electronic equipment detects is sent to the server；

Second electronic equipment determines that the wake-up word registered in first voice data and second electronic equipment is identical, The energy information for first voice data that second electronic equipment detects is sent to the server；

The energy information for first voice data that the server is detected according to first electronic equipment and described The energy information for first voice data that two electronic equipments detect determines and carries out wake-up sound by first electronic equipment It answers, the first electronic equipment of Xiang Suoshu sends first and wakes up instruction；Wherein, first language that first electronic equipment detects The energy of sound data is greater than the energy for first voice data that second electronic equipment detects；

First electronic equipment wakes up in response to described first and indicates, wakes up the voice control function of first electronic equipment Energy；

First electronic equipment after waking up voice control function receives the second speech data of user；

First electronic equipment sends the second speech data to the server；

The server determines target electronic device, the mesh from one group of equipment according to the second speech data Mark electronic equipment, which has, executes the function that the second speech data corresponds to event；

The server sends content instruction to the target electronic device, and the content is designated as the second speech data pair The instruction or the content answered, which are designated as executing the second speech data, corresponds to data required for the event；

The target electronic device is indicated according to the content, executes the corresponding event of the second speech data.

2. the method according to claim 1, wherein one group of equipment further includes third electronic equipment；

Wherein, the third electronic equipment does not have voice control function；Or,

The third electronic equipment has voice control function, but the distance between the third electronic equipment and user are greater than institute State the pickup distance of third electronic equipment.

3. method according to claim 1 or 2, which is characterized in that when receiving first voice data, described first The voice control function of electronic equipment and second electronic equipment is not waken up.

4. method according to any one of claim 1-3, which is characterized in that the method also includes:

The server sends command response instruction to first electronic equipment, and the command response instruction is used to indicate described First electronic device prompts user will be executed the corresponding event of the second speech data by the target electronic device；

First electronic equipment is indicated according to the command response, prompts user will be as described in target electronic device execution The corresponding event of second speech data.

5. method according to any of claims 1-4, which is characterized in that the server is according to second voice Data determine target electronic device from one group of equipment, comprising:

The server is according to the ability information of each equipment and the second speech data in one group of equipment, from described The equipment for having and executing the function that the second speech data corresponds to event is chosen in one group of equipment；

The function that the second speech data corresponds to event is executed if only existing an equipment in one group of equipment and having, The server determines that the equipment is the target electronic device；

If having the function of executing the second speech data and correspond to event, institute in one group of equipment there are multiple equipment It states server and determines that an equipment is the target electronic device from the multiple equipment；

Wherein, the target electronic device is any one in the multiple equipment, or,

The target electronic device meets at least one of the following conditions:

The target electronic device be in the multiple equipment between user the shortest equipment of distance；

The target electronic device is in open state；

The target electronic device is not determined for executing the corresponding event of other voice data within a preset time；Or,

The target electronic device is the highest equipment of user's frequency of use in the multiple equipment.

6. according to the method described in claim 5, it is characterized in that, the method also includes:

Each equipment in one group of equipment reports respective ability information to server respectively；

The server stores the ability information of each equipment in one group of equipment.

7. method according to claim 1 to 6, which is characterized in that the method also includes:

The server sends second to second electronic equipment and wakes up instruction, and second electronic equipment is according to described second Instruction is waken up, determines the voice control function for not waking up second electronic equipment；Or,

Second electronic equipment determines that not receiving described first within a preset time wakes up instruction, determines and does not wake up described the The voice control function of two electronic equipments.

8. a kind of sound control method, which is characterized in that be applied to one group of equipment, one group of equipment, which includes at least, has voice The first electronic equipment and the second electronic equipment of control function, which comprises

First electronic equipment determines that the wake-up word registered in first voice data and first electronic equipment is identical, Obtain the energy information for first voice data that first electronic equipment detects；

Second electronic equipment determines that the wake-up word registered in first voice data and second electronic equipment is identical, The energy information for first voice data that second electronic equipment detects is sent to first electronic equipment；

The energy information for first voice data that first electronic equipment is detected according to first electronic equipment and The energy information for first voice data that second electronic equipment detects, from first electronic equipment and described The equipment for carrying out waking up response is determined in two electronic equipments；

If the energy for first voice data that first electronic equipment detects is examined greater than second electronic equipment The energy of first voice data measured determines and carries out wake-up response by first electronic equipment, then described first is electric Sub- equipment wakes up the voice control function of first electronic equipment, and first electronics after waking up voice control function is set The standby second speech data for receiving user；

If the energy for first voice data that second electronic equipment detects is examined greater than first electronic equipment The energy of first voice data measured determines and carries out wake-up response by second electronic equipment, then described first is electric Sub- equipment sends first to second electronic equipment and wakes up instruction, and second electronic equipment wakes up in response to described first to be referred to Show, wake up the voice control function of second electronic equipment, second electronic equipment after waking up voice control function connects The second speech data of user is received, and is sent to first electronic equipment；

First electronic equipment determines target electronic device from one group of equipment according to the second speech data, The target electronic device, which has, executes the function that the second speech data corresponds to event；

If the target electronic device is first electronic equipment, first electronic equipment is to the second speech data It is analyzed, obtains the corresponding instruction of the second speech data, it is corresponding to execute the second speech data according to described instruction Event；Alternatively, first electronic equipment is corresponding required for the event from the server acquisition execution second speech data Data execute the corresponding event of the second speech data according to the data；

If the target electronic device is not first electronic equipment, first electronic equipment is set to the target electronic Preparation send content to indicate；The content is designated as the corresponding instruction of the second speech data or the content is designated as executing The second speech data corresponds to data required for the event；The target electronic device is indicated according to the content, described in execution The corresponding event of second speech data.

9. according to the method described in claim 8, it is characterized in that, one group of equipment further includes third electronic equipment；

10. method according to claim 8 or claim 9, which is characterized in that when receiving first voice data, described The voice control function of one electronic equipment and second electronic equipment is not waken up.

11. the method according to any one of claim 8-10, which is characterized in that

If second electronic equipment is the equipment for carrying out waking up response, the method also includes: first electronic equipment Command response instruction is sent to second electronic equipment, the command response instruction is used to indicate second electronic equipment and mentions Show that user will be executed the corresponding event of the second speech data by the target electronic device；Second electronic equipment according to The command response instruction prompts user that will execute the corresponding event of the second speech data by the target electronic device； Or

If first electronic equipment is the equipment for carrying out waking up response, the method also includes: first electronic equipment Prompt user that will execute the corresponding event of the second speech data by the target electronic device.

12. the method according to any one of claim 8-11, which is characterized in that first electronic equipment is according to Second speech data determines target electronic device from one group of equipment, comprising:

First electronic equipment according to the ability information of each equipment and the second speech data in one group of equipment, The equipment for having and executing the function that the second speech data corresponds to event is chosen from one group of equipment；

The function that the second speech data corresponds to event is executed if only existing an equipment in one group of equipment and having, First electronic equipment determines that the equipment is the target electronic device；

If having the function of executing the second speech data and correspond to event, institute in one group of equipment there are multiple equipment It states the first electronic equipment and determines that an equipment is the target electronic device from the multiple equipment；

Wherein, the target electronic device is any one in the multiple equipment, or,

The target electronic device meets at least one of the following conditions:

The target electronic device is in open state；

13. according to the method for claim 12, which is characterized in that the method also includes:

Each equipment in one group of equipment in addition to first electronic equipment reports respectively to first electronic equipment respectively From ability information；

First electronic equipment stores the ability information of each equipment in one group of equipment.

14. the method according to any one of claim 8-13, which is characterized in that if first electronic equipment be into The equipment that row wakes up response, the method also includes:

First electronic equipment sends second to second electronic equipment and wakes up instruction, and second electronic equipment is according to institute The second wake-up instruction is stated, determines the voice control function for not waking up second electronic equipment；Or,

15. a kind of sound control method, which is characterized in that described applied to the first electronic equipment for having voice control function First electronic equipment is contained in one group of equipment, and one group of equipment further includes that the second electronics for having voice control function is set It is standby, which comprises

First electronic equipment receives the first voice data of user；

First electronic equipment determines that the wake-up word registered in first voice data and first electronic equipment is identical, The energy information for first voice data that first electronic equipment detects is sent to server；

First electronic equipment receives the wake-up instruction that server is sent, and the wake-up instruction is the server according to The energy information and second electronic equipment for first voice data that first electronic equipment detects detect described The energy information of first voice data, which is determined, by first electronic equipment send after wake-up response, first electronics The energy for first voice data that equipment detects is greater than the first voice number that second electronic equipment detects According to energy；

First electronic equipment is indicated in response to the wake-up, wakes up the voice control function of first electronic equipment；

First electronic equipment sends the second speech data to the server；

First electronic equipment receives the command response instruction that the server is sent, and the command response instruction is used to indicate The first electronic device prompts user will be executed the corresponding event of the second speech data, the mesh by target electronic device Marking electronic equipment is the server according to the second speech data, and that determines from one group of equipment has execution institute State the equipment that second speech data corresponds to the function of event；

16. according to the method for claim 15, which is characterized in that one group of equipment further includes third electronic equipment；

17. method according to claim 15 or 16, which is characterized in that described when receiving first voice data The voice control function of first electronic equipment is to be waken up.

18. method described in any one of 5-17 according to claim 1, which is characterized in that the target electronic device is described First electronic equipment, the method also includes:

First electronic equipment receives the content instruction that the server is sent, and the content is designated as the second voice number It is designated as executing the second speech data according to corresponding instruction or content and corresponds to data required for the event；

First electronic equipment is indicated according to the content, executes the corresponding event of the second speech data.

19. a kind of electronic equipment characterized by comprising one or more processors and memory；

The memory is coupled with one or more of processors, and the memory is for storing computer program code, institute Stating computer program code includes computer instruction, when one or more of processors execute the computer instruction, institute State sound control method of the electronic equipment execution as described in any one of claim 15-18.

20. a kind of computer storage medium, which is characterized in that including computer instruction, when the computer instruction is set in electronics When standby upper operation, so that the electronic equipment executes the sound control method as described in any one of claim 15-18.

21. a kind of computer program product, which is characterized in that when the computer program product is run on computers, make Obtain sound control method of the computer execution as described in any one of claim 15-18.

22. a kind of speech control system characterized by comprising one group of equipment and server, one group of equipment include at least The first electronic equipment and the second electronic equipment for having voice control function；

First electronic equipment sends the second speech data to the server；

23. a kind of speech control system, which is characterized in that the speech control system includes: one group of equipment, one group of equipment Including at least the first electronic equipment and the second electronic equipment for having voice control function；

If the energy for first voice data that first electronic equipment detects is examined greater than second electronic equipment The energy of first voice data measured, first electronic equipment, which is determined, carries out wake-up sound by first electronic equipment It answers, then first electronic equipment wakes up the voice control function of first electronic equipment, after waking up voice control function First electronic equipment receive user second speech data；

If the energy for first voice data that second electronic equipment detects is examined greater than first electronic equipment The energy of first voice data measured, first electronic equipment, which is determined, carries out wake-up sound by second electronic equipment It answers, then first electronic equipment sends first to second electronic equipment and wakes up instruction, the second electronic equipment response Instruction is waken up in described first, wakes up the voice control function of second electronic equipment, the institute after waking up voice control function It states the second electronic equipment and receives the second speech data of user, and be sent to first electronic equipment；