CN108648754A

CN108648754A - Sound control method and device

Info

Publication number: CN108648754A
Application number: CN201810386446.0A
Authority: CN
Inventors: 王旭; 张建春; 郭峰; 刘广鑫
Original assignee: Beijing Xiaomi Mobile Software Co Ltd
Current assignee: Beijing Xiaomi Mobile Software Co Ltd
Priority date: 2018-04-26
Filing date: 2018-04-26
Publication date: 2018-10-12
Anticipated expiration: 2038-04-26
Also published as: CN108648754B

Abstract

The disclosure is directed to sound control method and devices.This method includes：The phonetic order recognition result that first terminal is sent is received, first terminal is associated with the first user account；Semantic processes are carried out to phonetic order recognition result, obtain operation information, operation information includes second user account and operation content；When second user account and the first user account are friend relation, the second terminal with second user account relating is searched；The operation instruction for carrying operation content is sent to second terminal, instruction second terminal executes operation content by intended application, and intended application is the application program for voice control.The disclosure disclosure satisfy that the demand that user is interacted by the voice assistant of oneself and the voice assistant of good friend, improve user experience.

Description

Sound control method and device

Technical field

This disclosure relates to field of communication technology more particularly to sound control method and device.

Background technology

Voice assistant is a intelligent application, can be equipped on mobile phone, TV, computer, intelligent sound box etc. and intelligently set It is standby upper, audio user signal is received by the microphone of smart machine, Semantic judgement is carried out, is then made back rapidly on foreground It answers, such as chats with user speech, or help user to manipulate smart machine according to instruction.Voice assistant is waken up, understands, meeting The process spoken, corresponding behind is machine learning and data mining algorithm and speech recognition, semantic understanding, phonetic synthesis Technology, and phonic knowledge database is needed to carry out high in the clouds support.

In the related technology, after voice assistant receives the voice command of user, control user equipment carries out and voice command Corresponding operation.

Invention content

To overcome the problems in correlation technique, a kind of sound control method of embodiment of the present disclosure offer and device.Institute It is as follows to state technical solution：

According to the first aspect of the embodiments of the present disclosure, a kind of sound control method is provided, cloud server, voice are applied to Control method includes：

The phonetic order recognition result that first terminal is sent is received, the first terminal is associated with the first user account；

Semantic processes are carried out to the phonetic order recognition result, obtain operation information, the operation information includes the Two user accounts and operation content；

When the second user account and first user account are friend relation, search and the second user account Number associated second terminal；

The operation instruction for carrying the operation content is sent to the second terminal, the operation instruction is used to indicate described Second terminal executes the operation content by intended application, and the intended application is the application program for voice control.

In one embodiment, described to send operation instruction to the second terminal, including：

Judge whether first user account there is the control second terminal to pass through described in intended application execution The permission of operation content；

Pass through the intended application execution behaviour with the second terminal is controlled when judging first user account When making the permission of content, operation instruction is sent to the second terminal.

In one embodiment, the method further includes：

The good friend for receiving carrying first user account and the second user account that the first terminal is sent tests Card request；

Good friend's checking request, good friend's verification is forwarded to ask to the second terminal of the second user account relating Seek to ask to establish the friend relation of the second user account and first user account.

In one embodiment, following any information or combination are also included at least in the operation information：Operation content Sending time or operation content the execution time.

In one embodiment, the type of the operation content include at least it is following it is any in perhaps combine：In message Appearance, voice mail content add backlog content or backlog reminding content in calendar.

According to the second aspect of the embodiment of the present disclosure, a kind of sound control method is provided, is applied to first terminal, voice control Method processed includes：

Phonetic order is received by intended application, the intended application is application program for voice control, described the One terminal is associated with the first user account；

The phonetic order is analyzed, phonetic order recognition result is obtained；

The phonetic order recognition result is sent to cloud server.

In one embodiment, the method further includes：

Obtain second user account；

The good friend's verification for carrying first user account and the second user account is sent to the cloud server Request, good friend's checking request are closed to ask to establish the second user account and the good friend of first user account System.

According to the third aspect of the embodiment of the present disclosure, a kind of sound control method is provided, is applied to second terminal, voice control Method processed includes：

The operation instruction that cloud server is sent is received, the operation instruction includes operation content；

The operation content is executed by intended application, the intended application is the application program for voice control.

In one embodiment, the method further includes：

Receive the good friend of carrying first user account and the second user account that the cloud server is sent Checking request, the second user account are associated with the second terminal；

According to good friend's checking request, the friend relation of first user account and second user account is established.

According to the fourth aspect of the embodiment of the present disclosure, a kind of phonetic controller is provided, including：

First receiving module, the phonetic order recognition result for receiving first terminal transmission, the first terminal and the One user account is associated with；

Processing module obtains operation information, the operation for carrying out semantic processes to the phonetic order recognition result Information includes second user account and operation content；

Searching module, for when the second user account and first user account are friend relation, search with The second terminal of the second user account relating；

First sending module, for sending the operation instruction for carrying the operation content, the behaviour to the second terminal Work instruction is used to indicate the second terminal and executes the operation content by intended application, and the intended application is for voice The application program of control.

In one embodiment, first sending module judges whether first user account has and controls described the Two terminals execute the permission of the operation content by the intended application；Institute is controlled when judging that first user account has When stating second terminal and executing the permission of the operation content by the intended application, sends operation to the second terminal and refer to Show.

In one embodiment, described device further includes：

Second receiving module, carrying first user account and described second sent for receiving the first terminal Good friend's checking request of user account；

Forwarding module, for forwarding good friend's checking request to the second terminal of the second user account relating, Good friend's checking request is asking to establish the friend relation of the second user account and first user account.

According to a fifth aspect of the embodiments of the present disclosure, a kind of phonetic controller is provided, including：

Third receiving module, for receiving phonetic order by intended application, the intended application is for voice control Application program, the first terminal is associated with the first user account；

Analysis module obtains phonetic order recognition result for analyzing the phonetic order；

Second sending module, for sending the phonetic order recognition result to cloud server.

According to the 6th of the embodiment of the present disclosure the aspect, a kind of phonetic controller is provided, including：

4th receiving module, the operation instruction for receiving cloud server transmission, the operation instruction include operation Content；

Execution module executes the operation content for passing through intended application, and the intended application is for voice control Application program.

According to the 7th of the embodiment of the present disclosure the aspect, a kind of phonetic controller is provided, including：

Processor；

Memory for storing processor-executable instruction；

Wherein, the processor is configured as：

According to the eighth aspect of the embodiment of the present disclosure, a kind of phonetic controller is provided, including：

Processor；

Memory for storing processor-executable instruction；

Wherein, the processor is configured as：

The phonetic order recognition result is sent to cloud server.

According to the 9th of the embodiment of the present disclosure the aspect, a kind of phonetic controller is provided, including：

Processor；

Memory for storing processor-executable instruction；

Wherein, the processor is configured as：

According to the tenth of the embodiment of the present disclosure the aspect, a kind of computer readable storage medium is provided, calculating is stored thereon with The step of machine instructs, which realizes above-mentioned first aspect the method when being executed by processor.

On the one hand according to the tenth of the embodiment of the present disclosure the, a kind of computer readable storage medium is provided, meter is stored thereon with The step of calculation machine instructs, which realizes above-mentioned second aspect the method when being executed by processor.

According to the 12nd of the embodiment of the present disclosure the aspect, a kind of computer readable storage medium is provided, meter is stored thereon with The step of calculation machine instructs, which realizes the above-mentioned third aspect the method when being executed by processor.

The technical scheme provided by this disclosed embodiment can include the following benefits：The technical solution is by by first Terminal is associated with the first user account and by second terminal and second user account relating, and establishes the first user account and The friend relation of two user accounts, cloud server carry out semantic processes to the phonetic order recognition result that first terminal is sent and obtain When to second user account and operation content, to second with the second user account relating that the first user account is friend relation Terminal sends operation content, and instruction second terminal executes operation content by intended application so that user can be by first eventually The intended application of the intended application at end and the second terminal of good friend interacts operation, to meet voice of the user by oneself The demand that the voice assistant of assistant and good friend interact improves user experience.

It should be understood that above general description and following detailed description is only exemplary and explanatory, not The disclosure can be limited.

Description of the drawings

The drawings herein are incorporated into the specification and forms part of this specification, and shows the implementation for meeting the disclosure Example, and together with specification for explaining the principles of this disclosure.

Fig. 1 is the application scenario diagram according to the sound control method shown in an exemplary embodiment.

Fig. 2 is the flow chart according to the sound control method shown in an exemplary embodiment.

Fig. 3 is the flow chart according to the sound control method shown in an exemplary embodiment.

Fig. 4 is the flow chart according to the sound control method shown in an exemplary embodiment.

Fig. 5 is the flow chart according to the sound control method shown in an exemplary embodiment.

Fig. 6 is the block diagram according to the phonetic controller shown in an exemplary embodiment.

Fig. 7 is the block diagram according to the phonetic controller shown in an exemplary embodiment.

Fig. 8 is the block diagram according to the phonetic controller shown in an exemplary embodiment.

Fig. 9 is the block diagram according to the phonetic controller shown in an exemplary embodiment.

Figure 10 is the block diagram according to the phonetic controller shown in an exemplary embodiment.

Figure 11 is the block diagram according to the phonetic controller shown in an exemplary embodiment.

Figure 12 is the block diagram according to the phonetic controller shown in an exemplary embodiment.

Figure 13 is the block diagram according to the phonetic controller shown in an exemplary embodiment.

Figure 14 is the block diagram according to the phonetic controller shown in an exemplary embodiment.

Specific implementation mode

Example embodiments are described in detail here, and the example is illustrated in the accompanying drawings.Following description is related to When attached drawing, unless otherwise indicated, the same numbers in different drawings indicate the same or similar elements.Following exemplary embodiment Described in embodiment do not represent all implementations consistent with this disclosure.On the contrary, they be only with it is such as appended The example of the consistent device and method of some aspects be described in detail in claims, the disclosure.

In the related technology, after voice assistant receives the voice command of user, control user equipment carries out and voice command Corresponding operation；However, the equipment that voice assistant is only capable of control user oneself in the related technology, cannot be satisfied user and passes through oneself Voice assistant and good friend the demand that interacts of voice assistant, influence user experience.

To solve the above-mentioned problems, the embodiment of the present disclosure provides a kind of sound control method, is applied to cloud server, Method includes：The phonetic order recognition result that first terminal is sent is received, first terminal is associated with the first user account；To voice Instruction identification result carries out semantic processes, obtains operation information, operation information includes second user account and operation content； When second user account and the first user account are friend relation, the second terminal with second user account relating is searched；To Two terminals send the operation instruction for carrying operation content, and instruction second terminal executes operation content by intended application, and target is answered With for the application program for voice control.

Referring to an optional application scenarios of sound control method in the embodiment of the present disclosure shown in Fig. 1；It is shown in FIG. 1 Application scenarios include：Terminal 11, terminal 12, network 13 and cloud server 14；Wherein, terminal 11 passes through high in the clouds with terminal 12 Server 14 is communicated；Terminal is, for example, smart mobile phone, intelligent sound box, smart television, tablet computer, laptop or can Wearable (such as bracelet, intelligent glasses) etc. can run the electronic equipment of the application program for realizing voice control, Application program such as voice assistant for voice control；Cloud server 14 can be a server, can also be Duo Tai The server cluster of server composition；Network 13 mobile communications network or cable network such as can be 2G/3G/4G/5G； It possible is answered it should be noted that application scenarios shown in Fig. 1 are only one of sound control method in the embodiment of the present disclosure With Sample Scenario, other application scene can also include the equipment being not directed in Fig. 1.The voice control that the embodiment of the present disclosure provides Method can be applied in above-mentioned scene, by the way that first terminal to be associated with the first user account and by second terminal and second User account is associated with, and establishes the friend relation of the first user account and second user account, and cloud server is to first terminal When the phonetic order recognition result of transmission carries out semantic processes and obtains second user account and operation content, to first user's account Number for friend relation second user account relating second terminal send operation content, instruction second terminal pass through intended application Execute operation content so that user can be carried out by the intended application of the second terminal of the intended application and good friend of first terminal Interactive operation is improved and is used to meet the demand that user is interacted by the voice assistant of oneself and the voice assistant of good friend It experiences at family.

Based on above-mentioned analysis, following specific embodiment is proposed.

Fig. 2 is a kind of flow chart of sound control method shown according to an exemplary embodiment, the execution master of this method Body can be cloud server；As shown in Fig. 2, this approach includes the following steps 201-204：

In step 201, the phonetic order recognition result that first terminal is sent, first terminal and the first user account are received Association.

Exemplary, user logs in the intended application of first terminal, first terminal and the first user using the first user account Account relating；User sends phonetic order to intended application.First terminal analysis phonetic order obtains phonetic order recognition result, Phonetic order recognition result is sent to cloud server.Intended application is the application program for voice control, such as voice Assistant.

In step 202, semantic processes are carried out to phonetic order recognition result, obtains operation information, is wrapped in operation information Include second user account and operation content.

It is exemplary, after cloud server receives the phonetic order recognition result of first terminal transmission, to phonetic order Recognition result carries out semantic processes, obtains operation information, operation information includes second user account and operation content.

In step 203, it when second user account and the first user account are friend relation, searches and second user account Number associated second terminal.

Exemplary, cloud server knows friend relation between each user account and user account and terminal in advance Incidence relation；Cloud server judges whether second user account and the first user account are friend relation；Work as second user When account and the first user account are friend relation, the second terminal with second user account relating is searched；When second user account When number with the non-friend relation of the first user account, flow terminates.

Exemplary, the first user logs in the intended application of first terminal using the first user account, and second user uses the Two user accounts log in the intended application of second terminal；First terminal can be asked to establish by cloud server to second terminal The friend relation of second user account and the first user account；Alternatively, second terminal can be whole to first by cloud server The friend relation of second user account and the first user account is established in end request.

For example, first terminal obtains second user account, is sent to cloud server and carry the first user account and second Good friend's checking request of user account, good friend's checking request is asking to establish the good of second user account and the first user account Friendly relationship.Cloud server receives the first user account of carrying that first terminal is sent and good friend's verification of second user account is asked It asks, good friend's checking request is forwarded to the second terminal of second user account relating, good friend's checking request is asking to establish the The friend relation of two user accounts and the first user account.Second terminal receives carrying the first user account that cloud server is sent Number and second user account good friend's checking request, second user account is associated with second terminal, built according to good friend's checking request The friend relation of vertical first user account and second user account.

In step 204, the operation instruction for carrying operation content is sent to second terminal, operation instruction is used to indicate second Terminal executes operation content by intended application, and intended application is the application program for voice control.

It is exemplary, following any information or combination are also included at least in operation information：The sending time of operation content or The execution time of operation content.

It is exemplary, the type of operation content include at least it is following it is any in perhaps combine：In message content, voice mail Hold, add backlog content or backlog reminding content in calendar.

Exemplary, cloud server sends the operation instruction for carrying operation content to second terminal, and instruction second terminal is logical Cross target application execution operation content.After second terminal receives the operation instruction of cloud server transmission, answered by target With execution operation content.

Exemplary, the realization method that operation instruction is sent to second terminal includes：Cloud server judges first user's account Number whether have control second terminal pass through intended application execute operation content permission；It is controlled when judging that the first user account has When second terminal processed executes the permission of operation content by intended application, operation instruction is sent to second terminal；When judgement first When user account does not have permission of the control second terminal by intended application execution operation content, flow terminates.

The technical scheme provided by this disclosed embodiment, by the way that first terminal to be associated with the first user account and by Two terminals and second user account relating, and establish the friend relation of the first user account and second user account, cloud service When device obtains second user account and operation content to the phonetic order recognition result progress semantic processes that first terminal is sent, to Operation content is sent with the second terminal for the second user account relating that the first user account is friend relation, indicates second terminal Operation content is executed by intended application so that user can pass through the second terminal of the intended application and good friend of first terminal Intended application interacts operation, is interacted by the voice assistant of oneself and the voice assistant of good friend to meeting user Demand improves user experience.

Fig. 3 is a kind of flow chart of sound control method shown according to an exemplary embodiment, the execution master of this method Body can be first terminal；As shown in figure 3, this approach includes the following steps 301-303：

In step 301, phonetic order is received by intended application, intended application is the application journey for voice control Sequence, first terminal are associated with the first user account.

In step 302, phonetic order is analyzed, phonetic order recognition result is obtained.

In step 303, phonetic order recognition result is sent to cloud server.

The technical scheme provided by this disclosed embodiment, by the way that first terminal to be associated with the first user account, first eventually End analysis phonetic order, cloud server is sent to by phonetic order recognition result, realizes that user passes through certainly by cloud server The demand that oneself intended application and the intended application of good friend interact.

Fig. 4 is a kind of flow chart of sound control method shown according to an exemplary embodiment, the execution master of this method Body can be second terminal；As shown in figure 4, this approach includes the following steps 401-402：

In step 401, the operation instruction that cloud server is sent is received, operation instruction includes operation content.

In step 402, operation content is executed by intended application, intended application is the application journey for voice control Sequence.

In the technical scheme provided by this disclosed embodiment, cloud server is known according to the phonetic order that first terminal is sent Other result receives the operation of cloud server transmission to the operation instruction that second terminal transmission includes operation content, second terminal After instruction by intended application execute operation content, realize user by oneself intended application and good friend intended application into The demand of row interaction.

Fig. 5 is a kind of flow chart of sound control method shown according to an exemplary embodiment, and this method is whole by first End, cloud server and second terminal cooperation are implemented.As shown in figure 5, on the basis of the above embodiments, this disclosure relates to language Sound controlling method may comprise steps of 501-507：

In step 501, first terminal receives phonetic order by intended application, and intended application is for voice control Application program, first terminal are associated with the first user account.

It is exemplary, need standby offline wake-up mechanism to ensure that first terminal can listen to wake-up by lasting monitoring technique Word (is defined) by developer, and exploitation end needs common language to chat one baseline model of training, wakes up word recording one order word of training Model, it is exactly the matching degree for taking recording data to calculate the two to wake up, if the voice command waken up is with trained order word model Matching reaches threshold value, and first terminal will be waken up.After first terminal is waken up, into listening mode, in Voice input End, in order to allow sound source to reduce distortion to the greatest extent, can use echo cancellor, noise reduction since sound source belongs to most important source Processing, sound source enhancing, the various means such as sound source filtering ensure the quality of sound source, and the hardware plan most frequently used is to pass through Mike Wind array ensures sound source.

In step 502, first terminal analyzes phonetic order, obtains phonetic order recognition result.

It is exemplary, after receiving voice command and being handled, speech recognition (Voice can be entered Recognition), first stage must realize that automatic speech recognition (ASR), the principle of speech recognition depend primarily on three in fact A factor, frame, state, phoneme.As above figure, the first step can be cut into sound corresponding sectional frame topmost, several frames Voice can form a state, and every three states can correspond to a phoneme, and several phonemes can be combined into a word, must Arrived speech recognition as a result, result is one section of text of output.

And how to realize the accurate understanding of a large amount of voices, it just needs to use hidden Markov model (HMM) here to build Then one state network is found and the most matched path of sound from state network.It only needs result to be limited to shape in this way In state network, it can be achieved with identifying, and if to identify arbitrary text, it needs the sufficiently large of network establishment, including arbitrary text This path.Here a large amount of training and the processing of mass data are related to, data are more, and accuracy is higher.

In step 503, first terminal sends phonetic order recognition result to cloud server.

In step 504, cloud server receives the phonetic order recognition result that first terminal is sent, first terminal and the One user account is associated with；Semantic processes are carried out to phonetic order recognition result, obtain operation information, operation information includes second User account and operation content.

It is exemplary, for the word or Chinese character identified by ASR, need to carry out natural language understanding (NLU), only not Current technical merit is crossed much there are no the level for reaching NLU, and can only say realizes natural language processing (NLP) stage.It is existing NLP mainly establish a huge corpus, by being constantly trained analysis to grammer, syntax, semanteme etc., with system Meter is learned principle and deep learning and is understood with simple naturally semantic processing to realize.And language can be understood as the mankind by reaching Justice needs largely to learn even could really allow a machine to generate the thought of people plus various sensors, experiences people couple The feeling of one things or language, to realize real emotion communication.

After realizing semantic processes, it is necessary in conjunction with context engage in the dialogue management and language synthesis, carry out up and down unity and coherence in writing Solution and context self-correction in conjunction with different scene and product, can be formed different anti-to realize relatively accurate feedback Feedback, the interactive voice between realization is man-machine.Here it is the entire flows of interactive voice, either want to control life by voice Household living, is engaged in the dialogue with voice and machine, phonetic search is carried out with search engine, will carry out this whole flow process, and every One link is carried out as the key of present voice market competition, voice feedback accurately and timely all too weight for a user It wants.

It is exemplary, when first terminal receives phonetic order by intended application, directly phonetic order can be sent to Cloud server analyzes phonetic order by cloud server, obtains phonetic order recognition result, then identifies and ties to phonetic order Fruit carries out semantic processes, obtains operation information.

It is exemplary, when first terminal receives phonetic order by intended application, phonetic order can be analyzed, voice is obtained Instruction identification obtains operation information as a result, then to phonetic order recognition result progress semantic processes；Then first terminal will be grasped It is sent to cloud server as information.

In step 505, cloud server is when second user account and the first user account are friend relation, search with The second terminal of second user account relating.

In step 506, cloud server sends the operation instruction for carrying operation content to second terminal, and operation instruction is used Operation content is executed by intended application in instruction second terminal, intended application is the application program for voice control.

It is exemplary, operation instruction is sent to second terminal, including：Judge whether the first user account has control second eventually End executes the permission of operation content by intended application；It is answered by target when judging the first user account with second terminal is controlled When with the permission for executing operation content, operation instruction is sent to second terminal.

In step 507, second terminal receives the operation instruction that cloud server is sent, and operation instruction includes in operation Hold；Operation content is executed by intended application, intended application is the application program for voice control.

In the technical scheme provided by this disclosed embodiment, the first user logs in first terminal using the first user account Intended application, second user log in the intended application of second terminal using second user account, and cloud server is to first terminal The phonetic order recognition result of transmission carries out semantic processes and obtains second user account and operation content, to the first user account Operation content is sent for the second terminal of the second user account relating of friend relation, instruction second terminal is held by intended application Row operation content so that user can be handed over by the intended application of the second terminal of the intended application and good friend of first terminal Interoperability improves user to meet the demand that user is interacted by the voice assistant of oneself and the voice assistant of good friend Experience.

In one exemplary embodiment, this disclosure relates to sound control method may comprise steps of：

The first user of step 1) logs in the intended application of first terminal using the first user account, in the operation of intended application Second user account is inputted on interface, and triggers good friend's checking request flow；Intended application is the application journey for voice control Sequence.Good friend's checking request flow is specially：First terminal sends to cloud server and carries the first user account and second user Good friend's checking request of account, good friend's checking request are closed to ask to establish second user account and the good friend of the first user account System；Cloud server receives good friend's checking request of the first user account of carrying and second user account that first terminal is sent, Good friend's checking request is forwarded to the second terminal of second user account relating, and good friend's checking request is asking to establish the second use The friend relation of family account and the first user account.Second terminal receive cloud server send the first user account of carrying and Good friend's checking request of second user account, second user account are associated with second terminal, and is established according to good friend's checking request The friend relation of one user account and second user account；Second terminal sends good friend to first terminal by cloud server and tests Card response.

First user is good friend by the second user account that the first user account adds second user, then passes through oneself Voice assistant be added on the calendar of second user to the voice assistant message of second user, voice mail, message information Or in backlog.

The first user of step 2) sends phonetic order to the intended application of first terminal；First terminal analyzes phonetic order, Phonetic order recognition result is obtained, for example, it is assumed that phonetic order recognition result is " to tell Lao Li, at 10 points in evening is in VIP meeting rooms Meeting " or assume that phonetic order recognition result is that " small love classmate, zero hour say mother that happy birthday！”.

Step 3) first terminal sends phonetic order recognition result to cloud server.

Step 4) cloud server carries out semantic processes to phonetic order recognition result, obtains second user account and operation Content.

For example, cloud server to phonetic order recognition result " telling Lao Li, have a meeting at night in VIP meeting rooms at 10 points " into After row semantic processes, determine that second user account is " Lao Li ", operation content is：By text, " at 10 points in evening is in VIP meeting rooms Meeting " is write into the calendar or backlog of second terminal.

For another example cloud server is to phonetic order recognition result, " small love classmate, it is fast that zero hour says the birthday to mother It is happy！" carry out semantic processes after, determine that second user account is " mother ", operation content is：When zero, used to second " happy birthday for the terminal sending information of family account relating！", by the terminal with second user account relating, by text, " birthday is fast It is happy！" it is converted to voice, and play.

Step 5) cloud server when second user account and the first user account are friend relation, to second user The second terminal of account relating sends the operation instruction for carrying operation content.

Step 6) second terminal judges whether the first user account there is control second terminal to execute behaviour by intended application Make the permission of content；When the permission for judging that the first user account has control second terminal by intended application execution operation content When, second terminal executes operation content by intended application；Pass through without second terminal is controlled when judging the first user account When intended application executes the permission of operation content, flow terminates.

Exemplary, the realization process that second terminal executes operation content by intended application includes：Operation content is carried out Analysis, identifies the type of operation content；When the type of operation content be in message perhaps, voice mail content when, by from text Operation content is converted to voice and played by this to voice (TTS, Text To Speech) technology；When the type of operation content is When adding backlog content or backlog reminding content in calendar, its in second terminal is called by intended application It is applied, such as calendar, executes operation content.

In embodiment of the disclosure, user logs in the application program for voice control, such as voice using user account Assistant, it is good friend that can add other user accounts, and user, can by sending phonetic order to the voice assistant of oneself terminal Voice assistant to control good friend's terminal executes the behaviour such as message, voice mail, timing message, calendar or charg`e d'affaires's item Make, it is only to control itself single equipment to make voice assistant no longer, so that the voice assistant of different terminals is contacted after adding good friend Come, realize more equipment interactive voice operations, keep the function of voice assistant more powerful, meet more " Personal Assistant " demands, More preferably user experience is brought for user.

Following is embodiment of the present disclosure, can be used for executing embodiments of the present disclosure.

Fig. 6 is a kind of block diagram of phonetic controller shown according to an exemplary embodiment；The device may be used respectively Kind of mode implemented, for example, beyond the clouds in server implementation all components, alternatively, server side is to couple beyond the clouds Component in mode implementation；The device can by software, hardware or both be implemented in combination with it is above-mentioned this disclosure relates to Method, as shown in fig. 6, the phonetic controller includes：First receiving module 601, processing module 602, searching module 603 and First sending module 604, wherein：

First receiving module 601 be configured as receive first terminal send phonetic order recognition result, first terminal with First user account is associated with；

Processing module 602 is configured as carrying out semantic processes to phonetic order recognition result, obtains operation information, operation letter Breath includes second user account and operation content；

Searching module 603 is configured as when second user account and the first user account are friend relation, is searched and the The associated second terminal of two user accounts；

First sending module 604 is configured as sending the operation instruction for carrying operation content, operation instruction to second terminal It is used to indicate second terminal and operation content is executed by intended application, intended application is the application program for voice control.

The device that the embodiment of the present disclosure provides can be used in executing the technical solution of embodiment illustrated in fig. 2, executive mode Similar with advantageous effect, details are not described herein again.

In a kind of possible embodiment, the first sending module 604 judges whether the first user account has control the Two terminals execute the permission of operation content by intended application；Pass through mesh with second terminal is controlled when judging the first user account When marking the permission of application execution operation content, operation instruction is sent to second terminal.

In a kind of possible embodiment, as shown in fig. 7, the phonetic controller shown in Fig. 6 can also include：

Second receiving module 701 is configured as receiving the first user account of carrying and second user account that first terminal is sent Number good friend's checking request；

Forwarding module 702 is configured as forwarding good friend's checking request to the second terminal of second user account relating, good Friendly checking request is asking to establish the friend relation of second user account and the first user account.

Fig. 8 is a kind of block diagram of phonetic controller shown according to an exemplary embodiment；The device may be used respectively Kind of mode is implemented, such as all components of implementation in the terminal, alternatively, in end side implementation in a coupled manner In component；The device can by software, hardware or both be implemented in combination with it is above-mentioned this disclosure relates to method, such as Fig. 8 Shown, which includes：Third receiving module 801, analysis module 802 and the second sending module 803, wherein：

Third receiving module 801 is configured as receiving phonetic order by intended application, and intended application is for voice control The application program of system, first terminal are associated with the first user account；

Analysis module 802 is configured as analysis phonetic order, obtains phonetic order recognition result；

Second sending module 803 is configured as sending phonetic order recognition result to cloud server.

The device that the embodiment of the present disclosure provides can be used in executing the technical solution of embodiment illustrated in fig. 3, executive mode Similar with advantageous effect, details are not described herein again.

Fig. 9 is a kind of block diagram of phonetic controller shown according to an exemplary embodiment；The device may be used respectively Kind of mode is implemented, such as all components of implementation in the terminal, alternatively, in end side implementation in a coupled manner In component；The device can by software, hardware or both be implemented in combination with it is above-mentioned this disclosure relates to method, such as Fig. 9 Shown, which includes：4th receiving module 901 and execution module 902, wherein：

4th receiving module 901 is configured as receiving the operation instruction that cloud server is sent, and operation instruction includes behaviour Make content；

Execution module 902 is configured as executing operation content by intended application, and intended application is for voice control Application program.

The device that the embodiment of the present disclosure provides can be used in executing the technical solution of embodiment illustrated in fig. 4, executive mode Similar with advantageous effect, details are not described herein again.

Figure 10 is a kind of block diagram of phonetic controller 1000 shown according to an exemplary embodiment, phonetic controller 1000 may be used various modes to implement, for example, beyond the clouds in server implementation all components, or take beyond the clouds It is engaged in the component of device side in a coupled manner in implementation；Phonetic controller 1000 includes：

Processor 1001；

Memory 1002 for storing processor-executable instruction；

Wherein, processor 1001 is configured as：

The phonetic order recognition result that first terminal is sent is received, first terminal is associated with the first user account；

Semantic processes are carried out to phonetic order recognition result, obtain operation information, operation information includes second user account Number and operation content；

When second user account and the first user account are friend relation, second with second user account relating is searched Terminal；

The operation instruction for carrying operation content is sent to second terminal, operation instruction is used to indicate second terminal and passes through target Application execution operation content, intended application are the application program for voice control.

In one embodiment, above-mentioned processor 1001 is also configured to：

Judge the permission whether the first user account there is control second terminal operation content is executed by intended application；

When judging that the first user account has the permission for controlling second terminal by intended application execution operation content, to Second terminal sends operation instruction.

In one embodiment, above-mentioned processor 1001 is also configured to：

Receive good friend's checking request of the first user account of carrying and second user account that first terminal is sent；

Good friend's checking request is forwarded to the second terminal of second user account relating, and good friend's checking request is asking to build The friend relation of vertical second user account and the first user account.

In one embodiment, following any information or combination are also included at least in operation information：The hair of operation content Send time or the execution time of operation content.

In one embodiment, the type of operation content include at least it is following it is any in perhaps combine：Message content, language Message case content adds backlog content or backlog reminding content in calendar.

Figure 11 is a kind of block diagram of phonetic controller 1100 shown according to an exemplary embodiment, phonetic controller 1100 may be used various modes to implement, such as all components of implementation in the terminal, or in end side to couple Mode implementation in component；Phonetic controller 1100 includes：

Processor 1101；

Memory 1102 for storing processor-executable instruction；

Wherein, processor 1101 is configured as：

Receive phonetic order by intended application, intended application is application program for voice control, first terminal with First user account is associated with；

Phonetic order is analyzed, phonetic order recognition result is obtained；

Phonetic order recognition result is sent to cloud server.

In one embodiment, above-mentioned processor 1101 is also configured to：

Obtain second user account；

The good friend's checking request for carrying the first user account and second user account, good friend's verification are sent to cloud server It asks to ask to establish the friend relation of second user account and the first user account.

Figure 12 is a kind of block diagram of phonetic controller 1200 shown according to an exemplary embodiment, phonetic controller 1200 may be used various modes to implement, such as all components of implementation in the terminal, or in end side to couple Mode implementation in component；Phonetic controller 1200 includes：

Processor 1201；

Memory 1202 for storing processor-executable instruction；

Wherein, processor 1201 is configured as：

The operation instruction that cloud server is sent is received, operation instruction includes operation content；

Operation content is executed by intended application, intended application is the application program for voice control.

In one embodiment, above-mentioned processor 1201 is also configured to：

The good friend's checking request for the first user account of carrying and second user account that reception cloud server is sent, second User account is associated with second terminal；

According to good friend's checking request, the friend relation of the first user account and second user account is established.

About the device in above-described embodiment, wherein modules execute the concrete mode of operation in related this method Embodiment in be described in detail, explanation will be not set forth in detail herein.

Figure 13 is a kind of block diagram of phonetic controller shown according to an exemplary embodiment.For example, voice control fills It can be smart mobile phone, intelligent sound box, smart television, tablet computer, laptop or wearable device (such as hand to set 1300 Ring, intelligent glasses etc.) etc. can run the electronic equipment of application program for realizing voice control.Referring to Fig.1 3, voice control Device 1300 processed may include following one or more components：Processing component 1302, memory 1304, power supply module 1306 are more Media component 1308, audio component 1310, input/output (I/O) interface 1312, sensor module 1314 and communication component 1316。

The integrated operation of 1302 usual control voice control device 1300 of processing component, such as with display, call, number According to communication, camera operation and record operate associated operation.Processing component 1302 may include one or more processors 1320 execute instruction, to perform all or part of the steps of the methods described above.In addition, processing component 1302 may include one Or multiple modules, convenient for the interaction between processing component 1302 and other assemblies.For example, processing component 1302 may include more matchmakers Module, to facilitate the interaction between multimedia component 1308 and processing component 1302.

Memory 1304 is configured as storing various types of data to support the operation in phonetic controller 1300.This The example of a little data includes the instruction for any application program or method that are operated on phonetic controller 1300, contact person Data, telephone book data, message, picture, video etc..Memory 1304 by any kind of volatibility or non-volatile can be deposited It stores up equipment or combination thereof is realized, such as static RAM (SRAM), electrically erasable programmable read-only memory (EEPROM), Erasable Programmable Read Only Memory EPROM (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, disk or CD.

Power supply module 1306 provides electric power for the various assemblies of phonetic controller 1300.Power supply module 1306 may include Power-supply management system, one or more power supplys and other to for phonetic controller 1300 generate, management and distribution electric power it is related The component of connection.

Multimedia component 1308 is included in the screen of one output interface of offer between phonetic controller 1300 and user Curtain.In some embodiments, screen may include liquid crystal display (LCD) and touch panel (TP).If screen includes touching Panel, screen may be implemented as touch screen, to receive input signal from the user.Touch panel includes one or more touches Sensor is touched to sense the gesture on touch, slide, and touch panel.Touch sensor can not only sense touch or sliding is dynamic The boundary of work, but also detect and touch or the relevant duration and pressure of slide.In some embodiments, multimedia Component 1308 includes a front camera and/or rear camera.When phonetic controller 1300 is in operation mode, such as clap When taking the photograph pattern or video mode, front camera and/or rear camera can receive external multi-medium data.It is each preposition Camera and rear camera can be a fixed optical lens system or have focusing and optical zoom capabilities.

Audio component 1310 is configured as output and/or input audio signal.For example, audio component 1310 includes a wheat Gram wind (MIC), when phonetic controller 1300 is in operation mode, when such as call model, logging mode and speech recognition mode, Microphone is configured as receiving external audio signal.The received audio signal can be further stored in memory 1304 or It is sent via communication component 1316.In some embodiments, audio component 1310 further includes a loud speaker, for exporting audio Signal.

I/O interfaces 1312 provide interface, above-mentioned peripheral interface module between processing component 1302 and peripheral interface module Can be keyboard, click wheel, button etc..These buttons may include but be not limited to：Home button, volume button, start button and Locking press button.

Sensor module 1314 includes one or more sensors, for providing various aspects for phonetic controller 1300 Status assessment.For example, sensor module 1314 can detect the state that opens/closes of phonetic controller 1300, component Relative positioning, such as display and keypad that component is phonetic controller 1300, sensor module 1314 can also examine Survey the position change of 1,300 1 components of phonetic controller 1300 or phonetic controller, user and phonetic controller 1300 The existence or non-existence of contact, 1300 orientation of phonetic controller or acceleration/deceleration and the temperature of phonetic controller 1300 become Change.Sensor module 1314 may include proximity sensor, be configured to detect without any physical contact near The presence of object.Sensor module 1314 can also include optical sensor, such as CMOS or ccd image sensor, for being imaged It is used in.In some embodiments, which can also include acceleration transducer, gyro sensors Device, Magnetic Sensor, pressure sensor or temperature sensor.

Communication component 1316 is configured to facilitate wired or wireless way between phonetic controller 1300 and other equipment Communication.Phonetic controller 1300 can access the wireless network based on communication standard, such as WiFi, 2G or 3G or they Combination.In one exemplary embodiment, communication component 1316 is received via broadcast channel from external broadcasting management system Broadcast singal or broadcast related information.In one exemplary embodiment, communication component 1316 further includes near-field communication (NFC) mould Block, to promote short range communication.For example, radio frequency identification (RFID) technology, Infrared Data Association (IrDA) skill can be based in NFC module Art, ultra wide band (UWB) technology, bluetooth (BT) technology and other technologies are realized.

In the exemplary embodiment, phonetic controller 1300 can be by one or more application application-specific integrated circuit (ASIC), digital signal processor (DSP), digital signal processing appts (DSPD), programmable logic device (PLD), scene can It programs gate array (FPGA), controller, microcontroller, microprocessor or other electronic building bricks to realize, for executing the above method.

In the exemplary embodiment, it includes the non-transitorycomputer readable storage medium instructed, example to additionally provide a kind of Such as include the memory 1304 of instruction, above-metioned instruction can be executed above-mentioned to complete by the processor 1320 of phonetic controller 1300 Method.For example, non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, magnetic Band, floppy disk and optical data storage devices etc..

Figure 14 is a kind of block diagram of phonetic controller shown according to an exemplary embodiment.For example, voice control fills It sets 1400 and may be provided as a server.Phonetic controller 1400 includes processing component 1402, further comprises one Or multiple processors, and by the memory resource representated by memory 1403, it can holding by processing component 1402 for storing Capable instruction, such as application program.The application program stored in memory 1403 may include one or more each A module for corresponding to one group of instruction.In addition, processing component 1402 is configured as executing instruction, to execute the above method.

Phonetic controller 1400 can also be configured as executing phonetic controller including a power supply module 1406 1400 power management, a wired or wireless network interface 1405 are configured as phonetic controller 1400 being connected to net Network and input and output (I/O) interface 1408.Phonetic controller 1400 can be operated based on being stored in memory 1403 Operating system, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or similar.

A kind of non-transitorycomputer readable storage medium, when the instruction in storage medium by phonetic controller 1300 or When the processor of phonetic controller 1400 executes so that phonetic controller 1300 or phonetic controller 1400 are able to carry out Following sound control method, method include：

Phonetic order is analyzed, phonetic order recognition result is obtained；

Phonetic order recognition result is sent to cloud server.

In one embodiment, method further includes：

Obtain second user account；

In embodiment of the disclosure, a kind of computer readable storage medium is provided, computer instruction is stored thereon with, this refers to Following method is realized when order is executed by processor：

In one embodiment, operation instruction is sent to second terminal, including：

In one embodiment, method further includes：

Those skilled in the art will readily occur to its of the disclosure after considering specification and putting into practice disclosure disclosed herein Its embodiment.This application is intended to cover any variations, uses, or adaptations of the disclosure, these modifications, purposes or Person's adaptive change follows the general principles of this disclosure and includes the undocumented common knowledge in the art of the disclosure Or conventional techniques.The description and examples are only to be considered as illustrative, and the true scope and spirit of the disclosure are by following Claim is pointed out.

It should be understood that the present disclosure is not limited to the precise structures that have been described above and shown in the drawings, and And various modifications and changes may be made without departing from the scope thereof.The scope of the present disclosure is only limited by the accompanying claims.

Claims

1. a kind of sound control method is applied to cloud server, which is characterized in that including：

Semantic processes are carried out to the phonetic order recognition result, obtain operation information, the operation information includes the second use Family account and operation content；

When the second user account and first user account are friend relation, search and closed with the second user account The second terminal of connection；

2. according to the method described in claim 1, it is characterized in that, it is described to the second terminal send operation instruction, including：

Judge whether first user account there is the control second terminal to pass through the intended application and execute the operation The permission of content；

Pass through in the intended application execution operation with the second terminal is controlled when judging first user account When the permission of appearance, operation instruction is sent to the second terminal.

3. according to the method described in claim 1, it is characterized in that, the method further includes：

The good friend's verification for receiving carrying first user account and the second user account that the first terminal is sent is asked It asks；

Good friend's checking request, good friend's checking request is forwarded to use to the second terminal of the second user account relating The friend relation of the second user account and first user account is established with request.

4. according to the method described in claim 1, it is characterized in that, also including at least following any letter in the operation information Breath or combination：The sending time of operation content or the execution time of operation content.

5. according to the method described in claim 1, it is characterized in that, the type of the operation content is including at least following any Inside perhaps combine：Message content, voice mail content add backlog content or backlog reminding content in calendar.

6. a kind of sound control method is applied to first terminal, which is characterized in that including：

Phonetic order is received by intended application, the intended application is the application program for voice control, and described first eventually End is associated with the first user account；

The phonetic order recognition result is sent to cloud server.

7. according to the method described in claim 6, it is characterized in that, the method further includes：

Obtain second user account；

The good friend's checking request for carrying first user account and the second user account is sent to the cloud server, Good friend's checking request is asking to establish the friend relation of the second user account and first user account.

8. a kind of sound control method is applied to second terminal, which is characterized in that including：

9. according to the method described in claim 8, it is characterized in that, the method further includes：

Receive the good friend's verification for carrying first user account and the second user account that the cloud server is sent Request, the second user account are associated with the second terminal；

10. a kind of phonetic controller, which is characterized in that including：

First receiving module, the phonetic order recognition result for receiving first terminal transmission, the first terminal and first are used Family account relating；

Processing module obtains operation information, the operation information for carrying out semantic processes to the phonetic order recognition result Include second user account and operation content；

Searching module, for when the second user account and first user account are friend relation, search with it is described The second terminal of second user account relating；

First sending module, for sending the operation instruction for carrying the operation content to the second terminal, the operation refers to Show that be used to indicate the second terminal executes the operation content by intended application, the intended application is for voice control Application program.

11. device according to claim 10, which is characterized in that first sending module judges the first user account Number whether have and to control the permission that the second terminal executes the operation content by the intended application；When judging described the One user account has when controlling the second terminal and executing the permission of the operation content by the intended application, to described Second terminal sends operation instruction.

12. device according to claim 10, which is characterized in that described device further includes：

Second receiving module, for receiving carrying first user account and the second user that the first terminal is sent Good friend's checking request of account；

Forwarding module, it is described for forwarding good friend's checking request to the second terminal of the second user account relating Good friend's checking request is asking to establish the friend relation of the second user account and first user account.

13. a kind of phonetic controller, which is characterized in that including：

Third receiving module, for receiving phonetic order by intended application, the intended application is answering for voice control With program, the first terminal is associated with the first user account；

14. a kind of phonetic controller, which is characterized in that including：

4th receiving module, the operation instruction for receiving cloud server transmission, the operation instruction includes operation content；

Execution module executes the operation content for passing through intended application, and the intended application is answering for voice control Use program.

15. a kind of phonetic controller, which is characterized in that including：

Processor；

Memory for storing processor-executable instruction；

Wherein, the processor is configured as：

16. a kind of phonetic controller, which is characterized in that including：

Processor；

Memory for storing processor-executable instruction；

Wherein, the processor is configured as：

The phonetic order recognition result is sent to cloud server.

17. a kind of phonetic controller, which is characterized in that including：

Processor；

Memory for storing processor-executable instruction；

Wherein, the processor is configured as：

18. a kind of computer readable storage medium, is stored thereon with computer instruction, which is characterized in that the instruction is by processor The step of any one of claim 1-5 the methods are realized when execution.

19. a kind of computer readable storage medium, is stored thereon with computer instruction, which is characterized in that the instruction is by processor The step of claim 6 or 7 the method are realized when execution.

20. a kind of computer readable storage medium, is stored thereon with computer instruction, which is characterized in that the instruction is by processor The step of claim 8 or 9 the method are realized when execution.