CN109830233A

CN109830233A - Exchange method, device, storage medium and the terminal of voice assistant

Info

Publication number: CN109830233A
Application number: CN201910058048.0A
Authority: CN
Inventors: 郭子亮
Original assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Current assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date: 2019-01-22
Filing date: 2019-01-22
Publication date: 2019-05-31

Abstract

The embodiment of the present application discloses exchange method, device, storage medium and the terminal of voice assistant.This method comprises: detecting that voice assistant function is triggered in the state that terminal shows the first interface；The display at first interface is kept, and enters voice messaging and obtains state；The first voice messaging is received, and first voice messaging is responded.The embodiment of the present application is by using above-mentioned technical proposal, after voice assistant function is triggered, in the state of keeping triggering front interface display, state is obtained into voice messaging, namely triggering front interface will not be covered, the influence to original application interface is avoided, voice assistant function is made preferably to combine together with terminal.

Description

Exchange method, device, storage medium and the terminal of voice assistant

Technical field

The invention relates to the exchange method of field of terminal technology more particularly to voice assistant, device, storage mediums And terminal.

Background technique

Speech recognition technology is that one kind allows machine that voice signal is changed into corresponding text by identification and understanding process Or the technology of order.In recent years, with the fast development of speech recognition technology, applied field is more and more extensive.Currently, Speech recognition technology has been successfully applied in various intelligent terminals, keeps the function of intelligent terminal more abundant.

Speech recognition technology is generally present in intelligent terminal in the form of voice assistant, and user can use voice assistant It is issued and is ordered to terminal by the way of natural language, and terminal can identify and understand to the natural language of user, in turn Corresponding operation is executed, is brought great convenience for user.In the related technology, the interaction schemes of voice assistant are still not complete enough It is kind, it needs to improve.

Summary of the invention

The embodiment of the present application provides exchange method, device, storage medium and the terminal of a kind of voice assistant, can optimize language The interaction schemes of sound assistant.

In a first aspect, the embodiment of the present application provides a kind of exchange method of voice assistant, comprising:

In the state that terminal shows the first interface, detect that voice assistant function is triggered；

The display at first interface is kept, and enters voice messaging and obtains state；

The first voice messaging is received, and first voice messaging is responded.

Second aspect, the embodiment of the present application provide a kind of interactive device of voice assistant, comprising:

Detection trigger module, in the state that terminal shows the first interface, whether detection voice assistant function to be touched Hair；

Status control module, for keeping the display at first interface when detecting that voice assistant function is triggered, And enters voice messaging and obtain state；

Voice messaging respond module for receiving the first voice messaging, and responds first voice messaging.

The third aspect, the embodiment of the present application provide a kind of computer readable storage medium, are stored thereon with computer journey Sequence realizes the exchange method of the voice assistant as described in the embodiment of the present application when the program is executed by processor.

Fourth aspect, the embodiment of the present application provide a kind of terminal, including memory, and processor and storage are on a memory And the computer program that can be run in processor, the processor realize such as the embodiment of the present application when executing the computer program The exchange method of the voice assistant.

The interaction schemes of the voice assistant provided in the embodiment of the present application, in the state that terminal shows the first interface, inspection It measures voice assistant function to be triggered, keeps the display at the first interface, and enter voice messaging and obtain state, receive the first voice Information, and first voice messaging is responded.By using above-mentioned technical proposal, after voice assistant function is triggered, In the state of keeping triggering front interface display, state is obtained into voice messaging, namely will not cover to triggering front interface Lid, avoids the influence to original application interface, voice assistant function is made preferably to combine together with terminal.

Detailed description of the invention

Fig. 1 is a kind of flow diagram of the exchange method of voice assistant provided by the embodiments of the present application；

Fig. 2 is the flow diagram of the exchange method of another voice assistant provided by the embodiments of the present application；

Fig. 3 is a kind of schematic diagram at the first interface provided by the embodiments of the present application；

Fig. 4 is a kind of first schematic diagram of voice Interaction Interface provided by the embodiments of the present application；

Fig. 5 is a kind of second schematic diagram of voice Interaction Interface provided by the embodiments of the present application；

Fig. 6 is a kind of third schematic diagram of voice Interaction Interface provided by the embodiments of the present application；

Fig. 7 is a kind of 4th schematic diagram of voice Interaction Interface provided by the embodiments of the present application；

Fig. 8 is the flow diagram of the exchange method of another voice assistant provided by the embodiments of the present application；

Fig. 9 is a kind of structural block diagram of the interactive device of voice assistant provided by the embodiments of the present application；

Figure 10 is a kind of structural schematic diagram of terminal provided by the embodiments of the present application；

Figure 11 is the structural schematic diagram of another terminal provided by the embodiments of the present application.

Specific embodiment

Further illustrate the technical solution of the application below with reference to the accompanying drawings and specific embodiments.It is understood that It is that specific embodiment described herein is used only for explaining the application, rather than the restriction to the application.It further needs exist for illustrating , part relevant to the application is illustrated only for ease of description, in attached drawing rather than entire infrastructure.

It should be mentioned that some exemplary embodiments are described as before exemplary embodiment is discussed in greater detail The processing or method described as flow chart.Although each step is described as the processing of sequence by flow chart, many of these Step can be implemented concurrently, concomitantly or simultaneously.In addition, the sequence of each step can be rearranged.When its operation The processing can be terminated when completion, it is also possible to have the additional step being not included in attached drawing.The processing can be with Corresponding to method, function, regulation, subroutine, subprogram etc..

Currently, being both provided with the sound collections component such as microphone in many terminals, sound collection component is recorded in addition to realizing Outside function, additionally it is possible to combined with speech recognition technology to realize voice assistant function.After terminal enters voice assistant function, The problem of user can be interacted with terminal using natural language, and terminal can answer user or phonetic order according to user Corresponding operation is executed, the human-computer interaction function of terminal is enriched, also brings great convenience for the use of user.Related skill In art, voice assistant is present in terminal as an application program (Application, APP), when voice assistant APP is beaten After opening, then it can enter the application interface of voice assistant APP, the application interface of voice assistant APP can cover user and use before The application interface of application program, the dominant perception affected for checking and operating to former application program, for user is new An APP and the corresponding function of the APP are increased, important system composition portion of the voice assistant as terminal can not be embodied well Point, it even more limits the imagination to more emerging interactive forms and explores space.And in the embodiment of the present application, to voice assistant function Interactive mode after capable of triggering is improved, and the influence to original application interface can be reduced.

Fig. 1 is a kind of flow diagram of the exchange method of voice assistant provided by the embodiments of the present application, and this method can be with It is executed by the interactive device of voice assistant, wherein the device can be implemented by software and/or hardware, and can generally integrate in the terminal. As shown in Figure 1, this method comprises:

Step 101, in the state that terminal shows the first interface, detect that voice assistant function is triggered.

Illustratively, the terminal in the embodiment of the present application may include the equipment such as mobile phone, tablet computer and computer.

Illustratively, the first interface can be any one interface in terminal, such as may include desktop, application program Interface, system set interface and status bar display interface etc..

Illustratively, user can be waken up using key wakeup, icon or the modes such as voice wakes up trigger voice assistant function Can, the embodiment of the present application is without limitation.Wherein, key wakeup for example may include the wake-up of long-pressing power key or the virtual homepage of long-pressing (home) key wake-up etc.；It may include that the icon for triggering voice assistant clicked on screen is waken up that icon, which wakes up for example, It may also include the icon (or option) clicked and shown in such as control centre, calendar or clock interface to wake up etc.；Voice is called out Waking up for example may include saying the trigger word comprising triggering voice assistant function to be waken up, such as " hi, small O " or " the small small O of O " etc..

Step 102, the display for keeping first interface, and enter voice messaging and obtain state.

Illustratively, the display for keeping the first interface may include keeping the first interface when voice assistant function is triggered Show that content is constant, i.e. the first interface is in static display state, will not change；Keeping the display at the first interface can also wrap It includes and keeps display strategy of first interface when voice assistant function is triggered constant.For example, the first interface is social category application The chat window interface of program, according to first way, then the conversation content shown in chat window will not change, when When receiving the new message content of other side, display not will do it；And according to the second way, then when being shown in chat window Conversation content be can real-time change can show according to original display strategy newly when receiving the new message content of other side Message content, such user can check new information in time.For another example, the first interface is video playing circle of video player Face, according to first way, then the broadcasting content shown in video playing interface will not change, and be equivalent to pause and play State；And according to the second way, then the broadcasting content shown in video playing interface can constantly change, and be equivalent to and hold Continuous broadcast state.In the embodiment of the present application, any one of the above mode can be used, can be specifically arranged automatically by system or by user According to personal habits self-setting, and before both of which can allow for user to continue to check that voice assistant function is triggered Interface can reduce the influence to interface display before for the relevant technologies, provide a kind of completely new interactive voice side Formula.

In the embodiment of the present application, while keeping the display at the first interface, state, Ye Jiyu are obtained into voice messaging Sound assistant is in audition state, such as opens microphone sound collection component and acquire ambient sound data, at this point, user can adopt Interactive voice is carried out with the mode and terminal for saying natural language.

Optionally, state is obtained in order to remind user terminal to be currently at voice messaging, it can be on the first interface basis The mark such as Overlapping display such as suspension ball；User can also be reminded by voice mode, such as playing voice, " it is a little assorted that you need me to do ? "；Change can also be made to the display mode at the first interface, such as shows color border in the first interface boundary position.

Step 103 receives the first voice messaging, and responds to first voice messaging.

Illustratively, for terminal after entering voice messaging acquisition state, the sound collections component such as microphone acquires ambient sound Then sound data extract voice messaging from ambient sound data, as the first voice messaging.

Illustratively, the event that voice assistant function can be completed is more and more abundant, and user can pass through voice mode control Terminal processed helps oneself to complete various operations, to reduce the manual operation of oneself, or even liberates the both hands of oneself.For example, language Sound assistant can help user to complete various defaults, such as adjust screen intensity, setting alarm clock and addition memorandum Deng；It may also help in user to be automatically brought into operation system application or third-party application etc., such as make a phone call, send short messages, in social activity Message is sent out in or gives bonus and open application program listens to song etc.；It may also help in the various information of user query, such as Inquire Weather information, enquiry navigation route and search pictures etc.；It can also chat with user, as answered user's proposition Various problems.In the embodiment of the present application, semantics recognition can be carried out to the first voice messaging, identify that the operation of user is intended to, and It is responded accordingly according to the operation intention identified, such as help user completes relevant to be automatically brought into operation or answer asking for user Topic etc. can also question closely user by modes such as voice or texts, when operation is intended to not clear enough further according to user's Answer proceeds to respond to.

The interaction schemes of the voice assistant provided in the embodiment of the present application, by voice assistant originally with movable (acticity) The mode of realization is changed to service (service) and realizes, becomes the function in operating system, rather than APP, shows the in terminal It in the state of one interface, detects that voice assistant function is triggered, keeps the display at the first interface, and enter voice messaging and obtain State receives the first voice messaging, and responds to first voice messaging.By using above-mentioned technical proposal, voice After assistant's function is triggered, in the state of keeping triggering front interface display, state is obtained into voice messaging, namely will not be right Triggering front interface is covered, and avoids the influence to original application interface, melting voice assistant function preferably with terminal is one Body.

In some embodiments, the display for keeping first interface, and enter voice messaging and obtain state, packet It includes: keeping the display at first interface, and suspend at first interface and show first identifier；When the first identifier is touched When hair, state is obtained into voice messaging.The advantages of this arrangement are as follows providing after entering voice assistant function into language Sound obtains the triggering mark of state, i.e. first identifier, that is to say, that user can be touched by way of triggering first identifier at any time It sends out voice messaging and obtains state, avoid voice assistant function by false triggering, terminal is also avoided to execute excessive voice collecting and knowledge Does not operate and increase power consumption.In addition, first identifier is shown on the first interface in a floating manner, to the display shadow at the first interface Ring very little.Optionally, first identifier is in removable state, for example, can pass through long-pressing first identifier and change by way of dragging The display position of first identifier, the content for avoiding first identifier from being concerned about the user in the first interface cause to block.Optionally, One mark can be shown that suspension ball can be translucent in the form of suspension ball, and triggering mode for example can be a little It hits.Optionally, ending sound detection can be carried out, the time that user pipes down is judged automatically out, namely identifies that the first voice is believed Breath receives, and can then stop sound collection, when user triggers first identifier again, is again introduced into voice messaging acquisition State saves power consumption.

In some embodiments, described in the display for keeping first interface, and enter voice messaging obtain state it Afterwards, comprising: when receiving the first operation for first interface, first operation is responded.It is arranged in this way It is advantageous in that, obtains state in voice messaging, operation of the user to former interface can be supported, including supporting to former application program Operation, can such as jump to the second contact surface in former application program.For example, the first interface is the chat window of social category application program Interface, user can continue to input chat content in the interface, chat with other side；For another example, the first interface is video player Video playing interface, user can control video playing in the interface, such as pause, F.F., retrogressing and adjusting Volume etc..

In some embodiments, the first voice messaging of the reception, and first voice messaging is responded, it wraps It includes: receiving the first voice messaging, and show the corresponding text information of first voice messaging in the form of suspended frame；With voice The form of casting and/or suspended frame responds first voice messaging.The advantages of this arrangement are as follows user is allowed to understand Whether the speech recognition of terminal is correct, is shown in the form of suspended frame to text information and response results, can be avoided To excessively blocking for the first interface, and a part that voice assistant is terminal system can be more embodied, rather than in the form of APP In the presence of.Optionally, it when being responded in the form of suspended frame to the first voice messaging, can be determined according to the first voice messaging Response mode.For example, when indicating the problem of user proposes terminal in the first voice messaging, it can be with display terminal in suspended frame The text information of answer；When the operation in the first voice messaging is intended to not clear enough, can be questioned closely in suspended frame with display terminal The problem of text information；When indicating that certain default operates in the first voice messaging, alarm clock is such as set, it can be in suspended frame Display indicates that the card of alarm clock has been arranged, such as may include time and the opening identification of alarm clock of alarm clock in card.

In some embodiments, described in the form of voice broadcast and/or suspended frame to first voice messaging into After row response, further includes: the suspended frame of current time of keeping at a distance nearest preset quantity be in display state, hangs to other Floating frame is hidden processing.The advantages of this arrangement are as follows being hidden processing to outmoded suspended frame in time, avoid to first Interface generates and excessively blocks.Preset quantity can be a fixed value, and such as 3；It is also possible to according to current interaction scenario dynamic Determining value such as talks with the quantity of corresponding suspended frame according to epicycle to determine, that is to say, that can be in time to last round of dialogue Do disappearance processing；It is also possible to determine that screen display mode includes vertical screen display and transverse screen according to the screen display mode of terminal It has been shown that, the corresponding preset quantity of vertical screen display can be greater than the corresponding preset quantity of transverse screen display, as vertical screen display is corresponding pre- If quantity is 3, and the corresponding preset quantity of transverse screen display is 1.

In some embodiments, while being responded in the form of voice broadcast to first voice messaging, also It include: display second identifier；When the second identifier is triggered, stop voice broadcast, and enters voice messaging and obtain state； The second voice messaging is received, and second voice messaging is responded.The advantages of this arrangement are as follows triggering can be passed through The mode of second identifier interrupts current voice broadcast, and the voice messaging for entering next round inputs, and promotes interactive efficiency.Work as end When the expection of the response results and user of holding feedback is not inconsistent or user thinks to continue to listen to down, voice can be terminated Casting, if still there is the content that do not feed back at this time, can also abandon together, rapidly enter next round interaction.

In some embodiments, it when meeting any one following situation, exits the voice assistant function: detecting screen The first predeterminable area on curtain is touched；And it after entering voice messaging acquisition state, is not received in the first preset time Voice messaging；And after responding to first voice messaging, the behaviour of user is not received in the second preset time Make.The advantages of this arrangement are as follows can carry out exiting processing in time when voice assistant function is not needed temporarily by user, It avoids generating excessive interference to former application program, while saving power consumption.Wherein, the first predeterminable area for example can be on screen Fixed area, such as upper left corner area；The white space etc. being also possible in suspended frame.First preset time and the second preset time It can be configured, can be the same or different according to the actual situation, such as can be 3 seconds.

In some embodiments, the corresponding text information of first voice messaging is shown in the form of suspended frame described While, further includes: third mark is shown in the second predeterminable area corresponding with the text information；When the third identifies quilt When triggering, into the editing mode of the text information；The text information is updated according to the edit operation received. Correspondingly, described respond first voice messaging in the form of voice broadcast and/or suspended frame, comprising: with voice The form of casting and/or suspended frame responds updated text information.The advantages of this arrangement are as follows receiving first It after voice messaging, will not directly be responded, but show the corresponding text information of the first voice messaging in the form of suspended frame, And editing machine meeting is provided for user, the maloperation for avoiding the speech recognition of mistake from generating, also the voice again of avoidable user is defeated Enter, improves interactive efficiency.When user has found that text information is not consistent with oneself word, or be found that while to be consistent but there may be Ambiguity or ambiguous situation, to the text information with regard to edlin, can be convenient for terminal by way of triggering third mark The true operation for accurately identifying user is intended to.Second predeterminable area for example can be the end region of text information.

Fig. 2 is the flow diagram of the exchange method of another voice assistant provided by the embodiments of the present application, this method packet Include following steps:

Step 201, in the state that terminal shows the first interface, detect that voice assistant function is triggered.

Illustratively, it is illustrated so that the first interface is the chat interface in social category application program as an example.Fig. 3 is this Shen Please embodiment provide first interface of one kind schematic diagram, as shown in figure 3, user use social category application program and good friend it is small Red chat, in chat process, good friend makes an appointment 7 points of tomorrow morning with user and park is gone to run, user in order to remind oneself in time It gets up, it is desirable to a fixed tomorrow morning of 6 points of 30 minutes alarm clocks, at this point it is possible to trigger voice assistant function.For example, user passes through The mode that voice wakes up, says " the small small O of O ", then terminal can detect that voice assistant function is triggered.

Illustratively, when voice assistant function is triggered, corresponding prompt tone can be played, to prompt user speech to help Hand function has been triggered.

Step 202, the display for keeping the first interface, and enter voice messaging and obtain state.

Illustratively, the display for keeping the first interface may include keeping the first interface when voice assistant function is triggered Display strategy is constant.That is, the conversation content shown in the chat interface be can real-time change, when receive good friend send it is new When message, new message content can be shown according to original display strategy, such user can check new information in time.For example, small It is red after distributing message " " at 7 points, and send new information " in park entrance set " and at this moment can also be shown on the first interface Show this new information.While keeping chat interface normally to show, state is obtained into voice messaging.For example, in order to remind User terminal has currently entered voice messaging and has obtained state, and voice assistant can also prompt user, such as plays voice " you What need I does? ".

Optionally, this step may be: keep the display at the first interface, and the first mark of display that suspends on the first interface Know；When first identifier is triggered, state is obtained into voice messaging.Fig. 4 is that a kind of voice provided by the embodiments of the present application is handed over First schematic diagram at mutual interface, illustratively, first identifier is suspension ball 401, as shown in figure 4, showing in bottom of screen outstanding Floating ball 401, user can trigger suspension ball 401 in a manner of click.

Step 203, the first voice messaging for receiving user's input, and the first voice messaging pair is shown in the form of suspended frame The text information answered.

Illustratively, user says " creating an alarm clock ", after terminal receives the voice messaging, carries out to voice messaging Identification, and will identify that the text information come is shown in the form of suspended frame, such as the first suspended frame 402 in Fig. 4.It is optional , editor's mark 403 can be shown at text information end, it, can be to text information when user triggers editor mark 403 It is edited.

Optionally, the corresponding text information of the first voice messaging is shown in the form of suspended frame, it may include: to the first interface In display content identified, determine target area, show the first voice messaging pair in the form of suspended frame in target area The text information answered.Wherein, target area can be the white space in the first interface, be also possible to assess using preset algorithm Out to the first the smallest region of interface coverage extent.

Step 204 responds first voice messaging in the form of voice broadcast and/or suspended frame.

Illustratively, terminal can carry out semantics recognition to the first voice messaging, identify that the operation of user is intended to.Work as behaviour When work is intended to not clear enough, user can be questioned closely.Voice assistant is known that user wants creation by semantics recognition One alarm clock, but specific alarm time is not known, therefore can be questioned closely to user.Fig. 5 mentions for the embodiment of the present application Second schematic diagram of a kind of voice Interaction Interface supplied, as shown in figure 5, being questioned closely " what to user in the form of suspended frame Time? ", such as the second suspended frame 501, at the same time it can also voice broadcast mode play voice " when ".In addition, Social category application program receives small red new information " in park entrance set " at this time, can also show that this is new in the first interface Message.Fig. 6 is a kind of third schematic diagram of voice Interaction Interface provided by the embodiments of the present application, and user questions closely voice assistant It is answered, says " at 4 points in afternoon ", then will appear new suspended frame on screen, such as third suspended frame 601.In voice assistant After getting complete information, the good alarm clock of user setting is helped, and the first voice is believed in the form of voice broadcast and suspended frame Breath is responded.Fig. 7 is a kind of 4th schematic diagram of voice Interaction Interface provided by the embodiments of the present application, as shown in fig. 7, voice With third suspended frame 701, to user feedback, " good, alarm clock has been arranged to 16:00 " assistant, and intuitively with the 4th suspended frame 702 Feed back the setting result of alarm clock.

The suspended frame of step 205, current time of keeping at a distance nearest preset quantity is in display state, to other suspensions Frame is hidden processing.

As shown in fig. 7, the interaction due to early period has been completed, necessity of display is not continued, it is possible to retain nearest 3 suspended frames, and processing is hidden to suspended frame before, to save display space, avoids generating the first interface More blocks.

Step 206 is receiving when operating for the first of the first interface of user's input, rings to the first operation It answers.

Illustratively, as shown in figure 5, after user sees the small red new information sent, which can be replied, Namely in chat interface, return information " good " can be inputted.Specifically, user can click Text Entry, key is recalled Disk, and input " good ", then " good " appears in Text Entry, and after then user confirms transmission, which will be sent out It gives small red.

Step 207, detect meet voice assistant function exit criteria when, exit voice assistant function.

Wherein, voice assistant function exit criteria may include such as: detect that the first predeterminable area on screen is touched；? Into after voice messaging acquisition state, voice messaging is not received in the first preset time；Or to first voice messaging After being responded, the operation of user is not received in the second preset time.

The exchange method of voice assistant provided by the embodiments of the present application, detects in the state that terminal shows the first interface Voice assistant function is triggered, and keeps display strategy of first interface when voice assistant function is triggered constant, and enter language Sound acquisition of information state receives the first voice messaging of user's input, and corresponding text information is shown in the form of suspended frame, The first voice messaging respond and be in time hidden outmoded suspended frame in the form of voice broadcast and/or suspended frame Processing also supports the operation to the first interface in the process.By using above-mentioned technical proposal, voice assistant function is touched After hair, the interface before triggering will not be covered, avoid the influence to original application interface, while will not answer for original It is had an impact with the operation at interface, voice assistant function is made preferably to combine together with terminal, promote interactive efficiency.

Fig. 8 is the flow diagram of the exchange method of another voice assistant provided by the embodiments of the present application, this method packet It includes:

Step 801, in the state that terminal shows the first interface, receive suspension ball recalls instruction.

Illustratively, what user can input suspension ball in many ways recalls instruction, such as passes through voice wake-up mode tune Suspension ball out, intelligent panel can also be pulled out by long-pressing terminal bottom, and (can be regarded as configuring in terminal integrates various intelligent function Can control interface) mode recall suspension ball, suspension ball can also be recalled from control centre, pass through triggering calendar in microphone mark Microphone in knowledge or clock identifies to recall suspension ball etc..Wherein, suspension ball is presented with translucent.

Step 802 keeps the display strategy at the first interface constant, and pops up suspension ball.

Optionally, the display position of suspension ball can be fixed, can also be by showing the layout of content in the first interface Dynamic decision, can also be according to screen display mode (transverse screen or vertical screen).For example, suspension ball can appear under transverse screen mode On the right side of screen；Under vertical screen mode, suspension ball can appear in bottom of screen.

Step 803, when suspension ball is clicked, into audition state.

Step 804 receives the first voice messaging, and the corresponding text letter of the first voice messaging is shown in the form of suspended frame Breath.

Step 805, on the right side of text information, display editor is identified, when editor's mark is triggered, into text information Editing mode, and text information is updated according to the edit operation of user.

Step 806 responds text information in the form of voice broadcast and suspended frame.

Step 807, during voice broadcast, detect that microphone mark stops voice broadcast when being clicked, and again Into audition state, the second voice messaging is received, and the second voice messaging is responded.

Optionally, during being responded to text information or voice messaging, may relate to wheel interact, keep away from The suspended frame of the preset quantity nearest from current time is in display state, is hidden processing to other suspended frames.

Optionally, during executing step 803 to step 807, it may also include that and receiving being directed to for user's input When first operation at the first interface, the first operation is responded.That is, will not influence during executing these steps Operation of the user to the first interface.

If step 808, the operation for not receiving user within a preset time, exit voice assistant function, to suspension ball And suspended frame carries out disappearance processing.

The exchange method of voice assistant provided by the embodiments of the present application can lead in the state that terminal shows arbitrary interface It crosses the mode whether detection suspension ball is transferred out and determines whether voice assistant function is triggered, if being triggered, keep the first boundary The display strategy in face is constant, and pops up suspension ball, and enters audition state, receives the voice messaging of user's input, and to suspend The form of frame shows corresponding text information, is responded in the form of voice broadcast and suspended frame to the first voice messaging, During voice broadcast, user can interrupt voice broadcast at any time and reenter audition state, can promote interactive voice effect Rate during this, also supports the operation to the first interface, and voice assistant function is made to become a part of system function, rather than with The form of APP exists, and when terminal judges that user no longer needs voice assistant function, voice assistant function again can be in time It automatically exits from, keeps voice assistant more intelligent.

Fig. 9 is a kind of structural block diagram of the interactive device of voice assistant provided by the embodiments of the present application, which can be by soft Part and/or hardware realization, are typically integrated in terminal, can be helped by executing the exchange method of voice assistant come controlling terminal voice The human-computer interaction of hand.As shown in figure 9, the device includes:

Detection trigger module 901, in the state that terminal show the first interface, detect voice assistant function whether by Triggering；

Status control module 902, for when detecting that voice assistant function is triggered, keeping the aobvious of first interface Show, and enters voice messaging and obtain state；

Voice messaging respond module 903 for receiving the first voice messaging, and rings first voice messaging It answers.

The interactive device of the voice assistant provided in the embodiment of the present application, in the state that terminal shows the first interface, inspection It measures voice assistant function to be triggered, keeps the display at the first interface, and enter voice messaging and obtain state, receive the first voice Information, and first voice messaging is responded.By using above-mentioned technical proposal, after voice assistant function is triggered, In the state of keeping triggering front interface display, state is obtained into voice messaging, namely will not cover to triggering front interface Lid, avoids the influence to original application interface, voice assistant function is made preferably to combine together with terminal.

Optionally, the display for keeping first interface, and enter voice messaging and obtain state, comprising:

The display at first interface is kept, and the display first identifier that suspends on first interface；

When the first identifier is triggered, state is obtained into voice messaging.

Optionally, the device further include:

Respond module is operated, for the display at holding first interface, and is entered after voice messaging acquisition state, When receiving the first operation for first interface, first operation is responded.

Optionally, the first voice messaging of the reception, and first voice messaging is responded, comprising:

The first voice messaging is received, and shows the corresponding text information of first voice messaging in the form of suspended frame；

First voice messaging is responded in the form of voice broadcast and/or suspended frame.

Optionally, the device further include:

Hide processing module, for it is described in the form of voice broadcast and/or suspended frame to first voice messaging After being responded, the suspended frame of current time of keeping at a distance nearest preset quantity is in display state, to other suspended frames It is hidden processing.

Optionally, the device further include:

Control module is broadcasted, for while being responded in the form of voice broadcast to first voice messaging, Show second identifier；When the second identifier is triggered, stop voice broadcast, and enters voice messaging and obtain state；

The voice messaging respond module is also used to: being received the second voice messaging, and is carried out to second voice messaging Response.

Optionally, the device further include:

Voice assistant exits module, for exiting the voice assistant function when meeting any one following situation:

Detect that the first predeterminable area on screen is touched；And

After entering voice messaging acquisition state, voice messaging is not received in the first preset time；And

After responding to first voice messaging, the operation of user is not received in the second preset time.

The embodiment of the present application also provides a kind of storage medium comprising computer executable instructions, and the computer is executable It instructs when being executed by computer processor for executing the exchange method of voice assistant, this method comprises:

The first voice messaging is received, and first voice messaging is responded.

Storage medium --- any various types of memory devices or storage equipment.Term " storage medium " is intended to wrap It includes: install medium, such as CD-ROM, floppy disk or magnetic tape equipment；Computer system memory or random access memory, such as DRAM, DDRRAM, SRAM, EDORAM, Lan Basi (Rambus) RAM etc.；Nonvolatile memory, such as flash memory, magnetic medium (example Such as hard disk or optical storage)；Register or the memory component of other similar types etc..Storage medium can further include other types Memory or combinations thereof.In addition, storage medium can be located at program in the first computer system being wherein performed, or It can be located in different second computer systems, second computer system is connected to the first meter by network (such as internet) Calculation machine system.Second computer system can provide program instruction to the first computer for executing.Term " storage medium " can To include two or more that may reside in different location (such as in the different computer systems by network connection) Storage medium.Storage medium can store the program instruction that can be performed by one or more processors and (such as be implemented as counting Calculation machine program).

Certainly, a kind of storage medium comprising computer executable instructions, computer provided by the embodiment of the present application The interactive operation for the voice assistant that executable instruction is not limited to the described above can also be performed the application any embodiment and be provided Voice assistant exchange method in relevant operation.

The embodiment of the present application provides a kind of terminal, and voice assistant provided by the embodiments of the present application can be integrated in the terminal Interactive device.Figure 10 is a kind of structural schematic diagram of terminal provided by the embodiments of the present application.Terminal 1000 may include: memory 1001, processor 1002 and the computer program that is stored on memory 1001 and can be run in processor, the processor 1002 realize the exchange method of the voice assistant as described in the embodiment of the present application when executing the computer program.

Terminal provided by the embodiments of the present application, after voice assistant function is triggered, in the shape for keeping triggering front interface to show Under state, state is obtained into voice messaging, namely will not cover to triggering front interface, avoid the shadow to original application interface It rings, voice assistant function is made preferably to combine together with terminal.

Figure 11 is the structural schematic diagram of another terminal provided by the embodiments of the present application, which may include: shell (figure In be not shown), memory 1101, central processing unit (central processing unit, CPU) 1102 (also known as processor, Hereinafter referred to as CPU), circuit board (not shown) and power circuit (not shown).The circuit board is placed in the shell The space interior that body surrounds；The CPU1102 and the memory 1101 are arranged on the circuit board；The power circuit, For each circuit or the device power supply for the terminal；The memory 1101, for storing executable program code；It is described CPU1102 is run and the executable program code by reading the executable program code stored in the memory 1101 Corresponding computer program, to perform the steps of

The first voice messaging is received, and first voice messaging is responded.

The terminal further include: Peripheral Interface 1103, RF (Radio Frequency, radio frequency) circuit 1105, voicefrequency circuit 1106, loudspeaker 1111, power management chip 1108, input/output (I/O) subsystem 1109, other input/control devicess 1110, touch screen 1112, other input/control devicess 1110 and outside port 1104, these components pass through one or more Communication bus or signal wire 1107 communicate.

It should be understood that graphic terminal 1100 is only an example of terminal, and terminal 1100 can have ratio Shown in the drawings more or less component, can combine two or more components, or can have different Component configuration.Various parts shown in the drawings can include that one or more signal processings and/or specific integrated circuit exist It is realized in the combination of interior hardware, software or hardware and software.

Just the terminal provided in this embodiment for interactive voice is described in detail below, which is with mobile phone Example.

Memory 1101, the memory 1101 can be accessed by CPU1102, Peripheral Interface 1103 etc., the memory 1101 may include high-speed random access memory, can also include nonvolatile memory, such as one or more disks are deposited Memory device, flush memory device or other volatile solid-state parts.

The peripheral hardware that outputs and inputs of equipment can be connected to CPU1102 by Peripheral Interface 1103, the Peripheral Interface 1103 With memory 1101.

I/O subsystem 1109, the I/O subsystem 1109 can be by the input/output peripherals in equipment, such as touch screen 1112 and other input/control devicess 1110, it is connected to Peripheral Interface 1103.I/O subsystem 1109 may include display control Device 11091 and one or more input controllers 11092 for controlling other input/control devicess 1110.Wherein, one or Multiple input controllers 11092 from other input/control devicess 1110 receive electric signal or to other input/control devicess 1110 send electric signal, other input/control devicess 1110 may include physical button (push button, rocker buttons etc.), dial Dialer, control stick, clicks idler wheel at slide switch.It is worth noting that input controller 11092 can be with any one following company It connects: the indicating equipment of keyboard, infrared port, USB interface and such as mouse.

Touch screen 1112, the touch screen 1112 are the input interface and output interface between user terminal and user, will Visual output is shown to user, and visual output may include figure, text, icon, video etc..

Display controller 11091 in I/O subsystem 1109 from touch screen 1112 receives electric signal or to touch screen 1112 send electric signal.Touch screen 1112 detects the contact on touch screen, the contact conversion that display controller 11091 will test For the interaction with the user interface object being shown on touch screen 1112, i.e. realization human-computer interaction is shown on touch screen 1112 User interface object can be the icon of running game, the icon for being networked to corresponding network etc..It is worth noting that equipment is also It may include light mouse, light mouse is not show the touch sensitive surface visually exported, or sensitive by the touch that touch screen is formed The extension on surface.

RF circuit 1105 is mainly used for establishing the communication of mobile phone Yu wireless network (i.e. network side), realizes mobile phone and wireless The data receiver of network and transmission.Such as transmitting-receiving short message, Email etc..Specifically, RF circuit 1105 receives and sends RF Signal, RF signal are also referred to as electromagnetic signal, and RF circuit 1105 converts electrical signals to electromagnetic signal or is converted to electromagnetic signal Electric signal, and communicated by the electromagnetic signal with communication network and other equipment.RF circuit 1105 may include using In the known circuit for executing these functions comprising but be not limited to antenna system, RF transceiver, one or more amplifiers, adjust Humorous device, one or more oscillators, digital signal processor, CODEC (COder-DECoder, coder) chipset, user Mark module (Subscriber Identity Module, SIM) etc..

Voicefrequency circuit 1106 is mainly used for receiving audio data from Peripheral Interface 1103, which is converted to electricity Signal, and the electric signal is sent to loudspeaker 1111.

Loudspeaker 1111 is reduced to sound for mobile phone to be passed through RF circuit 1105 from the received voice signal of wireless network Sound simultaneously plays the sound to user.

Power management chip 1108, the hardware for being connected by CPU1102, I/O subsystem and Peripheral Interface are supplied Electricity and power management.

It is arbitrarily real that the application can be performed in interactive device, storage medium and the terminal of the voice assistant provided in above-described embodiment The exchange method for applying voice assistant provided by example has and executes the corresponding functional module of this method and beneficial effect.Not upper The technical detail of detailed description in embodiment is stated, reference can be made to the interaction side of voice assistant provided by the application any embodiment Method.

Note that above are only the preferred embodiment and institute's application technology principle of the application.It will be appreciated by those skilled in the art that The application is not limited to specific embodiment described here, be able to carry out for a person skilled in the art it is various it is apparent variation, The protection scope readjusted and substituted without departing from the application.Therefore, although being carried out by above embodiments to the application It is described in further detail, but the application is not limited only to above embodiments, in the case where not departing from the application design, also It may include more other equivalent embodiments, and scope of the present application is determined by the scope of the appended claims.

Claims

1. a kind of exchange method of voice assistant characterized by comprising

The first voice messaging is received, and first voice messaging is responded.

2. the method according to claim 1, wherein the display for keeping first interface, and entering language Sound acquisition of information state, comprising:

When the first identifier is triggered, state is obtained into voice messaging.

3. the method according to claim 1, wherein in the display for keeping first interface, and entering After voice messaging acquisition state, comprising:

When receiving the first operation for first interface, first operation is responded.

4. the method according to claim 1, wherein the first voice messaging of the reception, and to first language Message breath is responded, comprising:

5. according to the method described in claim 4, it is characterized in that, described in the form of voice broadcast and/or suspended frame pair After first voice messaging is responded, further includes:

The suspended frame for current time nearest preset quantity of keeping at a distance is in display state, is hidden place to other suspended frames Reason.

6. according to the method described in claim 4, it is characterized in that, in the form of voice broadcast to first voice messaging While response, further includes:

Show second identifier；

When the second identifier is triggered, stop voice broadcast, and enters voice messaging and obtain state；

The second voice messaging is received, and second voice messaging is responded.

7. -6 any method according to claim 1, which is characterized in that when meeting any one following situation, exit The voice assistant function:

Detect that the first predeterminable area on screen is touched；And

8. a kind of interactive device of voice assistant characterized by comprising

Detection trigger module, in the state that terminal shows the first interface, whether detection voice assistant function to be triggered；

Status control module, for keeping the display at first interface, going forward side by side when detecting that voice assistant function is triggered Enter voice messaging and obtains state；

9. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is held by processor The exchange method of the voice assistant as described in any in claim 1-7 is realized when row.

10. a kind of terminal, which is characterized in that including memory, processor and storage can be run on a memory and in processor Computer program, the processor realizes that voice as claimed in claim 1 helps when executing the computer program The exchange method of hand.