WO2021052263A1 - 语音助手显示方法及装置 - Google Patents

语音助手显示方法及装置 Download PDF

Info

Publication number
WO2021052263A1
WO2021052263A1 PCT/CN2020/114899 CN2020114899W WO2021052263A1 WO 2021052263 A1 WO2021052263 A1 WO 2021052263A1 CN 2020114899 W CN2020114899 W CN 2020114899W WO 2021052263 A1 WO2021052263 A1 WO 2021052263A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice assistant
display
feedback
screen
instruction information
Prior art date
Application number
PCT/CN2020/114899
Other languages
English (en)
French (fr)
Inventor
宋平
杨之言
郑美洙
周煜啸
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2021052263A1 publication Critical patent/WO2021052263A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces
    • G06F9/453Help systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Definitions

  • This application relates to the technical field of electronic equipment, and in particular to a voice assistant display method and device.
  • the voice assistant can conduct intelligent interactions with users in intelligent dialogue and instant question and answer.
  • the voice assistant can also recognize the user's voice command, and make the mobile phone execute the event corresponding to the voice command. Taking a mobile phone as an example, if the voice assistant receives and recognizes the voice command "Call Mr. Li" entered by the user, the mobile phone can automatically call the contact Mr. Li.
  • the freeform multi-window technology is usually used to control the form of the voice assistant, so that the voice assistant is suspended in any position of the display interface to facilitate user operations.
  • the form of the voice assistant is relatively independent of the actual scene on the electronic device, which makes the user experience poor.
  • the present application provides a voice assistant display method and device, which define the display form of the voice assistant, so that the voice assistant can switch the corresponding form according to the change of the actual scene, so as to realize the system-level integration of the voice assistant and the electronic device and improve the user experience.
  • the present application provides a voice assistant display method, which is applied to an electronic device.
  • the display modes of the voice assistant include a half-screen state, a full-screen state, and a floating state.
  • the half-screen mode means that the display interface of the voice assistant accounts for less than 1 of the overall display interface of the electronic device
  • the full-screen mode means that the display interface of the voice assistant accounts for 1 of the overall display interface of the electronic device
  • the floating mode refers to the voice
  • the assistant is displayed floating on the current display interface of the electronic device.
  • the method includes: turning on the voice assistant, and displaying the voice assistant in the first display form.
  • the first display form is a default display form preset by the voice assistant. Determine the display form of the voice assistant according to the instruction information input to the voice assistant and the service indicated by the instruction information.
  • the present application provides a voice assistant display method. After the voice assistant is turned on, the voice assistant is displayed in a preset default display form. Then, according to the instruction information of the input voice assistant and the service indicated by the instruction information, the display form of the voice assistant is determined, so that the voice assistant can determine the change of the actual scene according to the instruction information, and switch the corresponding form according to the actual scene, Make the voice assistant and the system work together, so as to realize the system-level integration of the voice assistant and the mobile phone.
  • the first display mode is a half-screen mode
  • the voice assistant is turned on
  • the display form of the voice assistant is determined, specifically including: if the instruction information lacks keywords, the display form of the voice assistant is Full screen mode. If the instruction information does not lack keywords, and the feedback form of the service indicated by the instruction information is text feedback or voice feedback, the display form of the voice assistant is a half-screen state. If the instruction information does not lack keywords, and the feedback form of the service indicated by the instruction information is card feedback, the display mode of the voice assistant is a full-screen state. If the instruction information does not lack keywords, and the feedback form of the service indicated by the instruction information is split-screen feedback, the display form of the voice assistant is a floating state.
  • the application involved in the service indicated by the indication information is displayed in the form of a card on the display interface of the voice assistant, and the feedback form of the service indicated by the indication information is card feedback.
  • the service indicated by the indication information involves application interface switching, and the feedback form of the service indicated by the indication information is split-screen feedback.
  • the half-screen state of the voice assistant also includes the small half-screen state and the large half-screen state.
  • the small half-screen state means that the proportion of the voice assistant's display interface to the overall display interface of the electronic device is less than 0.5, and most
  • the screen state means that the ratio of the display interface of the voice assistant to the overall display interface of the electronic device is greater than 0.5, and the first display form is the small half-screen state.
  • the voice assistant’s display form is determined according to the instruction information input to the voice assistant and the service indicated by the instruction information, including: if the instruction information does not lack keywords, and feedback of the service indicated by the instruction information If the form is text feedback or voice feedback, the display form of the voice assistant is a small half-screen state. If the instruction information does not lack keywords, and the feedback form of the service indicated by the instruction information is card feedback, the display mode of the voice assistant is a half-screen state.
  • the method further includes: the voice assistant enters a sleep state, wakes up the voice assistant, and determines the display form of the voice assistant.
  • waking up the voice assistant and determining the display form of the voice assistant includes: if the voice assistant enters the dormant state, the display form of the voice assistant is suspended, then after the voice assistant is awakened, the display form of the voice assistant is The first display form. If the voice assistant enters the dormant state, its display form is half-screen, after waking up the voice assistant, the voice assistant's display form is half-screen. If the voice assistant enters the dormant state, its display mode is full-screen mode, after waking up the voice assistant, the voice assistant's display mode is full-screen mode.
  • the half-screen state of the voice assistant also includes a small half-screen state and a large half-screen state, where the small half-screen state refers to the proportion of the display interface of the voice assistant in the overall display interface of the electronic device Less than 0.5; the half-screen state means that the proportion of the display interface of the voice assistant in the overall display interface of the electronic device is greater than 0.5; the first display form is the small half-screen state.
  • the waking up the voice assistant and determining the display mode of the voice assistant includes: if the voice assistant enters the dormant state, the display mode is the small half-screen state, and after the voice assistant is awakened, the display mode of the voice assistant is the small half-screen state. If the voice assistant enters the dormant state, its display mode is half-screen mode, after waking up the voice assistant, the voice assistant's display mode is half-screen mode.
  • waking up the voice assistant and determining the display form of the voice assistant it further includes: determining the new display form of the voice assistant according to the new instruction information and the service indicated by the new instruction information.
  • the new display form of the voice assistant determines the new display form of the voice assistant, which specifically includes: if the display form of the voice assistant is full-screen and the new instruction information
  • the feedback form of the indicated service is text feedback, voice feedback or card feedback, and the new display form of the voice assistant is full-screen. If the display mode of the voice assistant is full-screen and the feedback form of the service indicated by the new indication information is split-screen feedback, the new display mode of the voice assistant is the floating state. If the display mode of the voice assistant is a half-screen state and the new instruction information lacks keywords, the new display mode of the voice assistant is a full-screen state.
  • the new display form of the voice assistant is half-screen, and the new instructions do not lack keywords, and the feedback form of the service corresponding to the new instructions is text feedback or voice feedback
  • the new display form of the voice assistant is half-screen . If the display form of the voice assistant is half-screen, and the new instruction information does not lack keywords, and the feedback form of the service corresponding to the new instruction information is card feedback, the new display form of the voice assistant is full-screen. If the display form of the voice assistant is a half-screen state, and the new instruction information does not lack keywords, and the feedback form of the service corresponding to the new instruction information is split-screen feedback, the new display form of the voice assistant is a floating state.
  • the application involved in the service indicated by the new indication information is displayed in the form of a card on the display interface of the voice assistant, and the feedback form of the service indicated by the new indication information is card feedback.
  • the service indicated by the new indication information involves application interface switching, and the feedback form of the service indicated by the new indication information is split-screen feedback.
  • the half-screen state of the voice assistant also includes a small half-screen state and a large half-screen state, where the small half-screen state refers to the proportion of the display interface of the voice assistant in the overall display interface of the electronic device Less than 0.5, the half-screen state means that the proportion of the display interface of the voice assistant in the overall display interface of the electronic device is greater than 0.5, and the first display form is the small half-screen state.
  • determining the new display form of the voice assistant includes: if the display form of the voice assistant is a small half-screen state, and the new instruction information does not lack keywords, the new If the feedback form of the service indicated by the instruction information is text feedback or voice feedback, the display form of the voice assistant is a small half-screen state; if the display form of the voice assistant is a small half-screen state, and the new instruction information does not lack keywords, new The feedback form of the service indicated by the instruction information is card feedback, and the display form of the voice assistant is a half-screen state.
  • an electronic device including: a processor, a memory, and a touch screen.
  • the memory and the touch screen are coupled to the processor.
  • the memory is used to store computer program codes.
  • the computer program codes include computer instructions.
  • the computer commands the electronic device to perform the following operations: turn on the voice assistant, and the voice assistant is displayed in the first display form.
  • the first display form is a default display form preset by the voice assistant.
  • the display modes of the voice assistant include a half-screen state, a full-screen state, and a floating state.
  • Half-screen mode means that the proportion of the voice assistant’s display interface to the overall display interface of the electronic device is less than 1.
  • Full-screen mode means that the proportion of the voice assistant’s display interface to the overall display interface of the electronic device is 1.
  • the floating state means that the voice assistant is floating Displayed on the current display interface of the electronic device.
  • the first display mode is a half-screen mode, when the processor reads computer instructions from the memory, so that the electronic device also performs the following operations: After opening the voice assistant, the current task interface moves downward as a whole , The voice assistant is displayed in half screen mode and the current task interface split screen.
  • the processor when the processor reads the computer instructions from the memory, so that the electronic device also performs the following operations: if the indication information lacks keywords, the display mode of the voice assistant Full screen mode. If the instruction information does not lack keywords, and the feedback form of the service indicated by the instruction information is text feedback or voice feedback, the display form of the voice assistant is a half-screen state. If the instruction information does not lack keywords, and the feedback form of the service indicated by the instruction information is card feedback, the display mode of the voice assistant is a full-screen state. If the instruction information does not lack keywords, and the feedback form of the service indicated by the instruction information is split-screen feedback, the display form of the voice assistant is a floating state.
  • the application involved in the service indicated by the indication information is displayed in the form of a card on the display interface of the voice assistant, and the feedback form of the service indicated by the indication information is card feedback.
  • the service indicated by the indication information involves application interface switching, and the feedback form of the service indicated by the indication information is split-screen feedback.
  • the half-screen state of the voice assistant also includes a small half-screen state and a large half-screen state, where the small half-screen state refers to the proportion of the display interface of the voice assistant in the overall display interface of the electronic device Less than 0.5; the half-screen state means that the proportion of the display interface of the voice assistant in the overall display interface of the electronic device is greater than 0.5; the first display form is the small half-screen state.
  • the processor when the processor reads the computer instructions from the memory, so that the electronic device also performs the following operations: if the instruction information does not lack keywords, and the feedback form of the service indicated by the instruction information is text feedback Or voice feedback, the display form of the voice assistant is a small half-screen state. If the instruction information does not lack keywords, and the feedback form of the service indicated by the instruction information is card feedback, the display mode of the voice assistant is a half-screen state.
  • the electronic device when the processor reads computer instructions from the memory, the electronic device also performs the following operations: the voice assistant enters a sleep state, wakes up the voice assistant, and determines the display form of the voice assistant.
  • the processor when the processor reads computer instructions from the memory, so that the electronic device also performs the following operations: if the voice assistant enters the dormant state and its display form is suspended, after waking up the voice assistant, The display form of the voice assistant is the first display form. If the voice assistant enters the dormant state, its display form is half-screen, after waking up the voice assistant, the voice assistant's display form is half-screen. If the voice assistant enters the dormant state, its display mode is full-screen mode, after waking up the voice assistant, the voice assistant's display mode is full-screen mode.
  • the half-screen state of the voice assistant also includes a small half-screen state and a large half-screen state, where the small half-screen state refers to the proportion of the display interface of the voice assistant in the overall display interface of the electronic device Less than 0.5; the half-screen state means that the proportion of the display interface of the voice assistant in the overall display interface of the electronic device is greater than 0.5; the first display mode is the small half-screen state; when the processor reads computer instructions from the memory , So that the electronic device also performs the following operations: if the voice assistant enters the dormant state and its display form is the small half-screen state, after waking up the voice assistant, the voice assistant’s display form is the small half-screen state. If the voice assistant enters the dormant state, its display mode is half-screen mode, after waking up the voice assistant, the voice assistant's display mode is half-screen mode.
  • the processor when the processor reads the computer instructions from the memory, so that the electronic device also performs the following operations: according to the new instruction information and the service indicated by the new instruction information, determine the new display of the voice assistant form.
  • the processor when the processor reads computer instructions from the memory, so that the electronic device also performs the following operations: if the display form of the voice assistant is full screen, and the new indication information indicates the service feedback form If it is text feedback, voice feedback or card feedback, the new display form of the voice assistant is full-screen mode. If the display form of the voice assistant is a full-screen state, and the feedback form of the service indicated by the new indication information is split-screen feedback, the new display form of the voice assistant is a floating state. If the display mode of the voice assistant is a half-screen state and the new instruction information lacks keywords, the new display mode of the voice assistant is a full-screen state.
  • the new display form of the voice assistant is half-screen, and the new instructions do not lack keywords, and the feedback form of the service corresponding to the new instructions is text feedback or voice feedback
  • the new display form of the voice assistant is half-screen . If the display form of the voice assistant is half-screen, and the new instruction information does not lack keywords, and the feedback form of the service corresponding to the new instruction information is card feedback, the new display form of the voice assistant is full-screen. If the display form of the voice assistant is a half-screen state, and the new instruction information does not lack keywords, and the feedback form of the service corresponding to the new instruction information is split-screen feedback, the new display form of the voice assistant is a floating state.
  • the application involved in the service indicated by the new indication information is displayed in the form of a card on the display interface of the voice assistant, and the feedback form of the service indicated by the new indication information is card feedback.
  • the service indicated by the new indication information involves application interface switching, and the feedback form of the service indicated by the new indication information is split-screen feedback.
  • the half-screen state of the voice assistant also includes a small half-screen state and a large half-screen state, where the small half-screen state refers to the proportion of the display interface of the voice assistant in the overall display interface of the electronic device Less than 0.5; the half-screen state means that the proportion of the display interface of the voice assistant in the overall display interface of the electronic device is greater than 0.5; the first display mode is the small half-screen state; when the processor reads computer instructions from the memory , So that the electronic device also performs the following operations: if the display form of the voice assistant is a small half-screen state, and the new instruction information does not lack keywords, and the feedback form of the service indicated by the new instruction information is text feedback or voice feedback, then The display mode of the voice assistant is a small half-screen state; if the display mode of the voice assistant is a small half-screen state, and the new instruction information does not lack keywords, and the feedback form of the service indicated by the new instruction information is card feedback, the voice assistant’s The display form is
  • a graphical user interface on an electronic device has a display screen, a camera, a memory, and one or more processors, and the one or more processors are configured to execute
  • the graphical user interface includes a graphical user interface displayed when the electronic device executes the method described in the above aspect and any one of its possible implementation manners.
  • a device in a fourth aspect, is provided, the device is included in an electronic device, and the device has the function of realizing the behavior of the electronic device in any of the foregoing aspects and possible implementation manners.
  • This function can be realized by hardware, or by hardware executing corresponding software.
  • the hardware or software includes at least one module or unit corresponding to the above-mentioned functions. For example, a receiving module or unit, a display module or unit, and a sending module or unit, etc.
  • a computer storage medium including computer instructions, which when the computer instructions run on an electronic device, cause the electronic device to execute the voice assistant display method described in the above aspects and any one of its possible implementations.
  • a computer program product which when the computer program product runs on a computer, causes the computer to execute the voice assistant display method described in the above aspects and any one of the possible implementation manners.
  • a chip system including a processor, and when the processor executes an instruction, the processor executes the voice assistant display method described in the foregoing aspects and any one of the possible implementation manners.
  • FIG. 1 is a schematic structural diagram of an electronic device provided by an embodiment of this application.
  • FIG. 2 is a block diagram of the software structure of an electronic device provided by an embodiment of the application.
  • FIG. 3 is a schematic diagram of various forms of a voice assistant provided by an embodiment of this application.
  • FIG. 4 is a flowchart of a voice assistant display method provided by an embodiment of this application.
  • FIG. 5 is a half-screen mode (H) display mode 1 of a voice assistant provided by an embodiment of this application;
  • FIG. 6 is a second half-screen (H) display mode of a voice assistant provided by an embodiment of this application.
  • FIG. 7 is a half-screen mode (H) display mode 3 of a voice assistant provided by an embodiment of this application;
  • FIG. 8 is a half-screen (L) display mode 4 of a voice assistant provided by an embodiment of this application.
  • FIG. 9 is a schematic diagram of a mode switching rule of a voice assistant provided by an embodiment of this application.
  • FIG. 10 is a schematic diagram 1 of mode switching of a voice assistant provided by an embodiment of this application.
  • FIG. 11 is a second schematic diagram of mode switching of a voice assistant provided by an embodiment of this application.
  • FIG. 12 is a third schematic diagram of mode switching of a voice assistant provided by an embodiment of this application.
  • FIG. 13 is a fourth schematic diagram of mode switching of a voice assistant provided by an embodiment of this application.
  • FIG. 14 is a schematic structural diagram of a chip system provided by an embodiment of the application.
  • the embodiments of the present application provide a voice assistant display method and device, which can be applied to the voice assistant display on an electronic device.
  • the voice assistant may be an application (application, APP) installed in an electronic device.
  • the voice assistant may be an embedded application in the electronic device (that is, a system application of the electronic device) or a downloadable application.
  • the embedded application is an application provided as a part of the implementation of an electronic device (such as a mobile phone).
  • the embedded application program may be a "settings" application, a "short message” application, a "camera” application, and so on.
  • a downloadable application is an application that can provide its own Internet protocol multimedia subsystem (IMS) connection.
  • IMS Internet protocol multimedia subsystem
  • the downloadable application can be pre-installed in the electronic device or can be downloaded and installed by the user.
  • Third-party applications in electronic devices may be a "WeChat” application, an "Alipay” application, and an "mail” application.
  • the electronic devices in the embodiments of this application may be portable computers (such as mobile phones), notebook computers, personal computers (PC), tablet computers, wearable electronic devices (such as smart watches), smart home devices, artificial intelligence Intelligence (AI) terminals (such as smart robots), augmented reality (AR) ⁇ virtual reality (VR) devices, on-board computers, etc.
  • portable computers such as mobile phones
  • notebook computers personal computers
  • PC personal computers
  • tablet computers wearable electronic devices
  • smart home devices such as smart watches
  • AI artificial intelligence Intelligence
  • AR augmented reality
  • VR virtual reality
  • the following embodiments do not specifically limit the specific form of the device.
  • FIG. 1 shows a schematic structural diagram of an electronic device 100 provided in this embodiment.
  • the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, an antenna 1, Antenna 2, mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, earphone jack 170D, sensor module 180, button 190, motor 191, indicator 192, camera 193, display 194 , And subscriber identification module (subscriber identification module, SIM) card interface 195, etc.
  • SIM subscriber identification module
  • the sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light Sensor 180L, bone conduction sensor 180M, etc.
  • the structure illustrated in this embodiment does not constitute a specific limitation on the electronic device 100.
  • the electronic device 100 may include more or fewer components than shown, or combine certain components, or split certain components, or arrange different components.
  • the illustrated components can be implemented in hardware, software, or a combination of software and hardware.
  • the processor 110 may include one or more processing units.
  • the processor 110 may include an application processor (AP), a modem processor, a graphics processing unit (GPU), and an image signal processor. (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) Wait.
  • AP application processor
  • modem processor modem processor
  • GPU graphics processing unit
  • image signal processor image signal processor
  • ISP image signal processor
  • controller memory
  • video codec digital signal processor
  • DSP digital signal processor
  • NPU neural-network processing unit
  • the different processing units may be independent devices or integrated in one or more processors.
  • the DSP can monitor the voice data in real time.
  • the voice data can be handed over to the AP.
  • the AP performs text verification and voiceprint verification on the voice data.
  • the electronic device can start the voice assistant.
  • the voice assistant after the voice assistant is awakened, it can be displayed on the interface of the electronic device in the form of a small half-screen (H1).
  • the small half-screen (H1) form of the voice assistant is shown in (b) in FIG. 3, and the description of the small half-screen (H1) form of the voice assistant is detailed below.
  • the controller may be the nerve center and command center of the electronic device 100.
  • the controller can generate operation control signals according to the instruction operation code and timing signals to complete the control of fetching and executing instructions.
  • a memory may also be provided in the processor 110 to store instructions and data.
  • the memory in the processor 110 is a cache memory.
  • the memory can store instructions or data that have just been used or recycled by the processor 110. If the processor 110 needs to use the instructions or data again, it can be directly called from the memory. Repeated accesses are avoided, the waiting time of the processor 110 is reduced, and the efficiency of the system is improved.
  • the processor 110 may include one or more interfaces.
  • the interface can include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, and a universal asynchronous transmitter (universal asynchronous) interface.
  • I2C integrated circuit
  • I2S integrated circuit built-in audio
  • PCM pulse code modulation
  • UART universal asynchronous transmitter
  • MIPI mobile industry processor interface
  • GPIO general-purpose input/output
  • SIM subscriber identity module
  • USB Universal Serial Bus
  • the I2C interface is a bidirectional synchronous serial bus, which includes a serial data line (SDA) and a serial clock line (SCL).
  • the processor 110 may include multiple sets of I2C buses.
  • the processor 110 may couple the touch sensor 180K, the charger, the flash, the camera 193, etc., respectively through different I2C bus interfaces.
  • the processor 110 may couple the touch sensor 180K through an I2C interface, so that the processor 110 and the touch sensor 180K communicate through an I2C bus interface to implement the touch function of the electronic device 100.
  • the I2S interface can be used for audio communication.
  • the processor 110 may include multiple sets of I2S buses.
  • the processor 110 may be coupled with the audio module 170 through an I2S bus to implement communication between the processor 110 and the audio module 170.
  • the audio module 170 may transmit audio signals to the wireless communication module 160 through an I2S interface, so as to realize the function of answering calls through a Bluetooth headset.
  • the PCM interface can also be used for audio communication to sample, quantize and encode analog signals.
  • the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface.
  • the audio module 170 may also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
  • the UART interface is a universal serial data bus used for asynchronous communication.
  • the bus can be a two-way communication bus. It converts the data to be transmitted between serial communication and parallel communication.
  • the UART interface is generally used to connect the processor 110 and the wireless communication module 160.
  • the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to realize the Bluetooth function.
  • the audio module 170 may transmit audio signals to the wireless communication module 160 through a UART interface, so as to realize the function of playing music through a Bluetooth headset.
  • the MIPI interface can be used to connect the processor 110 with the display screen 194, the camera 193 and other peripheral devices.
  • the MIPI interface includes a camera serial interface (camera serial interface, CSI), a display serial interface (display serial interface, DSI), and so on.
  • the processor 110 and the camera 193 communicate through a CSI interface to implement the shooting function of the electronic device 100.
  • the processor 110 and the display screen 194 communicate through a DSI interface to realize the display function of the electronic device 100.
  • the GPIO interface can be configured through software.
  • the GPIO interface can be configured as a control signal or as a data signal.
  • the GPIO interface can be used to connect the processor 110 with the camera 193, the display screen 194, the wireless communication module 160, the audio module 170, the sensor module 180, and so on.
  • the GPIO interface can also be configured as an I2C interface, I2S interface, UART interface, MIPI interface, etc.
  • the USB interface 130 is an interface that complies with the USB standard specification, and specifically may be a Mini USB interface, a Micro USB interface, a USB Type C interface, and so on.
  • the USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transfer data between the electronic device 100 and peripheral devices. It can also be used to connect earphones and play audio through earphones. This interface can also be used to connect to other electronic devices, such as AR devices.
  • the interface connection relationship between the modules illustrated in this embodiment is merely a schematic description, and does not constitute a structural limitation of the electronic device 100.
  • the electronic device 100 may also adopt different interface connection modes in the above-mentioned embodiments, or a combination of multiple interface connection modes.
  • the charging management module 140 is used to receive charging input from the charger.
  • the charger can be a wireless charger or a wired charger.
  • the charging management module 140 may receive the charging input of the wired charger through the USB interface 130.
  • the charging management module 140 may receive the wireless charging input through the wireless charging coil of the electronic device 100. While the charging management module 140 charges the battery 142, it can also supply power to the electronic device through the power management module 141.
  • the power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110.
  • the power management module 141 receives input from the battery 142 and/or the charge management module 140, and supplies power to the processor 110, the internal memory 121, the external memory, the display screen 194, the camera 193, and the wireless communication module 160.
  • the power management module 141 can also be used to monitor parameters such as battery capacity, battery cycle times, and battery health status (leakage, impedance).
  • the power management module 141 may also be provided in the processor 110.
  • the power management module 141 and the charging management module 140 may also be provided in the same device.
  • the mobile communication module 150 can provide a wireless communication solution including 2G/3G/4G/5G and the like applied to the electronic device 100.
  • the mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA), etc.
  • the mobile communication module 150 can receive electromagnetic waves by the antenna 1, and perform processing such as filtering, amplifying and transmitting the received electromagnetic waves to the modem processor for demodulation.
  • the mobile communication module 150 can also amplify the signal modulated by the modem processor, and convert it into electromagnetic wave radiation via the antenna 1.
  • at least part of the functional modules of the mobile communication module 150 may be provided in the processor 110.
  • at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be provided in the same device.
  • the modem processor may include a modulator and a demodulator.
  • the modulator is used to modulate the low frequency baseband signal to be sent into a medium and high frequency signal.
  • the demodulator is used to demodulate the received electromagnetic wave signal into a low-frequency baseband signal. Then the demodulator transmits the demodulated low-frequency baseband signal to the baseband processor for processing. After the low-frequency baseband signal is processed by the baseband processor, it is passed to the application processor.
  • the application processor outputs a sound signal through an audio device (not limited to the speaker 170A, the receiver 170B, etc.), or displays an image or video through the display screen 194.
  • the modem processor may be an independent device. In other embodiments, the modem processor may be independent of the processor 110 and be provided in the same device as the mobile communication module 150 or other functional modules.
  • the wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), bluetooth (BT), and global navigation satellites.
  • WLAN wireless local area networks
  • BT wireless fidelity
  • GNSS global navigation satellite system
  • FM frequency modulation
  • NFC near field communication technology
  • infrared technology infrared, IR
  • the wireless communication module 160 may be one or more devices integrating at least one communication processing module.
  • the wireless communication module 160 receives electromagnetic waves via the antenna 2, frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110.
  • the wireless communication module 160 may also receive the signal to be sent from the processor 110, perform frequency modulation, amplify it, and convert it into electromagnetic waves to radiate through the antenna 2.
  • the antenna 1 of the electronic device 100 is coupled with the mobile communication module 150, and the antenna 2 is coupled with the wireless communication module 160, so that the electronic device 100 can communicate with the network and other devices through wireless communication technology.
  • the wireless communication technology may include global system for mobile communications (GSM), general packet radio service (GPRS), code division multiple access (CDMA), broadband Code division multiple access (wideband code division multiple access, WCDMA), time-division code division multiple access (TD-SCDMA), long term evolution (LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc.
  • the GNSS may include global positioning system (GPS), global navigation satellite system (GLONASS), Beidou navigation satellite system (BDS), quasi-zenith satellite system (quasi -zenith satellite system, QZSS) and/or satellite-based augmentation systems (SBAS).
  • GPS global positioning system
  • GLONASS global navigation satellite system
  • BDS Beidou navigation satellite system
  • QZSS quasi-zenith satellite system
  • SBAS satellite-based augmentation systems
  • the electronic device 100 implements a display function through a GPU, a display screen 194, an application processor, and the like.
  • the GPU is a microprocessor for image processing, connected to the display 194 and the application processor.
  • the GPU is used to perform mathematical and geometric calculations for graphics rendering.
  • the processor 110 may include one or more GPUs, which execute program instructions to generate or change display information.
  • the display screen 194 is used to display images, videos, and the like.
  • the display screen 194 includes a display panel.
  • the display panel can adopt liquid crystal display (LCD), organic light-emitting diode (OLED), active matrix organic light-emitting diode or active-matrix organic light-emitting diode (active-matrix organic light-emitting diode).
  • LCD liquid crystal display
  • OLED organic light-emitting diode
  • active-matrix organic light-emitting diode active-matrix organic light-emitting diode
  • emitting diode AMOLED, flexible light-emitting diode (FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diode (QLED), etc.
  • the electronic device 100 may include one or N display screens 194, and N is a positive integer greater than one.
  • the electronic device 100 can realize a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, and an application processor.
  • the ISP is used to process the data fed back by the camera 193. For example, when taking a picture, the shutter is opened, the light is transmitted to the photosensitive element of the camera through the lens, the light signal is converted into an electrical signal, and the photosensitive element of the camera transmits the electrical signal to the ISP for processing and is converted into an image visible to the naked eye.
  • ISP can also optimize the image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene.
  • the ISP may be provided in the camera 193.
  • the camera 193 is used to capture still images or videos.
  • the object generates an optical image through the lens and is projected to the photosensitive element.
  • the photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
  • CMOS complementary metal-oxide-semiconductor
  • the photosensitive element converts the optical signal into an electrical signal, and then transfers the electrical signal to the ISP to convert it into a digital image signal.
  • ISP outputs digital image signals to DSP for processing.
  • DSP converts digital image signals into standard RGB, YUV and other formats of image signals.
  • the electronic device 100 may include one or N cameras 193, and N is a positive integer greater than one.
  • Digital signal processors are used to process digital signals. In addition to digital image signals, they can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the energy of the frequency point.
  • Video codecs are used to compress or decompress digital video.
  • the electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos in multiple encoding formats, such as: moving picture experts group (MPEG) 1, MPEG2, MPEG3, MPEG4, and so on.
  • MPEG moving picture experts group
  • MPEG2 MPEG2, MPEG3, MPEG4, and so on.
  • NPU is a neural-network (NN) computing processor.
  • NN neural-network
  • applications such as intelligent cognition of the electronic device 100 can be realized, such as image recognition, face recognition, voice recognition, text understanding, and so on.
  • the external memory interface 120 may be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100.
  • the external memory card communicates with the processor 110 through the external memory interface 120 to realize the data storage function. For example, save music, video and other files in an external memory card.
  • the internal memory 121 may be used to store computer executable program code, where the executable program code includes instructions.
  • the processor 110 executes various functional applications and data processing of the electronic device 100 by running instructions stored in the internal memory 121.
  • the internal memory 121 may include a storage program area and a storage data area.
  • the storage program area can store an operating system, at least one application program (such as a sound playback function, an image playback function, etc.) required by at least one function.
  • the data storage area can store data (such as audio data, phone book, etc.) created during the use of the electronic device 100.
  • the internal memory 121 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, a flash memory device, a universal flash storage (UFS), and the like.
  • UFS universal flash storage
  • the electronic device 100 can implement audio functions through the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the earphone interface 170D, and the application processor. For example, music playback, recording, etc.
  • the audio module 170 is used to convert digital audio information into an analog audio signal for output, and is also used to convert an analog audio input into a digital audio signal.
  • the audio module 170 can also be used to encode and decode audio signals.
  • the audio module 170 may be provided in the processor 110, or part of the functional modules of the audio module 170 may be provided in the processor 110.
  • the speaker 170A also called “speaker” is used to convert audio electrical signals into sound signals.
  • the electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
  • the receiver 170B also called “earpiece” is used to convert audio electrical signals into sound signals.
  • the electronic device 100 answers a call or voice message, it can receive the voice by bringing the receiver 170B close to the human ear.
  • the microphone 170C also called “microphone” or “microphone” is used to convert sound signals into electrical signals.
  • the user can approach the microphone 170C through the mouth to make a sound, and input the sound signal into the microphone 170C.
  • the electronic device 100 may be provided with at least one microphone 170C.
  • the electronic device 100 may be provided with two microphones 170C, which can implement noise reduction functions in addition to collecting sound signals.
  • the electronic device 100 may also be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and realize directional recording functions.
  • the earphone interface 170D is used to connect wired earphones.
  • the earphone interface 170D may be a USB interface 130, or a 3.5mm open mobile terminal platform (OMTP) standard interface, or a cellular telecommunications industry association of the USA (CTIA) standard interface.
  • OMTP open mobile terminal platform
  • CTIA cellular telecommunications industry association of the USA
  • the pressure sensor 180A is used to sense the pressure signal and can convert the pressure signal into an electrical signal.
  • the pressure sensor 180A may be provided on the display screen 194.
  • the capacitive pressure sensor may include at least two parallel plates with conductive material. When a force is applied to the pressure sensor 180A, the capacitance between the electrodes changes.
  • the electronic device 100 determines the intensity of the pressure according to the change in capacitance.
  • the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A.
  • the electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A.
  • touch operations that act on the same touch position but have different touch operation strengths may correspond to different operation instructions. For example, when a touch operation whose intensity of the touch operation is less than the first pressure threshold is applied to the short message application icon, an instruction to view the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, an instruction to create a new short message is executed.
  • the gyro sensor 180B may be used to determine the movement posture of the electronic device 100.
  • the angular velocity of the electronic device 100 around three axes ie, x, y, and z axes
  • the gyro sensor 180B can be used for image stabilization.
  • the gyro sensor 180B detects the shake angle of the electronic device 100, calculates the distance that the lens module needs to compensate according to the angle, and allows the lens to counteract the shake of the electronic device 100 through reverse movement to achieve anti-shake.
  • the gyro sensor 180B can also be used for navigation and somatosensory game scenes.
  • the air pressure sensor 180C is used to measure air pressure.
  • the electronic device 100 calculates the altitude based on the air pressure value measured by the air pressure sensor 180C to assist positioning and navigation.
  • the magnetic sensor 180D includes a Hall sensor.
  • the electronic device 100 may use the magnetic sensor 180D to detect the opening and closing of the flip holster.
  • the electronic device 100 can detect the opening and closing of the flip according to the magnetic sensor 180D.
  • features such as automatic unlocking of the flip cover are set.
  • the acceleration sensor 180E can detect the magnitude of the acceleration of the electronic device 100 in various directions (generally three axes). When the electronic device 100 is stationary, the magnitude and direction of gravity can be detected. It can also be used to identify the posture of electronic devices, and apply to applications such as horizontal and vertical screen switching, pedometers and so on.
  • the electronic device 100 can measure the distance by infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 may use the distance sensor 180F to measure the distance to achieve fast focusing.
  • the proximity light sensor 180G may include, for example, a light emitting diode (LED) and a light detector such as a photodiode.
  • the light emitting diode may be an infrared light emitting diode.
  • the electronic device 100 emits infrared light to the outside through the light emitting diode.
  • the electronic device 100 uses a photodiode to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100. When insufficient reflected light is detected, the electronic device 100 can determine that there is no object near the electronic device 100.
  • the electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power.
  • the proximity light sensor 180G can also be used in leather case mode, and the pocket mode will automatically unlock and lock the screen.
  • the ambient light sensor 180L is used to sense the brightness of the ambient light.
  • the electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived brightness of the ambient light.
  • the ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures.
  • the ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in the pocket to prevent accidental touch.
  • the fingerprint sensor 180H is used to collect fingerprints.
  • the electronic device 100 can use the collected fingerprint characteristics to realize fingerprint unlocking, access application locks, fingerprint photographs, fingerprint answering calls, and so on.
  • the temperature sensor 180J is used to detect temperature.
  • the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold value, the electronic device 100 reduces the performance of the processor located near the temperature sensor 180J, so as to reduce power consumption and implement thermal protection.
  • the electronic device 100 when the temperature is lower than another threshold, the electronic device 100 heats the battery 142 to avoid abnormal shutdown of the electronic device 100 due to low temperature.
  • the electronic device 100 boosts the output voltage of the battery 142 to avoid abnormal shutdown caused by low temperature.
  • Touch sensor 180K also called “touch panel”.
  • the touch sensor 180K may be disposed on the display screen 194, and the touch screen is composed of the touch sensor 180K and the display screen 194, which is also called a “touch screen”.
  • the touch sensor 180K is used to detect touch operations acting on or near it.
  • the touch sensor can pass the detected touch operation to the application processor to determine the type of touch event.
  • the visual output related to the touch operation can be provided through the display screen 194.
  • the touch sensor 180K may also be disposed on the surface of the electronic device 100, which is different from the position of the display screen 194.
  • the bone conduction sensor 180M can acquire vibration signals.
  • the bone conduction sensor 180M can obtain the vibration signal of the vibrating bone mass of the human voice.
  • the bone conduction sensor 180M can also contact the human pulse and receive the blood pressure pulse signal.
  • the bone conduction sensor 180M may also be provided in the earphone, combined with the bone conduction earphone.
  • the audio module 170 can parse the voice signal based on the vibration signal of the vibrating bone block of the voice obtained by the bone conduction sensor 180M, and realize the voice function.
  • the application processor can analyze the heart rate information based on the blood pressure beating signal obtained by the bone conduction sensor 180M, and realize the heart rate detection function.
  • the button 190 includes a power-on button, a volume button, and so on.
  • the button 190 may be a mechanical button. It can also be a touch button.
  • the electronic device 100 may receive key input, and generate key signal input related to user settings and function control of the electronic device 100.
  • the motor 191 can generate vibration prompts.
  • the motor 191 can be used for incoming call vibration notification, and can also be used for touch vibration feedback.
  • touch operations that act on different applications can correspond to different vibration feedback effects.
  • Acting on touch operations in different areas of the display screen 194, the motor 191 can also correspond to different vibration feedback effects.
  • Different application scenarios for example: time reminding, receiving information, alarm clock, games, etc.
  • the touch vibration feedback effect can also support customization.
  • the indicator 192 may be an indicator light, which may be used to indicate the charging status, power change, or to indicate messages, missed calls, notifications, and so on.
  • the SIM card interface 195 is used to connect to the SIM card.
  • the SIM card can be inserted into the SIM card interface 195 or pulled out from the SIM card interface 195 to achieve contact and separation with the electronic device 100.
  • the electronic device 100 may support 1 or N SIM card interfaces, and N is a positive integer greater than 1.
  • the SIM card interface 195 can support Nano SIM cards, Micro SIM cards, SIM cards, etc.
  • the same SIM card interface 195 can insert multiple cards at the same time. The types of the multiple cards can be the same or different.
  • the SIM card interface 195 can also be compatible with different types of SIM cards.
  • the SIM card interface 195 may also be compatible with external memory cards.
  • the electronic device 100 interacts with the network through the SIM card to implement functions such as call and data communication.
  • the electronic device 100 adopts an eSIM, that is, an embedded SIM card.
  • the eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100.
  • the software system of the electronic device 100 may adopt a layered architecture, an event-driven architecture, a microkernel architecture, a microservice architecture, or a cloud architecture.
  • an Android system with a layered architecture is taken as an example to illustrate the software structure of the electronic device 100 by way of example.
  • FIG. 2 is a software structure block diagram of an electronic device 100 provided in this embodiment.
  • the layered architecture divides the software into several layers, and each layer has a clear role and division of labor. Communication between layers through software interface.
  • the Android system is divided into four layers, from top to bottom, the application layer, the application framework layer, the Android runtime and system library, and the kernel layer.
  • the application layer can include a series of application packages.
  • the application package can include applications such as voice assistant, mail, gallery, calendar, call, map, navigation, WLAN, Bluetooth, music, video, short message, etc.
  • the application framework layer provides an application programming interface (application programming interface, API) and a programming framework for applications in the application layer.
  • the application framework layer includes some predefined functions.
  • the application framework layer can include a window manager, a content provider, a view system, a phone manager, a resource manager, and a notification manager.
  • the window manager is used to manage window programs.
  • the window manager can obtain the size of the display, determine whether there is a status bar, lock the screen, take a screenshot, etc.
  • the content provider is used to store and retrieve data and make these data accessible to applications.
  • the data may include videos, images, audios, phone calls made and received, browsing history and bookmarks, phone book, etc.
  • the view system includes visual controls, such as controls that display text, controls that display pictures, and so on.
  • the view system can be used to build applications.
  • the display interface can be composed of one or more views.
  • a display interface that includes a short message notification icon may include a view that displays text and a view that displays pictures.
  • the phone manager is used to provide the communication function of the electronic device 100. For example, the management of the call status (including connecting, hanging up, etc.).
  • the resource manager provides various resources for the application, such as localized strings, icons, pictures, layout files, video files, and so on.
  • the notification manager enables the application to display notification information in the status bar, which can be used to convey notification-type messages, and it can disappear automatically after a short stay without user interaction.
  • the notification manager is used to notify download completion, message reminders, and so on.
  • the notification manager can also be a notification that appears in the status bar at the top of the system in the form of a chart or a scroll bar text, such as a notification of an application running in the background, or a notification that appears on the screen in the form of a dialog window. For example, text messages are prompted in the status bar, prompt sounds, electronic devices vibrate, and indicator lights flash.
  • Android Runtime includes core libraries and virtual machines. Android runtime is responsible for the scheduling and management of the Android system.
  • the core library consists of two parts: one part is the function functions that the java language needs to call, and the other part is the core library of Android.
  • the application layer and the application framework layer run in a virtual machine.
  • the virtual machine executes the java files of the application layer and the application framework layer as binary files.
  • the virtual machine is used to perform functions such as object life cycle management, stack management, thread management, security and exception management, and garbage collection.
  • the system library can include multiple functional modules. For example: surface manager (surface manager), media library (media libraries), 3D graphics processing library (for example: OpenGL ES (open graphics library for embedded systems)), 2D graphics engine (for example: SGL), etc.
  • surface manager surface manager
  • media library media libraries
  • 3D graphics processing library for example: OpenGL ES (open graphics library for embedded systems)
  • 2D graphics engine for example: SGL
  • the surface manager is used to manage the display subsystem and provides a combination of 2D and 3D layers for multiple applications.
  • the media library supports playback and recording of a variety of commonly used audio and video formats, as well as still image files.
  • the media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
  • the 3D graphics processing library is used to realize 3D graphics drawing, image rendering, synthesis, and layer processing.
  • the 2D graphics engine is a drawing engine for 2D drawing.
  • the kernel layer is the layer between hardware and software.
  • the kernel layer contains at least display driver, camera driver, audio driver, and sensor driver.
  • the embodiment of the present application provides three forms of the voice assistant, namely the half-screen state (H), the full-screen state (L), and the floating state (F) .
  • the voice assistant interface in the overall display interface of the mobile phone its display form can be divided into half-screen mode (H) and full-screen mode (L).
  • the ratio of its display interface to the overall display interface of the mobile phone is greater than 0 and less than 1.
  • the ratio can be 1:2, as shown in Figure 3(a).
  • the ratio can be 3:8, and the ratio is less than 1:2.
  • the form of the voice assistant can also be called a small half-screen state (H1), as shown in (b) in FIG. 3.
  • the ratio can be 5:8, and the ratio is greater than 1:2.
  • the state of the voice assistant can also be called the half-screen state (H2), as shown in Figure 3(c).
  • the half-screen (H) voice assistant is suitable for single-round voice interaction scenarios.
  • the single-round voice interaction scenario refers to an application scenario in which the voice assistant can complete corresponding operations according to the instruction information input by the user in a single time.
  • the voice assistant can detect that there is no lack of keywords in the instruction information, and the voice assistant does not need to continue interacting with the user to obtain keywords to complete this Indicates the operation indicated by the information. For example, if the instruction information is "open Bluetooth connection", the voice assistant can directly perform the operation of opening the Bluetooth connection.
  • the voice assistant When the voice assistant is in full-screen mode (L), the voice assistant is displayed in full screen on the overall display interface of the mobile phone, that is, the ratio of the display interface of the voice assistant to the overall display interface of the mobile phone is 1:1, as shown in (d) in Figure 3 .
  • the full-screen (L) voice assistant is suitable for scenarios with multiple rounds of voice conversations.
  • the scenario of multiple rounds of voice interaction refers to a scenario where the voice assistant cannot complete the corresponding operation according to the instruction information input by the user a single time, and needs to interact with the voice assistant multiple times.
  • the voice assistant cannot clearly recognize the user's intention, that is, the voice assistant can detect that the instruction information is instruction information lacking keywords, and the voice assistant needs to continue to interact with the user to learn the keywords. Automatically enter the process of multiple rounds of voice interaction, and the voice assistant is displayed in full screen (L). For example, the instruction information is "buy a ticket”. At this time, the instruction information does not specify the time and destination of the ticket that the user needs to buy. Therefore, the voice assistant continues to ask users "When do you need to buy a ticket?" and "Where is your destination" and so on.
  • the voice assistant When the voice assistant is in the floating state (F), the voice assistant is displayed floating on the display interface of the mobile phone. Exemplarily, as shown in (e) of FIG. 3, the voice assistant is suspended on the display interface of the mobile phone in the form of a floating ball.
  • the voice assistant in the floating state (F) occupies less space on the overall display interface of the mobile phone.
  • the voice assistant in the floating state (F) is mainly suitable for scenarios where the voice assistant and other applications complete operations in collaboration, as well as immersive scenarios where the user's attention is not interrupted.
  • the scenario where the voice assistant and other applications complete operations in cooperation refers to the scenario where the voice assistant needs to cooperate with the third-party application interface to complete the operation corresponding to the instruction information.
  • the voice assistant when the voice assistant is displayed in the half-screen mode (H), the "photo” application is displayed on the display interface at the bottom of the mobile phone, and the instruction information input by the user is "Send WeChat to ask Duoduo to eat on weekends". Therefore, the voice assistant Close the interface of the "Photo” application and open the interface of the "WeChat” application. At this time, the voice assistant is displayed in a floating state (F).
  • immersive scenes that do not interrupt the user's attention, such as scenes such as reading, audio and video playback, etc.
  • the half-screen state (H) of the voice assistant can be used as its default form, but its default form is not limited to the above-mentioned half-screen state (H). Users can set the default form of the voice assistant to full screen according to their actual needs. State (L) or suspended state (F).
  • the present application also provides a method for displaying a voice assistant, which can switch between different forms of the voice assistant according to certain rules. By switching between the different forms of the voice assistant, the system-level integration of the form of the voice assistant and the actual scene on the mobile phone can be realized.
  • the half-screen state (H) is the default form of the voice assistant as an example.
  • the method includes steps S401-S405:
  • the display interface of the mobile phone may be a taskless interface, a single-task interface, or a multi-task interface.
  • the voice assistant is turned on, it is displayed in the default form, that is, half-screen (H), and the voice assistant is in the radio state at this time.
  • the way to open the voice assistant can refer to the existing technology, for example: long press the power button of the mobile phone, or turn on the voice assistant through a voice wake-up word, etc.
  • the display interface of the mobile phone before opening the voice assistant is a taskless interface.
  • the display interface of the mobile phone is a taskless interface (or, described as a homepage interface), and icons of multiple applications are displayed on the interface, as shown in (a) in Figure 5.
  • the current taskless interface moves downwards, revealing the upper part of the mobile phone display interface.
  • the voice assistant is displayed in the upper half of the mobile phone display interface in half-screen mode (H) by default, and the voice assistant is in radio mode;
  • the application icons in the upper half of the original taskless interface are displayed in the lower half of the mobile phone display interface, as shown in (b) in Figure 5.
  • the voice assistant is in a radio state, and the voice assistant's prompt information (for example, "Hi, I'm listening") and prompt graphics (such as sound wave graphics) can be used to prompt the user to input instructions, such as 501 Shown.
  • the display interface of the voice assistant also displays the voice skill recommendation items "V", "Keyword 1", and "Keyword 2" as shown in 502. Among them, "V” is used to switch the input form of the voice assistant.
  • “Keyword 1" is different from “Keyword 2".
  • "Keyword 1" and “Keyword 2" are "Open Bluetooth connection” and "Keyword 2". Change ringtone”.
  • the voice skill recommendation item is the instruction information recommended by the voice assistant for the user, which is used to call the service in the voice assistant, and is determined by the voice assistant according to the current time, the location of the mobile phone, the currently running application, and the user's usage habits.
  • the user can directly click the voice skill recommendation item to input the instruction information, or input the instruction information by voice, or the user can click "V" to switch the input form of the voice assistant, turn on the camera, and then input the instruction information through the video.
  • the voice assistant is displayed in half-screen mode (H)
  • the user can operate the lower part of the mobile phone display interface.
  • the display interface of the mobile phone is shown in Figure 5 (c).
  • the prompt graphic shown in 501 is changed to a floating ball, indicating that the voice assistant has stopped radio and entered a sleep state, and the voice assistant’s display interface displays the updated voice skill recommendation as shown in 502
  • the items "V”, “Keyword 3", “Keyword 4", “Keyword 3” and “Keyword 4" may be the same as or different from “Keyword 1" and "Keyword 2".
  • "Keyword 3" and “Keyword 4" are recommended items related to the current application interface, such as "photos from the last weekend” and “sharing photos”.
  • the mobile phone display interface is large, the user can manually turn on the one-handed operation mode of the mobile phone, or after turning on the voice assistant, the mobile phone can also automatically enter the one-handed operation mode, so that the user can download the display interface of the mobile phone. Half of the operation.
  • the display interface of the mobile phone before opening the voice assistant is a single task interface.
  • the display interface of the mobile phone is a single-task interface.
  • the task in the interface is a "photo" application, as shown in (a) in FIG. 6.
  • the current single-task interface moves downward as a whole, revealing the upper part of the mobile phone display interface.
  • the voice assistant is displayed in the upper half of the mobile phone display interface in half-screen mode (H), and the voice assistant is in radio mode;
  • the upper part of the "photo” application when it is displayed in full screen is displayed in the lower part of the display interface of the mobile phone, as shown in Figure 6(b).
  • the voice assistant is in the radio state, and the voice assistant's prompt information (for example, “Hi, I'm listening") and prompt graphics (such as sound wave graphics) can be used to prompt the user to input instructions, such as 601 Shown.
  • the voice assistant display interface also displays the voice skill recommendation items "V", “Keyword 1", and “Keyword 2" as shown in 602. "V” is used to switch the input form of the voice assistant, “Key “Word 1" is different from “Keyword 2".
  • "Keyword 1" and “Keyword 2" are "Query Today's Shanghai Weather” and “Share Photos", respectively.
  • the voice assistant When the voice assistant is displayed in half-screen mode (H), the user can operate the lower part of the mobile phone display interface.
  • the voice assistant and the "photo” application are displayed on separate screens.
  • the display interface of the mobile phone is as shown in Figure 7 (a).
  • the user clicks on a picture in the "Photos” application for example, clicks on the picture shown in 701
  • the picture shown in 701 is displayed in full screen in the lower half of the display interface of the mobile phone, and
  • the "title bar” shown in the figure is the name or number of the picture shown in 701, as shown in (b) in Figure 7.
  • FIG. 7(b) in the invisible area outside the screen, there is also a toolbar that can operate the clicked picture.
  • the display interface of the "Photo” application moves up as a whole, which is originally located in the invisible area outside the screen.
  • the photo is displayed in the lower part of the display interface of the mobile phone, and the photo originally located in the visible area of the screen moves up and is not visible, as shown in (c) in FIG. 7.
  • the dotted line K is the boundary between the visible area on the screen and the invisible area outside the screen.
  • the display interface of the mobile phone before opening the voice assistant is a multitasking interface.
  • the display interface of the mobile phone is a multitasking interface.
  • the "WeChat” application the “WeChat” application is located in the upper half of the mobile phone display interface
  • the "Photo” application is located in the lower half of the mobile phone display interface, as shown in Figure 8(a).
  • the voice assistant is displayed in the upper half of the mobile phone display interface in half-screen mode (H) by default, the "WeChat” application interface is closed, and the voice assistant is in radio mode, and the "photo” application remains unchanged.
  • the assistant is displayed on a split screen, that is, the "photo” application is still displayed in the lower part of the mobile phone display interface, as shown in (b) in Figure 8.
  • the voice assistant is in a radio state, and the voice assistant's prompt information (for example, "Hi, I'm listening --) and prompt graphics (such as sound wave graphics) can be used to prompt the user to input instructions, such as 801 Shown.
  • the voice skill recommendation items "V”, "Keyword 1", and “Keyword 2" as shown in 802 are also displayed on the display interface of the voice assistant.
  • “V” is used to switch the input form of the voice assistant.
  • “Keyword 1" is different from “Keyword 2".
  • Keyword 1 and “Keyword 2" are "Photos of selected weekends” and “Share photos”, for the specific description of the voice skill recommendation items and the way the user enters the instruction information, please refer to the above description, which will not be repeated here.
  • the half-screen state (H) of the voice assistant can be further divided into a small half-screen state (H1) and a large half-screen state (H2).
  • the default mode of setting the voice assistant is generally the small half-screen mode (H1).
  • the default mode of the voice assistant can also be set to the half-screen mode (H2) according to requirements.
  • the display interface of the half-screen state (H2) of the voice assistant is larger and can display more content.
  • the voice assistant determines its display form according to the received instruction information, and enters a sleep state after performing the operation indicated by the instruction information.
  • step S401 the user inputs instruction information to the voice assistant through voice or video.
  • the voice assistant determines the service involved in the instruction information, converts the instruction information into text information, and displays it. Subsequently, the voice assistant determines whether to switch the display mode of the voice assistant according to the current application scenario and the feedback form of the service involved in the instruction information. After the voice assistant completes the operation corresponding to the instruction message, it stops radio reception and enters the sleep state.
  • the services include services provided by applications on the voice assistant's own platform (such as turning on Bluetooth, inquiring about the weather, etc.) and services provided by the voice assistant by calling other applications (such as sending WeChat, opening the "Taobao” application, etc.),
  • the feedback forms of these services in the voice assistant include text feedback, voice feedback, card feedback (service-related setting items or application items in the feedback card), and split-screen feedback (application interface changes).
  • the feedback form of the service is determined by the setting items of the application related to the service. For example, if the “weather” application can only be displayed on the display interface of the voice assistant in the form of a card, the voice assistant can call the “weather” application to provide The feedback form of the service is card feedback.
  • the display form of the voice assistant may or may not change.
  • the display form of the voice assistant does not change.
  • the voice assistant is displayed in the upper half of the display interface of the mobile phone in its default form, that is, the half-screen state, and icons of some applications are displayed in the lower half of the display interface of the mobile phone.
  • the display interface of the voice assistant is as shown in Figure 5 (d).
  • the voice assistant prompt information for example, "Hi, I’m listening!” is changed to the instruction information input by the user.
  • the prompt graphic shown in 501 at this time is a sound wave graphic, but there is no voice skill recommendation item.
  • the voice assistant determines that there are no keywords in the instruction information "open Bluetooth connection", that is, the interaction process between the voice assistant and the user is a single round of voice interaction, and the feedback form of the service involved in the instruction information is text feedback or voice Feedback, after completing the operation corresponding to the instruction information, the display mode of the voice assistant is still in the half-screen state (H), as shown in (e) in FIG. 5.
  • the prompt graphic shown in 501 is changed to a floating ball, indicating that the voice assistant has stopped receiving the sound and enters a sleep state, and the prompt information shown in 501 is changed to the feedback text of the instruction "OK, Bluetooth is Open", the voice assistant updates and displays the recommended voice skills, as shown in 502.
  • the updated voice skills recommendation items are "Turn off wireless connection” and "Open photos”. If the feedback form of the service involved in the instruction information is voice feedback, the voice assistant needs to output the feedback text by voice while displaying the feedback text.
  • the voice assistant is displayed in the upper half of the display interface of the mobile phone in its default form, that is, the half-screen state, and the "photo” application is displayed in the lower half of the display interface of the mobile phone.
  • the voice assistant's display interface is shown in Figure 8 (c) Show.
  • the prompt information of the voice assistant for example, "Hi, I’m listening!”
  • the voice assistant is changed to the instruction information input by the user. That is, "selected weekend photos”.
  • the prompt graphic shown in 801 is a sound wave graphic, but there is no voice skill recommendation item.
  • the voice assistant determines that there are no keywords in the instruction information "photos of the selected weekend", that is, the interaction process between the voice assistant and the user is a single round of voice interaction, and the feedback form of the service related to the instruction information is text Feedback or voice feedback, after completing the operation corresponding to the instruction information, the voice assistant’s display form is still half-screen (H), and the weekend photo in the “Photos” application is selected (the ticked picture in the lower right corner is Selected picture), as shown in (d) in Figure 8.
  • the prompt graphic shown in 801 is changed to a floating ball, which is used to indicate that the voice assistant has stopped receiving and is in a dormant state, and the prompt information shown in 801 is changed to the feedback text "selected".
  • the voice assistant updates and displays the recommended voice skills, as shown in 802.
  • the updated voice skill recommendation items are "share photos", “delete photos”, and so on. If the feedback form of the service involved in the instruction information is voice feedback, the voice assistant needs to output the feedback text by voice while displaying the feedback text.
  • the voice assistant when the voice assistant is displayed in the small half-screen mode (H1), if the instruction information input by the user does not lack keywords, the interaction process between the voice assistant and the user is a single round of voice interaction, and the instruction information involves The feedback form of the service is text feedback and voice feedback, and the display mode of the voice assistant is still the small half-screen mode (H1).
  • the voice assistant is displayed in the full screen mode (H2), if the instruction information input by the user does not lack keywords, the interaction process between the voice assistant and the user is a single round of voice interaction, and the feedback of the service involved in the instruction information If the form is text feedback, voice feedback, or card feedback, the voice assistant's display mode is still half of the screen (H2).
  • the voice assistant is displayed in half-screen mode (H), as shown in Figure 9(b). If the instruction information input by the user does not lack keywords, the voice assistant and the user The interaction process is a single round of voice interaction, and the feedback form of the service involved in the instruction information is card feedback, or the instruction information input by the user lacks keywords, the display mode of the voice assistant is switched from half-screen mode (H) to full-screen mode (L), as shown in (c) in Figure 9.
  • the voice assistant is displayed in the upper half of the display interface of the mobile phone in a half-screen state (H) by default, and the "photo” application is displayed in the lower half of the display interface of the mobile phone.
  • H half-screen state
  • the display interface of the voice assistant is as shown in Figure 6(c).
  • the voice assistant's prompt information (for example, "Hi, I'm listening") is changed to "Check the weather in Shanghai today"
  • the prompt graphic shown in 601 is a sound wave graphic, but there is no voice skill recommendation item.
  • the voice assistant determines that there are no keywords missing in the instruction information "Query Today's Shanghai Weather", that is, the interaction process between the voice assistant and the user is a single round of voice interaction, and the instruction information "Query Today's Shanghai Weather" refers to the service feedback
  • the form is card feedback.
  • the display mode of the voice assistant is switched to the full-screen state (L), as shown in (d) in Figure 6.
  • the prompt graphic shown in 601 is a floating ball, which means that the voice assistant has stopped receiving the sound and enters the dormant state.
  • two buttons “1" and “2 are displayed on both sides of the floating ball. ", used to switch the input form of the voice assistant.
  • the input form of the voice assistant is voice input. Click the "1" button to switch the voice assistant's instruction information input mode to keyboard input (open the keyboard), and click the "2" button to input the voice assistant's instruction information The form is switched to video input (open the camera).
  • the voice assistant updates and displays the voice skill recommendation items.
  • the updated voice skill recommendation items are "photos from last weekend” and "share photos”.
  • the prompt information of the voice assistant is changed to the feedback text of the instruction information "Today's weather in Shanghai is sunny", and the prompt information also includes the feedback weather card, which displays the detailed weather information of today's Shanghai.
  • the voice assistant may output the feedback text "Today's weather in Shanghai is sunny” while displaying the feedback text.
  • the voice assistant is displayed in the upper half of the display interface of the mobile phone in a half-screen state (H), and the "photo” application is displayed in the lower half of the display interface of the mobile phone, as shown in (a) of FIG. 10.
  • the voice assistant is in a radio state, and the voice assistant's prompt information (for example, "Hi, I'm listening --) and prompt graphics (such as sound wave graphics) can be used to prompt the user to input instructions, such as Shown at 1001.
  • 1002 shows the voice skill recommendation items "share photos" and "buy tickets”.
  • the display interface of the voice assistant is as shown in Figure 10 (b).
  • the voice assistant's prompt information for example, "Hi, I'm listening!”
  • the prompt graphic shown at time 1001 is a sound wave graphic, but there is no voice skill recommendation item.
  • the voice assistant determines that the key slot information is missing in the instruction information "buy a ticket”, and its interaction with the user automatically enters multiple rounds of voice interaction process, and switches its display form to full-screen mode (L), as shown in Figure 10 ( c) as shown.
  • the prompt graphic is a sonic graphic, indicating that the voice assistant is in a radio mode
  • the floating ball sonic graphic shown in 1003 shows two buttons "1” and "2" on both sides. Among them, "1" and "2" are used to switch the input form of the voice assistant.
  • the input form of the voice assistant is voice input.
  • the form is switched to video input (open the camera).
  • the content shown in 1004 is a process of multiple rounds of voice interaction between the voice assistant and the user, where the voice assistant can determine that the user's intention is "buy a ticket to Shanghai on 8.24 pm”. At this time, the voice assistant determines that there is no shortage of keywords in "Buy a ticket to Shanghai on 8.24pm", and the feedback form of the service corresponding to the instruction information is card feedback, and the voice assistant's display form is still full screen (L).
  • the display form of the voice assistant is shown in Figure 10(d).
  • the prompt graphic is switched to a floating ball, which is used to indicate that the voice assistant stops receiving the sound and enters the dormant state.
  • the prompt message changes to the feedback text "purchased 8.24 The air ticket to Shanghai in the afternoon” and the corresponding feedback card, as shown in 1006, the voice assistant updates and displays the voice skills recommendation items.
  • the updated voice skills recommendation items are "rebook", "refund", and "share itinerary" Wait.
  • the voice assistant when the voice assistant is displayed in full-screen mode (L), the user can access the settings of the mobile phone and the history of the voice conversation between the user and the voice assistant on the mobile phone.
  • the screen state (H) is more complete, and the voice interaction process is more immersive, which means that the user's attention will be more concentrated.
  • the voice assistant when the voice assistant is displayed in the small half-screen mode (H1), if the instruction information input by the user does not lack keywords, the interaction process between the voice assistant and the user is a single round of voice interaction. And the feedback form of the service involved in the instruction information is card feedback, and the display form of the voice assistant is switched to the half-screen state (H2).
  • the voice assistant is displayed in a small half-screen state (H1).
  • the voice assistant is in a radio state, as shown in 1101, prompting graphics (such as sound wave graphics) and text (such as " Hi, I’m listening...”) It is used to remind the user to input instructions, as shown in 1102, the recommended voice skills are “V”, “Photos from Last Weekend”, “Share Photos”, etc.
  • the voice assistant receives instruction information through voice or other means, such as "open Bluetooth connection", and the display interface of the mobile phone is as shown in Figure 11 (b).
  • the prompt graphic is still a sound wave graphic, and the prompt information is changed to the instruction information "open Bluetooth connection".
  • the voice assistant determines that there are no keywords in the instruction information, and the feedback form of the service corresponding to the instruction information is text feedback or voice feedback, and the display state of the voice assistant is still the small half-screen state (H1), as shown in Figure 11 As shown in (c).
  • the voice assistant executes the service corresponding to the instruction message "open Bluetooth connection", as shown in 1101
  • the prompt message changes to the service feedback text "OK, Bluetooth is turned on”. If the feedback form of the service involved in the instruction information is voice feedback, the voice assistant needs to output the feedback text by voice while displaying the feedback text.
  • the prompt graphic shown in 1101 changes to a floating ball, indicating that the voice assistant has stopped receiving audio and is in a dormant state.
  • the voice assistant updates and displays the recommended item information, and the updated voice skill recommendation items are "V", "Turn off Bluetooth", "Photos from last weekend”, etc.
  • the feedback information is card feedback
  • the display interface of the mobile phone is as shown in (d) in Figure 11, the form of the voice assistant is switched to the half-screen mode (H2) display, and the content shown in 1101 and 1102 remains unchanged, as shown in 1103 .
  • the display interface of the voice assistant also includes a feedback card, and the content of the feedback card is the setting item of the Bluetooth switch (in this case, it means that the Bluetooth is on).
  • the voice assistant when the voice assistant is displayed in the small half-screen state (H1) or the large half-screen state (H2), if the instruction information entered by the user lacks keywords, the interaction process between the voice assistant and the user is multiple rounds of voice interaction.
  • the display mode of is switched to full screen (L). Specifically, please refer to the example of switching the half-screen mode (H) of the voice assistant to the full-screen mode (L), which will not be repeated here.
  • the voice assistant is in a half-screen mode (H), as shown in Figure 9(a), if the instruction information entered by the user does not lack keywords, and the instruction information relates to the service
  • the feedback form is split-screen feedback, and the "simulated click" skill is triggered, and the display form of the voice assistant is switched to the floating state (F), as shown in Figure 9(e).
  • the voice assistant is displayed in the upper half of the display interface of the mobile phone in a half-screen state (H), and the "photo” application is displayed in the lower half of the display interface of the mobile phone.
  • the prompt graphic is a sound wave graphic, and the prompt text (for example, "Hi, I'm listening.") changed to the instruction message "Send WeChat to Duoduo to ask if you have dinner on the weekend".
  • the voice assistant determines that there is no lack of key instruction information in the instruction information, and the feedback form of the service corresponding to the instruction information is split-screen feedback.
  • the voice assistant needs to cooperate with the third-party service "WeChat” application to trigger "simulated click” (deeplink) skill to complete the operation corresponding to the instruction message.
  • the voice assistant switches its display form to a floating state (F), as shown in (b) of Figure 12.
  • the voice assistant enters the floating state (F), as shown in 1201
  • the prompt graphic changes to a floating ball
  • the prompt information changes to the instruction message "Send WeChat to Duoduo to ask if you have dinner on weekends", " The "Photos" application resumes full-screen display.
  • the "simulation click" skill continues to be activated, the "photo” application is closed, and the "WeChat” application is opened and displayed in full screen, as shown in 1201, the prompt graphic is a floating ball, and the prompt information is "Send a WeChat to Duoduo to ask if you have dinner on the weekend.” If the floating ball is clicked before the "simulated click” skill is completed, the "simulated click” skill will be terminated, and the voice assistant's display form is shown in Figure 12 (d). Referring to (d) in FIG. 12, as shown in 1201, the prompt graphic is a floating ball and there is no prompt message, which means that the voice assistant has stopped receiving the sound and entered the dormant state.
  • the display form of the voice assistant is as shown in (e) in Figure 12.
  • the prompt graphic is a floating ball
  • the text prompt message is the feedback text "Sent” indicating the execution result of the information, which is used to indicate the instruction message "Send WeChat to Duoduo Ask”
  • the voice assistant enters the stop receiving state and enters the dormant state, as shown in Figure 12 (f).
  • the prompt graphic shown in 1201 is a floating ball, and there is no prompt information.
  • the voice assistant should also determine the WeChat recipient contact before completing the operation corresponding to the above instructions, as shown in Figure 12 (h) .
  • the contact list includes "Zhou Duoduo" and "Li Duoduo".
  • the user can click on the column of the contact to determine the recipient of the WeChat message as “ Zhou Duoduo”, and send the corresponding information to the contact "Zhou Duoduo” according to the instructions.
  • click the prompt message "First Contact” shown in 1201 determine that the recipient of the WeChat message is "Zhou Duoduo", and send corresponding information to the contact "Zhou Duoduo" according to the instructions.
  • the voice assistant switches its display form to the floating state (F)
  • applying the full-screen display will increase the success rate of the "simulated click" skill.
  • the floating state (F) of the voice assistant is automatically entered after the voice assistant recognizes and calculates the received instruction information, and cannot be entered manually.
  • the voice assistant when the voice assistant is displayed in the small half-screen state (H1) or the larger half-screen state (H2), if the instruction information input by the user does not lack keywords, the interaction process between the voice assistant and the user is a single round of voice interaction. And the feedback form of the service involved in the instruction information is split-screen feedback, and the display form of the voice assistant is switched to the floating state (F). Specifically, please refer to the example in which the half-screen state (H) of the voice assistant is switched to the floating state (F), which will not be repeated here.
  • the voice assistant is in the full-screen mode (L), as shown in Figure 9(d), if the feedback form of the service involved in the instruction information input by the user is split-screen feedback, it will trigger " Simulate clicking on the skill, the voice assistant enters the floating state (F), as shown in Figure 9(e).
  • the user's intention is determined through multiple rounds of voice interaction with the user.
  • the user's intention is "Send WeChat to Duoduo to ask for dinner on weekends.” ". If the feedback form of the service corresponding to the user's intention is text feedback, voice feedback, or card feedback, the display form of the voice assistant is still full-screen (L). If the feedback form of the service corresponding to the user's intention is split-screen feedback, then The display mode of the voice assistant is switched to the floating state (F).
  • the voice assistant is displayed in a full-screen state (H), as shown in (a) in FIG. 13.
  • the voice assistant is in the radio state, as shown in 1301
  • the prompt message is "What help is needed”
  • the prompt graphic is a sonic graphic, and both sides of the sonic graphic display
  • Two buttons "1" and “2”, where "1" and “2" are used to switch the input form of the voice assistant, and the prompt information shown in 1301 and the prompt graphic shown in 1303 are used to prompt the user to input instruction information.
  • the recommended voice skills are "Turn off the wireless network", “Change the ringtone” and so on.
  • the user inputs the instruction information "Send WeChat to Duoduo to ask if you have dinner on weekends” through voice or other means, as shown in Figure 13(b).
  • the "simulate click" skill is activated, the "WeChat” application is opened and displayed in full screen, as shown in 1303, the prompt graphic is a sound wave graphic, and buttons "1" and “2 are displayed on both sides of the sound wave graphic.
  • the content shown in 1301 and 1302 has not changed, and 1304 shows the instruction message input by the user "Send WeChat to Duoduo to ask if you have dinner on the weekend".
  • the voice assistant determined that there was no shortage of keywords in "Send WeChat to Duoduo to ask if you want to eat on weekends?", and the feedback form of the operation corresponding to the instruction information is split-screen feedback.
  • the "WeChat” application cooperates to complete the operation corresponding to the user's intention.
  • the display mode of the voice assistant is switched to the floating state (F), as shown in Figure 13(c). Referring to Fig. 13(c), as shown in 1303, the prompt graphic is a floating ball, and the prompt message "Send WeChat to Duoduo to ask if you eat on weekends" is displayed on the side of the floating ball.
  • the voice assistant will display the form as shown in (c) in Figure 13, and the feedback text "Sent" is displayed on the side of the floating ball shown in 1303 ". Then the voice assistant enters the dormant state, as shown in (d) in Figure 13, the floating ball shown in 1303 is used to indicate that the voice assistant stops receiving the sound and enters the dormant state.
  • the voice assistant when the voice assistant is displayed in the half-screen state (H), the user can also touch the lower half of the display interface of the mobile phone to stop the voice assistant from receiving the sound and enter the dormant state.
  • the voice assistant is displayed in full screen mode (L)
  • the user can access the settings of the mobile phone and the history of the voice conversation between the user and the voice assistant on the mobile phone.
  • the functions it can achieve are relative to its half screen mode (H) It is more complete, and the voice interaction process is more immersive, which means that the user's attention will be more concentrated.
  • the user can also click the graphic indicating that the voice assistant is in the radio state, or by means of voice interaction, so that the voice assistant stops radio reception and enters the dormant state.
  • the voice assistant is in the floating state (F)
  • the user can use the method mentioned in the above example to click the floating ball before the "simulated click" skill is successfully executed, so that the voice assistant enters the dormant state.
  • the display mode of the dormant voice assistant may be half-screen (H), full-screen (L), or floating (F). Subsequently, the user re-awakens the voice assistant by clicking on the floating ball, etc., depending on the display state of the voice assistant when it is in sleep, the display form of the voice assistant may or may not change, as shown in Figure 6 and Figure below. 9 and Figure 12 respectively illustrate the above two situations:
  • the voice assistant is in the dormant state, its display form is the floating state (F), as shown in Figure 9(e), the voice assistant’s display form is switched to its default form, which is the half-screen state (H), as shown in Figure 9. As shown in (a).
  • the display interface of the mobile phone is as shown in (f) of FIG. 12.
  • the voice assistant is awakened, and the wake-up form is the default form, that is, half-screen state (H).
  • the display interface of the mobile phone is displayed in the lower half of the screen.
  • the displayed application is a "WeChat" application, as shown in (g) of FIG. 12.
  • the voice assistant is in a radio state, as shown in 1301, prompt graphics (such as sound wave graphics) and prompt messages (such as "Hi, I'm listening.") to prompt the user to input instructions, 1302 Shown are the recommended voice skills, such as "V”, “Return to Photo”, “Send WeChat”, “Send Red Envelope”, etc.
  • the voice assistant If the voice assistant is in sleep mode, its display form is half-screen (H) or full-screen (L), as shown in (b) or (d) in Figure 9, after the voice assistant is awakened again, its display form remains the same It is a half-screen state (H) or a full-screen state (L), as shown in Figure 9 (a) or (c).
  • the voice assistant if the voice assistant is in the dormant state and its display form is the full-screen state (L), it will still be displayed in the full-screen state (L) after being awakened, as shown in (d) in FIG. 6.
  • the graphic shown in 601 is switched to a floating ball, and two buttons “1" and “2" are displayed on both sides of the floating ball.
  • 602 shows the updated voice skill recommendation items "Keyword 3" and "Keyword 4".
  • the weather card shown in 603 contains the feedback text "Today's weather in Shanghai is sunny” and detailed weather information of today's Shanghai. Among them, "Keyword 3" and “Keyword 4" may be the same as or different from “Keyword 1" and "Keyword 2".
  • "keyword 3" and “keyword 4" are "photos from last weekend” and “shared photos”, respectively.
  • the "1" and “2" on both sides of the floating ball shown in 601 are used to switch the input form of the voice assistant.
  • the input form of the voice assistant is voice input. Click the “1” button to switch the voice assistant's instruction information input mode to keyboard input (open the keyboard), and click the “2" button to input the voice assistant's instruction information The form is switched to video input (open the camera).
  • the voice assistant will still be displayed in the half-screen state (H) after being awakened, as shown in (b) in FIG. 6.
  • the voice assistant is in a radio receiving state, and the graphics and/or text shown in 601 (for example, "Hi, I'm listening.") can be used to prompt the user to input instructions.
  • the voice assistant display interface also displays the voice skill recommendation items "V", "Keyword 1", and "Keyword 2" as shown in 602. "V” is used to switch the input form of the voice assistant, "Key “Word 1" is different from “Keyword 2".
  • Keyword 1 and “Keyword 2” are “photographs of the weekend” and “photo sharing”, respectively.
  • voice skill recommendation item and the way the user inputs the instruction information, please refer to the above description, which will not be repeated here.
  • step S402 After the voice assistant enters the dormant state, the user can wake up the voice assistant by tapping the floating ball.
  • step S402 Regarding the change of the display form after the voice assistant is awakened again, please refer to the description in step S402 above, which will not be repeated here.
  • the voice assistant receives the instruction information again, determines its display form according to the new instruction information, and enters a sleep state after performing the operation indicated by the instruction information.
  • step S404 For the specific implementation process of this step S404, please refer to the description in step S402, which will not be repeated here.
  • the process of switching the voice assistant from the half-screen mode (H) to the full-screen mode (L), and the process of switching the voice assistant from the full-screen mode (L) to the floating mode (F) is irreversible . That is, the voice assistant cannot switch from the full screen state (L) to the half screen state (H), nor can it switch from the floating state (L) to the full screen state (L).
  • the voice assistant when the voice assistant is in the half-screen state (H), its display mode can be switched between the full-screen state (L) and the floating state (F).
  • the voice assistant When the voice assistant is in the full screen state (L), it can switch its display form to the floating state (F).
  • the voice assistant is in the floating state (F), it can only switch its display form to the default form.
  • the default form of the voice assistant is the half-screen state (H).
  • the form conversion of the voice assistant can also be set to be reversible according to requirements, that is, the voice assistant can switch from a full-screen state (L) to a half-screen state (H).
  • the voice assistant can also be switched to the small half screen mode (H1) according to the instructions input by the user and the current scene.
  • the voice assistant is displayed in the full screen mode (L), it can also be based on the instructions input by the user The information and the current scene are switched to the half screen state (H2).
  • the display mode of the voice assistant can be half-screen (H), full-screen (L) or floating (F).
  • H half-screen
  • L full-screen
  • F floating
  • the voice assistant When the voice assistant is displayed in half-screen (H) or full-screen (L), the user can interact through voice Or swipe up the voice assistant display interface to close and exit the voice assistant.
  • the voice assistant When the voice assistant is displayed in the floating state (F), the user can close and exit the voice assistant by swiping up or down the floating ball.
  • the mobile phone display interface is a taskless interface, such as As shown in Figure 5(a). If the voice assistant is displayed in half-screen mode, and there are other application interfaces displayed in the lower half of the mobile phone display interface (take the "photo" application as an example), after the voice assistant is exited, the applications in the lower half of the mobile phone display interface For full-screen display, the display interface of the mobile phone is a single-task interface, as shown in Figure 6 (a).
  • the voice assistant can switch the form of the voice assistant according to the actual scene of the mobile phone and the instruction information input by the user, so as to realize the system-level integration of the form of the voice assistant and the actual scene on the mobile phone, and improve the user experience.
  • the present application provides a voice assistant display method. After the voice assistant is turned on, the voice assistant is displayed in a preset default display form. Then, according to the instruction information of the input voice assistant and the service indicated by the instruction information, the display form of the voice assistant is determined, so that the voice assistant can determine the change of the actual scene according to the instruction information, and switch the corresponding form according to the actual scene, Make the voice assistant and the system work together, so as to realize the system-level integration of the voice assistant and the mobile phone.
  • the chip system includes at least one processor 1401 and at least one interface circuit 1402.
  • the processor 1401 and the interface circuit 1402 may be interconnected by wires.
  • the interface circuit 1402 may be used to receive signals from other devices (such as the memory of the electronic device 100).
  • the interface circuit 1402 may be used to send signals to other devices (such as the processor 1401).
  • the interface circuit 1402 may read instructions stored in the memory, and send the instructions to the processor 1401.
  • the electronic device can be made to execute each step executed by the electronic device 100 (for example, a mobile phone) in the above-mentioned embodiment.
  • the chip system may also include other discrete devices, which are not specifically limited in the embodiment of the present application.
  • the foregoing embodiments may be implemented in whole or in part by software, hardware, firmware, or any combination thereof.
  • the above-mentioned embodiments may appear in the form of a computer program product in whole or in part, and the computer program product includes one or more computer instructions.
  • the computer program instructions When the computer program instructions are loaded and executed on the computer, the processes or functions according to the embodiments of the present application are generated in whole or in part.
  • the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
  • Computer instructions may be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium.
  • Computer instructions may be transmitted from a website, computer, server, or data center through a cable (such as Coaxial cable, optical fiber, digital subscriber line (digital subscriber line, DSL) or wireless (such as infrared, wireless, microwave, etc.) transmission to another website site, computer, server or data center.
  • the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server or data center integrated with one or more available media.
  • the usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, and a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium (for example, a solid state disk (SSD)).
  • the disclosed device and method can be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods, for example, multiple units or components may be divided. It can be combined or integrated into another device, or some features can be omitted or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate parts may be physically separated or not physically separated.
  • the parts displayed as a unit may be one physical unit or multiple physical units, that is, they may be located in one place, or they may be distributed. To many different places. In the application process, some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the technical solutions of the embodiments of the present application are essentially or the part that contributes to the prior art or the part of the technical solutions can be embodied in the form of a software product, and the computer software product is stored in a storage medium.
  • Including several instructions to make a device (which may be a personal computer, a server, a network device, a single-chip microcomputer, or a chip, etc.) or a processor execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disks or optical disks and other media that can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephone Function (AREA)

Abstract

一种语音助手显示方法及装置,涉及到通信技术领域,用于定义语音助手的显示形态,使得语音助手可以根据实际场景的变化切换相应的形态,以实现语音助手与电子设备的系统级融合。该方法包括:打开语音助手,语音助手以第一显示形态显示,其中,第一显示形态为语音助手预设的默认显示形态;根据输入语音助手的指示信息以及指示信息所指示的服务,确定语音助手的显示形态。

Description

语音助手显示方法及装置
本申请要求于2019年09月18日提交国家知识产权局、申请号为201910883296.9、申请名称为“语音助手显示方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及电子设备技术领域,尤其涉及一种语音助手显示方法及装置。
背景技术
随着语音交互技术的日益成熟,语音助手的应用场景越来越广泛。语音助手可以与用户进行智能对话和即时问答的智能交互。并且,语音助手还可以识别用户的语音命令,使手机执行该语音命令对应的事件。以手机为例,如果语音助手接收并识别用户输入的语音命令“向李先生拨打电话”,则手机可以自动向联系人李先生打电话。
在现有技术中,通常会利用freeform多窗口技术来控制语音助手的形态,使语音助手悬浮在显示界面的任意位置,以便于用户操作。但是语音助手的形态与电子设备上的实际场景相对独立,使得用户体验较差。
发明内容
本申请提供一种语音助手显示方法及装置,定义语音助手的显示形态,使得语音助手可以根据实际场景的变化,切换相应的形态,以实现语音助手与电子设备的系统级融合,提高用户体验。
为达到上述目的,本申请采用如下技术方案:
第一方面,本申请提供一种语音助手显示方法,应用于电子设备,语音助手的显示形态包括半屏态、全屏态以及悬浮态。其中,半屏态是指语音助手的显示界面占电子设备的整体显示界面的比例小于1,全屏态是指语音助手的显示界面占电子设备的整体显示界面的比例为1,悬浮态是指语音助手悬浮显示于电子设备的当前显示界面。该方法包括:打开语音助手,语音助手以第一显示形态显示。其中,第一显示形态为语音助手预设的默认显示形态。根据输入语音助手的指示信息以及指示信息所指示的服务,确定语音助手的显示形态。
通过上述过程,本申请提供了一种语音助手显示方法,打开语音助手后,语音助手以预设的默认显示形态显示。随后根据输入语音助手的指示信息,以及该指示信息所指示的服务,确定语音助手的显示形态,以使得语音助手可以根据指示信息来确定实际场景的变化,并根据该实际场景切换相应的形态,使得语音助手与系统协同一体,从而实现语音助手与手机的系统级融合。
在一种可能的实现方式中,第一显示形态为半屏态,打开语音助手,语音助手以第一显示形态显示具体包括:打开语音助手后,当前任务界面整体向下移动,语音助手以半屏态与当前任务界面分屏显示。
在一种可能的实现方式中,根据输入语音助手的指示信息以及所述指示信息所指 示的服务,确定语音助手的显示形态,具体包括:若指示信息缺少关键词,则语音助手的显示形态为全屏态。若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为半屏态。若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为全屏态。若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为分屏反馈,则语音助手的显示形态为悬浮态。其中,指示信息所指示的服务所涉及到的应用在语音助手的显示界面中以卡片形式显示,则指示信息所指示的服务的反馈形式为卡片反馈。指示信息所指示的服务涉及到应用界面切换,则指示信息所指示的服务的反馈形式为分屏反馈。
在一种可能的实现方式中,语音助手的半屏态还包括小半屏态和大半屏态,其中,小半屏态是指语音助手的显示界面占电子设备的整体显示界面的比例小于0.5,大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5,第一显示形态为小半屏态。
在一种可能的实现方式中,根据输入语音助手的指示信息以及指示信息所指示的服务,确定语音助手的显示形态,包括:若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为小半屏态。若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为大半屏态。
在一种可能的实现方式中,在打开语音助手,语音助手以第一显示形态显示之后,还包括:语音助手进入休眠状态,唤醒语音助手,并确定语音助手的显示形态。
在一种可能的实现方式中,唤醒语音助手,并确定语音助手的显示形态,包括:若语音助手进入休眠状态时,其显示形态为悬浮态,则唤醒语音助手后,语音助手的显示形态为所述第一显示形态。若语音助手进入休眠状态时,其显示形态为半屏态,则唤醒语音助手后,语音助手的显示形态为半屏态。若语音助手进入休眠状态时,其显示形态为全屏态,则唤醒语音助手后,语音助手的显示形态为全屏态。
在一种可能的实现方式中,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态。所述唤醒语音助手,并确定语音助手的显示形态,包括:若语音助手进入休眠状态时,其显示形态为小半屏态,则唤醒语音助手后,语音助手的显示形态为小半屏态。若语音助手进入休眠状态时,其显示形态为大半屏态,则唤醒语音助手后,语音助手的显示形态为大半屏态。
在一种可能的实现方式中,唤醒语音助手,并确定语音助手的显示形态之后,还包括:根据新的指示信息以及新的指示信息指示的服务,确定语音助手的新的显示形态。
在一种可能的实现方式中,根据新的指示信息以及新的指示信息对应的服务,确定语音助手的新的显示形态,具体包括:若语音助手的显示形态为全屏态,且新的指示信息指示的服务的反馈形式为文本反馈、语音反馈或者卡片反馈,则语音助手的新的显示形态为全屏态。若语音助手的显示形态为全屏态,且新的指示信息指示的服务 的反馈形式为分屏反馈,则语音助手的新的显示形态为悬浮态。若语音助手的显示形态为半屏态,且新的指示信息缺少关键词,则语音助手的新的显示形态为全屏态。若语音助手的显示形态为半屏态,且新的指示信息不缺少关键词,新的指示信息对应的服务的反馈形式为文本反馈或语音反馈,则语音助手的新的显示形态为半屏态。若语音助手的显示形态为半屏态,且新的指示信息不缺少关键词,新的指示信息对应的服务的反馈形式为卡片反馈,则语音助手的新的显示形态为全屏态。若语音助手的显示形态为半屏态,且所述新的指示信息不缺少关键词,新的指示信息对应的服务的反馈形式为分屏反馈,则语音助手的新的显示形态为悬浮态。其中,新的指示信息所指示的服务所涉及到的应用在语音助手的显示界面中以卡片形式显示,则新的指示信息所指示的服务的反馈形式为卡片反馈。新的指示信息所指示的服务涉及到应用界面切换,则新的指示信息所指示的服务的反馈形式为分屏反馈。
在一种可能的实现方式中,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5,大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5,第一显示形态为小半屏态。所述根据新的指示信息以及新的指示信息对应的服务,确定语音助手的新的显示形态,包括:若语音助手的显示形态为小半屏态,且新的指示信息不缺少关键词,新的指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为小半屏态;若语音助手的显示形态为小半屏态,且新的指示信息不缺少关键词,新的指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为大半屏态。
第二方面,提供一种电子设备,包括:处理器、存储器和触摸屏,存储器、触摸屏与处理器耦合,存储器用于存储计算机程序代码,计算机程序代码包括计算机指令,当处理器从存储器中读取计算机指令,以使得电子设备执行如下操作:打开语音助手,语音助手以第一显示形态显示。其中,第一显示形态为语音助手预设的默认显示形态。根据输入语音助手的指示信息以及指示信息所指示的服务,确定语音助手的显示形态。其中,语音助手的显示形态包括半屏态、全屏态以及悬浮态。半屏态是指语音助手的显示界面占电子设备的整体显示界面的比例小于1,全屏态是指语音助手的显示界面占电子设备的整体显示界面的比例为1,悬浮态是指语音助手悬浮显示于电子设备的当前显示界面。
在一种可能的实现方式中,第一显示形态为半屏态,当处理器从存储器中读取计算机指令,以使得电子设备还执行如下操作:打开语音助手后,当前任务界面整体向下移动,语音助手以半屏态与当前任务界面分屏显示。
在一种可能的实现方式中,当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:若指示信息缺少关键词,则语音助手的显示形态为全屏态。若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为半屏态。若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为全屏态。若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为分屏反馈,则语音助手的显示形态为悬浮态。其中,指示信息所指示的服务所涉及到的应用在语音助手的显 示界面中以卡片形式显示,则指示信息所指示的服务的反馈形式为卡片反馈。指示信息所指示的服务涉及到应用界面切换,则指示信息所指示的服务的反馈形式为分屏反馈。
在一种可能的实现方式中,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态。
在一种可能的实现方式中,当处理器从存储器中读取计算机指令,以使得电子设备还执行如下操作:若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为小半屏态。若指示信息不缺少关键词,且指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为大半屏态。
在一种可能的实现方式中,当处理器从存储器中读取计算机指令,以使得电子设备还执行如下操作:语音助手进入休眠状态,唤醒语音助手,并确定语音助手的显示形态。
在一种可能的实现方式中,当处理器从存储器中读取计算机指令,以使得电子设备还执行如下操作:若语音助手进入休眠状态时,其显示形态为悬浮态,则唤醒语音助手后,语音助手的显示形态为所述第一显示形态。若语音助手进入休眠状态时,其显示形态为半屏态,则唤醒语音助手后,语音助手的显示形态为半屏态。若语音助手进入休眠状态时,其显示形态为全屏态,则唤醒语音助手后,语音助手的显示形态为全屏态。
在一种可能的实现方式中,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态;当处理器从存储器中读取计算机指令,以使得电子设备还执行如下操作:若语音助手进入休眠状态时,其显示形态为小半屏态,则唤醒语音助手后,语音助手的显示形态为小半屏态。若语音助手进入休眠状态时,其显示形态为大半屏态,则唤醒语音助手后,语音助手的显示形态为大半屏态。
在一种可能的实现方式中,当处理器从存储器中读取计算机指令,以使得电子设备还执行如下操作:根据新的指示信息以及新的指示信息指示的服务,确定语音助手的新的显示形态。
在一种可能的实现方式中,当处理器从存储器中读取计算机指令,以使得电子设备还执行如下操作:若语音助手的显示形态为全屏态,且新的指示信息指示的服务的反馈形式为文本反馈、语音反馈或者卡片反馈,则语音助手的新的显示形态为全屏态。若语音助手的显示形态为全屏态,且新的指示信息指示的服务的反馈形式为分屏反馈,则语音助手的新的显示形态为悬浮态。若语音助手的显示形态为半屏态,且新的指示信息缺少关键词,则语音助手的新的显示形态为全屏态。若语音助手的显示形态为半屏态,且新的指示信息不缺少关键词,新的指示信息对应的服务的反馈形式为文本反馈或语音反馈,则语音助手的新的显示形态为半屏态。若语音助手的显示形态为半屏 态,且新的指示信息不缺少关键词,新的指示信息对应的服务的反馈形式为卡片反馈,则语音助手的新的显示形态为全屏态。若语音助手的显示形态为半屏态,且所述新的指示信息不缺少关键词,新的指示信息对应的服务的反馈形式为分屏反馈,则语音助手的新的显示形态为悬浮态。其中,新的指示信息所指示的服务所涉及到的应用在语音助手的显示界面中以卡片形式显示,则新的指示信息所指示的服务的反馈形式为卡片反馈。新的指示信息所指示的服务涉及到应用界面切换,则新的指示信息所指示的服务的反馈形式为分屏反馈。
在一种可能的实现方式中,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态;当处理器从存储器中读取计算机指令,以使得电子设备还执行如下操作:若语音助手的显示形态为小半屏态,且新的指示信息不缺少关键词,新的指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为小半屏态;若语音助手的显示形态为小半屏态,且新的指示信息不缺少关键词,新的指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为大半屏态。
第三方面、提供一种电子设备上的图形用户界面,所述电子设备具有显示屏、摄像头、存储器、以及一个或多个处理器,所述一个或多个处理器用于执行存储在所述存储器中的一个或多个计算机程序,所述图形用户界面包括所述电子设备执行如上述方面及其中任一种可能的实现方式中所述的方法时显示的图形用户界面。
第四方面、提供一种装置,该装置包含在电子设备中,该装置具有实现上述方面及可能的实现方式中任一方法中电子设备行为的功能。该功能可以通过硬件实现,也可以通过硬件执行相应的软件实现。硬件或软件包括至少一个与上述功能相对应的模块或单元。例如,接收模块或单元、显示模块或单元、以及发送模块或单元等。
第五方面、提供一种计算机存储介质,包括计算机指令,当计算机指令在电子设备上运行时,使得电子设备执行如上述方面及其中任一种可能的实现方式中所述的语音助手显示方法。
第六方面、提供一种计算机程序产品,当计算机程序产品在计算机上运行时,使得计算机执行如上述方面中及其中任一种可能的实现方式中所述的语音助手显示方法。
第七方面、提供一种芯片系统,包括处理器,当处理器执行指令时,处理器执行如上述方面中及其中任一种可能的实现方式中所述的语音助手显示方法。
附图说明
图1为本申请实施例提供的一种电子设备的结构示意图;
图2为本申请实施例提供的一种电子设备的软件结构框图;
图3为本申请实施例提供的一种语音助手的多种形态的示意图;
图4为本申请实施例提供的一种语音助手显示方法的流程图;
图5为本申请实施例提供的一种语音助手的半屏态(H)显示方式一;
图6为本申请实施例提供的一种语音助手的半屏态(H)显示方式二;
图7为本申请实施例提供的一种语音助手的半屏态(H)显示方式三;
图8为本申请实施例提供的一种语音助手的半屏态(L)显示方式四;
图9为本申请实施例提供的一种语音助手的形态切换规则示意图;
图10为本申请实施例提供的一种语音助手的形态切换示意图一;
图11为本申请实施例提供的一种语音助手的形态切换示意图二;
图12为本申请实施例提供的一种语音助手的形态切换示意图三;
图13为本申请实施例提供的一种语音助手的形态切换示意图四;
图14为本申请实施例提供的一种芯片系统的结构示意图。
具体实施方式
下面结合附图对本申请的实施方式进行详细描述。
本申请实施例提供一种语音助手显示方法及装置,可以应用于电子设备上的语音助手的显示。其中,语音助手可以是安装在电子设备中的应用程序(application,APP)。该语音助手可以是电子设备中的嵌入式应用程序(即电子设备的系统应用)或者可下载的应用程序。其中,嵌入式应用程序是作为电子设备(如手机)实现的一部分提供的应用程序。例如,嵌入式应用程序可以为“设置”应用、“短消息”应用和“相机”应用等。可下载应用程序是一个可以提供自己的因特网协议多媒体子系统(internet protocol multimedia subsystem,IMS)连接的应用程序,该可下载应用程序可以预先安装在电子设备中的应用或可以由用户下载并安装在电子设备中的第三方应用。例如,该可下载应用程序可以为“微信”应用、“支付宝”应用和“邮件”应用等。
本申请实施例中的电子设备可以为便携式计算机(如手机)、笔记本电脑、个人计算机(personal computer,PC)、平板电脑、可穿戴电子设备(如智能手表)、智能家居设备、人工智能(artificial intelligence,AI)终端(例如智能机器人)、增强现实(augmented reality,AR)\虚拟现实(virtual reality,VR)设备、车载电脑等,以下实施例对该设备的具体形式不做特殊限制。
请参考图1,其示出了本实施例提供的一种电子设备100的结构示意图。其中,电子设备100可以包括处理器110,外部存储器接口120,内部存储器121,通用串行总线(universal serial bus,USB)接口130,充电管理模块140,电源管理模块141,电池142,天线1,天线2,移动通信模块150,无线通信模块160,音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,传感器模块180,按键190,马达191,指示器192,摄像头193,显示屏194,以及用户标识模块(subscriber identification module,SIM)卡接口195等。其中传感器模块180可以包括压力传感器180A,陀螺仪传感器180B,气压传感器180C,磁传感器180D,加速度传感器180E,距离传感器180F,接近光传感器180G,指纹传感器180H,温度传感器180J,触摸传感器180K,环境光传感器180L,骨传导传感器180M等。
可以理解的是,本实施例示意的结构并不构成对电子设备100的具体限定。在另一些实施例中,电子设备100可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。
处理器110可以包括一个或多个处理单元,例如:处理器110可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit, GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。
DSP可以实时监测语音数据,当DSP监测到的语音数据与电子设备中注册的唤醒词的相似度满足预设条件时,便可以将该语音数据交给AP。由AP对上述语音数据进行文本校验和声纹校验。当AP确定该语音数据与用户注册的唤醒词匹配时,电子设备便可以开启语音助手。在本申请实施例中,语音助手被唤醒后,可以以小半屏(H1)形态在电子设备界面上显示。其中,语音助手的小半屏(H1)形态如附图3中的(b)所示,有关于语音助手的小半屏(H1)形态的描述详见下文。
控制器可以是电子设备100的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。
处理器110中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使用的指令或数据,如果处理器110需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复存取,减少了处理器110的等待时间,因而提高了系统的效率。
在一些实施例中,处理器110可以包括一个或多个接口。接口可以包括集成电路(inter-integrated circuit,I2C)接口,集成电路内置音频(inter-integrated circuit sound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purpose input/output,GPIO)接口,用户标识模块(subscriber identity module,SIM)接口,和/或通用串行总线(universal serial bus,USB)接口等。
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDA)和一根串行时钟线(derail clock line,SCL)。在一些实施例中,处理器110可以包含多组I2C总线。处理器110可以通过不同的I2C总线接口分别耦合触摸传感器180K,充电器,闪光灯,摄像头193等。例如:处理器110可以通过I2C接口耦合触摸传感器180K,使处理器110与触摸传感器180K通过I2C总线接口通信,实现电子设备100的触摸功能。
I2S接口可以用于音频通信。在一些实施例中,处理器110可以包含多组I2S总线。处理器110可以通过I2S总线与音频模块170耦合,实现处理器110与音频模块170之间的通信。在一些实施例中,音频模块170可以通过I2S接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块170与无线通信模块160可以通过PCM总线接口耦合。在一些实施例中,音频模块170也可以通过PCM接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。所述I2S接口和所述PCM接口都可以用于音频通信。
UART接口是一种通用串行数据总线,用于异步通信。该总线可以为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接 口通常被用于连接处理器110与无线通信模块160。例如:处理器110通过UART接口与无线通信模块160中的蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块170可以通过UART接口向无线通信模块160传递音频信号,实现通过蓝牙耳机播放音乐的功能。
MIPI接口可以被用于连接处理器110与显示屏194,摄像头193等外围器件。MIPI接口包括摄像头串行接口(camera serial interface,CSI),显示屏串行接口(display serial interface,DSI)等。在一些实施例中,处理器110和摄像头193通过CSI接口通信,实现电子设备100的拍摄功能。处理器110和显示屏194通过DSI接口通信,实现电子设备100的显示功能。
GPIO接口可以通过软件配置。GPIO接口可以被配置为控制信号,也可被配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器110与摄像头193,显示屏194,无线通信模块160,音频模块170,传感器模块180等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。
USB接口130是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口130可以用于连接充电器为电子设备100充电,也可以用于电子设备100与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如AR设备等。
可以理解的是,本实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对电子设备100的结构限定。在另一些实施例中,电子设备100也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。
充电管理模块140用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。在一些有线充电的实施例中,充电管理模块140可以通过USB接口130接收有线充电器的充电输入。在一些无线充电的实施例中,充电管理模块140可以通过电子设备100的无线充电线圈接收无线充电输入。充电管理模块140为电池142充电的同时,还可以通过电源管理模块141为电子设备供电。
电源管理模块141用于连接电池142,充电管理模块140与处理器110。电源管理模块141接收电池142和/或充电管理模块140的输入,为处理器110,内部存储器121,外部存储器,显示屏194,摄像头193,和无线通信模块160等供电。电源管理模块141还可以用于监测电池容量,电池循环次数,电池健康状态(漏电,阻抗)等参数。在其他一些实施例中,电源管理模块141也可以设置于处理器110中。在另一些实施例中,电源管理模块141和充电管理模块140也可以设置于同一个器件中。
移动通信模块150可以提供应用在电子设备100上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块150可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块150可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块150还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块150的至少部分功能模块可以被设置于处理器110中。在一些实施例中,移动通信模块150的至少部分功能模块可以与处理器110的至少部分模块被设置在同一个器件中。
调制解调处理器可以包括调制器和解调器。其中,调制器用于将待发送的低频基带信号调制成中高频信号。解调器用于将接收的电磁波信号解调为低频基带信号。随后解调器将解调得到的低频基带信号传送至基带处理器处理。低频基带信号经基带处理器处理后,被传递给应用处理器。应用处理器通过音频设备(不限于扬声器170A,受话器170B等)输出声音信号,或通过显示屏194显示图像或视频。在一些实施例中,调制解调处理器可以是独立的器件。在另一些实施例中,调制解调处理器可以独立于处理器110,与移动通信模块150或其他功能模块设置在同一个器件中。
无线通信模块160可以提供应用在电子设备100上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块160可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块160经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器110。无线通信模块160还可以从处理器110接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。
在一些实施例中,电子设备100的天线1和移动通信模块150耦合,天线2和无线通信模块160耦合,使得电子设备100可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications,GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(code division multiple access,CDMA),宽带码分多址(wideband code division multiple access,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(global positioning system,GPS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(beidou navigation satellite system,BDS),准天顶卫星系统(quasi-zenith satellite system,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。
电子设备100通过GPU,显示屏194,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏194和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器110可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。
显示屏194用于显示图像,视频等。显示屏194包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,电子设备100可以包括1个或N个显示屏194,N为大于1的正整数。
电子设备100可以通过ISP,摄像头193,视频编解码器,GPU,显示屏194以及应用处理器等实现拍摄功能。
ISP用于处理摄像头193反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头193中。
摄像头193用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,电子设备100可以包括1个或N个摄像头193,N为大于1的正整数。
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当电子设备100在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。
视频编解码器用于对数字视频压缩或解压缩。电子设备100可以支持一种或多种视频编解码器。这样,电子设备100可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备100的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。
外部存储器接口120可以用于连接外部存储卡,例如Micro SD卡,实现扩展电子设备100的存储能力。外部存储卡通过外部存储器接口120与处理器110通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。
内部存储器121可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器110通过运行存储在内部存储器121的指令,从而执行电子设备100的各种功能应用以及数据处理。内部存储器121可以包括存储程序区和存储数据区。其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储电子设备100使用过程中所创建的数据(比如音频数据,电话本等)等。此外,内部存储器121可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等。
电子设备100可以通过音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,以及应用处理器等实现音频功能。例如音乐播放,录音等。
音频模块170用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频 输入转换为数字音频信号。音频模块170还可以用于对音频信号编码和解码。在一些实施例中,音频模块170可以设置于处理器110中,或将音频模块170的部分功能模块设置于处理器110中。
扬声器170A,也称“喇叭”,用于将音频电信号转换为声音信号。电子设备100可以通过扬声器170A收听音乐,或收听免提通话。
受话器170B,也称“听筒”,用于将音频电信号转换成声音信号。当电子设备100接听电话或语音信息时,可以通过将受话器170B靠近人耳接听语音。
麦克风170C,也称“话筒”或者“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息或需要通过语音助手触发电子设备100执行某些功能时,用户可以通过人嘴靠近麦克风170C发声,将声音信号输入到麦克风170C。电子设备100可以设置至少一个麦克风170C。在另一些实施例中,电子设备100可以设置两个麦克风170C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,电子设备100还可以设置三个,四个或更多麦克风170C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。
耳机接口170D用于连接有线耳机。耳机接口170D可以是USB接口130,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。
压力传感器180A用于感受压力信号,可以将压力信号转换成电信号。在一些实施例中,压力传感器180A可以设置于显示屏194。压力传感器180A的种类很多,如电阻式压力传感器,电感式压力传感器,电容式压力传感器等。电容式压力传感器可以是包括至少两个具有导电材料的平行板。当有力作用于压力传感器180A,电极之间的电容改变。电子设备100根据电容的变化确定压力的强度。当有触摸操作作用于显示屏194,电子设备100根据压力传感器180A检测所述触摸操作强度。电子设备100也可以根据压力传感器180A的检测信号计算触摸的位置。在一些实施例中,作用于相同触摸位置,但不同触摸操作强度的触摸操作,可以对应不同的操作指令。例如:当有触摸操作强度小于第一压力阈值的触摸操作作用于短消息应用图标时,执行查看短消息的指令。当有触摸操作强度大于或等于第一压力阈值的触摸操作作用于短消息应用图标时,执行新建短消息的指令。
陀螺仪传感器180B可以用于确定电子设备100的运动姿态。在一些实施例中,可以通过陀螺仪传感器180B确定电子设备100围绕三个轴(即x,y和z轴)的角速度。陀螺仪传感器180B可以用于拍摄防抖。示例性的,当按下快门,陀螺仪传感器180B检测电子设备100抖动的角度,根据角度计算出镜头模组需要补偿的距离,让镜头通过反向运动抵消电子设备100的抖动,实现防抖。陀螺仪传感器180B还可以用于导航,体感游戏场景。
气压传感器180C用于测量气压。在一些实施例中,电子设备100通过气压传感器180C测得的气压值计算海拔高度,辅助定位和导航。
磁传感器180D包括霍尔传感器。电子设备100可以利用磁传感器180D检测翻盖皮套的开合。在一些实施例中,当电子设备100是翻盖机时,电子设备100可以根据 磁传感器180D检测翻盖的开合。进而根据检测到的皮套的开合状态或翻盖的开合状态,设置翻盖自动解锁等特性。
加速度传感器180E可检测电子设备100在各个方向上(一般为三轴)加速度的大小。当电子设备100静止时可检测出重力的大小及方向。还可以用于识别电子设备姿态,应用于横竖屏切换,计步器等应用。
距离传感器180F,用于测量距离。电子设备100可以通过红外或激光测量距离。在一些实施例中,拍摄场景,电子设备100可以利用距离传感器180F测距以实现快速对焦。
接近光传感器180G可以包括例如发光二极管(LED)和光检测器,例如光电二极管。发光二极管可以是红外发光二极管。电子设备100通过发光二极管向外发射红外光。电子设备100使用光电二极管检测来自附近物体的红外反射光。当检测到充分的反射光时,可以确定电子设备100附近有物体。当检测到不充分的反射光时,电子设备100可以确定电子设备100附近没有物体。电子设备100可以利用接近光传感器180G检测用户手持电子设备100贴近耳朵通话,以便自动熄灭屏幕达到省电的目的。接近光传感器180G也可用于皮套模式,口袋模式自动解锁与锁屏。
环境光传感器180L用于感知环境光亮度。电子设备100可以根据感知的环境光亮度自适应调节显示屏194亮度。环境光传感器180L也可用于拍照时自动调节白平衡。环境光传感器180L还可以与接近光传感器180G配合,检测电子设备100是否在口袋里,以防误触。
指纹传感器180H用于采集指纹。电子设备100可以利用采集的指纹特性实现指纹解锁,访问应用锁,指纹拍照,指纹接听来电等。
温度传感器180J用于检测温度。在一些实施例中,电子设备100利用温度传感器180J检测的温度,执行温度处理策略。例如,当温度传感器180J上报的温度超过阈值,电子设备100执行降低位于温度传感器180J附近的处理器的性能,以便降低功耗实施热保护。在另一些实施例中,当温度低于另一阈值时,电子设备100对电池142加热,以避免低温导致电子设备100异常关机。在其他一些实施例中,当温度低于又一阈值时,电子设备100对电池142的输出电压执行升压,以避免低温导致的异常关机。
触摸传感器180K,也称“触控面板”。触摸传感器180K可以设置于显示屏194,由触摸传感器180K与显示屏194组成触摸屏,也称“触控屏”。触摸传感器180K用于检测作用于其上或附近的触摸操作。触摸传感器可以将检测到的触摸操作传递给应用处理器,以确定触摸事件类型。可以通过显示屏194提供与触摸操作相关的视觉输出。在另一些实施例中,触摸传感器180K也可以设置于电子设备100的表面,与显示屏194所处的位置不同。
骨传导传感器180M可以获取振动信号。在一些实施例中,骨传导传感器180M可以获取人体声部振动骨块的振动信号。骨传导传感器180M也可以接触人体脉搏,接收血压跳动信号。在一些实施例中,骨传导传感器180M也可以设置于耳机中,结合成骨传导耳机。音频模块170可以基于所述骨传导传感器180M获取的声部振动骨块的振动信号,解析出语音信号,实现语音功能。应用处理器可以基于所述骨传导传感器180M获取的血压跳动信号解析心率信息,实现心率检测功能。
按键190包括开机键,音量键等。按键190可以是机械按键。也可以是触摸式按键。电子设备100可以接收按键输入,产生与电子设备100的用户设置以及功能控制有关的键信号输入。
马达191可以产生振动提示。马达191可以用于来电振动提示,也可以用于触摸振动反馈。例如,作用于不同应用(例如拍照,音频播放等)的触摸操作,可以对应不同的振动反馈效果。作用于显示屏194不同区域的触摸操作,马达191也可对应不同的振动反馈效果。不同的应用场景(例如:时间提醒,接收信息,闹钟,游戏等)也可以对应不同的振动反馈效果。触摸振动反馈效果还可以支持自定义。
指示器192可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。
SIM卡接口195用于连接SIM卡。SIM卡可以通过插入SIM卡接口195,或从SIM卡接口195拔出,实现和电子设备100的接触和分离。电子设备100可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口195可以支持Nano SIM卡,Micro SIM卡,SIM卡等。同一个SIM卡接口195可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口195也可以兼容不同类型的SIM卡。SIM卡接口195也可以兼容外部存储卡。电子设备100通过SIM卡和网络交互,实现通话以及数据通信等功能。在一些实施例中,电子设备100采用eSIM,即:嵌入式SIM卡。eSIM卡可以嵌在电子设备100中,不能和电子设备100分离。
电子设备100的软件系统可以采用分层架构,事件驱动架构,微核架构,微服务架构,或云架构。本实施例以分层架构的Android系统为例,示例性说明电子设备100的软件结构。
请参考图2,其是本实施例提供的一种电子设备100的软件结构框图。其中,分层架构将软件分成若干个层,每一层都有清晰的角色和分工。层与层之间通过软件接口通信。在一些实施例中,将Android系统分为四层,从上至下分别为应用程序层,应用程序框架层,安卓运行时(Android runtime)和系统库,以及内核层。
应用程序层可以包括一系列应用程序包。
如图2所示,应用程序包可以包括语音助手,邮件,图库,日历,通话,地图,导航,WLAN,蓝牙,音乐,视频,短信息等应用程序。
应用程序框架层为应用程序层的应用程序提供应用编程接口(application programming interface,API)和编程框架。应用程序框架层包括一些预先定义的函数。
如图2所示,应用程序框架层可以包括窗口管理器,内容提供器,视图系统,电话管理器,资源管理器,通知管理器等。
窗口管理器用于管理窗口程序。窗口管理器可以获取显示屏大小,判断是否有状态栏,锁定屏幕,截取屏幕等。
内容提供器用来存放和获取数据,并使这些数据可以被应用程序访问。所述数据可以包括视频,图像,音频,拨打和接听的电话,浏览历史和书签,电话簿等。
视图系统包括可视控件,例如显示文字的控件,显示图片的控件等。视图系统可用于构建应用程序。显示界面可以由一个或多个视图组成的。例如,包括短信通知图标的显示界面,可以包括显示文字的视图以及显示图片的视图。
电话管理器用于提供电子设备100的通信功能。例如通话状态的管理(包括接通,挂断等)。
资源管理器为应用程序提供各种资源,比如本地化字符串,图标,图片,布局文件,视频文件等等。
通知管理器使应用程序可以在状态栏中显示通知信息,可以用于传达告知类型的消息,可以短暂停留后自动消失,无需用户交互。比如通知管理器被用于告知下载完成,消息提醒等。通知管理器还可以是以图表或者滚动条文本形式出现在系统顶部状态栏的通知,例如后台运行的应用程序的通知,还可以是以对话窗口形式出现在屏幕上的通知。例如在状态栏提示文本信息,发出提示音,电子设备振动,指示灯闪烁等。
Android Runtime包括核心库和虚拟机。Android runtime负责安卓系统的调度和管理。
核心库包含两部分:一部分是java语言需要调用的功能函数,另一部分是安卓的核心库。
应用程序层和应用程序框架层运行在虚拟机中。虚拟机将应用程序层和应用程序框架层的java文件执行为二进制文件。虚拟机用于执行对象生命周期的管理,堆栈管理,线程管理,安全和异常的管理,以及垃圾回收等功能。
系统库可以包括多个功能模块。例如:表面管理器(surface manager),媒体库(media libraries),三维图形处理库(例如:OpenGL ES(open graphics library for embedded systems)),2D图形引擎(例如:SGL)等。
表面管理器用于对显示子系统进行管理,并且为多个应用程序提供了2D和3D图层的融合。
媒体库支持多种常用的音频,视频格式回放和录制,以及静态图像文件等。媒体库可以支持多种音视频编码格式,例如:MPEG4,H.264,MP3,AAC,AMR,JPG,PNG等。
三维图形处理库用于实现三维图形绘图,图像渲染,合成,和图层处理等。
2D图形引擎是2D绘图的绘图引擎。
内核层是硬件和软件之间的层。内核层至少包含显示驱动,摄像头驱动,音频驱动,传感器驱动。
以下实施例中所涉及的技术方案均可以在具有上述硬件架构和软件架构的电子设备100中实现。以下结合附图和应用场景对本实施例提供的语音助手显示方法进行详细介绍。需要说明的是,以下实施例均以手机中的语音助手为例。
为了实现语音助手的形态与手机上的实际场景的系统级融合,本申请实施例提供了语音助手的三种形态,分别为半屏态(H)、全屏态(L)以及悬浮态(F)。
按照语音助手界面占手机整体显示界面的比例,其显示形态可以划分为半屏态(H)和全屏态(L)。
语音助手处于半屏态(H)时,其显示界面与手机整体显示界面的比例大于0且小于1,例如,该比例可以为1:2,如图3中的(a)所示。又如,该比例可以为3:8,该比例小于1:2,此时,该语音助手的形态又可称为小半屏态(H1),如图3中的(b)所示。再如,该比例可以为5:8,该比例大于1:2,此时,该语音助手的形 态又可称为大半屏态(H2),如图3中的(c)所示。半屏态(H)的语音助手适用于单轮语音交互的场景。单轮语音交互的场景是指,语音助手可以根据用户单次输入的指示信息完成相应操作的应用场景。示例性的:在单轮语音交互的场景下,用户输入指示信息后,语音助手可检测到该指示信息中并不缺少关键词,语音助手无需与用户继续交互以获取关键词,即可完成该指示信息所指示的操作。例如,指示信息为“打开蓝牙连接”,则语音助手直接执行打开蓝牙连接的操作即可。
语音助手处于全屏态(L)时,语音助手在手机整体显示界面上全屏显示,也即语音助手的显示界面与手机整体显示界面的比例为1:1,如图3中的(d)所示。全屏态(L)的语音助手适用于多轮语音对话的场景。多轮语音交互的场景是指,语音助手无法根据用户单次输入的指示信息完成相应操作,需要多次与语音助手交互的场景。也就是,用户输入指示信息后,语音助手无法明确识别用户意图,也即语音助手可检测到该指示信息为缺少关键词的指示信息,语音助手需要与用户继续进行交互以获知关键词,语音助手自动进入多轮语音交互的过程,且语音助手以全屏态(L)显示。例如,指示信息为“买机票”,此时,指示信息并未指明用户所需要购买的机票是什么时间去向哪里的机票。因此,语音助手继续向用户询问“请问您需要购买什么时候的机票?”以及“请问您的目的地是哪里呢”等。
语音助手处于悬浮态(F)时,语音助手悬浮显示在手机显示界面上。示例性的,如图3中的(e)所示,语音助手以悬浮球的形式悬浮在手机显示界面上。悬浮态(F)的语音助手在手机整体显示界面上占用空间较小。悬浮态(F)的语音助手主要适用于语音助手与其他应用协同完成操作的场景,以及不打断用户注意力的沉浸场景。语音助手与其它应用协同完成操作的场景是指,语音助手要完成该指示信息所对应的操作,需要与第三方应用界面相配合的场景。示例性的,语音助手以半屏态(H)显示时,手机下方显示界面中显示有“照片”应用,且用户输入的指示信息为“发送微信问多多周末吃饭吗”,因此,语音助手需要关闭“照片”应用的界面,并打开“微信”应用的界面,此时,语音助手以悬浮态(F)显示。示例性的,不打断用户注意力的沉浸场景如阅读,音视频播放等场景。
实际应用中,语音助手的半屏态(H)可以作为其默认形态,但其默认形态并不局限于上述半屏态(H),用户可以根据其实际需求将语音助手的默认形态设置为全屏态(L)或者悬浮态(F)。
本申请还提供了一种语音助手的显示方法,能够按照一定的规则实现语音助手在不同形态之间切换。通过语音助手的不同形态之间的切换,可以实现语音助手的形态与手机上的实际场景的系统级融合。
以语音助手的形态包括半屏态(H)、全屏态(L)以及悬浮态(F),半屏态(H)为语音助手的默认形态为例,如图4所示,该方法包括步骤S401-S405:
S401、打开语音助手,语音助手以默认形态(半屏态)显示。
在打开语音助手之前,手机的显示界面可能是无任务界面,也可能是单任务界面,或者多任务界面。另外,语音助手被打开之后,以默认形态,即半屏态(H)显示,且此时语音助手处于收音状态。语音助手的打开方式可以参照现有技术,例如:长按手机电源(power)键,或者通过语音唤醒词来打开语音助手等。
下面根据打开语音助手之前的手机的显示界面的不同,结合图5-图8对本申请中打开语音助手后的手机显示界面进行介绍:
1、打开语音助手之前的手机的显示界面为无任务界面。
在打开语音助手之前,手机的显示界面为无任务界面(或者,描述为主页界面),该界面上显示有多个应用的图标,如图5中的(a)所示。打开语音助手后,当前无任务界面整体向下移动,露出手机显示界面的上半部分,语音助手默认以半屏态(H)在手机显示界面的上半部分显示,且语音助手处于收音状态;原无任务界面的上半部分的应用图标在手机显示界面的下半部分显示,如图5中的(b)所示。参照图5的(b)所示,语音助手处于收音状态,语音助手的提示信息(例如“嗨,我在听…”)和提示图形(例如音波图形)可用于提示用户输入指示信息,如501所示。可选地,语音助手的显示界面上还显示有如502所示的语音技能推荐项“V”、“关键词1”、“关键词2”。其中,“V”用于切换语音助手的输入形式,“关键词1”与“关键词2”不同,示例性的,“关键词1”和“关键词2”为“打开蓝牙连接”和“更改铃声”。其中,语音技能推荐项是语音助手为用户推荐的指示信息,用于调用语音助手中的服务,由语音助手根据当前的时间、手机所处位置、当前运行应用、用户的使用习惯等确定。用户可以直接点击语音技能推荐项来输入指示信息,也可以通过语音来输入指示信息,或者用户也可以点击“V”来切换语音助手的输入形式,并打开摄像头,进而通过视频来输入指示信息。语音助手以半屏态(H)显示时,用户可以对手机显示界面的下半部分进行操作。示例性的,点击手机显示界面的下半部分上的应用图标,即可打开相应的应用软件(例如“照片”应用)。以“照片”应用与语音助手分屏显示为例,此时手机的显示界面如图5中的(c)所示。参照图5的(c)所示,501所示的提示图形改变为悬浮球,表示语音助手停止收音,进入休眠状态,且语音助手的显示界面上显示有如502所示的更新后的语音技能推荐项“V”、“关键词3”、“关键词4”,“关键词3”和“关键词4”可能与“关键词1”和“关键词2”相同,也可能不同。示例性的,“关键词3”和“关键词4”为与当前所处应用界面相关的推荐项,如“上周末的照片”和“分享照片”。
需要说明的是,若手机显示界面较大,用户可手动开启手机的单手操作模式,或者,在打开语音助手后,手机也可以自动进入单手操作模式,以便于用户对手机显示界面的下半部分进行操作。
2、打开语音助手之前的手机的显示界面为单任务界面。
在打开语音助手之前,手机的显示界面为单任务界面,示例性的,该界面中的任务为“照片”应用,如图6中的(a)所示。打开语音助手后,当前单任务界面整体向下移动,露出手机显示界面的上半部分,语音助手默认以半屏态(H)在手机显示界面的上半部分显示,且语音助手处于收音状态;“照片”应用全屏显示时的上半部分在手机的显示界面的下半部分显示,如图6中的(b)所示。参照图6的(b)所示,语音助手处于收音状态,语音助手的提示信息(例如“嗨,我在听…”)和提示图形(例如音波图形)可用于提示用户输入指示信息,如601所示。可选的,语音助手显示界面上还显示有如602所示的语音技能推荐项“V”、“关键词1”、“关键词2”,“V”用于切换语音助手的输入形式,“关键词1”与“关键词2”不同,示例性的, “关键词1”和“关键词2”分别为“查询今日上海天气”和“分享照片”。关于语音技能推荐项和用户输入指示信息的方式的具体描述,可以参见上述描述,在此不再赘述。
语音助手以半屏态(H)显示时,用户可以对手机显示界面的下半部分进行操作。示例性的,语音助手与“照片”应用分屏显示,此时手机的显示界面如图7中的(a)所示。参照图7的(a)所示,若用户点击“照片”应用中的某一图片,例如点击701所示的图片,则701所示的图片在手机显示界面的下半部分中全屏显示,且图中所示“标题栏”为701所示图片的名称或编号等,如图7中的(b)所示。参照图7的(b)所示,在屏幕外不可视区域中,还显示有可对该被点击图片进行操作的工具栏。若用户在手机显示界面中的下半部分,即“照片”应用界面上进行上划操作,如702所示,则“照片”应用的显示界面整体上移,原本位于屏幕外不可视区域中的照片在手机显示界面的下半部分中显示,原本位于屏幕内可视区域中的照片上移且不可视,如图7中的(c)所示。其中,在图7的(a)-(c)中,虚线K为屏幕内可视区与屏幕外不可视区的划分界限。
3、打开语音助手之前的手机的显示界面为多任务界面。
在打开语音助手之前,手机的显示界面为多任务界面,示例性的,该多任务界面中显示有两个任务,这两个任务在手机的显示界面中分屏显示,分别为“照片”应用和“微信”应用,“微信”应用位于手机显示界面的上半部分,“照片”应用位于手机显示界面的下半部分,如图8中的(a)所示。打开语音助手后,语音助手默认以半屏态(H)在手机显示界面的上半部分显示,“微信”应用界面被关闭,且语音助手处于收音状态,“照片”应用保持不变,与语音助手分屏显示,即“照片”应用仍在手机显示界面的下半部分中显示,如图8中的(b)所示。参照图8的(b)所示,语音助手处于收音状态,语音助手的提示信息(例如“嗨,我在听…”)和提示图形(例如音波图形)可用于提示用户输入指示信息,如801所示。可选地,语音助手的显示界面上还显示有如802所示的语音技能推荐项“V”、“关键词1”、“关键词2”。“V”用于切换语音助手的输入形式,“关键词1”与“关键词2”不同,示例性的,“关键词1”和“关键词2”分别为“选定周末的照片”和“分享照片”,关于语音技能推荐项和用户输入指示信息的方式的具体描述,可以参见上述描述,在此不再赘述。
需要说明的是,语音助手的半屏态(H)可以进一步划分为小半屏态(H1)和大半屏态(H2)。此时,设定语音助手的默认形态一般为小半屏态(H1)。当然,语音助手的默认形态也可以根据需求设定为大半屏态(H2)。语音助手的大半屏态(H2)与其小半屏态(H1)相比,语音助手的大半屏态(H2)的显示界面更大,所能显示的内容较多。
S402、语音助手根据接收到的指示信息,确定其显示形态,且在执行指示信息所指示的操作后进入休眠状态。
在步骤S401之后,用户通过语音或者视频向语音助手输入指示信息。语音助手在接收到用户输入的指示信息后,确定该指示信息所涉及的服务,并将该指示信息转化为文本信息后显示。随后,语音助手根据当前的应用场景、该指示信息所涉及到的服务的反馈形式,确定是否切换语音助手的显示形态。语音助手在完成该指示信息对应 的操作后,停止收音,并进入休眠状态。
其中,所述服务包括语音助手自身平台上的应用所提供的服务(例如打开蓝牙、查询天气等)以及语音助手通过调用其它应用所提供的服务(例如发送微信、打开“淘宝”应用等),这些服务在语音助手中的反馈形式包括文本反馈、语音反馈、卡片反馈(反馈卡片中为与服务相关的设置项或应用项)及分屏反馈(应用界面发生变化)等。所述服务的反馈形式由与该服务相关的应用的设置项决定,例如,若“天气”应用只能以卡片形式在语音助手的显示界面中显示,则语音助手通过调用“天气”应用所提供的服务的反馈形式为卡片反馈。
语音助手的各个形态之间的切换过程可以参照图9所示。语音助手在接收到用户输入的指示信息后,根据当前的应用场景,以及用户所输入的指示信息的不同,其显示形态可能会发生变化,也可能不发生变化。下面对上述语音助手的显示形态发生变化和语音助手的显示形态不发生变化,这两种情况分别进行说明:
1、语音助手的显示形态不发生变化。
(1)、参照图9所示,语音助手为半屏态(H)时,如图9中的(a)所示,若用户输入的指示信息不缺少关键词,则语音助手与用户之间的交互过程为单轮语音交互,且该指示信息所涉及的服务的反馈形式为文本反馈、语音反馈,则语音助手的显示形态仍为半屏态(H),如图9中的(b)所示。
示例性的,打开语音助手后,语音助手以其默认形态,即半屏态,在手机显示界面的上半部分显示,部分应用的图标则在手机显示界面的下半部分显示。此时,若用户点击语音技能推荐项中的“打开蓝牙连接”或者通过语音或视频来输入“打开蓝牙连接”,此时,语音助手的显示界面如图5中的(d)所示。参照图5的(d)所示,语音助手接收到用户输入的指示信息后,如501所示,语音助手的提示信息(例如“嗨,我在听…”)改变为用户输入的指示信息,即“打开蓝牙连接”,此时501所示的提示图形为音波图形,但无语音技能推荐项。随后,语音助手确定指示信息“打开蓝牙连接”中不缺少关键词,即语音助手与用户之间的交互过程为单轮语音交互,且该指示信息所涉及的服务的反馈形式为文本反馈或者语音反馈,则在完成指示信息对应的操作后,语音助手的显示形态仍为半屏态(H),如图5中的(e)所示。在图5的(e)中,501所示的提示图形改变为悬浮球,表示语音助手停止收音,进入休眠状态,501所示的提示信息改变为该指示信息的反馈文本“好的,蓝牙已打开”,语音助手对语音技能推荐项进行更新并显示,如502所示。例如,更新后的语音技能推荐项为“关闭无线连接”、“打开照片”。若指示信息涉及到的服务的反馈形式为语音反馈,则语音助手需要在显示反馈文本的同时,语音输出反馈文本。
示例性的,打开语音助手后,语音助手以其默认形态,即半屏态,在手机显示界面的上半部分显示,“照片”应用在手机显示界面的下半部分显示。此时,若用户点击语音技能推荐项中的“选定周末的照片”,或者通过语音或视频来输入“选定周末的照片”后,语音助手的显示界面如图8中的(c)所示。参照图8的(c)所示,语音助手接收到用户输入的指示信息后,如801所示,语音助手的提示信息(例如“嗨,我在听…”)改变为用户输入的指示信息,即“选定周末的照片”,此时801所示的提示图形为音波图形,但无语音技能推荐项。随后,语音助手确定指示信息“选定周 末的照片”中不缺少关键词,即语音助手与用户之间的交互过程为单轮语音交互,且该指示信息所涉及到的服务的反馈形式为文本反馈或者语音反馈,则在完成指示信息对应的操作后,语音助手的显示形态仍为半屏态(H),“照片”应用中周末的照片被选定(右下角被打勾的图片为被选定的图片),如图8中的(d)所示。参照图8的(d)所示,801所示的提示图形改变为悬浮球,用于表示语音助手停止收音,处于休眠状态,801所示的提示信息改变为该指示信息的反馈文本“已选定周末的照片”,语音助手对语音技能推荐项进行更新并显示,如802所示。例如,更新后的语音技能推荐项为“分享照片”、“删除照片”等。若指示信息涉及到的服务的反馈形式为语音反馈,则语音助手需要在显示反馈文本的同时,语音输出反馈文本。
相类似的,当语音助手以小半屏态(H1)显示时,若用户输入的指示信息不缺少关键词,则语音助手与用户之间的交互过程为单轮语音交互,且该指示信息所涉及的服务的反馈形式为文本反馈、语音反馈,则语音助手的显示形态仍为小半屏态(H1)。当语音助手以大半屏态(H2)显示时,若用户输入的指示信息不缺少关键词,则语音助手与用户之间的交互过程为单轮语音交互,且该指示信息所涉及的服务的反馈形式为文本文反馈、语音反馈或者卡片反馈,则语音助手的显示形态仍为大半屏态(H2)。
(2)、参照图9所示,语音助手为全屏态(L)时,如图9中的(c)所示,不论用户输入的指示信息是否缺少关键词,若用户输入的指示信息所对应的服务的反馈形式为文本反馈、语音反馈或者卡片反馈,则语音助手的显示形态不发生改变,如图9中的(d)所示。语音助手接收到指示信息后,仍以全屏态(L)显示的示例,可参见下述内容。
2、语音助手的显示形态发生变化。
(1)、参照图9所示,语音助手以半屏态(H)显示,如图9中的(b)所示,若用户输入的指示信息不缺少关键词,则语音助手与用户之间的交互过程为单轮语音交互,且该指示信息所涉及的服务的反馈形式为卡片反馈,或者用户输入的指示信息缺少关键词,语音助手的显示形态由半屏态(H)切换为全屏态(L),如图9中的(c)所示。
示例性的,打开语音助手后,语音助手默认以半屏态(H)在手机显示界面的上半部分显示,“照片”应用在手机的显示界面的下半部分显示。此时,若用户点击语音技能推荐项中的“查询今日上海天气”或者通过语音或者视频输入“查询今日上海天气”,此时,语音助手的显示界面如图6中的(c)所示。参照图6的(c)所示,语音助手接收到用户输入的指示信息后,如601所示,语音助手的提示信息(例如“嗨,我在听…”)改变为“查询今日上海天气”,此时601所示的提示图形为音波图形,但无语音技能推荐项。随后,语音助手确定指示信息“查询今日上海天气”中不缺少关键词,即语音助手与用户之间的交互过程为单轮语音交互,且该指示信息“查询今日上海天气”涉及的服务的反馈形式为卡片反馈,则在完成该指示信息对应的操作后,语音助手的显示形态切换为全屏态(L),如图6中的(d)所示。参见图6中的(d)所示,601所示的提示图形为悬浮球,表示语音助手停止收音,进入休眠状态,另外,悬浮球的两侧还显示有两个按钮“1”和“2”,用于切换语音助手的输入形式。一般的,语音助手的输入形式为语音输入,点击“1”的按钮,则语音助手的指示信息输入 形式切换为键盘输入(打开键盘),点击“2”的按键,则语音助手的指示信息输入形式切换为视频输入(打开摄像头)。如602所示,语音助手对语音技能推荐项进行更新并显示,例如,更新后的语音技能推荐项为“上周末的照片”和“分享照片”。如603所示,语音助手的提示信息改变为该指示信息的反馈文本“今日上海天气晴”,该提示信息中还包括反馈的天气卡片,该卡片中显示有今日上海的详细天气信息。可选地,若指示信息涉及到的服务的反馈形式支持语音反馈,则语音助手可以在显示反馈文本的同时,语音输出反馈文本“今日上海天气晴”。
示例性的,语音助手以半屏态(H)在手机显示界面的上半部分显示,手机显示界面的下半部分显示有“照片”应用,如图10中的(a)所示。参照图10的(a)所示,语音助手处于收音状态,语音助手的提示信息(例如,“嗨,我在听…”)和提示图形(例如音波图形)可用于提示用户输入指示信息,如1001所示。可选的,1002所示为语音技能推荐项“分享照片”、“买机票”。若语音助手点击语音技能推荐项中的“买机票”,或者通过语音或视频输入“买机票”,此时,语音助手的显示界面如图10中的(b)所示。参照图10的(b)所示,语音助手接收到用户输入的指示信息后,如1001所示,语音助手的提示信息(例如“嗨,我在听…”)改变为“买机票”,此时1001所示的提示图形为音波图形,但无语音技能推荐项。随后,语音助手确定指示信息“买机票”中缺少关键槽位的信息,其与用户的交互自动进入多轮语音交互过程,并切换其显示形态为全屏态(L),如图10中的(c)所示。在图10的(c)中,如1003所示,提示图形为音波图形,表示语音助手处于收音状态,1003所示悬浮球音波图形的两侧显示有两个按钮“1”和“2”。其中,“1”和“2”用于切换语音助手的输入形式。一般的,语音助手的输入形式为语音输入,点击“1”的按钮,则语音助手的指示信息输入形式切换为键盘输入(打开键盘),点击“2”的按键,则语音助手的指示信息输入形式切换为视频输入(打开摄像头)。1004所示的内容为语音助手与用户的多轮语音交互过程,其中,语音助手可确定用户意图为“买8.24下午去上海的机票”。此时,语音助手确定“买8.24下午去上海的机票”中不缺少关键词,且该指示信息对应的服务的反馈形式为卡片反馈,语音助手的显示形态仍为全屏态(L)。因此在根据用户意图执行相应的操作后,语音助手的显示形态如图10的(d)所示。参照图10的(d)所示,如1003所示,提示图形切换为悬浮球,用于表示语音助手停止收音,进入休眠状态,如1005所示,提示信息变化为为反馈文本“已购买8.24下午去上海的机票”以及相应的反馈卡片,如1006所示,语音助手对语音技能推荐项进行更新并显示,更新后的语音技能推荐项为“改签”、“退票”、“分享行程”等。
需要说明的是,语音助手在以全屏态(L)显示时,用户可以访问手机的设置项以及用户与手机上的语音助手进行语音对话的历史等内容,其所能实现的功能相对于其半屏态(H)来说更加完整,语音交互过程也更加沉浸,也就是说用户的注意力会更加集中。另外,语音助手在以半屏态(H)显示时,也可以通过下划语音助手界面的方式,切换语音助手的显示形态为全屏态(L)。
相类似的,当语音助手以小半屏态(H1)显示时,若用户输入的指示信息不缺少关键词,则语音助手与用户之间的交互过程为单轮语音交互。且该指示信息所涉及的服务的反馈形式为卡片反馈,则语音助手的显示形态切换为大半屏态(H2)。
示例性的,在图11的(a)中,语音助手以小半屏态(H1)显示,此时,语音助手处于收音状态,如1101所示,提示图形(例如音波图形)和文字(例如“嗨,我在听…”)用于提醒用户输入指示信息,如1102所示,语音技能推荐项为“V”、“上周末的照片”、“分享照片”等。语音助手通过语音等方式接收到指示信息,例如“打开蓝牙连接”,则手机的显示界面如图11中的(b)所示。参照图11的(b)所示,用户输入指示信息后,如1101所示,提示图形仍为音波图形,提示信息改变为该指示信息“打开蓝牙连接”。随后,语音助手确定该指示信息中不缺少关键词,且该指示信息对应的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示状态仍为小半屏态(H1),如图11中的(c)所示。参照图11的(c)所示,语音助手在执行指示信息“打开蓝牙连接”所对应的服务后,如1101所示,提示信息改变为该服务的反馈文本“好的,蓝牙已打开”。若指示信息涉及到的服务的反馈形式为语音反馈,则语音助手需要在显示反馈文本的同时,语音输出反馈文本。另外,1101所示的提示图形改变为悬浮球,表示语音助手停止收音,处于休眠状态。如1102所示,语音助手对推荐项信息进行更新并显示,更新后的语音技能推荐项为“V”、“关闭蓝牙”、“上周末的照片”等。若反馈信息为卡片反馈,则手机的显示界面如图11中的(d)所示,语音助手的形态切换为大半屏态(H2)显示,1101和1102所示内容不变,如1103所示,语音助手的显示界面中还包括反馈卡片,该反馈卡片的内容为蓝牙开关的设置项(此时表示蓝牙处于打开状态)。
需要说明的是,语音助手的大半屏态(H2)是由语音助手对接收到的指示信息的反馈内容进行识别和计算后自动进入的,无法手动进入。
另外,当语音助手以小半屏态(H1)或者大半屏态(H2)显示时,若用户输入的指示信息缺少关键词,则语音助手与用户之间的交互过程为多轮语音交互,语音助手的显示形态切换为全屏态(L)。具体的,可参见语音助手的半屏态(H)切换为全屏态(L)的示例,在此不再赘述。
(2)、参照图9所示,语音助手为半屏态(H),如图9的(a)所示,若用户输入的指示信息不缺少关键词,且该指示信息所涉及的服务的反馈形式为分屏反馈,则触发“模拟点击”技能,语音助手的显示形态切换为悬浮态(F),如图9的(e)所示。
示例性的,语音助手以半屏态(H)在手机显示界面的上半部分显示,手机显示界面的下半部分显示有“照片”应用。参照图12的(a)所示,语音助手接收到用户输入的指示信息“发微信给多多问周末吃饭吗”后,如1201所示,提示图形为音波图形,提示文字(例如,“嗨,我在听…”)改变为指示信息“发微信给多多问周末吃饭吗”。随后,语音助手确定该指示信息中不缺少关键指示信息,且该指示信息所对应的服务的反馈形式为分屏反馈,语音助手需要与第三方服务“微信”应用相配合,触发“模拟点击”(deeplink)技能,来完成该指示信息对应的操作。此时,语音助手切换其显示形态为悬浮态(F),如图12的(b)所示。参照图12的(b)所示,语音助手进入悬浮态(F),如1201所示,提示图形改变为悬浮球,且提示信息改变为指示信息“发微信给多多问周末吃饭吗”,“照片”应用恢复全屏显示。参照图12的(c)所示,“模拟点击”技能继续发动,“照片”应用被关闭,而“微信”应用开启并全 屏显示,如1201所示,提示图形为悬浮球,且提示信息为“发微信给多多问周末吃饭吗”。若在“模拟点击”技能完成之前对悬浮球进行点击,则“模拟点击”技能被终止,语音助手的显示形态如图12中的(d)所示。参照图12中的(d)所示,如1201所示,提示图形为悬浮球,且无提示信息,表示语音助手停止收音,进入休眠状态。若“模拟点击”技能成功执行,语音助手完成指示信息所指示的操作,则语音助手的显示形态如图12中的(e)所示。参照图12的(e)所示,如1201所示,提示图形为悬浮球,的文字提示信息为指示信息的执行结果的反馈文本“已发送”,用于表示指示信息“发微信给多多问周末吃饭吗”所对应的操作已经完成。随后,语音助手进入停止收音状态,进入休眠状态,如图12中的(f)所示。参照图12的(f)所示,1201所示的提示图形为悬浮球,无提示信息。需要说明的是,若微信联系人列表中有多个“多多”,则语音助手在完成上述指示信息对应的操作之前,还应确定微信的接收方联系人,如图12的(h)所示。参照图12的(h)所示,联系人列表中包括“周多多”和“李多多”,则如1202所示,用户可通过点击联系人所在栏的方式,确定微信消息的接收方为“周多多”,并按照指示信息发送相应的信息给联系人“周多多”。或者点击1201所示的提示信息“第一个联系人”,确定微信消息的接收方为“周多多”,并按照指示信息发送相应的信息给联系人“周多多”。
需要说明的是,语音助手在切换其显示形态为悬浮态(F)的过程中,应用全屏显示会使“模拟点击”技能的成功率提高。另外,语音助手的悬浮态(F)是由语音助手对接收到的指示信息经过识别和计算后自动进入的,无法手动进入。
相类似的,语音助手以小半屏态(H1)或者大半屏态(H2)显示时,若用户输入的指示信息不缺少关键词,语音助手与用户之间的交互过程为单轮语音交互。且该指示信息所涉及的服务的反馈形式为分屏反馈,则语音助手的显示形态切换为悬浮态(F)。具体的,可参见语音助手的半屏态(H)切换为悬浮态(F)的示例,在此不再赘述。
(3)、参照图9所示,语音助手为全屏态(L),如图9的(d)所示,若用户输入的指示信息所涉及的服务的反馈形式为分屏反馈,则触发“模拟点击”技能,语音助手进入悬浮态(F),如图9的(e)所示。
可选地,语音助手由半屏态(H)切换为全屏态(L)后,通过与用户进行多轮语音交互,确定用户意图,示例性的,用户意图为“发微信给多多问周末吃饭吗”。若该用户意图对应的服务的反馈形式为文本反馈、语音反馈或者卡片反馈,则语音助手的显示形依旧为全屏态(L),若该用户意图对应的服务的反馈形式为分屏反馈,则语音助手的显示形态切换为悬浮态(F)。
示例性的,语音助手以全屏态(H)显示,如图13中的(a)所示。参照图13的(a)所示,语音助手处于收音状态,如1301所示,提示信息为“请问需要什么帮助”,如1303所示,提示图形为音波图形,所述音波图形两侧显示有两个按钮“1”和“2”,其中,“1”和“2”用于切换语音助手的输入形式,1301所示的提示信息和1303所示的提示图形用于提示用户输入指示信息。如1302所示,语音技能推荐项为“关闭无线网络”“更改铃声”等。用户通过语音等方式输入指示信息“发微信给多多问周末吃饭吗”,如图13的(b)所示。参照图13的(b)所示,“模拟点击”技能发动,“微信”应用开启并全屏显示,如1303所示,提示图形为音波图形,音波图形两侧显 示有按钮“1”和“2”,1301和1302所示的内容未发生改变,1304所示为用户输入的指示信息“发微信给多多问周末吃饭吗”。随后,语音助手确定“发微信给多多问周末吃饭吗”中不缺少关键词,且该指示信息对应的操作的反馈形式是分屏反馈,语音助手触发“模拟点击”技能,与第三方服务“微信”应用相配合来完成用户意图所对应的操作,此时,语音助手的显示形态切换为悬浮态(F),如图13的(c)所示。参照图13的(c)所示,如1303所示,提示图形为悬浮球,悬浮球一侧显示有提示信息“发微信给多多问周末吃饭吗”。若“模拟点击”技能成功执行,语音助手完成指示信息所指示的操作,则语音助手的显示形态如图13中的(c)所示,1303所示悬浮球一侧显示有反馈文本“已发送”。然后语音助手进入休眠状态,如图13中的(d)所示,1303所示的悬浮球用于表示语音助手停止收音,进入休眠状态。
另外,语音助手在以半屏态(H)显示时,用户还可以通过触摸手机显示界面的下半部分的形式,使得语音助手停止收音,进入休眠状态。语音助手在以全屏态(L)显示时,用户可以访问手机的设置项以及用户与手机上的语音助手进行语音对话的历史等内容,其所能实现的功能相对于其半屏态(H)来说更加完整,语音交互过程也更加沉浸,也就是说用户的注意力会更加集中。此外,对于语音助手的半屏态(H)和全屏态(L),用户也可以通过点击表示语音助手处于收音状态的图形,或者语音交互的方式,使语音助手停止收音,进入休眠状态。语音助手以悬浮态(F)时,用户可以利用上述示例中所提到的方式,在“模拟点击”技能成功执行之前,点击悬浮球,以使得语音助手进入休眠状态。
另外,语音助手完成指示信息所对应的操作后,会进入休眠状态,休眠状态的语音助手的显示形态可能为半屏态(H)、全屏态(L)或者悬浮态(F)。随后,用户通过点击悬浮球等方式,重新唤醒语音助手,根据语音助手处于休眠时的显示状态的不同,语音助手的显示形态,可能会发生变化,也可能不发生变化,下面结合图6、图9以及图12,对上述两种情况分别进行说明:
1、重新唤醒语音助手后,语音助手的显示形态发生变化。
若语音助手处于休眠时,其显示形态为悬浮态(F),如图9的(e)所示,则语音助手的显示形态切换为其默认形态,即半屏态(H),如图9的(a)所示。
示例性的,语音助手处于休眠状态时,手机显示界面如图12的(f)所示。参照图12的(f)所示,若点击1201所示的悬浮球,则语音助手被唤醒,其唤醒后的形态为默认形态,即半屏态(H),手机显示界面的下半部分所显示的应用为“微信”应用,如图12的(g)所示。参照图12的(g)所示,语音助手处于收音状态,如1301所示,提示图形(如音波图形)和提示信息(例如“嗨,我在听…”)来提示用户输入指示信息,1302所示为语音技能推荐项,例如“V”、“返回照片”、“发微信”“发红包”等。
2、重新唤醒语音助手后,语音助手的显示形态不发生变化。
若语音助手处于休眠时,其显示形态为半屏态(H)或者全屏态(L),如图9的(b)或(d)所示,则语音助手重新被唤醒后,其显示形态依旧为半屏态(H),或者全屏态(L),如图9的(a)或(c)所示。
示例性的,若语音助手处于休眠时,其显示形态为全屏态(L),则其被唤醒后, 依旧以全屏态(L)显示,如图6中的(d)所示。参见图6中的(d)所示,601所示的图形切换为悬浮球,悬浮球的两侧显示有两个按钮“1”和“2”。602所示为更新后的语音技能推荐项“关键词3”“关键词4”,603所示的天气卡片中包含有反馈文本“今日上海天气晴”,以及今日上海的详细天气信息。其中,“关键词3”和“关键词4”可能与“关键词1”和“关键词2”相同,也可能不同。示例性的,“关键词3”和“关键词4”分别为“上周末的照片”和“分享照片”。其中,601所示悬浮球的两侧的“1”和“2”用于切换语音助手的输入形式。一般的,语音助手的输入形式为语音输入,点击“1”的按钮,则语音助手的指示信息输入形式切换为键盘输入(打开键盘),点击“2”的按键,则语音助手的指示信息输入形式切换为视频输入(打开摄像头)。
示例性的,若语音助手处于休眠时,其显示形态为半屏态(H),则语音助手被唤醒后,依旧以半屏态(H)显示,如图6中的(b)所示。参照图6的(b)所示,语音助手处于收音状态,如601所示的图形和/或文字(例如“嗨,我在听…”)可用于提示用户输入指示信息。可选的,语音助手显示界面上还显示有如602所示的语音技能推荐项“V”、“关键词1”、“关键词2”,“V”用于切换语音助手的输入形式,“关键词1”与“关键词2”不同,示例性的,“关键词1”和“关键词2”分别为“周末的照片”和“分享照片”。关于语音技能推荐项和用户输入指示信息的方式的具体描述,可以参见上述描述,在此不再赘述。
S403、唤醒语音助手,根据语音助手休眠时的显示形态,确定其唤醒后的形态。
语音助手进入休眠状态后,用户可以通过点击悬浮球的方式,来唤醒语音助手。关于语音助手重新被唤醒后的显示形态的变化,可以参见上述步骤S402中的描述,在此不再赘述。
S404、语音助手重新接收指示信息,根据新的指示信息确定其显示形态,并且在执行指示信息所指示的操作后进入休眠状态。
关于本步骤S404的具体实现过程可以参见步骤S402中的描述,在此也不再赘述。
一般,在语音助手的形态切换过程中,语音助手由半屏态(H)切换为全屏态(L)的过程,语音助手由全屏态(L)切换为悬浮态(F)的过程是不可逆的。即语音助手不能从全屏态(L)切换为半屏态(H),也不能从悬浮态(L)切换为全屏态(L)。另外,语音助手在处于半屏态(H)时,可以切换其显示形态为全屏态(L)和悬浮态(F)。语音助手在处于全屏态(L)时,可以切换其显示形态为悬浮态(F)。语音助手在处于悬浮态(F)时,只能切换其显示形态为默认形态,在本实施例中,语音助手的默认形态为半屏态(H)。
需要说明的是,语音助手的形态转换也可以根据需求设置为可逆,即语音助手可从全屏态(L)切换为半屏态(H)。语音助手以大半屏态(H2)显示时,也可以根据用户输入的指示信息和当前场景切换为小半屏态(H1),语音助手以全屏态(L)显示时,也可以根据用户输入的指示信息和当前场景切换为大半屏态(H2)。
S405、关闭并退出语音助手。
语音助手的显示形态可以为半屏态(H)、全屏态(L)或者悬浮态(F),当语音助手以半屏态(H)或者全屏态(L)显示时,用户可以通过语音交互或者上划语音 助手显示界面的方式,关闭并退出语音助手。当语音助手以悬浮态(F)显示时,用户可以通过上划或者下划悬浮球的方式,关闭并退出语音助手。
需要说明的是,若语音助手在以半屏态(H)显示时,手机显示界面的下半部分中无其他应用界面显示,则语音助手在退出后,手机的显示界面为无任务界面,如图5中的(a)所示。若语音助手在以半屏态显示时,手机显示界面的下半部分中有其他应用界面显示(以“照片”应用为例),则语音助手在退出后,手机显示界面下半部分中的应用为全屏显示,手机的显示界面为单任务界面,如图6中的(a)所示。
通过上述过程,语音助手可以根据手机的实际场景,以及用户所输入的指示信息进行语音助手的形态切换,从而实现语音助手的形态与手机上的实际场景的系统级融合,提升用户体验。
通过上述过程,本申请提供了一种语音助手显示方法,打开语音助手后,语音助手以预设的默认显示形态显示。随后根据输入语音助手的指示信息,以及该指示信息所指示的服务,确定语音助手的显示形态,以使得语音助手可以根据指示信息来确定实际场景的变化,并根据该实际场景切换相应的形态,使得语音助手与系统协同一体,从而实现语音助手与手机的系统级融合。
本申请实施例还提供一种芯片系统,如图14所示,该芯片系统包括至少一个处理器1401和至少一个接口电路1402。处理器1401和接口电路1402可通过线路互联。例如,接口电路1402可用于从其它装置(例如电子设备100的存储器)接收信号。又例如,接口电路1402可用于向其它装置(例如处理器1401)发送信号。示例性的,接口电路1402可读取存储器中存储的指令,并将该指令发送给处理器1401。当所述指令被处理器1401执行时,可使得电子设备执行上述实施例中的电子设备100(比如,手机)执行的各个步骤。当然,该芯片系统还可以包含其他分立器件,本申请实施例对此不作具体限定。
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。
上述实施例可以全部或部分通过软件,硬件,固件或者其任意组合实现。当使用软件程序实现时,上述实施例可以全部或部分地以计算机程序产品的形式出现,计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行计算机程序指令时,全部或部分地产生按照本申请实施例的流程或功能。
其中,所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心传输。计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是磁性介质,(例如,软盘,硬盘、磁带)、光介质(例如,DVD)或者半导体介质(例如固态硬盘(solid state disk,SSD))等。
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到,为描述的 方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个装置,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。
所述作为分离部件说明的单元可以是物理上分开的,或者也可以不是物理上分开的,作为单元显示的部件可以是一个物理单元或多个物理单元,即可以位于一个地方,或者也可以分布到多个不同地方。在应用过程中,可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请实施例的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一个设备(可以是个人计算机,服务器,网络设备,单片机或者芯片等)或处理器(processor)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何在本申请揭露的技术范围内的变化或替换,都应涵盖在本申请的保护范围之内。

Claims (25)

  1. 一种语音助手显示方法,应用于电子设备,其特征在于,所述语音助手的显示形态包括半屏态、全屏态以及悬浮态;其中,所述半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于1;所述全屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例为1;所述悬浮态是指所述语音助手悬浮显示于电子设备的当前显示界面;所述方法包括:
    打开语音助手,语音助手以第一显示形态显示;其中,所述第一显示形态为语音助手预设的默认显示形态;
    根据输入语音助手的指示信息以及所述指示信息所指示的服务,确定语音助手的显示形态。
  2. 根据权利要求1所述的语音助手显示方法,其特征在于,所述第一显示形态为半屏态;
    所述打开语音助手,语音助手以第一显示形态显示,具体包括:
    打开语音助手,当前任务界面整体向下移动;
    语音助手以半屏态与当前任务界面分屏显示。
  3. 根据权利要求1或2所述的语音助手显示方法,其特征在于,所述根据输入语音助手的指示信息以及所述指示信息所指示的服务,确定语音助手的显示形态,具体包括:
    若所述指示信息缺少关键词,则语音助手的显示形态为全屏态;
    若所述指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为半屏态;
    若所述指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为全屏态;所述指示信息所指示的服务所涉及到的应用在语音助手的显示界面中以卡片形式显示,则所述指示信息所指示的服务的反馈形式为卡片反馈;
    若所示指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为分屏反馈,则语音助手的显示形态为悬浮态;所述指示信息所指示的服务涉及到应用界面切换,则所述指示信息所指示的服务的反馈形式为分屏反馈。
  4. 根据权利要求1-3任一项所述的语音助手显示方法,其特征在于,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态。
  5. 根据权利要求4所述的语音助手显示方法,其特征在于,所述根据输入语音助手的指示信息以及所述指示信息所指示的服务,确定语音助手的显示形态包括:
    若所述指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为小半屏态;
    若所述指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为大半屏态。
  6. 根据权利要求1-3任一项所述的语音助手显示方法,其特征在于,在所述打开 语音助手,语音助手以第一显示形态显示之后,所述方法还包括:
    语音助手进入休眠状态;
    唤醒语音助手,并确定语音助手的显示形态。
  7. 根据权利要求6所述的语音助手显示方法,其特征在于,所述唤醒语音助手,并确定语音助手的显示形态,包括:
    若语音助手进入休眠状态时,其显示形态为悬浮态,则唤醒语音助手后,语音助手的显示形态为所述第一显示形态;
    若语音助手进入休眠状态时,其显示形态为半屏态,则唤醒语音助手后,语音助手的显示形态为半屏态;
    若语音助手进入休眠状态时,其显示形态为全屏态,则唤醒语音助手后,语音助手的显示形态为全屏态。
  8. 根据权利要求7所述的语音助手显示方法,其特征在于,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态;
    所述唤醒语音助手,并确定语音助手的显示形态,包括:
    所述若语音助手进入休眠状态时,其显示形态为小半屏态,则唤醒语音助手后,语音助手的显示形态为小半屏态;
    所述若语音助手进入休眠状态时,其显示形态为大半屏态,则唤醒语音助手后,语音助手的显示形态为大半屏态。
  9. 根据权利要求6或8所述的语音助手显示方法,其特征在于,唤醒语音助手,并确定语音助手的显示形态之后,所述方法还包括:
    根据新的指示信息以及所述新的指示信息指示的服务,确定语音助手的新的显示形态。
  10. 根据权利要求9所述的语音助手显示方法,其特征在于,所述根据新的指示信息以及所述新的指示信息对应的服务,确定语音助手的新的显示形态,具体包括:
    若语音助手的显示形态为全屏态,且所述新的指示信息指示的服务的反馈形式为文本反馈、语音反馈或者卡片反馈,则语音助手的新的显示形态为全屏态;所述新的指示信息所指示的服务所涉及到的应用在语音助手的显示界面中以卡片形式显示,则所述新的指示信息所指示的服务的反馈形式为卡片反馈;
    若语音助手的显示形态为全屏态,且所述新的指示信息指示的服务的反馈形式为分屏反馈,则语音助手的新的显示形态为悬浮态;所述新的指示信息所指示的服务涉及到应用界面切换,则所述新的指示信息所指示的服务的反馈形式为分屏反馈;
    若语音助手的显示形态为半屏态,且所述新的指示信息缺少关键词,则语音助手的新的显示形态为全屏态;
    若语音助手的显示形态为半屏态,且所述新的指示信息不缺少关键词,所述新的指示信息对应的服务的反馈形式为文本反馈或语音反馈,则语音助手的新的显示形态为半屏态;
    若语音助手的显示形态为半屏态,且所述新的指示信息不缺少关键词,所述新的 指示信息对应的服务的反馈形式为卡片反馈,则语音助手的新的显示形态为全屏态;
    若语音助手的显示形态为半屏态,且所述新的指示信息不缺少关键词,所述新的指示信息对应的服务的反馈形式为分屏反馈,则语音助手的新的显示形态为悬浮态。
  11. 根据权利要求9或10所述的语音助手显示方法,其特征在于,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态;
    所述根据新的指示信息以及所述新的指示信息对应的服务,确定语音助手的新的显示形态,包括:
    若语音助手的显示形态为小半屏态,且所述新的指示信息不缺少关键词,所述新的指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为小半屏态;
    若语音助手的显示形态为小半屏态,所述新的指示信息不缺少关键词,所述新的指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为大半屏态。
  12. 一种电子设备,其特征在于,包括:处理器、存储器和触摸屏,所述存储器、所述触摸屏与所述处理器耦合,所述存储器用于存储计算机程序代码,所述计算机程序代码包括计算机指令,当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备执行如下操作:
    打开语音助手,语音助手以第一显示形态显示;其中,所述第一显示形态为语音助手预设的默认显示形态;
    根据输入语音助手的指示信息以及所述指示信息所指示的服务,确定语音助手的显示形态;
    其中,所述语音助手的显示形态包括半屏态、全屏态以及悬浮态;其中,所述半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于1;所述全屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例为1;所述悬浮态是指所述语音助手悬浮显示于电子设备的当前显示界面。
  13. 根据权利要求12所述的电子设备,其特征在于,所述第一显示形态为半屏态,当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:
    打开语音助手,当前任务界面整体向下移动;
    语音助手以半屏态与当前任务界面分屏显示。
  14. 根据权利要求12或13所述的电子设备,其特征在于,当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:
    若所述指示信息缺少关键词,则语音助手的显示形态为全屏态;
    若所述指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为半屏态;
    若所述指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为全屏态;所述指示信息所指示的服务所涉及到的应用在语音助手的显示界面中以卡片形式显示,则所述指示信息所指示的服务的反馈形式 为卡片反馈;
    若所示指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为分屏反馈,则语音助手的显示形态为悬浮态;所述指示信息所指示的服务涉及到应用界面切换,则所述指示信息所指示的服务的反馈形式为分屏反馈。
  15. 根据权利要求12-14任一项所述的电子设备,其特征在于,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态。
  16. 根据权利要求15所述的电子设备,其特征在于,当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:
    若所述指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为小半屏态;
    若所述指示信息不缺少关键词,且所述指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为大半屏态。
  17. 根据权利要求12-14任一项所述的电子设备,其特征在于,当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:
    语音助手进入休眠状态;
    唤醒语音助手,并确定语音助手的显示形态。
  18. 根据权利要求17所述的电子设备,其特征在于,当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:
    若语音助手进入休眠状态时,其显示形态为悬浮态,则唤醒语音助手后,语音助手的显示形态为所述第一显示形态;
    若语音助手进入休眠状态时,其显示形态为半屏态,则唤醒语音助手后,语音助手的显示形态为半屏态;
    若语音助手进入休眠状态时,其显示形态为全屏态,则唤醒语音助手后,语音助手的显示形态为全屏态。
  19. 根据权利要求18所述的电子设备,其特征在于,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态;当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:
    所述唤醒语音助手,并确定语音助手的显示形态,包括:
    所述若语音助手进入休眠状态时,其显示形态为小半屏态,则唤醒语音助手后,语音助手的显示形态为小半屏态;
    所述若语音助手进入休眠状态时,其显示形态为大半屏态,则唤醒语音助手后,语音助手的显示形态为大半屏态。
  20. 根据权利要求17或19所述的电子设备,其特征在于,当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:
    根据新的指示信息以及所述新的指示信息指示的服务,确定语音助手的新的显示 形态。
  21. 根据权利要求20所述的电子设备,其特征在于,当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:
    若语音助手的显示形态为全屏态,且所述新的指示信息指示的服务的反馈形式为文本反馈、语音反馈或者卡片反馈,则语音助手的新的显示形态为全屏态;所述指示信息所指示的服务所涉及到的应用在语音助手的显示界面中以卡片形式显示,则所述指示信息所指示的服务的反馈形式为卡片反馈;
    若语音助手的显示形态为全屏态,且所述新的指示信息指示的服务的反馈形式为分屏反馈,则语音助手的新的显示形态为悬浮态;所述新的指示信息所指示的服务涉及到应用界面切换,则所述新的指示信息所指示的服务的反馈形式为分屏反馈;
    若语音助手的显示形态为半屏态,且所述新的指示信息缺少关键词,则语音助手的新的显示形态为全屏态;
    若语音助手的显示形态为半屏态,且所述新的指示信息不缺少关键词,所述新的指示信息对应的服务的反馈形式为文本反馈或语音反馈,则语音助手的新的显示形态为半屏态;
    若语音助手的显示形态为半屏态,且所述新的指示信息不缺少关键词,所述新的指示信息对应的服务的反馈形式为卡片反馈,则语音助手的新的显示形态为全屏态;
    若语音助手的显示形态为半屏态,且所述新的指示信息不缺少关键词,所述新的指示信息对应的服务的反馈形式为分屏反馈,则语音助手的新的显示形态为悬浮态。
  22. 根据权利要求20或21所述的电子设备,其特征在于,语音助手的半屏态还包括小半屏态和大半屏态,其中,所述小半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例小于0.5;所述大半屏态是指所述语音助手的显示界面占电子设备的整体显示界面的比例大于0.5;所述第一显示形态为小半屏态;当所述处理器从所述存储器中读取所述计算机指令,以使得所述电子设备还执行如下操作:
    所述根据新的指示信息以及所述新的指示信息对应的服务,确定语音助手的新的显示形态,包括:
    若语音助手的显示形态为小半屏态,且所述新的指示信息不缺少关键词,所述新的指示信息所指示的服务的反馈形式为文本反馈或语音反馈,则语音助手的显示形态为小半屏态;
    若语音助手的显示形态为小半屏态,所述新的指示信息不缺少关键词,所述新的指示信息所指示的服务的反馈形式为卡片反馈,则语音助手的显示形态为大半屏态。
  23. 一种计算机存储介质,其特征在于,包括计算机指令,当所述计算机指令在电子设备上运行时,使得所述电子设备执行如权利要求1-11中任一项所述的语音助手显示方法。
  24. 一种芯片系统,其特征在于,包括一个或多个处理器,当所述一个或多个处理器执行指令时,所述一个或多个处理器执行如权利要求1-11中任一项所述的语音助手显示方法。
  25. 一种电子设备上的图形用户界面,其特征在于,所述电子设备具有显示屏、摄像头、存储器、以及一个或多个处理器,所述一个或多个处理器用于执行存储在所 述存储器中的一个或多个计算机程序,所述图形用户界面包括所述电子设备执行如权利要求1-11中任意一项所述语音助手显示方法时显示的图形用户界面。
PCT/CN2020/114899 2019-09-18 2020-09-11 语音助手显示方法及装置 WO2021052263A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910883296.9 2019-09-18
CN201910883296.9A CN110825469A (zh) 2019-09-18 2019-09-18 语音助手显示方法及装置

Publications (1)

Publication Number Publication Date
WO2021052263A1 true WO2021052263A1 (zh) 2021-03-25

Family

ID=69548053

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/114899 WO2021052263A1 (zh) 2019-09-18 2020-09-11 语音助手显示方法及装置

Country Status (2)

Country Link
CN (1) CN110825469A (zh)
WO (1) WO2021052263A1 (zh)

Families Citing this family (60)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
US8977255B2 (en) 2007-04-03 2015-03-10 Apple Inc. Method and system for operating a multi-function portable electronic device using voice-activation
US8676904B2 (en) 2008-10-02 2014-03-18 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US10706373B2 (en) 2011-06-03 2020-07-07 Apple Inc. Performing actions associated with task items that represent tasks to perform
US10417037B2 (en) 2012-05-15 2019-09-17 Apple Inc. Systems and methods for integrating third party services with a digital assistant
CN113470640B (zh) 2013-02-07 2022-04-26 苹果公司 数字助理的语音触发器
US10652394B2 (en) 2013-03-14 2020-05-12 Apple Inc. System and method for processing voicemail
US10748529B1 (en) 2013-03-15 2020-08-18 Apple Inc. Voice activated device for use with a voice-based digital assistant
US10176167B2 (en) 2013-06-09 2019-01-08 Apple Inc. System and method for inferring user intent from speech inputs
CN105453026A (zh) 2013-08-06 2016-03-30 苹果公司 基于来自远程设备的活动自动激活智能响应
US10170123B2 (en) 2014-05-30 2019-01-01 Apple Inc. Intelligent assistant for home automation
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9966065B2 (en) 2014-05-30 2018-05-08 Apple Inc. Multi-command single utterance input method
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US9886953B2 (en) 2015-03-08 2018-02-06 Apple Inc. Virtual assistant activation
US10460227B2 (en) 2015-05-15 2019-10-29 Apple Inc. Virtual assistant in a communication session
US10200824B2 (en) 2015-05-27 2019-02-05 Apple Inc. Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device
US20160378747A1 (en) 2015-06-29 2016-12-29 Apple Inc. Virtual assistant for media playback
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US10331312B2 (en) 2015-09-08 2019-06-25 Apple Inc. Intelligent automated assistant in a media environment
US10740384B2 (en) 2015-09-08 2020-08-11 Apple Inc. Intelligent automated assistant for media search and playback
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10956666B2 (en) 2015-11-09 2021-03-23 Apple Inc. Unconventional virtual assistant interactions
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10586535B2 (en) 2016-06-10 2020-03-10 Apple Inc. Intelligent digital assistant in a multi-tasking environment
DK201670540A1 (en) 2016-06-11 2018-01-08 Apple Inc Application integration with a digital assistant
DK179415B1 (en) 2016-06-11 2018-06-14 Apple Inc Intelligent device arbitration and control
DK180048B1 (en) 2017-05-11 2020-02-04 Apple Inc. MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
DK179496B1 (en) 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
DK179745B1 (en) 2017-05-12 2019-05-01 Apple Inc. SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT
DK201770427A1 (en) 2017-05-12 2018-12-20 Apple Inc. LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT
US20180336892A1 (en) 2017-05-16 2018-11-22 Apple Inc. Detecting a trigger of a digital assistant
US20180336275A1 (en) 2017-05-16 2018-11-22 Apple Inc. Intelligent automated assistant for media exploration
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction
US11145294B2 (en) 2018-05-07 2021-10-12 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
DK179822B1 (da) 2018-06-01 2019-07-12 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
DK180639B1 (en) 2018-06-01 2021-11-04 Apple Inc DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT
US10892996B2 (en) 2018-06-01 2021-01-12 Apple Inc. Variable latency device coordination
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
DK201970509A1 (en) 2019-05-06 2021-01-15 Apple Inc Spoken notifications
US11307752B2 (en) 2019-05-06 2022-04-19 Apple Inc. User configurable task triggers
US11140099B2 (en) 2019-05-21 2021-10-05 Apple Inc. Providing message response suggestions
DK180129B1 (en) 2019-05-31 2020-06-02 Apple Inc. USER ACTIVITY SHORTCUT SUGGESTIONS
DK201970511A1 (en) 2019-05-31 2021-02-15 Apple Inc Voice identification in digital assistant systems
US11468890B2 (en) 2019-06-01 2022-10-11 Apple Inc. Methods and user interfaces for voice-based control of electronic devices
CN110825469A (zh) * 2019-09-18 2020-02-21 华为技术有限公司 语音助手显示方法及装置
US11038934B1 (en) 2020-05-11 2021-06-15 Apple Inc. Digital assistant hardware abstraction
US11061543B1 (en) 2020-05-11 2021-07-13 Apple Inc. Providing relevant data items based on context
CN111833868A (zh) * 2020-06-30 2020-10-27 北京小米松果电子有限公司 语音助手控制方法、装置及计算机可读存储介质
US11490204B2 (en) 2020-07-20 2022-11-01 Apple Inc. Multi-device audio adjustment coordination
US11438683B2 (en) 2020-07-21 2022-09-06 Apple Inc. User identification using headphones
CN111813491B (zh) * 2020-08-19 2020-12-18 广州汽车集团股份有限公司 一种车载助手的拟人化交互方法、装置及汽车
CN115700451A (zh) * 2021-07-28 2023-02-07 华为技术有限公司 一种服务的推荐方法及电子设备
CN113805747B (zh) * 2021-08-12 2023-07-25 荣耀终端有限公司 信息提醒方法、电子设备及计算机可读存储介质
CN113778315A (zh) * 2021-08-27 2021-12-10 京东方科技集团股份有限公司 一种数据交互方法、装置、系统及电子设备
CN114327349B (zh) * 2021-12-13 2024-03-22 青岛海尔科技有限公司 智能卡片的确定方法及装置、存储介质、电子装置

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109151200A (zh) * 2018-08-27 2019-01-04 维沃移动通信有限公司 一种通讯方法及移动终端
CN109491562A (zh) * 2018-10-09 2019-03-19 珠海格力电器股份有限公司 一种语音助手应用程序的界面显示方法及终端设备
CN109584879A (zh) * 2018-11-23 2019-04-05 华为技术有限公司 一种语音控制方法及电子设备
WO2019124841A1 (ko) * 2017-12-22 2019-06-27 삼성전자 주식회사 전자 장치 및 스트로크 입력에 따른 기능 실행 방법
CN110018858A (zh) * 2019-04-02 2019-07-16 北京蓦然认知科技有限公司 一种基于语音控制的应用管理方法、装置
CN110825469A (zh) * 2019-09-18 2020-02-21 华为技术有限公司 语音助手显示方法及装置

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102655554B (zh) * 2012-04-19 2016-08-17 惠州Tcl移动通信有限公司 无线通讯设备及其导航中的控制方法
CN105302837A (zh) * 2014-07-31 2016-02-03 腾讯科技(深圳)有限公司 一种查询信息的方法和终端
CN104731613A (zh) * 2015-01-30 2015-06-24 深圳市中兴移动通信有限公司 应用快速启动方法和系统
CN104898952B (zh) * 2015-06-16 2019-05-28 魅族科技(中国)有限公司 一种终端分屏实现方法及终端
CN107102806A (zh) * 2017-01-25 2017-08-29 维沃移动通信有限公司 一种分屏输入方法和移动终端
CN107315518A (zh) * 2017-06-27 2017-11-03 努比亚技术有限公司 一种终端分屏方法、装置及计算机可读存储介质
CN109243462A (zh) * 2018-11-20 2019-01-18 广东小天才科技有限公司 一种语音唤醒方法及装置
CN109669754A (zh) * 2018-12-25 2019-04-23 苏州思必驰信息科技有限公司 语音交互窗口的动态显示方法、具有伸缩式交互窗口的语音交互方法及装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019124841A1 (ko) * 2017-12-22 2019-06-27 삼성전자 주식회사 전자 장치 및 스트로크 입력에 따른 기능 실행 방법
CN109151200A (zh) * 2018-08-27 2019-01-04 维沃移动通信有限公司 一种通讯方法及移动终端
CN109491562A (zh) * 2018-10-09 2019-03-19 珠海格力电器股份有限公司 一种语音助手应用程序的界面显示方法及终端设备
CN109584879A (zh) * 2018-11-23 2019-04-05 华为技术有限公司 一种语音控制方法及电子设备
CN110018858A (zh) * 2019-04-02 2019-07-16 北京蓦然认知科技有限公司 一种基于语音控制的应用管理方法、装置
CN110825469A (zh) * 2019-09-18 2020-02-21 华为技术有限公司 语音助手显示方法及装置

Also Published As

Publication number Publication date
CN110825469A (zh) 2020-02-21

Similar Documents

Publication Publication Date Title
WO2021052263A1 (zh) 语音助手显示方法及装置
KR102470275B1 (ko) 음성 제어 방법 및 전자 장치
WO2021063343A1 (zh) 语音交互方法及装置
WO2020259452A1 (zh) 一种移动终端的全屏显示方法及设备
WO2021213164A1 (zh) 应用界面交互方法、电子设备和计算机可读存储介质
WO2021000807A1 (zh) 一种应用程序中等待场景的处理方法和装置
WO2021000804A1 (zh) 锁定状态下的显示方法及装置
WO2021036770A1 (zh) 一种分屏处理方法及终端设备
WO2021052282A1 (zh) 数据处理方法、蓝牙模块、电子设备与可读存储介质
WO2020073288A1 (zh) 一种触发电子设备执行功能的方法及电子设备
US20230021994A1 (en) Cross-Device Content Projection Method and Electronic Device
WO2020150917A1 (zh) 一种应用权限的管理方法及电子设备
WO2022037726A1 (zh) 分屏显示方法和电子设备
WO2021052139A1 (zh) 手势输入方法及电子设备
WO2021218429A1 (zh) 应用窗口的管理方法、终端设备及计算机可读存储介质
WO2021143391A1 (zh) 基于视频通话的共享屏幕方法及移动设备
WO2022127130A1 (zh) 一种添加操作序列的方法、电子设备和系统
WO2021129453A1 (zh) 一种截屏方法及相关设备
CN115206308A (zh) 一种人机交互的方法及电子设备
WO2024012346A1 (zh) 任务迁移的方法、电子设备和系统
WO2022052767A1 (zh) 一种控制设备的方法、电子设备和系统
WO2024109573A1 (zh) 悬浮窗显示的方法和电子设备
WO2022042774A1 (zh) 头像显示方法及电子设备
CN118131891A (zh) 一种人机交互的方法和装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20866242

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20866242

Country of ref document: EP

Kind code of ref document: A1