CN113608715A - Display device and voice service switching method - Google Patents

Display device and voice service switching method Download PDF

Info

Publication number
CN113608715A
CN113608715A CN202110934009.XA CN202110934009A CN113608715A CN 113608715 A CN113608715 A CN 113608715A CN 202110934009 A CN202110934009 A CN 202110934009A CN 113608715 A CN113608715 A CN 113608715A
Authority
CN
China
Prior art keywords
far
control function
voice
field
voice service
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110934009.XA
Other languages
Chinese (zh)
Inventor
黄媛媛
鲁亚凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vidaa Netherlands International Holdings BV
Vidaa USA Inc
Original Assignee
Vidaa Netherlands International Holdings BV
Vidaa USA Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vidaa Netherlands International Holdings BV, Vidaa USA Inc filed Critical Vidaa Netherlands International Holdings BV
Priority to CN202110934009.XA priority Critical patent/CN113608715A/en
Publication of CN113608715A publication Critical patent/CN113608715A/en
Priority to PCT/EP2022/071907 priority patent/WO2023012260A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/14Digital output to display device ; Cooperation and interconnection of the display device with other functional units
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/162Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs

Abstract

Based on the display device and the voice service switching method provided by the embodiment of the application, when a user switches a target voice service from a second voice service to a first voice service, a far-field voice control function is closed, so that voice data can be prevented from being continuously acquired by a far-field voice collector, interference of the far-field voice control function on a near-field voice control function of the first voice service is avoided, and user experience is optimized. By recording the switching state of the far-field voice control function when the far-field voice service is switched to the first voice service, the switching state is used as the historical switching state of the far-field voice control function, when a user switches the target voice service from the first voice service to the second voice service, the far-field voice control function is turned on or off according to the recorded historical switching state of the far-field voice control function, and after the target voice service is switched to the second voice service, the switching state of the far-field voice control function is consistent with the historical switching state of the far-field voice control function.

Description

Display device and voice service switching method
Technical Field
The present application relates to the field of display device technologies, and in particular, to a display device and a voice service switching method.
Background
The display device may provide a user with a play screen such as audio, video, pictures, and the like. Nowadays, display devices can provide users with not only live television program content received through data broadcasting, but also various applications and service content such as network video programs, network games, and the like.
Some display devices typically support Voice services products offered by multiple third party providers, for example, AVS (Alexa Voice service) Voice services and Google assisted build-in Voice services may coexist in the operating system of the display pen. When a user controls a display device through voice, a voice control function is realized by any one of an AVS voice service and a Google assisted build-in voice service. The AVS voice service at least supports a near-field voice control function and can also support a far-field voice control function, and the Google assisted build-in voice service only supports the near-field voice control function.
Aiming at the display equipment with various voice services coexisting, how to optimize the user experience of the voice control function by the user, especially the user experience when switching the voice services, is a problem to be solved urgently.
Disclosure of Invention
The application provides a display device and a voice service switching method, which can optimize the user experience of a user on a voice control function, particularly the user experience when switching voice services.
In a first aspect, the present application provides a display device comprising:
a display for displaying a user interface;
the far-field voice collector is used for collecting far-field voice data;
a controller configured to:
receiving an input voice service selection operation, determining a first voice service or a second voice service as a target voice service according to the selection operation, and realizing a voice control function through the target voice service, wherein the first voice service only supports a near-field voice control function, the second voice service supports a far-field voice control function and a near-field voice control function, and the far-field voice control function can be controlled to be turned on or turned off;
after the first voice service is determined as a target voice service according to the selection operation, recording the current switch state of the far-field voice control function as the historical switch state of the far-field voice control function, and controlling to close the far-field voice control function when the current switch state is in an open state;
and after the second voice service is determined as the target voice service according to the selection operation, if the historical switch state of the far-field voice control function is the opening state, the far-field voice control function is controlled to be opened.
In a second aspect, the present application further provides a voice service switching method, which is applied to a display device configured to be capable of operating a first voice service and a second voice service, where the first voice service supports only a near-field voice control function, the second voice service supports a far-field voice control function and a near-field voice control function, and the far-field voice control function can be controlled to be turned on or off; the method comprises the following steps:
receiving input voice service selection operation, determining a first voice service or a second voice service as a target voice service according to the selection operation, and realizing a voice control function through the target voice service;
after the first voice service is determined as a target voice service according to the selection operation, recording the current switch state of the far-field voice control function as the historical switch state of the far-field voice control function, and controlling to close the far-field voice control function when the current switch state is in an open state;
and after the second voice service is determined as the target voice service according to the selection operation, if the historical switch state of the far-field voice control function is the opening state, the far-field voice control function is controlled to be opened.
Based on the display device and the voice service switching method provided by the embodiment of the application, when the user switches the target voice service from the second voice service to the first voice service, the far-field voice control function is closed, so that the far-field voice collector can be prevented from continuously collecting voice data, the interference of the far-field voice control function on the near-field voice control function of the first voice service is avoided, and the user experience is optimized. By recording the switching state of the far-field voice control function when the far-field voice service is switched to the first voice service, the switching state is used as the historical switching state of the far-field voice control function, when a user switches the target voice service from the first voice service to the second voice service, the far-field voice control function is turned on or off according to the recorded historical switching state of the far-field voice control function, and after the target voice service is switched to the second voice service, the switching state of the far-field voice control function is consistent with the historical switching state of the far-field voice control function.
Drawings
In order to more clearly explain the technical solution of the present application, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious to those skilled in the art that other drawings can be obtained according to the drawings without any creative effort.
FIG. 1 illustrates an operational scenario between a display device and a control apparatus, in accordance with some embodiments;
fig. 2 illustrates a hardware configuration block diagram of the control apparatus 100 according to some embodiments;
fig. 3 illustrates a hardware configuration block diagram of the display device 200 according to some embodiments;
FIG. 4 illustrates a software configuration diagram in the display device 200, according to some embodiments;
FIG. 5 illustrates a voice settings interface, according to some embodiments;
FIG. 6 illustrates a media asset application home page entered by the display device after power-on, in accordance with some embodiments;
FIG. 7 illustrates another funding application home page, according to some embodiments;
FIG. 8 illustrates a far-field speech settings interface, according to some embodiments;
FIG. 9 illustrates another far-field speech setting interface, in accordance with some embodiments;
FIG. 10 illustrates another asset application home page entered after a display device is powered on, in accordance with some embodiments;
fig. 11 illustrates a flow diagram of a method for voice service handoff, according to some embodiments.
Detailed Description
To make the purpose and embodiments of the present application clearer, the following will clearly and completely describe the exemplary embodiments of the present application with reference to the attached drawings in the exemplary embodiments of the present application, and it is obvious that the described exemplary embodiments are only a part of the embodiments of the present application, and not all of the embodiments.
It should be noted that the brief descriptions of the terms in the present application are only for the convenience of understanding the embodiments described below, and are not intended to limit the embodiments of the present application. These terms should be understood in their ordinary and customary meaning unless otherwise indicated.
The terms "first," "second," "third," and the like in the description and claims of this application and in the above-described drawings are used for distinguishing between similar or analogous objects or entities and not necessarily for describing a particular sequential or chronological order, unless otherwise indicated. It is to be understood that the terms so used are interchangeable under appropriate circumstances.
The terms "comprises" and "comprising," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a product or apparatus that comprises a list of elements is not necessarily limited to all elements expressly listed, but may include other elements not expressly listed or inherent to such product or apparatus.
The term "module" refers to any known or later developed hardware, software, firmware, artificial intelligence, fuzzy logic, or combination of hardware and/or software code that is capable of performing the functionality associated with that element.
Fig. 1 is a schematic diagram of an operation scenario between a display device and a control apparatus according to an embodiment. As shown in fig. 1, a user may operate the display apparatus 200 through the smart device 300 or the control device 100.
In some embodiments, the control apparatus 100 may be a remote controller, and the communication between the remote controller and the display device includes an infrared protocol communication or a bluetooth protocol communication, and other short-distance communication methods, and controls the display device 200 in a wireless or wired manner. The user may input a user instruction through a key on a remote controller, voice input, control panel input, etc., to control the display apparatus 200.
In some embodiments, the smart device 300 (e.g., mobile terminal, tablet, computer, laptop, etc.) may also be used to control the display device 200. For example, the display device 200 is controlled using an application program running on the smart device.
In some embodiments, the display device 200 may also be controlled in a manner other than the control apparatus 100 and the smart device 300, for example, the voice command control of the user may be directly received by a module configured inside the display device 200 to obtain a voice command, or may be received by a voice control device provided outside the display device 200.
In some embodiments, the display device 200 is also in data communication with a server 400. The display device 200 may be allowed to be communicatively connected through a Local Area Network (LAN), a Wireless Local Area Network (WLAN), and other networks. The server 400 may provide various contents and interactions to the display apparatus 200. The server 400 may be a cluster or a plurality of clusters, and may include one or more types of servers.
Fig. 2 exemplarily shows a block diagram of a configuration of the control apparatus 100 according to an exemplary embodiment. As shown in fig. 2, the control device 100 includes a controller 110, a communication interface 130, a user input/output interface 140, a memory, and a power supply. The control apparatus 100 may receive an input operation instruction from a user and convert the operation instruction into an instruction recognizable and responsive by the display device 200, serving as an interaction intermediary between the user and the display device 200.
Fig. 3 shows a hardware configuration block diagram of the display apparatus 200 according to an exemplary embodiment.
In some embodiments, the display apparatus 200 includes at least one of a tuner demodulator 210, a communicator 220, a detector 230, an external device interface 240, a controller 250, a display 260, an audio output interface 270, a memory, a power supply, a user interface.
In some embodiments the controller comprises a processor, a video processor, an audio processor, a graphics processor, a RAM, a ROM, a first interface to an nth interface for input/output.
In some embodiments, the display 260 includes a display screen component for presenting a picture, and a driving component for driving an image display, a component for receiving an image signal from the controller output, performing display of video content, image content, and a menu manipulation interface, and a user manipulation UI interface.
In some embodiments, the display 260 may be a liquid crystal display, an OLED display, and a projection display, and may also be a projection device and a projection screen.
In some embodiments, communicator 220 is a component for communicating with external devices or servers according to various communication protocol types. For example: the communicator may include at least one of a Wifi module, a bluetooth module, a wired ethernet module, and other network communication protocol chips or near field communication protocol chips, and an infrared receiver. The display apparatus 200 may establish transmission and reception of control signals and data signals with the external control apparatus 100 or the server 400 through the communicator 220.
In some embodiments, the user interface may be configured to receive control signals for controlling the apparatus 100 (e.g., an infrared remote control, etc.).
In some embodiments, the detector 230 is used to collect signals of the external environment or interaction with the outside. For example, detector 230 includes a light receiver, a sensor for collecting ambient light intensity; alternatively, the detector 230 includes an image collector, such as a camera, which may be used to collect external environment scenes, attributes of the user, or user interaction gestures, or the detector 230 includes a sound collector, such as a microphone, which is used to receive external sounds.
In some embodiments, the external device interface 240 may include, but is not limited to, the following: high Definition Multimedia Interface (HDMI), analog or data high definition component input interface (component), composite video input interface (CVBS), USB input interface (USB), RGB port, and the like. The interface may be a composite input/output interface formed by the plurality of interfaces.
In some embodiments, the tuner demodulator 210 receives broadcast television signals via wired or wireless reception, and demodulates audio/video signals, such as EPG data signals, from a plurality of wireless or wired broadcast television signals.
In some embodiments, the controller 250 and the modem 210 may be located in different separate devices, that is, the modem 210 may also be located in an external device of the main device where the controller 250 is located, such as an external set-top box.
In some embodiments, the controller 250 controls the operation of the display device and responds to user operations through various software control programs stored in memory. The controller 250 controls the overall operation of the display apparatus 200. For example: in response to receiving a user command for selecting a UI object to be displayed on the display 260, the controller 250 may perform an operation related to the object selected by the user command.
In some embodiments, the object may be any one of selectable objects, such as a hyperlink, an icon, or other actionable control. The operations related to the selected object are: displaying an operation connected to a hyperlink page, document, image, or the like, or performing an operation of a program corresponding to the icon.
In some embodiments the controller comprises at least one of a Central Processing Unit (CPU), a video processor, an audio processor, a Graphics Processing Unit (GPU), a RAM Random Access Memory (RAM), a ROM (Read-Only Memory), a first to nth interface for input/output, a communication Bus (Bus), and the like.
A CPU processor. For executing operating system and application program instructions stored in the memory, and executing various application programs, data and contents according to various interactive instructions receiving external input, so as to finally display and play various audio-video contents. The CPU processor may include a plurality of processors. E.g. comprising a main processor and one or more sub-processors.
In some embodiments, a graphics processor for generating various graphics objects, such as: icons, operation menus, user input instruction display graphics, and the like. The graphic processor comprises an arithmetic unit, which performs operation by receiving various interactive instructions input by a user and displays various objects according to display attributes; the system also comprises a renderer for rendering various objects obtained based on the arithmetic unit, wherein the rendered objects are used for being displayed on a display.
In some embodiments, the video processor is configured to receive an external video signal, and perform video processing such as decompression, decoding, scaling, noise reduction, frame rate conversion, resolution conversion, and image synthesis according to a standard codec protocol of the input signal, so as to obtain a signal that can be displayed or played on the direct display device 200.
In some embodiments, the video processor includes a demultiplexing module, a video decoding module, an image synthesis module, a frame rate conversion module, a display formatting module, and the like. The demultiplexing module is used for demultiplexing the input audio and video data stream. And the video decoding module is used for processing the video signal after demultiplexing, including decoding, scaling and the like. And the image synthesis module is used for carrying out superposition mixing processing on the GUI signal input by the user or generated by the user and the video image after the zooming processing by the graphic generator so as to generate an image signal for display. And the frame rate conversion module is used for converting the frame rate of the input video. And the display formatting module is used for converting the received video output signal after the frame rate conversion, and changing the signal to be in accordance with the signal of the display format, such as an output RGB data signal.
In some embodiments, the audio processor is configured to receive an external audio signal, decompress and decode the received audio signal according to a standard codec protocol of the input signal, and perform noise reduction, digital-to-analog conversion, and amplification processing to obtain an audio signal that can be played in the speaker.
In some embodiments, a user may enter user commands on a Graphical User Interface (GUI) displayed on display 260, and the user input interface receives the user input commands through the Graphical User Interface (GUI). Alternatively, the user may input the user command by inputting a specific sound or gesture, and the user input interface receives the user input command by recognizing the sound or gesture through the sensor.
In some embodiments, a "user interface" is a media interface for interaction and information exchange between an application or operating system and a user that enables conversion between an internal form of information and a form that is acceptable to the user. A commonly used presentation form of the User Interface is a Graphical User Interface (GUI), which refers to a User Interface related to computer operations and displayed in a graphical manner. It may be an interface element such as an icon, a window, a control, etc. displayed in the display screen of the electronic device, where the control may include a visual interface element such as an icon, a button, a menu, a tab, a text box, a dialog box, a status bar, a navigation bar, a Widget, etc.
In some embodiments, a system of a display device may include a Kernel (Kernel), a command parser (shell), a file system, and an application program. The kernel, shell, and file system together make up the basic operating system structure that allows users to manage files, run programs, and use the system. After power-on, the kernel is started, kernel space is activated, hardware is abstracted, hardware parameters are initialized, and virtual memory, a scheduler, signals and interprocess communication (IPC) are operated and maintained. And after the kernel is started, loading the Shell and the user application program. The application program is compiled into machine code after being started, and a process is formed.
As shown in fig. 4, the system of the display device may include a Kernel (Kernel), a command parser (shell), a file system, and an application program. The kernel, shell, and file system together make up the basic operating system structure that allows users to manage files, run programs, and use the system. After power-on, the kernel is started, kernel space is activated, hardware is abstracted, hardware parameters are initialized, and virtual memory, a scheduler, signals and interprocess communication (IPC) are operated and maintained. And after the kernel is started, loading the Shell and the user application program. The application program is compiled into machine code after being started, and a process is formed.
As shown in fig. 4, the system of the display device is divided into three layers, i.e., an application layer, a middleware layer and a hardware layer from top to bottom.
The Application layer mainly includes common applications on the television and an Application Framework (Application Framework), wherein the common applications are mainly applications developed based on the Browser, such as: HTML5 APPs; and Native APPs (Native APPs). An Application Framework (Application Framework) is a complete program model, and has all basic functions required by standard Application software, such as: file access, data exchange, and interfaces to use these functions (toolbars, status lists, menus, dialog boxes). Native APPs (Native APPs) may support online or offline, message push, or local resource access.
The middleware layer comprises various television protocols, multimedia protocols, system components and other middleware. The middleware can use basic service (function) provided by system software to connect each part of an application system or different applications on a network, and can achieve the purposes of resource sharing and function sharing.
The hardware layer mainly comprises an HAL interface, hardware and a driver, wherein the HAL interface is a unified interface for butting all the television chips, and specific logic is realized by each chip. The driving mainly comprises: audio drive, display driver, bluetooth drive, camera drive, WIFI drive, USB drive, HDMI drive, sensor drive (like fingerprint sensor, temperature sensor, pressure sensor etc.) and power drive etc..
In some embodiments, the middleware layer includes various types of system services, such as voice services. The voice service enables the user to control the display device using voice instructions, i.e. the display device may respond to the user's voice. The voice service operates in a similar manner to a general application program, except that the voice service program processes input user voice data, derives a user instruction based on the user voice data, and responds to the derived user instruction.
In some embodiments, a first voice service and a second voice service based on different engines are deployed in an operating system of the display device, for example, the first voice service may be a Google assisted build-in voice service based on Google, which supports a near-field voice control function; the second Voice Service may be an Alexa Voice Service (AVS for short) based on Alexa, which supports at least a near-field Voice control function and may also support a far-field Voice control function. It should be understood that the two voice services are affiliated with different third party providers, and their functional implementation is based on different engine implementations. The engine here may be a server.
It should be noted that the voice data input channels respectively corresponding to the far-field voice control function and the near-field voice control function are different. Specifically, in the implementation of the near-field voice control function, a near-field voice acquisition device independent of the display device acquires user voice data, and sends the acquired user voice data to a certain voice service on the display device. The approach voice acquisition device is a device which has a voice acquisition function and can display equipment communication, such as a voice remote controller. The voice remote controller is usually provided with a voice key, when a user clicks the voice key, the remote controller is triggered to enter a radio receiving state, and meanwhile, the display equipment is triggered to enter a state of waiting for receiving voice data sent by the remote controller. At this time, if the user speaks a voice command, the voice command is collected by the voice remote controller and is sent to a certain voice service of the display device. In the implementation of the far-field voice control function, user voice data is collected based on a far-field voice collector built in the display device, and the collected user voice data is sent to a certain voice service on the display device, and the far-field voice collector can be a voice collector. Generally, a sound collector on a display device has a physical switch, and when the physical switch is in an on state, the sound collector is always in a ready state to collect user voice data at any time, and when the physical switch is in an off state, the sound collector does not collect the voice data.
In some embodiments, the second voice service is started during the start-up of the display device system, and the first voice service is initialized and started after the second voice service is started. In other embodiments, the second voice service is started during the start-up of the display device system, and when the user operation is switched from the second voice service to the first voice service, the first voice service is initialized and started, and the second voice service is kept running. In other embodiments, the first voice service is turned off when the user operation is switched from the first voice service to the second voice service. For a display device with multiple voice services coexisting, when receiving user voice data, the display device needs to send the voice data to one of the voice services for processing, and the voice service is a target voice service for realizing a voice control function. The operation of switching the voice service by the user is an operation of selecting a certain voice server as a target voice service by the user, or an operation of setting the target voice service by the user.
In some embodiments, a user first controls a display device to display a Voice Service setting interface, wherein the Voice Service setting interface comprises a first Voice Service option (Google assisted build-in) and a second Voice Service option (Alexa Voice Service). And then setting a voice service as a target voice service by inputting a selection operation for a certain voice service option. Fig. 5 is a Voice Service setting interface, which is shown in some exemplary embodiments of the present application, and sets a first Voice Service as a target Voice Service when a user selects a first Voice Service option (Google assisted build-in) in fig. 5, and sets a second Voice Service as the target Voice Service when the user selects a second Voice Service option (Alexa Voice Service) in fig. 5.
In some implementations, the value of the designated flag in the system is set to a third value when the display device receives a selection operation of the first voice service option and is set to a fourth value when the display device receives a selection operation of the second voice service option. For convenience of explanation, the flag bit characterizing the target voice service is referred to as a second flag bit. Illustratively, when the display device receives the selection operation of the first voice service option, the value of the second flag bit in the system is set to True, and when the display device receives the selection operation of the second voice service option, the value of the second flag bit in the system is set to False. In other implementations, when the display device receives a selection operation of the first voice service option, the value of the flag bit corresponding to the first voice service in the system is set to True, and the value of the flag bit corresponding to the second voice service in the system is set to False. When the display device receives the selection operation of the second voice service option, the value of the flag bit corresponding to the second voice service in the system is set to True, and the value of the flag bit corresponding to the first voice service in the system is set to False. Thus, when the display device receives the user voice data, whether the current target voice service is the first voice service or the second voice service can be determined according to the value of the designated zone bit, so that the user voice data can be sent to the target voice service.
In some embodiments, the second voice service supports a far-field voice control function and a near-field voice control function, and the far-field voice control function may be controlled to be turned on or off. Wherein a user can turn on or off far-field speech control functions by interacting with interactive interface objects specified in the user interface. When the display equipment receives the interactive operation of closing the far-field voice control function by the user, controlling a physical switch of the far-field voice collector to be closed so that the far-field voice collector does not collect the voice data of the user any more; when the display equipment receives the interactive operation of starting the far-field voice control function by a user, the display equipment controls the physical switch of the far-field voice collector to be opened so that the far-field voice collector can collect the voice data of the user.
In some embodiments, the switch state of the far-field voice control function is marked by some flag in the system. For convenience of explanation, the flag for marking the switch state of the far-field voice control function is referred to as a first flag. Specifically, the process that the display device turns on the far-field voice control function in response to the user operation may include setting a value of the first flag to a first value, and controlling turning on the far-field voice collector; the process of the display device turning off the far-field speech control function in response to the user operation may include setting the value of the first flag to a second value and controlling the far-field speech collector to be turned off. Therefore, the display equipment can acquire the switch state of the far-field voice control function by reading the value of the first marker bit.
In some embodiments, near-field voice data sent by a near-field voice acquisition device is received; if the value of the first zone bit is a first value, sending near-field voice data to the first voice service; and if the value of the second zone bit is a second value, sending the near-field voice data to the second voice service.
In some embodiments, the display device sets a value of a flag corresponding to the first voice service to True, a value of a flag corresponding to the second voice service to False, or a value of the second flag to a third value in response to a selection operation of the first voice service input by a user at the voice service setting interface, thereby determining the first voice service as the target voice service. And after the first voice service is determined as the target voice service, recording the current switch state of the far-field voice control function as the historical switch state of the far-field voice control function, and controlling to close the far-field voice control function when the current switch state is in an open state. Because the user has switched the target voice service from the second voice service to the first voice service, the far-field voice control function supported by the second voice service cannot be used, the far-field voice control function is closed, the far-field voice collector can be prevented from continuously collecting voice data, the interference of the far-field voice control function on the near-field voice control function of the first voice service is avoided, and the user experience is optimized. In addition, by recording the switch state of the far-field voice control function when switching to the first voice service as the historical switch state of the far-field voice control function, when the user switches the target voice service from the first voice service back to the second voice service, the far-field voice control function can be turned on or off according to the recorded historical switch state of the far-field voice control function, so that the switch state of the far-field voice control function is consistent with the historical switch state of the far-field voice control function after switching back to the second voice service.
In some embodiments, the display apparatus determines the second voice service as the target voice service by setting a value of a flag corresponding to the second voice service to True, setting a value of a flag corresponding to the first voice service to False, or setting a value of the second flag to a fourth value in response to a selection operation of the second voice service input by the user at the voice service setting interface. And after the second voice service is determined as the target voice service, acquiring the recorded historical switch state of the far-field voice control function, and if the historical switch state of the far-field voice control function is in an open state, controlling the far-field voice control function to be opened. And if the historical switch state of the far-field voice control function is the closed state, keeping the switch state of the far-field voice control function in the closed state. In the embodiments, the far-field voice control function is turned on or off according to the recorded historical on-off state of the far-field voice control function, so that after the user switches back to the second voice service, the on-off state of the far-field voice control function is consistent with the historical on-off state of the far-field voice control function, and the user experience is optimized.
In some embodiments, after the second voice service is determined as the target voice service according to the selection operation, if the historical switch state of the far-field voice control function is the off state, a first prompt message is displayed in the user interface to prompt the user to turn on the far-field voice control function. Therefore, when the user only selects the AVS near-field voice interaction mode, the user can be timely and reasonably guided to upgrade the near-field voice control function into the far-field voice control function. When the AVS far-field voice control function is used, no extra equipment (such as a near-field voice acquisition device like a voice remote controller) is needed, the voice service can be awakened only by directly speaking the awakening words, and a voice instruction is directly input to control the display equipment.
In some embodiments, when the switch state of the far-field voice control function is the off state, a pattern of far-field voice control function switch state identifiers is displayed in the user interface to reveal the switch state of the far-field voice control function as the off state.
Fig. 6 is a schematic user interface diagram illustrating a home page of a media asset application accessed after a display device is turned on, in which the target voice service is the second voice service and the far-field voice control function is in an off state according to some exemplary embodiments of the present application. As shown in fig. 6, the main page of the asset application displays time information, application icons and function controls. In addition, since the target voice service is the second voice service and the far-field voice control function is in a closed state, a first reminding message, specifically, that the far-field voice control function is closed, is displayed on the home page of the media asset application, that is, the far-field voice collector built in the display device is closed. In addition, a gray far-field voice control function switch state indicator is also displayed in the upper right corner of the main page of the media asset application shown in fig. 6, and the far-field voice control function switch state is characterized as a closed state by displaying the representation in gray.
In some embodiments, the first prompt is presented in the user interface in the form of a functionality control, e.g., the first prompt is now at an operable functionality control. The user can input a far-field voice control function starting instruction by operating the function control so as to instruct the display equipment to start the far-field voice control function. The display device receives a far-field voice control function starting instruction input by a user based on the first prompt information, and displays a far-field voice setting interface according to the switching state of the far-field voice collector, wherein the far-field voice setting interface comprises a function control used for starting the far-field voice control function.
It should be noted that the process of inputting the far-field voice control function start instruction based on the first prompt information by the user may be a process of inputting a series of operations continuously. The series of operations includes two or more steps of continuous operation. For example, referring to fig. 6, first, the user controls the focus to move to the function control where the first prompt is located. When the display device detects that focus is on the functionality control where the first reminder information is, a show control, such as "expand detail" shown in fig. 7, is displayed. Next, referring to fig. 7, after the user controls the focus to move to the display control and inputs the confirmation operation, the input operation of the far-field voice control function start instruction is completed.
And the display equipment responds to the far-field voice control function starting instruction and displays a far-field voice setting interface according to the switching state of the far-field voice collector. If the far-field voice collector is in an opening state, the function control used for opening the far-field voice control function is configured to be in an operable state, and if the far-field voice collector is in a closing state, the function control used for opening the far-field voice control function is configured to be in an inoperable state. When the function control for starting the far-field voice control function is in an operable state, a user can start the far-field voice control function by operating the function control.
Fig. 8 illustrates a far-field speech setting interface, in some exemplary embodiments, that is displayed when the far-field speech collector is in an open state. As shown in fig. 8, the far-field speech setting interface includes a function control "far-field speech control function" for turning on the far-field speech control function, and its explanatory information "turn on built-in microphone". Speaking to television, say only "Alexa" and you can then wake up Alexa on television. Enjoy Alexa voice services without manual operation! ". And the far-field voice collector is in an opening state, so that the function control for opening the far-field voice control function is in an operable state. Referring to fig. 8, when the user selects the function control and clicks the confirmation "ok", an operation of turning on the far-field voice control function may be input, and the display device may control the far-field voice control function to be turned on in response to the input turning-on operation.
Fig. 9 illustrates a far-field speech setting interface, in some exemplary embodiments, shown herein, and in particular, a far-field speech setting interface displayed when a far-field speech collector is in a closed state. As shown in fig. 9, unlike the far-field speech setting interface shown in fig. 8, the upper area of the far-field speech setting interface displays a prompt message for prompting the user to turn on the physical switch of the far-field speech collector, specifically, "the built-in microphone is now turned off. Please open near the terminal key at the bottom of the tv. This interface will be automatically refreshed. ". In addition, since the far-field voice collector is in a closed state, the function control for turning on the far-field voice control function is in an inoperable state. When detecting that the user opens the physical switch of the far-field voice collector, the function control for starting the far-field voice control function is updated to be in an operable state.
In some embodiments, when the switch state of the far-field voice control function is in the on state, a second prompt message and/or another pattern of far-field voice control function switch state identifiers is displayed in the user interface to reveal that the switch state of the far-field voice control function is in the on state.
Fig. 10 is a schematic user interface diagram illustrating a home page of a media asset application entered after a display device is turned on when a target voice service is a second voice service and a far-field voice control function is in an on state according to some exemplary embodiments of the present application. As shown in fig. 10, the main page of the asset application displays time information, application icons and function controls. Moreover, since the target voice service is the second voice service and the far-field voice control function is in the on state, a second reminding message, specifically, "the far-field voice control function is on", is displayed on the media asset application homepage, that is, the far-field voice collector built in the display device is on, that is, the far-field voice control function is on. In addition, a far-field voice control function switch state indicator in color (a specific color is not shown in the figure) is also displayed on the upper right corner of the main page of the media asset application shown in fig. 10, and the switch state of the far-field voice control function is represented as a closed state by displaying the indicator in color.
In the above embodiment, after the display duration of the first prompt message or the second prompt message reaches the preset duration, the display device cancels the display of the first prompt message or the second prompt message. Or when the first prompt message or the second prompt message is displayed, if the operation of closing the first prompt message or the second prompt message is received, the display of the first prompt message or the second prompt message is cancelled.
In some embodiments, when the switch state of the far-field voice control function is in the on state, the second prompt message and/or the far-field voice control function switch state identification of the first pattern is displayed in the user interface to reveal that the switch state of the far-field voice control function is in the on state. And when the switch state of the far-field voice control function is the closed state, displaying a second pattern of far-field voice control function switch state identification in the user interface so as to show that the switch state of the far-field voice control function is the closed state. The first pattern and the second pattern refer to different patterns, including but not limited to differences in color, contrast, size, content, and the like.
In some embodiments, after the display device is first turned on, it is determined whether the switch state of the far-field voice control function is in the off state, and if the switch state is in the off state, the first prompt message is displayed on the homepage after the homepage of the display device is entered. And if the user does not receive the far-field voice control function starting instruction input based on the first prompt message within the preset time (such as 3 seconds), canceling the first prompt message. In some implementation scenarios, before the duration of time for the user to use the display device reaches a certain time, if the on-off state of the far-field voice control function is always in the off state, the first prompt message is displayed once every preset time. For example, the first prompt message is displayed every 7 days for the first two months of the user's use of the display device. After the user uses the display device for a certain time, the first prompt message can not be displayed any more, and the interval time for displaying the first prompt message can be properly prolonged.
In some embodiments, when a selection operation of the second voice service input by the user in the voice service setting interface is received, whether the second voice service supports the far-field voice control function is determined, and in a case that it is determined that the second voice service supports the far-field voice control function, the steps described in the above embodiments are executed again. And if the second voice service is judged not to support the far-field voice control function, the process is ended after the second voice service is determined as the target voice service.
According to the display device provided by the above embodiment, the embodiment of the present application further provides a voice service switching method, which is applied to a display device, the display device is configured to be capable of operating a first voice service and a second voice service, the first voice service only supports a near-field voice control function, the second voice service supports a far-field voice control function and a near-field voice control function, and the far-field voice control function can be controlled to be turned on or turned off.
Fig. 11 is a flowchart of a voice service switching method shown in some embodiments of the present application, and as shown in fig. 11, the method may include:
s111, receiving an input voice service selection operation, if the selection operation is used to select the first voice service, performing S112, and if the selection operation is used to select the second voice service, performing S114.
And S112, determining the first voice service as a target voice service so as to realize a voice control function through the first voice service.
And S113, recording the current switch state of the far-field voice control function as the historical switch state of the far-field voice control function, and controlling to close the far-field voice control function when the current switch state is in an open state.
And S114, determining the second voice service as the target voice service so as to realize the voice control function through the second voice service.
And S115, if the historical switch state of the far-field voice control function is an opening state, controlling to open the far-field voice control function.
In some embodiments, after the second voice service is determined as the target voice service, if the historical switch state of the far-field voice control function is the off state, a first prompt message is displayed in the user interface to prompt the user to turn on the far-field voice control function.
In some embodiments, the method further comprises: receiving a far-field voice control function starting instruction input by a user based on the first prompt information, and displaying a far-field voice setting interface according to the switching state of the far-field voice collector, wherein the far-field voice setting interface comprises a function control for starting the far-field voice control function; if the far-field voice collector is in an open state, the function control is configured to be in an operable state, and if the far-field voice collector is in a closed state, the function control is configured to be in a non-operable state, and the function control in the operable state is used for triggering the far-field voice control function to be opened.
In some embodiments, the method further comprises: receiving a starting operation input by a user based on the function control under the condition that the function control is configured to be in an operable state; and responding to the opening operation, and controlling to open the far-field voice control function.
In some embodiments, when the switch state of the far-field voice control function is in the on state, a second prompt message and/or a first pattern of far-field voice control function switch state identifiers is displayed in the user interface to reveal that the switch state of the far-field voice control function is in the on state. And when the switch state of the far-field voice control function is the closed state, displaying a second pattern of far-field voice control function switch state identification in the user interface so as to show that the switch state of the far-field voice control function is the closed state.
In some embodiments, controlling the far-field speech control function to be turned on includes setting a value of a first flag to a first value and controlling the far-field speech collector to be turned on; the controlling of the far-field voice control function to be shut down comprises setting a value of a first flag bit to be a second value and controlling the far-field voice collector to be shut down so as to obtain the on-off state of the far-field voice control function by reading the value of the first flag bit.
In some embodiments, receiving an input voice service selection operation according to which the first voice service or the second voice service is determined to be the target voice service includes: displaying a voice service setting interface, wherein the voice service setting interface comprises a first voice service option and a second voice service option; and when the selection operation of the second voice service option is received, setting the value of the second zone bit to be a third value, and when the selection operation of the second voice service option is received, setting the value of the second zone bit to be a fourth value so as to send the received near-field voice data to the target voice service according to the value of the second zone bit.
In some embodiments, the method further comprises: receiving near-field voice data sent by a near-field voice acquisition device; if the value of the first zone bit is a first value, sending the near-field voice data to the first voice service; and if the value of the second zone bit is a second value, sending the near-field voice data to the second voice service.
According to the voice service switching method provided by the embodiment, when the user switches the target voice service from the second voice service to the first voice service, the far-field voice control function is closed, so that the far-field voice collector can be prevented from continuously collecting voice data, the far-field voice control function is prevented from interfering with the near-field voice control function of the first voice service, and the user experience is optimized. By recording the switching state of the far-field voice control function when the far-field voice service is switched to the first voice service, the switching state is used as the historical switching state of the far-field voice control function, when a user switches the target voice service from the first voice service to the second voice service, the far-field voice control function is turned on or off according to the recorded historical switching state of the far-field voice control function, and after the target voice service is switched to the second voice service, the switching state of the far-field voice control function is consistent with the historical switching state of the far-field voice control function.
Finally, it should be noted that: the above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present application.
The foregoing description, for purposes of explanation, has been presented in conjunction with specific embodiments. However, the illustrative discussions above are not intended to be exhaustive or to limit the embodiments to the precise forms disclosed above. Many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles and the practical application, to thereby enable others skilled in the art to best utilize the embodiments and various embodiments with various modifications as are suited to the particular use contemplated.

Claims (10)

1. A display device, comprising:
a display for displaying a user interface;
the far-field voice collector is used for collecting far-field voice data;
a controller configured to:
receiving an input voice service selection operation, determining a first voice service or a second voice service as a target voice service according to the selection operation, and realizing a voice control function through the target voice service, wherein the first voice service only supports a near-field voice control function, the second voice service supports a far-field voice control function and a near-field voice control function, and the far-field voice control function can be controlled to be turned on or turned off;
after the first voice service is determined as a target voice service according to the selection operation, recording the current switch state of the far-field voice control function as the historical switch state of the far-field voice control function, and controlling to close the far-field voice control function when the current switch state is in an open state;
and after the second voice service is determined as the target voice service according to the selection operation, if the historical switch state of the far-field voice control function is the opening state, the far-field voice control function is controlled to be opened.
2. The display device of claim 1, wherein the controller is further configured to:
after the second voice service is determined as the target voice service according to the selection operation, if the historical switch state of the far-field voice control function is the closed state, displaying first prompt information in a user interface to prompt a user to start the far-field voice control function.
3. The display device of claim 1, wherein the controller is further configured to:
receiving a far-field voice control function starting instruction input by a user based on the first prompt information, and displaying a far-field voice setting interface according to the switching state of the far-field voice collector, wherein the far-field voice setting interface comprises a function control for starting the far-field voice control function;
if the far-field voice collector is in an open state, the function control is configured to be in an operable state, and if the far-field voice collector is in a closed state, the function control is configured to be in a non-operable state, and the function control in the operable state is used for triggering the far-field voice control function to be opened.
4. The display device of claim 3, wherein the controller is further configured to:
receiving a starting operation input by a user based on the function control under the condition that the function control is configured to be in an operable state;
and responding to the opening operation, and controlling to open the far-field voice control function.
5. The display device of claim 1, wherein the controller is further configured to:
and when the switch state of the far-field voice control function is the opening state, displaying second prompt information and/or a first pattern of far-field voice control function switch state identification in the user interface so as to show that the switch state of the far-field voice control function is the opening state.
6. The display device of claim 1, wherein the controller is further configured to:
and when the switch state of the far-field voice control function is the closed state, displaying a second pattern of far-field voice control function switch state identification in the user interface so as to show that the switch state of the far-field voice control function is the closed state.
7. The display device of claim 5, wherein controlling the far-field voice control function to be turned on comprises setting a value of a first flag to a first value and controlling the far-field voice collector to be turned on;
the controlling of the far-field voice control function to be shut down comprises setting a value of a first flag bit to be a second value and controlling the far-field voice collector to be shut down so as to obtain the on-off state of the far-field voice control function by reading the value of the first flag bit.
8. The display device according to claim 1, wherein receiving an input voice service selection operation according to which the first voice service or the second voice service is determined as the target voice service comprises:
displaying a voice service setting interface, wherein the voice service setting interface comprises a first voice service option and a second voice service option;
and when the selection operation of the second voice service option is received, setting the value of the second zone bit to be a third value, and when the selection operation of the second voice service option is received, setting the value of the second zone bit to be a fourth value so as to send the received near-field voice data to the target voice service according to the value of the second zone bit.
9. The display device of claim 8, wherein the controller is further configured to:
receiving near-field voice data sent by a near-field voice acquisition device;
if the value of the first zone bit is a first value, sending the near-field voice data to the first voice service;
and if the value of the second zone bit is a second value, sending the near-field voice data to the second voice service.
10. A voice service switching method is applied to a display device, wherein the display device is configured to be capable of running a first voice service and a second voice service, the first voice service only supports a near-field voice control function, the second voice service supports a far-field voice control function and a near-field voice control function, and the far-field voice control function can be controlled to be switched on or switched off;
the method comprises the following steps:
receiving input voice service selection operation, determining a first voice service or a second voice service as a target voice service according to the selection operation, and realizing a voice control function through the target voice service;
after the first voice service is determined as a target voice service according to the selection operation, recording the current switch state of the far-field voice control function as the historical switch state of the far-field voice control function, and controlling to close the far-field voice control function when the current switch state is in an open state;
and after the second voice service is determined as the target voice service according to the selection operation, if the historical switch state of the far-field voice control function is the opening state, the far-field voice control function is controlled to be opened.
CN202110934009.XA 2021-08-05 2021-08-13 Display device and voice service switching method Pending CN113608715A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202110934009.XA CN113608715A (en) 2021-08-13 2021-08-13 Display device and voice service switching method
PCT/EP2022/071907 WO2023012260A1 (en) 2021-08-05 2022-08-04 Display apparatus, terminal device and communication connection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110934009.XA CN113608715A (en) 2021-08-13 2021-08-13 Display device and voice service switching method

Publications (1)

Publication Number Publication Date
CN113608715A true CN113608715A (en) 2021-11-05

Family

ID=78308602

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110934009.XA Pending CN113608715A (en) 2021-08-05 2021-08-13 Display device and voice service switching method

Country Status (1)

Country Link
CN (1) CN113608715A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114554265A (en) * 2022-02-23 2022-05-27 四川长虹电器股份有限公司 Television voice control system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114554265A (en) * 2022-02-23 2022-05-27 四川长虹电器股份有限公司 Television voice control system

Similar Documents

Publication Publication Date Title
CN112672195A (en) Remote controller key setting method and display equipment
CN113630656B (en) Display device, terminal device and communication connection method
CN112817680B (en) Upgrade prompting method and display device
CN113163258A (en) Channel switching method and display device
CN112860331A (en) Display device and voice interaction prompting method
CN112905149A (en) Processing method of voice instruction on display device, display device and server
CN113608715A (en) Display device and voice service switching method
CN113064691B (en) Display method and display equipment for starting user interface
CN112272320B (en) Display device and duplicate name detection method thereof
CN112565867B (en) Display device
CN113064534A (en) Display method and display equipment of user interface
CN114302101A (en) Display apparatus and data sharing method
CN112882780A (en) Setting page display method and display device
CN113596559A (en) Method for displaying information in information bar and display equipment
CN113573112A (en) Display device and remote controller
CN113014979A (en) Content display method and display equipment
CN112732396A (en) Media asset data display method and display device
CN113038255B (en) Channel information updating method and display device
CN113784222B (en) Interaction method of application and digital television program and display equipment
WO2022213807A1 (en) Display method and display device
CN112988023A (en) Application display method and display equipment
CN112882631A (en) Display method of electronic specification on display device and display device
CN116208793A (en) Application program language synchronization method and display device
CN113296668A (en) Display method and display device of electronic specification
CN112947888A (en) Display method and display equipment of voice function page

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination