CN114302197A - Voice separation control method and display device - Google Patents

Voice separation control method and display device Download PDF

Info

Publication number
CN114302197A
CN114302197A CN202110294224.8A CN202110294224A CN114302197A CN 114302197 A CN114302197 A CN 114302197A CN 202110294224 A CN202110294224 A CN 202110294224A CN 114302197 A CN114302197 A CN 114302197A
Authority
CN
China
Prior art keywords
voice
external audio
display
determined
control instruction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110294224.8A
Other languages
Chinese (zh)
Inventor
孟祥菲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hisense Visual Technology Co Ltd
Original Assignee
Hisense Visual Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hisense Visual Technology Co Ltd filed Critical Hisense Visual Technology Co Ltd
Priority to CN202110294224.8A priority Critical patent/CN114302197A/en
Publication of CN114302197A publication Critical patent/CN114302197A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Controls And Circuits For Display Device (AREA)

Abstract

The application discloses a voice separation control method and display equipment in the above embodiment, which are used for realizing that the television is controlled through voice in the remote video call process and improving the use experience of a user. The method comprises the following steps: receiving external audio; if the display equipment is in a remote video call state and the far-field microphone switch is on, transmitting the external audio to the object end of the video call; judging whether the external audio comprises a wake-up word, if so, suspending transmitting the external audio to an object end of the video call and monitoring the voice to be determined; receiving voice to be determined input by a user, if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction, and continuously transmitting external audio to an object end of the video call.

Description

Voice separation control method and display device
Technical Field
The present application relates to the field of voice control technologies, and in particular, to a voice separation control method and a display device.
Background
With the rapid development of smart televisions, an intelligent voice interaction mode is developed, and meanwhile, the continuous popularization of the social concept of televisions gradually generates a remote video call mode between a television end and between the television end and a mobile phone end. Therefore, how to control the television through voice in the remote video call process becomes a problem to be solved urgently by those skilled in the art.
Disclosure of Invention
The embodiment of the application provides a voice separation control method and display equipment, so that user experience is improved, a television is controlled through voice in a remote video call process, and the use experience of a user is improved.
In a first aspect, there is provided a display device comprising:
a display for displaying a user interface;
a user interface for receiving an input signal;
a controller respectively coupled to the display and the user interface for performing:
receiving external audio; if the display equipment is in a remote video call state and the far-field microphone switch is on, transmitting the external audio to the object end of the video call;
judging whether the external audio comprises a wake-up word, if so, suspending transmitting the external audio to an object end of the video call and monitoring the voice to be determined; receiving voice to be determined input by a user, if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction, and continuously transmitting external audio to an object end of the video call.
In some embodiments, the controller is configured to perform determining whether the external audio includes a wake-up word according to the following steps:
converting the external audio into voice words, and judging whether awakening words exist in the voice words or not;
if the wake-up word exists, determining that the external audio comprises the wake-up word;
if the wake-up word is not present, determining that the external audio does not include the wake-up word.
In some embodiments, the controller is configured to perform the determining whether the voice to be determined includes a control instruction for controlling the display device according to the following steps:
converting the voice to be determined into text content;
judging whether a control instruction which is the same as the text content exists in a control instruction set in the display equipment;
if the voice to be determined exists, determining that the voice to be determined comprises a control instruction for controlling the display equipment;
if not, it is determined that the voice to be determined does not include a control instruction for controlling the display apparatus.
In some embodiments, the controller is further configured to perform: and if the voice to be determined does not comprise a control instruction for controlling the display equipment, continuously transmitting the external audio to the object end of the video call, and repeatedly executing the step of judging whether the external audio comprises the awakening word.
In some embodiments, the controller is further configured to perform: and if the voice to be determined does not comprise a control instruction for controlling the display equipment, controlling a display to display a prompt message.
In some embodiments, the controller is further configured to perform: and if the external audio does not comprise the awakening word, continuously transmitting the external audio to the object end of the video call, and repeatedly executing the step of judging whether the external audio comprises the awakening word.
In some embodiments, the controller is further configured to perform:
and if the display equipment is not in a remote video call state and the external audio comprises a wakeup word, receiving the voice to be determined input by the user, and if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction.
In some embodiments, the controller is further configured to perform:
if the display equipment is in a remote video call state and the far-field microphone switch is off, the external audio is not transmitted to the object end of the video call; and if the voice text comprises a wake-up word, receiving the voice to be determined input by the user, and if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction.
In some embodiments, the controller is further configured to perform: controlling the display of a microphone-off message on the display.
In a second aspect, a voice separation control method is provided, including:
receiving external audio; if the display equipment is in a remote video call state and the far-field microphone switch is on, transmitting the external audio to the object end of the video call;
judging whether the external audio comprises a wake-up word, if so, suspending transmitting the external audio to an object end of the video call and monitoring the voice to be determined; receiving voice to be determined input by a user, if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction, and continuously transmitting external audio to an object end of the video call.
In the embodiment, the voice separation control method and the display device realize that the television is controlled through voice in the remote video call process, and the use experience of a user is improved. The method comprises the following steps: receiving external audio; if the display equipment is in a remote video call state and the far-field microphone switch is on, transmitting the external audio to the object end of the video call; judging whether the external audio comprises a wake-up word, if so, suspending transmitting the external audio to an object end of the video call and monitoring the voice to be determined; receiving voice to be determined input by a user, if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction, and continuously transmitting external audio to an object end of the video call.
Drawings
FIG. 1 illustrates an operational scenario between a display device and a control apparatus according to some embodiments;
fig. 2 illustrates a hardware configuration block diagram of the control apparatus 100 according to some embodiments;
fig. 3 illustrates a hardware configuration block diagram of the display apparatus 200 according to some embodiments;
FIG. 4 illustrates a software configuration diagram in the display device 200 according to some embodiments;
FIG. 5 illustrates an icon control interface display of an application in display device 200, in accordance with some embodiments;
FIG. 6 illustrates a flow diagram of a voice separation control method according to some embodiments;
FIG. 7 illustrates a user interface diagram with a far-field microphone switch on in accordance with some embodiments;
fig. 8 illustrates a user interface diagram with a far-field microphone switch off according to some embodiments.
Detailed Description
To make the objects, embodiments and advantages of the present application clearer, the following description of exemplary embodiments of the present application will clearly and completely describe the exemplary embodiments of the present application with reference to the accompanying drawings in the exemplary embodiments of the present application, and it is to be understood that the described exemplary embodiments are only a part of the embodiments of the present application, and not all of the embodiments.
All other embodiments, which can be derived by a person skilled in the art from the exemplary embodiments described herein without inventive step, are intended to be within the scope of the claims appended hereto. In addition, while the disclosure herein has been presented in terms of one or more exemplary examples, it should be appreciated that aspects of the disclosure may be implemented solely as a complete embodiment.
It should be noted that the brief descriptions of the terms in the present application are only for the convenience of understanding the embodiments described below, and are not intended to limit the embodiments of the present application. These terms should be understood in their ordinary and customary meaning unless otherwise indicated.
The terms "first", "second", "third", and the like in the description and claims of this application and in the above-described drawings are used for distinguishing between similar or analogous objects or entities and are not necessarily meant to define a particular order or sequence Unless otherwise indicated. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments described herein are, for example, capable of operation in sequences other than those illustrated or otherwise described herein.
Furthermore, the terms "comprises" and "comprising," as well as any variations thereof, are intended to cover a non-exclusive inclusion, such that a product or device that comprises a list of elements is not necessarily limited to those elements explicitly listed, but may include other elements not expressly listed or inherent to such product or device.
The term "module" as used herein refers to any known or later developed hardware, software, firmware, artificial intelligence, fuzzy logic, or combination of hardware and/or software code that is capable of performing the functionality associated with that element.
The term "remote control" as used in this application refers to a component of an electronic device, such as the display device disclosed in this application, that is typically wirelessly controllable over a short range of distances. Typically using infrared and/or Radio Frequency (RF) signals and/or bluetooth to connect with the electronic device, and may also include WiFi, wireless USB, bluetooth, motion sensor, etc. For example: the hand-held touch remote controller replaces most of the physical built-in hard keys in the common remote control device with the user interface in the touch screen.
The term "gesture" as used in this application refers to a user's behavior through a change in hand shape or an action such as hand motion to convey a desired idea, action, purpose, or result.
Fig. 1 is a schematic diagram illustrating an operation scenario between a display device and a control apparatus according to an embodiment. As shown in fig. 1, a user may operate the display device 200 through the mobile terminal 300 and the control apparatus 100.
In some embodiments, the control apparatus 100 may be a remote controller, and the communication between the remote controller and the display device includes an infrared protocol communication or a bluetooth protocol communication, and other short-distance communication methods, etc., and the display device 200 is controlled by wireless or other wired methods. The user may input a user command through a key on a remote controller, voice input, control panel input, etc. to control the display apparatus 200. Such as: the user can input a corresponding control command through a volume up/down key, a channel control key, up/down/left/right moving keys, a voice input key, a menu key, a power on/off key, etc. on the remote controller, to implement the function of controlling the display device 200.
In some embodiments, mobile terminals, tablets, computers, laptops, and other smart devices may also be used to control the display device 200. For example, the display device 200 is controlled using an application program running on the smart device. The application, through configuration, may provide the user with various controls in an intuitive User Interface (UI) on a screen associated with the smart device.
In some embodiments, the mobile terminal 300 may install a software application with the display device 200 to implement connection communication through a network communication protocol for the purpose of one-to-one control operation and data communication. Such as: the mobile terminal 300 and the display device 200 can establish a control instruction protocol, synchronize a remote control keyboard to the mobile terminal 300, and control the display device 200 by controlling a user interface on the mobile terminal 300. The audio and video content displayed on the mobile terminal 300 can also be transmitted to the display device 200, so as to realize the synchronous display function.
As also shown in fig. 1, the display apparatus 200 also performs data communication with the server 400 through various communication means. The display device 200 may be allowed to be communicatively connected through a Local Area Network (LAN), a Wireless Local Area Network (WLAN), and other networks. The server 400 may provide various contents and interactions to the display apparatus 200. Illustratively, the display device 200 receives software program updates, or accesses a remotely stored digital media library, by sending and receiving information, as well as Electronic Program Guide (EPG) interactions. The server 400 may be a cluster or a plurality of clusters, and may include one or more types of servers. Other web service contents such as video on demand and advertisement services are provided through the server 400.
The display device 200 may be a liquid crystal display, an OLED display, a projection display device. The particular display device type, size, resolution, etc. are not limiting, and those skilled in the art will appreciate that the display device 200 may be modified in performance and configuration as desired.
The display apparatus 200 may additionally provide an intelligent network tv function of a computer support function including, but not limited to, a network tv, an intelligent tv, an Internet Protocol Tv (IPTV), and the like, in addition to the broadcast receiving tv function.
A hardware configuration block diagram of a display device 200 according to an exemplary embodiment is exemplarily shown in fig. 2.
In some embodiments, at least one of the controller 250, the tuner demodulator 210, the communicator 220, the detector 230, the input/output interface 255, the display 275, the audio output interface 285, the memory 260, the power supply 290, the user interface 265, and the external device interface 240 is included in the display apparatus 200.
In some embodiments, a display 275 receives image signals originating from the first processor output and displays video content and images and components of the menu manipulation interface.
In some embodiments, the display 275, includes a display screen assembly for presenting a picture, and a driving assembly that drives the display of an image.
In some embodiments, the video content is displayed from broadcast television content, or alternatively, from various broadcast signals that may be received via wired or wireless communication protocols. Alternatively, various image contents received from the network communication protocol and sent from the network server side can be displayed.
In some embodiments, the display 275 is used to present a user-manipulated UI interface generated in the display apparatus 200 and used to control the display apparatus 200.
In some embodiments, a driver assembly for driving the display is also included, depending on the type of display 275.
In some embodiments, display 275 is a projection display and may also include a projection device and a projection screen.
In some embodiments, communicator 220 is a component for communicating with external devices or external servers according to various communication protocol types. For example: the communicator may include at least one of a Wifi chip, a bluetooth communication protocol chip, a wired ethernet communication protocol chip, and other network communication protocol chips or near field communication protocol chips, and an infrared receiver.
In some embodiments, the display apparatus 200 may establish control signal and data signal transmission and reception with the external control apparatus 100 or the content providing apparatus through the communicator 220.
In some embodiments, the user interface 265 may be configured to receive infrared control signals from a control device 100 (e.g., an infrared remote control, etc.).
In some embodiments, the detector 230 is a signal used by the display device 200 to collect an external environment or interact with the outside.
In some embodiments, the detector 230 includes a light receiver, a sensor for collecting the intensity of ambient light, and parameters changes can be adaptively displayed by collecting the ambient light, and the like.
In some embodiments, the detector 230 may further include an image collector, such as a camera, etc., which may be configured to collect external environment scenes, collect attributes of the user or gestures interacted with the user, adaptively change display parameters, and recognize user gestures, so as to implement a function of interaction with the user.
In some embodiments, the detector 230 may also include a temperature sensor or the like, such as by sensing ambient temperature.
In some embodiments, the display apparatus 200 may adaptively adjust a display color temperature of an image. For example, the display apparatus 200 may be adjusted to display a cool tone when the temperature is in a high environment, or the display apparatus 200 may be adjusted to display a warm tone when the temperature is in a low environment.
In some embodiments, the detector 230 may also be a sound collector or the like, such as a microphone, which may be used to receive the user's voice. Illustratively, a voice signal including a control instruction of the user to control the display device 200, or to collect an ambient sound for recognizing an ambient scene type, so that the display device 200 can adaptively adapt to an ambient noise.
In some embodiments, as shown in fig. 2, the input/output interface 255 is configured to allow data transfer between the controller 250 and external other devices or other controllers 250. Such as receiving video signal data and audio signal data of an external device, or command instruction data, etc.
In some embodiments, the external device interface 240 may include, but is not limited to, the following: the interface can be any one or more of a high-definition multimedia interface (HDMI), an analog or data high-definition component input interface, a composite video input interface, a USB input interface, an RGB port and the like. The plurality of interfaces may form a composite input/output interface.
In some embodiments, as shown in fig. 2, the tuning demodulator 210 is configured to receive a broadcast television signal through a wired or wireless receiving manner, perform modulation and demodulation processing such as amplification, mixing, resonance, and the like, and demodulate an audio and video signal from a plurality of wireless or wired broadcast television signals, where the audio and video signal may include a television audio and video signal carried in a television channel frequency selected by a user and an EPG data signal.
In some embodiments, the frequency points demodulated by the tuner demodulator 210 are controlled by the controller 250, and the controller 250 can send out control signals according to user selection, so that the modem responds to the television signal frequency selected by the user and modulates and demodulates the television signal carried by the frequency.
In some embodiments, the broadcast television signal may be classified into a terrestrial broadcast signal, a cable broadcast signal, a satellite broadcast signal, an internet broadcast signal, or the like according to the broadcasting system of the television signal. Or may be classified into a digital modulation signal, an analog modulation signal, and the like according to a modulation type. Or the signals are classified into digital signals, analog signals and the like according to the types of the signals.
In some embodiments, the controller 250 and the modem 210 may be located in different separate devices, that is, the modem 210 may also be located in an external device of the main device where the controller 250 is located, such as an external set-top box. Therefore, the set top box outputs the television audio and video signals modulated and demodulated by the received broadcast television signals to the main body equipment, and the main body equipment receives the audio and video signals through the first input/output interface.
In some embodiments, the controller 250 controls the operation of the display device and responds to user operations through various software control programs stored in memory. The controller 250 may control the overall operation of the display apparatus 200. For example: in response to receiving a user command for selecting a UI object to be displayed on the display 275, the controller 250 may perform an operation related to the object selected by the user command.
In some embodiments, the object may be any one of selectable objects, such as a hyperlink or an icon. Operations related to the selected object, such as: displaying an operation connected to a hyperlink page, document, image, or the like, or performing an operation of a program corresponding to the icon. The user command for selecting the UI object may be a command input through various input means (e.g., a mouse, a keyboard, a touch pad, etc.) connected to the display apparatus 200 or a voice command corresponding to a voice spoken by the user.
As shown in fig. 2, the controller 250 includes at least one of a Random Access Memory 251 (RAM), a Read-Only Memory 252 (ROM), a video processor 270, an audio processor 280, other processors 253 (e.g., a Graphics Processing Unit (GPU), a Central Processing Unit 254 (CPU), a Communication Interface (Communication Interface), and a Communication Bus 256(Bus), which connects the respective components.
In some embodiments, RAM 251 is used to store temporary data for the operating system or other programs that are running
In some embodiments, ROM252 is used to store instructions for various system boots.
In some embodiments, the ROM252 is used to store a Basic Input Output System (BIOS). The system is used for completing power-on self-test of the system, initialization of each functional module in the system, a driver of basic input/output of the system and booting an operating system.
In some embodiments, when the power of the display apparatus 200 is started upon receiving the power-on signal, the CPU executes the system boot instruction in the ROM252 and copies the temporary data of the operating system stored in the memory into the RAM 251 so as to boot or run the operating system. After the start of the operating system is completed, the CPU copies the temporary data of the various application programs in the memory to the RAM 251, and then, the various application programs are started or run.
In some embodiments, CPU processor 254 is used to execute operating system and application program instructions stored in memory. And executing various application programs, data and contents according to various interactive instructions received from the outside so as to finally display and play various audio and video contents.
In some example embodiments, the CPU processor 254 may comprise a plurality of processors. The plurality of processors may include a main processor and one or more sub-processors. A main processor for performing some operations of the display apparatus 200 in a pre-power-up mode and/or operations of displaying a screen in a normal mode. One or more sub-processors for one operation in a standby mode or the like.
In some embodiments, the graphics processor 253 is used to generate various graphics objects, such as: icons, operation menus, user input instruction display graphics, and the like. The display device comprises an arithmetic unit which carries out operation by receiving various interactive instructions input by a user and displays various objects according to display attributes. And the system comprises a renderer for rendering various objects obtained based on the arithmetic unit, wherein the rendered objects are used for being displayed on a display.
In some embodiments, the video processor 270 is configured to receive an external video signal, and perform video processing such as decompression, decoding, scaling, noise reduction, frame rate conversion, resolution conversion, image synthesis, and the like according to a standard codec protocol of the input signal, so as to obtain a signal that can be displayed or played on the direct display device 200.
In some embodiments, video processor 270 includes a demultiplexing module, a video decoding module, an image synthesis module, a frame rate conversion module, a display formatting module, and the like.
The demultiplexing module is used for demultiplexing the input audio and video data stream, and if the input MPEG-2 is input, the demultiplexing module demultiplexes the input audio and video data stream into a video signal and an audio signal.
And the video decoding module is used for processing the video signal after demultiplexing, including decoding, scaling and the like.
And the image synthesis module is used for carrying out superposition mixing processing on the GUI signal input by the user or generated by the user and the video image after the zooming processing by the graphic generator so as to generate an image signal for display.
The frame rate conversion module is configured to convert an input video frame rate, such as a 60Hz frame rate into a 120Hz frame rate or a 240Hz frame rate, and the normal format is implemented in, for example, an interpolation frame mode.
The display format module is used for converting the received video output signal after the frame rate conversion, and changing the signal to conform to the signal of the display format, such as outputting an RGB data signal.
In some embodiments, the graphics processor 253 and the video processor may be integrated or separately configured, and when the graphics processor and the video processor are integrated, the graphics processor and the video processor may perform processing of graphics signals output to the display, and when the graphics processor and the video processor are separately configured, the graphics processor and the video processor may perform different functions, respectively, for example, a GPU + frc (frame Rate conversion) architecture.
In some embodiments, the audio processor 280 is configured to receive an external audio signal, decompress and decode the received audio signal according to a standard codec protocol of the input signal, and perform noise reduction, digital-to-analog conversion, and amplification processes to obtain an audio signal that can be played in a speaker.
In some embodiments, video processor 270 may comprise one or more chips. The audio processor may also comprise one or more chips.
In some embodiments, the video processor 270 and the audio processor 280 may be separate chips or may be integrated together with the controller in one or more chips.
In some embodiments, the audio output, under the control of controller 250, receives sound signals output by audio processor 280, such as: the speaker 286, and an external sound output terminal of a generating device that can output to an external device, in addition to the speaker carried by the display device 200 itself, such as: external sound interface or earphone interface, etc., and may also include a near field communication module in the communication interface, for example: and the Bluetooth module is used for outputting sound of the Bluetooth loudspeaker.
The power supply 290 supplies power to the display device 200 from the power input from the external power source under the control of the controller 250. The power supply 290 may include a built-in power supply circuit installed inside the display apparatus 200, or may be a power supply interface installed outside the display apparatus 200 to provide an external power supply in the display apparatus 200.
A user interface 265 for receiving an input signal of a user and then transmitting the received user input signal to the controller 250. The user input signal may be a remote controller signal received through an infrared receiver, and various user control signals may be received through the network communication module.
In some embodiments, the user inputs a user command through the control apparatus 100 or the mobile terminal 300, the user input interface responds to the user input through the controller 250 according to the user input, and the display device 200 responds to the user input through the controller 250.
In some embodiments, a user may enter user commands on a Graphical User Interface (GUI) displayed on the display 275, and the user input interface receives the user input commands through the Graphical User Interface (GUI). Alternatively, the user may input the user command by inputting a specific sound or gesture, and the user input interface receives the user input command by recognizing the sound or gesture through the sensor.
In some embodiments, a "user interface" is a media interface for interaction and information exchange between an application or operating system and a user that enables conversion between an internal form of information and a form that is acceptable to the user. A commonly used presentation form of the User Interface is a Graphical User Interface (GUI), which refers to a User Interface related to computer operations and displayed in a graphical manner. It may be an interface element such as an icon, a window, a control, etc. displayed in the display screen of the electronic device, where the control may include a visual interface element such as an icon, a button, a menu, a tab, a text box, a dialog box, a status bar, a navigation bar, a Widget, etc.
The memory 260 includes a memory storing various software modules for driving the display device 200. Such as: various software modules stored in the first memory, including: at least one of a basic module, a detection module, a communication module, a display control module, a browser module, and various service modules.
The base module is a bottom layer software module for signal communication between various hardware in the display device 200 and for sending processing and control signals to the upper layer module. The detection module is used for collecting various information from various sensors or user input interfaces, and the management module is used for performing digital-to-analog conversion and analysis management.
For example, the voice recognition module comprises a voice analysis module and a voice instruction database module. The display control module is used for controlling the display to display the image content, and can be used for playing the multimedia image content, UI interface and other information. And the communication module is used for carrying out control and data communication with external equipment. And the browser module is used for executing a module for data communication between browsing servers. And the service module is used for providing various services and modules including various application programs. Meanwhile, the memory 260 may store a visual effect map for receiving external data and user data, images of various items in various user interfaces, and a focus object, etc.
Fig. 3 exemplarily shows a block diagram of a configuration of the control apparatus 100 according to an exemplary embodiment. As shown in fig. 3, the control apparatus 100 includes a controller 110, a communication interface 130, a user input/output interface, a memory, and a power supply source.
The control device 100 is configured to control the display device 200 and may receive an input operation instruction of a user and convert the operation instruction into an instruction recognizable and responsive by the display device 200, serving as an interaction intermediary between the user and the display device 200. Such as: the user responds to the channel up and down operation by operating the channel up and down keys on the control device 100.
In some embodiments, the control device 100 may be a smart device. Such as: the control apparatus 100 may install various applications that control the display apparatus 200 according to user demands.
In some embodiments, as shown in fig. 1, a mobile terminal 300 or other intelligent electronic device may function similar to the control device 100 after installing an application that manipulates the display device 200. Such as: the user may implement the functions of controlling the physical keys of the device 100 by installing applications, various function keys or virtual buttons of a graphical user interface available on the mobile terminal 300 or other intelligent electronic device.
The controller 110 includes a processor 112 and RAM 113 and ROM 114, a communication interface 130, and a communication bus. The controller is used to control the operation of the control device 100, as well as the communication cooperation between the internal components and the external and internal data processing functions.
The communication interface 130 enables communication of control signals and data signals with the display apparatus 200 under the control of the controller 110. Such as: the received user input signal is transmitted to the display apparatus 200. The communication interface 130 may include at least one of a WiFi chip 131, a bluetooth module 132, an NFC module 133, and other near field communication modules.
A user input/output interface 140, wherein the input interface includes at least one of a microphone 141, a touch pad 142, a sensor 143, keys 144, and other input interfaces. Such as: the user can realize a user instruction input function through actions such as voice, touch, gesture, pressing, and the like, and the input interface converts the received analog signal into a digital signal and converts the digital signal into a corresponding instruction signal, and sends the instruction signal to the display device 200.
The output interface includes an interface that transmits the received user instruction to the display apparatus 200. In some embodiments, the interface may be an infrared interface or a radio frequency interface. Such as: when the infrared signal interface is used, the user input instruction needs to be converted into an infrared control signal according to an infrared control protocol, and the infrared control signal is sent to the display device 200 through the infrared sending module. The following steps are repeated: when the rf signal interface is used, a user input command needs to be converted into a digital signal, and then the digital signal is modulated according to the rf control signal modulation protocol and then transmitted to the display device 200 through the rf transmitting terminal.
In some embodiments, the control device 100 includes at least one of a communication interface 130 and an input-output interface 140. The control device 100 is provided with a communication interface 130, such as: the WiFi, bluetooth, NFC, etc. modules may transmit the user input command to the display device 200 through the WiFi protocol, or the bluetooth protocol, or the NFC protocol code.
A memory 190 for storing various operation programs, data and applications for driving and controlling the control apparatus 200 under the control of the controller. The memory 190 may store various control signal commands input by a user.
And a power supply 180 for providing operational power support to the various elements of the control device 100 under the control of the controller. A battery and associated control circuitry.
In some embodiments, the system may include a Kernel (Kernel), a command parser (shell), a file system, and an application program. The kernel, shell, and file system together make up the basic operating system structure that allows users to manage files, run programs, and use the system. After power-on, the kernel is started, kernel space is activated, hardware is abstracted, hardware parameters are initialized, and virtual memory, a scheduler, signals and interprocess communication (IPC) are operated and maintained. And after the kernel is started, loading the Shell and the user application program. The application program is compiled into machine code after being started, and a process is formed.
Referring to fig. 4, in some embodiments, the system is divided into four layers, which are, from top to bottom, an Application (Applications) layer (referred to as an "Application layer"), an Application Framework (Application Framework) layer (referred to as a "Framework layer"), an Android runtime (Android runtime) layer and a system library layer (referred to as a "system runtime library layer"), and a kernel layer.
In some embodiments, at least one application program runs in the application program layer, and the application programs can be Window (Window) programs carried by an operating system, system setting programs, clock programs, camera applications and the like; or may be an application developed by a third party developer such as a hi program, a karaoke program, a magic mirror program, or the like. In specific implementation, the application packages in the application layer are not limited to the above examples, and may actually include other application packages, which is not limited in this embodiment of the present application.
The framework layer provides an APPlication Programming Interface (API) and a programming framework for the aPPlication program of the aPPlication layer. The application framework layer includes a number of predefined functions. The application framework layer acts as a processing center that decides to let the applications in the application layer act. The application program can access the resource in the system and obtain the service of the system in execution through the API interface
As shown in fig. 4, in the embodiment of the present application, the application framework layer includes a manager (Managers), a Content Provider (Content Provider), and the like, where the manager includes at least one of the following modules: an Activity Manager (Activity Manager) is used for interacting with all activities running in the system; the Location Manager (Location Manager) is used for providing the system service or application with the access of the system Location service; a Package Manager (Package Manager) for retrieving various information related to an application Package currently installed on the device; a Notification Manager (Notification Manager) for controlling display and clearing of Notification messages; a Window Manager (Window Manager) is used to manage the icons, windows, toolbars, wallpapers, and desktop components on a user interface.
In some embodiments, the activity manager is to: managing the life cycle of each application program and the general navigation backspacing function, such as controlling the exit of the application program (including switching the user interface currently displayed in the display window to the system desktop), opening, backing (including switching the user interface currently displayed in the display window to the previous user interface of the user interface currently displayed), and the like.
In some embodiments, the window manager is configured to manage all window processes, such as obtaining a display size, determining whether a status bar is available, locking a screen, intercepting a screen, controlling a display change (e.g., zooming out, dithering, distorting, etc.) and the like.
In some embodiments, the system runtime layer provides support for the upper layer, i.e., the framework layer, and when the framework layer is used, the android operating system runs the C/C + + library included in the system runtime layer to implement the functions to be implemented by the framework layer.
In some embodiments, the kernel layer is a layer between hardware and software. As shown in fig. 4, the core layer includes at least one of the following drivers: audio drive, display drive, bluetooth drive, camera drive, WIFI drive, USB drive, HDMI drive, sensor drive (such as fingerprint sensor, temperature sensor, touch sensor, pressure sensor, etc.), and so on.
In some embodiments, the kernel layer further comprises a power driver module for power management.
In some embodiments, software programs and/or modules corresponding to the software architecture of fig. 4 are stored in the first memory or the second memory shown in fig. 2 or 3.
In some embodiments, taking the magic mirror application (photographing application) as an example, when the remote control receiving device receives a remote control input operation, a corresponding hardware interrupt is sent to the kernel layer. The kernel layer processes the input operation into an original input event (including information such as a value of the input operation, a timestamp of the input operation, etc.). The raw input events are stored at the kernel layer. The application program framework layer obtains an original input event from the kernel layer, identifies a control corresponding to the input event according to the current position of the focus and uses the input operation as a confirmation operation, the control corresponding to the confirmation operation is a control of a magic mirror application icon, the magic mirror application calls an interface of the application framework layer to start the magic mirror application, and then the kernel layer is called to start a camera driver, so that a static image or a video is captured through the camera.
In some embodiments, for a display device with a touch function, taking a split screen operation as an example, the display device receives an input operation (such as a split screen operation) that a user acts on a display screen, and the kernel layer may generate a corresponding input event according to the input operation and report the event to the application framework layer. The window mode (such as multi-window mode) corresponding to the input operation, the position and size of the window and the like are set by an activity manager of the application framework layer. And the window management of the application program framework layer draws a window according to the setting of the activity manager, then sends the drawn window data to the display driver of the kernel layer, and the display driver displays the corresponding application interface in different display areas of the display screen.
In some embodiments, as shown in fig. 5, the application layer containing at least one application may display a corresponding icon control in the display, such as: the system comprises a live television application icon control, a video on demand application icon control, a media center application icon control, an application center icon control, a game application icon control and the like.
In some embodiments, the live television application may provide live television via different signal sources. For example, a live television application may provide television signals using input from cable television, radio broadcasts, satellite services, or other types of live television services. And, the live television application may display video of the live television signal on the display device 200.
In some embodiments, a video-on-demand application may provide video from different storage sources. Unlike live television applications, video on demand provides a video display from some storage source. For example, the video on demand may come from a server side of the cloud storage, from a local hard disk storage containing stored video programs.
In some embodiments, the media center application may provide various applications for multimedia content playback. For example, a media center, which may be other than live television or video on demand, may provide services that a user may access to various images or audio through a media center application.
In some embodiments, an application center may provide storage for various applications. The application may be a game, an application, or some other application associated with a computer system or other device that may be run on the smart television. The application center may obtain these applications from different sources, store them in local storage, and then be operable on the display device 200.
With the rapid development of smart televisions, an intelligent voice interaction mode is developed, and meanwhile, the continuous popularization of the social concept of televisions gradually generates a remote video call mode between a television end and between the television end and a mobile phone end. Therefore, how to control the television through voice in the remote video call process becomes a problem to be solved urgently by those skilled in the art.
The embodiment of the application provides a voice separation control method, which comprises the following steps:
and receiving external audio. In the embodiment of the application, after the display device is started, the external audio is continuously received. In some embodiments, the display device is powered on, a voice activity detection service (VAD) is started, the VAD identifies an external audio, and when a user voice is detected to exist in the external audio, a voice wake-up service (VT) is started, and the voice wake-up service is configured to start the voice control system after a wake-up word is identified, and execute a corresponding operation by using the control instruction.
In some embodiments, the far-field module receives external audio, the far-field module includes two channels, and a first channel is responsible for transmitting the external audio to an object end of a video call when the display device is in a remote video call state; the second channel is responsible for transmitting data related to voice control commands operating the display device.
According to the embodiment of the application, whether the display equipment is in a remote video call state or not is judged, and whether the external audio comprises the awakening word or not is judged. In the embodiment of the application, when the mobile terminal is in a remote video call state or not in the remote video call state, and the external audio includes a wakeup word and does not include the wakeup word, the control logic related to the embodiment of the application is different.
In some embodiments, if the display device is not in a remote video call state and the external audio includes a wakeup word, receiving a voice to be determined input by a user, and if the voice to be determined includes a control instruction for controlling the display device, executing a corresponding operation according to the control instruction.
In some embodiments, the step of determining whether the external audio includes a wake word includes: converting the external audio into voice words, and judging whether awakening words exist in the voice words or not; if the wake-up word exists, determining that the external audio comprises the wake-up word; if the wake-up word is not present, determining that the external audio does not include the wake-up word.
In some embodiments, the voice wakeup service determines whether the external audio includes a wakeup word, wherein the external audio is sent to the voice wakeup service through the second channel. In some embodiments, the far-field module writes the received external audio into the recording device, converts the external audio into a voice word through the voice engine, and determines whether the voice word contains a wakeup word. In some embodiments, the determining whether the speech text contains the wakeup word specifically includes: the voice wake-up service retrieves the voice text, and then the voice text retrieves the wake-up word using the voice wake-up service, wherein the voice text is transmitted to the voice wake-up service through the second channel. If the wake word is retrieved, it is determined that the wake word exists, and if the wake word is not retrieved, it is determined that the wake word does not exist. Illustratively, the wake word may be hi minuscule.
And when the external audio comprises the awakening words, monitoring the voice to be determined. In the embodiment of the application, when the awakening word is detected, the voice control system is informed to start, and the voice control system monitors the voice to be determined. Illustratively, an "open homepage" spoken by the user is received, at which time the homepage is opened as the pending voice. It is determined whether the voice to be determined includes a control instruction for controlling the display device.
In some embodiments, if the external audio does not include a wake word, then a determination is continued as to whether the external audio includes a wake word. It should be noted that, in the embodiment of the present application, the display device continuously receives the external audio. Therefore, when the external audio is continuously judged whether to include the awakening word, the external audio is the updated audio data.
In some embodiments, the controller is configured to perform the determining that the voice to be determined includes a control instruction for controlling the display device according to the following steps:
and converting the voice to be determined into text content, uploading the text content to a cloud server so that the cloud server analyzes the text content to obtain an analysis result, and transmitting the analysis result to display equipment.
In the embodiment of the application, because the user may use an nonstandard language, the display device may not accurately know the user requirement, and therefore the voice to be determined is converted into the text content and then uploaded to the cloud server, the text content is analyzed by the cloud server, and the content of the approximate instruction is screened out from the text content. In some embodiments, the far-field module writes the voice to be determined into the recording device and converts the voice into text. The cloud server analyzes the text content, screens out real semantics from the instruction text, and sends an analysis result to the display device.
If the control instruction which is the same as the analysis result does not exist in the control instruction set in the display equipment, the voice to be determined does not include the control instruction for controlling the display equipment; and if the control instruction which is the same as the analysis result exists in the control instruction set in the display equipment, the voice to be determined comprises a control instruction for controlling the display equipment. In some embodiments, the control instruction set is a set of preset control instructions, and the preset control instructions are uniformly stored in the control instruction set.
In some embodiments, the step of determining whether the voice to be determined includes a control instruction for controlling the display device includes:
converting the voice to be determined into text content;
judging whether a control instruction which is the same as the text content exists in a control instruction set in the display equipment;
if the voice to be determined exists, determining that the voice to be determined comprises a control instruction for controlling the display equipment;
if not, it is determined that the voice to be determined does not include a control instruction for controlling the display apparatus.
In some embodiments, the function of analyzing the text content may be integrated locally, and the text content is analyzed locally to obtain an analysis result. If the control instruction which is the same as the analysis result does not exist in the control instruction set in the display equipment, the voice to be determined does not include the control instruction for controlling the display equipment; and if the control instruction which is the same as the analysis result exists in the control instruction set in the display equipment, the voice to be determined comprises a control instruction for controlling the display equipment.
In some embodiments, if the voice to be determined includes a control instruction for controlling the display device, the corresponding operation is performed according to the control instruction. Illustratively, the control instructions instruct the display device to open a home page, and then control the display to display a home interface.
In some embodiments, if the speech to be determined does not include a control instruction for controlling the display device, the step of determining whether the external audio includes a wakeup word is repeatedly performed. It should be noted that, in the embodiment of the present application, the display device continuously receives the external audio. Therefore, when the external audio is continuously judged whether to include the awakening word, the external audio is the updated audio data.
In some embodiments, the controller is further configured to perform: and if the voice to be determined does not comprise a control instruction for controlling the display equipment, controlling a display to display a prompt message. Since the user may use an nonstandard language, if the to-be-determined speech does not include a control instruction for controlling the display device, a prompt message is displayed on the display, for example, the prompt message may be that a correct control instruction is not obtained, and in addition, the user may be prompted about the content of the correct control instruction, so that the use experience of the user is improved.
The above is that when the display device is not in the remote video call state, the voice control instruction operates the related content of the display device.
Another method of speech isolation control is described below, as shown in FIG. 6, including
And S100, receiving external audio. This step is the same as the above step involved in the voice control instruction operating the display device when the display device is not in the remote video call state, and is not described herein again.
In some embodiments, S200, if the remote video call state is established and the far-field microphone switch is turned on, transmitting the external audio to the target end of the video call. In the embodiment of the application, the external audio is transmitted to the object end of the video call by using the first channel.
In the embodiment of the application, when the far-field microphone switch is turned on, the external audio is transmitted to the object end of the video call, and when the far-field microphone switch is turned off, the external audio is not transmitted to the object end of the video call. As shown in fig. 7 and 8, the far-field mic switch is on in fig. 7 and off in fig. 8.
In some embodiments, when the remote video call state is established and the far-field microphone switch is turned on, S300, it is determined whether the external audio includes a wake-up word. S400, if the external audio is detected to include the awakening word, the external audio is paused to be transmitted to the object end of the video call, and the voice to be determined is monitored. S500, if the external audio does not comprise the awakening word, continuing to transmit the external audio to the object end of the video call, and repeatedly executing the step of judging whether the external audio comprises the awakening word.
In some embodiments, the external audio is communicated using a second channel transmission and it is determined whether the external audio includes a wake-up word. When the external audio comprises the awakening word, the first channel suspends the transmission of the external audio to the object end of the video call. In the embodiment of the application, two channels are utilized, each channel is responsible for different data transmission, external audio is transmitted to the object end of the video call in a parallel mode, and data related to voice control instruction operation display equipment is transmitted. The timing can be more accurately grasped for both suspending and continuing transmission of data in the first channel. If only one channel exists in the embodiment of the application, the channel is responsible for transmitting the external audio to the object end of the video call and transmitting the data related to the operation and display device operated by the voice control instruction, so that when the data transmission is instructed to be suspended and the data transmission is instructed to be continued, the data transmission is possibly not suspended and the data transmission is possibly continued due to the fact that the data related to the operation and display device operated by the voice control instruction also exists, and the use feeling of a user is influenced.
In the embodiment of the present application, the step of determining whether the external audio includes the wakeup word is the same as the step of determining whether the external audio includes the wakeup word, which is related to the operation of the voice control instruction display device not in the remote video call state, so that details are not repeated herein.
S600, receiving the voice to be determined input by the user. S700, judging whether the voice to be determined comprises a control instruction for controlling the display equipment.
In the embodiment of the application, after it is detected that the external audio includes the wakeup word, the speech spoken by the user is used as the to-be-determined voice input by the user, and at this time, the speech spoken by the user is not a certain standard in the remote video call process, so the display device cannot necessarily recognize the speech spoken by the user as the control instruction. It is necessary to judge again whether the voice to be determined includes a control instruction for controlling the display apparatus.
And S800, if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction, and continuously transmitting the external audio to the object end of the video call. In the embodiment of the application, after the control instruction is confirmed, the first channel is continuously utilized to transmit the external audio to the object end of the video call.
In the embodiment of the present application, the step of determining whether the voice to be determined includes the control instruction for controlling the display device is the same as the step of determining whether the voice to be determined includes the control instruction for controlling the display device, which is involved in operating the display device by using the voice control instruction which is not in the remote video call state in the foregoing, and therefore, the description is omitted here.
In some embodiments, S900, if the voice to be determined does not include a control instruction for controlling the display device, continuing to transmit the external audio to the object side of the video call, and repeatedly performing to determine whether the external audio includes a wakeup word. In some embodiments, if the voice to be determined does not include a control instruction for controlling a display device, controlling a display to display a prompt message.
In some embodiments, if the display device is in a remote video call state and the far-field microphone switch is off, controlling the display to display a microphone off message on the display, and not transmitting external audio to the object end of the video call; and if the voice text comprises a wake-up word, receiving the voice to be determined input by the user, and if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction. In some embodiments, if the far-field microphone switch is off, the microphone closing message is displayed on the display interface, and exemplarily, the microphone closing message is that the user can not hear sound from the other party, so that the remote audio control state can be clearly displayed, the problem that the user is unclear about the state is solved, and the user experience is improved.
In the embodiment, the voice separation control method and the display device realize that the television is controlled through voice in the remote video call process, and the use experience of a user is improved. The method comprises the following steps: receiving external audio; if the display equipment is in a remote video call state and the far-field microphone switch is on, transmitting the external audio to the object end of the video call; judging whether the external audio comprises a wake-up word, if so, suspending transmitting the external audio to an object end of the video call and monitoring the voice to be determined; receiving voice to be determined input by a user, if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction, and continuously transmitting external audio to an object end of the video call.
Finally, it should be noted that: the above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present application.
The foregoing description, for purposes of explanation, has been presented in conjunction with specific embodiments. However, the illustrative discussions above are not intended to be exhaustive or to limit the embodiments to the precise forms disclosed above. Many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles and the practical application, to thereby enable others skilled in the art to best utilize the embodiments and various embodiments with various modifications as are suited to the particular use contemplated.

Claims (10)

1. A display device, comprising:
a display for displaying a user interface;
a user interface for receiving an input signal;
a controller respectively coupled to the display and the user interface for performing:
receiving external audio; if the display equipment is in a remote video call state and the far-field microphone switch is on, transmitting the external audio to the object end of the video call;
judging whether the external audio comprises a wake-up word, if so, suspending transmitting the external audio to an object end of the video call and monitoring the voice to be determined; receiving voice to be determined input by a user, if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction, and continuously transmitting external audio to an object end of the video call.
2. The display device of claim 1, wherein the controller is configured to perform determining whether the ambient audio includes a wake-up word according to the following steps:
converting the external audio into voice words, and judging whether awakening words exist in the voice words or not;
if the wake-up word exists, determining that the external audio comprises the wake-up word;
if the wake-up word is not present, determining that the external audio does not include the wake-up word.
3. The display device according to claim 1, wherein the controller is configured to perform the determination of whether the voice to be determined includes a control instruction for controlling the display device according to the following steps:
converting the voice to be determined into text content;
judging whether a control instruction which is the same as the text content exists in a control instruction set in the display equipment;
if the voice to be determined exists, determining that the voice to be determined comprises a control instruction for controlling the display equipment;
if not, it is determined that the voice to be determined does not include a control instruction for controlling the display apparatus.
4. The display device according to claim 1, wherein the controller is further configured to perform: and if the voice to be determined does not comprise a control instruction for controlling the display equipment, continuously transmitting the external audio to the object end of the video call, and repeatedly executing the step of judging whether the external audio comprises the awakening word.
5. The display device according to claim 1, wherein the controller is further configured to perform: and if the voice to be determined does not comprise a control instruction for controlling the display equipment, controlling a display to display a prompt message.
6. The display device according to claim 1, wherein the controller is further configured to perform: and if the external audio does not comprise the awakening word, continuously transmitting the external audio to the object end of the video call, and repeatedly executing the step of judging whether the external audio comprises the awakening word.
7. The display device according to claim 1, wherein the controller is further configured to perform:
and if the display equipment is not in a remote video call state and the external audio comprises a wakeup word, receiving the voice to be determined input by the user, and if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction.
8. The display device according to claim 1, wherein the controller is further configured to perform:
if the display equipment is in a remote video call state and the far-field microphone switch is off, the external audio is not transmitted to the object end of the video call; and if the voice text comprises a wake-up word, receiving the voice to be determined input by the user, and if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction.
9. The display device according to claim 8, wherein the controller is further configured to perform: controlling the display of a microphone-off message on the display.
10. A voice separation control method, comprising:
receiving external audio; if the display equipment is in a remote video call state and the far-field microphone switch is on, transmitting the external audio to the object end of the video call;
judging whether the external audio comprises a wake-up word, if so, suspending transmitting the external audio to an object end of the video call and monitoring the voice to be determined; receiving voice to be determined input by a user, if the voice to be determined comprises a control instruction for controlling the display equipment, executing corresponding operation according to the control instruction, and continuously transmitting external audio to an object end of the video call.
CN202110294224.8A 2021-03-19 2021-03-19 Voice separation control method and display device Pending CN114302197A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110294224.8A CN114302197A (en) 2021-03-19 2021-03-19 Voice separation control method and display device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110294224.8A CN114302197A (en) 2021-03-19 2021-03-19 Voice separation control method and display device

Publications (1)

Publication Number Publication Date
CN114302197A true CN114302197A (en) 2022-04-08

Family

ID=80964600

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110294224.8A Pending CN114302197A (en) 2021-03-19 2021-03-19 Voice separation control method and display device

Country Status (1)

Country Link
CN (1) CN114302197A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117174089A (en) * 2023-11-01 2023-12-05 中电科新型智慧城市研究院有限公司 Control method, control device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105513596A (en) * 2013-05-29 2016-04-20 华为技术有限公司 Voice control method and control device
CN108665895A (en) * 2018-05-03 2018-10-16 百度在线网络技术(北京)有限公司 Methods, devices and systems for handling information
CN109688269A (en) * 2019-01-03 2019-04-26 百度在线网络技术(北京)有限公司 The filter method and device of phonetic order
US10649727B1 (en) * 2018-05-14 2020-05-12 Amazon Technologies, Inc. Wake word detection configuration
CN111556197A (en) * 2020-04-26 2020-08-18 北京小米松果电子有限公司 Method and device for realizing voice assistant and computer storage medium
CN112511882A (en) * 2020-11-13 2021-03-16 海信视像科技股份有限公司 Display device and voice call-up method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105513596A (en) * 2013-05-29 2016-04-20 华为技术有限公司 Voice control method and control device
CN108665895A (en) * 2018-05-03 2018-10-16 百度在线网络技术(北京)有限公司 Methods, devices and systems for handling information
US10649727B1 (en) * 2018-05-14 2020-05-12 Amazon Technologies, Inc. Wake word detection configuration
CN109688269A (en) * 2019-01-03 2019-04-26 百度在线网络技术(北京)有限公司 The filter method and device of phonetic order
CN111556197A (en) * 2020-04-26 2020-08-18 北京小米松果电子有限公司 Method and device for realizing voice assistant and computer storage medium
CN112511882A (en) * 2020-11-13 2021-03-16 海信视像科技股份有限公司 Display device and voice call-up method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117174089A (en) * 2023-11-01 2023-12-05 中电科新型智慧城市研究院有限公司 Control method, control device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN112019782B (en) Control method and display device of enhanced audio return channel
CN112118400B (en) Display method of image on display device and display device
CN112165640B (en) Display device
CN112087671B (en) Display method and display equipment for control prompt information of input method control
CN112243141B (en) Display method and display equipment for screen projection function
CN112135180A (en) Content display method and display equipment
CN112188279A (en) Channel switching method and display equipment
CN112565862A (en) Display equipment and equipment parameter memorizing method and restoring method thereof
CN112153440A (en) Display device and display system
CN112306604B (en) Progress display method and display device for file transmission
CN112399217B (en) Display device and method for establishing communication connection with power amplifier device
CN112040535B (en) Wifi processing method and display device
CN112214190A (en) Display equipment resource playing method and display equipment
CN111984167A (en) Rapid naming method and display device
CN111988646B (en) User interface display method and display device of application program
CN114302197A (en) Voice separation control method and display device
CN112118476B (en) Method for rapidly displaying program reservation icon and display equipment
CN115185392A (en) Display device, image processing method and device
CN114390190A (en) Display equipment and method for monitoring application to start camera
CN111931692A (en) Display device and image recognition method
CN111918056A (en) Camera state detection method and display device
CN113194355B (en) Video playing method and display equipment
CN113438553B (en) Display device awakening method and display device
CN111970554B (en) Picture display method and display device
CN113436564B (en) EPOS display method and display equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination