CN110740367A - Display device and voice instruction processing method - Google Patents

Display device and voice instruction processing method Download PDF

Info

Publication number
CN110740367A
CN110740367A CN201911014297.6A CN201911014297A CN110740367A CN 110740367 A CN110740367 A CN 110740367A CN 201911014297 A CN201911014297 A CN 201911014297A CN 110740367 A CN110740367 A CN 110740367A
Authority
CN
China
Prior art keywords
display
user
voice
controller
command
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911014297.6A
Other languages
Chinese (zh)
Inventor
吴松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hisense Electronic Technology (wuhan) Co Ltd
Original Assignee
Hisense Electronic Technology (wuhan) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hisense Electronic Technology (wuhan) Co Ltd filed Critical Hisense Electronic Technology (wuhan) Co Ltd
Priority to CN201911014297.6A priority Critical patent/CN110740367A/en
Publication of CN110740367A publication Critical patent/CN110740367A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/14Digital output to display device ; Cooperation and interconnection of the display device with other functional units
    • G06F3/1423Digital output to display device ; Cooperation and interconnection of the display device with other functional units controlling a plurality of local displays, e.g. CRT and flat panel display
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/4222Remote control device emulator integrated into a non-television apparatus, e.g. a PDA, media center or smart toy
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/42221Transmission circuitry, e.g. infrared [IR] or radio frequency [RF]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Controls And Circuits For Display Device (AREA)

Abstract

The embodiment of the application provides display equipment and a voice instruction processing method, wherein the display equipment comprises a th display, a second display, a voice recognition part and a controller, wherein the voice recognition part is used for receiving voice input of a user, the controller is configured to control the th display and the second display to display a recognition result of the voice input on the second display in response to the voice instruction input by the user, and is also used for displaying summary information of the response result of the voice input on the second display.

Description

Display device and voice instruction processing method
Technical Field
The embodiment of the application relates to the technology of intelligent display equipment, in particular to display equipment and a voice instruction processing method.
Background
With the continuous development of television technology and internet technology, an internet-based smart television is appeared. The smart television is based on the internet technology, is provided with an open operating system and a chip, is provided with an open application platform, and can support various functions such as audio and video, entertainment, data and the like, so that the diversified demands of users are met, and brand new use experience is brought to the users. In the aspect of user interaction, the smart television can support a voice interaction mode, and a user can control the smart television through voice.
In the prior art, the smart television comprises screens, and when a user controls the smart television through voice, the response information of the smart television to the voice command of the user is displayed on the screens.
Disclosure of Invention
The embodiment of the application provides display devices and a voice instruction processing method, and the technical scheme is as follows.
, embodiments of the present application provide a display device, comprising:
th display;
a second display;
a voice recognition part for receiving a voice input of a user;
a controller configured to:
the control unit is used for controlling the th display and the second display to display the recognition result of the voice input on the second display in response to the voice command input by the user and displaying the summary information of the response result of the voice input on the second display.
, the controller is configured to:
and responding to th preset key instruction input by a user, and displaying the response result on a th display.
, the controller is configured to:
and responding to the fact that the voice instruction is an instruction in a preset instruction set and the th display is in a bright screen state, and outputting a response result on the th display.
, the controller is configured to:
and responding to the voice instruction as an instruction in a preset instruction set, the th display in a breath screen state and the audio data in the response result, and playing the audio data.
, the controller is configured to:
and responding to a second preset key instruction input by the user, and displaying the summary information of the adjusted response result on the second display.
Further , the voice command is a play music command.
In a second aspect, an embodiment of the present application provides methods for processing a voice instruction, including:
receiving a voice input of a user;
in response to a voice instruction input by a user, a recognition result of the voice input is displayed on a second display, and summary information of a response result of the voice input is displayed on the second display.
Further to step , the method further comprises:
receiving th preset key instruction input by a user;
and responding to the th preset key instruction, and displaying the response result on a th display.
Further to step , the method further comprises:
and responding to the fact that the voice instruction is an instruction in a preset instruction set and the th display is in a bright screen state, and outputting a response result on the th display.
In a third aspect, an embodiment of the present application provides kinds of computing devices, including:
a memory for storing program instructions;
and the processor is used for calling the program instructions stored in the memory and executing the method of the second aspect according to the obtained program.
According to the display device and the voice instruction processing method provided by the embodiment of the application, after a user inputs a voice instruction to the display device, the second display in the display device displays the recognition result of the voice input, meanwhile, the controller acquires the response result, and then the summary information of the response result is displayed on the second display, in the process, the voice instruction and the summary information of the response result are displayed on the second display, the display can not be influenced by the voice interaction process, if the display is in a bright screen state, the information before the voice instruction input can be continuously displayed, and if the display is in a breath screen state, the breath screen state can be kept unchanged.
Drawings
In order to more clearly illustrate the technical solutions of the present invention or the prior art, is briefly introduced in the following for the drawings needed to be used in the description of the embodiments or the prior art, it is obvious that the drawings in the following description are embodiments of the present invention, and other drawings can be obtained according to these drawings for those skilled in the art without creative efforts.
Fig. 1 is a schematic diagram illustrating an operation scenario between a display device and a control apparatus according to an embodiment;
FIG. 2 is a block diagram of a hardware configuration of a display device 200 shown in accordance with an exemplary embodiment at ;
fig. 3 is a block diagram illustrating a hardware configuration of a display device 200 according to another exemplary embodiment ;
fig. 4 is a block diagram illustrating a hardware configuration of the control apparatus 100 according to an exemplary embodiment of the present application;
FIG. 5 is a schematic diagram of a functional configuration of a display device 200 shown in accordance with an exemplary embodiment at ;
fig. 6 is a schematic flowchart of a voice instruction processing method according to an embodiment of the present application;
fig. 7 is an exemplary diagram of module partitioning and message interaction in the controller 210 and the controller 310;
fig. 8 is a flowchart illustrating a voice command processing method according to an embodiment of the present application.
Detailed Description
To make the objects, technical solutions and advantages of the present invention clearer and more complete description of the technical solutions in the embodiments of the present application will be given below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are some, but not all embodiments of the present invention.
In the prior art, the intelligent television comprises screens, and when the user controls the intelligent television through voice, the response information of the intelligent television to the voice command of the user is displayed on the screens.
As possible implementations, the smart television may further include multiple screens, i.e., multiple displays, which is called a multi-screen smart television, in the multi-screen smart television, main screens and at least auxiliary screens are included.
When the multi-screen smart television supports a voice interaction mode, how to respond to a voice instruction of a user through the main screen and the auxiliary screen is a problem to be solved.
When a user sends a voice instruction to the display equipment, the secondary screen displays summary information of a response result of the voice instruction without influencing the current state of the primary screen, so that good user experience is realized.
For convenience of description, the following embodiments of the present application will simply refer to a multi-screen display device as a display device.
Fig. 1 is a schematic diagram illustrating an operation scenario between a display device and a control apparatus according to an embodiment. As shown in fig. 1, a user may operate the display apparatus 200 through the control device 100.
The control device 100 may be a remote controller 100A, which can communicate with the display device 200 through an infrared protocol communication, a bluetooth protocol communication, a ZigBee (ZigBee) protocol communication, or other short-range communication, and is used to control the display device 200 in a wireless or other wired manner. The user may input a user instruction through a key on a remote controller, voice input, control panel input, etc., to control the display apparatus 200. Such as: the user can input a corresponding control command through a volume up/down key, a channel control key, up/down/left/right moving keys, a voice input key, a menu key, a power on/off key, etc. on the remote controller, to implement the function of controlling the display device 200.
The control apparatus 100 may also be a smart device, such as a mobile terminal 100B, a tablet computer, a notebook computer, etc., which may communicate with the display device 200 through a Local Area Network (LAN), a domain Network (WAN), a Wireless Local Area Network (WLAN), or other networks, and implement control of the display device 200 through an application program corresponding to the display device 200.
For example, the mobile terminal 100B and the display device 200 may both be installed with software applications, so that connection communication between the two may be implemented through a network communication protocol, and further, the purpose of controlling operations and data communication of may be implemented by .
As shown in FIG. 1, the display device 200 may also be in data communication with the server 300 via a variety of communication means, the display device 200 may be permitted to be communicatively coupled via a Local Area Network (LAN), a Wireless Local Area Network (WLAN), and other networks the server 300 may provide various content and interactions to the display device 200. illustratively, the display device 200 receives software program updates by sending and receiving information, as well as Electronic Program Guide (EPG) interactions, or accesses a remotely stored digital media library. the server 300 may be sets, or multiple sets, or or more types of servers, other network service content such as video-on-demand and announcement services are provided via the server 300.
The display device 200, which may be a liquid crystal display, an OLED display, a projection display device, the specific display device type, size and resolution, etc. are not limiting, and it will be appreciated by those skilled in the art that the display device 200 may be modified in performance and configuration as desired.
The display device 200 may additionally provide an intelligent network television function of a computer support function in addition to the broadcast receiving television function, examples include a network television, a display device, an Internet Protocol Television (IPTV), and the like.
As shown in fig. 1, a camera may be connected or disposed on the display device, and is used to present a picture taken by the camera on a display interface of the display device or other display devices, so as to implement interactive chat between users. Specifically, the picture shot by the camera can be displayed on the display device in a full screen mode, a half screen mode or any optional area.
In other examples, more or less functions may be added or subtracted.
Fig. 2 is a block diagram illustrating a hardware configuration of a display apparatus 200 according to an exemplary embodiment of the present application, as shown in fig. 2, the display apparatus 200 may include a tuner demodulator 220, a communicator 230, a detector 240, an external device interface 250, a controller 210, a memory 290, a user input interface, a video processor 260-1, an audio processor 260-2, a voice recognition part 260-3, a th display 280-1, a second display 280-2, an audio output interface 272, and a power supply.
In the present embodiment, the th display 280-1 may correspond to the aforementioned main screen, and the second display 280-2 may correspond to the aforementioned sub screen.
The tuner/demodulator 220 receives broadcast television signals in a wired or wireless manner, and may perform modulation/demodulation processes such as amplification, mixing, resonance, etc., so as to demodulate the audio/video signal carried in the frequency of the television channel selected by the user and additional information (e.g., EPG data signal) from a plurality of wireless or wired broadcast television signals.
The tuner demodulator 220 is responsive to the user-selected television channel frequency and the television signal carried thereby, as selected by the user and as controlled by the controller 210.
The tuner demodulator 220 may receive signals according to different broadcast formats of the television signal , such as terrestrial broadcast, cable broadcast, satellite broadcast, internet broadcast, etc., and may demodulate analog signals and digital signals according to different modulation types, digital modulation formats, analog modulation formats, and different types of received television signals.
In other exemplary embodiments, the tuning demodulator 220 may be in an external device, such as an external set-top box, so that the set-top box outputs television audio/video signals through modulation and demodulation, and then inputs the television audio/video signals into the display device 200 through the external device interface 250.
The communicator 230 is a component for communicating with an external device or an external server according to various communication protocol types. For example: the communicator 230 may include a WIFI module 231, a bluetooth communication protocol module 232, a wired ethernet communication protocol module 233, and other network communication protocol modules or near field communication protocol modules.
The display apparatus 200 may establish a connection of a control signal and a data signal with an external control apparatus or a content providing apparatus through the communicator 230. For example, the communicator may receive a control signal of the remote controller 100 according to the control of the controller.
The detector 240 is a component of the display apparatus 200 for collecting signals of an external environment or interaction with the outside. The detector 240 may include a light receiver 242, a sensor for collecting the intensity of ambient light, which may be used to adapt to display parameter changes, etc.; the system can further include an image collector 241, such as a camera, etc., which can be used for collecting external environment scenes, collecting attributes of the user or interacting gestures with the user, adaptively changing display parameters, and recognizing user gestures, so as to realize the function of interaction with the user.
In other exemplary embodiments, detector 240 may further include a temperature sensor, such as by sensing ambient temperature, and display device 200 may be adapted to adjust the color temperature of the image displayed, such as by adjusting display device 200 to display a cooler color temperature of the image when the temperature is higher, or to adjust display device 200 to display a warmer color temperature when the temperature is lower.
In other exemplary embodiments, detector 240 may further include a sound collector, such as a microphone, that may be used to receive a user's voice, a voice signal including control instructions for the user to control display device 200, or to collect ambient sound for identifying an ambient scene type, and display device 200 may adapt to ambient noise.
The external device interface 250 provides a component for the controller 210 to control data transmission between the display apparatus 200 and other external apparatuses. The external device interface may be connected with an external apparatus such as a set-top box, a game device, a notebook computer, etc. in a wired/wireless manner, and may receive data such as a video signal (e.g., moving image), an audio signal (e.g., music), additional information (e.g., EPG), etc. of the external apparatus.
The external device interface 250 may include or more of a High Definition Multimedia Interface (HDMI) terminal 251, a Composite Video Blanking Sync (CVBS) terminal 252, an analog or digital component terminal 253, a Universal Serial Bus (USB) terminal 254, a red, green, blue (RGB) terminal (not shown), and the like.
The controller 210 controls the operation of the display device 200 and responds to the operation of the user by running various software control programs (such as an operating system and various application programs) stored on the memory 290.
As shown in fig. 2, the controller 210 includes a random access memory RAM213, a read only memory ROM214, a graphics processor 216, a CPU processor 212, a communication interface 218, and a communication bus. The RAM213 and the ROM214, the graphic processor 216, the CPU processor 212, and the communication interface 218 are connected via a bus.
A ROM213 for storing instructions for various system boots. If the display device 200 is powered on upon receipt of the power-on signal, the CPU processor 212 executes a system boot instruction in the ROM and copies the operating system stored in the memory 290 to the RAM214 to start running the boot operating system. After the start of the operating system is completed, the CPU processor 212 copies the various application programs in the memory 290 to the RAM214, and then starts running and starting the various application programs.
The graphics processor 216 is used for generating various graphic objects such as icons, operation menus, and user input instructions to display graphics, etc., includes an operator for performing operations by receiving various interactive instructions input by a user to display various objects according to display attributes, and includes a renderer for generating various objects based on the operator, and the result of rendering is displayed on the th display 280-1 or the second display 280-2.
A CPU processor 212 for executing operating system and application program instructions stored in memory 290. And executing various application programs, data and contents according to various interactive instructions received from the outside so as to finally display and play various audio and video contents.
In exemplary embodiments, the CPU processor 212 may include a plurality of processors, the plurality of processors may include main processors and a plurality of or sub-processors, the main processor for performing operations of the display apparatus 200 in a pre-power-up mode and/or operations of displaying a screen in a normal mode, the plurality of or sub-processors for performing operations in a standby mode or the like.
The communication interfaces, which may include th interface 218-1 through nth interface 218-n, may be network interfaces that are connected to external devices via a network.
For example, in response to receiving a user command for selecting a UI object to be displayed on the th display 280-1 or the second display 280-2, the controller 210 may perform an operation related to the object selected by the user command.
The user command for selecting the UI object may be a command input through various input means (e.g., a mouse, a keyboard, a touch pad, etc.) connected to the display apparatus 200 or a voice command corresponding to a voice spoken by the user.
The memory 290 includes a memory for storing various software modules for driving and controlling the display apparatus 200. Such as: various software modules stored in memory 290, including: the system comprises a basic module, a detection module, a communication module, a display control module, a browser module, various service modules and the like.
The basic module is a bottom layer software module for signal communication between hardware in the display device 200 and sending processing and control signals to an upper layer module. The detection module is a management module used for collecting various information from various sensors or user input interfaces, and performing digital-to-analog conversion and analysis management.
For example, the voice recognition module includes a voice parsing module and a voice command database module, the display control module is a module for controlling th display 280-1 or second display 280-2 to display image content, and may be used to play multimedia image content and information such as UI interface, the communication module is a module for controlling and data communication with external devices, the browser module is a module for performing data communication between browsing servers, and the service module is a module for providing various services and various applications.
Meanwhile, the memory 290 is also used to store visual effect maps and the like for receiving external data and user data, images of respective items in various user interfaces, and a focus object.
A user input interface for transmitting an input signal of a user to the controller 210 or transmitting a signal output from the controller to the user. For example, the control device (e.g., a mobile terminal or a remote controller) may send an input signal, such as a power switch signal, a channel selection signal, a volume adjustment signal, etc., input by a user to the user input interface, and then the input signal is forwarded to the controller by the user input interface; alternatively, the control device may receive an output signal such as audio, video, or data output from the user input interface via the controller, and display the received output signal or output the received output signal in audio or vibration form.
In embodiments, the user may input a user command on a Graphical User Interface (GUI) displayed on the th display 280-1 or the second display 280-2, the user input interface receiving the user input command through the Graphical User Interface (GUI), or alternatively, the user may input a user command by inputting a specific sound or gesture, the user input interface receiving the user input command by recognizing the sound or gesture through the sensor.
The video processor 260-1 is configured to receive the video signal, and perform video data processing such as decompression, decoding, scaling, noise reduction, frame rate conversion, resolution conversion, and image synthesis according to a standard codec protocol of the input signal, so as to obtain a video signal that is directly displayed or played on the -th display 280-1 or the second display 280-2.
Illustratively, the video processor 260-1 includes a demultiplexing module, a video decoding module, an image synthesizing module, a frame rate conversion module, a display formatting module, and the like.
The demultiplexing module is used for demultiplexing the input audio and video data stream, and if the input MPEG-2 is input, the demultiplexing module demultiplexes the input audio and video data stream into a video signal and an audio signal.
And the video decoding module is used for processing the video signal after demultiplexing, including decoding, scaling and the like.
And the image synthesis module is used for carrying out superposition mixing processing on the GUI signal input by the user or generated by the user and the video image after the zooming processing by the graphic generator so as to generate an image signal for display.
The frame rate conversion module is configured to convert a frame rate of an input video, such as a 24Hz, 25Hz, 30Hz, or 60Hz video, into a 60Hz, 120Hz, or 240Hz frame rate, where the input frame rate may be related to a source video stream, and the output frame rate may be related to an update rate of a display. The input is realized in a common format by using a frame insertion mode.
And a display formatting module for converting the signal output by the frame rate conversion module into a signal conforming to a display format of a display, such as converting the format of the signal output by the frame rate conversion module to output an RGB data signal.
-th display 280-1 and second display 280-2 for receiving image signals from the video processor 260-1 for displaying video content and images and menu manipulation interfaces, respectively -th display 280-1 and second display 280-2 include display components for presenting pictures and driving components for driving the display of images, respectively, display video content, either from broadcast signals received by the tuner demodulator 220 or from communicator or external device interface inputs, display 220, and simultaneously display a user manipulation interface UI generated in the display device 200 for controlling the display device 200.
And the th display 280-1 and the second display 280-2 further include driving components for driving the display, respectively, according to the th display 280-1 and the second display 280-2, or kinds of projection devices and projection screens may be further included, provided that the th display 280-1 and/or the second display 280-2 are kinds of projection displays.
The audio processor 260-2 is configured to receive an audio signal, decompress and decode the audio signal according to a standard codec protocol of the input signal, and perform noise reduction, digital-to-analog conversion, amplification and other audio data processing to obtain an audio signal that can be played in the speaker 272.
An audio output interface 270 for receiving the audio signal output by the audio processor 260-2 under the control of the controller 210, wherein the audio output interface may include a speaker 272 or an external sound output terminal 274 for outputting to a generating device of an external device, such as: external sound terminal or earphone output terminal.
In other exemplary embodiments, video processor 260-1 may include or more chips.Audio processor 260-2 may also include or more chips.
And, in other exemplary embodiments, video processor 260-1 and audio processor 260-2 may be separate chips or may be integrated with controller 210 in or more chips.
The voice recognition unit 260-3 is used for receiving voice input and analyzing the voice input into commands that can be recognized by the controller.
And a power supply for supplying power supply support to the display apparatus 200 from the power input from the external power source under the control of the controller 210. The power supply may include a built-in power supply circuit installed inside the display apparatus 200, or may be a power supply installed outside the display apparatus 200, such as a power supply interface for providing an external power supply in the display apparatus 200.
In FIG. 2, the controller 210 controls the th display 280-1 and the display of the second display 280-2.
In this mode, the recognition unit 260-3 receives a voice input from a user, and the controller 210 displays a recognition result of the voice input on the second display 280-2 and summary information of a response result of the voice input on the second display in response to a voice command input from the user.
As an alternative to , after the summary information is displayed on the second display, if the th preset key command is inputted by the user, the controller 210 displays the above response result on the th display 280-1 in response to the th preset key command.
In this alternative, the controller 210 first adjusts the state of the th display 280-1 to a bright screen state if the th display 280-1 is in a screen-down state before the controller 210 displays the above-described response result.
alternatively, if the voice command input by the user is a command in the preset command set and the th display 280-1 is in a bright screen state, the controller 210 outputs the result of the response on the th display 280-1.
As an alternative to , if the voice command is a command in the preset command set, the th display 280-1 is in the screen saver state, and the above response result includes audio data, the controller 210 plays the audio data.
In this alternative, after the summary information is displayed on the second display, if the user inputs a second preset key command, the controller 210 adjusts the response result according to the second preset key command, and displays the summary information of the adjusted response result on the second display 280-2.
The detailed processing of the controller will be described in detail in the following method embodiments.
In another alternative, the controller 210 may control the th display 280-1, and the controller 310 controls the display of the second display 280-2 based thereon.
Fig. 3 is a block diagram illustrating a hardware configuration of the display device 200 according to another exemplary embodiment , where the display device 200 may include a chip with the controller 210 and a chip with the controller 310 as shown in fig. 3.
The N-chip may include a tuner demodulator 220, a communicator 230, an external device interface 250, a controller 210, a memory 290, a user input interface 260-3, a video processor 260-1, an audio processor 260-2, a voice recognition portion 260-3, an th display 280-1, an audio output interface 270, and a power supply module 240.
The structure and function of the tuning demodulator 220 and the communicator 230 are similar to those of fig. 2 , and are not repeated here.
The external device interface 250 is a component for providing data transmission between the controller 210 and the controller 310 and other external devices. The external device interface 250 may be connected with an external apparatus such as a set-top box, a game device, a notebook computer, etc. in a wired/wireless manner, and may receive data such as a video signal (e.g., moving image), an audio signal (e.g., music), additional information (e.g., EPG), etc. of the external apparatus.
The external device interface 250 may include any or more of a High Definition Multimedia Interface (HDMI) terminal also referred to as HDMI251, a Composite Video Blanking Sync (CVBS) terminal also referred to as AV 252, an analog or digital component terminal also referred to as component 253, a Universal Serial Bus (USB) terminal 254, a red, green, blue (RGB) terminal (not shown), and the like.
The controller 210 controls the operation of the display device 200 and responds to the user's operation by running various software control programs (e.g., an operating system and/or various application programs) stored on the memory 290.
As shown in fig. 5, the controller 210 includes a read only memory RAM213, a random access memory ROM214, a graphics processor 216, a CPU processor 212, a communication interface 218, and a communication bus. The RAM213 and the ROM214, the graphic processor 216, the CPU processor 212, and the communication interface 218 are connected via a bus.
A ROM213 for storing instructions for various system boots. If the display device 200 is powered on upon receipt of the power-on signal, the CPU processor 212 executes a system boot instruction in the ROM and copies the operating system stored in the memory 290 to the RAM214 to start running the boot operating system. After the start of the operating system is completed, the CPU processor 212 copies the various application programs in the memory 290 to the RAM214, and then starts running and starting the various application programs.
The graphics processor 216 is used for generating various graphic objects such as icons, operation menus, and user input instructions to display graphics, etc., includes an operator for performing operations by receiving various interactive instructions input by a user to display various objects according to display attributes, and includes a renderer for generating various objects based on the operator, and the result of rendering is displayed on the -th display 280-1.
A CPU processor 212 for executing operating system and application program instructions stored in memory 290. And executing various application programs, data and contents according to various interactive instructions received from the outside so as to finally display and play various audio and video contents.
In exemplary embodiments, the CPU processor 212 may include a plurality of processors, which may include main processors and a plurality of or sub-processors, a main processor for performing operations of the display apparatus 200 in a pre-power-up mode and/or operations of displaying a screen in a normal mode, a plurality of or sub-processors for performing operations in a standby mode or the like.
The communication interfaces 218, may include interfaces 218-1 through n interfaces 218-n-these interfaces may be network interfaces that are connected to external devices via a network.
The controller 210 may control operations of the display device 200 associated with the th display 280-1, for example, in response to receiving a user command to select a UI object to be displayed on the th display 280-1, the controller 210 may perform operations associated with the object selected by the user command.
The user command for selecting the UI object may be a command input through various input means (e.g., a mouse, a keyboard, a touch pad, etc.) connected to the display apparatus 200 or a voice command corresponding to a voice spoken by the user.
The memory 290 includes a memory for storing various software modules for driving and controlling the display apparatus 200. Such as: various software modules stored in memory 290, including: a base module, a detection module, a communication module, a display control module, a browser module, and various service modules, etc. (not shown in the figure).
The basic module is a bottom software module used for signal communication among various hardware in the display device 200 and sending processing and control signals to an upper layer module, the detection module is a management module used for collecting various information from various sensors or user input interfaces, performing digital-to-analog conversion and analysis management, the voice recognition module comprises a voice analysis module and a voice instruction database module, the display control module is a module used for controlling the th display 280-1 to display image content and can be used for playing multimedia image content, UI (user interface) and other information, the communication module is a module used for performing control and data communication with external equipment, the browser module is a module used for performing data communication among browsing servers, and the service module is a module used for providing various services and various application programs.
Meanwhile, the memory 290 is also used to store visual effect maps and the like for receiving external data and user data, images of respective items in various user interfaces, and a focus object.
The user input interface 260-3 serves to transmit an input signal of a user to the controller 210 or transmit a signal output from the controller 210 to the user. For example, the control device (e.g., a mobile terminal or a remote controller) may transmit an input signal, such as a power switch signal, a channel selection signal, a volume adjustment signal, etc., input by the user to the user input interface, and then the input signal is forwarded to the controller 210 through the user input interface 260-3; alternatively, the control device may receive an output signal such as audio, video, or data output from the user input interface 260-3 via the controller 210, and display the received output signal or output the received output signal in audio or vibration form.
In embodiments, the user may input a user command on a Graphical User Interface (GUI) displayed on the display 280-1 and the user input interface 260-3 receives the user input command through the Graphical User Interface (GUI). alternatively, the user may input a user command by inputting a particular sound or gesture and the user input interface 260-3 receives the user input command by recognizing the sound or gesture through a sensor.
The video processor 260-1 is configured to receive the video signal, and perform video data processing such as decompression, decoding, scaling, noise reduction, frame rate conversion, resolution conversion, and image synthesis according to a standard codec protocol of the input signal, so as to obtain a video signal that is directly displayed or played on the -th display 280-1.
Illustratively, the video processor 260-1 includes a demultiplexing module, a video decoding module, an image synthesizing module, a frame rate conversion module, a display formatting module, and the like (not shown in the figure).
The demultiplexing module is used for demultiplexing the input audio and video data stream, and if the input MPEG-2 is input, the demultiplexing module demultiplexes the input audio and video data stream into a video signal and an audio signal.
And the video decoding module is used for processing the video signal after demultiplexing, including decoding, scaling and the like.
And the image synthesis module, such as an image synthesizer, is used for performing superposition mixing processing on the GUI signal input by the user or generated by the user and the video picture after the zooming processing by the graphics generator so as to generate an image signal for display.
The frame rate conversion module is configured to convert a frame rate of an input video, such as a 24Hz, 25Hz, 30Hz, or 60Hz video, into a 60Hz, 120Hz, or 240Hz frame rate, wherein the input frame rate may be related to a source video stream, and the output frame rate may be related to a refresh rate of a display. And a display formatting module for converting the signal output by the frame rate conversion module into a signal conforming to a display format of a display, such as converting the format of the signal output by the frame rate conversion module to output an RGB data signal.
-th display 280-1 for receiving image signals from the video processor 260-1 for displaying video content and images and menu manipulation interface -th display 280-1 includes display components for presenting pictures and driving components for driving the display of images-video content is displayed, either from broadcast signals received by the tuner demodulator 220 or from communicator or external device interface inputs- -th display 280-1, together with a user manipulation interface UI generated in the display device 200 and used for controlling the display device 200.
And a driving assembly for driving the display according to the type of the th display 280-1, or, in case that the th display 280-1 is the projection display, projection devices and projection screens may be further included.
The audio processor 260-2 is configured to receive an audio signal, decompress and decode the audio signal according to a standard codec protocol of the input signal, and perform noise reduction, digital-to-analog conversion, amplification and other audio data processing to obtain an audio signal that can be played in the speaker 272.
An audio output interface 270 for receiving the audio signal output by the audio processor 260-2 under the control of the controller 210, wherein the audio output interface may include a speaker 272 or an external sound output terminal 274 for outputting to a generating device of an external device, such as: external sound terminal or earphone output terminal.
In other exemplary embodiments, video processor 260-1 may include or more chips.Audio processor 260-2 may also include or more chips.
And, in other exemplary embodiments, video processor 260-1 and audio processor 260-2 may be separate chips or may be integrated with controller 210 in or more chips.
The voice recognition unit 260-3 is used for receiving voice input and analyzing the voice input into commands that can be recognized by the controller.
And a power supply module 240 for providing power supply support for the display device 200 by the power input from the external power source under the control of the controller 210. The power supply module 240 may include a built-in power supply circuit installed inside the display apparatus 200, or may be a power supply installed outside the display apparatus 200, such as a power supply interface for providing an external power supply in the display apparatus 200.
As shown in fig. 5, the a-chip may include a controller 310, a communicator 330, a detector 340, a memory 390, and a second display 280-2. A user input interface, a video processor, an audio output interface (not shown) may also be included in some embodiments. In some embodiments, there may also be a power supply module (not shown) that independently powers the a chip.
The communicator 330 is a component for communicating with an external device or an external server according to various communication protocol types. For example: the communicator 330 may include a WIFI module 331, a bluetooth communication protocol module 332, a wired ethernet communication protocol module 333, and other network communication protocol modules such as an infrared communication protocol module or a near field communication protocol module (not shown).
The communicator 330 and the communicator 230 also interact with each other, for example, a WiFi module 231 in an N-chip hardware system is used to connect to an external network, to generate network communication with an external server, etc. a WiFi module 331 in an A-chip hardware system is used to connect to the WiFi module 231 of the N-chip, not to generate a direct connection with an external network, etc. the A-chip is connected to the external network through the N-chip.
The detector 340 is a component of the display device a chip for collecting signals of an external environment or interacting with the outside. The detector 340 may include a light receiver 342, a sensor for collecting the intensity of ambient light, which may be used to adapt to display parameter changes, etc.; the system may further include an image collector 341, such as a camera, a video camera, etc., which may be configured to collect external environment scenes, collect attributes of the user or interact gestures with the user, adaptively change display parameters, and identify user gestures, so as to implement a function of interaction with the user.
An external device interface 350, which provides a component for data transmission between the controller 310 and the N-chip or other external devices. The external device interface may be connected with an external apparatus such as a set-top box, a game device, a notebook computer, etc. in a wired/wireless manner.
A video processor 360 for processing the associated video signal.
The controller 310 controls the operation of the display device 200 and responds to the user's operation by running various software control programs stored on the memory 390 (e.g., using installed third party applications, etc.), and interacting with the N-chip.
As shown in fig. 5, the controller 310 includes a read only memory ROM 313, a random access memory RAM314, a graphics processor 316, a CPU processor 312, a communication interface 318, and a communication bus. The ROM 313 and the RAM314, the graphic processor 316, the CPU processor 312, and the communication interface 318 are connected via a bus.
A ROM 313 for storing instructions for various system boots. CPU processor 312 executes system boot instructions in ROM and copies the operating system stored in memory 390 to RAM314 to begin running the boot operating system. After the start of the operating system is completed, the CPU processor 312 copies various application programs in the memory 390 to the RAM314, and then starts running and starting various application programs.
The CPU processor 312 is used for executing the operating system and application program instructions stored in the memory 390, communicating with the N chip, transmitting and interacting signals, data, instructions, etc., and executing various application programs, data and contents according to various interaction instructions received from the outside, so as to finally display and play various audio and video contents.
The communication interface 318 is a plurality of interfaces, which may include -th interface 318-1 to N-th interface 318-N. these interfaces may be network interfaces connected to external devices via a network, or may be network interfaces connected to an N-chip via a network.
The controller 310 may control operations of the display device 200 in relation to the second display 280-2. For example: in response to receiving a user command for selecting a UI object to be displayed on the second display 280-2, the controller 310 may perform an operation related to the object selected by the user command.
A graphics processor 316 for generating various graphics objects, such as: icons, operation menus, user input instruction display graphics, and the like. The display device comprises an arithmetic unit which carries out operation by receiving various interactive instructions input by a user and displays various objects according to display attributes. And a renderer for generating various objects based on the operator, and displaying the rendered result on the second display 280-2.
Both the A-chip graphics processor 316 and the N-chip graphics processor 216 are capable of generating various graphics objects. In distinction, if application 1 is installed on the a-chip and application 2 is installed on the N-chip, the a-chip graphics processor 316 generates a graphics object when a user performs a command input by the user in application 1 at the interface of application 1. When a user makes a command input by the user in the interface of the application 2 and within the application 2, a graphic object is generated by the graphic processor 216 of the N chip.
In FIG. 3, the controller 210 controls the display of the th display 280-1 and the controller 310 controls the display of the second display 280-2.
In this mode, the voice input of the user is received by the audio recognition unit 260-3, the controller 210 transmits the recognition result of the voice input to the controller 310 in response to the voice command input by the user, the recognition result of the voice input is displayed on the second display 280-2 by the controller 310, and the summary information of the response result of the voice input is transmitted to the controller 310 by the controller 210 and displayed on the second display by the controller 310.
As an alternative to , after the summary information is displayed on the second display, if the th preset key command is inputted by the user, the controller 210 displays the above response result on the th display 280-1 in response to the th preset key command.
In this alternative, the controller 210 first adjusts the state of the th display 280-1 to a bright screen state if the th display 280-1 is in a screen-down state before the controller 210 displays the above-described response result.
alternatively, if the voice command input by the user is a command in the preset command set and the th display 280-1 is in a bright screen state, the controller 210 outputs the result of the response on the th display 280-1.
As an alternative to , if the voice command is a command in the preset command set, the th display 280-1 is in the screen saver state, and the above response result includes audio data, the controller 210 plays the audio data.
In this alternative, after the summary information is displayed on the second display, if the user inputs a second preset key command, the controller 210 adjusts the response result according to the second preset key command, and transmits the summary information of the adjusted response result to the controller 310, and the controller 310 displays the summary information of the adjusted response result on the second display 280-2.
The detailed processing of the controller 210 and the controller 310 will be described in detail in the following method embodiments.
Fig. 4 is a block diagram illustrating a hardware configuration of the control device 100 according to an exemplary embodiment of the present application, and as shown in fig. 4, the control device 100 includes a controller 110, a communicator 130, a user input/output interface 140, a memory 190, and a power supply 180.
The control apparatus 100 is configured to control the display device 200 and may receive an input operation instruction of a user and convert the operation instruction into an instruction recognizable and responsive by the display device 200, serving as an interaction intermediary between the user and the display device 200. Such as: the user operates the channel up/down key on the control device 100, and the display device 200 responds to the channel up/down operation.
In , the control device 100 may be a type of smart device, for example, the control device 100 may be installed with various applications for controlling the display device 200 according to the user's needs.
In embodiments, as shown in FIG. 1, the mobile terminal 100B or other intelligent electronic device may function similarly to the control apparatus 100 after installation of an application for operating the display device 200. for example, a user may implement functions for controlling physical keys of the apparatus 100 by installing various function keys or virtual buttons of a graphical user interface available on the mobile terminal 100B or other intelligent electronic device.
The controller 110 includes a processor 112, a RAM113 and a ROM114, a communication interface, and a communication bus. The controller 110 is used to control the operation of the control device 100, as well as the internal components for communication and coordination and external and internal data processing functions.
The communicator 130 is controlled by the controller 110 to communicate control signals and data signals with the display device 200, for example, to transmit a received user input signal to the display device 200, the communicator 130 may include at least of communication modules, such as a WIFI module 131, a bluetooth module 132, and an NFC module 133.
The user input/output interface 140 includes at least of the input interfaces such as the microphone 141, the touch pad 142, the sensor 143, and the button 144, for example, the user can input a user command by voice, touch, gesture, pressing, etc., and the input interface converts the received analog signal into a digital signal and converts the digital signal into a corresponding command signal, and sends the digital signal to the display device 200.
The embodiments may include an infrared interface or a radio frequency interface, for example, in the case of an infrared signal interface, a user input command needs to be converted into an infrared control signal according to an infrared control protocol, and the infrared control signal is transmitted to the display device 200 through an infrared transmission module, or in the case of a radio frequency signal interface, a user input command needs to be converted into a digital signal, and then the digital signal is modulated according to a radio frequency control signal modulation protocol, and then the digital signal is transmitted to the display device 200 through a radio frequency transmission terminal.
In , the control device 100 includes at least of the communicator 130 and the output interface, the communicator 130 is configured in the control device 100, such as WIFI, bluetooth, NFC, etc., and the user input command can be encoded by WIFI protocol, bluetooth protocol, or NFC protocol and transmitted to the display device 200.
And a memory 190 for storing various operation programs, data and applications for driving and controlling the control apparatus 100 under the control of the controller 110. The memory 190 may store various control signal commands input by a user.
And a power supply 180 for providing operational power support to the components of the control device 100 under the control of the controller 110. A battery and associated control circuitry.
Fig. 5 is a functional configuration diagram of the display device 200 according to an exemplary embodiment of the present application, and as shown in fig. 5, the memory 290 is used to store an operating system, an application program, contents, user data, etc., and performs various operations for driving the system operation of the display device 200 and responding to a user under the control of the controller 210.
The memory 290 is specifically used for storing an operating program for driving the controller 210 in the display device 200, and storing various applications installed in the display device 200, various applications downloaded by a user from an external device, various graphical user interfaces related to the applications, various objects related to the graphical user interfaces, user data information, and internal data of various supported applications. The memory 290 is used to store system software such as an Operating System (OS) kernel, middleware, and applications, and to store input video data and audio data, and other user data.
The memory 290 is specifically used for storing drivers and related data of the video processor 260-1, the audio processor 260-2, the th display 280-1 (if the hardware structure of the display device is the structure shown in fig. 2, the display 280-2 is also included), the communication interface 230, the tuning demodulator 220, the detector 240, the input/output interface, and the like.
In embodiments, memory 290 may store software and/or programs representing software programs for an Operating System (OS) including, for example, a kernel, middleware, an Application Programming Interface (API), and/or an application program.
Illustratively, the memory 290 includes a broadcast reception module 2901, a channel control module 2902, a volume control module 2903, an image control module 2904, a display control module 2905, an audio control module 2906, an external command recognition module 2907, a communication control module 2908, a light reception module 2909, a power control module 2910, an operating system 2911, and other application programs 2912, a browser module, etc. in this case, the external command recognition module 2907 includes a pattern recognition module 2907-1, a voice recognition module 2907-2, and a key command recognition module 2907-3, the controller 210 executes other applications such as a broadcast signal reception demodulation function, a television channel selection control function, a volume selection control function, an image control function, a display control function, an audio control function, an external command recognition function, a communication control function, a light signal reception function, a power control function, a software manipulation platform supporting various functions, and a browser function by operating various software programs in the memory 290.
The following describes a processing flow of the display device 200 in response to a voice input by a user.
Fig. 6 is a flowchart illustrating a voice command processing method according to an embodiment of the present application, where an execution subject of the method is the controller 210 in fig. 2 or fig. 3, as shown in fig. 6, the method includes:
s601, receiving voice input of a user.
S602, in response to a voice command input by a user, displaying a recognition result of the voice input on a second display, and displaying summary information of a response result of the voice input on the second display.
Alternatively, if the hardware configuration of the display device 200 is the configuration shown in fig. 3, when the step S602 is executed, the controller 210 first sends the recognition result of the voice input and the summary information of the response result to the controller 310, and then the controller 310 displays the recognition result of the voice instruction and the summary information of the response result on the second display.
The following describes the implementation of the present embodiment by taking this alternative as an example.
For example, when a user wants to inquire information such as weather and vehicle restriction conditions, a voice instruction such as "how much weather is in city a today" and "how many restriction numbers are in today" may be issued to the display device.
Alternatively, before performing step S601, the user may wake up the display device using a specific wake-up word and display the wokened-up information on the second display. The wake-up procedure will be described in detail in the complete interaction flow described below.
Optionally, the recognition result of the voice input may be text information corresponding to the voice input. The recognition result of the voice input may be encapsulated in a command that can be interpreted by the controller 310 and transmitted to the controller 310.
The controller 310 parses the command after receiving the command encapsulating the recognition result of the voice input, and displays the parsed result on the second display.
For example, the voice command input by the user is a voice of "what weather is in city a today", the recognition result is a text of "what weather is in city a today", and the controller 310 receives the recognition result and displays the text of "what weather is in city a today" on the second display.
In the above process, after the user inputs the voice command, only the recognition result of the voice command is displayed on the second display, and the th display can be in the operation state directly before the voice command is sent out, i.e. the user can input the voice command without influencing or interfering the display of the th display, and at the same time, the user can normally view the voice command input by the user on the second display.
The controller 210 obtains a response result of the voice input according to the voice input.
For example, after the controller 210 parses and identifies the voice command to obtain an identification result, the controller may obtain a response result of the voice command from the server through the network according to the identification result. Illustratively, the voice instruction is "how much weather exists in city a today", the weather information in city a is queried from the server according to the recognition result of the instruction, and the queried weather information is the response result of the voice input.
The response result of the voice input may include a large amount of data, and the summary information of the response result of the voice input may be key information in the response result. For example, when inquiring weather of city a, the obtained response result may include various information such as weather, temperature, wind power, air quality, dressing advice, sports advice at various times of the current date and a plurality of subsequent dates, and the second display may display only two types of summary information of the weather and the temperature of the current date.
, the summary information of the response result can be extracted from the response result by the controller 210.
In another ways, the summary information of the response result can be used as part of the response result, and the controller 210 can directly obtain the summary information.
Optionally, the content and the representation manner of the summary information of the corresponding response result may be different for different voice inputs. The content of the summary information may include, for example: status information such as offline, error, prompt, etc., summary information of query or search results, etc. The summary information may be presented in the form of characters, animation, or the like.
Alternatively, the controller 310 may output the summary information in a form of voice in addition to displaying the summary information on the second display, so that the user can hear the summary information in response to the result.
In the above process, after the response result of the voice input is obtained, only the summary information of the response result is displayed on the second display, and the th display may be in the operation state of directly before the voice command is sent out, that is, the voice command input by the user does not affect or interfere with the display of the th display, and meanwhile, the user may normally view the summary information of the response result of the voice command input by the user on the second display.
In this embodiment, after the user inputs the voice instruction to the display device, the second display in the display device displays the recognition result of the voice input, and the controller acquires the response result, and displays the summary information of the response result on the second display, in this process, the voice instruction and the summary information of the response result are both displayed on the second display, and the display may not be affected by the voice interaction process, and if the display is in a bright screen state, the information before the voice instruction input may be continuously displayed, and if the display is in a breath screen state, the breath screen state may be kept unchanged.
As an alternative , after the second display displays the summary information of the response result, the user can press the preset button to trigger display to display the complete response result.
In the alternative embodiment, in cases, if the display is currently in the bright screen state, the user may press the th preset key to input the th preset key command, and after receiving the th preset key command, the controller 210 may display the response result on the th display according to the th preset key command.
In another cases, if the display is currently in the screen-saving state, the user may press the th preset key to input the th preset key command, and after receiving the th preset key command, the controller 210 first adjusts the th display to the bright screen state, and then displays the above response result on the th display according to the th preset key command.
The th preset key command may be a command issued by pressing a key in the remote controller, and the key corresponding to the th preset key command may be an "OK" key, an up key, etc.
Alternatively, when the th display shows the above response result, the information shown on the second display may be kept unchanged, and at the same time, the user may view the contents of the response information through the operation of the remote controller.
As an alternative to , for the instructions in the preset instruction set, the response operation of the voice instruction may be performed by the controller 210 while the second display displays the summary information of the response result.
The instructions in the preset instruction set can be instructions satisfying the formats of "play music", "i want to listen to XX music", "i" play song XX ", and the like, and if the format of the voice instruction input by the user satisfies of the formats, the voice instruction can be determined to be an instruction in the preset instruction set.
In the alternative embodiment, in case , if the controller 210 determines that the voice command is a command in the preset command set and the th display is currently in the bright screen state, the response result may be output on the th display.
Optionally, the above-mentioned outputting the response result on the th display may be directly displaying the response result by the controller 210, or may also be displaying the response result by calling a corresponding Application program (APP for short) by the controller 210.
Illustratively, assuming that the preset instruction set is an instruction set satisfying the formats of "play music", "i want to listen to XX music", "i" play song XX ", etc., when the user inputs a voice instruction of" i want to listen to a music ", the controller 210 obtains response information from the server, including summary information of the name, singer, duration, etc. of the a music, and lyrics and audio data of the a music, and displays the summary information of the a music through the second display, and at the same time, the controller displays the response result and calls the APP of the play music, and the APP of the play music displays the interface of the APP on the th display, and displays the text information of the name, singer, duration, lyrics, etc., and at the same time, plays the audio data of the a music.
In this embodiment, when the th display is in a bright screen state and the voice command is a command in a preset command set, for example, a music playing command, the controller 210 may directly output a response result of the voice command on the th display, which does not affect the display state of the th display, and simultaneously enables a user to normally obtain a complete response result of the voice command, thereby achieving good user experience.
In another cases, if the controller 210 determines that the voice command is a command in the preset command set and the th display is currently in the screen saver state and the response result includes audio data, the controller 210 plays the audio data.
For example, assuming that the voice command is a command for playing music a, the controller 210 may determine that the command is a command in the preset command set, and when the controller 210 determines that the th display is in the breath screen state currently and the response result of the voice command includes audio data, the controller 210 may trigger playing the audio data without affecting the display state of the th display.
Optionally, the controller 210 triggers playing of the audio data, which may be the controller 210 calling a corresponding APP to play the audio data.
Further , in this case, after the summary information is displayed on the second display by the controller 210 or the controller 310, the user may further input a second preset key instruction, the controller 210 receives the second preset key instruction input by the user and adjusts the response result according to the second preset key instruction, and further the controller 210 plays the audio data in the adjusted response result, and when the hardware configuration of the display device is the configuration shown in fig. 2, the controller 210 displays the summary information of the adjusted response result on the second display, and when the hardware configuration of the display device is the configuration shown in fig. 3, the controller 210 sends the summary information of the adjusted response result to the controller 310, and the controller 310 displays the summary information of the adjusted response result on the second display.
The second preset key instruction may be an instruction issued by pressing a key in the remote controller, and the key corresponding to the second preset key instruction may be an "OK" key, a left key, a right key, or the like.
For example, when the controller 210 receives the second preset key instruction, it may determine that the user needs to adjust the audio data, and the user presses the left key while playing music a currently, the controller 210 may determine that the user wishes to play pieces of music before music a, and the controller 210 may obtain the adjusted response result from the server according to the second preset key instruction.
It should be noted that, in the above case, since the th display is mainly responsible for displaying the response result because of the screen-saving state, the summary information of the response result may contain more information than in the case of the screen-saving state of the th display.
In this embodiment, when the th display is in the screen turning state and the voice command is a command in the preset command set, for example, a music playing command, the controller 210 may play audio data in the response result, and simultaneously display the summary information of the response result on the second display, so that the display state of the th display is not affected, and at the same time, the user can normally obtain the complete response result of the voice command, thereby achieving good user experience.
Alternatively, if the hardware configuration of the display apparatus 200 is the configuration shown in fig. 3 described above, the process shown in fig. 6 described above may be implemented by interaction of a plurality of processing modules in the controller 210 and the controller 310.
FIG. 7 is an exemplary diagram of the division of modules and the interaction of messages in the controller 210 and the controller 310, as shown in FIG. 7, an th command receiving module, a control module and a display module can be included in the controller 210, and an th command receiving module and a display module can be included in the controller 310. in the controller 210, the control module is used for processing data related to voice commands and service logic, and the display module is used for receiving control commands from the control module to display complete information of the response results of the voice commands on the th display.
After the user issues the voice command, the command receiving module of the controller 210 receives the voice command, the command receiving module of the controller 210 sends the voice command to the control module of the controller 210, the control module sends the recognition result of the voice command to the second command receiving module of the controller 310, the second command receiving module of the controller 310 sends the recognition result of the voice command to the display module of the controller 310, and the display module displays the recognition result of the voice command on the second display.
For the sake of distinction, hereinafter, the display module of the controller 210 is referred to as an -th display module, and the display module of the controller 310 is referred to as a second display module.
The following describes an example of a process flow based on the foregoing embodiments.
Fig. 8 is a schematic flowchart of a voice instruction processing method according to an embodiment of the present application, and as shown in fig. 6, an interaction flow based on the module division in fig. 7 includes:
s801, a user inputs a preset awakening instruction.
S802, , the command receiving module receives the wake-up command and sends the wake-up command to the control module.
And S803, the control module analyzes the instruction, encapsulates the instruction into a command and sends the command to the second command receiving module.
After the control module sends the command, the long connection service is started to maintain the real-time connection with the second control system, meanwhile, the th display keeps the running state unchanged before receiving the awakening command.
And S804, the second command receiving module forwards the command to the second display module.
And S805, the second display module displays the awakening response on the second display according to the command.
For example, the second display module may display a text of "good, what can help you" and a preset animation.
And S806, inputting a voice instruction by the user.
S807, the control module judges whether the th display is in a screen turning state, if so, the step S608 is executed, and if not, the step S623 is executed.
S808, the control module judges whether the voice command is a command in a preset command set, if so, the step S809 is executed, otherwise, the step S823 is executed.
And S809, the control module informs the command receiving module to lock the second preset key and the return key to the control module, obtains a response result of the voice instruction, calls the APP to play audio data in the response result, and sends the summary information of the response result to the second command receiving module.
At the same time, the th display remains on the screen.
The second preset key may be, for example, an OK key, a left key, a right key, or the like.
And S810, the second command receiving module forwards the summary information of the response result to the second display module.
And S811, the second display module displays summary information of the response result on the second display.
Taking the voice instruction as "i want to listen to music a" as an example, the second display module displays a picture of the music player on the second display, and displays a song name, a singer, a duration, a progress bar, animation, and the like on the music player.
And S812, pressing a second preset key by the user.
S813, the th command receiving module receives a second preset key instruction and sends the instruction to the control module.
S814, the control module adjusts the response result according to the second preset key instruction, and sends the summary information of the adjusted response result to the second command receiving module.
At the same time, the th display remains on the screen.
For example, when the user presses the left key, the control module acquires information on the first pieces of music currently playing music, and transmits summary information on the first pieces of music to the second command receiving module.
And S815, the second command receiving module sends the summary information to the second display module.
And S816, the second display module displays the summary information of the adjusted response result on the second display.
S817, the user presses the return key.
S818, th order receiving module receives the return key and sends the return key instruction to the control module.
S819, the control module stops playing the audio data and quits APP according to the instruction of the return key, and sends a quit instruction to the second command receiving module.
At the same time, the th display remains on the screen.
S820, the second command receiving module forwards the quit instruction to the second display module.
And S821, the second display module closes the display screen and quits.
S822, the second display module displays the screen before the voice command is received (step S811), and ends.
At the same time, the th display remains on the screen.
S823, the control module judges whether the search content result is in a showing stage, if so, S824 is executed, and if not, S825 is executed.
S824, the control module informs the command receiving module to lock th preset key and the return key to the control module.
The th preset key may be, for example, an OK key, an up key, etc.
If the search content result is in the showing stage, the response result is shown, and therefore the control module needs to acquire the focus information.
And S825, the control module acquires the response result and sends the response result to the second display module through the second command receiving module.
Meanwhile, the th display keeps the operation state before receiving the wake-up command.
S826, determining whether the voice command is a command in the preset command set, if so, performing S836, otherwise, performing S827.
S827, the second display module displays summary information of the response result on the second display.
And S828, pressing th preset key by the user.
S829 and the command receiving module receives the th preset key command and sends the command to the control module.
S830, the control module judges whether the th preset key is locked currently, if yes, S831 is executed, and if not, S834 is executed.
And S831, judging whether the displayer is in a screen turning state, if so, executing S832, otherwise, executing S833.
S832, the control module adjusts the th display to be in a bright screen state.
And S833, controlling the th display module to display the response result on the th display by the control module.
And simultaneously, all subsequent remote controller keys are responded.
And S834, judging whether the voice command is not received or a return key command is received when the preset time length is exceeded, if so, executing S835, otherwise, executing S806.
And S835, the th display module and the second display module quit, the control module runs in a low-power-consumption mode, the th display and the second display both display the main interface, and the process is finished.
And S836, the second display module displays summary information of the response result on the second display.
S837, the control module outputs the response result on the th display.
S838, the control module releases the lock on the th preset and return key and proceeds to S834.
Optionally, the kinds of computing devices are further provided, including a memory and a processor, where the memory is used for storing program instructions, and the processor is used for calling the program instructions stored in the memory and executing the method described in the above method embodiments according to the obtained program.
Optionally, the present application further provides storage media, where the storage media stores instructions that, when executed on a computer, cause the computer to perform the method in the above method embodiment.
Optionally, an chip for executing the instructions is further provided in the embodiment of the present application, where the chip is configured to execute the method in the foregoing method embodiment.
The present application further provides program products, where the program products include computer programs, the computer programs are stored in a storage medium, at least processors can read the computer programs from the storage medium, and the at least processors can implement the methods in the above method embodiments when executing the computer programs.
In the embodiments of the present application, "at least " means or more, "a plurality" means two or more "and/or" describing the association relationship of the associated objects, and means that there may be three relationships, for example, a and/or B, and may mean that a exists alone, a exists together with a and B, and B exists alone, where a and B may be singular or plural, the character "/" generally means the relationship that the preceding and following associated objects are "or", in the formula, the character "/" represents the relationship that the preceding and following associated objects are "divisible", that "the following at least item(s)" or its similar expression, means any combination of these items, including any combination of single item(s) or plural item(s), for example, at least item(s) of a, B, or c, and may mean a, B, c, a-B, a-c, B-c, or a-B-c, where a, B, c may be single or plural.
It is to be understood that the various numerical references referred to in the embodiments of the present application are merely for convenience of description and distinction and are not intended to limit the scope of the embodiments of the present application.
It should be understood that, in the embodiment of the present invention, the sequence numbers of the above-mentioned processes do not mean the execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiment of the present application.
Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

Claims (10)

  1. A display device of , comprising:
    th display;
    a second display;
    a voice recognition part for receiving a voice input of a user;
    a controller configured to:
    the control unit is used for controlling the th display and the second display to display the recognition result of the voice input on the second display in response to the voice command input by the user and displaying the summary information of the response result of the voice input on the second display.
  2. 2. The display device according to claim 1, wherein the controller is configured to:
    and responding to th preset key instruction input by a user, and displaying the response result on a th display.
  3. 3. The display device according to claim 1, wherein the controller is configured to:
    and responding to the fact that the voice instruction is an instruction in a preset instruction set and the th display is in a bright screen state, and outputting a response result on the th display.
  4. 4. The display device according to claim 1, wherein the controller is configured to:
    and responding to the voice instruction as an instruction in a preset instruction set, the th display in a breath screen state and the response result comprising audio data, and playing the audio data.
  5. 5. The display device according to claim 4, wherein the controller is configured to:
    and responding to a second preset key instruction input by the user, and displaying the summary information of the adjusted response result on the second display.
  6. 6. The display device of claim 5, wherein the voice command is a play music command.
  7. 7, A method for processing a voice command, comprising:
    receiving a voice input of a user;
    in response to a voice instruction input by a user, a recognition result of the voice input is displayed on a second display, and summary information of a response result of the voice input is displayed on the second display.
  8. 8. The method of claim 7, further comprising:
    receiving th preset key instruction input by a user;
    and responding to the th preset key instruction, and displaying the response result on a th display.
  9. 9. The method of claim 7, further comprising:
    and responding to the fact that the voice command is a command in a preset command set and the th display is in a bright screen state, and outputting a response result on the th display.
  10. 10, a computing device, comprising:
    a memory for storing program instructions;
    a processor for calling the program instructions stored in the memory and executing the method of any of claims 7-9 according to the obtained program.
CN201911014297.6A 2019-10-23 2019-10-23 Display device and voice instruction processing method Pending CN110740367A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911014297.6A CN110740367A (en) 2019-10-23 2019-10-23 Display device and voice instruction processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911014297.6A CN110740367A (en) 2019-10-23 2019-10-23 Display device and voice instruction processing method

Publications (1)

Publication Number Publication Date
CN110740367A true CN110740367A (en) 2020-01-31

Family

ID=69271062

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911014297.6A Pending CN110740367A (en) 2019-10-23 2019-10-23 Display device and voice instruction processing method

Country Status (1)

Country Link
CN (1) CN110740367A (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111526402A (en) * 2020-05-06 2020-08-11 海信电子科技(武汉)有限公司 Method for searching video resources through voice of multi-screen display equipment and display equipment
CN111524515A (en) * 2020-04-30 2020-08-11 海信电子科技(武汉)有限公司 Voice interaction method and device, electronic equipment and readable storage medium
CN111901683A (en) * 2020-07-24 2020-11-06 海信视像科技股份有限公司 Fault warning information display method and display equipment
CN112201232A (en) * 2020-08-28 2021-01-08 星络智能科技有限公司 Voice output control method, electronic device and computer readable storage medium
CN112492371A (en) * 2020-11-18 2021-03-12 海信视像科技股份有限公司 Display device
CN112788375A (en) * 2019-11-04 2021-05-11 海信视像科技股份有限公司 Display device, display method and computing device
CN113365124A (en) * 2020-03-06 2021-09-07 海信视像科技股份有限公司 Display device and display method
CN113453079A (en) * 2020-03-24 2021-09-28 海信视像科技股份有限公司 Control method for returning double-system-size double-screen application and display equipment
CN113495702A (en) * 2020-03-18 2021-10-12 聚好看科技股份有限公司 Interactive invitation processing method and display equipment
CN113573108A (en) * 2020-04-28 2021-10-29 海信视像科技股份有限公司 Display device
CN113573107A (en) * 2020-04-28 2021-10-29 海信视像科技股份有限公司 Display apparatus and control method
CN113782021A (en) * 2021-09-14 2021-12-10 海信电子科技(武汉)有限公司 Display device and prompt tone playing method
WO2022032916A1 (en) * 2020-08-12 2022-02-17 海信电子科技(武汉)有限公司 Display system
CN114296669A (en) * 2021-03-11 2022-04-08 海信视像科技股份有限公司 Display device
CN114945103A (en) * 2022-05-13 2022-08-26 深圳创维-Rgb电子有限公司 Voice interaction system and voice interaction method
CN115134643A (en) * 2021-03-24 2022-09-30 腾讯科技(深圳)有限公司 Bullet screen display method and device for vehicle-mounted terminal, terminal and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080010610A1 (en) * 2006-03-07 2008-01-10 Samsung Electronics Co., Ltd. Method and device for providing quick menu in menu screen of mobile commnunication terminal
CN105721931A (en) * 2016-01-21 2016-06-29 青岛海信电器股份有限公司 Voice control method of TV set, TV set, and terminal device
CN108282688A (en) * 2018-01-12 2018-07-13 山东浪潮商用系统有限公司 A kind of voice interactive method, set-top box and system
CN109922289A (en) * 2019-03-25 2019-06-21 深圳创维-Rgb电子有限公司 A kind of double-screen television

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080010610A1 (en) * 2006-03-07 2008-01-10 Samsung Electronics Co., Ltd. Method and device for providing quick menu in menu screen of mobile commnunication terminal
CN105721931A (en) * 2016-01-21 2016-06-29 青岛海信电器股份有限公司 Voice control method of TV set, TV set, and terminal device
CN108282688A (en) * 2018-01-12 2018-07-13 山东浪潮商用系统有限公司 A kind of voice interactive method, set-top box and system
CN109922289A (en) * 2019-03-25 2019-06-21 深圳创维-Rgb电子有限公司 A kind of double-screen television

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112788375A (en) * 2019-11-04 2021-05-11 海信视像科技股份有限公司 Display device, display method and computing device
CN113365124B (en) * 2020-03-06 2023-01-20 海信视像科技股份有限公司 Display device and display method
CN113365124A (en) * 2020-03-06 2021-09-07 海信视像科技股份有限公司 Display device and display method
CN113495702B (en) * 2020-03-18 2023-10-24 聚好看科技股份有限公司 Interactive invitation processing method and display equipment
CN113495702A (en) * 2020-03-18 2021-10-12 聚好看科技股份有限公司 Interactive invitation processing method and display equipment
CN113453079B (en) * 2020-03-24 2022-08-26 海信视像科技股份有限公司 Control method for returning double-system-size double-screen application and display equipment
CN113453079A (en) * 2020-03-24 2021-09-28 海信视像科技股份有限公司 Control method for returning double-system-size double-screen application and display equipment
CN113573108B (en) * 2020-04-28 2023-04-25 海信视像科技股份有限公司 Display apparatus
CN113573108A (en) * 2020-04-28 2021-10-29 海信视像科技股份有限公司 Display device
CN113573107A (en) * 2020-04-28 2021-10-29 海信视像科技股份有限公司 Display apparatus and control method
CN111524515A (en) * 2020-04-30 2020-08-11 海信电子科技(武汉)有限公司 Voice interaction method and device, electronic equipment and readable storage medium
CN111526402A (en) * 2020-05-06 2020-08-11 海信电子科技(武汉)有限公司 Method for searching video resources through voice of multi-screen display equipment and display equipment
CN111901683A (en) * 2020-07-24 2020-11-06 海信视像科技股份有限公司 Fault warning information display method and display equipment
CN111901683B (en) * 2020-07-24 2023-02-28 海信视像科技股份有限公司 Fault warning information display method and display equipment
WO2022032916A1 (en) * 2020-08-12 2022-02-17 海信电子科技(武汉)有限公司 Display system
CN112201232A (en) * 2020-08-28 2021-01-08 星络智能科技有限公司 Voice output control method, electronic device and computer readable storage medium
CN112492371B (en) * 2020-11-18 2023-02-17 海信视像科技股份有限公司 Display device
CN112492371A (en) * 2020-11-18 2021-03-12 海信视像科技股份有限公司 Display device
CN114327326A (en) * 2021-03-11 2022-04-12 海信视像科技股份有限公司 Display device and display method
CN114296668A (en) * 2021-03-11 2022-04-08 海信视像科技股份有限公司 Display device
CN114296669A (en) * 2021-03-11 2022-04-08 海信视像科技股份有限公司 Display device
CN115134643A (en) * 2021-03-24 2022-09-30 腾讯科技(深圳)有限公司 Bullet screen display method and device for vehicle-mounted terminal, terminal and medium
CN113782021A (en) * 2021-09-14 2021-12-10 海信电子科技(武汉)有限公司 Display device and prompt tone playing method
CN113782021B (en) * 2021-09-14 2023-10-24 Vidaa(荷兰)国际控股有限公司 Display equipment and prompt tone playing method
CN114945103A (en) * 2022-05-13 2022-08-26 深圳创维-Rgb电子有限公司 Voice interaction system and voice interaction method
WO2023216414A1 (en) * 2022-05-13 2023-11-16 深圳创维-Rgb电子有限公司 Speech interaction system and speech interaction method

Similar Documents

Publication Publication Date Title
CN110740367A (en) Display device and voice instruction processing method
CN111200746B (en) Method for awakening display equipment in standby state and display equipment
CN111757171A (en) Display device and audio playing method
CN111343489B (en) Display device and method for playing music in terminal
CN112492371B (en) Display device
CN112055240B (en) Display device and operation prompt display method for pairing display device with remote controller
CN112073865B (en) Volume setting method and device of Bluetooth headset and electronic equipment
US11288033B2 (en) Method for outputting audio data of applications and display device
CN112019782B (en) Control method and display device of enhanced audio return channel
CN111277884A (en) Video playing method and device
CN111479145A (en) Display device and television program pushing method
CN112188279A (en) Channel switching method and display equipment
CN112243141B (en) Display method and display equipment for screen projection function
CN112153406A (en) Live broadcast data generation method, display equipment and server
CN112437334A (en) Display device
CN111176603A (en) Image display method for display equipment and display equipment
CN112153440A (en) Display device and display system
CN112199064A (en) Interaction method of browser application and system platform and display equipment
CN111954059A (en) Screen saver display method and display device
CN111083538A (en) Background image display method and device
CN111836083B (en) Display device and screen sounding method
CN113489938A (en) Virtual conference control method, intelligent device and terminal device
CN112017415A (en) Recommendation method of virtual remote controller, display device and mobile terminal
CN111641856A (en) Prompt message display method for guiding user operation in display equipment and display equipment
CN114501158B (en) Display device, external sound equipment and audio output method of external sound equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200131

RJ01 Rejection of invention patent application after publication