CN112073775A - Sound processing method and display device - Google Patents

Sound processing method and display device Download PDF

Info

Publication number
CN112073775A
CN112073775A CN201911231348.0A CN201911231348A CN112073775A CN 112073775 A CN112073775 A CN 112073775A CN 201911231348 A CN201911231348 A CN 201911231348A CN 112073775 A CN112073775 A CN 112073775A
Authority
CN
China
Prior art keywords
chip
audio
video
sound
volume
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911231348.0A
Other languages
Chinese (zh)
Inventor
陈俊宁
卢平光
初德进
齐消消
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hisense Visual Technology Co Ltd
Original Assignee
Hisense Visual Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hisense Visual Technology Co Ltd filed Critical Hisense Visual Technology Co Ltd
Publication of CN112073775A publication Critical patent/CN112073775A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/42224Touch pad or touch panel provided on the remote control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting

Abstract

The embodiment of the application shows a sound processing method and display equipment, and is particularly suitable for a social television. According to the technical scheme, when the chat is started, the audio and video input port is distinguished, the second chip sends feedback information to the first chip in response to the fact that the input port is connected with the first chip, and the feedback information is used for indicating the first chip to reduce the volume of the audio and video sent to the second chip; and in response to the input port being the second port, the second chip reduces the volume of the received audio and video. According to the technical scheme, when a user chats, the problem that the sound (sound of input audio) of the chatting in a video window conflicts with the sound (sound of audio and video) of a program played by a television (sound of audio and video) in the current chatting situation is avoided by turning down the volume of the audio and video.

Description

Sound processing method and display device
This application claims priority to a chinese patent application filed by the national intellectual property office on 10/06/10 in 2019 under application number 201910498197.9. The entire contents of which are incorporated by reference in the present application.
Technical Field
The embodiment of the application relates to a display technology. And more particularly, to a sound processing method and a display apparatus.
Background
Currently, since a display device can provide a user with a play picture such as audio, video, picture, and the like, it is receiving a wide attention of the user. With the development of big data and artificial intelligence, the functional requirements of users on display devices are increasing day by day. For example, the user performs a video chat with a friend while the user wants to play the display screen; or when the user is in a game scene, chatting with game teammates in real time; or, the user learns the current screen content in the education application program and simultaneously performs remote audio-video interaction with parents/teachers in real time, so as to meet the requirement of chatting while watching.
In the current chat situation, because the sound of the chat in the video window and the sound of the program played by the television are in conflict, when the played sound is not very different, the interaction of the user is interfered, and therefore the user wants to highlight the sound of the chat in the scene.
Therefore, a sound processing method is urgently needed to solve the problem of sound conflict in the scene of chatting while looking, and provide good user experience for users.
Disclosure of Invention
In view of the above technical problems, an object of the present application is to provide a sound processing method and a display device.
A first aspect of the embodiments of the present application shows a sound processing method, which is applied to a display device, where the display device includes: a first chip, a second chip coupled to the first chip, and a speaker coupled to the second chip, the method comprising:
the second chip detects a chat sound corresponding to the chat application;
the second chip determines the input port of the current audio and video in response to the detection of the chat sound;
in response to the input port being a first port connected with the first chip on the second chip, the second chip sends feedback information to the first chip, wherein the feedback information is used for instructing the first chip to reduce the volume of the audio and video sent to the second chip so as to reduce the output volume of the loudspeaker;
and in response to the input port being a second port on the second chip and externally connected with the display device, the second chip reduces the volume of the received audio and video to reduce the output volume of the loudspeaker.
A second aspect of embodiments of the present application shows a display device, including:
the display screen is used for displaying the images of the audios and the videos;
the loudspeaker is used for outputting the sound of the audio and video;
a second chip connected to the speaker and the display screen, the second chip configured to detect a chat sound corresponding to a chat application;
the second chip responds to the chat sound detected and determines the input port of the current audio and video; in response to the input port being a port on the second chip connected to the first chip in the display device, the second chip sends feedback information to the first chip, where the feedback information is used to instruct the first chip to reduce the volume of the audio and video sent to the second chip; and in response to the input port being a second port on the second chip which is externally connected with the display device, the second chip reduces the volume of the received audio and video.
According to the technical scheme, when the chat is started, the audio and video input ports are distinguished, the second chip sends feedback information to the first chip in response to the fact that the input ports are the ports connected with the first chip, and the feedback information is used for indicating the first chip to reduce the volume of the audio and video sent to the second chip; and in response to the input port being the second port, the second chip reduces the volume of the received audio and video. According to the technical scheme, when a user chats, the problem that the sound (sound of input audio) of the chatting in a video window conflicts with the sound (sound of audio and video) of a program played by a television (sound of audio and video) in the current chatting situation is avoided by turning down the volume of the audio and video.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings without creative efforts.
Fig. 1 is a schematic diagram illustrating an operation scenario between a display device and a control apparatus according to an embodiment;
fig. 2 is a block diagram exemplarily showing a hardware configuration of the control apparatus 100 according to the embodiment;
fig. 3 is a block diagram exemplarily showing a hardware configuration of the display device 200 according to the embodiment;
a block diagram of the hardware architecture of the display device 200 according to fig. 3 is exemplarily shown in fig. 4;
fig. 5 is a diagram exemplarily showing a functional configuration of the display device 200 according to the embodiment;
fig. 6a schematically shows a software configuration in the display device 200 according to an embodiment;
fig. 6b schematically shows a configuration of an application in the display device 200 according to an embodiment;
fig. 7 schematically illustrates a user interface in the display device 200 according to an embodiment;
FIG. 8 is a flow chart illustrating a method of sound processing;
a flow chart of network audio signal processing is shown in fig. 9 by way of example;
fig. 10 is a block diagram illustrating a structure of a display device.
Detailed Description
To make the objects, technical solutions and advantages of the exemplary embodiments of the present application clearer, the technical solutions in the exemplary embodiments of the present application will be clearly and completely described below with reference to the drawings in the exemplary embodiments of the present application, and it is obvious that the described exemplary embodiments are only a part of the embodiments of the present application, but not all the embodiments.
For the convenience of users, various external device interfaces are usually provided on the display device to facilitate connection of different peripheral devices or cables to implement corresponding functions. When a high-definition camera is connected to an interface of the display device, if a hardware system of the display device does not have a hardware interface of a high-pixel camera receiving the source code, data received by the camera cannot be displayed on a display screen of the display device.
Furthermore, due to the hardware structure, the hardware system of the conventional display device only supports one path of hard decoding resources, and usually only supports video decoding with a resolution of 4K at most, so when a user wants to perform video chat while watching a network television, the user needs to use the hard decoding resources (usually GPU in the hardware system) to decode the network video without reducing the definition of the network video screen, and in this case, the user can only process the video chat screen by using a general-purpose processor (e.g. CPU) in the hardware system to perform soft decoding on the video.
The soft decoding is adopted to process the video chat picture, so that the data processing burden of a CPU (central processing unit) can be greatly increased, and when the data processing burden of the CPU is too heavy, the problem of picture blocking or unsmooth flow can occur. Further, due to the data processing capability of the CPU, when the CPU performs soft decoding on the video chat screen, multi-channel video calls cannot be generally implemented, and when a user wants to perform video chat with multiple other users in the same chat scene, access is blocked.
In view of the above aspects, to overcome the above drawbacks, the present application discloses a dual hardware system architecture to implement multiple channels of video chat data (at least one channel of local video).
The concept to which the present application relates will be first explained below with reference to the drawings. It should be noted that the following descriptions of the concepts are only for the purpose of facilitating understanding of the contents of the present application, and do not represent limitations on the scope of the present application.
The term "module" as used in various embodiments of the present application may refer to any known or later developed hardware, software, firmware, artificial intelligence, fuzzy logic, or combination of hardware and/or software code that is capable of performing the functionality associated with that element.
The term "remote control" as used in the various embodiments of the present application refers to a component of an electronic device, such as the display device disclosed in the present application, that is capable of wirelessly controlling the electronic device, typically over a relatively short distance. The component may typically be connected to the electronic device using infrared and/or Radio Frequency (RF) signals and/or bluetooth, and may also include functional modules such as WiFi, wireless USB, bluetooth, motion sensors, etc. For example: the hand-held touch remote controller replaces most of the physical built-in hard keys in the common remote control device with the user interface in the touch screen.
The term "gesture" as used in the embodiments of the present application refers to a user's behavior through a change in hand shape or an action such as hand movement to express an intended idea, action, purpose, or result.
The term "hardware system" used in the embodiments of the present application may refer to a physical component having computing, controlling, storing, inputting and outputting functions, which is formed by a mechanical, optical, electrical and magnetic device such as an Integrated Circuit (IC), a Printed Circuit Board (PCB) and the like. In various embodiments of the present application, a hardware system may also be referred to as a motherboard (or chip).
Fig. 1 is a schematic diagram illustrating an operation scenario between a display device and a control apparatus according to an embodiment. As shown in fig. 1, a user may operate the display apparatus 200 through the control device 100.
The control device 100 may be a remote controller 100A, which can communicate with the display device 200 through an infrared protocol communication, a bluetooth protocol communication, a ZigBee (ZigBee) protocol communication, or other short-range communication, and is used to control the display device 200 in a wireless or other wired manner. The user may input a user instruction through a key on a remote controller, voice input, control panel input, etc., to control the display apparatus 200. Such as: the user can input a corresponding control command through a volume up/down key, a channel control key, up/down/left/right moving keys, a voice input key, a menu key, a power on/off key, etc. on the remote controller, to implement the function of controlling the display device 200.
The control apparatus 100 may also be a smart device, such as a mobile terminal 100B, a tablet computer, a notebook computer, etc., which may communicate with the display device 200 through a Local Area Network (LAN), a Wide Area Network (WAN), a Wireless Local Area Network (WLAN), or other networks, and implement control of the display device 200 through an application program corresponding to the display device 200.
For example, the mobile terminal 100B and the display device 200 may each have a software application installed thereon, so that connection communication between the two can be realized through a network communication protocol, and the purpose of one-to-one control operation and data communication can be further realized. Such as: a control instruction protocol can be established between the mobile terminal 100B and the display device 200, a remote control keyboard is synchronized to the mobile terminal 100B, and the function of controlling the display device 200 is realized by controlling a user interface on the mobile terminal 100B; the audio and video content displayed on the mobile terminal 100B may also be transmitted to the display device 200, so as to implement a synchronous display function.
As shown in fig. 1, the display apparatus 200 may also perform data communication with the server 300 through various communication means. In various embodiments of the present application, the display device 200 may be allowed to be communicatively coupled to the server 300 via a local area network, a wireless local area network, or other network. The server 300 may provide various contents and interactions to the display apparatus 200.
Illustratively, the display device 200 receives software Program updates, or accesses a remotely stored digital media library by sending and receiving information, and Electronic Program Guide (EPG) interactions. The servers 300 may be a group or groups, and may be one or more types of servers. Other web service contents such as a video on demand and an advertisement service are provided through the server 300.
The display device 200 may be a liquid crystal display, an oled (organic Light Emitting diode) display, a projection display device, or an intelligent tv. The specific display device type, size, resolution, etc. are not limiting, and those skilled in the art will appreciate that the display device 200 may be modified in performance and configuration as desired.
The display apparatus 200 may additionally provide an intelligent network tv function that provides a computer support function in addition to the broadcast receiving tv function. Examples include a web tv, a smart tv, an Internet Protocol Tv (IPTV), and the like.
As shown in fig. 1, the display device may be connected or provided with a camera, and is configured to present a picture taken by the camera on a display interface of the display device or other display devices, so as to implement interactive chat between users. Specifically, the picture shot by the camera can be displayed on the display device in a full screen mode, a half screen mode or any optional area.
As an optional connection mode, the camera is connected with the display rear shell through the connecting plate, is fixedly installed in the middle of the upper side of the display rear shell, and can be fixedly installed at any position of the display rear shell as an installable mode, so that an image acquisition area is ensured not to be shielded by the rear shell, for example, the display orientation of the image acquisition area is the same as that of the display equipment.
As another alternative connection mode, the camera is connected to the display rear shell through a connection board or other conceivable connector, the camera is capable of lifting, the connector is provided with a lifting motor, when a user wants to use the camera or an application program wants to use the camera, the camera is lifted out of the display, and when the camera is not needed, the camera can be embedded in the rear shell to protect the camera from being damaged.
As an embodiment, the camera adopted in the present application may have 1600 ten thousand pixels, so as to achieve the purpose of ultra high definition display. In actual use, cameras higher or lower than 1600 ten thousand pixels may also be used.
After the camera is installed on the display device, the contents displayed by different application scenes of the display device can be fused in various different modes, so that the function which cannot be realized by the traditional display device is achieved.
Illustratively, a user may conduct a video chat with at least one other user while watching a video program. The presentation of the video program may be as a background frame over which a window for video chat is displayed. Pictorially, this function may be referred to as "chat while looking".
Optionally, in the scene of "chat while watching", at least one video chat is performed across the terminals while watching the live video or the network video.
In another example, a user can conduct a video chat with at least one other user while entering the educational application for learning. For example, a student may interact remotely with a teacher while learning content in an educational application. Figuratively, this function may be referred to as "chatting while learning".
In another example, a user conducts a video chat with a player entering a card game while playing the game. For example, a player may enable remote interaction with other players when entering a gaming application to participate in a game. Figuratively, the function may be referred to as "play while watching".
Optionally, the game scene is fused with the video picture, the portrait in the video picture is scratched and displayed in the game picture, and the user experience is improved.
Optionally, in the motion sensing game (such as ball hitting, boxing, running and dancing), the human posture and motion, limb detection and tracking and human skeleton key point data detection are obtained through the camera, and then the human posture and motion, the limb detection and tracking and the human skeleton key point data detection are fused with the animation in the game, so that the game of scenes such as sports and dancing is realized.
In another example, a user may interact with at least one other user in a karaoke application in video and voice. Figuratively, this function may be referred to as "sing while looking". Preferably, when at least one user enters the application in a chat scenario, a plurality of users can jointly complete recording of a song.
In another example, a user may turn on a camera locally to take pictures and videos, figurative, which may be referred to as "looking into the mirror".
In other examples, more or less functionality may be added. The function of the display device is not particularly limited in the present application.
Fig. 2 is a block diagram schematically showing the configuration of the control apparatus 100 according to the exemplary embodiment. As shown in fig. 2, the control device 100 includes a controller 110, a communicator 130, a user input/output interface 140, a memory 190, and a power supply 180.
The control apparatus 100 is configured to control the display device 200, and to receive an input operation instruction from a user, and convert the operation instruction into an instruction recognizable and responsive by the display device 200, and to mediate interaction between the user and the display device 200. Such as: the user operates the channel up/down key on the control device 100, and the display device 200 responds to the channel up/down operation.
In some embodiments, the control device 100 may be a smart device. Such as: the control apparatus 100 may install various applications that control the display device 200 according to user demands.
In some embodiments, as shown in fig. 1, the mobile terminal 100B or other intelligent electronic device may function similar to the control apparatus 100 after installing an application for manipulating the display device 200. Such as: the user may implement the functions of controlling the physical keys of the apparatus 100 by installing applications, various function keys or virtual buttons of a graphical user interface available on the mobile terminal 100B or other intelligent electronic devices.
The controller 110 includes a processor 112, a RAM113 and a ROM114, a communication interface, and a communication bus. The controller 110 is used to control the operation of the control device 100, as well as the internal components for communication and coordination and external and internal data processing functions.
The communicator 130 enables communication of control signals and data signals with the display apparatus 200 under the control of the controller 110. Such as: the received user input signal is transmitted to the display apparatus 200. The communicator 130 may include at least one of a WIFI module 131, a bluetooth module 132, an NFC module 133, and the like.
A user input/output interface 140, wherein the input interface includes at least one of a microphone 141, a touch pad 142, a sensor 143, a key 144, and the like. Such as: the user can realize a user instruction input function through actions such as voice, touch, gesture, pressing, and the like, and the input interface converts the received analog signal into a digital signal and converts the digital signal into a corresponding instruction signal, and sends the instruction signal to the display device 200.
The output interface includes an interface that transmits the received user instruction to the display apparatus 200. In some embodiments, it may be an infrared interface or a radio frequency interface. Such as: when the infrared signal interface is used, the user input instruction needs to be converted into an infrared control signal according to an infrared control protocol, and the infrared control signal is sent to the display device 200 through the infrared sending module. The following steps are repeated: when the rf signal interface is used, a user input command needs to be converted into a digital signal, and then the digital signal is modulated according to the rf control signal modulation protocol and then transmitted to the display device 200 through the rf transmitting terminal.
In some embodiments, the control device 100 includes at least one of a communicator 130 and an output interface. The communicator 130 is configured in the control device 100, such as: the modules of WIFI, bluetooth, NFC, etc. may send the user input command to the display device 200 through the WIFI protocol, or the bluetooth protocol, or the NFC protocol code.
And a memory 190 for storing various operation programs, data and applications for driving and controlling the control apparatus 100 under the control of the controller 110. The memory 190 may store various control signal commands input by a user.
And a power supply 180 for providing operational power support to the components of the control device 100 under the control of the controller 110. A battery and associated control circuitry.
A hardware configuration block diagram of a hardware system in the display apparatus 200 according to an exemplary embodiment is exemplarily shown in fig. 3.
When a dual hardware system architecture is adopted, the mechanism relationship of the hardware system can be shown in fig. 3. For convenience of description, one hardware system in the dual hardware system architecture will be referred to as a first hardware system or a system, a-chip, and the other hardware system will be referred to as a second hardware system or N-system, N-chip. The chip A comprises a controller of the chip A and various interfaces, and the chip N comprises a controller of the chip N and various interfaces. The a-chip and the N-chip may each have a separate operating system installed therein, so that there are two separate but interrelated subsystems in the display apparatus 200.
As shown in fig. 3, the a chip and the N chip may be connected, communicated and powered through a plurality of different types of interfaces. The interface type of the interface between the a chip and the N chip may include a General-purpose input/output (GPIO) interface, a USB interface, an HDMI interface, a UART interface, and the like. One or more of these interfaces may be used for communication or power transfer between the a-chip and the N-chip. For example, as shown in fig. 3, in the dual hardware system architecture, the N chip may be powered by an external power source (power), and the a chip may not be powered by the external power source but by the N chip.
In addition to the interface for connecting with the N chip, the a chip may further include an interface for connecting other devices or components, such as an MIPI interface for connecting a Camera (Camera) shown in fig. 3, a bluetooth interface, and the like.
Similarly, in addition to the interface for connecting with the N chip, the N chip may further include an VBY interface for connecting with a display screen tcon (timer Control register), and an i2S interface for connecting with a power Amplifier (AMP) and a Speaker (Speaker); and an IR/Key interface, a USB interface, a Wifi interface, a bluetooth interface, an HDMI interface, a Tuner interface, and the like.
The dual hardware system architecture of the present application is further described below with reference to fig. 4. It should be noted that fig. 4 is only an exemplary illustration of the dual hardware system architecture of the present application, and does not represent a limitation of the present application. In actual practice, both hardware systems may contain more or less hardware or interfaces as desired.
A block diagram of the hardware architecture of the display device 200 according to fig. 3 is exemplarily shown in fig. 4. As shown in fig. 4, the hardware system of the display device 200 may include an a chip and an N chip, and a module connected to the a chip or the N chip through various interfaces.
The N-chip may include a tuner demodulator 220, a communicator 230, an external device interface 250, a controller 210, a memory 290, a user input interface, a video processor 260-1, an audio processor 260-2, a display 280, an audio output interface 272, and a power supply. The N-chip may also include more or fewer modules in other embodiments.
The tuning demodulator 220 is configured to perform modulation and demodulation processing such as amplification, mixing, resonance and the like on a broadcast television signal received in a wired or wireless manner, so as to demodulate an audio/video signal carried in a frequency of a television channel selected by a user and additional information (e.g., an EPG data signal) from a plurality of wireless or wired broadcast television signals. Depending on the broadcast system of the television signal, the signal path of the tuner 220 may be various, such as: terrestrial broadcasting, cable broadcasting, satellite broadcasting, internet broadcasting, or the like; according to different modulation types, the adjustment mode of the signal can be a digital modulation mode or an analog modulation mode; and depending on the type of television signal being received, tuner demodulator 220 may demodulate analog and/or digital signals.
The tuner demodulator 220 is also operative to respond to the user-selected television channel frequency and the television signals carried thereby, in accordance with the user selection, and as controlled by the controller 210.
In other exemplary embodiments, the tuner/demodulator 220 may be in an external device, such as an external set-top box. In this way, the set-top box outputs television audio/video signals after modulation and demodulation, and the television audio/video signals are input into the display device 200 through the external device interface 250.
The communicator 230 is a component for communicating with an external device or an external server according to various communication protocol types. For example: the communicator 230 may include a WIFI module 231, a bluetooth communication protocol module 232, a wired ethernet communication protocol module 233, and other network communication protocol modules such as an infrared communication protocol module or a near field communication protocol module.
The display apparatus 200 may establish a connection of a control signal and a data signal with an external control apparatus or a content providing apparatus through the communicator 230. For example, the communicator may receive a control signal of the remote controller 100 according to the control of the controller.
The external device interface 250 is a component for providing data transmission between the N-chip controller 210 and the a-chip and other external devices. The external device interface may be connected with an external apparatus such as a set-top box, a game device, a notebook computer, etc. in a wired/wireless manner, and may receive data such as a video signal (e.g., moving image), an audio signal (e.g., music), additional information (e.g., EPG), etc. of the external apparatus.
The external device interface 250 may include: a High Definition Multimedia Interface (HDMI) terminal 251, a Composite Video Blanking Sync (CVBS) terminal 252, an analog or digital component terminal 253, a Universal Serial Bus (USB) terminal 254, a red, green, blue (RGB) terminal (not shown), and the like. The number and type of external device interfaces are not limited by this application.
The controller 210 controls the operation of the display device 200 and responds to the user's operation by running various software control programs (e.g., an operating system and/or various application programs) stored on the memory 290.
As shown in fig. 4, the controller 210 includes a read only memory RAM213, a random access memory ROM214, a graphics processor 216, a CPU processor 212, a communication interface 218, and a communication bus. The RAM213 and the ROM214, the graphic processor 216, the CPU processor 212, and the communication interface 218 are connected via a bus.
A ROM213 for storing instructions for various system boots. If the display device 200 is powered on upon receipt of the power-on signal, the CPU processor 212 executes a system boot instruction in the ROM and copies the operating system stored in the memory 290 to the RAM214 to start running the boot operating system. After the start of the operating system is completed, the CPU processor 212 copies the various application programs in the memory 290 to the RAM214, and then starts running and starting the various application programs.
A graphics processor 216 for generating various graphics objects, such as: icons, operation menus, user input instruction display graphics, and the like. The display device comprises an arithmetic unit which carries out operation by receiving various interactive instructions input by a user and displays various objects according to display attributes. And a renderer for generating various objects based on the operator and displaying the rendered result on the display 280.
A CPU processor 212 for executing operating system and application program instructions stored in memory 290. And executing various application programs, data and contents according to various interactive instructions received from the outside so as to finally display and play various audio and video contents.
In some exemplary embodiments, the CPU processor 212 may include a plurality of processors. The plurality of processors may include a main processor and a plurality of or a sub-processor. A main processor for performing some operations of the display apparatus 200 in a pre-power-up mode and/or operations of displaying a screen in a normal mode. A plurality of or one sub-processor for performing an operation in a standby mode or the like.
The communication interfaces may include a first communication interface 218-1 through an nth interface 218-n. These interfaces may be network interfaces that are connected to external devices via a network.
The controller 210 may control the overall operation of the display apparatus 200. For example: in response to receiving a user command for selecting a UI object to be displayed on the display 280, the controller 210 may perform an operation related to the object selected by the user command.
Wherein the object may be any one of selectable objects, such as a hyperlink or an icon. Operations related to the selected object, such as: displaying an operation connected to a hyperlink page, document, image, or the like, or performing an operation of a program corresponding to an icon. The user command for selecting the UI object may be a command input through various input means (e.g., a mouse, a keyboard, a touch pad, etc.) connected to the display apparatus 200 or a voice command corresponding to a voice spoken by the user.
The memory 290 includes a memory for storing various software modules for driving and controlling the display apparatus 200. Such as: various software modules stored in memory 290, including: the system comprises a basic module, a detection module, a communication module, a display control module, a browser module, various service modules and the like.
The basic module is a bottom layer software module for signal communication between hardware in the display device 200 and sending processing and control signals to an upper layer module. The detection module is a management module used for collecting various information from various sensors or user input interfaces, and performing digital-to-analog conversion and analysis management.
For example: the voice recognition module comprises a voice analysis module and a voice instruction database module. The display control module is a module for controlling the display 280 to display image content, and may be used to play information such as multimedia image content and UI interface. The communication module is used for carrying out control and data communication with external equipment. And the browser module is used for executing data communication between the browsing servers. The service module is a module for providing various services and various application programs.
Meanwhile, the memory 290 is also used to store visual effect maps and the like for receiving external data and user data, images of respective items in various user interfaces, and a focus object.
A user input interface for transmitting an input signal of a user to the controller 210 or transmitting a signal output from the controller to the user. For example, the control device (e.g., a mobile terminal or a remote controller) may send an input signal, such as a power switch signal, a channel selection signal, a volume adjustment signal, etc., input by a user to the user input interface, and then the input signal is forwarded to the controller by the user input interface; alternatively, the control device may receive an output signal such as audio, video, or data output from the user input interface via the controller, and display the received output signal or output the received output signal in audio or vibration form.
In some embodiments, a user may enter a user command on a Graphical User Interface (GUI) displayed on the display 280, and the user input interface receives the user input command through the Graphical User Interface (GUI). Alternatively, the user may input the user command by inputting a specific sound or gesture, and the user input interface receives the user input command by recognizing the sound or gesture through the sensor.
The video processor 260-1 is configured to receive a video signal, and perform video data processing such as decompression, decoding, scaling, noise reduction, frame rate conversion, resolution conversion, and image synthesis according to a standard codec protocol of the input signal, so as to obtain a video signal that is directly displayed or played on the display 280.
Illustratively, the video processor 260-1 includes a demultiplexing module, a video decoding module, an image synthesizing module, a frame rate conversion module, a display formatting module, and the like.
The demultiplexing module is used for demultiplexing the input audio and video data stream, and if the input MPEG-2 is input, the demultiplexing module demultiplexes the input audio and video data stream into a video signal and an audio signal.
And the video decoding module is used for processing the video signal after demultiplexing, including decoding, scaling and the like.
And the image synthesis module is used for carrying out superposition mixing processing on the GUI signal input by the user or generated by the user and the video image after the zooming processing by the graphic generator so as to generate an image signal for display.
The frame rate conversion module is configured to convert a frame rate of an input video, such as a 24Hz, 25Hz, 30Hz, or 60Hz video, into a 60Hz, 120Hz, or 240Hz frame rate, where the input frame rate may be related to a source video stream, and the output frame rate may be related to an update rate of a display. The input is realized in a common format by using a frame insertion mode.
And a display formatting module for converting the signal output by the frame rate conversion module into a signal conforming to a display format of a display, such as converting the format of the signal output by the frame rate conversion module to output an RGB data signal.
And a display 280 for receiving the image signal input from the video processor 260-1 and displaying the video content and image and the menu manipulation interface. The display 280 includes a display component for presenting a picture and a driving component for driving the display of an image. The video content may be displayed from the video in the broadcast signal received by the tuner/demodulator 220, or from the video content input from the communicator or the external device interface. And a display 220 simultaneously displaying a user manipulation interface UI generated in the display apparatus 200 and used to control the display apparatus 200.
And, a driving component for driving the display according to the type of the display 280. Alternatively, in case the display 280 is a projection display, it may also comprise a projection device and a projection screen.
The audio processor 260-2 is configured to receive an audio signal, decompress and decode the audio signal according to a standard codec protocol of the input signal, and perform noise reduction, digital-to-analog conversion, amplification and other audio data processing to obtain an audio signal that can be played in the speaker 272.
An audio output interface 270 for receiving the audio signal output by the audio processor 260-2 under the control of the controller 210, wherein the audio output interface may include a speaker 272 or an external sound output terminal 274 for outputting to a generating device of an external device, such as: external sound terminal or earphone output terminal.
In other exemplary embodiments, video processor 260-1 may comprise one or more chip components. The audio processor 260-2 may also include one or more chips.
And, in other exemplary embodiments, the video processor 260-1 and the audio processor 260-2 may be separate chips or may be integrated in one or more chips with the controller 210.
And a power supply for supplying power supply support to the display apparatus 200 from the power input from the external power source under the control of the controller 210. The power supply may include a built-in power supply circuit installed inside the display apparatus 200, or may be a power supply installed outside the display apparatus 200, such as a power supply interface for providing an external power supply in the display apparatus 200.
Similar to the N-chip, as shown in fig. 4, the a-chip may include a controller 310, a communicator 330, a detector 340, and a memory 390. A user input interface, a video processor, an audio processor, a display, an audio output interface may also be included in some embodiments. In some embodiments, there may also be a power supply that independently powers the A-chip.
The communicator 330 is a component for communicating with an external device or an external server according to various communication protocol types. For example: the communicator 330 may include a WIFI module 331, a bluetooth communication protocol module 332, a wired ethernet communication protocol module 333, and other network communication protocol modules such as an infrared communication protocol module or a near field communication protocol module.
The communicator 330 of the a-chip and the communicator 230 of the N-chip also interact with each other. For example, the N-chip WiFi module 231 is used to connect to an external network, generate network communication with an external server, and the like. The WiFi module 331 of the a chip is used to connect to the WiFi module 231 of the N chip without making a direct connection with an external network or the like. Therefore, for the user, a display device as in the above embodiment displays a WiFi account to the outside.
The detector 340 is a component of the display device a chip for collecting signals of an external environment or interacting with the outside. The detector 340 may include a light receiver 342, a sensor for collecting the intensity of ambient light, which may be used to adapt to display parameter changes, etc.; the system may further include an image collector 341, such as a camera, a video camera, etc., which may be configured to collect external environment scenes, collect attributes of the user or interact gestures with the user, adaptively change display parameters, and identify user gestures, so as to implement a function of interaction with the user.
An external device interface 350, which provides a component for data transmission between the controller 310 and the N-chip or other external devices. The external device interface may be connected with an external apparatus such as a set-top box, a game device, a notebook computer, etc. in a wired/wireless manner.
The controller 310 controls the operation of the display device 200 and responds to the user's operation by running various software control programs stored on the memory 390 (e.g., using installed third party applications, etc.), and interacting with the N-chip.
As shown in fig. 4, the controller 310 includes a read only memory ROM313, a random access memory RAM314, a graphics processor 316, a CPU processor 312, a communication interface 318, and a communication bus. The ROM313 and the RAM314, the graphic processor 316, the CPU processor 312, and the communication interface 318 are connected via a bus.
A ROM313 for storing instructions for various system boots. CPU processor 312 executes system boot instructions in ROM and copies the operating system stored in memory 390 to RAM314 to begin running the boot operating system. After the start of the operating system is completed, the CPU processor 312 copies various application programs in the memory 390 to the RAM314, and then starts running and starting various application programs.
The CPU processor 312 is used for executing the operating system and application program instructions stored in the memory 390, communicating with the N chip, transmitting and interacting signals, data, instructions, etc., and executing various application programs, data and contents according to various interaction instructions received from the outside, so as to finally display and play various audio and video contents.
The communication interfaces may include a first communication interface 318-1 through an nth interface 318-n. These interfaces may be network interfaces connected to external devices via a network, or may be network interfaces connected to the N-chip via a network.
The controller 310 may control the overall operation of the display apparatus 200. For example: in response to receiving a user command for selecting a UI object to be displayed on the display 280, the controller 210 may perform an operation related to the object selected by the user command.
A graphics processor 316 for generating various graphics objects, such as: icons, operation menus, user input instruction display graphics, and the like. The display device comprises an arithmetic unit which carries out operation by receiving various interactive instructions input by a user and displays various objects according to display attributes. And a renderer for generating various objects based on the operator and displaying the rendered result on the display 280.
Both the A-chip graphics processor 316 and the N-chip graphics processor 216 are capable of generating various graphics objects. In distinction, if application 1 is installed on the a-chip and application 2 is installed on the N-chip, the a-chip graphics processor 316 generates a graphics object when a user performs a command input by the user in application 1 at the interface of application 1. When a user makes a command input by the user in the interface of the application 2 and within the application 2, a graphic object is generated by the graphic processor 216 of the N chip.
Fig. 5 is a diagram schematically illustrating a functional configuration of a display device according to an exemplary embodiment.
As shown in fig. 5, the memory 390 of the a-chip and the memory 290 of the N-chip are used to store an operating system, an application program, contents, user data, and the like, respectively, and perform system operations for driving the display device 200 and various operations in response to a user under the control of the controller 310 of the a-chip and the controller 210 of the N-chip. The A-chip memory 390 and the N-chip memory 290 may include volatile and/or non-volatile memory.
The memory 290 is specifically configured to store an operating program for driving the controller 210 in the display device 200, and store various applications installed in the display device 200, various applications downloaded by a user from an external device, various graphical user interfaces related to the applications, various objects related to the graphical user interfaces, user data information, and internal data of various supported applications. The memory 290 is used to store system software such as an Operating System (OS) kernel, middleware, and applications, and to store input video data and audio data, and other user data.
The memory 290 is specifically used for storing drivers and related data such as the video processor 260-1 and the audio processor 260-2, the display 280, the communication interface 230, the tuner demodulator 220, the input/output interface, and the like.
In some embodiments, memory 290 may store software and/or programs, software programs for representing an Operating System (OS) including, for example: a kernel, middleware, an Application Programming Interface (API), and/or an application program. For example, the kernel may control or manage system resources, or functions implemented by other programs (e.g., the middleware, APIs, or applications), and the kernel may provide interfaces to allow the middleware and APIs, or applications, to access the controller to implement controlling or managing system resources.
The memory 290, for example, includes a broadcast receiving module 2901, a channel control module 2902, a volume control module 2903, an image control module 2904, a display control module 2905, an audio control module 2906, an external instruction recognition module 2907, a communication control module 2908, a light receiving module 2909, a power control module 2910, an operating system 2911, and other applications 2912, a browser module, and the like. The controller 210 performs functions such as: the system comprises a broadcast television signal receiving and demodulating function, a television channel selection control function, a volume selection control function, an image control function, a display control function, an audio control function, an external instruction identification function, a communication control function, an optical signal receiving function, an electric power control function, a software control platform supporting various functions, a browser function and other various functions.
The memory 390 includes a memory storing various software modules for driving and controlling the display apparatus 200. Such as: various software modules stored in memory 390, including: the system comprises a basic module, a detection module, a communication module, a display control module, a browser module, various service modules and the like. Since the functions of the memory 390 and the memory 290 are similar, reference may be made to the memory 290 for relevant points, and thus, detailed description thereof is omitted here.
Illustratively, the memory 390 includes an image control module 3904, an audio control module 2906, an external instruction recognition module 3907, a communication control module 3908, a light receiving module 3909, an operating system 3911, and other application programs 3912, a browser module, and the like. The controller 210 performs functions such as: the system comprises an image control function, a display control function, an audio control function, an external instruction identification function, a communication control function, an optical signal receiving function, an electric power control function, a software control platform supporting various functions, a browser function and other various functions.
Differently, the external instruction recognition module 2907 of the N-chip and the external instruction recognition module 3907 of the a-chip can recognize different instructions.
Illustratively, since the image receiving device such as a camera is connected with the a-chip, the external instruction recognition module 3907 of the a-chip may include the pattern recognition module 2907-1, a pattern database is stored in the pattern recognition module 3907-1, and when the camera receives an external pattern instruction, the camera corresponds to the instruction in the pattern database to perform instruction control on the display device. Since the voice receiving device and the remote controller are connected to the N-chip, the external command recognition module 2907 of the N-chip may include a voice recognition module 2907-2, a voice database is stored in the graphic recognition module 2907-2, and when the voice receiving device receives an external voice command or the like, the voice receiving device and the like perform a corresponding relationship with a command in the voice database to perform command control on the display device. Similarly, a control device 100 such as a remote controller is connected to the N-chip, and a key command recognition module performs command interaction with the control device 100.
A block diagram of a configuration of a software system in a display device 200 according to an exemplary embodiment is exemplarily shown in fig. 6 a.
For an N-chip, as shown in fig. 6a, the operating system 2911, which includes executing operating software for handling various basic system services and for performing hardware related tasks, serves as an intermediary between applications and hardware components for data processing.
In some embodiments, portions of the operating system kernel may contain a series of software to manage the display device hardware resources and provide services to other programs or software code.
In other embodiments, portions of the operating system kernel may include one or more device drivers, which may be a set of software code in the operating system that assists in operating or controlling the devices or hardware associated with the display device. The drivers may contain code that operates the video, audio, and/or other multimedia components. Examples include a display, a camera, Flash, WiFi, and audio drivers.
The accessibility module 2911-1 is configured to modify or access the application program to achieve accessibility and operability of the application program for displaying content.
A communication module 2911-2 for connection to other peripherals via associated communication interfaces and a communication network.
The user interface module 2911-3 is configured to provide an object for displaying a user interface, so that each application program can access the object, and user operability can be achieved.
Control applications 2911-4 for controlling process management, including runtime applications and the like.
The event transmission system 2914 may be implemented within the operating system 2911 or within the application 2912. In some embodiments, an aspect is implemented within the operating system 2911, while implemented in the application 2912, for listening for various user input events, and will implement one or more sets of predefined operations in response to various events referring to the recognition of various types of events or sub-events.
The event monitoring module 2914-1 is configured to monitor an event or a sub-event input by the user input interface.
The event identification module 2914-1 is configured to input definitions of various types of events for various user input interfaces, identify various events or sub-events, and transmit the same to a process for executing one or more corresponding sets of processes.
The event or sub-event refers to an input detected by one or more sensors in the display device 200 and an input of an external control device (e.g., the control apparatus 100). Such as: the method comprises the following steps of inputting various sub-events through voice, inputting a gesture sub-event through gesture recognition, inputting a remote control key command of a control device and the like. Illustratively, the one or more sub-events in the remote control include a variety of forms including, but not limited to, one or a combination of key presses up/down/left/right/, ok keys, key presses, and the like. And non-physical key operations such as move, hold, release, etc.
The interface layout management module 2913, directly or indirectly receiving the input events or sub-events from the event transmission system 2914, monitors the input events or sub-events, and updates the layout of the user interface, including but not limited to the position of each control or sub-control in the interface, and the size, position, and level of the container, which are related to the layout of the interface.
Since the functions of the operating system 3911 of the a chip are similar to those of the operating system 2911 of the N chip, reference may be made to the operating system 2911 for relevant points, and details are not repeated here.
As shown in fig. 6b, the application layer of the display device contains various applications that can be executed at the display device 200.
The N-chip application layer 2912 may include, but is not limited to, one or more applications such as: a video-on-demand application, an application center, a game application, and the like. The application layer 3912 of the a-chip may include, but is not limited to, one or more applications such as: live television applications, media center applications, and the like. It should be noted that what applications are respectively contained in the a chip and the N chip is determined according to an operating system and other designs, and the present invention does not need to make specific limitations and divisions on the applications contained in the a chip and the N chip.
The live television application program can provide live television through different signal sources. For example, a live television application may provide television signals using input from cable television, radio broadcasts, satellite services, or other types of live television services. And, the live television application may display video of the live television signal on the display device 200.
A video-on-demand application may provide video from different storage sources. Unlike live television applications, video on demand provides a video display from some storage source. For example, the video on demand may come from a server side of the cloud storage, from a local hard disk storage containing stored video programs.
The media center application program can provide various applications for playing multimedia contents. For example, a media center, which may be other than live television or video on demand, may provide services that a user may access to various images or audio through a media center application.
The application program center can provide and store various application programs. The application may be a game, an application, or some other application associated with a computer system or other device that may be run on a display device. The application center may obtain these applications from different sources, store them in local storage, and then be operable on the display device 200.
A schematic diagram of a user interface in a display device 200 according to an exemplary embodiment is illustrated in fig. 7. As shown in fig. 7, the user interface includes a plurality of view display areas, illustratively, a first view display area 201 and a play screen 202, wherein the play screen includes a layout of one or more different items. And a selector in the user interface indicating that the item is selected, the position of the selector being movable by user input to change the selection of a different item.
It should be noted that the multiple view display areas may present display screens of different hierarchies. For example, a first view display area may present video chat project content and a second view display area may present application layer project content (e.g., web page video, VOD presentations, application screens, etc.).
Optionally, the different view display areas are presented with different priorities, and the display priorities of the view display areas are different among the view display areas with different priorities. If the priority of the system layer is higher than that of the application layer, when the user uses the acquisition selector and picture switching in the application layer, the picture display of the view display area of the system layer is not blocked; and when the size and the position of the view display area of the application layer are changed according to the selection of the user, the size and the position of the view display area of the system layer are not influenced.
The display frames of the same hierarchy can also be presented, at this time, the selector can switch between the first view display area and the second view display area, and when the size and the position of the first view display area are changed, the size and the position of the second view display area can be changed along with the change.
Since the a-chip and the N-chip may have independent operating systems installed therein, there are two independent but interrelated subsystems in the display device 200. For example, Android and various APPs can be independently installed on the chip a and the chip N, so that each chip can realize a certain function, and the chip a and the chip N cooperatively realize a certain function.
In the current chat situation, because the sound of the chat in the video window and the sound of the program played by the television are in conflict, when the played sound is not very different, the interaction of the user is interfered, and therefore the user wants to highlight the sound of the chat in the scene.
Therefore, a sound processing method is urgently needed to solve the problem of sound conflict in the scene of chatting while looking, and provide good user experience for users.
In order to solve the foregoing technical problem, a first aspect of an embodiment of the present application shows a sound processing method, and specifically, referring to fig. 8, the method shown in the embodiment of the present application includes the following steps:
s101, detecting a chat sound corresponding to the chat application by a second chip;
the chat sound includes: the voice received through the voice entry module of the display device and the opposite-end voice received from the opposite end and required to be output from the speaker, which is output by the chat application.
The detecting of the chat sound corresponding to the chat application comprises: and detecting the sound received by the sound recording module and/or detecting the opposite-end sound output to a loudspeaker by the chat application.
And the chat application outputs the opposite-end sound of the loudspeaker and the sound of the audio and video to the loudspeaker to be played through the loudspeaker at the same time. Therefore, the audio and video sound may interfere with the opposite end sound output to the speaker by the chat application. In this case, it is necessary to reduce the audio and video sounds so that the user can better receive the opposite-end sound output by the chat application to the speaker.
S102 the second chip responds to the input audio and determines the input port of the current audio and video.
And when the second chip detects the chat sound, triggering the second chip to determine the input port of the audio and video at the moment. Wherein, the input port of audio frequency and video includes: the display device comprises a first port connected with the first chip and a second port connected with the exterior of the display device.
And the first port (also referred to as a first port in the application) is connected with the first chip and is used for receiving the sound of the audio and video transmitted by the third-party application installed on the first chip. In a feasible embodiment, the first port connected to the first chip may be a USB port (Universal Serial Bus) or an HDMI port (High Definition Multimedia Interface). And a first port connected with the first chip sends audio and video to the second chip. The HDMI port mainly sends audio and video signals received by a third-party application installed on the first chip to the second chip, and the audio and video signals comprise: network video signals, network audio data, image information collected by a camera, and the like. The signal received by the first chip is transmitted to the second chip through the HDMIO, and it should be noted that in the technical solution shown in the embodiment of the present application, the HDMI port is not open to the user. The USB is mainly used for transmitting the image data of the camera to the second chip through the USB.
And a second port (also referred to as a second port in this application) externally connected to the display device for receiving the audio-video data. In a feasible embodiment, the second chip is connected with the external set-top box through the second port. The second port may be an AV composite Video port, an S-terminal port (independent Video terminal port), a color difference port, a VGA port (Video Graphics Array, Video transmission standard port), a DVI port (Digital Video Interface), an HDMI port, or the like. And audio and video signals received by the external set top box are transmitted to the second chip through the second port.
And responding to the input port connected with the first chip, executing S103, sending feedback information to the first chip by the second chip, wherein the feedback information is used for indicating the first chip to reduce the volume of the audio and video sent to the second chip so as to reduce the output volume of the loudspeaker.
The second chip also comprises a communication port different from the port for inputting the audio and video, and a connecting channel between the first chip and the second chip comprises a first channel passing through the communication port; the sending, by the second chip, the feedback information to the first chip includes: and the second chip sends feedback information to the first chip through the first channel.
The communication interface may be a UART (Universal Asynchronous Receiver/Transmitter), the first channel is a communication channel formed by connecting UART interfaces, and the second chip sends feedback information to the first chip through the first channel.
The communication interface may also be connected via a network interface. And the second chip sends feedback information to the first chip through the network channel.
And in response to that the input port is a second port, executing S104 to reduce the volume of the received audio and video by a second chip so as to reduce the output volume of the loudspeaker.
Therefore, in the technical scheme shown in the embodiment of the application, the audio and video received by the second chip include the audio and video transmitted by the first chip through the first port and the audio and video transmitted by the external set-top box through the second port. In order to prevent the audio and video sound from influencing the input voice of the user in the process of chatting, the user is required to reduce the audio and video sound in the process of chatting, but the technical scheme shown in the embodiment of the application needs to determine the audio and video source and reduce the audio and video sound in a targeted manner because the number of the video sources is two.
The following describes the method of the embodiment of the present application in detail with reference to specific application scenarios.
In the process that a user watches videos independently, the user selects a corresponding signal source according to the requirement, for example: if the user wants to watch the live television, the second chip signal source can be switched to the second port. At the moment, the second chip receives the audio and video transmitted by the set top box through the second port. For another example, if the user wants to watch the network television, the signal source can be switched to the first port, and at this time, the second chip receives the audio and video transmitted by the first chip through the first port.
And the second chip receives the audio and video in real time under the condition that the user normally watches the video. When the user enters a chat-while-watching scenario (i.e., the communication function is enabled while watching the video), the second chip detects the input audio corresponding to the instant messaging function and input by the microphone. In response to detecting the input audio, the second chip determines an input port of the current audio and video, and meanwhile, receives chat voice information (which may also be referred to as input audio) input by a user.
The triggering of the communication function may be in response to the activation of a wakeup word, for example, a user may speak a wakeup word before activating the chat function, and the communication function is activated. The second chip starts to receive the chat voice information input by the user or the chat voice information of the user sent by other terminals. The triggering of the communication function may also be in response to a communication function initiated by a user via a remote control. For example, if the user wants to chat, the user presses a corresponding key on the remote controller, and at this time, the remote controller sends a chat start signal to the second chip. The second chip starts to receive chat voice information input by a user or chat voice information sent by other terminals.
The second chip, in response to detecting the input audio, determines a port to which a signal source of audio data is connected. In the technical solution shown in the embodiment of the present application, the port connected to the audio/video signal source may be a first port connected to the first chip, or may be a second port connected to an external electromechanical box.
(1) And if the signal source is connected with the second port, the second chip reduces the volume of the received audio and video.
Specifically, the process of turning down the volume of the wired audio data is as follows: the second chip reduces the front-end gain value of the second port.
The gain value is the degree to which the current, voltage or power of an electrical component, circuit, device or system is increased. The gain is reduced, and the current, voltage or power of the electrical element, circuit, equipment or system is reduced. Specifically, when the second chip reduces the gain value of the front end of the second port, the decibel value of the audio and video output by the second chip is reduced correspondingly. At the moment, the user carries out video chat, and the sound of the audio and video does not generate certain interference to the user.
Optionally, the embodiment of the present application shows that the technical solution further includes a fault tolerance mechanism. For example, if the user has finished the chat state, the video sound of the tv is always in a low decibel state, which may cause a bad experience for the user. According to the technical scheme shown in the embodiment of the application, the second chip counts the time length of the chat voice message which is not received in real time, and if the time length is greater than the preset time threshold, the fact that the user finishes the chat state is proved. In this case, the second chip recovers the input amplitude of the signal source, i.e. the gain value of the front end of the second port is adjusted back to the default value.
The technical scheme shown in the embodiment of the application can judge whether the user finishes the chat state in a polling mode. Specifically, the second chip judges whether the second chip has chat voice information input every a first preset time interval, and if the results of two polling times are that the chat voice information is not received, the second chip proves that the user carelessly sends a chat starting signal or the user finishes the chat state. At this time, the second chip recovers the input amplitude of the signal source, i.e., the gain value at the front end of the second port is adjusted back to the default value.
(2) If the signal source is connected with a port connected with a first chip; the second chip sends feedback information to the first chip, and the feedback information is used for indicating the first chip to reduce the volume of the audio and video sent to the second chip.
The following is a detailed description of how the first chip adjusts the volume according to the volume-down feedback information after receiving the feedback information.
In a feasible embodiment, when a user watches a network television, the first chip receives APP transmission audio and video data in real time, wherein the audio and video data comprise original audio and video data, and the first chip generates processed audio data after performing audio mixing processing on the original audio data and sends the processed audio data to the second chip.
Specifically, the original audio data transmitted by the APP needs to be processed on the first chip in a series of processes to generate processed audio data, and the specific processing process refers to fig. 9. As shown in fig. 9, the network video App writes original audio data into the AudioMixer through the AudioTrack port to mix, and performs algorithm mixing according to the decibel value (Db) of the original audio data sent by the App and the track volume default gain value, the stream volume alias default gain value, and the master volume default gain value corresponding to the App in the AudioMixer mixing process. The adjusted processed audio data is sent to the hardware device for output through the AudioHAL layer.
According to the technical scheme, the decibel value of the processed audio data can be reduced before the processed audio data are transmitted to the second chip.
Specifically, if the first chip receives the volume-down feedback information sent by the second chip, the first chip directly adjusts the corresponding parameter according to the volume-down feedback information. Wherein turning down the volume feedback information comprises: one or more combinations of a track volume adjusting gain value, a stream volume alias adjusting gain value and a master volume adjusting gain value; the track volume adjusting gain value is smaller than the track volume default gain value, and the track volume adjusting gain value is smaller than the track volume default gain value; the stream volume adjustment gain value is smaller than the stream volume default gain value; the stream volume alias adjustment gain value is smaller than the stream volume alias default gain value; the master volume adjustment gain value is less than the master volume adjustment gain value.
In the technical scheme shown in the embodiment of the application, the first chip is an Android system, wherein each APP installed in the first chip supports 10 StreamType in the Android system, and each StreamType corresponds to one stream volume and each stream volume can adjust the volume value of the audio data corresponding to the stream volume by adjusting the gain value.
Alternatively, 10 streams may be divided into groups according to the relevance of the stream volumes. A specific grouping rule may be that streams belonging to the same group have the same alias (alias). Each set of streams corresponds to one volume adjustment slider having one stream volume alias. The adjustment of the volume value of the audio data corresponding to the stream volume alias can be realized by adjusting the gain value of the stream volume alias.
Each APP corresponds to one track volume, and the adjustment of the volume value of the audio data corresponding to the APP is realized through the adjustment of the gain value of the track volume.
In the embodiment of the present application, the setting of the master volume gain value is equivalent to the common setting of the stream volume gain value and the track volume gain value. The master volume gain value can be written into the sound card to control the volume of all sounds. It is also possible not to write into the sound card but to influence all the volumes as a multiplier factor.
The following provides a brief description of the process of turning down the volume of audio data by the first chip with reference to a specific example.
In a feasible embodiment, the second chip receives audio data in real time, when the second chip receives a chat start signal, the second chip receives chat voice information input by a user, the second chip determines that the signal source is connected to a port connected to the first chip, and the second chip sends a track volume adjustment gain value to the first chip.
The method comprises the steps that the first chip adjusts the gain value of the track volume of the APP to be the track volume adjustment gain value through the default gain value of the track volume, the network video App writes original audio data into an Audio mixer of the first chip through an Audio track port to perform sound mixing, the Audio mixer performs sound mixing according to the adjusted gain value in the sound mixing process, and the first chip finally sends the adjusted audio data to the second chip.
Since the track volume adjusted gain value is smaller than the track volume default gain value, the decibel value of the adjusted audio data is decreased in this case. At the moment, the user carries out video chat, and the sound of the network television does not generate certain interference to the user.
Optionally, in the technical solution shown in the embodiment of the present application, the second chip counts the time length of the chat voice message that is not received in real time, the second chip responds to that the time length of the chat sound is not detected to exceed the preset time threshold, at this time, it is proved that the chat is finished, the second chip sends a recovery instruction to the first chip, and the recovery instruction is used for instructing the first chip to increase the volume of the audio and video that is sent to the second chip to the volume before the audio and video is reduced. In this case, the second chip sends a resume instruction to the first chip, where the resume instruction in this embodiment is to adjust the gain value of the track volume of the APP to the default gain value of the track volume, and at this time, the volume of the network television returns to normal.
The technical scheme shown in the embodiment of the application can judge whether the chat state is finished or not in a polling mode. Specifically, the second chip does not interval the first preset time to judge whether the second chip has the chat voice information, if the results of two polling are that the chat voice information is not received, the chat state is proved to be ended, at this moment, the second chip sends a recovery instruction to the first chip, the gain value of the track volume of the APP, which is the recovery instruction, is adjusted to be the default gain value of the track volume, and at this moment, the volume of the network television is recovered to be normal.
In another feasible embodiment, the second chip receives audio data in real time, and when the second chip receives a chat start signal, the second chip receives chat voice information input by a user, and determines that the signal source is connected to the second port, and sends a stream volume adjustment gain value to the first chip.
The first chip adjusts the gain value of the stream volume of the APP to the gain value adjustment of the stream volume, the network video App writes original audio data into an Audio mixer of the first chip through an Audio track port to perform sound mixing, the Audio mixer performs sound mixing according to the adjusted gain value in the sound mixing process, and the first chip finally sends the adjusted audio data to the second chip.
Since the stream volume adjustment gain value is smaller than the stream volume default gain value, the decibel value of the processed audio data after adjustment is lowered in this case. At the moment, the user carries out video chat, and the sound of the network television does not generate certain interference to the user.
And the second chip continuously detects whether a chat starting signal is input or not in the process of watching while chatting, if the data is detected to be empty for a long time, the chat is judged to be finished, and the related gain value is recovered to be the default gain value. The specific recovery process is similar to that shown in the embodiments and will not be described in detail herein for reasons of space.
In another possible embodiment, the second chip receives audio data in real time, and when the second chip receives the chat start signal, the second chip receives the chat voice information input by the user, and the second chip determines that the signal source is connected to the second port, and sends a stream volume alias adjustment gain value to the first chip.
The first chip adjusts the gain value of the stream volume alias of the APP to the gain value adjustment of the stream volume alias, the network video App writes original audio data into an Audio Mixer of the first chip through an Audio track port to perform sound mixing, the Audio Mixer performs sound mixing according to the adjusted gain value in the sound mixing process, and the first chip finally sends the processed audio data to the second chip.
Since the stream volume alias adjustment gain value is smaller than the stream volume alias default gain value, the decibel value of the audio data after adjustment is lowered in this case. At the moment, the user carries out video chat, and the sound of the network television does not generate certain interference to the user.
And the second chip continuously detects whether data are input or not in the process of chatting while watching, if the data are detected to be null for a long time, the application is judged to be abnormal in the chatting process, and the related gain value is restored to the default gain value. The specific recovery process is similar to that shown in the embodiments and will not be described in detail herein for reasons of space.
In another feasible embodiment, the second chip receives audio data in real time, receives chat voice information input by a user when the second chip receives a chat start signal, determines that the signal source is connected with the second port, and sends a master volume adjustment gain value to the first chip.
The first chip adjusts the gain value of the master volume of the APP to be adjusted to the master volume adjustment gain value through the master volume default gain value, the network video App writes original audio data into an Audio mixer of the first chip through an Audio track port to perform sound mixing, the Audio mixer performs sound mixing according to the adjusted gain value in the sound mixing process, and the first chip finally sends the adjusted audio data to the second chip.
Since the master volume adjustment gain value is smaller than the master volume default gain value, the decibel value of the processed audio data after adjustment is decreased in this case. At the moment, the user carries out video chat, and the sound of the network television does not generate certain interference to the user.
And the second chip continuously detects whether data are input or not in the process of chatting while watching, if the data are detected to be null for a long time, the application is judged to be abnormal in the chatting process, and the related gain value is restored to the default gain value. The specific recovery process is similar to that shown in the embodiments and will not be described in detail herein for reasons of space.
According to the technical scheme, when the chat is started, signal input ends are distinguished, if the signal source is connected with the second port, the volume of wired audio data is reduced, and if the signal source is connected with the port connected with the first chip; sending the volume-down feedback information to the first chip. And the first chip reduces the volume of the processed audio data according to the volume reduction feedback information. According to the technical scheme, when a user chats, the problem that the sound of the video window chats conflicts with the sound of the television playing program in the current chatting situation is avoided by turning down the volume of the audio data.
A second aspect of the embodiment of the present application shows a display device, and specifically, referring to fig. 10, the system includes: the display screen is used for displaying the images of the audios and the videos;
the loudspeaker is used for outputting the sound of the audio and video;
a second chip connected to the speaker and the display screen, the second chip configured to detect a chat sound corresponding to a chat application.
The second chip responds to the chat sound detected and determines the input port of the current audio and video; in response to the input port being a port on the second chip connected to the first chip in the display device, the second chip sends feedback information to the first chip, where the feedback information is used to instruct the first chip to reduce the volume of the audio and video sent to the second chip; and in response to the input port being a second port on the second chip which is externally connected with the display device, the second chip reduces the volume of the received audio and video.
The input audio and the audio and video are played simultaneously, the chat sound comprises sound received through a sound input module of the display device and opposite-end sound which is output by the chat application and received from an opposite end and needs to be output from the loudspeaker, and the detection of the chat sound corresponding to the chat application comprises detection of the sound received by the sound input module and/or detection of the opposite-end sound output by the chat application to the loudspeaker.
The loudspeaker is also used for outputting the opposite end sound.
Wherein, the input port of audio frequency and video includes: the display device comprises a first port connected with the first chip and a second port connected with the exterior of the display device.
And the first port (also referred to as a first port in the application) is connected with the first chip and is used for receiving the sound of the audio and video transmitted by the third-party application installed on the first chip. In a feasible embodiment, the first port connected to the first chip may be a USB port (Universal Serial Bus) or an HDMI port (High Definition Multimedia Interface). And a first port connected with the first chip sends audio and video to the second chip. The HDMI port mainly sends audio and video signals received by a third-party application installed on the first chip to the second chip, and the audio and video signals comprise: network video signals, network audio data, image information collected by a camera, and the like. The signal received by the first chip is transmitted to the second chip through the HDMIO, and it should be noted that in the technical solution shown in the embodiment of the present application, the HDMI port is not open to the user. The USB is mainly used for transmitting the image data of the camera to the second chip through the USB.
And a second port (also referred to as a second port in this application) externally connected to the display device for receiving the audio-video data. In a feasible embodiment, the second chip is connected with the external set-top box through the second port. The second port may be an AV composite Video port, an S-terminal port (independent Video terminal port), a color difference port, a VGA port (Video Graphics Array, Video transmission standard port), a DVI port (Digital Video Interface), an HDMI port, or the like. And audio and video signals received by the external set top box are transmitted to the second chip through the second port.
In a feasible embodiment, the second chip further includes a communication port different from the port for inputting the audio/video, and the connection channel between the first chip and the second chip includes a first channel passing through the communication port; the sending, by the second chip, the feedback information to the first chip includes: and the second chip sends feedback information to the first chip through the first channel.
The communication interface may be a UART (Universal Asynchronous Receiver/Transmitter), the first channel is a communication channel formed by connecting UART interfaces, and the second chip sends feedback information to the first chip through the first channel.
The communication interface may also be connected via a network interface. And the second chip sends feedback information to the first chip through the network channel.
Optionally, the step of using the feedback information to instruct the first chip to reduce the volume of the audio and video sent to the second chip specifically includes: the feedback information is used for indicating the first chip to reduce a first gain parameter, and the first gain parameter participates in the gain parameter related to the volume of the audio and video. For the adjustment method of the specific gain parameter, reference may be made to the foregoing embodiments, which are not described herein again.
Optionally, in response to that the second chip does not receive the input audio when exceeding the preset time threshold, the second chip is further configured to send a recovery instruction to the first chip, where the recovery instruction is used to instruct the first chip to increase the volume of the audio and video sent to the second chip to the volume before reduction. For a specific sound reducing method, reference may be made to the foregoing embodiments, which are not described herein again.
According to the technical scheme, when the chat is started, the technical scheme distinguishes signal input ends, if the signal source is connected with the second port, the volume of wired audio data is reduced, and if the signal source is connected with the port connected with the first chip, the signal source is connected with the second port; sending the volume-down feedback information to the first chip. And the first chip reduces the volume of the processed audio data according to the volume reduction feedback information. According to the technical scheme, when a user chats, the problem that the sound of the video window chats conflicts with the sound of the television playing program in the current chatting situation is avoided by turning down the volume of the audio data.
It should be understood that the terms "first," "second," "third," and the like in the description and in the claims of the present application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used are interchangeable under appropriate circumstances and can be implemented in sequences other than those illustrated or otherwise described herein with respect to the embodiments of the application, for example.
Furthermore, the terms "comprises" and "comprising," as well as any variations thereof, are intended to cover a non-exclusive inclusion, such that a product or device that comprises a list of elements is not necessarily limited to those elements explicitly listed, but may include other elements not expressly listed or inherent to such product or device.
Finally, it should be noted that: the above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present application.

Claims (10)

1. A sound processing method is applied to a display device, and the display device comprises: a first chip, a second chip connected to the first chip, and a speaker connected to the second chip, wherein the method comprises:
the second chip detects a chat sound corresponding to the chat application;
the second chip determines the input port of the current audio and video in response to the detection of the chat sound;
in response to the input port being a first port connected with the first chip on the second chip, the second chip sends feedback information to the first chip, wherein the feedback information is used for instructing the first chip to reduce the volume of the audio and video sent to the second chip so as to reduce the output volume of the loudspeaker;
and in response to the input port being a second port on the second chip which is externally connected with the display device, the second chip reduces the volume of the received audio and video to reduce the output volume of the loudspeaker.
2. The method according to claim 1, wherein the second chip further comprises a communication port different from the port of the input audio/video, and the connection channel between the first chip and the second chip comprises a first channel passing through the communication port; the sending, by the second chip, the feedback information to the first chip includes:
and the second chip sends feedback information to the first chip through the first channel.
3. The method of claim 1, the chat sound comprising: the method comprises the steps that voice received through a voice recording module of the display device and opposite-end voice which is output by a chat application and is received from an opposite end and required to be output from a loudspeaker are output;
the detecting of the chat sound corresponding to the chat application comprises: and detecting the sound received by the sound recording module and/or detecting the opposite-end sound output to a loudspeaker by the chat application.
4. The method of claim 3, wherein the peer-to-peer sound and the audio-visual sound output by the chat application to a speaker are played simultaneously through the speaker.
5. The method according to any one of claims 1-4, wherein: the feedback information is used for indicating the first chip to reduce the volume of the audio and video sent to the second chip, and specifically comprises the following steps:
the feedback information is used for indicating the first chip to reduce a first gain parameter, and the first gain parameter is used for reducing the volume-related gain parameter of the audio and video sent to the second chip by the first chip.
6. The method of claim 1, wherein: further comprising:
and the second chip responds to the fact that the duration of the chat sound does not exceed a preset time threshold, and sends a recovery instruction to the first chip, wherein the recovery instruction is used for guiding the first chip to increase the volume of the audio and video sent to the second chip to the volume before reduction.
7. A display device, comprising:
the display screen is used for displaying the images of the audios and the videos;
the loudspeaker is used for outputting the sound of the audio and video;
a second chip connected to the speaker and the display screen, the second chip configured to detect a chat sound corresponding to a chat application;
the second chip responds to the chat sound detected and determines the input port of the current audio and video; in response to the input port being a port on the second chip connected to the first chip in the display device, the second chip sends feedback information to the first chip, where the feedback information is used to instruct the first chip to reduce the volume of the audio and video sent to the second chip; and in response to the input port being a second port on the second chip which is externally connected with the display device, the second chip reduces the volume of the received audio and video.
8. The display device according to claim 7, wherein the input audio and the audio and video play the chat sound simultaneously, and the chat sound comprises a sound received by a sound input module of the display device and an opposite-end sound output by a chat application and received from an opposite end and required to be output from a speaker, and the detection of the chat sound corresponding to the chat application comprises detection of the sound received by the sound input module and/or detection of the opposite-end sound output by the chat application to the speaker;
the loudspeaker is also used for outputting the opposite end sound.
9. The display device according to claim 7, wherein: the feedback information is used for indicating the first chip to reduce the volume of the audio and video sent to the second chip, and specifically comprises the following steps: the feedback information is used for indicating the first chip to reduce a first gain parameter, and the first gain parameter participates in the gain parameter related to the volume of the audio and video.
10. The display device according to claim 7, characterized in that:
and responding to the fact that the second chip does not receive the input audio when the second chip exceeds the preset time threshold, the second chip is also used for sending a recovery instruction to the first chip, and the recovery instruction is used for guiding the first chip to increase the volume of the audio and video sent to the second chip to the volume before reduction.
CN201911231348.0A 2019-06-10 2019-12-05 Sound processing method and display device Pending CN112073775A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2019104981979 2019-06-10
CN201910498197 2019-06-10

Publications (1)

Publication Number Publication Date
CN112073775A true CN112073775A (en) 2020-12-11

Family

ID=73658521

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911231348.0A Pending CN112073775A (en) 2019-06-10 2019-12-05 Sound processing method and display device

Country Status (1)

Country Link
CN (1) CN112073775A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104780430A (en) * 2015-03-19 2015-07-15 四川长虹电器股份有限公司 Interactive method of complete machine system and extended system in multisystem television equipment
US20160094878A1 (en) * 2014-09-29 2016-03-31 Sony Corporation Device and method for generating metadata log for video data
CN105872834A (en) * 2015-12-15 2016-08-17 乐视致新电子科技(天津)有限公司 Method and device for volume automatic adjustment in presence of double pictures and intelligent device
CN106791099A (en) * 2016-12-21 2017-05-31 珠海市魅族科技有限公司 Volume adjustment device and method
CN107071152A (en) * 2017-03-20 2017-08-18 广州视源电子科技股份有限公司 Method and system based on verbal system state adjust automatically stereo set volume

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160094878A1 (en) * 2014-09-29 2016-03-31 Sony Corporation Device and method for generating metadata log for video data
CN104780430A (en) * 2015-03-19 2015-07-15 四川长虹电器股份有限公司 Interactive method of complete machine system and extended system in multisystem television equipment
CN105872834A (en) * 2015-12-15 2016-08-17 乐视致新电子科技(天津)有限公司 Method and device for volume automatic adjustment in presence of double pictures and intelligent device
CN106791099A (en) * 2016-12-21 2017-05-31 珠海市魅族科技有限公司 Volume adjustment device and method
CN107071152A (en) * 2017-03-20 2017-08-18 广州视源电子科技股份有限公司 Method and system based on verbal system state adjust automatically stereo set volume

Similar Documents

Publication Publication Date Title
CN113330736B (en) Display and image processing method
CN112073797B (en) Volume adjusting method and display device
CN112073662A (en) Display device
CN112399243A (en) Playing method and display device
CN112073778A (en) Display device and fault-tolerant method for key transmission
CN112073789B (en) Sound processing method and display device
CN112068741B (en) Display device and display method for Bluetooth switch state of display device
CN111385631B (en) Display device, communication method and storage medium
CN112995733B (en) Display device, device discovery method and storage medium
CN112533056B (en) Display device and sound reproduction method
CN112073666B (en) Power supply control method of display equipment and display equipment
CN112073776B (en) Voice control method and display device
CN112802440B (en) Display device and sound low-delay processing method
CN112463267B (en) Method for presenting screen saver information on display device screen and display device
CN112399245A (en) Playing method and display device
CN112073803A (en) Sound reproduction method and display equipment
CN113448529A (en) Display apparatus and volume adjustment method
CN112073808A (en) Color space switching method and display device
CN112073769A (en) Display device and method for applying common display
CN112073773A (en) Screen interaction method and device and display equipment
CN112073816A (en) Dual-system USB upgrading method and device and display equipment
CN112399223B (en) Method for improving moire fringe phenomenon and display device
CN112073779B (en) Display device and fault-tolerant method for key transmission
CN112073772B (en) Key seamless transmission method based on dual systems and display equipment
CN112995113B (en) Display device, port control method and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20201211