CN105744347A - Method of network media terminal for improving user audio-visual experience - Google Patents
Method of network media terminal for improving user audio-visual experience Download PDFInfo
- Publication number
- CN105744347A CN105744347A CN201610089275.6A CN201610089275A CN105744347A CN 105744347 A CN105744347 A CN 105744347A CN 201610089275 A CN201610089275 A CN 201610089275A CN 105744347 A CN105744347 A CN 105744347A
- Authority
- CN
- China
- Prior art keywords
- audio
- network media
- user
- media terminal
- stream
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/434—Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
- H04N21/4341—Demultiplexing of audio and video streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4398—Processing of audio elementary streams involving reformatting operations of audio signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
- H04N21/440218—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by transcoding between formats or standards, e.g. from MPEG-2 to MPEG-4
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/63—Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
- H04N21/643—Communication protocols
Abstract
The invention discloses a method of a network media terminal for improving a user audio-visual experience. The method comprises the following steps of step 1, code stream editing of editing a program source code stream into audio tracks in multiple audio formats which are used for satisfying different audio format demands, different bandwidths and different peripheral equipment; step 2, front-end configuration of placing the edited audio-visual program stream containing multiple audios at a front-end live video server or a RTSP server; and step 3, terminal processing of directly obtaining the audio formats of the program stream for preferred selection by the terminal or obtaining the audio formats of the program stream based on an audio configuration file placed at the front end for preferred selection. According to the method of the network media terminal for improving the user audio-visual experience, a set-top box and the front end can cooperate with each other for preferred selection of audio resources, and a better audio-visual experience can be brought to a user.
Description
Technical field
The present invention relates to computer software and numeral TV technology, be specifically related to a kind of method that network media terminal improves user's audiovisual experience.
Background technology
Universal along with the development of network technology and home network, IPTV set top box application is more and more extensive, high bandwidth, and high definition, 4K video resource bring the audiovisual experience compared with the more flexible personalized customization of SD and conventional set-top box and more high-quality to user.User bandwidth, Set Top Box, TV are because family's difference is different, in this case, it is intended that media termination can bring different user more accurate best audiovisual experience, it is to avoid with minimizing without phenomenons such as output, broadcasting card pause.
Summary of the invention
Instant invention overcomes the deficiencies in the prior art, it is provided that a kind of network media terminal improves the method for user's audiovisual experience.
For solving above-mentioned technical problem, the present invention by the following technical solutions:
A kind of network media terminal improves the method for user's audiovisual experience, and described method comprises the following steps:
Step one, code stream editor
Program source code stream is compiled as the track of multiple audio format, for meeting different audio format demand, different bandwidth, different ancillary equipment use;
Step 2, front-end configuration
The audio/video program comprising Multi-audio-frequency edited is banished to head end video direct broadcast server or RTSP server;
Step 3, terminal processes
Terminal directly obtains the audio format of program stream and carries out preferably;Or the audio format of the audio configuration file acquisition program stream according to front end placement carries out preferably.
Further technical scheme is that step one also includes code stream is fabricated to multiple sound accompaniment format step.
Further technical scheme is that step 2 includes: front end assigned catalogue places a configuration file, the audio-frequency information of online resource is recorded and safeguards;Terminal reads the audio-frequency information that the code stream currently play in this document is corresponding.
Further technical scheme is that step 3 sound intermediate frequency form preferably includes: when chip is supported Doby decoding and has mandate, if comprising Doby track in decoded stream, then preferentially broadcast Doby track.
Further technical scheme is that step 3 also includes, according to currently playing smooth degree, network condition, switching the accompanying audio soundtrack of different code check for user and pointing out user steps.
Further technical scheme is that step 3 also includes: when Set Top Box does not support the audio coding formats of decoding, the transparent transmission option in being arranged by Set Top Box, and HDMI is transparent to TV decoding, and SPDIF is transparent to power amplifying device decoding step.
Compared with prior art, the invention has the beneficial effects as follows: the present invention can allow end in front of Set Top Box coordinate, it is preferable that audio resource, brings better audiovisual experience for user.
Accompanying drawing explanation
Fig. 1 is code stream editor in one embodiment of the invention, code stream building form flow chart.
Fig. 2 is front-end configuration flow chart in one embodiment of the invention.
Fig. 3 is that in one embodiment of the invention, STB terminal processes workflow diagram.
Detailed description of the invention
All features disclosed in this specification, or the step in disclosed all methods or process, except mutually exclusive feature and/or step, all can combine by any way.
This specification (include any accessory claim, summary and accompanying drawing) disclosed in any feature, unless specifically stated otherwise, all can by other equivalences or there is the alternative features of similar purpose replaced.That is, unless specifically stated otherwise, each feature is an example in a series of equivalence or similar characteristics.
Below in conjunction with drawings and Examples, the specific embodiment of the present invention is described in detail.
As shown in Figures 1 to 3, according to one embodiment of present invention, the present embodiment discloses a kind of method that network media terminal improves user's audiovisual experience, and program source code stream is compiled as multiple track by the method, is available for different demand, different bandwidth, distinct device user use.Place a configuration file in front end, its content is the necessary information of program and program, and program comprise track quantity, encoding format information etc..STB terminal reads the audio-frequency information of actual program stream, audio-frequency informations as multiple in actual program band from front end configuration file, selects best audio track to decode: if any Dolby Audio, it is possible to preferential with Doby.Can arrange interface at STB terminal provides Multi-audio-frequency user preferably to arrange, user can select according to network, equipment situation, this selection can be combined with transparent transmission option, and the codec format that Set Top Box is not supported can be transparent to the equipment such as television set, the audio amplifier that can support and decode.
Concrete, the method is by code stream editor, front-end configuration, terminal processes, coming for user's preferably best track.Comprise the steps of
Step one, code stream editor: an audio and video resources configures the track of multiple audio formats, and code stream is fabricated to multiple sound accompaniment form, including the form such as Doby form, AAC, DTS, PCM.Different audio formats meets different user requirement, compatible different bandwidth (different audio format compression ratios are different with data volume), the chip (Doby, DTS are likely to not support) of different decoding capabilities, different ancillary equipment (pass through TV, audio amplifier decoding such as by HDMI, SPDIF), different audio contents (multilingual sound accompaniment or broadcast).The user that bandwidth is high can select the form that compression ratio low bit-rate is big, can decoding by prioritizing selection Doby of Doby decoding supported by subscriber equipment, IPTV set top box does not support that Doby decodes and television set or digital sound box support, then optional HDMI transparent transmission TV or the decoding of SPDIF transparent transmission audio amplifier.User selects different sound accompanying languages or broadcast listening program according to hobby.
Code stream transcoding or editor are put in the video server of IPTV system after completing as live, order video program source.
The existing general video of Internet resources is a corresponding audio frequency only, uses the also fewer of Dolby Audio, increases the multi-audio of program stream and will bring to user and more rich select freely.For the high-definition program stream comprising Multi-audio-frequency, its TS code stream can comprise a video H.264 compressed, one Doby format audio (primary sound such as English, Guangdong language, Korean), one AAC format audio (Chinese translation is dubbed), one AAC format audio (primary sound such as English, Guangdong language, Korean), naturally it is also possible to comprise other or more Multi-audio-frequency or broadcast.TS stream comprises audio frequency quantity information, different audio frequency are corresponding different Audio PID.Can be the multi-audio in IPTV programme televised live source by the multi-audio corresponding conversion of wideband direct broadcast satellite TV, current CCTV direct broadcasting satellite high definition channel just contains Doby at two interior audio frequency, after high definition DVB receives the program containing Multi-audio-frequency, after H.264 transcoder transcoding, pass to Living streaming server and broadcast.Local high-definition program source is live then adds Multi-audio-frequency resource with request program source when program source editor, after being encoded by H.264 high definition encoder server, passes to Living streaming server or RTSP server.TS stream containing Multi-audio-frequency presses the stream media protocol transmission such as RTSP, TS.Preferential for Doby, operator provide situation according to local IPTV set top box, whether main flow IPTV supports that Doby decodes, and determines code stream whether band Dolby Audio, and requires that terminal realizes Doby priority function.Carrier Requirements is pressed by terminal producer, supports that the chip of Doby realizes Doby and preferentially decodes.Audio decoder transparent transmission option gives tacit consent to not transparent transmission, user oneself decide whether transparent transmission and transparent transmission mode.
Step 2, front-end configuration: the audio/video program comprising Multi-audio-frequency edited is banished to head end video direct broadcast server or RTSP server.RTSP server is as video request program streaming media server, by RTSP protocol realization point multicast function between RTSP server and IPTV set top box.The method how effectively transmitting integrating multimedia data by IP network under RTSP protocol definition one-to-many communication applications.RTSP agreement is to transmit content in unicast stream mode, and this is an other agreement of application-level, is create specially for controlling the transmission of real time data (such as Voice & Video content).This agreement is to realize on the host-host protocol basis of error correction, and support stops, suspending, refunds and F.F. index.Front end assigned catalogue (such as http: // 116.210.255.160/config/AudioConfig.xml) places a configuration file, the audio-frequency information of online resource is recorded and safeguards.Terminal reads the audio-frequency information that the code stream currently play in this document is corresponding: audio frequency quantity, PID, form etc..
The use of audio configuration file: configuration file comprises all audio frequency relevant information, it is possible to be XML or alternative document form.Configuration file can be pushed to Set Top Box assigned catalogue by front end and preserve, or Set Top Box goes to appointment address, front end to take file to be stored to this locality.Set Top Box checks acquisition program audio information and preferred to local profile before playing program.Obtain audio-frequency information from the Audio PID of code stream when without configuration file and preferably also may be used.
Step 3, terminal processes: terminal directly obtains the audio format of program stream and carries out preferably, and this kind of situation front end can be configured without file;Or the audio format of the audio configuration file acquisition program stream according to front end placement carries out preferably.Audio frequency preferred version: when chip is supported Doby decoding and has mandate, if comprising Doby track in decoded stream, then preferentially broadcast Doby track, it is possible to be called " Doby is preferential ".One tier 2 cities, 4K Set Top Box, support Doby decoding ancillary equipment, support that the fiber entering household that high code check transmits is relatively broad, reaching the standard grade the program containing Doby code stream enable " Doby is preferential " scheme at Set Top Box in front end, coordinates 4K video that average family network media terminal can be allowed to have fever levels effect.Arranging interface at STB terminal provides many tracks user preferably to arrange, user can select according to the situation such as bandwidth, equipment, this selection can be combined with transparent transmission option, and the codec format that Set Top Box is not supported can be transparent to the equipment such as television set, the audio amplifier that can support and decode.
IPTV set top box is according to currently playing smooth degree, network condition, switch the accompanying audio soundtrack of different code check for user and point out user, certainly, need to support scope switching at subscriber equipment, for suspending download state, user then considers that audio is preferential, also may be configured as non-dynamic switching in arranging, and individual subscriber hobby is fixed and set, pursue the fluency of network or high-quality sound accompaniment effect.
Set Top Box does not support the audio coding formats of decoding, it is possible to the transparent transmission option in being arranged by Set Top Box, and HDMI is transparent to TV and decodes (decoding that this audio coding formats supported by TV), and SPDIF is transparent to power amplifying device decoding.Decoding function after transparent transmission, volume adjusting, the sound control such as quiet are controlled by peripheral decoding device.
" embodiment ", " another embodiment ", " embodiment " spoken of in this manual etc., refer to the specific features, structure or the feature that describe in conjunction with this embodiment and include at least one embodiment that the application generality describes.Multiple local appearance statement of the same race is not necessarily refer to same embodiment in the description.Furthermore, it is understood that when describing a specific features, structure or feature in conjunction with any one embodiment, what advocate is also fall within the scope of the present invention to realize this feature, structure or feature in conjunction with other embodiments.
Although reference be made herein to invention has been described for the multiple explanatory embodiment invented, but, it should be understood that those skilled in the art can be designed that a lot of other amendments and embodiment, these amendments and embodiment will drop within spirit disclosed in the present application and spirit.More specifically, in disclosure scope of the claims, it is possible to building block and/or layout to theme composite configuration carry out multiple modification and improvement.Except the modification that building block and/or layout are carried out and improvement, to those skilled in the art, other purposes also will be apparent from.
Claims (6)
1. the method that a network media terminal improves user's audiovisual experience, it is characterised in that described method comprises the following steps:
Step one, code stream editor
Program source code stream is compiled as the track of multiple audio format, for meeting different audio format demand, different bandwidth, different ancillary equipment use;
Step 2, front-end configuration
The audio/video program comprising Multi-audio-frequency edited is banished to head end video direct broadcast server or RTSP server;
Step 3, terminal processes
Terminal directly obtains the audio format of program stream and carries out preferably;Or the audio format of the audio configuration file acquisition program stream according to front end placement carries out preferably.
2. the method that network media terminal according to claim 1 improves user's audiovisual experience, it is characterised in that described step one also includes being fabricated to code stream multiple sound accompaniment format step.
3. the method that network media terminal according to claim 1 improves user's audiovisual experience, it is characterised in that described step 2 includes: front end assigned catalogue places a configuration file, the audio-frequency information of online resource is recorded and safeguards;Terminal reads the audio-frequency information that the code stream currently play in this document is corresponding.
4. the method that network media terminal according to claim 1 improves user's audiovisual experience, it is characterized in that described step 3 sound intermediate frequency form preferably includes: when chip is supported Doby decoding and has mandate, if decoded stream comprises Doby track, then preferentially broadcast Doby track.
5. the method that network media terminal according to claim 2 improves user's audiovisual experience, it is characterised in that described step 3 also includes, according to currently playing smooth degree, network condition, switching the accompanying audio soundtrack of different code check for user and pointing out user steps.
6. the method that network media terminal according to claim 1 improves user's audiovisual experience, it is characterized in that described step 3 also includes: when Set Top Box does not support the audio coding formats of decoding, transparent transmission option in being arranged by Set Top Box, HDMI is transparent to TV decoding, and SPDIF is transparent to power amplifying device decoding step.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610089275.6A CN105744347A (en) | 2016-02-17 | 2016-02-17 | Method of network media terminal for improving user audio-visual experience |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610089275.6A CN105744347A (en) | 2016-02-17 | 2016-02-17 | Method of network media terminal for improving user audio-visual experience |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105744347A true CN105744347A (en) | 2016-07-06 |
Family
ID=56246060
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610089275.6A Pending CN105744347A (en) | 2016-02-17 | 2016-02-17 | Method of network media terminal for improving user audio-visual experience |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105744347A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106331846A (en) * | 2016-08-17 | 2017-01-11 | 青岛海信电器股份有限公司 | Audio transmission method and apparatus |
CN112189344A (en) * | 2018-05-29 | 2021-01-05 | 华为技术有限公司 | Method and device for selecting audio track from audio/video file |
CN112735445A (en) * | 2020-12-25 | 2021-04-30 | 广州朗国电子科技有限公司 | Method, apparatus and storage medium for adaptively selecting audio track |
CN113542883A (en) * | 2021-07-12 | 2021-10-22 | 广州浩传网络科技有限公司 | Intelligent pushing method, device and equipment for playing content of audio-video equipment |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012019272A1 (en) * | 2010-08-13 | 2012-02-16 | Simon Fraser University | System and method for multiplexing of variable bit-rate video streams in mobile video systems |
CN103093776A (en) * | 2011-11-04 | 2013-05-08 | 腾讯科技(深圳)有限公司 | Method and system of multi-audio-track content play in network seeing and hearing |
CN104754366A (en) * | 2015-03-03 | 2015-07-01 | 腾讯科技(深圳)有限公司 | Audio and video file live broadcasting method, device and system |
-
2016
- 2016-02-17 CN CN201610089275.6A patent/CN105744347A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012019272A1 (en) * | 2010-08-13 | 2012-02-16 | Simon Fraser University | System and method for multiplexing of variable bit-rate video streams in mobile video systems |
CN103093776A (en) * | 2011-11-04 | 2013-05-08 | 腾讯科技(深圳)有限公司 | Method and system of multi-audio-track content play in network seeing and hearing |
CN104754366A (en) * | 2015-03-03 | 2015-07-01 | 腾讯科技(深圳)有限公司 | Audio and video file live broadcasting method, device and system |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106331846A (en) * | 2016-08-17 | 2017-01-11 | 青岛海信电器股份有限公司 | Audio transmission method and apparatus |
CN106331846B (en) * | 2016-08-17 | 2019-04-12 | 青岛海信电器股份有限公司 | The method and device of audio transparent transmission |
CN112189344A (en) * | 2018-05-29 | 2021-01-05 | 华为技术有限公司 | Method and device for selecting audio track from audio/video file |
CN112735445A (en) * | 2020-12-25 | 2021-04-30 | 广州朗国电子科技有限公司 | Method, apparatus and storage medium for adaptively selecting audio track |
CN113542883A (en) * | 2021-07-12 | 2021-10-22 | 广州浩传网络科技有限公司 | Intelligent pushing method, device and equipment for playing content of audio-video equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20200177931A1 (en) | Receiving device, transmitting device, and data processing method | |
US10674229B2 (en) | Enabling personalized audio in adaptive streaming | |
JP2007018496A (en) | Content integration platform | |
JP2007020144A (en) | Content integration method with format and protocol conversion | |
KR100728256B1 (en) | Homenetwork/Broadcast Linkage System and Method for using Multimedia Contents between Home Network and Broadcast | |
CN105744347A (en) | Method of network media terminal for improving user audio-visual experience | |
US9942620B2 (en) | Device and method for remotely controlling the rendering of multimedia content | |
WO2007128194A1 (en) | Method, apparatus and system for playing audio/video data | |
JP7100052B2 (en) | Electronic device and its control method | |
CN103597840A (en) | Systems and methods for processing timed text in video programming | |
CN103067747A (en) | Interactive digital TV display mode | |
US20140112636A1 (en) | Video Playback System and Related Method of Sharing Video from a Source Device on a Wireless Display | |
CN113301359A (en) | Audio and video processing method and device and electronic equipment | |
KR102630037B1 (en) | Information processing device, information processing method, transmission device, and transmission method | |
KR102640835B1 (en) | Transmitting devices, receiving devices, and data processing methods | |
CN104284227A (en) | Mobile phone remote control television system | |
CN113923510B (en) | Method, device, equipment and readable storage medium for forwarding digital television content | |
Bleidt et al. | Building the world’s most complex TV network: a test bed for broadcasting immersive and interactive audio | |
CN103281585A (en) | Set top box (STB) device of Internet protocol television (IPTV) | |
JP6425423B2 (en) | Recording and reproducing apparatus and recording and reproducing system | |
KR102628917B1 (en) | Transmitting devices, receiving devices, and data processing methods | |
CN107005745B (en) | Method and apparatus for encapsulating a stream of audiovisual content | |
Grewe et al. | MPEG-H Audio System for SBTVD TV 3.0 Call for Proposals | |
CN103796049A (en) | Dual core-based smart media player system design method | |
KR101435834B1 (en) | IPTV receiver, method for reproducing contents in the IPTV receiver and recording contents in IPTV environment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160706 |
|
RJ01 | Rejection of invention patent application after publication |