WO2007080500A1 - Extensions to rich media container format for use by mobile broadcast/multicast streaming servers - Google Patents

Extensions to rich media container format for use by mobile broadcast/multicast streaming servers Download PDF

Info

Publication number
WO2007080500A1
WO2007080500A1 PCT/IB2007/000073 IB2007000073W WO2007080500A1 WO 2007080500 A1 WO2007080500 A1 WO 2007080500A1 IB 2007000073 W IB2007000073 W IB 2007000073W WO 2007080500 A1 WO2007080500 A1 WO 2007080500A1
Authority
WO
WIPO (PCT)
Prior art keywords
alc
media file
iso base
information
flute
Prior art date
Application number
PCT/IB2007/000073
Other languages
French (fr)
Inventor
Ramakrishna Vedantham
Vidya Setlur
Original Assignee
Nokia Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Corporation filed Critical Nokia Corporation
Priority to EP07705423.7A priority Critical patent/EP1974526B1/en
Priority to ES07705423.7T priority patent/ES2526814T3/en
Publication of WO2007080500A1 publication Critical patent/WO2007080500A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/12Messaging; Mailboxes; Announcements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/16Arrangements for providing special services to substations
    • H04L12/18Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
    • H04L12/1836Arrangements for providing special services to substations for broadcast or conference, e.g. multicast with heterogeneous network architecture
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1101Session protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1101Session protocols
    • H04L65/1104Session initiation protocol [SIP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/61Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
    • H04L65/611Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for multicast or broadcast
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/65Network streaming protocols, e.g. real-time transport protocol [RTP] or real-time control protocol [RTCP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/262Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists
    • H04N21/26283Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists for associating distribution time parameters to content, e.g. to generate electronic program guide data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6131Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via a mobile phone network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/84Generation or processing of descriptive data, e.g. content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/85406Content authoring involving a specific file format, e.g. MP4 format
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/24Systems for the transmission of television signals using pulse code modulation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W28/00Network traffic management; Network resource management
    • H04W28/02Traffic management, e.g. flow control or congestion control
    • H04W28/10Flow control between communication endpoints

Definitions

  • the present invention relates generally to the extension of the ISO Base Media File Format to include Asynchronous Layered Coding (ALC) as a broadcast/multicast protocol while streaming rich media content. More particularly, the present invention relates to the inclusion of file session description protocol (SDP), metadata and hint tracks for broadcast/multicast downloading of rich media content using ALC.
  • ALC Asynchronous Layered Coding
  • Rich media content is generally referred to content that is graphically rich and contains compound (or multiple) media, including graphics, text, video and audio, and is preferably delivered through a single interface.
  • the streaming of rich media content is becoming increasingly important for delivering visually rich content for real-time content, especially within the MBMS and Packet Switched Streaming (PSS) service architecture.
  • Multimedia Broadcast Multicast Service (MBMS) streaming services facilitate resource-efficient delivery of popular real-time content to multiple receivers in a 3 G mobile environment.
  • the content may be pre-recorded or generated from a live feed.
  • Scalable Vector Graphics Mobile 1.2 is a language for describing two-dimensional graphics in XML.
  • Scalable Vector Graphics allow for three types of graphic objects: vector graphic shapes (e.g., paths consisting of straight lines and curves), multimedia (such as raster images, video, etc.) and text.
  • SVG drawings can be interactive (using the document object model (DOM) event model) and dynamic. Animations can be defined and triggered either declaratively (i.e., by embedding SVG animation elements in SVG content) or via scripting.
  • Sophisticated applications of SVG are possible by the use of a supplemental scripting language which accesses the SVG Micro Document Object Model (uDOM), which provides complete access to all elements, attributes and properties.
  • uDOM SVG Micro Document Object Model
  • a rich set of event handlers can be assigned to any SVG graphical object.
  • features like scripting can be performed on XHTML and SVG elements simultaneously within the same Web page.
  • the Synchronized Multimedia Integration Language (SMIL) version 2.0 enables the simple authoring of interactive audiovisual presentations. SMIL is typically used for "rich media'Vmultimedia presentations which integrate streaming audio and video with images, text or any other media type.
  • RTP Real-Time Transport Protocol
  • RTP is used for unicast streaming (e.g., 3GPP PSS, 3GPP2 MSS (Multimedia Streaming Services)), broadcast/multicast streaming (e.g., 3GPP multimedia broadcast/multicast service (MBMS), 3GPP2 BCMCS (Broadcast Multicast Services)) and rich media conferencing applications.
  • Asynchronous layered coding is a massively scalable and reliable content delivery protocol.
  • ALC is the base protocol for the reliable multicast delivery of arbitrary binary objects.
  • ALC is adopted as the mandatory protocol for broadcast/multicast file delivery in 3GPP2 BCMCS and OMA BAC BCAST.
  • the file metadata (that is carried as part of a File Delivery Table (FDT) in file delivery over unidirectional transport (FLUTE)) is now delivered to the clients as part of the OMA BCAST electronic service guide (ESG).
  • This metadata is divided into various ESG fragments according to a service guide data model. The fragments are identified as: Service, Schedule, Content, Access, Session Description, Purchase Item, Purchase Data, Purchase Channel, Service Guide Context, Service Guide Delivery Descriptor and Preview Data.
  • the OMA BCAST ESG data model is illustrated in prior art Figure 3.
  • the ESG is normally delivered to the clients well in advance of the ALC session. Therefore, clients have the file metadata before the ALC session starts.
  • FLUTE builds on top of ALC and defines FDT that stores the metadata associated with the files being delivered in an ALC session. FLUTE also provides mechanisms for in-band delivery and updates of FDT. FLUTE is adopted by 3GPP MBMS and DVB-H IPDC as the mandatory protocol for broadcast/multicast file delivery.
  • SVG is designed to describe resolution independent 2D vector graphics, allowing for interactivity using the event model and animation concepts borrowed from SMIL. SVG also allows for infinite zoomability and enhances the power of user interfaces on mobile devices. In addition, SVG supports embedding of media elements similar to SMIL media elements.
  • the embedded media can be divided into two parts - discrete media (e.g. images) and continuous media (e.g. audio, video). Continuous media elements define their own timelines within their time container. SVG is therefore, gaining importance and becoming one of the core elements to drive a multimedia presentation, especially for rich media services such as Mobile TV, live updates of traffic information, weather, news, etc. SVG is XML-based, allowing for more transparent integration with other existing web technologies.
  • RTP can be used to deliver continuous media such as audio, video and SVG scenes/updates.
  • SVG presentations also include discrete media, and it currently makes more sense to use file download protocols rather than RTP for the delivery of discrete media.
  • ALC and FLUTE are currently the preferred transport layer protocols for file delivery over broadcast/multicast networks.
  • a broadcast/multicast streaming server should be able to generate rich media packets (RTP and ALC, or RTP and FLUTE) by reading the contents from a rich media container file.
  • a container file may include (1) media tracks for continuous media, i.e., SVG tracks, audio tracks, video tracks, raster images, etc.; (2) hint tracks that hold synchronization information; and (3) internally embedded discrete media.
  • a broadcast/multicast streaming server creates RTP packets to carry continuous media by using the media tracks and hint tracks.
  • the server also needs to create ALC or FLUTE packets to carry internally embedded discrete media.
  • the server may also decide to acquire some or all of the externally referenced discrete media, and send them to the clients using ALC or FLUTE. After reception, these images (a) could be rendered when the corresponding SVG content is being played or (b) can be locally stored/cached and rendered by user interaction.
  • the rich media application can make sure that the images are rendered only if the user buys them (i.e., the digital rights management (DRM) rights).
  • DRM digital rights management
  • the externally referenced discrete media files are broadcast to all users whether they subscribed to their access or not.
  • the broadcast download of these files wastes radio and memory resources for users that do not subscribe to these files.
  • the broadcast download of these files reduces the usage of radio resources and also enhances their user experience.
  • the server needs the metadata associated with the discrete media (image) files. This file metadata also needs to be included in the rich media container file.
  • the invention solves the problem of a lack of a mechanism for storing this metadata in the rich media container files.
  • the present invention includes an extension to the ISO Base Media File Format to support ALC as a broadcast protocol, as well as the extension of the ESG to include metadata specific to the transport of SVG over mobile broadcast/multicast networks.
  • a "BMFDP hint track" is introduced in the container file format, with the required file metadata being in these hint tracks.
  • Figure 1 is a representation showing how ALC-specific information can be included in the ISO Base Media File Format to realize a rich media broadcast/multicast streaming/download service using RTP and ALC protocols;
  • Figure 2 is a representation of the protocol stack for rich media streaming and file downloading over mobile broadcast/multicast networks
  • Figure 3 is a representation of a conventional data model of the OMA
  • Figure 4 is a perspective view of an electronic device that can be used in the implementation of the present invention.
  • Figure 5 is a schematic representation of the telephone circuitry of the mobile telephone of Figure 4.
  • the present invention includes an extension to the ISO Base Media File Format to support ALC as a broadcast protocol, as well as the extension of the ESG to include metadata specific to the transport of SVG over mobile broadcast/multicast networks.
  • a "BMFDP hint track" is introduced in the container file format, with the required file metadata being in these hint tracks.
  • the implementation of the present invention involves the extension of the ISO Base Media File Format to support ALC protocol for transmission. This involves (1) adding session description information for ALC in the ISO Base Media File Format; (2) extending the ISO Base Media File Format to include metadata information for the ESG; and (3) extending the ISO Base Media File Format for including hint track information to form ALC packets for transmission.
  • Figure 1 is a representation of how ALC-specific information can be included in the ISO Base Media File Format to realize a rich media broadcast/multicast streaming/downloading service using RTP and ALC protocols.
  • rich media 110 SVG with discrete and continuous media
  • SDP information 115 SDP information 115
  • ALC-based metadata information 120 SDP information 115
  • hint track information for ALC packetization 125 is entered into an ISO Base Media File generator 130, which produces a rich media ISO Base Media File 135.
  • the rich media ISO Base Media File 135 is provided to a rich media server 140.
  • the rich media player 140 then transmits (1) metadata of discrete files (ESG) 145; (2) session description information 150; and (3) RTP and ALC packets 155 to a rich media player 160, which can then decode the encoded information for subsequent exhibition.
  • Figure 2 shows the protocol stack for rich media streaming and file downloading over mobile broadcast/multicast networks.
  • SVG supports media elements similar to SMIL media elements. As discussed above, all of the embedded media can be divided into two parts—discrete and continuous media.
  • SVG can also embed other SVG documents, which in turn can embed yet more SVG documents through nesting.
  • the animation element specifies an external embedded SVG document or an SVG document fragment providing synchronized animated vector graphics.
  • the animation element is a graphical object with size determined by its x, y, width and height attributes. The following is one such example:
  • the media in SVG can be internally or externally referenced.
  • the embedded media elements can be linked through internal or external URLs in the SVG content.
  • internal URLs refer to file paths within the ISO Base Media File itself.
  • External URLs refer to file paths outside of the ISO Base Media File either present on the same server containing the source SVG file or on other servers.
  • the present invention is directed to ALC transport mechanisms for internally embedded discrete media.
  • Session Description Protocol (SDP) is specified for internally embedded media.
  • both discrete and continuous embedded media can be transported by HTTP, FLUTE or ALC depending on whether it is point-to-point or broadcast.
  • the continuous media can be transported by RTP in a broadcast streaming scenario.
  • the following transport combinations are envisioned: (1) broadcast streaming + HTTP downloading of discrete media; (2) unicast streaming + HTTP downloading of discrete media; (3) broadcast streaming + FLUTE/ALC downloading of discrete media; and (4) unicast streaming + FLUTE/ ALC downloading of discrete media.
  • the present invention is directed to items (3) and (4).
  • the discrete media files can be explicitly transmitted by transmitting them to the UE in advance via an ALC/FLUTE session; (2) transmitting them to each client on a point-to-point bearer before the streaming session, in a manner similar to the way security keys are sent to clients prior to an MBMS session; (3) having a parallel ALC/FLUTE transmission session independent of the RTP transmission session, if enough radio resources are available; or (4) having non-parallel transmission sessions to transmit all of the data due to the limited radio resources.
  • Each transmission session contains either ALC/FLUTE data or RTP data.
  • SDP Session Description Protocol
  • ALC packets can be used to transport the scene description, as well as discrete and/or continuous embedded media depending upon whether it is a pure download scenario or it is shared with an RTP streaming session.
  • the URIs of the internally embedded media are indicated in the ESG that is sent prior to the beginning of the ALC session.
  • the syntax of the SDP description for ALC is similar to that of FLUTE, and has been defined in the Internet-Draft: SDP Descriptors for FLUTE.
  • Item BMFDP Information An item level hint information container is defined within an 'ihib' box, dedicated for ALC or FLUTE. This can be used when all of the content in the current item is sent via ALC or FLUTE.
  • a movie level hint information container is defined within a 'hnti' box, dedicated for ALC or FLUTE. This can be used when all of the content in the current movie is sent via ALC or FLUTE.
  • a track level hint information container is defined within the 'hnti' box, dedicated for ALC or FLUTE. This can be used when all of the content in the current track is sent via ALC or FLUTE.
  • RTP stream may be combined together. This case may occur when SVG media contains both discrete and continuous embedded media.
  • the discrete media is transmitted via ALC, and continuous media is transmitted via RTP.
  • the SDP information can then be saved in the following boxes.
  • United States Provisional Patent Application No. 60/;713,303, filed September 1, 2005 defines boxes for FLUTE + RTP at different levels (e.g., presentation, movie, item). However, the same boxes can be generalized for broadcast download to include ALC as well. Therefore, boxes are prefixed with 'BMFDP' to be more generic to store SDP for ALC + RTP or FLUTE + RTP SDP.
  • a flag called 'protocol' indicates whether the boxes are being used for FLUTE or ALC.
  • the boxes at the three different levels are as follows:
  • the metadata is sent as part of the FDT if FLUTE is used as a broadcast protocol, or the metadata is sent as part of OMA BCAST ESG if ALC is used in conjunction with OMA BCAST ESG.
  • the ESG provides a mechanism for describing various metadata associated with files that are to be delivered through ALC in the Digital Mobile Broadcast Service.
  • the ESG specifies the use of Service Guide Delivery Units (SGDUs) and Service Guide Delivery Descriptors (SGDDs) for consistent control of the ESG while it is being downloaded via PtM and/or PtP channels.
  • SGDUs Service Guide Delivery Units
  • SGDDs Service Guide Delivery Descriptors
  • Metadata updated during broadcast/multicast file download can be performed either in-band or out-of-band.
  • In-band metadata updates are performed using FDT of FLUTE, where the FDT Instance ID identifies the latest version of the metadata in FDT.
  • the metadata is signaled as an SGDU object in ALC, then the ESG is updated according to the SGDU or SG fragment versioning system.
  • PtM transport of the new SGDU There are two options for the PtM transport of the new SGDU. With the first option, the new SGDU object is sent in a separate ALC session. With the second option, the new SGDU object is added to the same ALC session as the original ALC session.
  • 60/713,303 describes the extension of the ISO Base Media File format by defining boxes to store the data of FDT instances. However, the same boxes can be generalized to store metadata for broadcast download, including ALC as well. As in the first implementation, the boxes are prefixed with 'BMFDP' ' to be more generic to store metadata for broadcast/multicast downloads. These file parameters are required by any protocol used for broadcast/multicast downloading. Boxes are defined for all of the four levels— presentation, movie, track and item as depicted below. [0048] (a) Presentation Metadata Information.
  • a presentation level metadata container is defined within 'bdph' or 'bdrp' boxes, dedicated for ALC/FLUTE or ALC/FLUTE + RTP transport schemes, respectively. aligned(8) class BMFDPpresentationmetadatainformation extends box('bmpm') ⁇
  • the Content-Location of embedded media resources may be referred by using the URL forms defined in Section 8.44.7 in ISO/IEC 15444-12:2005.
  • (b) Item Metadata Information An item level metadata container is defined within 'bdih' or 'bdri' boxes, dedicated for ALC/FLUTE or ALC/FLUTE+RTP transport schemes, respectively. aligned(8) class BMFDPitemmetadatainformation extends box('bmim') ⁇
  • a movie level metadata container is defined within the 'hnti' box, dedicated for ALC/FLUTE. aligned(8) class BMFDPmoviemetadatainformation extends box('bmmm') ⁇
  • a track level metadata container is defined within the 'bdth' box, dedicated for ALC/FLUTE. This can be used when all of the content in the current track is sent via ALC/FLUTE.
  • aligned(8) class BMFDPtrackmetadatainformation extends box('bmtm') ⁇
  • a third implementation of the present invention involves the use of boxes to store hint track information.
  • the hint track structure is generalized to support hint samples in multiple data formats.
  • the hint track sample contains any data needed to build the packet header of the correct type.
  • the hint track sample also contains a pointer to the block of data that belongs in the packet.
  • Such data can be SVG, continuous embedded media, and discrete embedded media.
  • Hint track samples are not part of the hint track box structure, although they are usually found in the same file.
  • the hint track data reference box ('dref ) and sample table box ('stbl') are used to find the file specification and byte offset for a particular sample.
  • Hint track sample data is byte-aligned and is always in big-endian format.
  • boxes can be prefixed with 'BMFDP' to be more generic to store hint track information for either ALC or FLUTE.
  • BMFDPsample are defined. In addition, some related structures and constructors are also defined. A 'protocol' is also added to indicates whether the boxes are being used for FLUTE or ALC.
  • BMFDP hint tracks are hint tracks (media handler 'hint') with an entry-format in the sample description of 'bmfd'.
  • BMFDPHintSampleEntry is contained in the SampleDescriptionBox ('stsd').
  • the sample-delta between first sample of current media/SVG and the final sample of previous media/SVG has the same value as the difference between start-time of the scene/update to which current and previous media/SVG belong.
  • the sample-deltas for the rest of the successive samples in current media/SVG are zero.
  • a BMFDP sample represents an entire media or SVG content, then there will be no successive samples (containing the successive data from the same media/SVG) with deltas equal to zero following this BMFDP sample. Therefore, only one sample-delta is present for current BMFDP sample.
  • Each sample contains two areas—the instructions to compose the packets, and any extra data needed when sending those packets (e.g. an encrypted version of the media data). It should be noted that the size of the sample is known from the sample size table. aligned(8) class BMFDPsample ⁇ uint protocol; // 0 for ALC, 1 for FLUTE unsigned int( 16) packetcount; unsigned int( 16) reserved;
  • Each packet in the packet entry table has the following structure: aligned(8) class BMFDPpacket ⁇ uint protocol; // 0 for ALC, 1 for FLUTE
  • BMFDPheader bmfdpjieader unsigned int(l 6) entrycount; dataentry constructors [entrycount];
  • the "bmfdpjieader" contains the header for current BMFDP packet.
  • Entry_count is the count of following constructors, with a constructor being a structure which is used to construct the BMFDP packets. aligned(8) class BMFDPheader ⁇ uint protocol; // 0 for ALC, 1 for FLUTE
  • the FEC_payload_lD is determined by the FEC Encoding ID that must be communicated in the Session Description, class pseudoheader ⁇ unsigned int(32) source_address; unsigned int(32) destination_address; unsigned int(8) zero; unsigned int(8) protocol; unsigned int(16) UDPJength; ⁇
  • class UDPheader ⁇ pseudoheader pheader; unsigned int(16) source_port; unsigned int(16) destination_port; unsigned int(16) length; unsigned int(16) checksum; ⁇
  • the FEC_encoding_ID used below must be signalled in the session description.
  • the header_extension_type 193 is valid only for FLUTE. It is used for EXT_CENC, the header extension that indicates the content encoding used by the FDT Instance.
  • class BMFDPnoopconstructor extends BMFDPconstructor(O)
  • signed int(8) trackrefindex; unsigned int(16) length; unsigned int(32) sampledescriptionindex; unsigned int(32) sampledescriptionoffset; unsigned int(32) reserved; ⁇
  • class BMFDPitemconstructor extends BMFDPconstructor(4)
  • FDT data is one part of the whole FLUTE/ALC data stream. This data is either transmitted in-band during the FLUTE session in the form of FLUTE packets, or out-of-band during an ALC session using ESG or by other means.
  • a constructor defined herein is used to map SGDU to the ALC packet. The syntax of the constructor is provided as follows: aligned(8) class ALCsgduconstructor extends BMFDPconstructor( ⁇ )
  • BMFDP+RTP hint tracks are hint tracks (media handler 'hint') with an entry-format in the sample description of 'frhs': BMFDPRtpHintSampleEntry is defined within the SampledDescriptionBox 'stsd'
  • the hinttrackversion is currently 1.
  • the highest compatible version field specifies the oldest version with which this track is backward compatible.
  • the maxpacketsize indicates the size of the largest packet that this track will generate.
  • the additional data is a set of boxes ('tims', and 'tsro' ) which are defined in the ISO Base Media File Format.
  • BMFDPRTPSample is defined within the MediaDataBox ('mdat'). This box contains multiple BMFDP samples, RTP samples, possible FDT/SGDU and SDP information and any extra data.
  • One BMFDPRTPSample may contain either one FDT/SGDU data, SDP data, BMFDP sample, or RTP sample.
  • BMFDPRTPSamples that contain BMFDP samples are used here only to transmit the discrete media. Such media are always embedded in the Scene or Scene Update among the SVG presentation. Their start-times are the same as the start-time of Scene/Scene Update they belong to.
  • BMFDP samples do not have their own specific timestamps, but instead are sent sequentially, immediately after the RTP samples of the Scene/Scene Update they belong to. Therefore in the TimeToSampleBox, the sample-deltas of the BMFDPRTPSample for discrete media are all set to zero. Their sequential order represents their sending-time order.
  • a fourth implementation of the present invention is similar to the first implementation discussed above. However, other description formats such as DCCPtext may be stored, in which case the sdptext field will change accordingly.
  • a fifth implementation is similar to the second implementation. However, in this embodiment, a single fragment field can contain all of the fragment data in the ESG. The application can then choose to either split this data into different fields for all levels or for files.
  • the hinttrackversion and highestcompatibleversion fields may have different values, and a minpacketsize field may be added in addition to the maxpacketsize field.
  • the packetcount field can be made 32 bits by removing the reserved field.
  • the hierarchical structure of the different header boxes (BMFDPheader, UDPheader, LCTheader, etc.) could be different.
  • the ALCsgdutconstructor syntax can have separate field definitions for each sgdu_box, the BMFDPitemconstructor may have item_id replaced by itemjname, the BMFDPxmlboxconstructor can have the data_length field to be made to 64 bytes by removing the reserved field, and the BMFDPxmlboxconstructor can have the data_length field to be made to 16 bytes and adjust reserved field to 64 bytes.
  • the BMFDPRtpHintSampleEntry can have the hinttrackversion and highestcompatibleversion fields to be of different values, the BMFDPRtpHintSampleEntry can add a minpacketsize field in addition to the maxpacketsize field, and the BMFDPRTPSample box can have separate field definitions for each sample__type.
  • ALC ALC-based language
  • a first such use case involves the preview of long cartoon animations.
  • the service of the present invention allows an end-user to progressively download small portions of each animation before deciding which one he or she wishes to view in its entirety.
  • a second use case for the present invention involves interactive Mobile TV services.
  • a deterministic rendering and behavior of rich- media content can be delivered together in the end user interface.
  • the content can include audio-video content, text, graphics, images, and TV and radio channels.
  • the service must provide convenient navigation through content in a single application or service, and the service must allow synchronized interaction in local or in distant settings, such as voting and personalization (e.g., related menus or sub-menus, advertising and content in function of the end-user profile or service subscription).
  • This use case is described in four steps corresponding to four services and sub- services available in an iTV mobile service — a mosaic menu showing the TV channel landscape, an electronic program guide and triggering of a related iTV service, the iTV service, and personalized menus, such as "sports news.”
  • a third use case for the present invention involves the use of a live enterprise data feed.
  • This service includes, for example, stock tickers that provide streaming of real-time quotes, intra-day charts with technical indicators, news monitoring, weather alerts, charts, business updates, sports scores, etc.
  • a fourth use case for the present invention involves live chat services.
  • Live chat services can be incorporated within a web cam, video channel, or a rich-media blog service.
  • End-users can register, save their surname and exchange messages. Messages appear dynamically in the live chat service, along with rich-media data provided by the end-user.
  • the chat service can be either private or public in one or more multiple channels at the same time. End users are dynamically alerted of new messages from other users. Dynamic updates of messages within the service occur without reloading a complete page.
  • a fifth use case for the present invention involves karaoke services.
  • Karaoke services display a music TV channel or video clip catalog, along with the speech of a song with fluid-like animation on the text characters to be singing (e.g., a smooth color transition of fonts, scrolling of text, etc)
  • the end-user can download a song of his or her choice, along with the complete animation by selecting an interactive button.
  • Similar systems can also be used for the reenactment of movie or television shows or clips.
  • FIGS 4 and 5 show one representative electronic device 12 within which the present invention may be implemented. It should be understood, however, that the present invention is not intended to be limited to one particular type of electronic device.
  • the electronic device 12 of Figures 4 and 5 includes a housing 30, a display 32 in the form of a liquid crystal display, a keypad 34, a microphone 36, an ear-piece 38, a battery 40, an infrared port 42, an antenna 44, a smart card 46 in the form of a UICC according to one embodiment of the invention, a card reader 48, radio interface circuitry 52, codec circuitry 54, a controller 56 and a memory 58.
  • Individual circuits and elements are all of a type well known in the art, for example in the Nokia range of mobile telephones.
  • the present invention is described in the general context of method steps, which may be implemented in one embodiment by a program product including computer-executable instructions, such as program code, executed by computers in networked environments.
  • program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types.
  • Computer-executable instructions, associated data structures, and program modules represent examples of program code for executing steps of the methods disclosed herein.
  • the particular sequence of such executable instructions or associated data structures represents examples of corresponding acts for implementing the functions described in such steps. '

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Computer Security & Cryptography (AREA)
  • Databases & Information Systems (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

An extension to the ISO Base Media File Format to support ALC as a broadcast protocol. The present invention also provides for the extension of the ESG to include metadata specific to the transport of SVG over mobile broadcast/multicast networks. A 'BMFDP hint track' is introduced in the container file format, with the required file metadata being in these hint tracks. The present invention can be used in applications such as the preview of long cartoon animations, interactive Mobile TV services, live enterprise data feeds, live chat services, and karaoke programs.

Description

EXTENSIONS TO RICH MEDIA CONTAINER FORMAT FOR USE BY MOBILE BROADCAST/MULTICAST STREAMING
SERVERS
FIELD OF THE INVENTION
[0001] The present invention relates generally to the extension of the ISO Base Media File Format to include Asynchronous Layered Coding (ALC) as a broadcast/multicast protocol while streaming rich media content. More particularly, the present invention relates to the inclusion of file session description protocol (SDP), metadata and hint tracks for broadcast/multicast downloading of rich media content using ALC.
BACKGROUND OF THE INVENTION
[0002] This section is intended to provide a background or context to the invention that is recited in the claims. The description herein may include concepts that could be pursued, but are not necessarily ones that have been previously conceived or pursued. Therefore, unless otherwise indicated herein, what is described in this section is not prior art to the description and claims in this application and is not admitted to be prior art by inclusion in this section.
[0003] Rich media content is generally referred to content that is graphically rich and contains compound (or multiple) media, including graphics, text, video and audio, and is preferably delivered through a single interface. The streaming of rich media content is becoming increasingly important for delivering visually rich content for real-time content, especially within the MBMS and Packet Switched Streaming (PSS) service architecture. Multimedia Broadcast Multicast Service (MBMS) streaming services facilitate resource-efficient delivery of popular real-time content to multiple receivers in a 3 G mobile environment. The content may be pre-recorded or generated from a live feed.
[0004] There are currently several existing systems for representing rich media, particularly in the Web services domain. Scalable Vector Graphics (SVG) Mobile 1.2 is a language for describing two-dimensional graphics in XML. Scalable Vector Graphics allow for three types of graphic objects: vector graphic shapes (e.g., paths consisting of straight lines and curves), multimedia (such as raster images, video, etc.) and text. SVG drawings can be interactive (using the document object model (DOM) event model) and dynamic. Animations can be defined and triggered either declaratively (i.e., by embedding SVG animation elements in SVG content) or via scripting. Sophisticated applications of SVG are possible by the use of a supplemental scripting language which accesses the SVG Micro Document Object Model (uDOM), which provides complete access to all elements, attributes and properties. A rich set of event handlers can be assigned to any SVG graphical object. Because of its compatibility and leveraging of other Web standards (such as the compound documents format (CDF)), features like scripting can be performed on XHTML and SVG elements simultaneously within the same Web page. [0005] The Synchronized Multimedia Integration Language (SMIL) version 2.0 enables the simple authoring of interactive audiovisual presentations. SMIL is typically used for "rich media'Vmultimedia presentations which integrate streaming audio and video with images, text or any other media type. [0006] For CDF, there are currently efforts underway to combine separate component languages (e.g. XML-based languages, elements and attributes from separate vocabularies), such as XHTML, SVG, MathML, and SMIL, with a focus on user interface markups. When combining user interface markups, specific problems have to be resolved that are not addressed by the individual markups specifications. These issues include the propagation of events across markups and the combination of rendering or the user interaction model with a combined document. [0007] Real-Time Transport Protocol (RTP) is currently the preferred transport layer protocol for the streaming delivery of continuous media, such as audio, video and SVG. RTP is used for unicast streaming (e.g., 3GPP PSS, 3GPP2 MSS (Multimedia Streaming Services)), broadcast/multicast streaming (e.g., 3GPP multimedia broadcast/multicast service (MBMS), 3GPP2 BCMCS (Broadcast Multicast Services)) and rich media conferencing applications. [0008] Asynchronous layered coding (ALC) is a massively scalable and reliable content delivery protocol. ALC is the base protocol for the reliable multicast delivery of arbitrary binary objects. ALC is adopted as the mandatory protocol for broadcast/multicast file delivery in 3GPP2 BCMCS and OMA BAC BCAST. The file metadata (that is carried as part of a File Delivery Table (FDT) in file delivery over unidirectional transport (FLUTE)) is now delivered to the clients as part of the OMA BCAST electronic service guide (ESG). This metadata is divided into various ESG fragments according to a service guide data model. The fragments are identified as: Service, Schedule, Content, Access, Session Description, Purchase Item, Purchase Data, Purchase Channel, Service Guide Context, Service Guide Delivery Descriptor and Preview Data. The OMA BCAST ESG data model is illustrated in prior art Figure 3. The ESG is normally delivered to the clients well in advance of the ALC session. Therefore, clients have the file metadata before the ALC session starts. If the file metadata needs to be updated during the ALC session, then those fragments of ESG that contain the file metadata are updated by using the ESG delivery/update channels. Therefore, file metadata update is not performed in-band of the ALC session. [0009] FLUTE builds on top of ALC and defines FDT that stores the metadata associated with the files being delivered in an ALC session. FLUTE also provides mechanisms for in-band delivery and updates of FDT. FLUTE is adopted by 3GPP MBMS and DVB-H IPDC as the mandatory protocol for broadcast/multicast file delivery.
[0010] In addition to the above, There also exists an ISO Base Media File container format for storage of rich media content and subsequent transport of such content over HTTP, FLUTE and RTP. These formats are discussed in detail in United States Provisional Patent Application No. 60/713,303, filed September 1, 2005, and United States Provisional Patent Application No. 60/694,440, filed June 27, 2005, both of which are incorporated herein by reference in their entirety. However, there are currently no mechanisms for including ALC as a broadcast protocol, particularly for storing file metadata in the ESG.
[0011] Until recently, applications for mobile devices were text-based with limited interactivity. However, as more wireless devices are coming equipped with color displays and more advanced graphics rendering libraries, consumers are increasingly demanding a rich media experience from all of their wireless applications. A realtime rich media content streaming service is imperative for mobile terminals, especially in the area of MBMS, PSS, and MMS services. [0012] SVG is designed to describe resolution independent 2D vector graphics, allowing for interactivity using the event model and animation concepts borrowed from SMIL. SVG also allows for infinite zoomability and enhances the power of user interfaces on mobile devices. In addition, SVG supports embedding of media elements similar to SMIL media elements.
[0013] AU the embedded media can be divided into two parts - discrete media (e.g. images) and continuous media (e.g. audio, video). Continuous media elements define their own timelines within their time container. SVG is therefore, gaining importance and becoming one of the core elements to drive a multimedia presentation, especially for rich media services such as Mobile TV, live updates of traffic information, weather, news, etc. SVG is XML-based, allowing for more transparent integration with other existing web technologies.
[0014] For rich media streaming over broadcast/multicast networks, RTP can be used to deliver continuous media such as audio, video and SVG scenes/updates. However and as discussed previously, SVG presentations also include discrete media, and it currently makes more sense to use file download protocols rather than RTP for the delivery of discrete media. ALC and FLUTE are currently the preferred transport layer protocols for file delivery over broadcast/multicast networks. A broadcast/multicast streaming server should be able to generate rich media packets (RTP and ALC, or RTP and FLUTE) by reading the contents from a rich media container file. A container file may include (1) media tracks for continuous media, i.e., SVG tracks, audio tracks, video tracks, raster images, etc.; (2) hint tracks that hold synchronization information; and (3) internally embedded discrete media. [0015] A broadcast/multicast streaming server creates RTP packets to carry continuous media by using the media tracks and hint tracks. The server also needs to create ALC or FLUTE packets to carry internally embedded discrete media. The server may also decide to acquire some or all of the externally referenced discrete media, and send them to the clients using ALC or FLUTE. After reception, these images (a) could be rendered when the corresponding SVG content is being played or (b) can be locally stored/cached and rendered by user interaction. This results in a satisfactory user experience and does not require another simultaneous point-to-point (PtP) connection from the client. However, this optimization does not preclude the possibility of simultaneous PtP connections, but instead merely minimizes the need for simultaneous PtP connections. If these discrete media are not free to access, for example, CNN imagery, then the rich media application can make sure that the images are rendered only if the user buys them (i.e., the digital rights management (DRM) rights). Thus, the externally referenced discrete media files are broadcast to all users whether they subscribed to their access or not. The broadcast download of these files wastes radio and memory resources for users that do not subscribe to these files. For users that subscribe to these files, the broadcast download of these files reduces the usage of radio resources and also enhances their user experience. To create ALC or FLUTE packets, the server needs the metadata associated with the discrete media (image) files. This file metadata also needs to be included in the rich media container file.
SUMMARY OF THE INVENTION
[0016] The invention solves the problem of a lack of a mechanism for storing this metadata in the rich media container files. The present invention includes an extension to the ISO Base Media File Format to support ALC as a broadcast protocol, as well as the extension of the ESG to include metadata specific to the transport of SVG over mobile broadcast/multicast networks. A "BMFDP hint track" is introduced in the container file format, with the required file metadata being in these hint tracks. [0017] There are several use cases for rich media services that can benefit from using ALC as a protocol. These uses include the preview of long cartoon animations, interactive Mobile TV services, live enterprise data feeds, live chat services, and karaoke programs. As there has been no previous solution to include ALC-specific content in the ISO Base Media File format, the inclusion of ALC facilitates greater leverage for being able to send out-of-band information during a rich media session to 3G mobile devices for downloading applications. Given the benefits for services offering interactive and dynamic rich media streaming and progressive downloading provided by the present invention, the invention can be incorporated into a wide variety of products, and the present invention can be used in services such as 3GPP MBMS, 3GPP2 BCMCS, DVB-H IPDC and OMA BCAST. [0018] These and other advantages and features of the invention, together with the organization and manner of operation thereof, will become apparent from the following detailed description when taken in conjunction with the accompanying drawings, wherein like elements have like numerals throughout the several drawings described below.
BRIEF DESCRIPTION OF THE DRAWINGS
[0019] Figure 1 is a representation showing how ALC-specific information can be included in the ISO Base Media File Format to realize a rich media broadcast/multicast streaming/download service using RTP and ALC protocols;
[0020] Figure 2 is a representation of the protocol stack for rich media streaming and file downloading over mobile broadcast/multicast networks;
[0021] Figure 3 is a representation of a conventional data model of the OMA
BCAST ESG;
[0022] Figure 4 is a perspective view of an electronic device that can be used in the implementation of the present invention; and
[0023] Figure 5 is a schematic representation of the telephone circuitry of the mobile telephone of Figure 4.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0024] The present invention includes an extension to the ISO Base Media File Format to support ALC as a broadcast protocol, as well as the extension of the ESG to include metadata specific to the transport of SVG over mobile broadcast/multicast networks. A "BMFDP hint track" is introduced in the container file format, with the required file metadata being in these hint tracks. [0025] The implementation of the present invention involves the extension of the ISO Base Media File Format to support ALC protocol for transmission. This involves (1) adding session description information for ALC in the ISO Base Media File Format; (2) extending the ISO Base Media File Format to include metadata information for the ESG; and (3) extending the ISO Base Media File Format for including hint track information to form ALC packets for transmission. [0026] Figure 1 is a representation of how ALC-specific information can be included in the ISO Base Media File Format to realize a rich media broadcast/multicast streaming/downloading service using RTP and ALC protocols. As can be observed in Figure 1, rich media 110 (SVG with discrete and continuous media), SDP information 115, ALC-based metadata information 120, and hint track information for ALC packetization 125 is entered into an ISO Base Media File generator 130, which produces a rich media ISO Base Media File 135. The rich media ISO Base Media File 135 is provided to a rich media server 140. The rich media player 140 then transmits (1) metadata of discrete files (ESG) 145; (2) session description information 150; and (3) RTP and ALC packets 155 to a rich media player 160, which can then decode the encoded information for subsequent exhibition. Figure 2 shows the protocol stack for rich media streaming and file downloading over mobile broadcast/multicast networks. It should be understood that, although text and examples contained herein may specifically describe an encoding process, one skilled in the art would readily understand that the same concepts and principles also apply to the corresponding decoding process and vice versa.
[0027] The following is a description of the capability of SVG to embed media, as well as the different transport scenarios for rich media transmission. This information provides a basis for the usage of ALC as a protocol for rich media download, as well as the scope of the issues addressed by the present invention. SVG supports media elements similar to SMIL media elements. As discussed above, all of the embedded media can be divided into two parts—discrete and continuous media. Discrete media such as images are embedded in SVG using the 'image' element, such as: <image x="200" y="200" width="100px" height="100px" xlink:href="myimage.png"> [0028] Examples for including continuous media such as audio and video are as follows: <audio xlink:href="l.ogg" volume="0.7" type="audio/vorbis" begin="mybutton.click" repeatCount="3" />
<video xlink:href="ski.avi" volume=".8" type="video/x-msvideo" x="10" y="170'7>
[0029] SVG can also embed other SVG documents, which in turn can embed yet more SVG documents through nesting. The animation element specifies an external embedded SVG document or an SVG document fragment providing synchronized animated vector graphics. Like the video element, the animation element is a graphical object with size determined by its x, y, width and height attributes. The following is one such example:
<animation
Figure imgf000009_0001
x="100" y="100" xlink:href="myIcon.svg"/>
[0030] Similarly, the media in SVG can be internally or externally referenced.
While the above examples are internally referenced, the following example shows externally referenced media:
<animate attributeName="xlink:href ' values="http://www.example.com/images/l .png; http://www.example.eom/images/2.png; http://www.example.eom/images/3.png" begin="15s" dur="30s" />
[0031] The embedded media elements can be linked through internal or external URLs in the SVG content. In this scenario, internal URLs refer to file paths within the ISO Base Media File itself. External URLs refer to file paths outside of the ISO Base Media File either present on the same server containing the source SVG file or on other servers. The present invention is directed to ALC transport mechanisms for internally embedded discrete media. Correspondingly, Session Description Protocol (SDP) is specified for internally embedded media.
[0032] The transport mechanisms discussed herein are provided for embedded discrete and continuous media residing on the same or another server. These media could be in raw or ISO Base Media File format. However, appropriate steps should be taken to ensure the proper synchronization of these media elements during transmission and presentation.
[0033] For download, both discrete and continuous embedded media can be transported by HTTP, FLUTE or ALC depending on whether it is point-to-point or broadcast. However, only the continuous media can be transported by RTP in a broadcast streaming scenario. For this reason, the following transport combinations are envisioned: (1) broadcast streaming + HTTP downloading of discrete media; (2) unicast streaming + HTTP downloading of discrete media; (3) broadcast streaming + FLUTE/ALC downloading of discrete media; and (4) unicast streaming + FLUTE/ ALC downloading of discrete media. The present invention is directed to items (3) and (4). Furthermore, the discrete media files can be explicitly transmitted by transmitting them to the UE in advance via an ALC/FLUTE session; (2) transmitting them to each client on a point-to-point bearer before the streaming session, in a manner similar to the way security keys are sent to clients prior to an MBMS session; (3) having a parallel ALC/FLUTE transmission session independent of the RTP transmission session, if enough radio resources are available; or (4) having non-parallel transmission sessions to transmit all of the data due to the limited radio resources. Each transmission session contains either ALC/FLUTE data or RTP data.
[0034] The following is a description of a first implementation of the present invention, involving the use of boxes to store Session Description Protocol (SDP) information. SDP is a common practical format for specifying the session description. ALC packets can be used to transport the scene description, as well as discrete and/or continuous embedded media depending upon whether it is a pure download scenario or it is shared with an RTP streaming session. The URIs of the internally embedded media are indicated in the ESG that is sent prior to the beginning of the ALC session. The syntax of the SDP description for ALC is similar to that of FLUTE, and has been defined in the Internet-Draft: SDP Descriptors for FLUTE. [0035] There may be various description formats for ALC. SDP is defined is defined below. The text present in SDP is correctly formatted as a series of lines, each terminated by <crlf>, as required by SDP. This case arises for the transmission of SVG scene and scene updates and discrete embedded media. Boxes for FLUTE at different levels (presentation, movie, track, item) are described in detail in United States Provisional Patent Application No. 60/;713,303, filed September 1, 2005. However, the same boxes can be generalized for broadcast download to include ALC as well. Therefore, boxes are prefixed with 'BMFDP' (Broadcast Multicast File Download Protocol) to be more generic to store SDP for either ALC or FLUTE SDP. A flag called 'protocol' is also added to indicate whether the boxes are being used for FLUTE or ALC. The boxes at the four different levels are as follows; [0036] (a) Presentation BMFDP Information. A presentation level hint information container is defined within a 'phib' box, dedicated for ALC or FLUTE. This can be used when all of the content in the current presentation is sent via ALC or FLUTE, aligned(δ) class BMFDPpresentationhintinformation extends box('bdph') { uint protocol; // 0 for ALC, 1 for FLUTE uint(32) descriptionformat = 'sdp '; char sdptextfj;
}
[0037] (b) Item BMFDP Information. An item level hint information container is defined within an 'ihib' box, dedicated for ALC or FLUTE. This can be used when all of the content in the current item is sent via ALC or FLUTE. aligned(8) class BMFDPitemhintinformation extends box('bdih') { uint protocol; // 0 for ALC, 1 for FLUTE uint(32) descriptionformat = 'sdp '; char sdptext[];
}
[0038] (c) Movie Broadcast Download Information. A movie level hint information container is defined within a 'hnti' box, dedicated for ALC or FLUTE. This can be used when all of the content in the current movie is sent via ALC or FLUTE. aligned(8) class BMFDPmoviehintinformation extends box('bdmh') { uint protocol; // 0 for ALC, 1 for FLUTE uint(32) descriptionformat = 'sdp '; char sdptext[];
}
[0039] (d) Track Broadcast Download Information. A track level hint information container is defined within the 'hnti' box, dedicated for ALC or FLUTE. This can be used when all of the content in the current track is sent via ALC or FLUTE. aligned(8) class BMFDPtrackhintinformation extends box('bdth') { uint protocol; // 0 for ALC, 1 for FLUTE uint(32) descriptionformat = 'sdp '; char sdptext[];
}
[0040] In the ALC + RTP transport system, the SDP information for the ALC and
RTP stream may be combined together. This case may occur when SVG media contains both discrete and continuous embedded media. The discrete media is transmitted via ALC, and continuous media is transmitted via RTP. The SDP information can then be saved in the following boxes. United States Provisional Patent Application No. 60/;713,303, filed September 1, 2005 defines boxes for FLUTE + RTP at different levels (e.g., presentation, movie, item). However, the same boxes can be generalized for broadcast download to include ALC as well. Therefore, boxes are prefixed with 'BMFDP' to be more generic to store SDP for ALC + RTP or FLUTE + RTP SDP. A flag called 'protocol' indicates whether the boxes are being used for FLUTE or ALC. The boxes at the three different levels are as follows:
[0041] (a) Presentation SDP Information. aligned(8) class BMFDPrtppresentationhintinformation extends box('bdrp') { uint protocol; // 0 for ALC, 1 for FLUTE uint(32) descriptionformat = 'sdp '; char sdptext[];
}
[0042] (b) Movie SDP Information. aligned(8) class BMFDPrtpmoviehintinformation extends box('bdrm') { uint protocol; // 0 for ALC, 1 for FLUTE uint(32) descriptionformat = 'sdp '; char sdptext[];
}
[0043] (c) Item SDP Information. aligned(8) class BMFDPrtpitemhintinformation extends box('bdri') { uint protocol; // 0 for ALC, 1 for FLUTE uint(32) descriptionformat = 'sdp '; char sdptextQ; } [0044] A second implementation of the present invention involves the use of boxes to store metadata information. In order to transmit internally embedded discrete media using broadcast/multicast file download protocol (ALC/FLUTE), it is required for the server to also transmit some metadata corresponding to the discrete media. Hence, this implementation of the invention involves the encapsulation of this metadata in the ISO Base Media File Format.
[0045] The metadata is sent as part of the FDT if FLUTE is used as a broadcast protocol, or the metadata is sent as part of OMA BCAST ESG if ALC is used in conjunction with OMA BCAST ESG. The ESG provides a mechanism for describing various metadata associated with files that are to be delivered through ALC in the Digital Mobile Broadcast Service. In addition, the ESG specifies the use of Service Guide Delivery Units (SGDUs) and Service Guide Delivery Descriptors (SGDDs) for consistent control of the ESG while it is being downloaded via PtM and/or PtP channels.
[0046] Metadata updated during broadcast/multicast file download can be performed either in-band or out-of-band. In-band metadata updates are performed using FDT of FLUTE, where the FDT Instance ID identifies the latest version of the metadata in FDT. If the metadata is signaled as an SGDU object in ALC, then the ESG is updated according to the SGDU or SG fragment versioning system. There are two options for the PtM transport of the new SGDU. With the first option, the new SGDU object is sent in a separate ALC session. With the second option, the new SGDU object is added to the same ALC session as the original ALC session. [0047] United States Provisional Patent Application No. 60/713,303 describes the extension of the ISO Base Media File format by defining boxes to store the data of FDT instances. However, the same boxes can be generalized to store metadata for broadcast download, including ALC as well. As in the first implementation, the boxes are prefixed with 'BMFDP' ' to be more generic to store metadata for broadcast/multicast downloads. These file parameters are required by any protocol used for broadcast/multicast downloading. Boxes are defined for all of the four levels— presentation, movie, track and item as depicted below. [0048] (a) Presentation Metadata Information. A presentation level metadata container is defined within 'bdph' or 'bdrp' boxes, dedicated for ALC/FLUTE or ALC/FLUTE + RTP transport schemes, respectively. aligned(8) class BMFDPpresentationmetadatainformation extends box('bmpm') {
String Content-Location; unsignedLong Content-Length; unsignedLong Transfer-Length;
String Content-Type'
String Content-Encoding;
Base64Binary Content-MD5;
}
[0049] The Content-Location of embedded media resources may be referred by using the URL forms defined in Section 8.44.7 in ISO/IEC 15444-12:2005.
[0050] (b) Item Metadata Information. An item level metadata container is defined within 'bdih' or 'bdri' boxes, dedicated for ALC/FLUTE or ALC/FLUTE+RTP transport schemes, respectively. aligned(8) class BMFDPitemmetadatainformation extends box('bmim') {
String Content-Location; unsignedLong Content-Length; unsignedLong Transfer-Length;
String Content-Type'
String Content-Encoding;
Base64Binary Content-MD5;
}
[0051] (c) Movie Metadata Information. A movie level metadata container is defined within the 'hnti' box, dedicated for ALC/FLUTE. aligned(8) class BMFDPmoviemetadatainformation extends box('bmmm') {
String Content-Location; unsignedLong Content-Length; unsignedLong Transfer-Length;
String Content-Type'
String Content-Encoding;
Base64Binary Content-MD5; }
[0052] (d) Track Metadata Information. A track level metadata container is defined within the 'bdth' box, dedicated for ALC/FLUTE. This can be used when all of the content in the current track is sent via ALC/FLUTE. aligned(8) class BMFDPtrackmetadatainformation extends box('bmtm') {
String Content-Location; unsignedLong Content-Length; unsignedLong Transfer-Length;
String Content-Type'
String Content-Encoding;
Base64Binary Content-MD5;
}
[0053] A third implementation of the present invention involves the use of boxes to store hint track information. The hint track structure is generalized to support hint samples in multiple data formats. The hint track sample contains any data needed to build the packet header of the correct type. The hint track sample also contains a pointer to the block of data that belongs in the packet. Such data can be SVG, continuous embedded media, and discrete embedded media.
[0054] Hint track samples are not part of the hint track box structure, although they are usually found in the same file. The hint track data reference box ('dref ) and sample table box ('stbl') are used to find the file specification and byte offset for a particular sample. Hint track sample data is byte-aligned and is always in big-endian format.
[0055] The following is a discussion of the hint track format for ALC. United
States Provisional Patent Application No. 60/713,303 describes boxes for FLUTE hint track information at differenflevels (e.g., presentation, movie, track, item). However, the same boxes can be generalized for broadcast download to include ALC as well.
Hence, boxes can be prefixed with 'BMFDP' to be more generic to store hint track information for either ALC or FLUTE.
[0056] Similar to the hierarchy of RTP hint track, the BMFDPHintSampleEntry and
BMFDPsample are defined. In addition, some related structures and constructors are also defined. A 'protocol' is also added to indicates whether the boxes are being used for FLUTE or ALC.
[0057] (a) Sample Description Format. BMFDP hint tracks are hint tracks (media handler 'hint') with an entry-format in the sample description of 'bmfd'. The
BMFDPHintSampleEntry is contained in the SampleDescriptionBox ('stsd'). class BMFDPHintSampleEntryQ extends SampleEntry (bmfd') { uint protocol; // 0 for ALC, 1 for FLUTE uint(16) hinttrackversion = 1 ; uint( 16) highestcompatibleversion = 1 ; uint(32) maxpacketsize; box additionaldata[]; //optional
}
[0058] The fields "hinttrackversion", "highestcompatibleversion" and
"maxpacketsize" have the same interpretation as that in the "RtpHintSampleEntry" described in section 10.2 of the ISO/IEC 15444-12:2005 specification. The additional data is a set of boxes from "timescaleentry" and "timeoffsef", which are referenced in ISO/IEC 15444-12:2005 section 10.2. These boxes are optional for ALC/FLUTE. [0059] (b) Sample Format. Each BMFDP sample in the hint track will generate one or more BMFDP packets, respectively. Compared to RTP samples, BMFDPsamples do not have their own specific timestamps, but instead are sent sequentially. Considering the sample-delta saved in the TimeToSampleBox, if the BMFDP samples represent fragments of the embedded media or SVG content, then the sample-delta between first sample of current media/SVG and the final sample of previous media/SVG has the same value as the difference between start-time of the scene/update to which current and previous media/SVG belong. The sample-deltas for the rest of the successive samples in current media/SVG are zero. However, if a BMFDP sample represents an entire media or SVG content, then there will be no successive samples (containing the successive data from the same media/SVG) with deltas equal to zero following this BMFDP sample. Therefore, only one sample-delta is present for current BMFDP sample.
[0060] Each sample contains two areas—the instructions to compose the packets, and any extra data needed when sending those packets (e.g. an encrypted version of the media data). It should be noted that the size of the sample is known from the sample size table. aligned(8) class BMFDPsample { uint protocol; // 0 for ALC, 1 for FLUTE unsigned int( 16) packetcount; unsigned int( 16) reserved;
BMFDPpacket packets[packetcount] ; byte extradata[]; //optional
}
[0061] (c) Packet Entry Format. Each packet in the packet entry table has the following structure: aligned(8) class BMFDPpacket { uint protocol; // 0 for ALC, 1 for FLUTE
BMFDPheader bmfdpjieader; unsigned int(l 6) entrycount; dataentry constructors [entrycount];
}
[0062] The "bmfdpjieader" contains the header for current BMFDP packet. The
"entry_count" is the count of following constructors, with a constructor being a structure which is used to construct the BMFDP packets. aligned(8) class BMFDPheader { uint protocol; // 0 for ALC, 1 for FLUTE
UDPheader header;
LCTheader lct__header; variable FEC_payload_ID;
}
[0063] The FEC_payload_lD is determined by the FEC Encoding ID that must be communicated in the Session Description, class pseudoheader { unsigned int(32) source_address; unsigned int(32) destination_address; unsigned int(8) zero; unsigned int(8) protocol; unsigned int(16) UDPJength; }
class UDPheader { pseudoheader pheader; unsigned int(16) source_port; unsigned int(16) destination_port; unsigned int(16) length; unsigned int(16) checksum; }
class LCTheader { unsigned int(4) V_bits; unsigned int(2) C_bits; unsigned int(2) reserved; unsigned int(l) S_bit; unsigned int(2) O_bits; unsigned int(l) H_bit; unsigned int(l) T_bit; unsigned int(2) R_bit; unsigned int(2) A_bit; unsigned int(2) B_bit; unsigned int(8) header_length; unsigned int(8) codepoint unsigned int((C_bits+l)*32) congestion_control_information; unsigned int(S_bit*32 + H_bit*16) transport_session_identifier; unsigned int(O_bits*32 + H_bit* 16) transport_object_identifier; //For EXTJFDT, TOI=O if(T_bit == 1) { unsigned int(32) sender_current_time;
} if (TJbit = l) { unsigned int(32) expected_residual_time;
} if (headerjength > (32 + (C_bits+1)*32 + S_bit*32 + H_bit*16 + O_bits*32 + H_bit*16) ) {
LCTheaderextentions header_extention;
} }
class LCTheaderextentions { unsigned int(8) header_extention_type; //192- EXT_FDT, 193- EXT_CENC, 64- EXT_FTI if (header_extention_type <= 127) { unsigned int(8) header_extention_length;
}
if (header_extention_type = 64) { unsigned int(48) transfer_length; if ((FEC_encoding_ID == 0)||(FEC_encoding_ID == 128)||(FEC_encoding_ID == 130)) { unsigned int(16) encoding_symbol_length; unsigned int(32) max_source_block_length;
} else if ((FECjmcodingJD >= 128)||(FEC_encoding_ID <= 255)) { unsigned int(16) FEC_instance_ID;
} else if (FEC_encoding_ID = 129) { unsigned int(16) encoding_symbol_length; unsigned int(16) max_source_block_length; unsigned int(16) max_num_of_encoding_symbol; }
} else if (header_extention_type == 192) { unsigned int(4) version = 1 ; unsigned int(20) FDTJnstanceJD;
} else if (header_extention_type === 193){ unsigned int(8) content_encoding_algorithm; //ZLIB,DEFLATE,GZIP unsigned int(16) reserved = 0;
} else { byte other_extentions_content[]; } } [0064] The FEC_encoding_ID used below must be signalled in the session description. The header_extension_type = 64 is valid for both ALC and FLUTE. It is used for EXTJ7TI, the header extension that carries the FEC Object Transmission Information. The header_extension_type = 192 is valid only for FLUTE. It is used for EXT_FDT, the header extension that informs that the particular FLUTE packet contains the FDT Instance as a payload, rather than the file. The header_extension_type = 193 is valid only for FLUTE. It is used for EXT_CENC, the header extension that indicates the content encoding used by the FDT Instance. [0065] (d) Constructor Format. There are various forms of the constructor. Each constructor is 16 bytes in order to make iteration easier. The first byte is a union discriminator. This structure is based on section 10.3.2 from ISO/IEC 15444- 12:2005. aligned(8) class BMFDPconstructor(type) { uint protocol; // 0 for ALC, 1 for FLUTE unsigned int(8) constructorjype = type; }
aligned(8) class BMFDPnoopconstructor extends BMFDPconstructor(O)
{ uint(8) pad[15];
}
aligned(8) class BMFDPimmediateconstructor extends BMFDPconstructor(l)
{ unsigned int(8) count; unsigned int(8) data[count]; unsigned int(8) pad[14 - count]; }
aligned(8) class BMFDPsampleconstructor extends BMFDPconstructor(2)
{ signed int(8) trackrefindex; unsigned int(16) length; unsigned int(32) samplenumber; unsigned int(32) sampleoffset; unsigned int(16) bytesperblock = 1; unsigned int(l 6) samplesperblock = 1; }
aligned(8) class BMFDPsampledescriptionconstructor extends BMFDPconstructor(3)
{ signed int(8) trackrefindex; unsigned int(16) length; unsigned int(32) sampledescriptionindex; unsigned int(32) sampledescriptionoffset; unsigned int(32) reserved; }
aligned(8) class BMFDPitemconstructor extends BMFDPconstructor(4)
{ unsigned int(16) item_ID; unsigned int(16) extent_index; unsigned int(64) data_offset; //offset in byte within extent unsigned int(32) data_length; //length in byte within extent }
aligned(8) class BMFDPxmlboxconstructor extends BMFDPconstructor(5)
{ unsigned int(64) data_offset; //offset in byte within XMLBox or BinaryXMLBox unsigned int(32) data_length; unsigned int(32) reserved; }
[0066] FDT data is one part of the whole FLUTE/ALC data stream. This data is either transmitted in-band during the FLUTE session in the form of FLUTE packets, or out-of-band during an ALC session using ESG or by other means. A constructor defined herein is used to map SGDU to the ALC packet. The syntax of the constructor is provided as follows: aligned(8) class ALCsgduconstructor extends BMFDPconstructor(θ)
{ unsigned int(2) sgdu_box; //O-'sgdp', l-'sgdnϊ, 2-'sgdi', 3-'sgdt' if ((sgdu_box==0)||(sgdu_box==l) ||(sgdu__box==2)) { unsigned int(30) instance_index; //index of the SGDU unsigned int(64) data_offset; //offset in byte within the given SGDU unsigned int(32) datajength; //length in byte within the given SGDU
} else { unsigned int(64) data_offset; //offset in byte within the given SGDU box unsigned int(32) data_length; //length in byte within the given SGDU box bit pad[30]; //padding bits
} }
[0067] The following is a discussion of the use of hint tracks for ALC + RTP. In the case that both RTP and BMFDP (ALC/FLUTE) packets are transmitted simultaneously during a presentation, both constructors for RTP and BMFDP are used. RTP packets are used to transmit the continuous media and SVG content, while BMFDP packets are used to transmit the discrete media. A different hint system is used for this case. Such a system can combine all of the RTP and BMFDP samples in a correct time order.
[0068] United States Provisional Patent Application No. 60/713,303 describes boxes for FLUTE + RTP hint track information at different levels. However, the same boxes can be generalized for broadcast download to include ALC as well. Hence, the boxes below are prefixed with 'BMFDP' to be more generic. [0069] In order to facilitate the generation of BMFDP and RTP packets for a presentation, the hint track format for BMFDP + RTP is defined below. Similar to the hierarchy of RTP and BMFDP hint track, the BMFDPRtpHintSampleEntry and BMFDPRTPsample are defined. In addition, the data in TimeToSampleBox gives the time information for each packet.
[0070] (a) Sample Description Format. BMFDP+RTP hint tracks are hint tracks (media handler 'hint') with an entry-format in the sample description of 'frhs': BMFDPRtpHintSampleEntry is defined within the SampledDescriptionBox 'stsd'
class BMFDPRtpHintSampleEntryO extends SampleEntry ('brhs') { uint protocol; // 0 for ALC, 1 for FLUTE uint( 16) hinttrackversion = 1 ; uint(l 6) highestcompatibleversion = 1 ; uint(32) maxpacketsize; box additionaldataQ,"
}
[0071] The hinttrackversion is currently 1. The highest compatible version field specifies the oldest version with which this track is backward compatible. The maxpacketsize indicates the size of the largest packet that this track will generate. The additional data is a set of boxes ('tims', and 'tsro' ) which are defined in the ISO Base Media File Format.
[0072] (b) Sample Format. BMFDPRTPSample is defined within the MediaDataBox ('mdat'). This box contains multiple BMFDP samples, RTP samples, possible FDT/SGDU and SDP information and any extra data. One BMFDPRTPSample may contain either one FDT/SGDU data, SDP data, BMFDP sample, or RTP sample. BMFDPRTPSamples that contain BMFDP samples are used here only to transmit the discrete media. Such media are always embedded in the Scene or Scene Update among the SVG presentation. Their start-times are the same as the start-time of Scene/Scene Update they belong to. BMFDP samples do not have their own specific timestamps, but instead are sent sequentially, immediately after the RTP samples of the Scene/Scene Update they belong to. Therefore in the TimeToSampleBox, the sample-deltas of the BMFDPRTPSample for discrete media are all set to zero. Their sequential order represents their sending-time order. [0073] UE may have limited power and can support only one transmission session at any time instant, and the BMFDP sessions and RTP sessions need to be interleaved one by one. One shall be started immediately after the other is finished. In this case, the descriptionjextl, description_text2 and descriptioη_text3 fields below are used to provide SDP and FDT/SGDU information for each session. aligned(8) class BMFDPRTPSample { uint protocol; // 0 for ALC, 1 for FLUTE unit(2) samplejype; unsigned int(6) reserved; if (samplejype == 0) {
//FDT instance info for the FLUTE samples or SGDU info for the ALC //samples char metadatatextrj;
} else if (samplejype == 1) { char sdptextQ; //SDP info for the samples
} else if (samplejype = 2 && protocol = 0) { BMFDPsample alc_sample;
} else if (sample Jype == 2 && protocol == 1) { BMFDPsample flute_sample;
} else {
RTPsample rtp_sample;
} byte extradata[];
}
[0074] In addition to the above, there are several other potential implementations of the present invention. The following are a discussion of some such alternative implementations. A fourth implementation of the present invention is similar to the first implementation discussed above. However, other description formats such as DCCPtext may be stored, in which case the sdptext field will change accordingly. A fifth implementation is similar to the second implementation. However, in this embodiment, a single fragment field can contain all of the fragment data in the ESG. The application can then choose to either split this data into different fields for all levels or for files.
[0075] There are several potential implementations of the present invention is similar in many respects to the third implementation discuss above, with various differences. For example, one may redefine the 'hnti' box at other levels, for example to contain presentation-level or item-level session information. For a sample description format for hint track information for ALC, the hinttrackversion and highestcompatibleversion fields may have different values, and a minpacketsize field may be added in addition to the maxpacketsize field. For sample formats for the hint track format for ALC, the packetcount field can be made 32 bits by removing the reserved field. For the packet entry format for the hint track format for ALC, the hierarchical structure of the different header boxes (BMFDPheader, UDPheader, LCTheader, etc.) could be different. For the constructor format for the hint track format for ALC, the ALCsgdutconstructor syntax can have separate field definitions for each sgdu_box, the BMFDPitemconstructor may have item_id replaced by itemjname, the BMFDPxmlboxconstructor can have the data_length field to be made to 64 bytes by removing the reserved field, and the BMFDPxmlboxconstructor can have the data_length field to be made to 16 bytes and adjust reserved field to 64 bytes. For hint tracks for ALC + RTP, the BMFDPRtpHintSampleEntry can have the hinttrackversion and highestcompatibleversion fields to be of different values, the BMFDPRtpHintSampleEntry can add a minpacketsize field in addition to the maxpacketsize field, and the BMFDPRTPSample box can have separate field definitions for each sample__type.
[0076] There are several use cases for rich media services that can benefit from using ALC as a protocol. A first such use case involves the preview of long cartoon animations. The service of the present invention allows an end-user to progressively download small portions of each animation before deciding which one he or she wishes to view in its entirety.
[0077] A second use case for the present invention involves interactive Mobile TV services. With the present invention a deterministic rendering and behavior of rich- media content can be delivered together in the end user interface. The content can include audio-video content, text, graphics, images, and TV and radio channels. The service must provide convenient navigation through content in a single application or service, and the service must allow synchronized interaction in local or in distant settings, such as voting and personalization (e.g., related menus or sub-menus, advertising and content in function of the end-user profile or service subscription). This use case is described in four steps corresponding to four services and sub- services available in an iTV mobile service — a mosaic menu showing the TV channel landscape, an electronic program guide and triggering of a related iTV service, the iTV service, and personalized menus, such as "sports news."
[0078] A third use case for the present invention involves the use of a live enterprise data feed. This service includes, for example, stock tickers that provide streaming of real-time quotes, intra-day charts with technical indicators, news monitoring, weather alerts, charts, business updates, sports scores, etc.
[0079] A fourth use case for the present invention involves live chat services. Live chat services can be incorporated within a web cam, video channel, or a rich-media blog service. End-users can register, save their surname and exchange messages. Messages appear dynamically in the live chat service, along with rich-media data provided by the end-user. The chat service can be either private or public in one or more multiple channels at the same time. End users are dynamically alerted of new messages from other users. Dynamic updates of messages within the service occur without reloading a complete page.
[0080] A fifth use case for the present invention involves karaoke services. Karaoke services display a music TV channel or video clip catalog, along with the speech of a song with fluid-like animation on the text characters to be singing (e.g., a smooth color transition of fonts, scrolling of text, etc) The end-user can download a song of his or her choice, along with the complete animation by selecting an interactive button. Similar systems can also be used for the reenactment of movie or television shows or clips.
[0081] Figures 4 and 5 show one representative electronic device 12 within which the present invention may be implemented. It should be understood, however, that the present invention is not intended to be limited to one particular type of electronic device. The electronic device 12 of Figures 4 and 5 includes a housing 30, a display 32 in the form of a liquid crystal display, a keypad 34, a microphone 36, an ear-piece 38, a battery 40, an infrared port 42, an antenna 44, a smart card 46 in the form of a UICC according to one embodiment of the invention, a card reader 48, radio interface circuitry 52, codec circuitry 54, a controller 56 and a memory 58. Individual circuits and elements are all of a type well known in the art, for example in the Nokia range of mobile telephones.
[0082] The present invention is described in the general context of method steps, which may be implemented in one embodiment by a program product including computer-executable instructions, such as program code, executed by computers in networked environments. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Computer-executable instructions, associated data structures, and program modules represent examples of program code for executing steps of the methods disclosed herein. The particular sequence of such executable instructions or associated data structures represents examples of corresponding acts for implementing the functions described in such steps. '
[0083] Software and web implementations of the present invention could be accomplished with standard programming techniques with rule based logic and other logic to accomplish the various database searching steps, correlation steps, comparison steps and decision steps. It should also be noted that the words "component" and "module," as used herein and in the claims, is intended to encompass implementations using one or more lines of software code, and/or hardware implementations, and/or equipment for receiving manual inputs. [0084] The foregoing description of embodiments of the present invention have been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the present invention to the precise form disclosed, and modifications and variations are possible in light of the above teachings or may be acquired from practice of the present invention. The embodiments were chosen and described in order to explain the principles of the present invention and its practical application to enable one skilled in the art to utilize the present invention in various embodiments and with various modifications as are suited to the particular use contemplated.

Claims

WHAT IS CLAIMED IS:
1. A method for progressively providing rich media content to a client device, comprising: encoding an ISO Base Media File, in accordance with an ISO Base Media File format, from input information including: scalable vector graphics, file metadata information for one of ALC and FLUTE, and hint track information for one of ALC and FLUTE packetization; and transmitting the encoded ISO Base Media File in a plurality of RTP packets and packets selected from the group consisting of ALC packets and FLUTE packets to the client device, the ISO Base Media File including metadata of discrete files.
2. The method of claim 1 , wherein the input information further includes session description protocol (SDP) information for one of ALC and FLUTE, and wherein the ISO Base Media File further includes session description information.
3. The method of claim 1 , wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information.
4. The method of claim 1 , wherein the scalable vector graphics contain both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP information.
5. The method of claim 1 , wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE metadata information.
6. The method of claim 3, wherein the scalable vector graphics contain both discrete and continuous media, and wherein the plurality of SDP boxes are for containing either ALC or FLUTE information, in addition to RTP metadata information.
7. The method of claim 1 , wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE hint track information.
8. The method of claim 7, wherein the scalable vector graphics contain both discrete and continuous media, and wherein the plurality of SDP boxes are for containing either ALC or FLUTE information, in addition to RTP hint track information.
9. A computer program product for progressively providing rich media content to a client device, comprising: computer code for encoding an ISO Base Media File, in accordance with an ISO Base Media File format, from input information including: scalable vector graphics, file metadata information for one of ALC and FLUTE, and hint track information for one of ALC and FLUTE packetization; and computer code for transmitting the encoded ISO Base Media File in a plurality of RTP packets and packets selected from the group consisting of ALC packets and FLUTE packets to the client device, the ISO Base Media File including metadata of discrete files .
10. The computer program product of claim 9, wherein the input information further includes session description protocol (SDP) information for one of ALC and FLUTE, and wherein the ISO Base Media File further includes session description information.
11. The computer program product of claim 9, wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information.
12. The computer program product of claim 11 , wherein the scalable vector graphics contain both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP information.
13. The computer program product of claim 9, wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE metadata information.
14. The computer program product of claim 13, wherein the scalable vector graphics contain both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP metadata information.
15. The computer program product of claim 9, wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE hint track information.
16. The computer program product of claim 15, wherein the scalable vector graphics contain both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP hint track information.
17. An electronic device, comprising: a processor; and a memory unit communicatively connected to a processor and including: computer code for encoding an ISO Base Media File, in accordance with an ISO Base Media File format, from input information including: scalable vector graphics, file metadata information for one of ALC and FLUTE, and hint track information for one of ALC and FLUTE packetization; and computer code for transmitting the encoded ISO Base Media File in a plurality of RTP packets and packets selected from the group consisting of ALC packets and FLUTE packets to the client device, the ISO Base Media File including metadata of discrete files.
18. The electronic device of claim 17, wherein the input information further includes session description protocol (SDP) information for one of ALC and FLUTE, and wherein the ISO Base Media File further includes session description information.
19. The electronic device of claim 17, wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information.
20. The electronic device of claim 19, wherein the scalable vector graphics contain both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP information.
21. The electronic device of claim 17, wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE metadata information.
22. The electronic device of claim 21 , wherein the scalable vector graphics contain both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP metadata information.
23. The electronic device of claim 17, wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE hint track information.
24. The electronic device of claim 23, wherein the scalable vector graphics contain both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP hint track information.
25. A method of exhibiting rich media content on a client device, comprising: receiving from a rich media server an ISO Base Media File in a plurality of RTP packets and a plurality of packets selected from the group consisting of ALC packets and FLUTE packets, the ISO Base Media File including metadata of discrete files; decoding the ISO Base Media File; and playing the decoded ISO Base Media File.
26. The method of claim 25, wherein the ISO Base Media File further includes session description information.
27. The method of claim 25, wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information.
28. The method of claim 27, wherein the ISO Base Media File is generated from scalable vector graphics containing both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP information.
29. The method of claim 25, wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE metadata information.
30. The method of claim 29, wherein the ISO Base Media File is generated from scalable vector graphics containing both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP metadata information.
31. The method of claim 25, wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE hint track information.
32. The method of claim 31 , wherein the ISO Base Media File is generated from scalable vector graphics containing both discrete and continuous media, and wherein the ISO Base Media File includes a plurality of SDP boxes for containing either ALC or FLUTE information, in addition to RTP hint track information.
33. A method for progressively providing rich media content to a client device, comprising: creating an ISO Base Media File, in accordance with an ISO Base Media File format, from input information; encoding the ISO Base Media File; and transmitting the encoded ISO Base Media File to the client device.
34. A computer program product, included in a computer-readable media, for progressively providing rich media content to a client device, comprising: computer code for creating an ISO Base Media File, in accordance with an ISO Base Media File format, from input information; computer code for encoding the ISO Base Media File; and computer code for transmitting the encoded ISO Base Media File to the client device.
35. An electronic device, comprising: a processor; and a memory unit communicatively connected to the processor and including a computer program product for progressively providing rich media content to a client device, comprising: computer code for creating an ISO Base Media File, in accordance with an ISO Base Media File format, from input information; computer code for encoding the ISO Base Media File; and computer code for transmitting the encoded ISO Base Media File to the client device.
PCT/IB2007/000073 2006-01-11 2007-01-11 Extensions to rich media container format for use by mobile broadcast/multicast streaming servers WO2007080500A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP07705423.7A EP1974526B1 (en) 2006-01-11 2007-01-11 Extensions to rich media container format for use by mobile broadcast/multicast streaming servers
ES07705423.7T ES2526814T3 (en) 2006-01-11 2007-01-11 Extensions for Rich Media Containers Format for use by general stream / multicast streaming servers

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US75825606P 2006-01-11 2006-01-11
US60/758,256 2006-01-11

Publications (1)

Publication Number Publication Date
WO2007080500A1 true WO2007080500A1 (en) 2007-07-19

Family

ID=38256031

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2007/000073 WO2007080500A1 (en) 2006-01-11 2007-01-11 Extensions to rich media container format for use by mobile broadcast/multicast streaming servers

Country Status (6)

Country Link
US (1) US7917644B2 (en)
EP (2) EP2790380A1 (en)
KR (1) KR100959574B1 (en)
CN (2) CN105024852B (en)
ES (1) ES2526814T3 (en)
WO (1) WO2007080500A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009024926A1 (en) * 2007-08-20 2009-02-26 Nokia Corporation Segmented metadata and indexes for streamed multimedia data
EP2076032A2 (en) * 2007-12-26 2009-07-01 Lg Electronics Inc. Method and apparatus for processing service guide information
EP2184889A1 (en) * 2007-08-02 2010-05-12 Huawei Technologies Co., Ltd. Media service showing method and communication system and relative device
EP2301174A2 (en) * 2008-07-16 2011-03-30 Samsung Electronics Co., Ltd. Method and apparatus for providing rich media service
CN102158811A (en) * 2008-04-11 2011-08-17 华为技术有限公司 Method and device of present mode of notification information in BCAST (broadcast)
US8510375B2 (en) 2009-12-11 2013-08-13 Nokia Corporation Apparatus and methods for time mapping media segments in streaming media files
CN113170238A (en) * 2018-09-12 2021-07-23 诺基亚技术有限公司 Apparatus, method and computer program for video encoding and decoding

Families Citing this family (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1760933B1 (en) * 2004-08-06 2012-03-14 Panasonic Corporation Feedback control for multicast or broadcast services
FR2907627B1 (en) * 2006-10-20 2008-12-19 Alcatel Sa TRANSPORT CHANNEL TYPE SELECTION DEVICE FOR CONTENT BROADCAST TO COMMUNICATION TERMINALS
US8391775B2 (en) * 2007-03-09 2013-03-05 Airbiquity Inc. Mobile digital radio playlist system
US8678896B2 (en) 2007-06-14 2014-03-25 Harmonix Music Systems, Inc. Systems and methods for asynchronous band interaction in a rhythm action game
US8678895B2 (en) 2007-06-14 2014-03-25 Harmonix Music Systems, Inc. Systems and methods for online band matching in a rhythm action game
KR101418591B1 (en) * 2007-10-05 2014-07-10 삼성전자주식회사 Apparatus and method for announcing service guides in mobile communication system
CN101911693A (en) * 2007-12-03 2010-12-08 诺基亚公司 Systems and methods for storage of notification messages in ISO base media file format
US20110093880A1 (en) * 2008-02-22 2011-04-21 Nokia Corporation Apparatus and method of providing an integrated rich media environment
WO2009114111A2 (en) * 2008-03-12 2009-09-17 Packetvideo Corp. System and method for reformatting digital broadcast multimedia for a mobile device
KR20090119412A (en) * 2008-05-16 2009-11-19 엘지전자 주식회사 Mobile terminal and method for controlling broadcast contents purchase therein
US8526350B2 (en) * 2008-05-23 2013-09-03 Qualcomm Incorporated Systems and methods for carrying broadcast services over a mobile broadcast network
WO2010006054A1 (en) * 2008-07-08 2010-01-14 Harmonix Music Systems, Inc. Systems and methods for simulating a rock and band experience
US20100037258A1 (en) * 2008-08-07 2010-02-11 Research In Motion Limited Mobile broadcasting system and method for enhancing mobile broadcasting services with rich media including an enhanced service guide
AU2009311364A1 (en) * 2008-10-28 2010-05-14 Airbiquity Inc. Purchase of a piece of music being played on a radio in a vehicle
US9235572B2 (en) * 2008-10-31 2016-01-12 Disney Enterprises, Inc. System and method for updating digital media content
US8315994B2 (en) 2008-10-31 2012-11-20 Disney Enterprises, Inc. System and method for updating digital media content
CA2744183C (en) 2008-11-18 2014-05-27 Lg Electronics Inc. Non-real time service processing method and broadcast receiver
US8904191B2 (en) * 2009-01-21 2014-12-02 Microsoft Corporation Multiple content protection systems in a file
US8465366B2 (en) 2009-05-29 2013-06-18 Harmonix Music Systems, Inc. Biasing a musical performance input to a part
US8449360B2 (en) 2009-05-29 2013-05-28 Harmonix Music Systems, Inc. Displaying song lyrics and vocal cues
US9002574B2 (en) 2009-10-15 2015-04-07 Airbiquity Inc. Mobile integration platform (MIP) integrated handset application proxy (HAP)
US8942888B2 (en) 2009-10-15 2015-01-27 Airbiquity Inc. Extensible scheme for operating vehicle head unit as extended interface for mobile device
US8838332B2 (en) * 2009-10-15 2014-09-16 Airbiquity Inc. Centralized management of motor vehicle software applications and services
US8831823B2 (en) * 2009-10-15 2014-09-09 Airbiquity Inc. Centralized management of motor vehicle software applications and services
US9981193B2 (en) 2009-10-27 2018-05-29 Harmonix Music Systems, Inc. Movement based recognition and evaluation
US10357714B2 (en) * 2009-10-27 2019-07-23 Harmonix Music Systems, Inc. Gesture-based user interface for navigating a menu
US8568234B2 (en) 2010-03-16 2013-10-29 Harmonix Music Systems, Inc. Simulating musical instruments
US9358456B1 (en) 2010-06-11 2016-06-07 Harmonix Music Systems, Inc. Dance competition game
EP2579955B1 (en) 2010-06-11 2020-07-08 Harmonix Music Systems, Inc. Dance game and tutorial
US8562403B2 (en) 2010-06-11 2013-10-22 Harmonix Music Systems, Inc. Prompting a player of a dance game
CN101950427B (en) * 2010-09-08 2011-11-16 东莞电子科技大学电子信息工程研究院 Vector line segment contouring method applicable to mobile terminal
US9024166B2 (en) 2010-09-09 2015-05-05 Harmonix Music Systems, Inc. Preventing subtractive track separation
US9438883B2 (en) * 2012-04-09 2016-09-06 Intel Corporation Quality of experience reporting for combined unicast-multicast/broadcast streaming of media content
JP6154894B2 (en) 2012-06-08 2017-06-28 エアビクティ インコーポレイテッド A method for evaluating electronic sensor data to remotely identify vehicles and monitor driver behavior
WO2014079027A1 (en) * 2012-11-22 2014-05-30 Thomson Licensing Apparatus and method for extending tv services with rich media services
EP2936861B1 (en) 2012-12-20 2021-07-21 Airbiquity, Inc. Efficient headunit communication integration
WO2014152820A1 (en) 2013-03-14 2014-09-25 Vdopia Inc. Systems and methods for layering content
US9781181B2 (en) * 2013-06-17 2017-10-03 Qualcomm Incorporated Multiple file delivery over unidirectional transport protocol sessions for a service
US10476693B2 (en) * 2014-02-24 2019-11-12 Lg Electronics Inc. Apparatus for transmitting broadcast signals, apparatus for receiving broadcast signals, method for transmitting broadcast signals and method for receiving broadcast signals
US9740792B2 (en) * 2014-06-18 2017-08-22 Vmware, Inc. Connection paths for application topology
US9836284B2 (en) 2014-06-18 2017-12-05 Vmware, Inc. HTML5 graph layout for application topology
US9852114B2 (en) 2014-06-18 2017-12-26 Vmware, Inc. HTML5 graph overlays for application topology
US11088893B2 (en) * 2014-11-12 2021-08-10 Telefonaktiebolaget Lm Ericsson (Publ) Methods and devices for negotiating session descriptor parameters
TW201931863A (en) * 2018-01-12 2019-08-01 圓剛科技股份有限公司 Multimedia signal synchronization apparatus and synchronization method thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004056096A1 (en) * 2002-12-18 2004-07-01 Nokia Corporation Method of announcing sessions
WO2005039131A1 (en) 2003-10-17 2005-04-28 Telefonaktiebolaget Lm Ericsson (Publ) Container format for multimedia presentations
WO2005060339A2 (en) * 2003-12-19 2005-07-07 Nokia Corporation Method and system for header compression

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6516435B1 (en) * 1997-06-04 2003-02-04 Kabushiki Kaisha Toshiba Code transmission scheme for communication system using error correcting codes
US7979886B2 (en) * 2003-10-17 2011-07-12 Telefonaktiebolaget Lm Ericsson (Publ) Container format for multimedia presentations
US7536622B2 (en) * 2004-03-29 2009-05-19 Nokia Corporation Data repair enhancements for multicast/broadcast data distribution
US20050216472A1 (en) * 2004-03-29 2005-09-29 David Leon Efficient multicast/broadcast distribution of formatted data
MXPA06014093A (en) * 2004-06-25 2007-02-15 Nokia Corp File delivery session handling.
EP1760933B1 (en) * 2004-08-06 2012-03-14 Panasonic Corporation Feedback control for multicast or broadcast services
US20070192818A1 (en) * 2004-10-12 2007-08-16 Mikael Bourges-Sevenier System and method for creating, distributing, and executing rich multimedia applications
US20060193337A1 (en) * 2005-02-25 2006-08-31 Toni Paila Device management broadcast operation
EP2894831B1 (en) * 2005-06-27 2020-06-03 Core Wireless Licensing S.a.r.l. Transport mechanisms for dynamic rich media scenes
CN101300810A (en) * 2005-09-01 2008-11-05 诺基亚公司 Method for embedding SVG content into an ISO base media file format for progressive downloading and streaming of rich media content

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004056096A1 (en) * 2002-12-18 2004-07-01 Nokia Corporation Method of announcing sessions
WO2005039131A1 (en) 2003-10-17 2005-04-28 Telefonaktiebolaget Lm Ericsson (Publ) Container format for multimedia presentations
WO2005060339A2 (en) * 2003-12-19 2005-07-07 Nokia Corporation Method and system for header compression

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
"3GPP DRAFT; 29N70112, 3RD GENERATION PARTNERSHIP PROJECT (3GPP", vol. SA WG4, 9 November 2005, MOBILE COMPETENCE CENTRE, article "INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISOIIEC JTC1/SC29/WG11 MPEG/N7583"
"3GPP STANDARD; 3GPP TS 26.234 3RD GENERATION PARTNERSHIP PROJECT", 1 September 2005, MOBILE COMPETENCE CENTRE, article "3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Transparent end-to-end Packet-switched Streaming Service (PSS); Protocols and codecs (Release 6", pages: 1 - 126
PAILA NOKIA M LUBY, FLUTE - FILE DELIVERY OVER UNIDIRECTIONAL TRANSPORT; DRAFT-IETF-RMT-FILUTE-08.TXT, vol. RMT, no. 8, 5 June 2004 (2004-06-05)
See also references of EP1974526A4

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2184889A1 (en) * 2007-08-02 2010-05-12 Huawei Technologies Co., Ltd. Media service showing method and communication system and relative device
EP2184889A4 (en) * 2007-08-02 2012-07-11 Huawei Tech Co Ltd Media service showing method and communication system and relative device
US9277181B2 (en) 2007-08-02 2016-03-01 Huawei Technologies Co., Ltd. Media service presentation method and communication system and related device
EP2191402A4 (en) * 2007-08-20 2014-05-21 Nokia Corp Segmented metadata and indexes for streamed multimedia data
EP2191402A1 (en) * 2007-08-20 2010-06-02 Nokia Corporation Segmented metadata and indexes for streamed multimedia data
WO2009024926A1 (en) * 2007-08-20 2009-02-26 Nokia Corporation Segmented metadata and indexes for streamed multimedia data
US9852219B2 (en) 2007-08-20 2017-12-26 Nokia Technologies Oy Segmented metadata and indexes for streamed multimedia data
EP2076032A2 (en) * 2007-12-26 2009-07-01 Lg Electronics Inc. Method and apparatus for processing service guide information
EP2076032A3 (en) * 2007-12-26 2011-12-28 LG Electronics Inc. Method and apparatus for processing service guide information
CN102158811A (en) * 2008-04-11 2011-08-17 华为技术有限公司 Method and device of present mode of notification information in BCAST (broadcast)
EP2301174A2 (en) * 2008-07-16 2011-03-30 Samsung Electronics Co., Ltd. Method and apparatus for providing rich media service
AU2009271869B2 (en) * 2008-07-16 2013-10-17 Samsung Electronics Co., Ltd. Method and apparatus for providing rich media service
EP2301174A4 (en) * 2008-07-16 2013-01-16 Samsung Electronics Co Ltd Method and apparatus for providing rich media service
US8510375B2 (en) 2009-12-11 2013-08-13 Nokia Corporation Apparatus and methods for time mapping media segments in streaming media files
CN113170238A (en) * 2018-09-12 2021-07-23 诺基亚技术有限公司 Apparatus, method and computer program for video encoding and decoding
CN113170238B (en) * 2018-09-12 2023-08-01 诺基亚技术有限公司 Apparatus, method and computer program for video encoding and decoding

Also Published As

Publication number Publication date
EP2790380A1 (en) 2014-10-15
EP1974526A1 (en) 2008-10-01
CN101390367A (en) 2009-03-18
KR20080083353A (en) 2008-09-17
US20070180133A1 (en) 2007-08-02
CN105024852B (en) 2020-01-03
ES2526814T3 (en) 2015-01-15
US7917644B2 (en) 2011-03-29
KR100959574B1 (en) 2010-05-27
EP1974526B1 (en) 2014-10-29
EP1974526A4 (en) 2012-11-28
CN105024852A (en) 2015-11-04

Similar Documents

Publication Publication Date Title
US7917644B2 (en) Extensions to rich media container format for use by mobile broadcast/multicast streaming servers
US20070186005A1 (en) Method to embedding SVG content into ISO base media file format for progressive downloading and streaming of rich media content
US11785289B2 (en) Receiving device, transmitting device, and data processing method
JP6441521B2 (en) Control message composition apparatus and method in broadcast system
US7746882B2 (en) Method and device for assembling forward error correction frames in multimedia streaming
KR100939030B1 (en) Auxiliary content handling over digital communication systems
Lim et al. New MPEG transport standard for next generation hybrid broadcasting system with IP
Herpel et al. MPEG-4 Systems: Elementary stream management
Lee et al. Converged mobile TV services supporting rich media in cellular and DVB-H systems
Setlur et al. More: a mobile open rich media environment
Herpel et al. MPEG-4 systems: elementary stream management and delivery

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 6188/DELNP/2008

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2007705423

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1020087019296

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 200780006785.1

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 2007705423

Country of ref document: EP