WO2012049305A1 - Method for synchronizing multimedia flows and corresponding device - Google Patents

Method for synchronizing multimedia flows and corresponding device Download PDF

Info

Publication number
WO2012049305A1
WO2012049305A1 PCT/EP2011/068016 EP2011068016W WO2012049305A1 WO 2012049305 A1 WO2012049305 A1 WO 2012049305A1 EP 2011068016 W EP2011068016 W EP 2011068016W WO 2012049305 A1 WO2012049305 A1 WO 2012049305A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
multimedia data
network
flow
flows
Prior art date
Application number
PCT/EP2011/068016
Other languages
French (fr)
Inventor
Anthony Laurent
Eric Gautier
Yvon Legallais
Original Assignee
Thomson Licensing
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing filed Critical Thomson Licensing
Priority to CN2011800497729A priority Critical patent/CN103155584A/en
Priority to KR1020137009471A priority patent/KR101828639B1/en
Priority to US13/878,216 priority patent/US9729762B2/en
Priority to BR112013008587-8A priority patent/BR112013008587B1/en
Priority to JP2013533234A priority patent/JP6053686B2/en
Priority to AU2011315435A priority patent/AU2011315435B2/en
Priority to EP11770759.6A priority patent/EP2628297B1/en
Publication of WO2012049305A1 publication Critical patent/WO2012049305A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/04Synchronising
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/242Synchronization processes, e.g. processing of PCR [Program Clock References]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • H04N21/2389Multiplex stream processing, e.g. multiplex stream encrypting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43072Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/438Interfacing the downstream path of the transmission network originating from a server, e.g. retrieving encoded video stream packets from an IP network
    • H04N21/4385Multiplex stream processing, e.g. multiplex stream decrypting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44004Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving video buffer management, e.g. video decoder buffer or video display buffer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6112Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving terrestrial transmission, e.g. DVB-T
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6125Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/631Multimode Transmission, e.g. transmitting basic layers and enhancement layers of the content over different transmission paths or transmitting with different error corrections, different keys or with different transmission protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/6437Real-time Transport Protocol [RTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8455Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream

Definitions

  • the present invention relates to the processing of multimedia flows from different multimedia sources or transmitted via different transport protocols and restored on a rendering device.
  • the audio and video flows are traditionally broadcast together. They are generally provided by a single multimedia source, for example a supplier of multimedia contents, then transported by a single transport protocol over a given transport network then delivered to a single end user device, for example a decoder or a television, in charge of reading these flows, displaying the video data on a screen and broadcasting the audio data on a loudspeaker.
  • a single multimedia source for example a supplier of multimedia contents
  • a single transport protocol over a given transport network then delivered to a single end user device, for example a decoder or a television, in charge of reading these flows, displaying the video data on a screen and broadcasting the audio data on a loudspeaker.
  • An example of new multimedia application is the generation of an audio flow by a source different from that of the video flow, this audio flow being intended to substitute itself for a basic audio flow which would be provided with the video flow.
  • this audio flow being intended to substitute itself for a basic audio flow which would be provided with the video flow.
  • the said flows must contain common or equivalent timing references.
  • the transport protocol provides these references or timestamps to the rendering device so that it regulates and synchronizes the rendering of the two flows.
  • the timestamp is generally a counter value which indicates the time during which the event associated with this timestamp occurs.
  • the clock frequency of the counter must be a value known by the rendering device so that it correctly regulates the flow rendering. The manner in which this clock frequency is given to the rendering device is described in the specifications of the transport layers (MPEG-TS, RTP, etc.).
  • the rendering device In order that the rendering device can synchronize the two flows, the latter generally refer to a common clock commonly called "wall clock".
  • a common clock commonly called "wall clock”.
  • RTP protocol for Real-Time Transport Protocol
  • RTCP for Real-time Transport Control Protocol
  • NTP protocol for Network Time Protocol
  • This synchronization problem can also exist between two video flows which are displayed on a single rendering device, one of the video contents being displayed by picture in picture in the other one, when the two flows are not provided by the same source or the same transport protocol.
  • the Picture in Picture function is an example of this.
  • Another example concerns cases of 2D to 3D transition where a 2D video is received in the broadcast flow and the 3D complement enabling a 3D display is received in the broadband flow.
  • the present invention relates to an object capable of solving synchronization problems when flows are distributed over networks having different time bases.
  • the purpose of the invention is a method for the processing of multimedia data flows in said device comprising an interface to a first network and an interface to a second network, said method comprising the following steps: receive, from the first network, a first flow comprising multimedia data through a first transport protocol fitted with a first data synchronization system; receive, from the second network, a second flow comprising multimedia data through a second transport protocol fitted with a second data synchronization system, said second synchronization system being based on timing references different from those of the first data synchronization system; said first and second flows transporting the same synchronization information in the data field of the first and of the second transport protocol, said synchronization information comprising data indicating the moment from which said multimedia data must be rendered; synchronize said first and second flows by using said synchronization information; and perform a step for rendering said first and second multimedia flows.
  • the invention has the advantages of proposing an independent synchronization system complementary to those used by the respective transport protocols.
  • said transport protocols are of the MPEG-2 TS and/or RTP type.
  • the synchronization information is based on the timing references of one of the transport protocols.
  • the synchronization step comprises a step for delaying the contents received in advance by a memorization, until the contents can be rendered synchronously.
  • the first flow is received on the first network in broadcast mode, and comprises countdown information indicating when first multimedia data to be received on the first network will have to be rendered, the second multimedia data of the second flow being pre-loaded upon request of said reception device on the second network sufficiently in advance to enable the reception device to render them synchronously with the first multimedia data when they are received.
  • a reception device comprising an interface to a first network enabling a first flow comprising multimedia data to be received through a first transport protocol fitted with a first data synchronization system; and an interface to a second network, enabling a second flow comprising multimedia data to be received through a second transport protocol fitted with a second data synchronization system, said second synchronization system being based on timing references different from those of the first data synchronization system; a synchronizer to synchronize said first and second flows by using synchronization information transported in the first and second flows and comprising data indicating when said multimedia data must be rendered; and a third interface enabling said first and second multimedia flows to be transmitted synchronously.
  • Computer program product is understood to mean a computer program medium that can consist not only in a storage space containing the program, such as a computer memory, but also a signal, such as an electrical or optical signal.
  • FIG. 2 is a content distribution block diagram according to an embodiment of the invention.
  • FIG. 3 is a content distribution block diagram according to another embodiment of the invention.
  • FIG. 4 is a decoder according to an embodiment of the invention.
  • modules shown are functional units that may or may not correspond to physically distinguishable units.
  • these modules or some of them can be grouped together in a single component, or constitute functions of the same software.
  • some modules may be composed of separate physical entities.
  • a content distribution system is illustrated in figure 1 . It comprises a server 1 of the audio-video server type.
  • the server broadcasts an audio-video content through a broadcast network noted as broadcast network 7.
  • the broadcast network is of the DVB-T type and transports audio-video data according to the MPEG-2 transport stream transport protocol, noted as MPEG-2 TS, and specified in ISO standard ISO/IEC 13818-1 standard: "Information technology - generic coding of moving pictures and associated audio information: Systems", and noted as ISO/IEC 13818-1 .
  • the broadcast network is here of the unidirectional type, but the invention naturally applies to any type of broadcast network and is also not limited to the MPEG-2 TS transport protocol.
  • the audio-video content is received by a video decoder 4, noted as STB (for set-top box).
  • STB for set-top box
  • the STB decodes the audio-video content and transmits the decoded content to a television set, TV 5.
  • timing references are transmitted to the decoder.
  • the transport stream comprises Program Clock Reference (also noted as PCR) packets that are samples of the system time clock (STC) contained in the server. They are used to synchronize the clock system in the decoder.
  • STC system time clock
  • the elementary streams of a given service contain packets that contain a presentation timestamp (also noted as PTS) which represent the time during which the data transported in the packet should be rendered.
  • PTS presentation timestamp
  • the same server is accessible by the STB through a broadband distribution network, or broadband network 2.
  • the broadband network uses the Internet network 2 accessible to the STB through a home gateway, noted as HG 3.
  • This network is of the bidirectional type and enables the gateway to transmit and to receive data to the server and more generally to any type of server accessible through the Internet network.
  • the server makes accessible to the STB a second audio flow complementary to the broadcast audio-video content.
  • This second audio flow can be received by the STB by means of the RTP / RTCP transport protocol. Hence, once received, the audio flow is decoded and transmitted to the television set.
  • the RTCP protocol indicates to the decoder the equivalence between the presentation timestamp and the time given by the common clock according to the Network Time Protocol (NTP).
  • NTP Network Time Protocol
  • the audio- video flow and the second audio flow are transmitted by two different networks using different transport protocols, they are played synchronously in the television set. Indeed, an item of synchronization information is transmitted by the server in the form of auxiliary data packet. This information enables the STB to synchronize the audio-video flow rendering time on that of the second audio. In practice, only the video part of the audio-video flow is rendered, and the second audio is rendered synchronously.
  • Figure 2 illustrates the data transport according to the MPEG-2 TS protocol in broadcast mode and according to the RTP/RTCP protocol in broadband mode.
  • broadcast mode
  • video frames V0 to V3 are broadcast with corresponding presentation timestamps packets PTSvO to PTSv3.
  • broadband mode
  • the audio frames AO to A2 are transported with corresponding presentation timestamps packets TSA0 to TSA2.
  • auxiliary data packets TO TO to
  • T2 are associated with the video and audio flows and are transmitted over the two networks by using the same transport protocol as the flows with which they are associated.
  • the auxiliary data packets and the multimedia flow are synchronized by means of the synchronization protocol specific to the transport protocol.
  • the auxiliary data packets transport temporal synchronization data. This data can be assimilated with a timeline and represents a counter value from a given time, for example the beginning of the current event.
  • a video frame broadcast over the broadcast network and an audio frame broadcast over the broadband network that must be rendered at the same time will refer to the same timeline value.
  • the synchronization data transported over the two networks use the same format. This format can be different from those of the timing references used by the transport protocols.
  • the decoder can adjust in this way the audio presentation with respect to the video.
  • the video is received in advance, it is memorized for a sufficiently long time to enable audio rendering synchronously.
  • the receiver memorizes the data received in advance to present them synchronously with the data received late.
  • the decoder receives information indicating to it in advance when a content to come will be played. These are countdown information indicating the beginning of a timeline. More precisely, as illustrated in figure 3, if the content must begin to be rendered at the time corresponding to that indicated in the auxiliary data TO, an item of information T -3 , T -2 , T-i is received in advance by the receiver indicating to it when this time TO will begin. This makes it possible for it to recover in advance, in a mode called pre-fetch mode, the audio data corresponding at the video that will be rendered at time TO. These information are received over the networks of the broadcast type and are intended to enable the terminals to pre-load in advance the broadband component.
  • a STB according to the embodiments is illustrated in figure 4. It comprises an interface 44 to the broadcast distribution network 7, an interface 43 to the broadband network 6, and an interface 47 to a television set 5. It comprises an audio-video decoder 45. It also comprises a synchronizer 46 that enables it to synchronize the contents rendering according to what is indicated above. More particularly; the synchronizer identifies the auxiliary data associated with the audio and video flows. It measures in this way the time between the rendering of the two flows. According to the time value, the synchronizer memorizes 42 one of the contents so that the renderings are synchronized. The contents are decoded and synchronized in this way to be delivered to a television set through the TV interface 47.
  • the STB naturally comprises a processor 41 which enables it to implement various applications such as decoding or the synchronizer. Naturally, the synchronized contents could be transmitted to any type of device enabling these contents to be reproduced.
  • the packets of auxiliary data are now described in detail for the MPEG-2 TS and for the RTP.
  • the fields are indicated in the English language in order to facilitate the reading with respect to the indicated specifications.
  • the Broadcast Timeline Descriptor is a linear content element with an inherent timeline that increases at the same rate as its flow. It is used to indicate the value of the timeline.
  • the Content Labelling Descriptor is a means of associating a label, in the form of an identifier, with a content element.
  • the packet of auxiliary date comprises the following data: Transport Stream packet header information, Elementary Stream packet header information and auxiliary data structure.
  • Transport Stream packet header information comprises the Broadcast Timeline Descriptor and the Content Labelling Descriptor.
  • the format of the transport packet is such as defined in the ISO/IEC specification 13818-1 , section 2.4.3.2.
  • the auxiliary data packet comprises no adaptation field.
  • the PCR is transported in a separate component or in the adaptation field of the audio or video component of the same program.
  • the transport packet fields and values are indicated in the following table:
  • the format of the Elementary Stream packet is such as defined in the ISO/IEC specification 13818-1 , section 2.4.3.6.
  • the fields and values are indicated in the following table:
  • auxiliary data structure is such as defined in the TS102823 specification, section 4.5.
  • the fields and values are indicated in the following table: Name No. of Mnemo Value Comment
  • Payload_format 4 bslbf 0x1 Payload field consists of 0 or more
  • the value of the Payloadjormat field is 0x1 to indicate that the payload field is one of the descriptors defined in the TS102823 specification, section 5, and more particularly this involves the broadcast timeline descriptor.
  • the broadcast timeline descriptor is defined in the TS102823 specification, section 5.2.2. According to the embodiment, the broadcast timeline is of the direct type, it is linear and not subject to discontinuities.
  • the value of the Running status field is 'running'.
  • the tick format is 90,000 ticks per second. The fields and values are indicated in the following table:
  • the broadcast timeline descriptor is different on the following points.
  • the value of the Running status field is 'countdown'. This is a private value which is not specified in the standard.
  • the value of a broadcast timeline information field is 'Prefetch_period_duration_ticks' to indicate the pre-fetch period in number of ticks.
  • An "absolute_ticks" field represents the counter advance. The combination of the value of this field with the 'Prefetch_period_duration_ticks' enables to indicate the moment at which the content will be played. Hence, a value equal to zero means that the content will be played in 'Prefetch_period_duration_ticks' ticks.
  • a 'Prefetch_period_duration_ticks' value means that the content begins to be played.
  • the fields and values are indicated in the following table:
  • Running_status 3 Uimsbf 0x5 Status is
  • the format of the Content Labelling descriptor is such as defined in TS102823, section 5.2.4.
  • the fields and values are indicated in the following table:
  • the content_reference_id_data field is defined in the following table:
  • the content_labeling_id_type field specifies the identifier type. The values are indicated in the following table:
  • the contentjabelingjdjength field indicates the length of the content_labeling_id field.
  • the content_labeling_id_byte field identifies the content.
  • the content labeling field can be referenced in the content identifier descriptor such as defined in section 12.1 of standard ETSI TS 102323 on "Digital Video Broadcasting (DVB); Carriage and signalling of TV-Anytime information in DVB transport streams". This enables the content labeling ID to be made public. That is, an external entity can refer to the content referenced by the timeline associated with the content identifier descriptor without having to analyse the auxiliary timeline component.
  • the auxiliary data are transported in the payload field part of the RTP packet.
  • the payload field contains a Broadcast Timeline Descriptor and a Content Labelling Descriptor similar to those indicated above.
  • the RTP packet fields and values are indicated in the following table:
  • BT desc length (8 bits): broadcast timeline descriptorjength. Represents, for this descriptor, the total number of bytes following the byte which defines the value of this field. Value set to 0x8.
  • BT Id (8 bits): broadcast timeline Id.
  • T of the broadcast timeline type. Value set to '0' to indicate a direct type encoding.
  • C Continuity_indicator. Value set to '0' to indicate a linear broadcast timeline not subject to discontinuities.
  • Prev_discontinuity_flag Value set to '0' to indicate that there is no previous discontinuity.
  • Next_discontinuity_flag Value set to '0' to indicate that there is no future discontinuity.
  • RS running status. Value set to "0x4" to indicate that the status is "running”. res (2 bits): reserved. Value set to ⁇ 1 '. tick format (6 bits): tickjormat. Value set to "0x1 1 " to indicate 90,000 ticks per second.
  • Absolute ticks (32 bits): absolute ticks value.
  • Info length 8 bits: value set to 0x0 to indicate that no broadcast timeline info is present.
  • CL desc type 8 bits: descriptor type. Value set to 0x4 to indicate that the descriptor is of the content labelling type.
  • CL desc length 8 bits: content labeling descriptor length. Represents, for this descriptor, the total number of bytes following the byte which defines the value of this field.
  • Metadata application format (16 bits): metadata application format. Value set to 0x4444 to indicate that it is user-defined.
  • CTBI (4 bits): content time base indicator. Value set to 0x8 to indicate use of the DVB broadcast timeline.
  • CRIR length (8 bits): content reference id record length. Specifies the number of bytes of the content_reference_id following this field
  • CLI_type (8 bits): content labeling id type. Specifies the type of content's reference identification.
  • time base mapping flag Value set to '0' to indicate that no explicit mapping from external time base to broadcast timeline is provided.
  • BT ID (8 bits): broadcast timeline ID the content labeling refers to.
  • the 'BT desc length' field represents the length of the broadcast timeline descriptor, in particular the total number of bytes following the byte which defines the value of this field.
  • the RS field represents the running status and its value is 'countdown'.
  • the info length field represents the length of the 'broadcast timeline info' field.
  • the Prefetch_period_duration_ticks and 'absolute ticks' fields have the same meaning as in the MPEG-2 TS case indicated above.
  • the timeline is based on the same clock whatever the transport protocol and the network used.
  • the embodiments of the invention are based on two different transport protocols with different timing references. Naturally, the invention applies in the same manner to two identical transport protocols being based on different timing references. Likewise, the first embodiment could be based on two broadband networks or two broadcast networks.
  • the synchronization data transmitted in the auxiliary data packets is based on one of the clocks, namely that relating to the MPEG-2 TS.
  • this synchronization data could just as well be based on a clock independent of those of the transport protocols.
  • the embodiments are based on a receiver.
  • the audio and video contents could be rendered in distinct equipment.
  • a first equipment is then considered as the master. It periodically communicates to the second equipment the temporal information corresponding to the commonly rendered content.
  • the synchronizer of the second equipment uses this information as a reference. The communication time between the two equipment is presumed to be known or sufficiently low.
  • the embodiment is placed within the framework of a broadcast network and of a broadband network, and for audio and video contents.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The present invention relates to a reception device and a method for the processing of multimedia data flows in said device comprising an interface to a first network and an interface to a second network, said method comprising the following steps: receive, from the first network, a first flow comprising multimedia data through a first transport protocol fitted with a first data synchronization system; receive, from the second network, a second flow comprising multimedia data through a second transport protocol fitted with a second data synchronization system, said second synchronization system being based on timing references different from those of the first data synchronization system; characterized in that said first and second flows transport the same synchronization information in the payload field of the first and of the second transport protocol, said synchronization information comprising data indicating the moment from which said multimedia data must be rendered; synchronize said first and second flows by using said synchronization information; and perform a step for rendering said first and second multimedia flows.

Description

METHOD FOR SYNCHRONIZING MULTIMEDIA FLOWS AND
CORRESPONDING DEVICE
Technical domain of the invention
The present invention relates to the processing of multimedia flows from different multimedia sources or transmitted via different transport protocols and restored on a rendering device.
Technological background of the invention
In broadcast TV, the audio and video flows are traditionally broadcast together. They are generally provided by a single multimedia source, for example a supplier of multimedia contents, then transported by a single transport protocol over a given transport network then delivered to a single end user device, for example a decoder or a television, in charge of reading these flows, displaying the video data on a screen and broadcasting the audio data on a loudspeaker.
With the rapid development of the Internet network and mobile telecommunication networks new multimedia applications have appeared in which the sources and/or the transport protocols can be different for the audio flows and the video flows. Interactive applications have also appeared for which the sources and transport protocols can also be different from those of the audio-video contents to which they refer. These applications can in this way be transported through broadband networks.
For these new applications, it is necessary to make sure that the rendering of the audio flow is synchronous with the rendering of the video flow, or that the interactive application is synchronously rendered with the audio-video flow.
An example of new multimedia application is the generation of an audio flow by a source different from that of the video flow, this audio flow being intended to substitute itself for a basic audio flow which would be provided with the video flow. For example, in the case of a football match broadcast on the television, it is possible to substitute for the basic audio flow provided with the video flow of the match an audio flow comprising the example comments in a language other than that of the basic audio flow which would be delivered by another multimedia supplier than the match broadcaster. In order that the audio flow can be synchronized with the video flow, the said flows must contain common or equivalent timing references. As a general rule, the transport protocol provides these references or timestamps to the rendering device so that it regulates and synchronizes the rendering of the two flows.
The timestamp is generally a counter value which indicates the time during which the event associated with this timestamp occurs. The clock frequency of the counter must be a value known by the rendering device so that it correctly regulates the flow rendering. The manner in which this clock frequency is given to the rendering device is described in the specifications of the transport layers (MPEG-TS, RTP, etc.).
In order that the rendering device can synchronize the two flows, the latter generally refer to a common clock commonly called "wall clock". For example, in the case of the RTP protocol (for Real-Time Transport Protocol) specified by the IETF according to RFC 3550, a transmitter periodically transmits a message called RTCP (for Real-time Transport Control Protocol ) broadcast report indicating the equivalence between the timestamp and the time given by the common clock. If the audio and video flows are provided by different sources, these two sources must share the same common clock. The NTP protocol (for Network Time Protocol) is typically used to synchronize the two sources on the same clock.
However, when the two sources are not connected by a sufficiently reliable network in terms of transport time, another synchronization mechanism is then necessary.
This synchronization problem can also exist between two video flows which are displayed on a single rendering device, one of the video contents being displayed by picture in picture in the other one, when the two flows are not provided by the same source or the same transport protocol. The Picture in Picture function is an example of this. Another example concerns cases of 2D to 3D transition where a 2D video is received in the broadcast flow and the 3D complement enabling a 3D display is received in the broadband flow. General description of the invention
The present invention relates to an object capable of solving synchronization problems when flows are distributed over networks having different time bases. For this purpose, the purpose of the invention is a method for the processing of multimedia data flows in said device comprising an interface to a first network and an interface to a second network, said method comprising the following steps: receive, from the first network, a first flow comprising multimedia data through a first transport protocol fitted with a first data synchronization system; receive, from the second network, a second flow comprising multimedia data through a second transport protocol fitted with a second data synchronization system, said second synchronization system being based on timing references different from those of the first data synchronization system; said first and second flows transporting the same synchronization information in the data field of the first and of the second transport protocol, said synchronization information comprising data indicating the moment from which said multimedia data must be rendered; synchronize said first and second flows by using said synchronization information; and perform a step for rendering said first and second multimedia flows.
The invention has the advantages of proposing an independent synchronization system complementary to those used by the respective transport protocols.
According to one embodiment, said transport protocols are of the MPEG-2 TS and/or RTP type. According to one embodiment, the synchronization information is based on the timing references of one of the transport protocols.
According to one embodiment, the synchronization step comprises a step for delaying the contents received in advance by a memorization, until the contents can be rendered synchronously.
According to one embodiment, the first flow is received on the first network in broadcast mode, and comprises countdown information indicating when first multimedia data to be received on the first network will have to be rendered, the second multimedia data of the second flow being pre-loaded upon request of said reception device on the second network sufficiently in advance to enable the reception device to render them synchronously with the first multimedia data when they are received.
Another purpose of the invention relates to a reception device comprising an interface to a first network enabling a first flow comprising multimedia data to be received through a first transport protocol fitted with a first data synchronization system; and an interface to a second network, enabling a second flow comprising multimedia data to be received through a second transport protocol fitted with a second data synchronization system, said second synchronization system being based on timing references different from those of the first data synchronization system; a synchronizer to synchronize said first and second flows by using synchronization information transported in the first and second flows and comprising data indicating when said multimedia data must be rendered; and a third interface enabling said first and second multimedia flows to be transmitted synchronously. The invention also applies to a computer program product comprising program code instructions for the execution of the steps of the method according to the invention, when this program is executed on a computer. "Computer program product" is understood to mean a computer program medium that can consist not only in a storage space containing the program, such as a computer memory, but also a signal, such as an electrical or optical signal. Brief description of the figures
The invention will be better understood and illustrated by means of the following embodiments and implementations, by no means limiting, with reference to the figures attached in the appendix, wherein: - Figure 1 is a content distribution system according to an embodiment of the invention;
- Figure 2 is a content distribution block diagram according to an embodiment of the invention;
- Figure 3 is a content distribution block diagram according to another embodiment of the invention; and
- Figure 4 is a decoder according to an embodiment of the invention.
In Figures 1 and 4, the modules shown are functional units that may or may not correspond to physically distinguishable units. For example, these modules or some of them can be grouped together in a single component, or constitute functions of the same software. On the contrary, some modules may be composed of separate physical entities.
Description of a preferred embodiment of the invention
The embodiments are placed within the framework of content distribution through two communication networks and according to two different data transport protocols, namely RTP and MPEG-2 TS, but the invention is not limited to this particular environment and can apply within other data distribution systems. A content distribution system according to an embodiment of the invention is illustrated in figure 1 . It comprises a server 1 of the audio-video server type. The server broadcasts an audio-video content through a broadcast network noted as broadcast network 7. The broadcast network is of the DVB-T type and transports audio-video data according to the MPEG-2 transport stream transport protocol, noted as MPEG-2 TS, and specified in ISO standard ISO/IEC 13818-1 standard: "Information technology - generic coding of moving pictures and associated audio information: Systems", and noted as ISO/IEC 13818-1 . The broadcast network is here of the unidirectional type, but the invention naturally applies to any type of broadcast network and is also not limited to the MPEG-2 TS transport protocol.
The audio-video content is received by a video decoder 4, noted as STB (for set-top box). The STB decodes the audio-video content and transmits the decoded content to a television set, TV 5.
According to the MPEG-2 TS protocol, timing references are transmitted to the decoder. The transport stream comprises Program Clock Reference (also noted as PCR) packets that are samples of the system time clock (STC) contained in the server. They are used to synchronize the clock system in the decoder. The elementary streams of a given service contain packets that contain a presentation timestamp (also noted as PTS) which represent the time during which the data transported in the packet should be rendered.
The same server is accessible by the STB through a broadband distribution network, or broadband network 2. In particular, the broadband network uses the Internet network 2 accessible to the STB through a home gateway, noted as HG 3. This network is of the bidirectional type and enables the gateway to transmit and to receive data to the server and more generally to any type of server accessible through the Internet network. According to the embodiment, the server makes accessible to the STB a second audio flow complementary to the broadcast audio-video content. This second audio flow can be received by the STB by means of the RTP / RTCP transport protocol. Hence, once received, the audio flow is decoded and transmitted to the television set. The RTCP protocol indicates to the decoder the equivalence between the presentation timestamp and the time given by the common clock according to the Network Time Protocol (NTP). According to the embodiment of the invention, although the audio- video flow and the second audio flow are transmitted by two different networks using different transport protocols, they are played synchronously in the television set. Indeed, an item of synchronization information is transmitted by the server in the form of auxiliary data packet. This information enables the STB to synchronize the audio-video flow rendering time on that of the second audio. In practice, only the video part of the audio-video flow is rendered, and the second audio is rendered synchronously.
Figure 2 illustrates the data transport according to the MPEG-2 TS protocol in broadcast mode and according to the RTP/RTCP protocol in broadband mode. In broadcast, mode, the video frames V0 to V3 are broadcast with corresponding presentation timestamps packets PTSvO to PTSv3. In broadband, mode, the audio frames AO to A2 are transported with corresponding presentation timestamps packets TSA0 to TSA2.
In accordance with the embodiment, auxiliary data packets TO to
T2 are associated with the video and audio flows and are transmitted over the two networks by using the same transport protocol as the flows with which they are associated. On each network, the auxiliary data packets and the multimedia flow are synchronized by means of the synchronization protocol specific to the transport protocol. The auxiliary data packets transport temporal synchronization data. This data can be assimilated with a timeline and represents a counter value from a given time, for example the beginning of the current event. A video frame broadcast over the broadcast network and an audio frame broadcast over the broadband network that must be rendered at the same time will refer to the same timeline value.
The synchronization data transported over the two networks use the same format. This format can be different from those of the timing references used by the transport protocols.
They enable the receiver to match the PCR clock reference with the NTP clock reference. In other words, this enables to measure the time between the contents presentation.
The decoder can adjust in this way the audio presentation with respect to the video. In particular if the video is received in advance, it is memorized for a sufficiently long time to enable audio rendering synchronously. More generally, the receiver memorizes the data received in advance to present them synchronously with the data received late.
However, within the framework of MPEG-2 TS, a temporal model imposes not to delay the data rendering. More precisely, the time between the data encoding and the data presentation must remain constant. Therefore, this does not allow to delay the rendering of video data in the case of the embodiment.
This is why, in a second embodiment, the decoder receives information indicating to it in advance when a content to come will be played. These are countdown information indicating the beginning of a timeline. More precisely, as illustrated in figure 3, if the content must begin to be rendered at the time corresponding to that indicated in the auxiliary data TO, an item of information T-3, T-2, T-i is received in advance by the receiver indicating to it when this time TO will begin. This makes it possible for it to recover in advance, in a mode called pre-fetch mode, the audio data corresponding at the video that will be rendered at time TO. These information are received over the networks of the broadcast type and are intended to enable the terminals to pre-load in advance the broadband component. They enable the decoder to issue requests to receive the data in advance. A STB according to the embodiments is illustrated in figure 4. It comprises an interface 44 to the broadcast distribution network 7, an interface 43 to the broadband network 6, and an interface 47 to a television set 5. It comprises an audio-video decoder 45. It also comprises a synchronizer 46 that enables it to synchronize the contents rendering according to what is indicated above. More particularly; the synchronizer identifies the auxiliary data associated with the audio and video flows. It measures in this way the time between the rendering of the two flows. According to the time value, the synchronizer memorizes 42 one of the contents so that the renderings are synchronized. The contents are decoded and synchronized in this way to be delivered to a television set through the TV interface 47. The STB naturally comprises a processor 41 which enables it to implement various applications such as decoding or the synchronizer. Naturally, the synchronized contents could be transmitted to any type of device enabling these contents to be reproduced.
The packets of auxiliary data are now described in detail for the MPEG-2 TS and for the RTP. The fields are indicated in the English language in order to facilitate the reading with respect to the indicated specifications.
Concerning the MPEG-2 TS, the "ETSI TS 102 823 v1 .1 .1 (2005- 1 1 ) over Digital Video Broadcasting (DVB); Specification for the carriage of synchronized auxiliary data in DVB transport streams"specification, noted as TS102823, describes a method used to transport, in a transport stream of the DVB type, auxiliary data that must be synchronized with so-called linear data such as audio or video data. It offers the possibility to encode the payload field of the auxiliary data structure under several formats. In particular, the formats used are those indicated in sections 5.2.2 and 5.2.4 of TS102823. These are the Broadcast Timeline Descriptor and the Content Labelling Descriptor. The Broadcast Timeline Descriptor is a linear content element with an inherent timeline that increases at the same rate as its flow. It is used to indicate the value of the timeline. The Content Labelling Descriptor is a means of associating a label, in the form of an identifier, with a content element.
i.e. the packet of auxiliary date comprises the following data: Transport Stream packet header information, Elementary Stream packet header information and auxiliary data structure. The latter comprises the Broadcast Timeline Descriptor and the Content Labelling Descriptor.
The format of the transport packet is such as defined in the ISO/IEC specification 13818-1 , section 2.4.3.2. The auxiliary data packet comprises no adaptation field. The PCR is transported in a separate component or in the adaptation field of the audio or video component of the same program. The transport packet fields and values are indicated in the following table:
Figure imgf000012_0001
The format of the Elementary Stream packet is such as defined in the ISO/IEC specification 13818-1 , section 2.4.3.6. The fields and values are indicated in the following table:
Figure imgf000013_0001
The format of the auxiliary data structure is such as defined in the TS102823 specification, section 4.5. The fields and values are indicated in the following table: Name No. of Mnemo Value Comment
bits
nics
Payload_format 4 bslbf 0x1 Payload field consists of 0 or more
descriptors
Reserved 3 bslbf '000'
CRCJIag 1 bslbf Ό' No CRC field
Broadcast timeline Defined in section descriptor "Broadcast timeline descriptor"
Content labeling Defined in section descriptor "Content labeling
descriptor"
The value of the Payloadjormat field is 0x1 to indicate that the payload field is one of the descriptors defined in the TS102823 specification, section 5, and more particularly this involves the broadcast timeline descriptor.
The broadcast timeline descriptor is defined in the TS102823 specification, section 5.2.2. According to the embodiment, the broadcast timeline is of the direct type, it is linear and not subject to discontinuities. The value of the Running status field is 'running'. The tick format is 90,000 ticks per second. The fields and values are indicated in the following table:
Name No. of MnemoValue comment bits nics
Descriptor_tag 8 Uimsbf 0x02
Descriptorjength 8 Uimsbf 0x08 Total number of bytes following the byte defining the value of this field
Broadcast timeline id 8 Uimsbf
Reserved 1 Uimsbf '1 '
Broadcast_timeline_typ 1 Uimsbf '0' Direct encoding e
Continuity_indicator 1 Uimsbf '0'
Prev_discontinuity_flag 1 Uimsbf '0' No previous
discontinuity
Next_discontinuity_flag 1 Uimsbf '0' No next discontinuity
Running_status 3 Uimsbf 0x4 Status is "running"
Reserved 2 Uimsbf '1 1 ' Tick_format 6 Uimsbf 0x1 1 90,000 ticks per second
Absolute ticks 32 Uimsbf
Broadcast_timeline_info 8 Uimsbf 0x0 No broadcast timeline Jength info
Concerning the second embodiment, the broadcast timeline descriptor is different on the following points. The value of the Running status field is 'countdown'. This is a private value which is not specified in the standard. The value of a broadcast timeline information field is 'Prefetch_period_duration_ticks' to indicate the pre-fetch period in number of ticks. An "absolute_ticks" field represents the counter advance. The combination of the value of this field with the 'Prefetch_period_duration_ticks' enables to indicate the moment at which the content will be played. Hence, a value equal to zero means that the content will be played in 'Prefetch_period_duration_ticks' ticks. A 'Prefetch_period_duration_ticks' value means that the content begins to be played. The fields and values are indicated in the following table:
Name No. of Mnemo Value Comment bits
nics
Descriptor_tag 8 Uimsbf 0x02
Descriptorjength 8 Uimsbf OxOC Total number of bytes following the byte defining the value of this field
Broadcast timeline id 8 Uimsbf
Reserved 1 Uimsbf '1 '
Broadcast_timeline_type 1 Uimsbf Ό' Direct encoding
Continuity_indicator 1 Uimsbf Ό'
Prev_discontinuity_flag 1 Uimsbf Ό' No previous
discontinuity
Next_discontinuity_flag 1 Uimsbf Ό' No next
discontinuity
Running_status 3 Uimsbf 0x5 Status is
"countdown"
Reserved 2 Uimsbf '1 1 '
Tick_format 6 Uimsbf 0x1 1 90,000 ticks per second
Absolute ticks 32 Uimsbf Broadcast_timeline_info_l 8 Uimsbf 0x4
ength
Prefetch_period_duration_ 32 Uimsbf
ticks
The format of the Content Labelling descriptor is such as defined in TS102823, section 5.2.4. The fields and values are indicated in the following table:
Figure imgf000016_0001
The content_reference_id_data field is defined in the following table:
Name No. of bits Mnemonics content_reference_id_data () {
for (i=0; i<N; i++) {
content_labeling_id_type 8 Uimsbf content_labeling_id_length 8 Uimsbf for (i=0; klength; i++) {
content labeling id byte 8 Uimsbf
}
}
I The content_labeling_id_type field specifies the identifier type. The values are indicated in the following table:
Figure imgf000017_0001
The contentjabelingjdjength field indicates the length of the content_labeling_id field. The content_labeling_id_byte field identifies the content. The content labeling field can be referenced in the content identifier descriptor such as defined in section 12.1 of standard ETSI TS 102323 on "Digital Video Broadcasting (DVB); Carriage and signalling of TV-Anytime information in DVB transport streams". This enables the content labeling ID to be made public. That is, an external entity can refer to the content referenced by the timeline associated with the content identifier descriptor without having to analyse the auxiliary timeline component.
Concerning the transport by RTP, the auxiliary data are transported in the payload field part of the RTP packet. The payload field contains a Broadcast Timeline Descriptor and a Content Labelling Descriptor similar to those indicated above. The RTP packet fields and values are indicated in the following table:
o 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
— h— h— h— h— h— h— h— h- -+— h— h- - +— h— h— h- — h- -+— h— h— h— h- -+ — h— h— - +— h— h— h-
BT desc type BT desc length BT Id R 1 T 1 c 1 : P 1 N 1 RS
+ — h— h— h— h— h— h— h— h- -+— h— h- - +— h— h— h- — h- - +— h— h— h— h- -+ — h— h— h- - +— h— h— h- + resltick format absolute
— h— h— h— h— h— h— h— h- -+— h— h- - +— h— h— h- — h- - +— h— h— h— h- -+ — h— h— h- - +— h— h— h- ticks info length CL desc type CL desc length
— h— h— h— h— h— h— h— h- -+— h— h- -+— h— h— h- — h- -+— h— h— h— h- -+ — h— h— h- - +— h— h— h- metadata application format F CTBI 1 res CRIR length
+ — h— h— h— h— h— h— h— h- -+— h— h- -+— h— h— h- — h- -+— h— h— h— h- -+ — h— h— -+— h— h— h- +
CLI_type CLI_ _length CLI byte 1 CLI byte 2
— h— h— h— h— h— h— h— h- - +— h— h- -+— h— h— h- — h- -+— h— h— h— h- -+ — h— h— h- -+— h— h— h-
CLI byte 3 CLI byte N
+ — h— h— h— h— h— h— h— h- - +— h— h- -+— h— h— h- — h- -+— h— h— h— h- -+ — h— h— h- -+— h— h— h- +
TBAD length reserved M 3T ID BT desc type (8 bits): descriptor type. Value set to 0x2 to indicate that the descriptor is of the broadcast timeline type.
BT desc length (8 bits): broadcast timeline descriptorjength. Represents, for this descriptor, the total number of bytes following the byte which defines the value of this field. Value set to 0x8.
BT Id (8 bits): broadcast timeline Id. R (1 bit): reserved, Ί ' value.
T (1 bit): of the broadcast timeline type. Value set to '0' to indicate a direct type encoding. C (1 bit): Continuity_indicator. Value set to '0' to indicate a linear broadcast timeline not subject to discontinuities.
P (1 bit): Prev_discontinuity_flag. Value set to '0' to indicate that there is no previous discontinuity.
N (1 bit): Next_discontinuity_flag. Value set to '0' to indicate that there is no future discontinuity.
RS (3 bits): running status. Value set to "0x4" to indicate that the status is "running". res (2 bits): reserved. Value set to Ί 1 '. tick format (6 bits): tickjormat. Value set to "0x1 1 " to indicate 90,000 ticks per second.
Absolute ticks (32 bits): absolute ticks value.
Info length (8 bits): value set to 0x0 to indicate that no broadcast timeline info is present.
CL desc type (8 bits): descriptor type. Value set to 0x4 to indicate that the descriptor is of the content labelling type. CL desc length (8 bits): content labeling descriptor length. Represents, for this descriptor, the total number of bytes following the byte which defines the value of this field.
Metadata application format (16 bits): metadata application format. Value set to 0x4444 to indicate that it is user-defined. F (1 bit): content reference id record flag. Value set to Ί ' to signal the presence of a content_reference_id_record in this descriptor.
CTBI (4 bits): content time base indicator. Value set to 0x8 to indicate use of the DVB broadcast timeline.
Res (3 bits): reserved. Value set to '1 1 1 '
CRIR length (8 bits): content reference id record length. Specifies the number of bytes of the content_reference_id following this field
CLI_type (8 bits): content labeling id type. Specifies the type of content's reference identification. CLIJength (8 bits): content labeling id length. Specify the length in number of bytes of the content labeling id, i.e. the CLI bytes.
CLI bytes {CLIJength bytes): content labeling id that identifies the content. TBAD length (8 bits): time base association data length. Set to 0x2. Reserved (7 bits): reserved. Value set to Ί 1 1 1 1 1 '.
M (1 bit): time base mapping flag. Value set to '0' to indicate that no explicit mapping from external time base to broadcast timeline is provided.
BT ID (8 bits): broadcast timeline ID the content labeling refers to.
For the second embodiment, the differences in comparison with the preceding RTP packet are indicated below. The RTP packet fields and values are indicated in the following table:
3
0 1 2 3 4 5 9 0 1 2 3 4 5 9 0 1 2 3 4 5 9 0 1
+— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h BT desc type | BT desc length] BT Id |R|T|C|P|N| RS
— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h res I tick format! absolute — h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h ticks I info length | prefetch period
+— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h duration ticks CL desc type CL desc length
+— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h metadata application format CTBI res CRIR length
+— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h
CLI_type CLI_length CLI byte 1 CLI byte 2
+— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h
CLI byte 3 | ... | CLI byte N
+— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h— h TBAD length | reserved |M| BT ID
+= =+ The 'BT desc length' field represents the length of the broadcast timeline descriptor, in particular the total number of bytes following the byte which defines the value of this field. The RS field represents the running status and its value is 'countdown'. The info length field represents the length of the 'broadcast timeline info' field. The Prefetch_period_duration_ticks and 'absolute ticks' fields have the same meaning as in the MPEG-2 TS case indicated above.
Hence, the timeline is based on the same clock whatever the transport protocol and the network used.
The embodiments of the invention are based on two different transport protocols with different timing references. Naturally, the invention applies in the same manner to two identical transport protocols being based on different timing references. Likewise, the first embodiment could be based on two broadband networks or two broadcast networks.
In the embodiments, the synchronization data transmitted in the auxiliary data packets is based on one of the clocks, namely that relating to the MPEG-2 TS. Naturally, this synchronization data could just as well be based on a clock independent of those of the transport protocols.
The embodiments are based on a receiver. Alternatively, the audio and video contents could be rendered in distinct equipment. A first equipment is then considered as the master. It periodically communicates to the second equipment the temporal information corresponding to the commonly rendered content. The synchronizer of the second equipment uses this information as a reference. The communication time between the two equipment is presumed to be known or sufficiently low.
The invention is described in the preceding text as an example. It is understood that those skilled in the art are capable of producing variants of the invention without leaving the scope of the patent. In particular, the embodiment is placed within the framework of a broadcast network and of a broadband network, and for audio and video contents.

Claims

1 . Method for the processing of multimedia data flows in a reception device (4) comprising an interface (43) to a first network (6) and an interface (44) to a second network (7), said method comprising the following steps:
- receive and synchronize, from the first network, a first flow comprising first multimedia data, through a first transport protocol fitted with a first data synchronization system, said first flow comprising countdown information indicating when said first multimedia data will have to be rendered;
- receive and synchronize, from the second network, a second flow comprising second multimedia data, through a second transport protocol fitted with a second data synchronization system, said second synchronization system being different from the first data synchronization system, said second multimedia data being pre-loaded upon request of said reception device to enable the reception device to render it synchronously with the first multimedia data when it is received;
said first and second flows each transporting a same synchronization information of a same format in the payload field of the first and of the second transport protocol, said synchronization information comprising data indicating the moment from which said multimedia data must be rendered; said method comprising the following steps:
- synchronize rendering said multimedia data from said first and second flows by using said synchronization information.
2. Method according to claim 1 , said transport protocols being of the MPEG-2 TS and/or RTP type.
3. Method according to one of the preceding claims, the synchronization information being based on the synchronization system of one of the transport protocols.
4. Method according to one of the preceding claims, the synchronization step comprising a step for delaying the content received in advance by memorization, until the contents can be rendered synchronously.
5. Method according to claim 1 , in which the first flow is received on the first network in broadcast mode.
6. Reception device (4) comprising an interface (43) to a first network enabling a first flow comprising multimedia data to be received and synchronized, through a first transport protocol fitted with a first data synchronization system, said first flow comprising countdown information indicating when said first multimedia data will have to be rendered;
- an interface to a second network (44), enabling a second flow comprising multimedia data to be received and synchronized, through a second transport protocol fitted with a second data synchronization system, said second synchronization system being different from the first data synchronization system, said second multimedia data being pre-loaded upon request of said reception device to enable the reception device to render it synchronously with the first multimedia data when it is received;
- a synchronizer (46) to synchronize rendering said multimedia data from said first and second flows by using synchronization information transported in the first and second flows and comprising data indicating the moment from which said multimedia data must be rendered.
7. Computer program product, characterized in that it comprises program code instructions for executing the steps of the method according to claims 1 to 5 when said programme is executed on a computer.
PCT/EP2011/068016 2010-10-15 2011-10-14 Method for synchronizing multimedia flows and corresponding device WO2012049305A1 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
CN2011800497729A CN103155584A (en) 2010-10-15 2011-10-14 Method for synchronizing multimedia flows and corresponding device
KR1020137009471A KR101828639B1 (en) 2010-10-15 2011-10-14 Method for synchronizing multimedia flows and corresponding device
US13/878,216 US9729762B2 (en) 2010-10-15 2011-10-14 Method for synchronizing multimedia flows and corresponding device
BR112013008587-8A BR112013008587B1 (en) 2010-10-15 2011-10-14 METHOD FOR PROCESSING MULTIMEDIA DATA ON A RECEIVING DEVICE, AND RECEIVING DEVICE
JP2013533234A JP6053686B2 (en) 2010-10-15 2011-10-14 Methods and corresponding devices for synchronizing multimedia flows
AU2011315435A AU2011315435B2 (en) 2010-10-15 2011-10-14 Method for synchronizing multimedia flows and corresponding device
EP11770759.6A EP2628297B1 (en) 2010-10-15 2011-10-14 Method for synchronizing multimedia flows and corresponding device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR1058421 2010-10-15
FR1058421 2010-10-15

Publications (1)

Publication Number Publication Date
WO2012049305A1 true WO2012049305A1 (en) 2012-04-19

Family

ID=44063507

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2011/068016 WO2012049305A1 (en) 2010-10-15 2011-10-14 Method for synchronizing multimedia flows and corresponding device

Country Status (8)

Country Link
US (1) US9729762B2 (en)
EP (1) EP2628297B1 (en)
JP (1) JP6053686B2 (en)
KR (1) KR101828639B1 (en)
CN (1) CN103155584A (en)
AU (1) AU2011315435B2 (en)
BR (1) BR112013008587B1 (en)
WO (1) WO2012049305A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2704449A1 (en) 2012-08-30 2014-03-05 Thomson Licensing Rendering time control
WO2014204225A1 (en) * 2013-06-19 2014-12-24 엘지전자 주식회사 Broadcasting transmission/reception apparatus and broadcasting transmission/reception method
EP3032766A1 (en) 2014-12-08 2016-06-15 Thomson Licensing Method and device for generating personalized video programs

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2680599A1 (en) 2012-06-29 2014-01-01 Thomson Licensing Provision of a personalized media content
JP6625318B2 (en) * 2013-08-29 2019-12-25 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America Transmission method and reception method
JP6505996B2 (en) * 2013-08-30 2019-04-24 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America Receiving method and receiving apparatus
CN105580386A (en) * 2013-09-30 2016-05-11 索尼公司 Receiver device, broadcast device, server device and reception method
EP3087747A4 (en) 2013-12-23 2017-08-16 Lg Electronics Inc. Apparatuses and methods for transmitting or receiving a broadcast content via one or more networks
CN105917654B (en) * 2014-01-13 2019-07-26 Lg电子株式会社 The device and method of broadcasted content are sent or received via one or more network
WO2015105391A1 (en) * 2014-01-13 2015-07-16 Lg Electronics Inc. Apparatuses and methods for transmitting or receiving a broadcast content via one or more networks
WO2015108309A1 (en) * 2014-01-14 2015-07-23 Lg Electronics Inc. Broadcast transmission device and operating method thereof, broadcast reception device and operating method thereof
US9560421B2 (en) * 2014-03-27 2017-01-31 Samsung Electronics Co., Ltd. Broadcast and broadband hybrid service with MMT and DASH
CN106031181B (en) * 2014-04-18 2019-06-14 Lg电子株式会社 Broadcast singal sending device, broadcasting signal receiving, broadcast singal sending method and broadcast signal received method
CN104135667B (en) 2014-06-10 2015-06-24 腾讯科技(深圳)有限公司 Video remote explanation synchronization method, terminal equipment and system
CN104079957B (en) * 2014-06-25 2017-09-01 广东欧珀移动通信有限公司 A kind of method and system of multimedia equipment simultaneously operating
EP3171605B1 (en) * 2014-07-18 2020-10-14 Sony Corporation Transmission device, transmission method, reception device, and reception method
JP6318953B2 (en) * 2014-07-30 2018-05-09 ソニー株式会社 Transmitting apparatus, transmitting method, receiving apparatus, and receiving method
DE102015001622A1 (en) 2015-02-09 2016-08-11 Unify Gmbh & Co. Kg Method for transmitting data in a multimedia system, and software product and device for controlling the transmission of data in a multimedia system
US10986421B2 (en) * 2017-07-03 2021-04-20 Dolby Laboratories Licensing Corporation Identification and timing data for media content
US10009862B1 (en) * 2017-09-06 2018-06-26 Texas Instruments Incorporated Bluetooth media device time synchronization
US11122310B2 (en) * 2017-09-14 2021-09-14 Media Links Co., Ltd. Video switching system
JP6504294B2 (en) * 2018-03-23 2019-04-24 ソニー株式会社 Transmission apparatus, transmission method, reception apparatus and reception method
JP6743931B2 (en) * 2019-03-26 2020-08-19 ソニー株式会社 Transmission device, transmission method, reception device, and reception method
WO2021065605A1 (en) * 2019-10-01 2021-04-08 ソニー株式会社 Information processing device and information processing method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070118850A1 (en) * 2003-09-16 2007-05-24 France Telecom Television signal reception method and module
WO2008055420A1 (en) * 2006-11-09 2008-05-15 Huawei Technologies Co., Ltd. A synchronizing method between different medium streams and a system
EP2073548A1 (en) * 2007-12-17 2009-06-24 Alcatel Lucent Method for synchronizing at least two streams

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001069459A (en) * 1999-06-22 2001-03-16 Matsushita Electric Ind Co Ltd Countdown voice generator and countdown voice generating system
US6778756B1 (en) 1999-06-22 2004-08-17 Matsushita Electric Industrial Co., Ltd. Countdown audio generation apparatus and countdown audio generation system
US6415438B1 (en) * 1999-10-05 2002-07-02 Webtv Networks, Inc. Trigger having a time attribute
US6583821B1 (en) 1999-07-16 2003-06-24 Thomson Licensing S.A. Synchronizing apparatus for a compressed audio/video signal receiver
CA2313979C (en) * 1999-07-21 2012-06-12 Thomson Licensing Sa Synchronizing apparatus for a compressed audio/video signal receiver
JP4407007B2 (en) 2000-05-02 2010-02-03 ソニー株式会社 Data transmission apparatus and method
US7313313B2 (en) 2002-07-25 2007-12-25 Microsoft Corporation Audio/video synchronization with no clean points
JP2004088366A (en) * 2002-08-26 2004-03-18 Matsushita Electric Ind Co Ltd Digital broadcast receiver and digital broadcast system
US7657645B2 (en) 2004-02-05 2010-02-02 Sharp Laboratories Of America, Inc. System and method for transporting MPEG2TS in RTP/UDP/IP
JP4718275B2 (en) * 2005-08-26 2011-07-06 三菱電機株式会社 Multiple media synchronized playback system and synchronized playback method
EP1855402A1 (en) 2006-05-11 2007-11-14 Koninklijke Philips Electronics N.V. Transmission, reception and synchronisation of two data streams
EP1954054A1 (en) 2007-02-02 2008-08-06 Thomson Licensing System and method for transporting interactive marks
CN102057687B (en) 2008-06-11 2013-06-12 皇家飞利浦电子股份有限公司 Synchronization of media stream components
US8776144B2 (en) * 2008-10-16 2014-07-08 Industrial Technology Research Institute Mobile TV system and method for synchronizing the rendering of streaming services thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070118850A1 (en) * 2003-09-16 2007-05-24 France Telecom Television signal reception method and module
WO2008055420A1 (en) * 2006-11-09 2008-05-15 Huawei Technologies Co., Ltd. A synchronizing method between different medium streams and a system
EP2073548A1 (en) * 2007-12-17 2009-06-24 Alcatel Lucent Method for synchronizing at least two streams

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DR UWE RAUSCHENBACH ET AL: "A Scalable Interactive TV Service Supporting Synchronized Delivery Over Broadcast and Broadband Networks", INTERNATIONAL BROADCASTING CONFERENCE 2004; 10-9-2004 - 14-9-2004; AMSTERDAM,, 10 September 2004 (2004-09-10), XP030081445 *
HURST N ET AL: "MPEG SPLICING: A NEW STANDARD FOR TELEVISION - SMPTE 312M", SMPTE - MOTION IMAGING JOURNAL, SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS, WHITE PLAINS, NY, US, vol. 107, no. 11, 1 November 1998 (1998-11-01), pages 978 - 988, XP000804761, ISSN: 0036-1682 *
J. BRASSIL AND H. SCHULZRINNE: "Structuring Internet Media Streams With Cueing Protocols", IEEE/ACM TRANSACTIONS ON NETWORKING, vol. 10, no. 5, August 2002 (2002-08-01), pages 466 - 476, XP040140121 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2704449A1 (en) 2012-08-30 2014-03-05 Thomson Licensing Rendering time control
WO2014033140A1 (en) 2012-08-30 2014-03-06 Thomson Licensing Rendering time control
US10057624B2 (en) 2012-08-30 2018-08-21 Thomson Licensing Synchronization of content rendering
WO2014204225A1 (en) * 2013-06-19 2014-12-24 엘지전자 주식회사 Broadcasting transmission/reception apparatus and broadcasting transmission/reception method
EP3032766A1 (en) 2014-12-08 2016-06-15 Thomson Licensing Method and device for generating personalized video programs

Also Published As

Publication number Publication date
AU2011315435B2 (en) 2016-05-12
JP2013545355A (en) 2013-12-19
JP6053686B2 (en) 2016-12-27
EP2628297A1 (en) 2013-08-21
AU2011315435A1 (en) 2013-05-09
EP2628297B1 (en) 2017-07-12
BR112013008587B1 (en) 2021-09-21
BR112013008587A2 (en) 2016-07-12
CN103155584A (en) 2013-06-12
KR101828639B1 (en) 2018-03-22
KR20130138777A (en) 2013-12-19
US9729762B2 (en) 2017-08-08
US20130335629A1 (en) 2013-12-19

Similar Documents

Publication Publication Date Title
AU2011315435B2 (en) Method for synchronizing multimedia flows and corresponding device
KR101689616B1 (en) Method for transmitting/receiving media segment and transmitting/receiving apparatus thereof
US10349091B2 (en) Transmitting method, receiving method, transmitting device, and receiving device
KR101946861B1 (en) Method and apparatus for synchronizing media data of multimedia broadcast service
CN100401784C (en) Data synchronization method and apparatus for digital multimedia data receiver
US10305617B2 (en) Transmission apparatus, transmission method, reception apparatus, and reception method
US11758201B2 (en) Transmitting method, receiving method, transmitting device, and receiving device
US10797811B2 (en) Transmitting device and transmitting method, and receiving device and receiving method
JP2018182677A (en) Information processing apparatus, information processing method, program, and recording medium manufacturing method
JP2024040224A (en) Method for transmission, transmitter, method for reception, and receiver
JP2018182617A (en) Information processing apparatus, information processing method, program, and recording medium manufacturing method
CN101489122B (en) Method, apparatus and system for implementing transmission stream time mapping
CN110719244B (en) Method and system for transmitting media in heterogeneous network
JP2021010194A (en) Transmitter, transmission method, receiver and reception method
JP2020053999A (en) Transmitter, transmission method, receiver and reception method
KR20140004045A (en) Transmission apparatus and method, and reception apparatus and method for providing 3d service using the content and additional image seperately transmitted with the reference image transmitted in real time

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201180049772.9

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11770759

Country of ref document: EP

Kind code of ref document: A1

REEP Request for entry into the european phase

Ref document number: 2011770759

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2011770759

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 13878216

Country of ref document: US

ENP Entry into the national phase

Ref document number: 2013533234

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 20137009471

Country of ref document: KR

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2011315435

Country of ref document: AU

Date of ref document: 20111014

Kind code of ref document: A

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112013008587

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 112013008587

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20130409