WO2007110283A1 - Procédé pour générer un flux de données numérique - Google Patents

Procédé pour générer un flux de données numérique Download PDF

Info

Publication number
WO2007110283A1
WO2007110283A1 PCT/EP2007/051571 EP2007051571W WO2007110283A1 WO 2007110283 A1 WO2007110283 A1 WO 2007110283A1 EP 2007051571 W EP2007051571 W EP 2007051571W WO 2007110283 A1 WO2007110283 A1 WO 2007110283A1
Authority
WO
WIPO (PCT)
Prior art keywords
metadata
media data
data stream
stream
section
Prior art date
Application number
PCT/EP2007/051571
Other languages
German (de)
English (en)
Inventor
Peter Amon
Andreas Hutter
Thomas Rathgen
Original Assignee
Nokia Siemens Networks Gmbh & Co. Kg
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Siemens Networks Gmbh & Co. Kg filed Critical Nokia Siemens Networks Gmbh & Co. Kg
Priority to EP07704647A priority Critical patent/EP2002654A1/fr
Publication of WO2007110283A1 publication Critical patent/WO2007110283A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4348Demultiplexing of additional data and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234327Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/23614Multiplexing of additional data and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2662Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/84Generation or processing of descriptive data, e.g. content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/24Systems for the transmission of television signals using pulse code modulation
    • H04N7/52Systems for transmission of a pulse code modulated video signal with one or more other pulse code modulated signals, e.g. an audio signal or a synchronizing signal
    • H04N7/54Systems for transmission of a pulse code modulated video signal with one or more other pulse code modulated signals, e.g. an audio signal or a synchronizing signal the signals being synchronous
    • H04N7/56Synchronising systems therefor

Definitions

  • the invention relates to a method for generating a digital data stream and to a method for processing a data stream generated in this way.
  • the invention relates to a transmitter or a receiver for transmitting or receiving a digital data stream and a corresponding digital information carrier.
  • Metadata streams are often transmitted synchronized in time with the actual media data stream, which deliver a content-related description of the corresponding data packets of the media data stream.
  • video sample also referred to as "video sample”
  • metadata stream portion also referred to as a "meta data sample”
  • the provisional MPEG4-SVC Moving Pictures Experts Group
  • MPEG Moving Pictures Experts Group
  • the generated video stream is constructed so that the video images can be extracted in different resolutions.
  • R / D-optimal extraction of the video data is to be achieved here.
  • R / D stands for "rate distortion” and with the R / D-optimal extraction, which is well known to the person skilled in the art and is explained in more detail in the detailed description, it is to be ensured that the best possible quality of the video data stream at a given
  • Such an optimal extraction On is very complex and requires the provision of certain information. It is not enough that only metadata be provided for each video sample, but it is necessary to provide parameters for the individual video data packets within the video samples for the R / D-optimal extraction of video images.
  • the document WO 03/091905 A2 discloses the classification of a data stream by dividing the data stream into data stream subareas, wherein each data stream subarea is assigned to a class.
  • the classes can be selected such that they are independent of the structure and syntax of the data stream to be described, in particular the encoding format of the data stream.
  • the document does not address the synchronized generation of metadata for a media data stream.
  • the object of the invention is to provide a method for generating a digital data stream and a method for processing such a data stream, which allow easy access to metadata of a media data stream. It is another object of the invention to provide a corresponding
  • a media data stream is generated which contains a plurality of media data stream sections sorted temporally in a specific manner, wherein each media data stream section contains one or more media data packets.
  • the media data stream sections may be sorted in temporal succession.
  • a media data stream section is, for example, a predetermined video image in a video data stream.
  • the generated media data stream is assigned a metadata stream that is synchronized with it in a timely manner, the metadata stream comprising metadata stream sections and one or more of the media data stream sections and in particular each metadata stream section each being assigned a metadata stream section.
  • a metadata stream section is generated, in which metadata relating to the corresponding media data flow section is deposited in the metadata stream section.
  • the invention is characterized in that one or more first metadata elements are arranged in a respective metadata flow section such that each media data packet of the media data flow section to which the respective metadata flow section is assigned is assigned a first metadata element of the respective metadata flow section.
  • This assignment ensures that the structure of a respective media data flow section is mapped in the metadata flow section.
  • the assignment according to the invention of metadata elements to media data packets ensures that for each media data packet a unique first metadata element exists, which is assigned to the media data packet.
  • the method according to the invention can also be used in one embodiment if in each case a plurality of media data packets are combined into one or more aggregators in one or more of the media data flow sections.
  • aggregator is to be understood here in general terms, and this includes any type of combination of several media data packets into a group of media data packets. Nevertheless, the term aggregator is also to be understood in its special meaning known from the MPEG4 data formats. There aggregators are used to summarize media data packets with the same quality levels, which are described for example by the so-called. DTQ values.
  • the metadata flow section which is assigned to the respective media data flow section, contains second metadata elements, which are arranged in such a way that a second metadata element is assigned to each aggregator.
  • this uniquely assigns a metadata element to an aggregator, so that a defined mapping of an aggregator in the media data stream also takes place in the corresponding metadata stream.
  • the criteria according to which media data packages are aggregated into aggregators can be arbitrary and play no role in the invention.
  • aggregators are used to aggregate media data packets having substantially the same data quality.
  • the second metadata element includes a metadata element for describing the respective aggregator and the metadata contained in the one or more metadata elements assigned to the one or more media data packets that are aggregated into the respective aggregator.
  • the data stream contains a plurality of media data streams and correspondingly assigned metadata streams.
  • at least one of the media data streams may contain one or more so-called extractors, each of which references one or more media data packets of another media data stream, according to the invention each extractor being assigned a third metadata element in the metadata stream associated with the at least one media data stream.
  • the third metadata element assigned to the extractor is preferably designed such that it contains a metadata element for describing the extractor itself as well as the metadata contained in the first or second metadata element (s) Media data packages that are referenced by the extractor.
  • first metadata elements it is specified in one or more of the first metadata elements whether and / or how many subsequent first and / or second and / or third metadata elements have the same content.
  • the reading out of the metadata from the data stream can be simplified since for several first and / or second and / or third metadata elements, the contents of the individual metadata elements do not have to be accessed each time, but the content of the first metadata element of each other following metadata elements can be used.
  • the media data stream sections and / or the metadata stream sections are referenced in the data stream. Furthermore, the description of one or more quality levels and the instructions may be included in the data stream for direct access to the media data packages of the corresponding quality level. In a further embodiment of the method according to the invention, the first and / or second and / or third metadata elements contained in a metadata stream section can be accessed in the data stream.
  • each media stream portion represents a video image at a predetermined time and the media data packets are video data packets.
  • the video data stream may be an MPEG4 SVC video data stream well known in the art that enables scalable encoding and decoding of video data.
  • the metadata elements contain z. B. content that allows optimized transmission of the data stream at a given data rate.
  • the first and / or second and / or third media data elements contain parameters relating to a rate control of the data stream, preferably parameters of an R / D-optimal extraction of media data, and / or parameters for adapting the quality of the data stream.
  • the invention further relates to a method of generating a digital data stream. drive to process such a generated digital data stream.
  • the data stream is read such that the assignment of the media data packets to the first metadata elements and / or the aggregators to the second metadata elements and / or the extractors to the third metadata elements by numbering the read media data packets and / or aggregators and / or extractors and simultaneous numbering of the read first and / or second and / or third metadata elements is determined.
  • no separate indices for the association between metadata elements and media data packets have to be stored; rather, the assignment already results from the mapping of the structure of the media data in the metadata.
  • the invention further relates to a transmitter for transmitting a data stream according to the invention, comprising: a first means for generating a media data stream containing a plurality of time-sorted media data stream sections, each media data stream section containing one or more media data packets; second means for associating a metadata stream synchronized with the media data stream with the media data stream, the metadata stream including metadata flow portions, and each of the media data flow portions having a metadata stream portion associated therewith; third means for arranging one or more first metadata items in a respective metadata stream portion such that each media data packet of the media data stream portion to which the respective metadata stream portion is assigned is assigned a first metadata item of the respective metadata stream portion.
  • the transmitter further comprises means for processing the data stream according to the above processing method.
  • the sender can extract parts from the data stream according to predetermined criteria, in particular according to criteria relating to the data rate and / or the data quality, and the sender can send out the extracted parts.
  • the invention further relates to a receiver for receiving a data stream generated according to the invention, comprising: a reading means for reading out the data stream such that the assignment of the media data packets to the first metadata elements and / or the aggregators to the second metadata elements and / or the extractors to the third metadata elements by numbering the read media data packets and / or aggregators and / or extractors and simultaneous numbering of the read first and / or second and / or third metadata elements is determined.
  • the receiver can in this case be designed such that it can read parts from the data stream according to predetermined criteria, in particular criteria with regard to the data rate and / or the data quality.
  • the invention further comprises a corresponding digital information carrier containing digital information with which a data stream according to the invention
  • the digital information carrier includes a first data storage area for generating a media data stream including a plurality of time-sorted media data stream portions, each media data stream portion containing one or more media data packets. Furthermore, a second data storage area is provided for generating a metadata stream synchronized in time with the media data stream, wherein the metadata stream comprises metadata flow sections and one or more of the media data flow sections and in particular each metadata flow section is assigned a respective metadata flow section.
  • the information carrier is characterized in that one or more first metadata elements are arranged in a respective metadata stream section such that each media data packet of the media data stream section to which the respective metadata stream is associated with a first metadata element of the respective metadata flow section.
  • the digital information carrier comprises a track for referencing media data stream sections and / or one or more tracks for referencing metadata stream sections.
  • the tracks for referencing metadata flow sections may here comprise descriptions of groupings of first and / or second and / or third metadata elements with which, for example, predetermined quality levels for the data stream transmission are determined.
  • FIG. 1 shows a schematic representation of a file format known from the prior art, in which stored video data are referenced via a video track;
  • Figure 2 is a schematic representation of a file format known in the art in which video data is referenced via a video track and also metadata is time synchronized with the video data via a metadata track;
  • FIG. 3 shows a schematic representation of the assignment according to the invention of metadata elements to media data packets
  • FIG. 4 shows a schematic representation of a file format according to the invention in which inter alia a media data stream is used which contains so-called extractors;
  • 5 shows a schematic representation of a file format according to the invention, in which the referencing of the media data stream and the time-synchronized metadata stream takes place via corresponding tracks;
  • FIG. 6 shows a schematic representation of a file format according to the invention, in which metadata are provided in several hierarchical levels of the media data stream;
  • FIG. 7 is a schematic representation of a file format according to the invention, in which additional information can be used to access groups of metadata elements in the metadata track directly;
  • Fig. 8 is a schematic representation of a transmitter, a
  • SVC Scalable Video Coding
  • SNR Signal to Noise Ratio
  • the D value represents the local resolution
  • the T value the temporal resolution
  • the Q value the SNR scaling direction for each video packet. The larger the values, the better the quality of the corresponding resolutions.
  • Fig. 1 shows a file format described in the standardization document [1] concerning the format for storing MPEG4 SVC data.
  • the format contains a media data container MD, which contains the area "mdat” in which one or more media data streams MDS are stored, and a description container MT, which includes an area "moov", which contains one or more video tracks VT.
  • a video track stores information that allows access to (or portions of) the media data.
  • a media data stream MDS contains, for example, the temporally organized individual video data packets, wherein the video data packets are designated as NALU (Network Abstraction Layer Unit).
  • NALU Network Abstraction Layer Unit
  • the video track VT contains data with the aid of which a predetermined operating point with defined resolution and refresh rate can be accessed, the operating points being grouped in defined "animals" so that the operating points in an animal have a defined resolution and refresh rate.
  • the evaluation of this data results in extraction instructions illustrated by the remaining (not bold) arrows in FIG.
  • the quality of an animal may vary in certain areas, as well as the resulting data transfer rate of the video data stream.
  • the video track VT thus contains metadata for the individual animals, which describe the resulting quality. This static description refers to the entire representation of the data stream.
  • the metadata can be z. B.
  • the video track VT includes a number of data structures that allow efficient access to the desired media data.
  • the media data stream section SO contains six video data packets NALu, whereas the section Sl comprises five NALus. For each video data packet NALu a corresponding DTQ value is stored. For example, the first video data packet from the left in the media data stream section SO has a DTQ value in the form of the triples (0,0,0). This means that the local resolution has the value or index 0, the time resolution has the index 0, and the SNR scalability also has the index 0. Further, in the file format of Fig.
  • adjacent video data packets are grouped into a so-called aggregator if they have the same DTQ values.
  • the second and the third video data packet from the left in the first media data stream section SO are combined to form an aggregator A, since both video data packets have the same DTQ value (1, 0, 0).
  • the file format contains a description container MT and a media data container MD.
  • TMT timed meta data track
  • This track in turn has references in the form of arrows to a metadata stream MTS within the media data container MD.
  • the metadata stream MTS contains a plurality of metadata stream sections in the form of so-called "meta data samples" MO, Ml, ..., M7, which are synchronized in time with individual media data stream sections SO, Sl,..., S7.
  • the file format of Fig. 2 is described in the standardization document [4], for example. With this format, metadata can be encoded using, for example, binarized XML (BiM).
  • AVC Advanced Video Coding
  • the other quality levels are coded by the fact that the Quantization parameters, for example, reduced by six.
  • the individual FGS refinements are in turn coded in two stages, which are designated as "significant pass” and “refinement pass” and are stored in a so-called FGS NALu.
  • the MPEG4 SVC standard now allows FGS NALus to adjust the video stream to a predetermined bit rate by truncating the individual FGS NALs.
  • the NALs can be cut off here always at the same bit position, which, however, leads to a non-optimal decoding.
  • the FGS NALus may be "pre-cut" at defined locations, these precut fragments (so-called aka
  • R / D optimal extraction the RD values for a video image of a given local resolution level are plotted on an RD diagram over the bit rate. This results in a convex hull, and it can be determined for each point on the convex hull the angle ⁇ , which indicates how much the distortion varies in response to the change in the data rate. The value of ⁇ can be generated after appropriate scaling for each NALu and stored appropriately.
  • the NALs to be retained can then optimally for a given local Resolution and a given data rate are selected. Details of the R / D-optimal extraction of video data packets can be found in the publication [3].
  • quality id The parameter used in selecting the NALs according to the R / D optimal extraction is referred to in the above document [3] as "quality id".
  • quality id the parameter used in selecting the NALs according to the R / D optimal extraction.
  • individual fields for metadata elements are stored in the time-synchronized metadata stream MTS, wherein a metadata element is assigned exactly one video data packet NALu from the media data stream.
  • Any information concerning the individual video data packets can be stored in these metadata elements, in particular the corresponding parameter for the R / D optimal extraction of video data can be stored for each video data packet (eg the aforementioned quality_id).
  • the invention is illustrated in FIG. 3 for the media data flow sections or metadata flow sections SO and MO shown in FIG.
  • the media data stream section SO has the same structure as the media data stream section SO shown in FIG.
  • the metadata flow section MO has the same structure as the media flow section SO.
  • a metadata element ME in MO is provided for each NALu in SO, wherein the assignment of the individual metadata elements ME to the NALus in FIG. 3 is indicated by the arrows P.
  • the aggregator A in SO which combines the second and third NALu with the same DTQ values, with the method according to the invention, the aggregator also has a corresponding one Assigned metadata element, which is designated in Fig. 3 as MA 'and whose assignment to the aggregator A is indicated by the arrow P'.
  • the metadata element MA ' contains a metadata element MEA which describes the properties of the aggregator, for example properties that have all the NALus contained in the aggregator. In the option shown in FIG.
  • the metadata element MA 'contains, after the metadata element MEA, first metadata elements ME for each NALu contained in the aggregator.
  • MEA describes the aggregator and any NALu contained therein.
  • an essential aspect of the invention is that the structure of a media data stream section is mapped in a corresponding metadata stream section.
  • the corresponding assignment between metadata elements and video data packets can be achieved here by parallel counting of the metadata elements in the metadata stream and the video data packets in the media data stream. This counting is shown in FIG. 3 by corresponding numbering of the video data packets NALu and the metadata elements ME and of the metadata element MEA. The numbering is as follows:
  • the first NALu from the left receives the count index 1, the following aggregator A the count index 2 and the aggregator following NALus the count indices 3, 4 and 5. Analogue is counted in the metadata stream. That is to say, the first metadata element, which corresponds to the NALu with the count index 1, likewise receives the number index 1, the metadata element MA 'for the aggregator A is assigned the count index 2, which is also assigned to the aggregator A, and the metadata elements follow The aggregator A contains the counter indices 3, 4 and 5, analogous to the corresponding NALs.
  • the numbering within the aggregator is re-numbered so that the NALus in the aggregator A and the corresponding meta data elements ME have the indices 1 and 2. It should be noted here that the numbers shown in Fig. 3 are not stored in the data format, but automatically result in the use of Zählindizes when accessing the data.
  • FIG. 4 shows a further embodiment of the file format according to the invention, wherein not only a single media data stream MDS and a single synchronized metadata stream MTS are present, but a further media data stream MDS 'is provided to which a corresponding metadata stream MTS' is assigned.
  • the media data stream section SO shown in the media data stream MDS here corresponds to the media data stream section SO of FIG. 3.
  • the metadata stream section MO in the metadata stream MTS corresponds to the metadata stream section MO of FIG. 3.
  • the additionally provided media data stream section MDS ' contains so-called extractors. The use of such extractors is described in Section 7.6 of Standardization Document [1]. These extractors represent packets that serve to refer media data to other media streams. In the example of FIG.
  • the left extractor E in the media data stream MDS 'references the first NALu, the subsequent aggregator A and the two NALus contained in this aggregator in the media data stream MDS.
  • the second extractor E in the media data stream MDS 'references the back three NALus in the media data stream MDS.
  • a further type of metadata elements is provided for the extractors in the metadata stream MTS'.
  • These metadata elements are reproduced in the stream MTS 'of FIG. 4 as MEI' for the left extractor of MDS 'and as ME2' for the right tractor of MDS '.
  • these metadata elements are also referred to as ME '.
  • the assignment between the extractors and the metadata elements ME1 'and ME2' in turn follows by numbering the extractors E with the counting indexes 1 and 2 and the corresponding metadata elements ME1 ', ME2' with the same indices 1 and 2.
  • the metadata elements ME1 'and ME2' contain a metadata element MEE at the beginning describes the entire extractor. This metadata element MEE can be followed by the metadata elements ME of those media data packets (NALus) which are referenced by the extractor.
  • FIG. 5 again shows, in a schematic overall view, the file format generated according to the invention.
  • the structure of Fig. 5 corresponds to the structure of Fig. 2, but the contents of the video track VT, the metadata track TMT and the media data stream MDS and the metadata stream MTS are shown in detail.
  • the video track VT contains a number of metadata structures that describe the video track globally. Such metadata structures concern z. For example, the following descriptions:
  • the metadata track TMT contains data structures which globally describe the time-synchronized metadata in the metadata stream MTS.
  • the metadata track contains in particular a description of the metadata stream (so-called “metric data sample entry”) as well as further necessary structures (such as eg "sample size box”).
  • each NALu in SI is uniquely associated with exactly one metadata element ME in the metadata stream section M1.
  • FIG. 6 shows a data format similar to FIG. 5, but additionally the metadata is stored in several hierarchical levels of the media data stream.
  • metadata concerning the entire metadata track TMT is stored as so-called “track meta data” TMD.
  • different animals or layers are formed, which are stored as so-called “animal meta data” TI in the metadata track TMT, wherein different animals are provided for different quality levels.
  • FIG. 6 shows by way of example the animals TII and TI2.
  • corresponding metadata elements MS are provided in the metadata stream MTS, which describe the corresponding sample or video image at a time.
  • the metadata flow sections MO 'and Ml' are thus formed in the metadata stream MTS according to FIG. 6, these metadata flow sections each containing a metadata element MS and the metadata flow section MO or Ml.
  • the above-described metadata elements ME or MA 'or ME' are provided on the ter lying video packet level.
  • the metadata at the sample level and at the data packet level are thus stored with temporal and structured synchronization to the media data in the metadata stream MTS.
  • FIG. 7 shows a file format similar to FIG. 6, wherein a grouping of the metadata in the metadata track TMT can take place such that individual metadata elements ME or MA 'in the metadata stream MTS can be accessed via the metadata track TMT.
  • the access mechanisms in the metadata track TMT correspond to the access mechanisms in the video track VT.
  • These metadata may e.g. can be sent together with the media data via a network (eg in the "RTP payload format" or in special SEI messages), whereby further adaptation can take place in this network or in a corresponding terminal.
  • the extraction of metadata from the data stream can be done, for example, by the extraction method described in document [3].
  • the method is used in such a way that a data reading device accesses the corresponding metadata in addition to the media data in order to achieve an increased quality in the extraction of parts of the bit stream. It may not be necessary to access all metadata, as the description in the tracks indicates, for example, which media data belongs to a certain local scaling level, the processing of which does not require any additional metadata for the currently selected quality.
  • the metadata can have the same mechanisms for thinning out the data stream (so-called stream thinning) or used for data extraction as for the media data.
  • stream thinning the same mechanisms for thinning out the data stream
  • the grouping shown there can, inter alia, be used to generate a scalable metadata stream, ie it is possible to group the metadata in such a way that they are, for example, scalable by relevance.
  • z. B. a temporal scaling of the metadata possible. If the metadata is grouped like the media data, the remaining relevant metadata can easily be extracted when adapting the data stream.
  • Fig. 8 is a schematic diagram showing a transmitter 1, a receiver 2 and a digital information carrier 3 according to an embodiment of the invention.
  • the transmitter 1 comprises a first means 4 for generating a media data stream, which contains a plurality of temporally sorted media stream sections, wherein each media data stream section contains one or more media data packets.
  • the transmitter further comprises a second means for allocating a metadata stream synchronized with the media data stream to the media data stream, wherein the metadata stream contains metadata flow sections and one or more of the media data flow sections each have a metadata flow. tenstromabterrorism is assigned.
  • the transmitter further comprises third means 6 for arranging one or more first metadata items in a respective metadata stream portion such that each media data packet of the media data stream portion to which the respective metadata stream portion is assigned is assigned a first metadata item of the respective metadata stream portion.
  • the data stream generated in the transmitter 1 may be sent to the receiver 2 via any communication path, which may be wireless or wired, as indicated by the arrow 10.
  • the receiver 2 can then suitably process the digital data stream according to the invention.
  • the receiver 2 has a reading means 7, which the
  • Data stream read out such that the assignment of the media data packets to the first metadata elements and / or the aggregatoren to the second metadata elements and / or the extractors to the third metadata elements by numbering the read media data packets and / or aggregators and / or extractors and simultaneous numbering of the read first and / or second and / or third metadata elements is determined.
  • the transmitter 1 preferably interacts with a digital information carrier 3, as indicated by the double arrow 11 in FIG. 8.
  • a digital information carrier 3 On the digital information carrier 3, a first data storage area 8 and a second data storage area ⁇ : are provided.
  • the information in the first data storage area 8 can be used to generate a media data stream containing a plurality of time-sorted media data stream portions, each media data stream portion containing one or more media data packets.
  • the information in the second data storage area 9 can be used to generate a metadata stream which is synchronized in time with the media data stream, wherein the metadata stream contains metadata flow segments. holds and one or more of the media data stream sections each associated with a metadata stream section.
  • One or more first metadata elements are arranged in the digital information carrier in a respective metadata stream section such that each media data packet of the media data stream section to which the respective metadata stream section is assigned is assigned a first metadata element of the respective metadata stream section.
  • TMD track meta data

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

L'invention concerne un procédé servant à générer un flux de données numérique. Selon ce procédé, un flux de données multimédia (MDS) contenant une pluralité de segments de flux de données multimédia (S0,..., S7) classés chronologiquement est généré, chaque segment de flux de données multimédia contenant un ou plusieurs paquets de données multimédia (NALu). Un flux de métadonnées (MTS) synchronisé avec le flux de données multimédia (MDS) est associé au flux de données multimédia (MDS), le flux de métadonnées (MTS) contenant des segments de flux de métadonnées (M0,..., M7) et un segment de flux de métadonnées (M0,..., M7) étant respectivement associé à un ou plusieurs des segments de flux de données multimédia (S0,..., S7). Le procédé selon l'invention est caractérisé en ce qu'un ou plusieurs premiers éléments de métadonnées (ME) sont disposés dans un segment de flux de métadonnées correspondant (M0,..., M7) de sorte qu'un premier élément de métadonnées (ME) du segment de flux de métadonnées concerné (M0,..., M7) est affecté à chaque paquet de données multimédia (NALu) du segment de flux de données multimédia (S0,..., S7) auquel est associé le segment de flux de métadonnées concerné (M0,..., M7).
PCT/EP2007/051571 2006-03-27 2007-02-19 Procédé pour générer un flux de données numérique WO2007110283A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP07704647A EP2002654A1 (fr) 2006-03-27 2007-02-19 Procédé pour générer un flux de données numérique

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
DE102006014036.2 2006-03-27
DE102006014036 2006-03-27
DE102006045140.6 2006-09-25
DE102006045140A DE102006045140A1 (de) 2006-03-27 2006-09-25 Verfahren zur Erzeugung eines digitalen Datenstroms

Publications (1)

Publication Number Publication Date
WO2007110283A1 true WO2007110283A1 (fr) 2007-10-04

Family

ID=38055167

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2007/051571 WO2007110283A1 (fr) 2006-03-27 2007-02-19 Procédé pour générer un flux de données numérique

Country Status (4)

Country Link
EP (1) EP2002654A1 (fr)
KR (1) KR20090009832A (fr)
DE (1) DE102006045140A1 (fr)
WO (1) WO2007110283A1 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2150060A1 (fr) * 2008-07-28 2010-02-03 Alcatel, Lucent Procédé et agencement pour codage vidéo
WO2010060442A1 (fr) * 2008-11-26 2010-06-03 Telefonaktiebolaget Lm Ericsson (Publ) Technique de manipulation de contenu multimédia devant être accessible par l'intermédiaire de multiples pistes multimédias

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040006575A1 (en) * 2002-04-29 2004-01-08 Visharam Mohammed Zubair Method and apparatus for supporting advanced coding formats in media files
US20050110662A1 (en) * 2003-11-26 2005-05-26 Debargha Mukherjee Method and apparatus for updating sequences in a bitstream

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040006575A1 (en) * 2002-04-29 2004-01-08 Visharam Mohammed Zubair Method and apparatus for supporting advanced coding formats in media files
US20050110662A1 (en) * 2003-11-26 2005-05-26 Debargha Mukherjee Method and apparatus for updating sequences in a bitstream

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SINGER D ET AL: "WD 3.0 of ISO/IEC 14496-15/PDAM2 (SVC File Format)", ISO/IEC JTC1/SC29/WG11 MPEG06/N7906, 16 January 2006 (2006-01-16) - 20 January 2006 (2006-01-20), Bangkok meeting, pages 1 - 32, XP002436531 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2150060A1 (fr) * 2008-07-28 2010-02-03 Alcatel, Lucent Procédé et agencement pour codage vidéo
WO2010012501A1 (fr) * 2008-07-28 2010-02-04 Alcatel Lucent Procédé et système de codage vidéo
WO2010060442A1 (fr) * 2008-11-26 2010-06-03 Telefonaktiebolaget Lm Ericsson (Publ) Technique de manipulation de contenu multimédia devant être accessible par l'intermédiaire de multiples pistes multimédias
US8798264B2 (en) 2008-11-26 2014-08-05 Telefonaktiebolaget Lm Ericsson (Publ) Technique for handling media content to be accessible via multiple media tracks

Also Published As

Publication number Publication date
EP2002654A1 (fr) 2008-12-17
DE102006045140A1 (de) 2007-10-18
KR20090009832A (ko) 2009-01-23

Similar Documents

Publication Publication Date Title
DE112012000421B4 (de) Wiedergabevorrichtung, Verfahren zum Steuern der Wiedergabevorrichtung, Erzeugungsvorrichtung, Verfahren zum Steuern der Erzeugungsvorrichtung, Aufzeichnungsmedium, Datenstruktur, Steuerprogramm und Aufzeichnungsmedium, auf welchem das Programm gespeichert ist
DE3639026C2 (de) Hochauflösendes Bildübertragungsverfahren
DE69823168T2 (de) Verfahren und gerät zur kodierung und formatierung von daten eines videoprogramms zur erzeugung sich mehrfach überlappender darstellungen des videoprogramms
DE60109423T2 (de) Videokodierung mit prädiktiver bitebenenkodierung und progressiver fein-granularitätsskalierung (pfgs)
DE69233411T2 (de) Verfahren und Einrichtung zur Kompression von sich bewegenden Videobildern mit adaptiver Bitzuordnung und Quantisierung
DE10392598T5 (de) Unterstützung von fortschrittlichen Codierungsformaten in Mediendateien
DE102005001287A1 (de) Verfahren und Vorrichtung zur Verarbeitung von skalierbaren Daten
DE102005032952A1 (de) Statistischer Multiplexer mit schützenden Charakteristika vor durch redundante Systemelemente erzeugten äußeren Nachrichten
DE10392282T5 (de) Verfahren und Vorrichtung zum Unterstützen von AVC in MP4
WO2011144615A2 (fr) Fournisseur de flux de transport, fournisseur de signal dab, analyseur de flux de transport, récepteur dab, procédé, programme informatique et signal de flux de transport
EP1371229A1 (fr) Procede de compression et de decompression de donnees video
DE60211790T2 (de) Videokodierung mit konstanter Qualität
EP2002654A1 (fr) Procédé pour générer un flux de données numérique
EP1247402A1 (fr) Procede pour modifier le contenu informatif d'un flux de transport de donnees video dans des regions selectionnees
DE69934206T2 (de) Übertragungssystem für multiplex signale
DE60008141T2 (de) Digitalfernsehrundfunkverfahren, entsprechendes digitalsignal und entsprechende vorrichtung
DE60223054T2 (de) Verbesserungen bezüglich der inhaltsauswahl
EP1554884A1 (fr) Ensemble circuit et procede pour la reception de signaux de television numeriques dans des recepteurs de television mobile
DE102020106101B4 (de) Verfahren zur automatisierten Anpassung und/oder Erzeugung eines Encoders und/oder zur Erstellung eines Medienstreams, Recheneinheit und Computerprogrammprodukt
EP1334621B1 (fr) Procede et dispositif d'enregistrement de donnees numeriques audiovisuelles codees
DE102016216346A1 (de) Vorrichtung und verfahren zum bestimmen einer qualität eines mediadatenstroms
WO2007009849A1 (fr) Procede d'enregistrement de differents elements donnees d'un flux de donnees modulable dans un fichier et un dispositif associe
AT503668B1 (de) Verfahren und vorrichtung zur darstellung von signalen auf einer anzeigevorrichtung
EP1588564A1 (fr) Procede de reduction de donnees
DE102011102348B4 (de) Verfahren und Vorrichtung zum Bereitstellen eines IP-Fernsehprogramms

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780010925.2

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07704647

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2007704647

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 1020087026259

Country of ref document: KR