WO2008088132A1 - Time-stamping apparatus and method for rtp packetization of svc coded video, and rtp packetization system using the same - Google Patents

Time-stamping apparatus and method for rtp packetization of svc coded video, and rtp packetization system using the same Download PDF

Info

Publication number
WO2008088132A1
WO2008088132A1 PCT/KR2007/006636 KR2007006636W WO2008088132A1 WO 2008088132 A1 WO2008088132 A1 WO 2008088132A1 KR 2007006636 W KR2007006636 W KR 2007006636W WO 2008088132 A1 WO2008088132 A1 WO 2008088132A1
Authority
WO
WIPO (PCT)
Prior art keywords
nal unit
rtp
timestamp
picture
value
Prior art date
Application number
PCT/KR2007/006636
Other languages
French (fr)
Inventor
Soon-Heung Jung
Jae-Gon Kim
Jin-Woo Hong
Kwang-Deok Seo
Chul-Wook Moon
Jin-Won Lee
Original Assignee
Electronics And Telecommunications Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020070096872A external-priority patent/KR100897525B1/en
Application filed by Electronics And Telecommunications Research Institute filed Critical Electronics And Telecommunications Research Institute
Priority to US12/523,375 priority Critical patent/US20100046552A1/en
Publication of WO2008088132A1 publication Critical patent/WO2008088132A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/6437Real-time Transport Protocol [RTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43072Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234318Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into objects, e.g. MPEG-4 objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/242Synchronization processes, e.g. processing of PCR [Program Clock References]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/647Control signaling between network components and server or clients; Network processes for video distribution between server and clients, e.g. controlling the quality of the video stream, by dropping packets, protecting content from unauthorised alteration within the network, monitoring of network load, bridging between two different networks, e.g. between IP and wireless
    • H04N21/64784Data processing by the network
    • H04N21/64792Controlling the complexity of the content stream, e.g. by dropping packets
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8547Content authoring involving timestamps for synchronizing content

Definitions

  • the present invention relates to a time-stamping apparatus and method for real time transport protocol (RTP) packetization of a scalable video coding (SVC) coded video, and a RTP packetization system using the same; and, more particularly, to a time- stamping apparatus and method for the RTP packetization of a SVC coded video, and a RTP packetization system using the same, which set a timestamp value for an instantaneous decoding refresh (IDR) picture that is the first picture of a SVC bit stream and generate a timestamp of a network abstraction layer (NAL) unit using a picture property and a temporal_level (TL) among header information of an inputted NAL unit.
  • RTP real time transport protocol
  • Scalable video coding is a H.264 scalable coding technology that was developed to overcome the disadvantages of the scalability of scalable coding in MPEG-2 and MPEG-4, such as a low compression rate, the incapability of supporting integrated scalability, and high embodying complexity.
  • the SVC encodes a plurality of video layers to one bit sequence.
  • the layers of SVC are constituted of a base layer and scalable layers that can be stacked on the base layer consecutively.
  • Each of the scalable layers can express the maximum bit rate, the maximum frame rate, and a resolution based on the information of lower layers.
  • the SVC if a plurality of scalable layers are stacked, the SVC is a coding technology suitable to a multimedia contents service of a universal multimedia access (UMA) that can solve the diversity problems related to bandwidths, the performance of a receiving terminal, resolutions in a heterogeneous network environment.
  • UMA universal multimedia access
  • a SVC coder in a video coding layer generates the base layer coding information and the scalable coding information of a scalable layer in a unit of a slice.
  • Each of the generated slices is generated as a network abstraction layer (NAL) unit in a NAL layer again and stored in a SVC bit-stream.
  • NAL network abstraction layer
  • a RTP packetization step is performed to transmit the SVC bit-stream through an Internet protocol (IP) network.
  • IP Internet protocol
  • RTP timestamp information must be transmitted to a receiving end by inserting the RTP timestamp information into a RTP header in order to synchronize with different types of media information.
  • An embodiment of the present invention is directed to providing a time- stamping apparatus and method for the RTP packetization of a SVC coded video, and a RTP packetization system using the same, which set a timestamp value for an instantaneous decoding refresh (IDR) picture that is the first picture of a SVC bit stream and generate a timestamp of a network abstraction layer (NAL) unit using a picture property and a temporal_level (TL) among header information of an inputted NAL unit.
  • IDR instantaneous decoding refresh
  • a method for generating a timestamp for a real time transport protocol (RTP) packetization of a scalable video coding (SVC) video including the steps of: a) setting a RTP timestamp value for an instantaneous decoding refresh (IDR) picture; and b) generating a RTP timestamp of a corresponding NAL unit using picture properties and a temporal_level (TL) value among header information of an input network abstraction layer (NAL) unit.
  • RTP real time transport protocol
  • IDR instantaneous decoding refresh
  • a time stamping apparatus for a real time transport protocol (RTP) packetization of a scalable video coding (SVC) video including: a network abstraction layer (NAL) unit classifying unit for checking a header of an input NAL unit and classifying the input NAL units based on a picture property; a first timestamp calculating unit for cal- culating a RTP timestamp value for a NAL unit classified as a key picture by the NAL unit classifying unit; a second timestamp calculating unit for calculating a RTP timestamp value for a NAL unit classified as a non-key picture by the NAL unit classifying unit; and a controlling unit for setting a RTP timestamp value for an instantaneous decoding refresh (IDR) picture and controlling the first and second timestamp calculating unit for calculating a RTP timestamp value of a corresponding NAL unit.
  • NAL network abstraction layer
  • a system for real time transport protocol (RTP) packetization of a scalable video coding (SVC) bit- stream including: a SVC encoding unit for storing coding information, which is generated when an input video sequence is coded based on SVC, in a SVC bit-stream in a form of a network abstraction layer (NAL) unit; a RTP timestamp generating unit for generating a RTP timestamp with reference to a header of a NAL unit generated in the SVC encoding unit; and a RTP packetizer for generating a RTP packet by inserting the generated RTP timestamp in a header of a RTP packet when a RTP packet is generated using the generated NAL unit.
  • NAL network abstraction layer
  • a time-stamping apparatus and method for the RTP packetization of a SVC coded video, and a RTP packetization system using the same can packetize a SVC video based on a real-time transport protocol (RTP) by setting a timestamp value for an instantaneous decoding refresh (IDR) picture that is the first picture of a SVC bit stream and generating a timestamp of a network abstraction layer (NAL) unit using a picture property and a temporal_level (TL) among header information of an inputted NAL unit although a display order of pictures is different from a coding order of the pictures or a transmit order.
  • RTP real-time transport protocol
  • a time-stamping apparatus and method for the RTP packetization of a SVC coded video, and a RTP packetization system using the same according to the present invention can automatically generate a RTP timestamp value that is required for the RTP packetization in order to transmit NAL units having a SVC bit stream through an IP network such as Internet.
  • FIG. 1 is a diagram illustrating a RTP packetization system of a SVC bit-stream in accordance with an embodiment of the present invention.
  • FIG. 2 is a diagram depicting a time-stamping apparatus for RTP packetization of a
  • Fig. 3 is a diagram showing a RTP packet in accordance with an embodiment of the present invention.
  • Fig. 4 is a diagram illustrating a header of a NAL unit in accordance with an embodiment of the present invention.
  • FIG. 5 is a diagram showing a SVC video screen and a hierarchy structure in accordance with an embodiment of the present invention.
  • FIG. 6 is a flowchart of a method for generating a timestamp for RTP packetization of a SVC video in accordance with an embodiment of the present invention.
  • Fig. 7 is a diagram for describing a procedure of setting TL_Group and
  • FIG. 1 is a diagram illustrating a RTP packetization system of a SVC bit-stream in accordance with an embodiment of the present invention.
  • a system for packetizing a SVC bit-stream based on a real-time transport protocol (RTP) includes a SVC encoder 11, a time-stamping apparatus 12, and a RTP packetizer 13.
  • the SVC encoder 11 stores coding information in a form of a network abstraction layer (NAL) unit, where the coding information is generated when an input video sequence based on scalable video coding (SVC).
  • NAL network abstraction layer
  • the time-stamping apparatus 12 generate a RTP timestamp with reference to a header of a NAL unit generated in the SVC encoder 11.
  • the RTP packetizer 13 generates a RTP packet by inserting a RTP timestamp generated from the time- stamping apparatus 12 in to the header of the RTP packet using the NAL unit generated in the SVC encoder 11.
  • the SVC bit-stream is constituted of an instantaneous decoding refresh (IDR) picture and at least one of group of pictures (GOP).
  • IDR instantaneous decoding refresh
  • GOP group of pictures
  • FIG. 2 is a diagram depicting a time-stamping apparatus for RTP packetization of a
  • the time stamping apparatus for packetizing a SVC video based on a real-time transport protocol includes a NAL unit classifier 21, a first time stamp calculator 22, a second time stamp calculator 23, and a controller 24.
  • the NAL unit classifier 21 classifies NAL units based on the property of a picture by checking the headers of inputted NAL units.
  • the first time stamp calculator 22 using a temporal_level (TL) among header information of NAL units which are classified as a key picture by the NAL unit classifier 21.
  • the second timestamp calculator 23 calculates a RTP timestamp value with reference to the TL among the header information of NAL units which are classified as non key picture by the NAL unit classifier 21 and an order in a TL group.
  • the controller 24 sets a RTP timestamp value for an instantaneous decoding refresh picture which is the first picture of a SVC bit-stream and controls the first and second timestamp calculators 22 and 23 to calculating a RTP timestamp value of a corresponding NAL unit.
  • the controller 24 performs another control function for inserting RTP timestamps calculated by the first and second timestamp calculators 22 and 23 to the header of a corresponding RTP packet.
  • controller 24 allocates the set RTP timestamp value when a NAL unit corresponding to the IDR picture inputs.
  • Fig. 3 is a diagram showing a RTP packet in accordance with an embodiment of the present invention.
  • the RTP packet according to the present embodiment includes a RTP header 21 and a RTP pay load 32.
  • the RTP header 31 includes a 32-bit timestamp period 301.
  • the timestamp information for a currently transmitted SVC video picture (NAL unit) is recorded in the timestamp period 301.
  • one SVC video picture includes at least one of NAL units because one SVC video picture is formed by decoding at least one of NAL units.
  • Fig. 4 is a diagram illustrating a header of a NAL unit according to an embodiment of the present invention.
  • the diagram a) shows a header structure of a base layer NAL unit
  • the diagram b) shows a header structure of a scalable layer NAL unit.
  • the header structures a) and b) store encoding information generated in SVC.
  • the header structure a) can be compatible with H.264.
  • a spatio-temporal hierarchy relation for a NAL unit can be derived from a temporal_level (TL), DID, and QL field information of the header structure b).
  • Information used for generating a timestamp is the TL information representing a hierarchy between temporal layers for temporal scalability.
  • Fig. 5 is a diagram showing a SVC video picture and a hierarchy structure used in the present invention.
  • the SVC video picture and the hierarchy structure denotes an instantaneous decoding refresh (IDR) picture that is a start part of a SVC stream and pictures in the first group among a plurality of GOPs, where the GOP stands for group of picture.
  • IDR instantaneous decoding refresh
  • One GOP includes total 16 pictures.
  • the IDR picture is marked with 0
  • the first B -picture in a GOP is marked with
  • the picture numbers 1 to 16 are matched with an order of displaying the pictures on a monitor.
  • a supportable picture resolution in the base layer 501 is QCIF
  • a supportable picture resolution in the spatial scalable layer 502 is CIF.
  • a hierarchical B -picture scheme is applied to provide temporal scalability, and a TL value is used for displaying a supportable frame rate among a TL field, a DID field, and a QL filed.
  • the TL value is displayed at the center of each picture display in a form of rectangle.
  • a frame rate up to 1.875fps (frame per second).
  • the frame rate can be supported up to the maximum 15 fps with QCIF. Since the maximum TL value is 4 in the spatial scalable layer 502, the frame rate can be supported up to the maximum 30 fps with CIF.
  • FIG. 6 is a flowchart of a method for generating a timestamp for RTP packetization of a SVC video in accordance with an embodiment of the present invention.
  • a RTP timestamp value is set for an instantaneous decoding refresh picture that is the first picture of a SVC bit-stream at step S601.
  • the timestamp value of an IDR picture is generally set as 0.
  • the timestamp value of the IDR picture may be set as a predetermined number for security purpose. Therefore, if a NAL unit of an IDR picture inputs, the set RTP timestamp value is allocated.
  • a RTP timestamp value is calculated using Eq. 1 at step S 603. That is, a RTP timestamp value is calculated using a TL value among the header information of a NAL unit if the input NAL unit is the key picture.
  • TS Kev Pk (T m ⁇ ) IDRJS + Clock :Jntxf ⁇ xG0P_Nwn
  • T MAX denotes the maximum TL value among temporal_level (TL) values of
  • a clock interval (Clock_Int) is a time interval of a timestamp value between pictures.
  • IDR_TS denotes a timestamp value for an IDR picture that is the first picture of a SVC stream
  • G0P_Num( ⁇ l) denotes an order number of a current GOP among all GOPs in a SVC stream.
  • the related standard defines that 90 KHz is used as a sampling clock used for generating a RTP timestamp value for a SVC video picture.
  • the inter-frame clock interval can be calculated through Eq. 2 in case of a video supporting a frame rate up to 30fps.
  • a RTP timestamp value is calculated using Eq. 3 at step S604. That is, if the input NAL unit is not a NAL unit of a key picture, a RTP timestamp value is calculated with reference to a TL value or an order in a TL group among the header information of the input NAL unit.
  • TS P ⁇ (T, n) IDR _TS + JnI x 2 Ttta ⁇ ⁇ (GOP _Num - 1) ⁇
  • T ( I ⁇ J ⁇ J is a TL value in a current picture
  • n is an order number of a current picture in the same TL _Group, and its range is
  • Fig. 7 is a diagram for describing a procedure of setting TL_Group and
  • TL_Group_Size in accordance with an embodiment of the present invention.
  • pictures are encoded and transmitted in an order of TL values. That is, the picture having a smaller TL value is encoded and transmitted first.
  • TL_Group denotes a group of pictures (NAL units) having the same TL value in a GOP
  • TL_Group_Size denotes the number of pictures in the same TL_Group.
  • the 16 th picture having a TL value of 0 forms an independent TL_Group, and the
  • TL_Group_Size becomes 1.
  • TL_Group_Size becomes 1.
  • a n value of the first picture in each TL_Groups becomes 0, and a n value of the second picture in each TL_Groups becomes 1.
  • the n value of the second picture in the TL_Group including the 2 nd picture, 6 th picture, 10 th picture and 14 th picture becomes 0, and the n value of the 6 th picture becomes 1.
  • the calculated RTP timestamp may be inserted into a header of a corresponding RTP packet.
  • NAL units in the same picture may exist for one picture.
  • a timestamp value is calculated for the first NAL unit of a picture, it is preferable to use the calculated timestamp value for the other NAL units in the same picture because NAL units in the same picture have the same time information.
  • the above described method according to the present invention can be embodied as a program and stored on a computer readable recording medium.
  • the computer readable recording medium is any data storage device that can store data which can be thereafter read by the computer system.
  • the computer readable recording medium includes a read-only memory (ROM), a random-access memory (RAM), a CD-ROM, a floppy disk, a hard disk and an optical magnetic disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Provided are a time-stamping apparatus and method for RTP packetization of a SVC coded video, and a RTP packetization system using the same. The time stamping apparatus includes: a NAL unit classifier for checking a header of an input NAL unit and classifying the input NAL units based on a picture property; a first timestamp calculator for calculating a RTP timestamp value for a NAL unit classified as a key picture by the NAL unit classifier; a second timestamp calculator for calculating a RTP timestamp value for a NAL unit classified as a non-key picture by the NAL unit classifier; and a controller for setting a RTP timestamp value for an instantaneous decoding refresh (IDR) picture and controlling the first and second timestamp calculators for calculating a RTP timestamp value of a corresponding NAL unit.

Description

Description
TIME-STAMPING APPARATUS AND METHOD FOR RTP
PACKETIZATION OF SVC CODED VIDEO, AND RTP PACK-
ETIZATION SYSTEM USING THE SAME
Technical Field
[1] The present invention relates to a time-stamping apparatus and method for real time transport protocol (RTP) packetization of a scalable video coding (SVC) coded video, and a RTP packetization system using the same; and, more particularly, to a time- stamping apparatus and method for the RTP packetization of a SVC coded video, and a RTP packetization system using the same, which set a timestamp value for an instantaneous decoding refresh (IDR) picture that is the first picture of a SVC bit stream and generate a timestamp of a network abstraction layer (NAL) unit using a picture property and a temporal_level (TL) among header information of an inputted NAL unit.
[2] This work was supported by IT R & D program of MIC/ IITA [2005-S- 103-02, "Development of Ubiquitous Content Access Technology for Convergence of Broadcasting and Communications"]. Background Art
[3] Scalable video coding (SVC) is a H.264 scalable coding technology that was developed to overcome the disadvantages of the scalability of scalable coding in MPEG-2 and MPEG-4, such as a low compression rate, the incapability of supporting integrated scalability, and high embodying complexity.
[4] The SVC encodes a plurality of video layers to one bit sequence. The layers of SVC are constituted of a base layer and scalable layers that can be stacked on the base layer consecutively. Each of the scalable layers can express the maximum bit rate, the maximum frame rate, and a resolution based on the information of lower layers.
[5] Since it is possible to support various bit rates, frame rates, and resolutions in the
SVC if a plurality of scalable layers are stacked, the SVC is a coding technology suitable to a multimedia contents service of a universal multimedia access (UMA) that can solve the diversity problems related to bandwidths, the performance of a receiving terminal, resolutions in a heterogeneous network environment.
[6] A SVC coder in a video coding layer (VCL) generates the base layer coding information and the scalable coding information of a scalable layer in a unit of a slice. Each of the generated slices is generated as a network abstraction layer (NAL) unit in a NAL layer again and stored in a SVC bit-stream.
[7] Here, a RTP packetization step is performed to transmit the SVC bit-stream through an Internet protocol (IP) network. In the RTP packetization step, RTP timestamp information must be transmitted to a receiving end by inserting the RTP timestamp information into a RTP header in order to synchronize with different types of media information.
[8] Particularly, it is essential to transmit the RTP timestamp to support lip synchronization between video and audio in a receiving end when a SVC video is serviced with an audio such as AAC.
[9] An international standard for the SVC is not completely prepared, and it is expected to completely prepare the international standard for the SVC by a year of 2007. Therefore, no method for automatically generating a timestamp and recording the timestamp when a SVC bit-stream is loaded in a RTP packet was introduced. Disclosure of Invention Technical Problem
[10] An embodiment of the present invention is directed to providing a time- stamping apparatus and method for the RTP packetization of a SVC coded video, and a RTP packetization system using the same, which set a timestamp value for an instantaneous decoding refresh (IDR) picture that is the first picture of a SVC bit stream and generate a timestamp of a network abstraction layer (NAL) unit using a picture property and a temporal_level (TL) among header information of an inputted NAL unit.
[11] Other objects and advantages of the present invention can be understood by the following description, and become apparent with reference to the embodiments of the present invention. Also, it is obvious to those skilled in the art of the present invention that the objects and advantages of the present invention can be realized by the means as claimed and combinations thereof. Technical Solution
[12] In accordance with an aspect of the present invention, there is provided a method for generating a timestamp for a real time transport protocol (RTP) packetization of a scalable video coding (SVC) video, including the steps of: a) setting a RTP timestamp value for an instantaneous decoding refresh (IDR) picture; and b) generating a RTP timestamp of a corresponding NAL unit using picture properties and a temporal_level (TL) value among header information of an input network abstraction layer (NAL) unit.
[13] In accordance with an aspect of the present invention, there is provided a time stamping apparatus for a real time transport protocol (RTP) packetization of a scalable video coding (SVC) video, including: a network abstraction layer (NAL) unit classifying unit for checking a header of an input NAL unit and classifying the input NAL units based on a picture property; a first timestamp calculating unit for cal- culating a RTP timestamp value for a NAL unit classified as a key picture by the NAL unit classifying unit; a second timestamp calculating unit for calculating a RTP timestamp value for a NAL unit classified as a non-key picture by the NAL unit classifying unit; and a controlling unit for setting a RTP timestamp value for an instantaneous decoding refresh (IDR) picture and controlling the first and second timestamp calculating unit for calculating a RTP timestamp value of a corresponding NAL unit.
[14] In accordance with an aspect of the present invention, there is provided a system for real time transport protocol (RTP) packetization of a scalable video coding (SVC) bit- stream, including: a SVC encoding unit for storing coding information, which is generated when an input video sequence is coded based on SVC, in a SVC bit-stream in a form of a network abstraction layer (NAL) unit; a RTP timestamp generating unit for generating a RTP timestamp with reference to a header of a NAL unit generated in the SVC encoding unit; and a RTP packetizer for generating a RTP packet by inserting the generated RTP timestamp in a header of a RTP packet when a RTP packet is generated using the generated NAL unit.
Advantageous Effects
[15] A time-stamping apparatus and method for the RTP packetization of a SVC coded video, and a RTP packetization system using the same according to the present invention can packetize a SVC video based on a real-time transport protocol (RTP) by setting a timestamp value for an instantaneous decoding refresh (IDR) picture that is the first picture of a SVC bit stream and generating a timestamp of a network abstraction layer (NAL) unit using a picture property and a temporal_level (TL) among header information of an inputted NAL unit although a display order of pictures is different from a coding order of the pictures or a transmit order.
[16] A time-stamping apparatus and method for the RTP packetization of a SVC coded video, and a RTP packetization system using the same according to the present invention can automatically generate a RTP timestamp value that is required for the RTP packetization in order to transmit NAL units having a SVC bit stream through an IP network such as Internet. Brief Description of the Drawings
[17] Fig. 1 is a diagram illustrating a RTP packetization system of a SVC bit-stream in accordance with an embodiment of the present invention.
[18] Fig. 2 is a diagram depicting a time-stamping apparatus for RTP packetization of a
SVC video in accordance with an embodiment of the present invention.
[19] Fig. 3 is a diagram showing a RTP packet in accordance with an embodiment of the present invention. [20] Fig. 4 is a diagram illustrating a header of a NAL unit in accordance with an embodiment of the present invention.
[21] Fig. 5 is a diagram showing a SVC video screen and a hierarchy structure in accordance with an embodiment of the present invention.
[22] Fig. 6 is a flowchart of a method for generating a timestamp for RTP packetization of a SVC video in accordance with an embodiment of the present invention.
[23] Fig. 7 is a diagram for describing a procedure of setting TL_Group and
TL_Group_Size in accordance with an embodiment of the present invention. Mode for the Invention
[24] The advantages, features and aspects of the invention will become apparent from the following description of the embodiments with reference to the accompanying drawings, which is set forth hereinafter.
[25] Fig. 1 is a diagram illustrating a RTP packetization system of a SVC bit-stream in accordance with an embodiment of the present invention.
[26] As shown in Fig. 1, a system for packetizing a SVC bit-stream based on a real-time transport protocol (RTP) according to the present embodiment includes a SVC encoder 11, a time-stamping apparatus 12, and a RTP packetizer 13. The SVC encoder 11 stores coding information in a form of a network abstraction layer (NAL) unit, where the coding information is generated when an input video sequence based on scalable video coding (SVC).
[27] The time-stamping apparatus 12 generate a RTP timestamp with reference to a header of a NAL unit generated in the SVC encoder 11. The RTP packetizer 13 generates a RTP packet by inserting a RTP timestamp generated from the time- stamping apparatus 12 in to the header of the RTP packet using the NAL unit generated in the SVC encoder 11.
[28] The SVC bit-stream is constituted of an instantaneous decoding refresh (IDR) picture and at least one of group of pictures (GOP). One GOP includes 16 pictures.
[29] Fig. 2 is a diagram depicting a time-stamping apparatus for RTP packetization of a
SVC video in accordance with an embodiment of the present invention. As shown in Fig. 2, the time stamping apparatus for packetizing a SVC video based on a real-time transport protocol (RTP) includes a NAL unit classifier 21, a first time stamp calculator 22, a second time stamp calculator 23, and a controller 24.
[30] The NAL unit classifier 21 classifies NAL units based on the property of a picture by checking the headers of inputted NAL units. The first time stamp calculator 22 using a temporal_level (TL) among header information of NAL units which are classified as a key picture by the NAL unit classifier 21.
[31] The second timestamp calculator 23 calculates a RTP timestamp value with reference to the TL among the header information of NAL units which are classified as non key picture by the NAL unit classifier 21 and an order in a TL group. The controller 24 sets a RTP timestamp value for an instantaneous decoding refresh picture which is the first picture of a SVC bit-stream and controls the first and second timestamp calculators 22 and 23 to calculating a RTP timestamp value of a corresponding NAL unit.
[32] Here, the controller 24 performs another control function for inserting RTP timestamps calculated by the first and second timestamp calculators 22 and 23 to the header of a corresponding RTP packet.
[33] Furthermore, the controller 24 allocates the set RTP timestamp value when a NAL unit corresponding to the IDR picture inputs.
[34] Fig. 3 is a diagram showing a RTP packet in accordance with an embodiment of the present invention. Referring to Fig. 3, the RTP packet according to the present embodiment includes a RTP header 21 and a RTP pay load 32.
[35] Here, the RTP header 31 includes a 32-bit timestamp period 301. The timestamp information for a currently transmitted SVC video picture (NAL unit) is recorded in the timestamp period 301.
[36] Here, one SVC video picture includes at least one of NAL units because one SVC video picture is formed by decoding at least one of NAL units.
[37] Fig. 4 is a diagram illustrating a header of a NAL unit according to an embodiment of the present invention. The diagram a) shows a header structure of a base layer NAL unit, and the diagram b) shows a header structure of a scalable layer NAL unit.
[38] As shown in Fig. 4, the header structures a) and b) store encoding information generated in SVC. Here, the header structure a) can be compatible with H.264.
[39] Also, a spatio-temporal hierarchy relation for a NAL unit can be derived from a temporal_level (TL), DID, and QL field information of the header structure b).
[40] Information used for generating a timestamp is the TL information representing a hierarchy between temporal layers for temporal scalability.
[41] Fig. 5 is a diagram showing a SVC video picture and a hierarchy structure used in the present invention. As shown in Fig. 5, the SVC video picture and the hierarchy structure denotes an instantaneous decoding refresh (IDR) picture that is a start part of a SVC stream and pictures in the first group among a plurality of GOPs, where the GOP stands for group of picture. One GOP includes total 16 pictures.
[42] Here, the IDR picture is marked with 0, the first B -picture in a GOP is marked with
1, and a key picture the last picture in the GOP is marked with 16. The picture numbers 1 to 16 are matched with an order of displaying the pictures on a monitor.
[43] A supportable picture resolution in the base layer 501 is QCIF, and a supportable picture resolution in the spatial scalable layer 502 is CIF.
[44] A hierarchical B -picture scheme is applied to provide temporal scalability, and a TL value is used for displaying a supportable frame rate among a TL field, a DID field, and a QL filed.
[45] Also, the TL value is displayed at the center of each picture display in a form of rectangle. Here, if only key pictures having TL=O are transmitted, it is possible to support a frame rate up to 1.875fps (frame per second). If a B-picture having TL=I is transmitted with the key pictures, it is possible to support a frame rate up to 3.75fps.
[46] In addition, in case of transmitting a B-picture having TL=2, it is possible to support a frame rate up to 7.5fps. In case of transmitting B-pictures having TL=3 and TL=4, it is possible to support a frame rate up to 15fps and 30fps.
[47] Since the maximum TL value is 3 in the base layer 501, the frame rate can be supported up to the maximum 15 fps with QCIF. Since the maximum TL value is 4 in the spatial scalable layer 502, the frame rate can be supported up to the maximum 30 fps with CIF.
[48] Fig. 6 is a flowchart of a method for generating a timestamp for RTP packetization of a SVC video in accordance with an embodiment of the present invention.
[49] At first, a RTP timestamp value is set for an instantaneous decoding refresh picture that is the first picture of a SVC bit-stream at step S601. The timestamp value of an IDR picture is generally set as 0. However, the timestamp value of the IDR picture may be set as a predetermined number for security purpose. Therefore, if a NAL unit of an IDR picture inputs, the set RTP timestamp value is allocated.
[50] Then, picture property information is confirmed by checking the header of the input
NAL unit at step 602.
[51] If the NAL unit is a key picture which is the first picture in a GOP based on the checking result at step S602, a RTP timestamp value is calculated using Eq. 1 at step S 603. That is, a RTP timestamp value is calculated using a TL value among the header information of a NAL unit if the input NAL unit is the key picture.
[52] MathFigure 1
[Math.l]
TSKev Pk(T) = IDRJS + Clock :Jntxf^ xG0P_Nwn
[53] In Eq. 1, T MAX denotes the maximum TL value among temporal_level (TL) values of
NAL units in a current GOP. A clock interval (Clock_Int) is a time interval of a timestamp value between pictures. IDR_TS denotes a timestamp value for an IDR picture that is the first picture of a SVC stream, and G0P_Num(≥l) denotes an order number of a current GOP among all GOPs in a SVC stream.
[54] Hereinafter, a procedure of calculating a clock interval (Clock_Int) will be described in more detail with reference to Fig. 5. [55] Since the maximum value of TL is 4, the frame rate can be supportable up to maximum 30fps in a SVC video picture and a hierarchy structure as shown in Fig. 5.
Here, the related standard defines that 90 KHz is used as a sampling clock used for generating a RTP timestamp value for a SVC video picture. [56] Therefore, the inter-frame clock interval can be calculated through Eq. 2 in case of a video supporting a frame rate up to 30fps. [57] MathFigure 2
[Math.2]
_ , , r, , 1 t j 90,000Hz 90,00OcIoOkSZs
Inter - frame _Uock Jnterva! = = 3,000 clocks/frame
Max ^FR 30frames/s
[58] According to the confirming result at step S602, if the input NAL unit is not the key picture such as normal picture, a RTP timestamp value is calculated using Eq. 3 at step S604. That is, if the input NAL unit is not a NAL unit of a key picture, a RTP timestamp value is calculated with reference to a TL value or an order in a TL group among the header information of the input NAL unit.
[59] MathFigure 3
[Math.3]
TS (T, n) = IDR _TS +
Figure imgf000009_0001
JnI x 2Ttta ■< (GOP _Num - 1)}
+ Clock _ hit x 2Tua ~τ x (2 x n + 1) [60] In Eq. 3,
T ( I <Γ<J^ J is a TL value in a current picture, n is an order number of a current picture in the same TL _Group, and its range is
Q<}i <TL_ Group _ Size
[61] Hereinafter, a procedure of setting TL_Group and TL_Group_Size will be described in more detail with reference to Fig. 7.
[62] Fig. 7 is a diagram for describing a procedure of setting TL_Group and
TL_Group_Size in accordance with an embodiment of the present invention. In general, pictures are encoded and transmitted in an order of TL values. That is, the picture having a smaller TL value is encoded and transmitted first.
[63] As shown in Fig. 7, TL_Group denotes a group of pictures (NAL units) having the same TL value in a GOP, and TL_Group_Size denotes the number of pictures in the same TL_Group. [64] The 16th picture having a TL value of 0 forms an independent TL_Group, and the
TL_Group_Size becomes 1.
[65] The 8th picture having a TL value of 1 forms an independent TL_Group, and the
TL_Group_Size becomes 1.
[66] The 4th picture and the 12th picture having a TL value of 2 form an independent
TL_Group, and the TL_Group_Size becomes 2.
[67] The 2nd picture, 6th picture, 10th picture, and 14th picture, which have a TL value of 3, form an independent TL_Group, and TL_Group_Size becomes 4.
[68] The 1st picture, 3rd picture, 5th picture, 7th picture, 9th picture, 11th picture, 13th picture, and 15th picture, which have a TL value of 4, form an independent TL_Group, and TL_Group_Size becomes 8.
[69] Here, a n value of the first picture in each TL_Groups becomes 0, and a n value of the second picture in each TL_Groups becomes 1. For example, the n value of the second picture in the TL_Group including the 2nd picture, 6th picture, 10th picture and 14th picture becomes 0, and the n value of the 6th picture becomes 1.
[70] In addition, the calculated RTP timestamp may be inserted into a header of a corresponding RTP packet.
[71] As described above, it was described that only one NAL unit exists for one picture.
However, a plurality of NAL units may exist for one picture. In addition, if a timestamp value is calculated for the first NAL unit of a picture, it is preferable to use the calculated timestamp value for the other NAL units in the same picture because NAL units in the same picture have the same time information.
[72] The above described method according to the present invention can be embodied as a program and stored on a computer readable recording medium. The computer readable recording medium is any data storage device that can store data which can be thereafter read by the computer system. The computer readable recording medium includes a read-only memory (ROM), a random-access memory (RAM), a CD-ROM, a floppy disk, a hard disk and an optical magnetic disk.
[73] The present application contains subject matter related to Korean Patent Application
Nos. 2007-0006057 and 2007-0096872, filed in the Korean Intellectual Property Office on January 19, 2007, and September 21, 2007, the entire contents of which is incorporated herein by reference.
[74] While the present invention has been described with respect to certain preferred embodiments, it will be apparent to those skilled in the art that various changes and modifications may be made without departing from the spirits and scope of the invention as defined in the following claims. Industrial Applicability [75] The present invention can be used for RTP packetization of a SVC video.

Claims

Claims
[1] A method for generating a timestamp for a real time transport protocol (RTP) packetization of a scalable video coding (SVC) video, comprising the steps of: a) setting a RTP timestamp value for an instantaneous decoding refresh (IDR) picture; and b) generating a RTP timestamp of a corresponding NAL unit using picture properties and a temporal_level (TL) value among header information of an input network abstraction layer (NAL) unit.
[2] The method of claim 1, further comprising the step of: c) controlling to insert the generated RTP timestamp into a header of a corresponding RTP packet.
[3] The method of claim 1, wherein the step b) includes the steps of: b-1) confirming a picture property by checking a header of an input NAL unit; b-2) allocating the set RTP timestamp if the input NAL unit is a NAL unit of an
IDR picture; b-3) generating a RTP timestamp using a TL value if the input NAL unit is a
NAL unit of a key picture; and b-4) generating a RTP timestamp with reference to a TL value and an order in a
TL group if the input NAL unit is not a NAL unit of a key picture.
[4] The method of claim 3, wherein in the step b-3), a RTP timestamp of a NAL unit is calculated using Equation:
TSKev Plc(T)m)=IDR_TS+Oock_Intx2TM41 xGOP Jm
,where 1 ' M' AX denotes the maximum TL value among temporal_level (TL) values of NAL units in a current GOP, Clock_Int denotes a time interval of a timestamp value between pictures, IDR_TS denotes a timestamp value for an IDR picture that is the first picture of a SVC stream, and G0P_Num(≥l) denotes an order number of a current GOP among all GOPs in a SVC stream.
[5] The method of claim 3, wherein in the step b-4), a RTP timestamp of a NAL unit using Equation:
Figure imgf000012_0001
x(GOPJumA)} + Clock __ lnt x 2hm~τ x (2 x n + 1) where
Figure imgf000013_0001
denotes a TL value in a current picture, n is an order number of a current picture in the same TL _Group, and its range is ϋ<n <TL Jjfoiψ _ Size
[6] A time stamping apparatus for a real time transport protocol (RTP) packetization of a scalable video coding (SVC) video, comprising: a network abstraction layer (NAL) unit classifying means for checking a header of an input NAL unit and classifying the input NAL units based on a picture property; a first timestamp calculating means for calculating a RTP timestamp value for a NAL unit classified as a key picture by the NAL unit classifying means; a second timestamp calculating means for calculating a RTP timestamp value for a NAL unit classified as a non-key picture by the NAL unit classifying means; and a controlling means for setting a RTP timestamp value for an instantaneous decoding refresh (IDR) picture and controlling the first and second timestamp calculating means for calculating a RTP timestamp value of a corresponding NAL unit.
[7] The time stamping apparatus of claim 6, wherein the first timestamp calculating means calculates a RTP timestamp of a corresponding NAL unit using a temporal_level (TL) value among header information of a NAL unit, and the second time stamp calculating means calculates a RTP timestamp of a corresponding NAL unit with reference to a TL value among header information of a NAL unit and an order in a TL group.
[8] The time stamping apparatus of claim 6, wherein the controlling means performs a controlling function of inserting the calculated RTP timestamps from the first and second timestamp calculating means into a header of a corresponding RTP packet.
[9] The time stamping apparatus of claim 8, wherein the controlling means allocates the set RTP timestamp value if a NAL unit of an IDR picture inputs.
[10] A system for real time transport protocol (RTP) packetization of a scalable video coding (SVC) bit-stream, comprising: a SVC encoding means for storing coding information, which is generated when an input video sequence is coded based on SVC, in a SVC bit-stream in a form of a network abstraction layer (NAL) unit; a RTP timestamp generating means for generating a RTP timestamp with reference to a header of a NAL unit generated in the SVC encoding means; and a RTP packetization means for generating a RTP packet by inserting the generated RTP timestamp in a header of a RTP packet when a RTP packet is generated using the generated NAL unit.
[11] The system of claim 10, wherein the RTP timestamp generating means includes: a NAL unit classifying means for checking a header of an input NAL unit and classifying the input NAL units based on a picture property; a first timestamp calculating means for calculating a RTP timestamp value for a NAL unit classified as a key picture by the NAL unit classifying means; a second timestamp calculating means for calculating a RTP timestamp value for a NAL unit classified as a non-key picture by the NAL unit classifying means; and a controlling means for setting a RTP timestamp value for an instantaneous decoding refresh (IDR) picture and controlling the first and second timestamp calculating means for calculating a RTP timestamp value of a corresponding NAL unit.
[12] The system of claim 11, wherein the first timestamp calculating means calculates a RTP timestamp of a corresponding NAL unit using a temporal_level (TL) value among header information of a NAL unit, and the second time stamp calculating means calculates a RTP timestamp of a corresponding NAL unit with reference to a TL value among header information of a NAL unit and an order in a TL group.
PCT/KR2007/006636 2007-01-19 2007-12-18 Time-stamping apparatus and method for rtp packetization of svc coded video, and rtp packetization system using the same WO2008088132A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/523,375 US20100046552A1 (en) 2007-01-19 2007-12-18 Time-stamping apparatus and method for rtp packetization of svc coded video, and rtp packetization system using the same

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR10-2007-0006057 2007-01-19
KR20070006057 2007-01-19
KR10-2007-0096872 2007-09-21
KR1020070096872A KR100897525B1 (en) 2007-01-19 2007-09-21 Time-stamping apparatus and method for RTP Packetization of SVC coded video, RTP packetization system using that

Publications (1)

Publication Number Publication Date
WO2008088132A1 true WO2008088132A1 (en) 2008-07-24

Family

ID=39636105

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2007/006636 WO2008088132A1 (en) 2007-01-19 2007-12-18 Time-stamping apparatus and method for rtp packetization of svc coded video, and rtp packetization system using the same

Country Status (1)

Country Link
WO (1) WO2008088132A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010108416A1 (en) * 2009-03-24 2010-09-30 华为技术有限公司 Method, device and communication system for forwarding scalable video coding data messages
US9918112B2 (en) 2011-12-29 2018-03-13 Thomson Licensing System and method for multiplexed streaming of multimedia content

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040098748A1 (en) * 2002-11-20 2004-05-20 Lan Bo MPEG-4 live unicast video streaming system in wireless network with end-to-end bitrate-based congestion control
US6965646B1 (en) * 2000-06-28 2005-11-15 Cisco Technology, Inc. MPEG file format optimization for streaming
EP1742476A1 (en) * 2005-07-06 2007-01-10 Thomson Licensing Scalable video coding streaming system and transmission mechanism of the same system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6965646B1 (en) * 2000-06-28 2005-11-15 Cisco Technology, Inc. MPEG file format optimization for streaming
US20040098748A1 (en) * 2002-11-20 2004-05-20 Lan Bo MPEG-4 live unicast video streaming system in wireless network with end-to-end bitrate-based congestion control
EP1742476A1 (en) * 2005-07-06 2007-01-10 Thomson Licensing Scalable video coding streaming system and transmission mechanism of the same system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010108416A1 (en) * 2009-03-24 2010-09-30 华为技术有限公司 Method, device and communication system for forwarding scalable video coding data messages
US9918112B2 (en) 2011-12-29 2018-03-13 Thomson Licensing System and method for multiplexed streaming of multimedia content

Similar Documents

Publication Publication Date Title
JP6342457B2 (en) Network streaming of encoded video data
EP3459247B1 (en) Most-interested region in an image
CN110431850B (en) Signaling important video information in network video streaming using MIME type parameters
KR101558116B1 (en) Switching between representations during network streaming of coded multimedia data
KR101645780B1 (en) Signaling attributes for network-streamed video data
KR101607225B1 (en) Network streaming of media data
US8503541B2 (en) Method and apparatus for determining timing information from a bit stream
EP3095247B1 (en) Robust live operation of dash
US20100046552A1 (en) Time-stamping apparatus and method for rtp packetization of svc coded video, and rtp packetization system using the same
EP2880836A1 (en) Replacing lost media data for network streaming
CN113287323A (en) Multi-decoder interface for streaming media data
KR20230030589A (en) Streaming of Media Data Containing an Addressable Resource Index Track with Switching Sets
WO2008088132A1 (en) Time-stamping apparatus and method for rtp packetization of svc coded video, and rtp packetization system using the same
TWI846795B (en) Multiple decoder interface for streamed media data
US11863767B2 (en) Transporting HEIF-formatted images over real-time transport protocol
US20240163461A1 (en) Transporting heif-formatted images over real-time transport protocol
CN117099375A (en) Transmitting HEIF formatted images via real-time transport protocol

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07851603

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 12523375

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07851603

Country of ref document: EP

Kind code of ref document: A1