WO2003094532A1 - Flexible data partitioning and packetization for h.26l for improved packet loss resilience - Google Patents

Flexible data partitioning and packetization for h.26l for improved packet loss resilience Download PDF

Info

Publication number
WO2003094532A1
WO2003094532A1 PCT/IB2003/001694 IB0301694W WO03094532A1 WO 2003094532 A1 WO2003094532 A1 WO 2003094532A1 IB 0301694 W IB0301694 W IB 0301694W WO 03094532 A1 WO03094532 A1 WO 03094532A1
Authority
WO
WIPO (PCT)
Prior art keywords
type
packet
coeff
data
partition mode
Prior art date
Application number
PCT/IB2003/001694
Other languages
French (fr)
Inventor
Yingwei Chen
Jong C. Ye
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/253,234 external-priority patent/US20040057465A1/en
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to JP2004502637A priority Critical patent/JP2005524355A/en
Priority to AU2003225482A priority patent/AU2003225482A1/en
Priority to KR10-2004-7017584A priority patent/KR20040106441A/en
Priority to EP03747520A priority patent/EP1504611A1/en
Publication of WO2003094532A1 publication Critical patent/WO2003094532A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/65Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience
    • H04N19/66Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience involving data partitioning, i.e. separation of data into packets or partitions according to importance
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/24Systems for the transmission of television signals using pulse code modulation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/37Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability with arrangements for assigning different transmission priorities to video input data or to video coded data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/89Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder

Definitions

  • the present invention relates generally to data packetization, and more specifically to a partitioning and packetization scheme for the transmission of coded video, improvements in the data partitioning syntax, and the corresponding network adaptation layer (NAL) packetization process to enable flexible data partitioning in an H.26L protocol environment.
  • NAL network adaptation layer
  • wireless networks may be highly susceptible to interferences from other devices operating in the same frequency band, packet errors or losses may often result. This is particularly the case in an 802.1 lb wireless LAN environment, which utilizes the 2.4 GHz ISM band that is shared by microwaves, cordless phones and/or other 802.1 lb networks.
  • Another challenge for transmitting video over an 802.1 lb networks is that the 802. lib media access (MAC) layer requires packets received with bit errors to be discarded, thereby limiting the possibility of error correction at the receiver.
  • MAC media access
  • the MAC layer and the application layer can provide unequal error protection (UEP) to certain packets to guarantee their on-time arrival. For example, at least 50% of all packets can be delivered virtually free of losses even under co-channel interference that degrades the channel throughput by 50%.
  • UEP unequal error protection
  • the combination of scalable or layered coding and transmission with UEP can make sure that the essential parts of video get through even under channel disturbances while the non-essential parts get through only if the channel has enough throughput.
  • a partition ratio of around 50% base vs. 50% enhancement may be desirable for optimal video quality if the enhancement layer packets are lost. An overly low partition ratio will result in under-utilization of the UEP capability of the underlying network.
  • the H.26L standard was introduced to achieve enhanced compression performance while providing a "network-friendly" video representation addressing "conversational” (video telephony) and “non-conversational” (storage, broadcast, or streaming) applications.
  • the H.26L standard includes a Video Coding Layer (VCL), which provides the core high-compression representation of the video picture content, and a Network Adaptation Layer (NAL), which packages that representation for delivery over a particular type of network.
  • VCL Video Coding Layer
  • NAL Network Adaptation Layer
  • H.26L (or Joint Video Team, or JVT, or MPEG-4 Video Part 10) specifies a data partitioning syntax in the byte stream (video elementary stream) that allows three fixed partition types: Partition A that contains header symbols of coded macroblocks; Partion B that contains coded block patterns and DCT data for intra blocks; and Partition C that contains coded block patterns and DCT data for inter blocks.
  • H.26L also specifies a packetization process in the Network Adaptation Layer (NAL) that packetizes the three partitions into three packets.
  • the three packets have different transport (such as RTP, or Real Time Transport Protocol) payload types, which will signal to the application layer or the underlying network transport layer to provide differentiated service or unequal error protection.
  • transport such as RTP, or Real Time Transport Protocol
  • the packets containing different partitions are depacketized and merged into a single bitstream (with multiple partitions) for decoding. While this system provides some level of resilience against packet losses, it has several drawbacks.
  • the fixed partitioning and NAL packetization process do not allow any rate-distortion optimization of the partitioning operation. This will result in lower video quality (when enhancement layer data are lost) compared with flexible data partitioning where the partitioning point and the corresponding base versus enhancement layer packet boundary can be changed adaptively based on picture statistics.
  • the fixed partitioning and packetization limits the ratio of base and enhancement layers to a small range determined by the amount of header and motion vector information versus DCT data. Because the H.26L standard is designed for multiple applications, such as home cinema and video streaming, the application or the underlying network will have varying capabilities for unequal error protection. A fixed ratio for base and enhancement layer partitions will not allow optimization of the overall system performance where unequal error protection is available.
  • the invention addresses the above-mentioned problems, as well as others, by providing modifications to the VCL data partitioning syntax and the corresponding NAL packetization process to enable flexible data partitioning.
  • the invention provides an H.26L coding system, comprising: a video coding layer (VCL) having a first partition mode and a second partition mode for partitioning video data, wherein the second partition mode separately partitions low and high frequency DCT coefficients; and a network adaptation layer (NAL) for packetizing data into a first and second packet, wherein the first packet is configured to contain low frequency DCT coefficients and the second packet is configured to contain high frequency DCT coefficients when the second partition mode is implemented by the VCL.
  • VCL video coding layer
  • NAL network adaptation layer
  • the invention provides a method of partitioning and packetizing video data in an H.26L environment, comprising: providing a video coding layer (VCL) having a first partition mode and a second partition mode for partitioning video data; partitioning video data into the three partitions (A, B and C) when the first partition mode is selected; and partitioning video data into a first partition (containing header information and lower frequency DCT data) and a second partition (higher frequency DCT data) when the second partition mode is selected.
  • VCL video coding layer
  • the invention provides a program product stored on a recordable medium for packetizing and partitioning video data in an H.26L environment, comprising: a video coding layer (VCL) having a first partition mode and a second partition mode for partitioning video data; means for partitioning video data into three partitions (A, B and C) when the first partition mode is selected; and means for partitioning video data into a first partition (containing header information and lower frequency DCT data) and a second partition (higher frequency DCT data) when the second partition mode is selected.
  • VCL video coding layer
  • the invention provides a decoding system for decoding video data in an H.26L environment, wherein the video data was packetized in one of two schemes, including: a first scheme, wherein header data is packetized into a first packet type, coded block pattern and DCT data for intra blocks are packetized into a second packet type, and coded block pattern and DCT data for inter blocks are packetized into a third packet type; a second scheme, wherein header data and low frequency DCT coefficients are packetized into the first packet type and high frequency DCT coefficients are packetized into the second packet type; and wherein the decoding system includes: a depacketizer system for determining which of the first and second scheme was used, and for depacketizing video data from the packets; and a decoder for decoding the video data.
  • Fig. 1 depicts an H.26L video transmission system in accordance with an embodiment of the present invention.
  • Fig. 2 depicts a packetization boundary indication (PBI) field in accordance with an embodiment of the present invention.
  • PBI packetization boundary indication
  • H.26L encoding system 10 includes a Video Coding Layer (VCL) 12 and a Network Adaptation Layer (NAL) 16.
  • VCL 12 includes a unique syntax to efficiently represent the content of the video data
  • NAL 16 is defined to format that data and provide header information in a manner appropriate for conveyance by the higher-level system.
  • the data is organized into data packets, each of which contains an integer number of bytes. These data packets are then transmitted in a manner defined by the NAL 16.
  • VCL 12 includes a data partitioning system 14 that can partition based on one of two modes, i.e., Mode One (Ml) and Mode Two (M2).
  • Mode One which comprises header vs. DCT data, is syntactically defined by the existing H.26L specification, where there are eight syntax element types in the VCL as follows:
  • Mode Two which provides DCT partitioning, is added to provide enhanced partitioning flexibility.
  • the NAL packetization scheme varies according to the partition mode used in the byte stream.
  • partition Mode Two i.e., DCT partitioning, TYPE-COEFF_Y and
  • TYPE_COEFF_C are each further divided into two new partitions that represent high and low frequency DCT coefficients. Namely, TYPE-COEFF_Y has been broken into TYPE_ COEFF_Y_L and TYPE_COEFF_Y_H; and TYPE_COEFF_C has been broken into TYPE_COEFF_C_L, and TYPE_COEFF_C_H. It should be understood that the selection of a naming convention for these new types could change without departing from the scope of the invention. Accordingly, for Mode Two, the data partitioning system provides 10 syntax element types as follows:
  • PBI packetization boundary indication
  • PBP packetization break point
  • the second sub-field is partition type (PT) (1 bit), which is set to 0, with 1 being reserved.
  • the third sub-field is DCT break point (DBP) (5 bits), which indicates the beginning of the index of the first DCT run-length VLC pair in TYPE_COEFF_X_H (where X is either Y or C).
  • DCT DCT break point
  • PBI which is pre-selected, identifies the boundary between high and low frequency DCT coefficients TYPE COEFF X L and TYPE_COEFF_X_H.
  • the NAL packetization process creates two packets, wherein each packet includes several different types of partitioned data.
  • a packetization scheme is selected based on which mode is implemented by the VCL 12.
  • partition Mode One results in packetization Scheme One being implemented
  • partition Mode Two results in packetization Scheme Two being implemented.
  • NAL 16 can determine which scheme to use based on whether the TYPEJHEADER partition ends with a PBI field or not.
  • packetization Scheme One (PI) is utilized, which reflects the scheme utilized in the current H.26L specification. As expected, all header and motion vector information are contained in Packet One, all infra coded block pattern and DCT data information are contained in Packet Two, and all inter coded block pattern (CBP) and DCT information are contained in Packet Three.
  • the low frequency DCT coefficients are packetized in Packet One, while the high frequency DCT coefficients are packetized in Packet Two. Accordingly, flexible packetization is achieved. As is evident, changing the PBP value in the PBI field will alter the contents of the packets under Scheme Two. Accordingly, it should be appreciated that different variations can be achieved under Scheme Two.
  • H.26L decoding system 18 which includes a depacketizing system 20 and a decoder 22. Prior to decoding, the packets are depacketized into the 8 (Packetization Scheme One) or 10 (Packetization Scheme Two) partitions for decoding.
  • the decoder 22 fetches data from the right partition depending on which scheme, Scheme One (SI) or Scheme Two (S2), is implemented.
  • systems, functions, mechanisms, methods, algorithms and modules described herein can be implemented in hardware, software, or a combination of hardware and software. They may be implemented by any type of computer system or other apparatus adapted for carrying out the methods described herein.
  • a typical combination of hardware and software could be a general-purpose computer system with a computer program that, when loaded and executed, controls the computer system such that it carries out the methods described herein.
  • a specific use computer containing specialized hardware for carrying out one or more of the functional tasks of the invention could be utilized.
  • the present invention can also be embedded in a computer program product, which comprises all the features enabling the implementation of the methods and functions described herein, and which - when loaded in a computer system - is able to carry out these methods and functions.
  • Computer program, software program, program, program product, or software in the present context mean any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following: (a) conversion to another language, code or notation; and/or (b) reproduction in a different material form.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A method and system of partitioning and packetizing video data in an H.26L environment. An H.26L coding system 10 is disclosed, comprising: a video coding layer 12 (VCL) having a first partition mode and a second partition mode for partitioning video data, wherein the second partition mode separately partitions low and high frequency DCT coefficients; and a network adaptation layer 16 (NAL) for packetizing data into a first and second packet, wherein the first packet is configured to contain all low frequency DCT coefficients and the second packet is configured to contain all high frequency DCT coefficients when the second partition mode is implemented by the VCL.

Description

Flexible data partitioning and packetization for H.26L for improved packet loss resilience
The present invention relates generally to data packetization, and more specifically to a partitioning and packetization scheme for the transmission of coded video, improvements in the data partitioning syntax, and the corresponding network adaptation layer (NAL) packetization process to enable flexible data partitioning in an H.26L protocol environment.
The emergence of WLAN technologies with high bandwidth capability from several mega bits per second to tens of mega bits per second is enabling high quality video streaming over such networks. Recently, 802.1 lb became a popular standard-based wireless Ethernet networking technology for both business and home. With a realistic payload throughput of 6 Mbps, it is fast enough for most network applications, including coded video broadcasts.
However, many challenges exist for transmitting high quality wireless video signals, mainly due to limitations relating to bandwidth constraints and high error rates. Because wireless networks may be highly susceptible to interferences from other devices operating in the same frequency band, packet errors or losses may often result. This is particularly the case in an 802.1 lb wireless LAN environment, which utilizes the 2.4 GHz ISM band that is shared by microwaves, cordless phones and/or other 802.1 lb networks. Another challenge for transmitting video over an 802.1 lb networks is that the 802. lib media access (MAC) layer requires packets received with bit errors to be discarded, thereby limiting the possibility of error correction at the receiver.
However, the MAC layer and the application layer can provide unequal error protection (UEP) to certain packets to guarantee their on-time arrival. For example, at least 50% of all packets can be delivered virtually free of losses even under co-channel interference that degrades the channel throughput by 50%. The combination of scalable or layered coding and transmission with UEP can make sure that the essential parts of video get through even under channel disturbances while the non-essential parts get through only if the channel has enough throughput. For maximum benefits provided by data partitioning and UEP, a partition ratio of around 50% base vs. 50% enhancement may be desirable for optimal video quality if the enhancement layer packets are lost. An overly low partition ratio will result in under-utilization of the UEP capability of the underlying network.
Recently, the H.26L standard was introduced to achieve enhanced compression performance while providing a "network-friendly" video representation addressing "conversational" (video telephony) and "non-conversational" (storage, broadcast, or streaming) applications. The H.26L standard includes a Video Coding Layer (VCL), which provides the core high-compression representation of the video picture content, and a Network Adaptation Layer (NAL), which packages that representation for delivery over a particular type of network.
Unfortunately, the current data partitioning syntax in the H.26L video coding layer (VCL) provides little flexibility in selecting the partitioning ratio. Fixed partitioning does not fit well with the diverse unequal error protection capability provided by different networks, such as 802.1 la and 802.1 lb. Fixed partitioning also disallows rate-distortion optimization of base layer video quality.
Current H.26L (or Joint Video Team, or JVT, or MPEG-4 Video Part 10) specifies a data partitioning syntax in the byte stream (video elementary stream) that allows three fixed partition types: Partition A that contains header symbols of coded macroblocks; Partion B that contains coded block patterns and DCT data for intra blocks; and Partition C that contains coded block patterns and DCT data for inter blocks. H.26L also specifies a packetization process in the Network Adaptation Layer (NAL) that packetizes the three partitions into three packets. The three packets have different transport (such as RTP, or Real Time Transport Protocol) payload types, which will signal to the application layer or the underlying network transport layer to provide differentiated service or unequal error protection. On the receiver side, the packets containing different partitions are depacketized and merged into a single bitstream (with multiple partitions) for decoding. While this system provides some level of resilience against packet losses, it has several drawbacks.
First, the fixed partitioning and NAL packetization process do not allow any rate-distortion optimization of the partitioning operation. This will result in lower video quality (when enhancement layer data are lost) compared with flexible data partitioning where the partitioning point and the corresponding base versus enhancement layer packet boundary can be changed adaptively based on picture statistics.
Second, the fixed partitioning and packetization limits the ratio of base and enhancement layers to a small range determined by the amount of header and motion vector information versus DCT data. Because the H.26L standard is designed for multiple applications, such as home cinema and video streaming, the application or the underlying network will have varying capabilities for unequal error protection. A fixed ratio for base and enhancement layer partitions will not allow optimization of the overall system performance where unequal error protection is available.
Accordingly, the need exists for improvements to provide more data partitioning flexibility within an H.26L environment, which is essential to supporting video communication applications over diverse packet-lossy networks.
The invention addresses the above-mentioned problems, as well as others, by providing modifications to the VCL data partitioning syntax and the corresponding NAL packetization process to enable flexible data partitioning. In a first aspect, the invention provides an H.26L coding system, comprising: a video coding layer (VCL) having a first partition mode and a second partition mode for partitioning video data, wherein the second partition mode separately partitions low and high frequency DCT coefficients; and a network adaptation layer (NAL) for packetizing data into a first and second packet, wherein the first packet is configured to contain low frequency DCT coefficients and the second packet is configured to contain high frequency DCT coefficients when the second partition mode is implemented by the VCL.
In a second aspect, the invention provides a method of partitioning and packetizing video data in an H.26L environment, comprising: providing a video coding layer (VCL) having a first partition mode and a second partition mode for partitioning video data; partitioning video data into the three partitions (A, B and C) when the first partition mode is selected; and partitioning video data into a first partition (containing header information and lower frequency DCT data) and a second partition (higher frequency DCT data) when the second partition mode is selected.
In a third aspect, the invention provides a program product stored on a recordable medium for packetizing and partitioning video data in an H.26L environment, comprising: a video coding layer (VCL) having a first partition mode and a second partition mode for partitioning video data; means for partitioning video data into three partitions (A, B and C) when the first partition mode is selected; and means for partitioning video data into a first partition (containing header information and lower frequency DCT data) and a second partition (higher frequency DCT data) when the second partition mode is selected. In a fourth aspect, the invention provides a decoding system for decoding video data in an H.26L environment, wherein the video data was packetized in one of two schemes, including: a first scheme, wherein header data is packetized into a first packet type, coded block pattern and DCT data for intra blocks are packetized into a second packet type, and coded block pattern and DCT data for inter blocks are packetized into a third packet type; a second scheme, wherein header data and low frequency DCT coefficients are packetized into the first packet type and high frequency DCT coefficients are packetized into the second packet type; and wherein the decoding system includes: a depacketizer system for determining which of the first and second scheme was used, and for depacketizing video data from the packets; and a decoder for decoding the video data.
These and other features of this invention will be more readily understood from the following detailed description of the various aspects of the invention taken in conjunction with the accompanying drawings in which:
Fig. 1 depicts an H.26L video transmission system in accordance with an embodiment of the present invention.
Fig. 2 depicts a packetization boundary indication (PBI) field in accordance with an embodiment of the present invention.
Referring to Fig. 1, a video transmission network is shown for transmitting a byte stream between an H.26L encoding system 10 and an H.26L decoding system 18. H.26L encoding system 10 includes a Video Coding Layer (VCL) 12 and a Network Adaptation Layer (NAL) 16. As is known in the art, the VCL 12 includes a unique syntax to efficiently represent the content of the video data, and the NAL 16 is defined to format that data and provide header information in a manner appropriate for conveyance by the higher-level system. The data is organized into data packets, each of which contains an integer number of bytes. These data packets are then transmitted in a manner defined by the NAL 16. Data Partitioning re-arranges the symbols in a way that all symbols of one data type (e.g., DC coefficients, macroblock headers, motion vectors) that belong to a single slice are collected in one VLC coded bitstream that starts byte aligned. Decoder system 18 can process such a partitioned data streams by fetching symbols from the correct partition. In accordance with the present invention VCL 12 includes a data partitioning system 14 that can partition based on one of two modes, i.e., Mode One (Ml) and Mode Two (M2). Mode One, which comprises header vs. DCT data, is syntactically defined by the existing H.26L specification, where there are eight syntax element types in the VCL as follows:
0 - TYPE_HEADER
1 - TYPEJMQBHEADER
2 - TYPE_MVD 3 - TYPE_CBP
4 - TYPE_2x2DC
5 - TYPE_COEFF_Y
6 - TYPE_COEFF_C
7 - TYPE_EOS
Because partition Mode One remains unchanged from the existing H.26L specification, backward compatibility is maintained. Mode Two, which provides DCT partitioning, is added to provide enhanced partitioning flexibility. As discussed below, the NAL packetization scheme varies according to the partition mode used in the byte stream. In partition Mode Two, i.e., DCT partitioning, TYPE-COEFF_Y and
TYPE_COEFF_C are each further divided into two new partitions that represent high and low frequency DCT coefficients. Namely, TYPE-COEFF_Y has been broken into TYPE_ COEFF_Y_L and TYPE_COEFF_Y_H; and TYPE_COEFF_C has been broken into TYPE_COEFF_C_L, and TYPE_COEFF_C_H. It should be understood that the selection of a naming convention for these new types could change without departing from the scope of the invention. Accordingly, for Mode Two, the data partitioning system provides 10 syntax element types as follows:
0 - TYPEJHEADER 1 - TYPE_MBHEADER
2 - TYPE_MVD
3 - TYPE_CBP
4 - TYPE_2x2DC
5 - TYPE COEFF Y L 6 - TYPE_COEFF_ C_L
7 - TYPE_COEFF_Y_H
8 - TYPE_COEFF_C_H
9 - TYPE_EOS
In addition, a new field, the packetization boundary indication (PBI), has been added to the end of TYPEJHEADER. An exemplary field structure for the PBI is shown in Figure 2. As shown, the PBI field is further divided into three sub-fields. The first sub-field is packetization break point (PBP) (2 bits), which indicates which partition packetization should break. Namely, by changing the PBI, the user can select which packet should include
TYPE_CBP and TYPE_2x2DC. The second sub-field is partition type (PT) (1 bit), which is set to 0, with 1 being reserved. The third sub-field is DCT break point (DBP) (5 bits), which indicates the beginning of the index of the first DCT run-length VLC pair in TYPE_COEFF_X_H (where X is either Y or C). Thus, PBI, which is pre-selected, identifies the boundary between high and low frequency DCT coefficients TYPE COEFF X L and TYPE_COEFF_X_H. Obviously, other PBI structures than that depicted in Figure 2 could be implemented to achieve the same functionality, and such other structures fall within the scope of this invention.
As noted above, the NAL packetization process creates two packets, wherein each packet includes several different types of partitioned data. In accordance with the present invention, a packetization scheme is selected based on which mode is implemented by the VCL 12. Thus, for example, partition Mode One results in packetization Scheme One being implemented, while partition Mode Two results in packetization Scheme Two being implemented. NAL 16 can determine which scheme to use based on whether the TYPEJHEADER partition ends with a PBI field or not.
If no PBI field is included, packetization Scheme One (PI) is utilized, which reflects the scheme utilized in the current H.26L specification. As expected, all header and motion vector information are contained in Packet One, all infra coded block pattern and DCT data information are contained in Packet Two, and all inter coded block pattern (CBP) and DCT information are contained in Packet Three.
If however the TYPE ΪEADER partition ends with the PBI field, packetization operates under Scheme Two (P2). In this case, the contents of the resulting packets will depend on the values in the PBI field. The following is an example with a PBP value of 2, and a PT value of 0. Packet One
TYPE_HEADER (with PBI field)
TYPE_MBHEADER
TYPE_MVD
TYPE_CBP
YYPE_2x2DC
TYPE_COEFF_Y_L
TYPE_COEFF_C_L
Packet Two
TYPE_COEFF_Y_H TYPE_COEFF_C_H TYPE_EOS
In this Scheme Two example, the low frequency DCT coefficients are packetized in Packet One, while the high frequency DCT coefficients are packetized in Packet Two. Accordingly, flexible packetization is achieved. As is evident, changing the PBP value in the PBI field will alter the contents of the packets under Scheme Two. Accordingly, it should be appreciated that different variations can be achieved under Scheme Two.
Also shown in Fig. 1 is H.26L decoding system 18, which includes a depacketizing system 20 and a decoder 22. Prior to decoding, the packets are depacketized into the 8 (Packetization Scheme One) or 10 (Packetization Scheme Two) partitions for decoding. The decoder 22 fetches data from the right partition depending on which scheme, Scheme One (SI) or Scheme Two (S2), is implemented.
It is understood that the systems, functions, mechanisms, methods, algorithms and modules described herein can be implemented in hardware, software, or a combination of hardware and software. They may be implemented by any type of computer system or other apparatus adapted for carrying out the methods described herein. A typical combination of hardware and software could be a general-purpose computer system with a computer program that, when loaded and executed, controls the computer system such that it carries out the methods described herein. Alternatively, a specific use computer, containing specialized hardware for carrying out one or more of the functional tasks of the invention could be utilized. The present invention can also be embedded in a computer program product, which comprises all the features enabling the implementation of the methods and functions described herein, and which - when loaded in a computer system - is able to carry out these methods and functions. Computer program, software program, program, program product, or software, in the present context mean any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following: (a) conversion to another language, code or notation; and/or (b) reproduction in a different material form. The foregoing description of the preferred embodiments of the invention has been presented for purposes of illustration and description. They are not intended to be exhaustive or to limit the invention to the precise form disclosed, and obviously many modifications and variations are possible in light of the above teachings. Such modifications and variations that are apparent to a person skilled in the art are intended to be included within the scope of this invention as defined by the accompanying claims.

Claims

CLAIMS:
1. An H.26L coding system [10], comprising: a video coding layer (VCL) [12] having a first partition mode and a second partition mode for partitioning video data, wherein the second partition mode separately partitions low and high frequency DCT coefficients; and a network adaptation layer (NAL) [16] for packetizing data into a first and second packet, wherein the first packet is configured to contain low frequency DCT coefficients and the second packet is configured to contain high frequency DCT coefficients when the second partition mode is implemented by the VCL [12].
2. The H.26L coding system [10] of claim 1, wherein the first partition mode comprises a first partition that contains header symbols of coded macroblocks; a second partition that contains coded block patterns and DCT data for infra blocks; and a third partition that contains coded block patterns and DCT data for inter blocks.
3. The H.26L coding system [ 10] of claim 2, wherein the second partition mode includes TYPE_HEADER, TYPEJV1BHEADER, TYPEJVIND, TYPE_CBP, TYPE_2x2DC, TYPEJEOS and divides each of TYPE_ COEFF_Y and TYPE_COEFF_C into a high frequency type and a low frequency type.
4. The H.26L coding system [10] of claim 2, wherein the second partition mode includes TYPEJHEADER, TYPE_MBHEADER, TYPEJVIND, TYPE_CBP, TYPE_2x2DC, TYPEJEOS, TYPE_COEFF_Y_L, TYPE_COEFF_C_L, TYPE_COEFF_Y_H, and TYPE_COEFF_C_H.
5. The H.26L coding system [ 10] of claim 4, wherein the ΝAL [ 16] packetizes
TYPE_HEADER, TYPE_MBHEADER, and TYPE_MVD into the first packet and packetizes TYPE_CBP, TYPE_2x2DC, TYPE_COEFF_Y, TYPE_COEFF_C, and TYPE_EOS into the second packet when the first partition mode is used.
6. The H.26L coding system [10] of claim 4, wherein the NAL [16] packetizes
TYPE_COEFF_Y_L and TYPE_COEFF_C_L into the first packet and TYPE_COEFF_Y_H, and TYPE COEFF C JH into the second packet when the second partition mode is used.
7. The H.26L coding system [10] of claim 4, wherein TYPE_HEADER includes a field having a packetization boundary indication that determines a break point between high and low frequency DCT coefficients and signals the NAL [16] to partition the high and low frequency DCT coefficients.
8. The H.26L coding system [ 10] of claim 7, wherein the packetization boundary indication further determines which packet should include TYPE CBP, TYPE_2x2DC.
9. A method of partitioning and packetizing video data in an H.26L environment, comprising: packetizing header data into a first packet, coded block pattern and DCT data for intra blocks into a second packet, and coded block pattern and DCT data for inter blocks into a third packet when a first partition mode is selected; and packetizing header data and low frequency DCT coefficients into a first packet and high frequency DCT coefficients into a second packet when a second partition mode is selected.
10. The method of claim 9, further comprising: providing a video coding layer (VCL) [12] for partitioning video data; packetizing TYPE_HEADER, TYPE_MBHEADER, and TYPE_MVD into the first packet and packetizing TYPE_CBP, TYPE_2x2DC, TYPE_COEFF_Y,
TYPE_COEFF_C, and TYPEJEOS into the second packet when the first mode is used; and packetizing TYPE_COEFF_Y_L and TYPE_COEFF_C_L into the first packet and TYPE_COEFF_Y_H, and TYPE_COEFF_C_H into the second packet when the second mode is used.
11. The method of claim 10, comprising the further steps of: setting a breakpoint between high and low frequency DCT coefficients; and storing the breakpoint in a boundary indication field in TYPEJTEADER.
12. The method of claim 11 , wherein TYPE COEFF Y J and TYPE_COEFF_C_L are packetized into the first packet and TYPE_COEFF_Y_H, and TYPE_COEFF_C_H are packetized into the second packet when the boundary indication field is included in TYPEJHEADER.
13. The method of claim 11 , wherein the boundary indication field further determines which packet should include TYPE_CBP and TYPE_2x2DC.
14. A program product stored on a recordable medium for packetizing and partitioning video data in an H.26L environment, comprising: means for packetizing header data into a first packet, coded block pattern and DCT data for intra blocks into a second packet, and coded block pattern and DCT data for inter blocks into a third packet when a first partition mode is selected; and means for packetizing header data and low frequency DCT coefficients into a first packet and high frequency DCT coefficients into a second packet when a second partition mode is selected.
15. A decoding system [18] for decoding video data in an H.26L environment, wherein the video data was packetized in one of two schemes, including: a first scheme, wherein header data are packetized into a first packet type, coded block pattern and DCT data for intra blocks are packetized into a second packet type, and coded block pattern and DCT data for inter blocks are packetized into a third packet type; a second scheme, wherein header data and low frequency DCT coefficients are packetized into the first packet type and high frequency DCT coefficients are packetized into the second packet type; and wherein the decoding system includes: a depacketizer system [20] for determining which of the first and second scheme was used and for depacketizing video data from the packets; and a decoder [22] for decoding the video data.
PCT/IB2003/001694 2002-05-01 2003-04-23 Flexible data partitioning and packetization for h.26l for improved packet loss resilience WO2003094532A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2004502637A JP2005524355A (en) 2002-05-01 2003-04-23 H.264 for improved packet loss recovery. 26L flexible data partitioning and packetization
AU2003225482A AU2003225482A1 (en) 2002-05-01 2003-04-23 Flexible data partitioning and packetization for h.26l for improved packet loss resilience
KR10-2004-7017584A KR20040106441A (en) 2002-05-01 2003-04-23 Flexible data partitioning and packetization for H.26L for improved packet loss resilience
EP03747520A EP1504611A1 (en) 2002-05-01 2003-04-23 Flexible data partitioning and packetization for h.26l for improved packet loss resilience

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US37718602P 2002-05-01 2002-05-01
US60/377,186 2002-05-01
US10/253,234 2002-09-24
US10/253,234 US20040057465A1 (en) 2002-09-24 2002-09-24 Flexible data partitioning and packetization for H.26L for improved packet loss resilience

Publications (1)

Publication Number Publication Date
WO2003094532A1 true WO2003094532A1 (en) 2003-11-13

Family

ID=29406452

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2003/001694 WO2003094532A1 (en) 2002-05-01 2003-04-23 Flexible data partitioning and packetization for h.26l for improved packet loss resilience

Country Status (6)

Country Link
EP (1) EP1504611A1 (en)
JP (1) JP2005524355A (en)
KR (1) KR20040106441A (en)
CN (1) CN1650637A (en)
AU (1) AU2003225482A1 (en)
WO (1) WO2003094532A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010239288A (en) * 2009-03-30 2010-10-21 Sony Corp Information processing device and method
WO2014068817A1 (en) * 2012-10-31 2014-05-08 パナソニック株式会社 Audio signal coding device and audio signal decoding device

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
ARAVIND R ET AL: "PACKET LOSS RESILIENCE OF MPEG-2 SCALABLE VIDEO CODING ALGORITHMS", IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, IEEE INC. NEW YORK, US, vol. 6, no. 5, 1 October 1996 (1996-10-01), pages 426 - 435, XP000627031, ISSN: 1051-8215 *
CUENCA P ET AL: "Techniques to increase MPEG-2 error resilience in the VBR video transmission over ATM networks", COMMUNICATIONS, 1998. ICC 98. CONFERENCE RECORD. 1998 IEEE INTERNATIONAL CONFERENCE ON ATLANTA, GA, USA 7-11 JUNE 1998, NEW YORK, NY, USA,IEEE, US, 7 June 1998 (1998-06-07), pages 869 - 873, XP010284774, ISBN: 0-7803-4788-9 *
SULLIVAN G J ET AL: "USING THE DRAFT H.26L VIDEO CODING STANDARD FOR MOBILE APPLICATIONS", PROCEEDINGS 2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING. ICIP 2001. THESSALONIKI, GREECE, OCT. 7 - 10, 2001, INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, NEW YORK, NY: IEEE, US, vol. 3 OF 3. CONF. 8, 7 October 2001 (2001-10-07), pages 573 - 576, XP008004827, ISBN: 0-7803-6725-1 *
WIEGAND T: "JOINT MODEL NUMBER 1, REVISION 1(JM-IRL)", ITU STUDY GROUP 16 - VIDEO CODING EXPERTS GROUP, XX, XX, 3 December 2001 (2001-12-03), pages 1,3 - 75, XP001086627 *

Also Published As

Publication number Publication date
JP2005524355A (en) 2005-08-11
EP1504611A1 (en) 2005-02-09
CN1650637A (en) 2005-08-03
AU2003225482A1 (en) 2003-11-17
KR20040106441A (en) 2004-12-17

Similar Documents

Publication Publication Date Title
Wu et al. On end-to-end architecture for transporting MPEG-4 video over the Internet
US7274740B2 (en) Wireless video transmission system
Ksentini et al. Toward an improvement of H. 264 video transmission over IEEE 802.11 e through a cross-layer architecture
JP4360908B2 (en) Video transfer with variable modulation rate
KR102326312B1 (en) Method for transmitting video and data transmitter
Hsiao et al. H. 264 video transmissions over wireless networks: Challenges and solutions
JP4549610B2 (en) COMMUNICATION SYSTEM, COMMUNICATION METHOD, TRANSMISSION DEVICE AND METHOD, RECEPTION DEVICE AND METHOD, AND PROGRAM
KR102385337B1 (en) Method for transmitting video and data transmitter
Huusko et al. Cross-layer architecture for scalable video transmission in wireless network
KR20180035137A (en) Method for transmitting video and data transmitter
US20040057465A1 (en) Flexible data partitioning and packetization for H.26L for improved packet loss resilience
Worrall et al. Prioritisation of data partitioned MPEG—4 video over mobile networks
Fabri et al. Real-time video communications over GPRS
US7724819B2 (en) Wireless image transmission device and image transmission method
WO2003094533A1 (en) Error-resilient video transmission system for wireless lan utilizing data partitioning and unequal error protection
WO2003094532A1 (en) Flexible data partitioning and packetization for h.26l for improved packet loss resilience
van der Schaar et al. Robust transmission of MPEG-4 scalable video over 4G wireless networks
Barreira et al. Impact of the IEEE 802.11 n frame aggregation mechanisms on video streaming quality
Bernardini et al. Real-time multiple description video streaming over QoS-based wireless networks
Ksentini et al. Improving H. 264 video transmission in 802.11 e EDCA
Qadeer et al. Real time video streaming over heterogeneous networks
Casetti et al. Supporting multimedia traffic in 802.11 e WLANs
JP4356023B2 (en) Transmitting apparatus and method, receiving apparatus and method, and program
KR20180087859A (en) Method for transmitting data and transmitter therefor
Patel et al. A dynamic cross-layer mapping to support real time multimedia applications over IEEE 802.11 n

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2003747520

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2004502637

Country of ref document: JP

Ref document number: 20038097222

Country of ref document: CN

WWE Wipo information: entry into national phase

Ref document number: 1020047017584

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 1020047017584

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2003747520

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2003747520

Country of ref document: EP