US20150189305A1 - Video decoder for tiles with absolute signaling - Google Patents

Video decoder for tiles with absolute signaling Download PDF

Info

Publication number
US20150189305A1
US20150189305A1 US14/656,161 US201514656161A US2015189305A1 US 20150189305 A1 US20150189305 A1 US 20150189305A1 US 201514656161 A US201514656161 A US 201514656161A US 2015189305 A1 US2015189305 A1 US 2015189305A1
Authority
US
United States
Prior art keywords
poc
lsb
picture
frame
slice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/656,161
Inventor
Sachin G. Deshpande
Christopher A. Segall
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Velos Media LLC
Original Assignee
Sharp Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Corp filed Critical Sharp Corp
Priority to US14/656,161 priority Critical patent/US20150189305A1/en
Publication of US20150189305A1 publication Critical patent/US20150189305A1/en
Priority to US15/388,798 priority patent/US9883181B2/en
Priority to US15/645,797 priority patent/US10250875B2/en
Assigned to VELOS MEDIA, LLC reassignment VELOS MEDIA, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SHARP KABUSHIKI KAISHA
Priority to US16/274,175 priority patent/US10506227B2/en
Priority to US16/706,783 priority patent/US10911752B2/en
Priority to US17/164,757 priority patent/US11245893B2/en
Priority to US17/565,407 priority patent/US11582446B2/en
Priority to US18/109,253 priority patent/US20230199172A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/573Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/577Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/58Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one

Definitions

  • the present invention relates to video encoding and/or decoding.
  • Digital video is typically represented as a series of images or frames, each of which contains an array of pixels.
  • Each pixel includes information, such as intensity and/or color information.
  • each pixel is represented as a set of three colors, each of which may be defined by eight bit color values.
  • Video-coding techniques typically provide higher coding efficiency at the expense of increasing complexity.
  • Increasing image quality requirements and increasing image resolution requirements for video coding techniques also increase the coding complexity.
  • Video decoders that are suitable for parallel decoding may improve the speed of the decoding process and reduce memory requirements; video encoders that are suitable for parallel encoding may improve the speed of the encoding process and reduce memory requirements.
  • H.264/MPEG-4 AVC Joint Video Team of ITU-T VCEG and ISO/IEC MPEG, “H.264: Advanced video coding for generic audiovisual services,” ITU-T Rec. H.264 and ISO/IEC 14496-10 (MPEG4—Part 10), November 2007]
  • JCT-VC [“Draft Test Model Under Consideration”, JCTVC-A205, JCT-VC Meeting, Dresden, April 2010 (JCT-VC)]
  • JCT-VC video codec (encoder/decoder) specifications that decode pictures based upon reference pictures in a video sequence for compression efficiency.
  • FIG. 1 illustrates a H.264/AVC video encoder.
  • FIG. 2 illustrates a H.264/AVC video decoder
  • FIG. 3 illustrates an exemplary slice structure
  • FIG. 4 illustrates another exemplary slice structure.
  • FIG. 5 illustrates reconstruction of an entropy slice.
  • FIG. 6 illustrates reconstruction of an portion of the entropy slice of FIG. 5 .
  • FIG. 7 illustrates reconstruction of an entropy slice with an omitted LSB count value.
  • FIG. 8 illustrates reconstruction of an entropy slice with a long term picture value.
  • FIG. 9 illustrates reconstruction of an entropy slice by selecting a first preceding frame with a long term picture value.
  • FIG. 10 illustrates reconstruction of an entropy slice by using duplicate long term picture frame having the same least significant bit count value.
  • FIGS. 11A-11B illustrates a technique for selecting a reference frame.
  • FIG. 12 illustrates another technique for selecting a reference frame.
  • FIGS. 13A-13B illustrates another technique for selecting a reference frame.
  • FIG. 14 illustrates another technique for selecting a reference frame.
  • video coder/decoder that uses encoding/decoding
  • exemplary embodiments are described in relation to an H.264/AVC encoder and an H.264/AVC decoder merely for purposes of illustration.
  • Many video coding techniques are based on a block-based hybrid video-coding approach, wherein the source-coding technique is a hybrid of inter-picture, also considered inter-frame, prediction, intra-picture, also considered intra-frame, prediction and transform coding of a prediction residual.
  • Inter-frame prediction may exploit temporal redundancies
  • intra-frame and transform coding of the prediction residual may exploit spatial redundancies.
  • FIG. 1 is a block diagram illustrating an exemplary encoder 104 for an electronic device 102 . It should be noted that one or more of the elements illustrated as included within the electronic device 102 may be implemented in hardware, and/or software. For example, the electronic device 102 includes an encoder 104 , which may be implemented in hardware and/or software.
  • the electronic device 102 may include a supplier 134 .
  • the supplier 134 may provide picture or image data (e.g., video) as a source 106 to the encoder 104 .
  • Non limiting examples of the supplier 134 include image sensors, memory, communication interfaces, network interfaces, wireless receivers, ports, video frame content, previously encoded video content, non-encoded video content, etc.
  • the source 106 may be provided to an intra-frame prediction module and reconstruction buffer 140 .
  • the source 106 may also be provided to a motion estimation and motion compensation module 166 and to a subtraction module 146 .
  • the intra-frame prediction module and reconstruction buffer 140 may generate intra mode information 148 and an intra signal 142 based on the source 106 and reconstructed data 180 .
  • the motion estimation and motion compensation module 166 may generate inter mode information 168 and an inter signal 144 based on the source 106 and a reference picture buffer 196 signal 198 .
  • the reference picture buffer 196 signal 198 may include data from one or more reference pictures stored in the reference picture buffer 196 .
  • the reference picture buffer 196 may also include an RPS index initializer module 108 .
  • the initializer module 108 may process reference pictures corresponding to the buffering and list construction of an RPS.
  • the encoder 104 may select between the intra signal 142 and the inter signal 144 in accordance with a mode.
  • the intra signal 142 may be used in order to exploit spatial characteristics within a picture in an intra coding mode.
  • the inter signal 144 may be used in order to exploit temporal characteristics between pictures in an inter coding mode. While in the intra coding mode, the intra signal 142 may be provided to the subtraction module 146 and the intra mode information 158 may be provided to an entropy coding module 160 . While in the inter coding mode, the inter signal 144 may be provided to the subtraction module 146 and the inter mode information 168 may be provided to the entropy coding module 160 .
  • Either the intra signal 142 or the inter signal 144 (depending on the mode) is subtracted from the source 106 at the subtraction module 146 in order to produce a prediction residual 148 .
  • the prediction residual 148 is provided to a transformation module 150 .
  • the transformation module 150 may compress the prediction residual 148 to produce a transformed signal 152 that is provided to a quantization module 154 .
  • the quantization module 154 quantizes the transformed signal 152 to produce transformed and quantized coefficients (TQCs) 156 .
  • the TQCs 156 are provided to an entropy coding module 160 and an inverse quantization module 170 .
  • the inverse quantization module 170 performs inverse quantization on the TQCs 156 to produce an inverse quantized signal 172 that is provided to an inverse transformation module 174 .
  • the inverse transformation module 174 decompresses the inverse quantized signal 172 to produce a decompressed signal 176 that is provided to a reconstruction module 178 .
  • the reconstruction module 178 may produce reconstructed data 180 based on the decompressed signal 176 .
  • the reconstruction module 178 may reconstruct (modified) pictures.
  • the reconstructed data 180 may be provided to a deblocking filter 182 and to the intra prediction module and reconstruction buffer 140 .
  • the deblocking filter 182 may produce a filtered signal 184 based on the reconstructed data 180 .
  • the filtered signal 184 may be provided to a sample adaptive offset (SAO) module 186 .
  • the SAO module 186 may produce SAO information 188 that is provided to the entropy coding module 160 and an SAO signal 190 that is provided to an adaptive loop filter (ALF) 192 .
  • the ALF 192 produces an ALF signal 194 that is provided to the reference picture buffer 196 .
  • the ALF signal 194 may include data from one or more pictures that may be used as reference pictures.
  • the entropy coding module 160 may code the TQCs 156 to produce a bitstream 114 . Also, the entropy coding module 160 may code the TQCs 156 using Context-Adaptive Variable Length Coding (CAVLC) or Context-Adaptive Binary Arithmetic Coding (CABAC). In particular, the entropy coding module 160 may code the TQCs 156 based on one or more of intra mode information 158 , inter mode information 168 and SAO information 188 .
  • the bitstream 114 may include coded picture data. The encoder often encodes a frame as a sequence of blocks, generally referred to as macroblocks.
  • Quantization involved in video compression such as HEVC, is a lossy compression technique achieved by compressing a range of values to a single value.
  • the quantization parameter (QP) is a predefined scaling parameter used to perform the quantization based on both the quality of reconstructed video and compression ratio.
  • the block type is defined in HEVC to represent the characteristics of a given block based on the block size and its color information. QP, resolution information and block type may be determined before entropy coding.
  • the electronic device 102 e.g., the encoder 104
  • the entropy coding module 160 may determine the block size based on a block of TQCs 156 .
  • block size may be the number of TQCs 156 along one dimension of the block of TQCs.
  • the number of TQCs 156 in the block of TQCs may be equal to block size squared.
  • block size may be determined as the square root of the number of TQCs 156 in the block of TQCs.
  • Resolution may be defined as a pixel width by a pixel height. Resolution information may include a number of pixels for the width of a picture, for the height of a picture or both.
  • Block size may be defined as the number of TQCs 156 along one dimension of a 2D block of TQCs.
  • the bitstream 114 may be transmitted to another electronic device.
  • the bitstream 114 may be provided to a communication interface, network interface, wireless transmitter, port, etc.
  • the bitstream 114 may be transmitted to another electronic device via LAN, the Internet, a cellular phone base station, etc.
  • the bitstream 114 may additionally or alternatively be stored in memory on the electronic device 102 or other electronic device.
  • FIG. 2 is a block diagram illustrating an exemplary decoder 212 on an electronic device 202 .
  • the decoder 212 may be included for an electronic device 202 .
  • the decoder 212 may be a HEVC decoder.
  • the decoder 212 and/or one or more of the elements illustrated as included in the decoder 212 may be implemented in hardware and/or software.
  • the decoder 212 may receive a bitstream 214 (e.g., one or more encoded pictures included in the bitstream 214 ) for decoding.
  • the received bitstream 214 may include received overhead information, such as a received slice header, received PPS (or picture parameter set), received buffer description information, etc.
  • the encoded pictures included in the bitstream 214 may include one or more encoded reference pictures and/or one or more other encoded pictures.
  • Received symbols (in the one or more encoded pictures included in the bitstream 214 ) may be entropy decoded by an entropy decoding module 268 , thereby producing a motion information signal 270 and quantized, scaled and/or transformed coefficients 272 .
  • the motion information signal 270 may be combined with a portion of a reference frame signal 298 from a frame memory 278 at a motion compensation module 274 , which may produce an inter-frame prediction signal 282 .
  • the quantized, descaled and/or transformed coefficients 272 may be inverse quantized, scaled and inverse transformed by an inverse module 262 , thereby producing a decoded residual signal 284 .
  • the decoded residual signal 284 may be added to a prediction signal 292 to produce a combined signal 286 .
  • the prediction signal 292 may be a signal selected from either the inter-frame prediction signal 282 or an intra-frame prediction signal 290 produced by an intra-frame prediction module 288 . In some configurations, this signal selection may be based on (e.g., controlled by) the bitstream 214 .
  • the intra-frame prediction signal 290 may be predicted from previously decoded information from the combined signal 292 (in the current frame, for example).
  • the combined signal 292 may also be filtered by a de-blocking filter 294 .
  • the resulting filtered signal 296 may be written to frame memory 278 .
  • the resulting filtered signal 296 may include a decoded picture.
  • the frame memory 778 may include a DPB (or display picture buffer) as described herein.
  • the DPB may include one or more decoded pictures that may be maintained as short or long term reference frames.
  • the frame memory 278 may also include overhead information corresponding to the decoded pictures.
  • the frame memory 278 may include slice headers, PPS information, buffer description information, etc. One or more of these pieces of information may be signaled from an encoder (e.g., encoder 104 ).
  • the frame memory 278 may provide a decoded picture 718 .
  • An input picture comprising a plurality of macroblocks may be partitioned into one or several slices.
  • the values of the samples in the area of the picture that a slice represents may be properly decoded without the use of data from other slices provided that the reference pictures used at the encoder and the decoder are the same and that de-blocking filtering does not use information across slice boundaries. Therefore, entropy decoding and macroblock reconstruction for a slice does not depend on other slices.
  • the entropy coding state may be reset at the start of each slice.
  • the data in other slices may be marked as unavailable when defining neighborhood availability for both entropy decoding and reconstruction.
  • the slices may be entropy decoded and reconstructed in parallel. No intra prediction and motion-vector prediction is preferably allowed across the boundary of a slice. In contrast, de-blocking filtering may use information across slice boundaries.
  • FIG. 3 illustrates an exemplary video picture 90 comprising eleven macroblocks in the horizontal direction and nine macroblocks in the vertical direction (nine exemplary macroblocks labeled 91 - 99 ).
  • FIG. 3 illustrates three exemplary slices: a first slice denoted “SLICE # 0 ” 89 , a second slice denoted “SLICE # 1 ” 88 and a third slice denoted “SLICE # 2 ” 87 .
  • An H.264/AVC decoder may decode and reconstruct the three slices 87 , 88 , 89 in parallel. Each of the slices may be transmitted in scan line order in a sequential manner.
  • entropy decoding 268 is initialized or reset and macroblocks in other slices are marked as unavailable for both entropy decoding and macroblock reconstruction.
  • macroblocks for example, the macroblock labeled 93 , in “SLICE # 1 ,” macroblocks (for example, macroblocks labeled 91 and 92 ) in “SLICE # 0 ” may not be used for entropy decoding or reconstruction.
  • a macroblock for example, the macroblock labeled 95 , in “SLICE # 1 ,” other macroblocks (for example, macroblocks labeled 93 and 94 ) in “SLICE # 1 ” may be used for entropy decoding or reconstruction. Therefore, entropy decoding and macroblock reconstruction proceeds serially within a slice. Unless slices are defined using a flexible macroblock ordering (FMO), macroblocks within a slice are processed in the order of a raster scan.
  • FMO flexible macroblock ordering
  • Flexible macroblock ordering defines a slice group to modify how a picture is partitioned into slices.
  • the macroblocks in a slice group are defined by a macroblock-to-slice-group map, which is signaled by the content of the picture parameter set and additional information in the slice headers.
  • the macroblock-to-slice-group map consists of a slice-group identification number for each macroblock in the picture.
  • the slice-group identification number specifies to which slice group the associated macroblock belongs.
  • Each slice group may be partitioned into one or more slices, wherein a slice is a sequence of macroblocks within the same slice group that is processed in the order of a raster scan within the set of macroblocks of a particular slice group. Entropy decoding and macroblock reconstruction proceeds serially within a slice group.
  • FIG. 4 depicts an exemplary macroblock allocation into three slice groups: a first slice group denoted “SLICE GROUP # 0 ” 86 , a second slice group denoted “SLICE GROUP # 1 ” 85 and a third slice group denoted “SLICE GROUP # 2 ” 84 .
  • These slice groups 84 , 85 , 86 may be associated with two foreground regions and a background region, respectively, in the picture 90 .
  • a picture may be partitioned into one or more slices, wherein a slice may be self-contained in the respect that values of the samples in the area of the picture that the slice represents may be correctly reconstructed without use of data from other slices, provided that the references pictures used are identical at the encoder and the decoder. All reconstructed macroblocks within a slice may be available in the neighborhood definition for reconstruction.
  • a slice may be partitioned into more than one entropy slice, wherein an entropy slice may be self-contained in the respect that the area of the picture that the entropy slice represents may be correctly entropy decoded without the use of data from other entropy slices.
  • the entropy decoding 268 may be reset at the decoding start of each entropy slice.
  • the data in other entropy slices may be marked as unavailable when defining neighborhood availability for entropy decoding.
  • a device configured for decoding pictures obtains or otherwise receives a bitstream that includes a series of pictures, including a current picture.
  • the device further obtains a reference picture set (RPS) parameter that may be used for the identification of other frames that may be used for the decoding of the current picture or for the decoding of pictures subsequent to the current picture in the order that pictures are signaled in the bitstream.
  • RPS reference picture set
  • a RPS provides an identification of a set of reference pictures associated with the current frame.
  • a RPS may identify reference pictures that are prior to the current picture in display order that may be used for inter prediction of the current picture and/or identify reference pictures that are after the current picture in display order that may be used for inter prediction of the current picture. For example, if the system receives frame 1, 3, 5 and 5 uses 3 for reference, and, an encoder uses frame 1 for the prediction of frame 7. Then, the RPS for 5 may signal to keep both frame 3 and 1 in the frame memory 278 even though frame 1 is not used for reference of frame 5. In one embodiment, the RPS for 5 may be [ ⁇ 2 ⁇ 4]. Additionally, the frame memory 278 may be referred to the display picture buffer, or equivalently DPB. For this example, the frame number corresponds to the display order, or output order, of the frames.
  • a RPS describes one or more reference pictures that should be maintained, at least for a limited time duration, in the decoded picture buffer (DPB) for subsequent use.
  • This identification of the RPS may be included in the slice header of each picture, together with a picture, and/or together with a group of pictures.
  • a list of RPS may be sent in a picture parameter set (PPS).
  • the slice header may identify one of the RPS sent in the PPS to be used for the slice.
  • a RPS for a group of pictures may be signaled in a picture parameter set (PPS). Any pictures in the DPB that are not a part of the RPS for the current frame may be marked as “unused for reference.”
  • a DPB may be used to store reconstructed (e.g., decoded) pictures at the decoder. These stored pictures may then be used, for example, in an inter-prediction technique.
  • a picture in the DPB may be associated with a picture order count (POC).
  • the POC may be a variable that is associated with each encoded picture and that has a value that increases with increasing picture position in an output order. In other words, the POC may be used by the decoder to deliver the pictures in the correct order for display.
  • the POC may also be used for identification of reference pictures during construction of a reference picture list and identification of decoded reference pictures. Furthermore, the POC may be used for identification of pictures that are lost during transmission from an encoder to a decoder.
  • each of the frames may have an associated POC 310 .
  • the POC may increment from a minus number though a large positive number. In some embodiments, the POC may only increment from zero through a larger positive number.
  • the POC is typically incremented by one for each frame, but in some cases one or more POC are skipped or otherwise omitted.
  • the POC for a set of frames in the encoder may be, 0, 1, 2, 3, 4, 5, etc.
  • the POC for the same or another set of frames in the encoder may be, 0, 1, 2, 4, 5, etc., with POC 3 being skipped or otherwise omitted.
  • the encoder may reduce the number of bits used to identify a particular POC by using a selected number of least significant bits (LSB) of the POC to identify each frame, such as 4 bits. Since the reference frames used for decoding the current frame are often temporally located proximate to the current frame, this identification technique is suitable and results in a reduction in the computational complexity of the system and an overall reduction in the bit rate of the video.
  • the number of LSB to use to identify the pictures may be signaled in the bit stream to the decoder.
  • the LSB index repeats every 16 values (2 ⁇ 4) when the selected number of LSB of the POC is 4.
  • frame 0 has a LSB having a value of 0
  • frame 1 has a LSB having a value of 1, . . .
  • frame 14 has a LSB having a value of 14
  • frame 15 has a LSB having a value of 15.
  • frame 16 again has a LSB having a value of
  • frame 17 again has a LSB having a value of 1
  • frame 20 has a LSB having a value of 4.
  • the encoder Rather than including the POC the bitstream to identify frames, the encoder preferably provides the LSB index (generally also referred to as the LSB of the POC or, equivalently, POC LSB), in the bitstream to the decoder.
  • the reference frames used for inter prediction of a current frame, or frames subsequent to the current frame may be identified with an RPS using either relative (e.g., delta) referencing (using the difference between POC values, or alternatively a deltaPOC and a currentPOC, for example) or absolute referencing (using the POC, for example).
  • frames identified with relative referencing may be called a short term reference frame
  • frames identified with an absolute referencing may be called a long term reference frame.
  • the frame identified by POC 5 310 and signaled to the decoder as LSB 5 320 in the bitstream may have an associated RPS 330 of [ ⁇ 5, ⁇ 2, ⁇ 1]. The meaning of the RPS values is described later.
  • the RPS of [ ⁇ 5, ⁇ 2, ⁇ 1] refers to frames that include the fifth previous frame 320 , second previous frame 321 , and first previous frame 322 relative to the current frame. This in turn refers to the POC values of 0, 3, and 4, respectively as illustrated in FIG. 6 for the current frame with POC value of 5.
  • the RPS refers to the difference in between the POC value of the current frame and the POC value of the previous frame.
  • the RPS can also include frames in the future. These may be indicated with positive values in the RPS (positive deltaPOC values)
  • the difference between the POC value of the current frame and POC value of the previous frame may be different than the number of frames output between the previous frame and current frame such as illustrated in FIG. 7 .
  • the RPS of [ ⁇ 5, ⁇ 2, ⁇ 1] refers to frames that include the fifth previous frame 320 , second previous frame 321 , and first previous frame 322 relative to the POC of the frame identified with POC value equal to 5.
  • the RPS may be signaled in the bitstream in any suitable manner, such as provided together with the frame or provided together with a set of frames.
  • an absolute reference generally referred to as a long term picture, in the RPS associated with a frame.
  • the decoding process such as the motion vector prediction technique, may be different depending if the reference frame is signaled using an absolute reference or a relative reference.
  • a RPS of [LT3, ⁇ 5] would refer to a reference frame having POC LSB value of 3 and a reference frame with a POC equal to the POC of the current frame minus 5. In FIG. 8 , this corresponds to the reference frame with POC equal to 3 444 and the reference frame with POC equal to 0 320 .
  • the LT3 refers to the first previous frame relative to the current frame having a POC LSB value of 3.
  • LT3 refers to the first previous frame relative to the current frame in output order having a POC LSB value of 3.
  • LT3 refers to the first previous frame relative to the current frame in transmission order having a POC LSB value of 3. While such a system is suitable for many bit streams, it is not sufficiently robust to select a frame with a LSB count value of 3 that is different than the immediately previous frame having a LSB count value of 3.
  • the encoder may desire to signal the long term picture frame 0, which likewise has a POC LSB count value of 0, but this may not be accomplished with such a first previous referencing scheme.
  • one technique is to increase the number of least significant bits used to signal the long term frame POC LSB. While such an increase in the number of least significant bits is possible, it results in substantial additional bits being added to the bitstream.
  • a more preferred technique that results in fewer additional bits being added to the bitstream is to signal a different long term picture than the first immediately preceding frame with a corresponding POC LSB value.
  • the system could indicate the RPS of the current frame having an absolute reference as [LT0
  • 2] where the 0 refers to the POC LSB value and 2 refers to which of the previous frames with POC LSB value equal to 0 to usc, which in this case would be the second previous POC LSB value of 0 (e.g., frame 0 in FIG. 9 ). If no second reference is included then the system may default to the immediately preceding frame with a POC LSB 0 [LT0] (e.g., frame 16 in FIG. 9 ).
  • the system may use a duplication technique.
  • the RPS may be structured as follows, [LT0, LT013]. The duplication of the LT0 within the same RPS signals the decoder to use a different frame having a POC LSB value of 0, which in this case would be the third previous occurrence of the POC LSB value of 0.
  • a cycle of POC LSB values denotes a set of frames that when ordered in output order do not contain the same POC LSB value and are not separated in output order by frames not in the set.
  • the duplication technique may be indicated as follows.
  • the RPS includes a signal of a long term picture having a POC LSB value 400 (e.g., [LT3]).
  • the same RPS includes another signal of a long term picture having the same POC LSB value 410 (e.g., [LT3, LT3].
  • the same RPS includes another signal of the second long term picture having the same LSB count value 410 indicating the location of the desired frame 420 [LT3, LT3
  • the signaling of the location of the desired frame may be performed in any suitable manner.
  • the location may be one or more previous cycles of the POC LSB values for the desired frame relative to the current frame, such as the third previous cycle.
  • the location may be based upon an absolute number of frames offset from the current frame.
  • the location may be one or more previous cycles of the POC LSB values relative to the first immediately preceding frame with the desired POC LSB value.
  • the location may be based upon an absolute number of frames offset relative to the first immediately preceding frame with the desired POC LSB value.
  • One exemplary implementation of such a technique may use the following syntax.
  • slice_header( ) Descriptor lightweight_slice_flag u(1) if( !lightweight_slice_flag ) ⁇ slice_type ue(v) pic_parameter_set_id ue(v) if( IdrPicFlag ) ⁇ idr_pic_id ue(v) no_output_of_prior_pics_flag u(1) ⁇ else ⁇ pic_order_cnt_lsb u(v) short_term_ref_pic_set_pps_flag u(1) if( !short_term_ref_pic_set_pps_flag ) short_term_ref_pic_set( num_short_term_ref_pic_sets ) else short_term_ref_pic_set_idx u(v) if( long_term_ref_pics_present_flag ) ⁇ num_long_term_pics
  • a treeblock may be a macroblock and LCUAddress denotes the spatial location of the treeblock within a picture.
  • the slice_type specifies the coding type of the slice as follows:
  • slice_type When nal_unit_type is equal to 5 (IDR picture), slice_type shall be equal to 2. When max_num_ref_frames is equal to 0, slice_type shall be equal to 2.
  • pic_parameter_set_id specifies the picture parameter set in use.
  • the value of pic_parameter_set_id shall be in the range of 0 to 255, inclusive.
  • idr_pic_id identifies an IDR picture, which denotes a picture that does not use previously transmitted pictures for reference.
  • the values of idr_pic_id in all the slices of an IDR picture shall remain unchanged.
  • the value of idr_pic_id in the slices of the first such IDR access unit shall differ from the idr_pic_id in the second such IDR access unit.
  • the value of idr_pic_id shall be in the range of 0 to 65535, inclusive.
  • no_output_of_prior_pics_flag specifies how the previously-decoded pictures in the decoded picture buffer are treated after decoding of an IDR picture.
  • the value of no_output_of_prior_pics_flag has no effect on the decoding process.
  • no_output_of_prior_pics_flag 1 may (but should not) be inferred by the decoder, regardless of the actual value of no_output_of_prior_pics_flag.
  • pic_order_cnt_lsb specifies the picture order count modulo MaxPicOrderCntLsb for the current picture.
  • the length of the pic_order_cnt_lsb syntax element is log 2_max_pic_order_cnt_lsb_minus4+4 bits.
  • the value of the pic_order_cnt_lsb shall be in the range of 0 to MaxPicOrderCntLsb ⁇ 1, inclusive.
  • pic_order_cnt_lsb shall be inferred to be equal to 0.
  • pic_order_cnt_lsb indicates the number of LSBs in POC LSB.
  • short_term_ref_pic_set_pps_flag 1 specifies that the short-term reference picture set of the current picture shall be created using syntax elements in the active picture parameter set, which contains syntax elements that may be shared between multiple pictures.
  • short_term_ref_pic_set_pps_flag 0 specifies that the short-term reference picture set of the current picture shall be created using syntax elements in the short_term_ref_pic_set( )syntax structure in the slice header.
  • a short-term reference picture set denotes a pictures set that only uses delta referencing.
  • short_term_ref_pic_set_idx specifies the index to the list of the short-term reference picture sets specified in the active picture parameter set that shall be used for creation of the reference picture set of the current picture.
  • the syntax element short_term_ref_pic_set_idx shall be represented by ceil(log 2(num_short_term_ref_pic_sets)) bits.
  • the value of short_term_ref_pic_set_idx shall be in the range of 0 to num_short_term_ref_pic_sets ⁇ 1, inclusive, where num_short_term_ref_pic_sets is the syntax element from the active picture parameter set.
  • the variable StRpsIdx is derived as follows.
  • num_long_term_pics specifies the number of the long-term reference pictures that are to be included in the long-term reference picture set of the current picture.
  • the value of num_long_term_pics shall be in the range of 0 to max_num_ref_frames ⁇ NumNegativePics[StRpsIdx] ⁇ NumPositivePics[StRpsIdx], inclusive.
  • the value of num_long_term_pics shall be inferred to be equal to 0.
  • the long-term reference pictures denote reference pictures that are transmitted with absolute referencing.
  • delta_poc_lsb_lt_minus1[i] is used to determine the value of the least significant bits of the picture order count value of the i-th long-term reference picture that is included in the long-term reference picture set of the current picture.
  • delta_poc_lsb_lt_minus1[i] shall be in the range of 0 to MaxPicOrderCntLsb ⁇ 1, inclusive.
  • delta_poc_lsb_lt_minus1[i] denotes POC LSB of the i-th long-term reference picture.
  • variable DeltaPocLt[i] is derived as follows.
  • DeltaPocLt[i] shall be in the range of 0 to MaxPicOrderCntLsb, inclusive.
  • deltaPOCLSBCheck(i) is a function as follows:
  • delta_poc_msb_lt_minus1[i] is together with delta_poc_lsb_lt_minus1 [i] used to determine the value of picture order count of the i-th long term reference picture that is included in the long-term reference picture set of the current reference picture.
  • variable delta_poc_msb_lt_minus1[i] is derived as follows:
  • a poc_msb_lt_minus1 or poc_msb_lt element may be sent.
  • poc_msb_lt_minus1 indicates POC value of the reference picture ⁇ 1. This may be absolute POC value.
  • poc_msb_lt indicates POC value of reference picture. Again this may be absolute POC value.
  • used_by_curr_pic_lt_flag[i] 0 specifies that the i-th long-term reference picture included in the long-term reference picture set of the current picture is not used for reference, or inter-frame prediction, by the current picture.
  • num_ref idx_active_override_flag 1 specifies that the syntax element num_ref idx_l0_active_minus1 is present for P and B slices and that the syntax element num_ref_idx_l1_active_minus1 is present for B slices.
  • num_ref_idx_active_override_flag 0 specifies that the syntax elements num_ref_idx_l0_active_minus1 and num_ref_idx_l1_active_minus1 are not present.
  • num_ref idx_l0_active_minus1 specifies the maximum reference index for reference picture list 0 that shall be used to decode the slice.
  • num_ref_idx_l0_active_minus1 shall be inferred to be equal to num_ref_idx_l0_default_active_minus1.
  • num_ref_idx_l0_active_minus1 shall be in the range of 0 to 15, inclusive.
  • MbaffFrameFlag is equal to 1
  • num_ref idx_l0_active_minus1 is the maximum index value for the decoding of frame macroblocks
  • 2*num_ref_idx_l0_active_minus1+1 is the maximum index value for the decoding of field macroblocks.
  • num_ref_idx_l0_active_minus1 shall be in the range of 0 to 31, inclusive.
  • num_ref_idx_ ⁇ l_active_minus1 specifies the maximum reference index for reference picture list 1 that shall be used to decode the slice.
  • num_ref_idx_l1_active_minus1 shall be inferred to be equal to num_ref_idx_l1_default_active_minus1.
  • num_ref_idx_l1_active_minus1 The range of num_ref_idx_l1_active_minus1 is constrained as specified in the semantics for num_ref_idx_l0_active_minus1 with l0 and list 0 replaced by l1 and list 1, respectively.
  • deltaPOCLSBCheck(int i) determines the same POC LSB is transmitted from the encoder to the decoder using absolute referencing for the current frame.
  • determining if the same POC LSB is transmitted can be accomplished by checking if the value delta_poc_lsb_lt_minus1 is equal to a value known to both the encoder and decoder. For example, delta_poc_lsb_lt_minus1 equal to 0 could denote the POC LSB is the same as the previously transmitted POC LSB.
  • delta_poc_lsb_lt_minus1 equal to 2 ⁇ N ⁇ 1, where N denotes the number of bits used to transmit POC LSB and known to the both the encoder and decoder, 0 could denote the POC LSB is the same as the previously transmitted POC LSB.
  • the value delta_poc_lsb_lt_minus1 is replaced with the syntax element delta_poc_lsb_lt, which is generally equal to delta_poc_lsb_lt_minus1 plus 1.
  • the delta_poc_lsb_lt equal to a value known to both the encoder and decoder can indicate the picture transmitted using absolute referencing has the same POC LSB as the previous picture transmitted using absolute referencing in the same RPS.
  • delta_poc_lsb_lt 0 could denote the POC LSB is the same as the previously transmitted POC LSB.
  • delta_poc_lsb_lt 2 ⁇ N, where N denotes the number of bits used to transmit POC LSB and known to the both the encoder and decoder, 0 could denote the POC LSB is the same as the previously transmitted POC LSB.
  • the decoding process may be done as follows:

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A system for decoding a video bitstream includes receiving a reference picture set associated with a frame including a set of reference picture identifiers. The reference picture set identifies one or more reference pictures to be used for inter-prediction of the frame based upon its associated least significant bits of a picture order count based upon the reference picture identifiers. The one or more reference pictures is a second or greater previous frame to the frame having the matching reference picture identifier.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • None.
  • BACKGROUND OF THE INVENTION
  • The present invention relates to video encoding and/or decoding.
  • Digital video is typically represented as a series of images or frames, each of which contains an array of pixels. Each pixel includes information, such as intensity and/or color information. In many cases, each pixel is represented as a set of three colors, each of which may be defined by eight bit color values.
  • Video-coding techniques, for example H.264/MPEG-4 AVC (H.264/AVC), typically provide higher coding efficiency at the expense of increasing complexity. Increasing image quality requirements and increasing image resolution requirements for video coding techniques also increase the coding complexity. Video decoders that are suitable for parallel decoding may improve the speed of the decoding process and reduce memory requirements; video encoders that are suitable for parallel encoding may improve the speed of the encoding process and reduce memory requirements.
  • H.264/MPEG-4 AVC [Joint Video Team of ITU-T VCEG and ISO/IEC MPEG, “H.264: Advanced video coding for generic audiovisual services,” ITU-T Rec. H.264 and ISO/IEC 14496-10 (MPEG4—Part 10), November 2007], and similarly the JCT-VC, [“Draft Test Model Under Consideration”, JCTVC-A205, JCT-VC Meeting, Dresden, April 2010 (JCT-VC)], both of which are incorporated by reference herein in their entirety, are video codec (encoder/decoder) specifications that decode pictures based upon reference pictures in a video sequence for compression efficiency.
  • The foregoing and other objectives, features, and advantages of the invention will be more readily understood upon consideration of the following detailed description of the invention, taken in conjunction with the accompanying drawings.
  • BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS
  • FIG. 1 illustrates a H.264/AVC video encoder.
  • FIG. 2 illustrates a H.264/AVC video decoder.
  • FIG. 3 illustrates an exemplary slice structure.
  • FIG. 4 illustrates another exemplary slice structure.
  • FIG. 5 illustrates reconstruction of an entropy slice.
  • FIG. 6 illustrates reconstruction of an portion of the entropy slice of FIG. 5.
  • FIG. 7 illustrates reconstruction of an entropy slice with an omitted LSB count value.
  • FIG. 8 illustrates reconstruction of an entropy slice with a long term picture value.
  • FIG. 9 illustrates reconstruction of an entropy slice by selecting a first preceding frame with a long term picture value.
  • FIG. 10 illustrates reconstruction of an entropy slice by using duplicate long term picture frame having the same least significant bit count value.
  • FIGS. 11A-11B illustrates a technique for selecting a reference frame.
  • FIG. 12 illustrates another technique for selecting a reference frame.
  • FIGS. 13A-13B illustrates another technique for selecting a reference frame.
  • FIG. 14 illustrates another technique for selecting a reference frame.
  • DETAILED DESCRIPTION OF PREFERRED EMBODIMENT
  • While any video coder/decoder (codec) that uses encoding/decoding may be accommodated by embodiments described herein, exemplary embodiments are described in relation to an H.264/AVC encoder and an H.264/AVC decoder merely for purposes of illustration. Many video coding techniques are based on a block-based hybrid video-coding approach, wherein the source-coding technique is a hybrid of inter-picture, also considered inter-frame, prediction, intra-picture, also considered intra-frame, prediction and transform coding of a prediction residual. Inter-frame prediction may exploit temporal redundancies, and intra-frame and transform coding of the prediction residual may exploit spatial redundancies.
  • FIG. 1 is a block diagram illustrating an exemplary encoder 104 for an electronic device 102. It should be noted that one or more of the elements illustrated as included within the electronic device 102 may be implemented in hardware, and/or software. For example, the electronic device 102 includes an encoder 104, which may be implemented in hardware and/or software.
  • The electronic device 102 may include a supplier 134. The supplier 134 may provide picture or image data (e.g., video) as a source 106 to the encoder 104. Non limiting examples of the supplier 134 include image sensors, memory, communication interfaces, network interfaces, wireless receivers, ports, video frame content, previously encoded video content, non-encoded video content, etc.
  • The source 106 may be provided to an intra-frame prediction module and reconstruction buffer 140. The source 106 may also be provided to a motion estimation and motion compensation module 166 and to a subtraction module 146.
  • The intra-frame prediction module and reconstruction buffer 140 may generate intra mode information 148 and an intra signal 142 based on the source 106 and reconstructed data 180. The motion estimation and motion compensation module 166 may generate inter mode information 168 and an inter signal 144 based on the source 106 and a reference picture buffer 196 signal 198.
  • The reference picture buffer 196 signal 198 may include data from one or more reference pictures stored in the reference picture buffer 196. The reference picture buffer 196 may also include an RPS index initializer module 108. The initializer module 108 may process reference pictures corresponding to the buffering and list construction of an RPS.
  • The encoder 104 may select between the intra signal 142 and the inter signal 144 in accordance with a mode. The intra signal 142 may be used in order to exploit spatial characteristics within a picture in an intra coding mode. The inter signal 144 may be used in order to exploit temporal characteristics between pictures in an inter coding mode. While in the intra coding mode, the intra signal 142 may be provided to the subtraction module 146 and the intra mode information 158 may be provided to an entropy coding module 160. While in the inter coding mode, the inter signal 144 may be provided to the subtraction module 146 and the inter mode information 168 may be provided to the entropy coding module 160.
  • Either the intra signal 142 or the inter signal 144 (depending on the mode) is subtracted from the source 106 at the subtraction module 146 in order to produce a prediction residual 148. The prediction residual 148 is provided to a transformation module 150. The transformation module 150 may compress the prediction residual 148 to produce a transformed signal 152 that is provided to a quantization module 154. The quantization module 154 quantizes the transformed signal 152 to produce transformed and quantized coefficients (TQCs) 156.
  • The TQCs 156 are provided to an entropy coding module 160 and an inverse quantization module 170. The inverse quantization module 170 performs inverse quantization on the TQCs 156 to produce an inverse quantized signal 172 that is provided to an inverse transformation module 174. The inverse transformation module 174 decompresses the inverse quantized signal 172 to produce a decompressed signal 176 that is provided to a reconstruction module 178.
  • The reconstruction module 178 may produce reconstructed data 180 based on the decompressed signal 176. For example, the reconstruction module 178 may reconstruct (modified) pictures. The reconstructed data 180 may be provided to a deblocking filter 182 and to the intra prediction module and reconstruction buffer 140. The deblocking filter 182 may produce a filtered signal 184 based on the reconstructed data 180.
  • The filtered signal 184 may be provided to a sample adaptive offset (SAO) module 186. The SAO module 186 may produce SAO information 188 that is provided to the entropy coding module 160 and an SAO signal 190 that is provided to an adaptive loop filter (ALF) 192. The ALF 192 produces an ALF signal 194 that is provided to the reference picture buffer 196. The ALF signal 194 may include data from one or more pictures that may be used as reference pictures.
  • The entropy coding module 160 may code the TQCs 156 to produce a bitstream 114. Also, the entropy coding module 160 may code the TQCs 156 using Context-Adaptive Variable Length Coding (CAVLC) or Context-Adaptive Binary Arithmetic Coding (CABAC). In particular, the entropy coding module 160 may code the TQCs 156 based on one or more of intra mode information 158, inter mode information 168 and SAO information 188. The bitstream 114 may include coded picture data. The encoder often encodes a frame as a sequence of blocks, generally referred to as macroblocks.
  • Quantization, involved in video compression such as HEVC, is a lossy compression technique achieved by compressing a range of values to a single value. The quantization parameter (QP) is a predefined scaling parameter used to perform the quantization based on both the quality of reconstructed video and compression ratio. The block type is defined in HEVC to represent the characteristics of a given block based on the block size and its color information. QP, resolution information and block type may be determined before entropy coding. For example, the electronic device 102 (e.g., the encoder 104) may determine the QP, resolution information and block type, which may be provided to the entropy coding module 160.
  • The entropy coding module 160 may determine the block size based on a block of TQCs 156. For example, block size may be the number of TQCs 156 along one dimension of the block of TQCs. In other words, the number of TQCs 156 in the block of TQCs may be equal to block size squared. For instance, block size may be determined as the square root of the number of TQCs 156 in the block of TQCs. Resolution may be defined as a pixel width by a pixel height. Resolution information may include a number of pixels for the width of a picture, for the height of a picture or both. Block size may be defined as the number of TQCs 156 along one dimension of a 2D block of TQCs.
  • In some configurations, the bitstream 114 may be transmitted to another electronic device. For example, the bitstream 114 may be provided to a communication interface, network interface, wireless transmitter, port, etc. For instance, the bitstream 114 may be transmitted to another electronic device via LAN, the Internet, a cellular phone base station, etc. The bitstream 114 may additionally or alternatively be stored in memory on the electronic device 102 or other electronic device.
  • FIG. 2 is a block diagram illustrating an exemplary decoder 212 on an electronic device 202. The decoder 212 may be included for an electronic device 202. For example, the decoder 212 may be a HEVC decoder. The decoder 212 and/or one or more of the elements illustrated as included in the decoder 212 may be implemented in hardware and/or software. The decoder 212 may receive a bitstream 214 (e.g., one or more encoded pictures included in the bitstream 214) for decoding. In some configurations, the received bitstream 214 may include received overhead information, such as a received slice header, received PPS (or picture parameter set), received buffer description information, etc. The encoded pictures included in the bitstream 214 may include one or more encoded reference pictures and/or one or more other encoded pictures.
  • Received symbols (in the one or more encoded pictures included in the bitstream 214) may be entropy decoded by an entropy decoding module 268, thereby producing a motion information signal 270 and quantized, scaled and/or transformed coefficients 272.
  • The motion information signal 270 may be combined with a portion of a reference frame signal 298 from a frame memory 278 at a motion compensation module 274, which may produce an inter-frame prediction signal 282. The quantized, descaled and/or transformed coefficients 272 may be inverse quantized, scaled and inverse transformed by an inverse module 262, thereby producing a decoded residual signal 284. The decoded residual signal 284 may be added to a prediction signal 292 to produce a combined signal 286. The prediction signal 292 may be a signal selected from either the inter-frame prediction signal 282 or an intra-frame prediction signal 290 produced by an intra-frame prediction module 288. In some configurations, this signal selection may be based on (e.g., controlled by) the bitstream 214.
  • The intra-frame prediction signal 290 may be predicted from previously decoded information from the combined signal 292 (in the current frame, for example). The combined signal 292 may also be filtered by a de-blocking filter 294. The resulting filtered signal 296 may be written to frame memory 278. The resulting filtered signal 296 may include a decoded picture.
  • The frame memory 778 may include a DPB (or display picture buffer) as described herein. The DPB may include one or more decoded pictures that may be maintained as short or long term reference frames. The frame memory 278 may also include overhead information corresponding to the decoded pictures. For example, the frame memory 278 may include slice headers, PPS information, buffer description information, etc. One or more of these pieces of information may be signaled from an encoder (e.g., encoder 104). The frame memory 278 may provide a decoded picture 718.
  • An input picture comprising a plurality of macroblocks may be partitioned into one or several slices. The values of the samples in the area of the picture that a slice represents may be properly decoded without the use of data from other slices provided that the reference pictures used at the encoder and the decoder are the same and that de-blocking filtering does not use information across slice boundaries. Therefore, entropy decoding and macroblock reconstruction for a slice does not depend on other slices. In particular, the entropy coding state may be reset at the start of each slice. The data in other slices may be marked as unavailable when defining neighborhood availability for both entropy decoding and reconstruction. The slices may be entropy decoded and reconstructed in parallel. No intra prediction and motion-vector prediction is preferably allowed across the boundary of a slice. In contrast, de-blocking filtering may use information across slice boundaries.
  • FIG. 3 illustrates an exemplary video picture 90 comprising eleven macroblocks in the horizontal direction and nine macroblocks in the vertical direction (nine exemplary macroblocks labeled 91-99). FIG. 3 illustrates three exemplary slices: a first slice denoted “SLICE # 089, a second slice denoted “SLICE # 188 and a third slice denoted “SLICE # 287. An H.264/AVC decoder may decode and reconstruct the three slices 87, 88, 89 in parallel. Each of the slices may be transmitted in scan line order in a sequential manner. At the beginning of the decoding/reconstruction process for each slice, entropy decoding 268 is initialized or reset and macroblocks in other slices are marked as unavailable for both entropy decoding and macroblock reconstruction. Thus, for a macroblock, for example, the macroblock labeled 93, in “SLICE # 1,” macroblocks (for example, macroblocks labeled 91 and 92) in “SLICE # 0” may not be used for entropy decoding or reconstruction. Whereas, for a macroblock, for example, the macroblock labeled 95, in “SLICE # 1,” other macroblocks (for example, macroblocks labeled 93 and 94) in “SLICE # 1” may be used for entropy decoding or reconstruction. Therefore, entropy decoding and macroblock reconstruction proceeds serially within a slice. Unless slices are defined using a flexible macroblock ordering (FMO), macroblocks within a slice are processed in the order of a raster scan.
  • Flexible macroblock ordering defines a slice group to modify how a picture is partitioned into slices. The macroblocks in a slice group are defined by a macroblock-to-slice-group map, which is signaled by the content of the picture parameter set and additional information in the slice headers. The macroblock-to-slice-group map consists of a slice-group identification number for each macroblock in the picture. The slice-group identification number specifies to which slice group the associated macroblock belongs. Each slice group may be partitioned into one or more slices, wherein a slice is a sequence of macroblocks within the same slice group that is processed in the order of a raster scan within the set of macroblocks of a particular slice group. Entropy decoding and macroblock reconstruction proceeds serially within a slice group.
  • FIG. 4 depicts an exemplary macroblock allocation into three slice groups: a first slice group denoted “SLICE GROUP # 086, a second slice group denoted “SLICE GROUP # 185 and a third slice group denoted “SLICE GROUP # 284. These slice groups 84, 85, 86 may be associated with two foreground regions and a background region, respectively, in the picture 90.
  • A picture may be partitioned into one or more slices, wherein a slice may be self-contained in the respect that values of the samples in the area of the picture that the slice represents may be correctly reconstructed without use of data from other slices, provided that the references pictures used are identical at the encoder and the decoder. All reconstructed macroblocks within a slice may be available in the neighborhood definition for reconstruction.
  • A slice may be partitioned into more than one entropy slice, wherein an entropy slice may be self-contained in the respect that the area of the picture that the entropy slice represents may be correctly entropy decoded without the use of data from other entropy slices. The entropy decoding 268 may be reset at the decoding start of each entropy slice. The data in other entropy slices may be marked as unavailable when defining neighborhood availability for entropy decoding.
  • A device configured for decoding pictures obtains or otherwise receives a bitstream that includes a series of pictures, including a current picture. The device further obtains a reference picture set (RPS) parameter that may be used for the identification of other frames that may be used for the decoding of the current picture or for the decoding of pictures subsequent to the current picture in the order that pictures are signaled in the bitstream.
  • A RPS provides an identification of a set of reference pictures associated with the current frame. A RPS may identify reference pictures that are prior to the current picture in display order that may be used for inter prediction of the current picture and/or identify reference pictures that are after the current picture in display order that may be used for inter prediction of the current picture. For example, if the system receives frame 1, 3, 5 and 5 uses 3 for reference, and, an encoder uses frame 1 for the prediction of frame 7. Then, the RPS for 5 may signal to keep both frame 3 and 1 in the frame memory 278 even though frame 1 is not used for reference of frame 5. In one embodiment, the RPS for 5 may be [−2 −4]. Additionally, the frame memory 278 may be referred to the display picture buffer, or equivalently DPB. For this example, the frame number corresponds to the display order, or output order, of the frames.
  • A RPS describes one or more reference pictures that should be maintained, at least for a limited time duration, in the decoded picture buffer (DPB) for subsequent use. This identification of the RPS may be included in the slice header of each picture, together with a picture, and/or together with a group of pictures. In one embodiment, a list of RPS may be sent in a picture parameter set (PPS). Then, the slice header may identify one of the RPS sent in the PPS to be used for the slice. For example, a RPS for a group of pictures may be signaled in a picture parameter set (PPS). Any pictures in the DPB that are not a part of the RPS for the current frame may be marked as “unused for reference.”
  • A DPB may be used to store reconstructed (e.g., decoded) pictures at the decoder. These stored pictures may then be used, for example, in an inter-prediction technique. Also, a picture in the DPB may be associated with a picture order count (POC). The POC may be a variable that is associated with each encoded picture and that has a value that increases with increasing picture position in an output order. In other words, the POC may be used by the decoder to deliver the pictures in the correct order for display. The POC may also be used for identification of reference pictures during construction of a reference picture list and identification of decoded reference pictures. Furthermore, the POC may be used for identification of pictures that are lost during transmission from an encoder to a decoder.
  • Referring to FIG. 5, one example of a set of frames 300 provided from an encoder to a decoder is illustrated. Each of the frames may have an associated POC 310. As illustrated, the POC may increment from a minus number though a large positive number. In some embodiments, the POC may only increment from zero through a larger positive number. The POC is typically incremented by one for each frame, but in some cases one or more POC are skipped or otherwise omitted. For example, the POC for a set of frames in the encoder may be, 0, 1, 2, 3, 4, 5, etc. For example, the POC for the same or another set of frames in the encoder may be, 0, 1, 2, 4, 5, etc., with POC 3 being skipped or otherwise omitted.
  • As the POC becomes sufficiently large, a significant number of bits would be necessary to identify each frame using the POC. The encoder may reduce the number of bits used to identify a particular POC by using a selected number of least significant bits (LSB) of the POC to identify each frame, such as 4 bits. Since the reference frames used for decoding the current frame are often temporally located proximate to the current frame, this identification technique is suitable and results in a reduction in the computational complexity of the system and an overall reduction in the bit rate of the video. The number of LSB to use to identify the pictures may be signaled in the bit stream to the decoder.
  • As illustrated, with LSB being 4 bits, the LSB index repeats every 16 values (2̂4) when the selected number of LSB of the POC is 4. Thus, frame 0 has a LSB having a value of 0, frame 1 has a LSB having a value of 1, . . . , frame 14 has a LSB having a value of 14, frame 15 has a LSB having a value of 15. However, frame 16 again has a LSB having a value of 0, frame 17 again has a LSB having a value of 1, and frame 20 has a LSB having a value of 4. The LSB identifier (generally also referred to as the LSB of the POC or, equivalently, POC LSB) may have the characteristic of LSB=POC % 16, where % is the remainder after dividing by 16 (2̂ number of least significant bits which in this case is 4). Similarly, if the selected number of LSBs to identify a POC is N bits, the LSB identifier may have the characteristic of LSB=POC % (2̂N) where 2̂N denotes 2 raised to the power of N. Rather than including the POC the bitstream to identify frames, the encoder preferably provides the LSB index (generally also referred to as the LSB of the POC or, equivalently, POC LSB), in the bitstream to the decoder.
  • The reference frames used for inter prediction of a current frame, or frames subsequent to the current frame, may be identified with an RPS using either relative (e.g., delta) referencing (using the difference between POC values, or alternatively a deltaPOC and a currentPOC, for example) or absolute referencing (using the POC, for example). In some embodiments, frames identified with relative referencing may be called a short term reference frame, and frames identified with an absolute referencing may be called a long term reference frame. For example, the frame identified by POC 5 310 and signaled to the decoder as LSB 5 320 in the bitstream may have an associated RPS 330 of [−5, −2, −1]. The meaning of the RPS values is described later.
  • Referring to FIG. 6, illustrating a portion of FIG. 5, the RPS of [−5, −2, −1] refers to frames that include the fifth previous frame 320, second previous frame 321, and first previous frame 322 relative to the current frame. This in turn refers to the POC values of 0, 3, and 4, respectively as illustrated in FIG. 6 for the current frame with POC value of 5. Typically, the RPS refers to the difference in between the POC value of the current frame and the POC value of the previous frame. For example, the RPS of [−5, −2, −1] for a current frame having a POC value of 5, refers to frames having POC values of 5 minus 5=0; 5 minus 2=3; and 5 minus 1=4. The RPS can also include frames in the future. These may be indicated with positive values in the RPS (positive deltaPOC values)
  • In the case that the POC values are not sequential, such as in the case that one or more POC values are skipped or otherwise omitted in parts of the bitstream, the difference between the POC value of the current frame and POC value of the previous frame may be different than the number of frames output between the previous frame and current frame such as illustrated in FIG. 7. As shown in FIG. 7, the RPS of [−5, −2, −1] refers to frames that include the fifth previous frame 320, second previous frame 321, and first previous frame 322 relative to the POC of the frame identified with POC value equal to 5. The RPS may be signaled in the bitstream in any suitable manner, such as provided together with the frame or provided together with a set of frames.
  • Referring to FIG. 8, another technique for signaling the reference frames is to use an absolute reference, generally referred to as a long term picture, in the RPS associated with a frame. The decoding process, such as the motion vector prediction technique, may be different depending if the reference frame is signaled using an absolute reference or a relative reference. The absolute reference (referred to as LT for convenience) refers to a particular LSB count value associated with a reference frame, such as a previous or subsequent frame. For example, the absolute reference of LT=3 (LT3) would refer to a reference frame having a POC LSB value of 3. Accordingly, a RPS of [LT3, −5] would refer to a reference frame having POC LSB value of 3 and a reference frame with a POC equal to the POC of the current frame minus 5. In FIG. 8, this corresponds to the reference frame with POC equal to 3 444 and the reference frame with POC equal to 0 320. Typically, the LT3 refers to the first previous frame relative to the current frame having a POC LSB value of 3. In one embodiment, LT3 refers to the first previous frame relative to the current frame in output order having a POC LSB value of 3. In a second embodiment, LT3 refers to the first previous frame relative to the current frame in transmission order having a POC LSB value of 3. While such a system is suitable for many bit streams, it is not sufficiently robust to select a frame with a LSB count value of 3 that is different than the immediately previous frame having a LSB count value of 3.
  • Referring to FIG. 9, for example, if the encoder was encoding frame 31 (POC=31) and the system signals the use of the long term picture with POC LSB=0 (LT0), then this would refer A to frame 16 (POC=16) since it is the first previous frame with LSB=0. However, the encoder may desire to signal the long term picture frame 0, which likewise has a POC LSB count value of 0, but this may not be accomplished with such a first previous referencing scheme. To overcome this limitation, one technique is to increase the number of least significant bits used to signal the long term frame POC LSB. While such an increase in the number of least significant bits is possible, it results in substantial additional bits being added to the bitstream.
  • A more preferred technique that results in fewer additional bits being added to the bitstream is to signal a different long term picture than the first immediately preceding frame with a corresponding POC LSB value. For example, the system could indicate the RPS of the current frame having an absolute reference as [LT0|2] where the 0 refers to the POC LSB value and 2 refers to which of the previous frames with POC LSB value equal to 0 to usc, which in this case would be the second previous POC LSB value of 0 (e.g., frame 0 in FIG. 9). If no second reference is included then the system may default to the immediately preceding frame with a POC LSB=0 [LT0] (e.g., frame 16 in FIG. 9).
  • In many cases, the frequency of occurrence of the desire to signal a frame that is not the first immediately preceding frame with the corresponding POC LSB value using absolute referencing will be relatively infrequent. To further reduce the overall bit rate indicating which frame to use, while permitting the capability of signaling a different frame than the first immediately preceding frame with the corresponding POC LSB value using absolute referencing, the system may use a duplication technique. For example, the RPS may be structured as follows, [LT0, LT013]. The duplication of the LT0 within the same RPS signals the decoder to use a different frame having a POC LSB value of 0, which in this case would be the third previous occurrence of the POC LSB value of 0. In general, aside from the potential that a particular POC LSB value would not be included in a particular cycle of POC LSB values, the desired POC LSB value will correspond to a frame of the indicated previous occurrence. Here, a cycle of POC LSB values denotes a set of frames that when ordered in output order do not contain the same POC LSB value and are not separated in output order by frames not in the set.
  • Referring to FIG. 10, the duplication technique may be indicated as follows. The RPS includes a signal of a long term picture having a POC LSB value 400 (e.g., [LT3]). The same RPS includes another signal of a long term picture having the same POC LSB value 410 (e.g., [LT3, LT3]. The same RPS includes another signal of the second long term picture having the same LSB count value 410 indicating the location of the desired frame 420 [LT3, LT3|2].
  • The signaling of the location of the desired frame may be performed in any suitable manner. Referring to FIGS. 11A-11B for example, the location may be one or more previous cycles of the POC LSB values for the desired frame relative to the current frame, such as the third previous cycle. Referring to FIG. 12 for example, the location may be based upon an absolute number of frames offset from the current frame. Referring to FIGS. 13A-13B for example, the location may be one or more previous cycles of the POC LSB values relative to the first immediately preceding frame with the desired POC LSB value. Referring to FIG. 14 for example, the location may be based upon an absolute number of frames offset relative to the first immediately preceding frame with the desired POC LSB value.
  • One exemplary implementation of such a technique may use the following syntax.
  • slice_header( ) { Descriptor
     lightweight_slice_flag u(1)
     if( !lightweight_slice_flag ) {
      slice_type ue(v)
      pic_parameter_set_id ue(v)
      if( IdrPicFlag ) {
        idr_pic_id ue(v)
       no_output_of_prior_pics_flag u(1)
      }
      else {
        pic_order_cnt_lsb u(v)
       short_term_ref_pic_set_pps_flag u(1)
       if( !short_term_ref_pic_set_pps_flag )
        short_term_ref_pic_set( num_short_term_ref_pic_sets )
       else
        short_term_ref_pic_set_idx u(v)
       if( long_term_ref_pics_present_flag ) {
        num_long_term_pics ue(v)
        for( i = 0; i < num_long_term_pics; i++ ) {
         delta_poc_lsb_lt_minus1[ i ] ue(v)
         if(deltaPOCLSBCheck(i)==1) {
          delta_poc_msb_lt_minus1[ i ] ue(v)
          }
         used_by_curr_pic_lt_flag[ i ] u(1)
        }
       }
      }
      if( slice_type = = P || slice_type = = B ) {
        num_ref_idx_active_override_flag u(1)
        if( num_ref_idx_active_override_flag ) {
         num_ref_idx_l0_active_minus1 ue(v)
         if( slice_type = = B )
         num_ref_idx_l1_active_minus1 ue(v)
        }
       }
    ...
    }
  • When the lightweight_slice_flag is equal to 1 specifies that the value of slice header syntax elements not present shall be inferred to be equal to the value of slice header syntax elements in a proceeding slice, where a proceeding slice is defined as the slice containing treeblock with location (LCUAddress−1). The lightweight_slice_flag shall be equal to 0 when LCUAddress equal to 0. Here, a treeblock may be a macroblock and LCUAddress denotes the spatial location of the treeblock within a picture.
  • The slice_type specifies the coding type of the slice as follows:
  • slice_type Name of slice_type
    0 P (P slice)
    1 B (B slice)
    2 I (I slice)
  • When nal_unit_type is equal to 5 (IDR picture), slice_type shall be equal to 2. When max_num_ref_frames is equal to 0, slice_type shall be equal to 2.
  • pic_parameter_set_id specifies the picture parameter set in use. The value of pic_parameter_set_id shall be in the range of 0 to 255, inclusive.
  • idr_pic_id identifies an IDR picture, which denotes a picture that does not use previously transmitted pictures for reference. The values of idr_pic_id in all the slices of an IDR picture shall remain unchanged. When two consecutive access units in decoding order are both IDR access units, the value of idr_pic_id in the slices of the first such IDR access unit shall differ from the idr_pic_id in the second such IDR access unit. The value of idr_pic_id shall be in the range of 0 to 65535, inclusive.
  • no_output_of_prior_pics_flag specifies how the previously-decoded pictures in the decoded picture buffer are treated after decoding of an IDR picture. When the IDR picture is the first IDR picture in the bitstream, the value of no_output_of_prior_pics_flag has no effect on the decoding process. When the IDR picture is not the first IDR picture in the bitstream and the value of pic_width_in_luma_samples or pic_height_in_luma_samples, which denote the dimensions of the pictures, or max_dec_frame_buffering, which denotes the maximum amount of reordering required at a decoder to convert a sequence of frames in transmission order to a sequence of frames in display order, derived from the active sequence parameter set is different from the value of pic_width_in_luma_samples or pic_height_in_luma_samples or max_dec_frame_buffering derived from the sequence parameter set active for the preceding picture, no_output_of_prior_pics_flag equal to 1 may (but should not) be inferred by the decoder, regardless of the actual value of no_output_of_prior_pics_flag.
  • pic_order_cnt_lsb specifies the picture order count modulo MaxPicOrderCntLsb for the current picture. The length of the pic_order_cnt_lsb syntax element is log 2_max_pic_order_cnt_lsb_minus4+4 bits. The value of the pic_order_cnt_lsb shall be in the range of 0 to MaxPicOrderCntLsb−1, inclusive. When pic_order_cnt_lsb is not present, pic_order_cnt_lsb shall be inferred to be equal to 0. Here, pic_order_cnt_lsb indicates the number of LSBs in POC LSB.
  • short_term_ref_pic_set_pps_flag equal to 1 specifies that the short-term reference picture set of the current picture shall be created using syntax elements in the active picture parameter set, which contains syntax elements that may be shared between multiple pictures. short_term_ref_pic_set_pps_flag equal to 0 specifies that the short-term reference picture set of the current picture shall be created using syntax elements in the short_term_ref_pic_set( )syntax structure in the slice header. In some embodiments, a short-term reference picture set denotes a pictures set that only uses delta referencing.
  • short_term_ref_pic_set_idx specifies the index to the list of the short-term reference picture sets specified in the active picture parameter set that shall be used for creation of the reference picture set of the current picture. The syntax element short_term_ref_pic_set_idx shall be represented by ceil(log 2(num_short_term_ref_pic_sets)) bits. The value of short_term_ref_pic_set_idx shall be in the range of 0 to num_short_term_ref_pic_sets−1, inclusive, where num_short_term_ref_pic_sets is the syntax element from the active picture parameter set.
  • The variable StRpsIdx is derived as follows.
  • If( short_term_ref_pic_set_pps_flag )
    StRpsIdx = short_term_ref_pic_set_idx
    ELSE
    StRpsIdx = num_short_term_ref_pic_sets
  • num_long_term_pics specifies the number of the long-term reference pictures that are to be included in the long-term reference picture set of the current picture. The value of num_long_term_pics shall be in the range of 0 to max_num_ref_frames−NumNegativePics[StRpsIdx]−NumPositivePics[StRpsIdx], inclusive. When not present, the value of num_long_term_pics shall be inferred to be equal to 0. In some embodiments, the long-term reference pictures denote reference pictures that are transmitted with absolute referencing.
  • delta_poc_lsb_lt_minus1[i] is used to determine the value of the least significant bits of the picture order count value of the i-th long-term reference picture that is included in the long-term reference picture set of the current picture. delta_poc_lsb_lt_minus1[i] shall be in the range of 0 to MaxPicOrderCntLsb−1, inclusive. In some embodiments, delta_poc_lsb_lt_minus1[i] denotes POC LSB of the i-th long-term reference picture.
  • The variable DeltaPocLt[i] is derived as follows.
  • If (i= = 0)
    DeltaPocLt[ i ] = delta_poc_lsb_lt_minus1[ i ] + 1
    Else
    DeltaPocLt[ i ] = delta_poc_lsb_lt_minus1[ i ] + 1 +
    DeltaPocLt[ i − 1 ]
  • The value of DeltaPocLt[i] shall be in the range of 0 to MaxPicOrderCntLsb, inclusive.
  • deltaPOCLSBCheck(i) is a function as follows:
  • deltaPOCLSBCheck(int i)
    {
    for(m=0;m<i;m++)
    {
    if(delta_poc_lsb_lt_minus1[i]==delta_poc_lsb_lt_minus1[m])
    {
    return 1;
    }
    }
    return 0;
    }
  • delta_poc_msb_lt_minus1[i] is together with delta_poc_lsb_lt_minus1 [i] used to determine the value of picture order count of the i-th long term reference picture that is included in the long-term reference picture set of the current reference picture.
  • The variable delta_poc_msb_lt_minus1[i] is derived as follows:
  • for(n=0;n<i;n++)
    {
    deltaNumSameLSBs=0;
    if(delta_poc_lsb_lt_minus1[i]==delta_poc_lsb_lt_minus1[n])
    {
    if(deltaNumSameLSBs==0)
    {
    delta_poc_msb_lt_minus1[i]=PicOrderCntMsb[i]−1;
    deltaNumSameLSBs++;
    }
    else
    {
    delta_poc_msb_lt_minus1[i]=PicOrderCntMsb[i]−
    delta_poc_msb_lt_minus1 [n−1];
    }
    }
    }
  • In an alternative embodiment instead of sending element delta_poc_msb_lt_minus1 when the delta_poc_lsb_lt_minus1 values are same, a poc_msb_lt_minus1 or poc_msb_lt element may be sent. Here poc_msb_lt_minus1 indicates POC value of the reference picture−1. This may be absolute POC value. Similarly poc_msb_lt indicates POC value of reference picture. Again this may be absolute POC value.
  • used_by_curr_pic_lt_flag[i] equal to 0 specifies that the i-th long-term reference picture included in the long-term reference picture set of the current picture is not used for reference, or inter-frame prediction, by the current picture.
  • num_ref idx_active_override_flag equal to 1 specifies that the syntax element num_ref idx_l0_active_minus1 is present for P and B slices and that the syntax element num_ref_idx_l1_active_minus1 is present for B slices. num_ref_idx_active_override_flag equal to 0 specifies that the syntax elements num_ref_idx_l0_active_minus1 and num_ref_idx_l1_active_minus1 are not present.
  • When the current slice is a P or B slice and field_pic_flag is equal to 0 and the value of num_ref_idx_l0_default_active_minus1 in the picture parameter set exceeds 15, num_ref_idx_active_override_flag shall be equal to 1.
  • When the current slice is a B slice and field_pic_flag is equal to 0 and the value of num_ref_idx_l1_default_active_minus1 in the picture parameter set exceeds 15, num_ref_idx_active_override_flag shall be equal to 1.
  • num_ref idx_l0_active_minus1 specifies the maximum reference index for reference picture list 0 that shall be used to decode the slice.
  • When the current slice is a P or B slice and num_ref_idx_l0_active_minus1 is not present, num_ref_idx_l0_active_minus1 shall be inferred to be equal to num_ref_idx_l0_default_active_minus1.
  • The range of num_ref_idx_l0_active_minus1 is specified as follows:
  • If field_pic_flag is equal to 0, num_ref_idx_l0_active_minus1 shall be in the range of 0 to 15, inclusive. When MbaffFrameFlag is equal to 1, num_ref idx_l0_active_minus1 is the maximum index value for the decoding of frame macroblocks and 2*num_ref_idx_l0_active_minus1+1 is the maximum index value for the decoding of field macroblocks.
  • Otherwise (field_pic_flag is equal to 1), num_ref_idx_l0_active_minus1 shall be in the range of 0 to 31, inclusive.
  • num_ref_idx_μl_active_minus1 specifies the maximum reference index for reference picture list 1 that shall be used to decode the slice.
  • When the current slice is a B slice and num_ref_idx_l1_active_minus1 is not present, num_ref_idx_l1_active_minus1 shall be inferred to be equal to num_ref_idx_l1_default_active_minus1.
  • The range of num_ref_idx_l1_active_minus1 is constrained as specified in the semantics for num_ref_idx_l0_active_minus1 with l0 and list 0 replaced by l1 and list 1, respectively.
  • The operation deltaPOCLSBCheck(int i) determines the same POC LSB is transmitted from the encoder to the decoder using absolute referencing for the current frame. In an alternative embodiment, determining if the same POC LSB is transmitted can be accomplished by checking if the value delta_poc_lsb_lt_minus1 is equal to a value known to both the encoder and decoder. For example, delta_poc_lsb_lt_minus1 equal to 0 could denote the POC LSB is the same as the previously transmitted POC LSB. Alternatively, delta_poc_lsb_lt_minus1 equal to 2̂N−1, where N denotes the number of bits used to transmit POC LSB and known to the both the encoder and decoder, 0 could denote the POC LSB is the same as the previously transmitted POC LSB. In alternative embodiments, the value delta_poc_lsb_lt_minus1 is replaced with the syntax element delta_poc_lsb_lt, which is generally equal to delta_poc_lsb_lt_minus1 plus 1. In these embodiments, the delta_poc_lsb_lt equal to a value known to both the encoder and decoder can indicate the picture transmitted using absolute referencing has the same POC LSB as the previous picture transmitted using absolute referencing in the same RPS. For example, delta_poc_lsb_lt equal to 0 could denote the POC LSB is the same as the previously transmitted POC LSB. Alternatively, delta_poc_lsb_lt equal to 2̂N, where N denotes the number of bits used to transmit POC LSB and known to the both the encoder and decoder, 0 could denote the POC LSB is the same as the previously transmitted POC LSB.
  • For long term reference picture set the decoding process may be done as follows:
  • for( i = 0, j = 0, k = 0; i < num_long_term_pics; i++ ) {
    PocMSB=0;
    if(deltaPOCLSBCheck(i)==0)
    {
    for(n=0;n<i;n++)
    {
    PocMSB=0; deltaNumSameLSBs=0;
    if(delta_poc_lsb_lt_minus1[i]==delta_poc_lsb_lt_minus1[n])
    {
    if(deltaNumSameLSB==0)
    {
    PocMSB =delta_poc_msb_lt_minus1[i];
    deltaNumSameLSBs++;
    }
    else
    {
    PocMSB +=delta_poc_msb_lt_minus1 [n];
    }
    }
    }
    }
    if( used_by_curr_pic_lt_flag[ i ] )
    PocLtCurr[ j++ ] = PocMSB+(( PicOrderCntVal − DeltaPocLt[ i ]
    + MaxPicOrderCntLsb ) %
    MaxPicOrderCntLsb)
    else
    PocLtFoll[ k++ ] = PocMSB+(( PicOrderCntVal − DeltaPocLt[ i ]
    + MaxPicOrderCntLsb ) %
    MaxPicOrderCntLsb)
    }
  • The terms and expressions which have been employed in the foregoing specification are used therein as terms of description and not of limitation, and there is no intention, in the use of such terms and expressions, of excluding equivalents of the features shown and described or portions thereof, it being recognized that the scope of the invention is defined and limited only by the claims which follow.

Claims (7)

1-8. (canceled)
9. A method for decoding a video bitstream comprising:
decoding a current picture by using inter prediction based on a reference picture set; and
storing said decoded picture to be referred for future inter prediction, wherein said reference picture set is decoded by using at least:
(a) a selected number of least significant bits (LSB) of a picture order count (POC) of a reference picture; and
(b) a signal to specify whether or not subsequent data to determine a MSB of the POC for said reference picture exists.
10. The method of claim 9, wherein said subsequent data is based on a difference between the MSBs of two POCs.
11. The method of claim 9, wherein said subsequent data is a value of the LSB of the POC of an i-th reference picture that is included in said reference picture set of said current picture.
12. A method for encoding a video bitstream comprising:
encoding a current picture using inter prediction based on a reference picture set, wherein said reference picture set is encoded by using at least:
(a) one or more reference picture identifiers each of which being based on a selected number of least significant bits (LSB) of a picture order count (POC) for a reference picture; and
(b) a signal to specify whether or not subsequent data to determine a MSB of said POC exists.
13. The method of claim 12, wherein said subsequent data is based on the difference between the MSBs of two POCs.
14. The method of claim 12, wherein said subsequent data is a value of the LSB of the POC of an i-th reference picture that is included in said reference picture set of said current picture.
US14/656,161 2012-01-25 2015-03-12 Video decoder for tiles with absolute signaling Abandoned US20150189305A1 (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
US14/656,161 US20150189305A1 (en) 2012-01-25 2015-03-12 Video decoder for tiles with absolute signaling
US15/388,798 US9883181B2 (en) 2012-01-25 2016-12-22 Video decoding methods and video encoding methods
US15/645,797 US10250875B2 (en) 2012-01-25 2017-07-10 Device for decoding a video bitstream
US16/274,175 US10506227B2 (en) 2012-01-25 2019-02-12 Device for decoding a video bitstream
US16/706,783 US10911752B2 (en) 2012-01-25 2019-12-08 Device for decoding a video bitstream
US17/164,757 US11245893B2 (en) 2012-01-25 2021-02-01 Device for decoding a video bitstream
US17/565,407 US11582446B2 (en) 2012-01-25 2021-12-29 Device for decoding a video bitstream
US18/109,253 US20230199172A1 (en) 2012-01-25 2023-02-13 Device for Decoding a Video Bitstream

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13/358,414 US20130188709A1 (en) 2012-01-25 2012-01-25 Video decoder for tiles with absolute signaling
US14/656,161 US20150189305A1 (en) 2012-01-25 2015-03-12 Video decoder for tiles with absolute signaling

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US13/358,414 Continuation US20130188709A1 (en) 2012-01-25 2012-01-25 Video decoder for tiles with absolute signaling

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/388,798 Continuation US9883181B2 (en) 2012-01-25 2016-12-22 Video decoding methods and video encoding methods

Publications (1)

Publication Number Publication Date
US20150189305A1 true US20150189305A1 (en) 2015-07-02

Family

ID=48797186

Family Applications (9)

Application Number Title Priority Date Filing Date
US13/358,414 Abandoned US20130188709A1 (en) 2012-01-25 2012-01-25 Video decoder for tiles with absolute signaling
US14/656,161 Abandoned US20150189305A1 (en) 2012-01-25 2015-03-12 Video decoder for tiles with absolute signaling
US15/388,798 Active US9883181B2 (en) 2012-01-25 2016-12-22 Video decoding methods and video encoding methods
US15/645,797 Active US10250875B2 (en) 2012-01-25 2017-07-10 Device for decoding a video bitstream
US16/274,175 Active US10506227B2 (en) 2012-01-25 2019-02-12 Device for decoding a video bitstream
US16/706,783 Active US10911752B2 (en) 2012-01-25 2019-12-08 Device for decoding a video bitstream
US17/164,757 Active US11245893B2 (en) 2012-01-25 2021-02-01 Device for decoding a video bitstream
US17/565,407 Active US11582446B2 (en) 2012-01-25 2021-12-29 Device for decoding a video bitstream
US18/109,253 Pending US20230199172A1 (en) 2012-01-25 2023-02-13 Device for Decoding a Video Bitstream

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US13/358,414 Abandoned US20130188709A1 (en) 2012-01-25 2012-01-25 Video decoder for tiles with absolute signaling

Family Applications After (7)

Application Number Title Priority Date Filing Date
US15/388,798 Active US9883181B2 (en) 2012-01-25 2016-12-22 Video decoding methods and video encoding methods
US15/645,797 Active US10250875B2 (en) 2012-01-25 2017-07-10 Device for decoding a video bitstream
US16/274,175 Active US10506227B2 (en) 2012-01-25 2019-02-12 Device for decoding a video bitstream
US16/706,783 Active US10911752B2 (en) 2012-01-25 2019-12-08 Device for decoding a video bitstream
US17/164,757 Active US11245893B2 (en) 2012-01-25 2021-02-01 Device for decoding a video bitstream
US17/565,407 Active US11582446B2 (en) 2012-01-25 2021-12-29 Device for decoding a video bitstream
US18/109,253 Pending US20230199172A1 (en) 2012-01-25 2023-02-13 Device for Decoding a Video Bitstream

Country Status (7)

Country Link
US (9) US20130188709A1 (en)
EP (2) EP4117288A1 (en)
JP (3) JP2015508577A (en)
CN (2) CN107959859B (en)
CA (1) CA2861255C (en)
MY (2) MY197320A (en)
WO (1) WO2013111605A1 (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102332492B1 (en) * 2011-11-11 2021-12-01 엘지전자 주식회사 Method and device for transmitting image information, and decoding method and device using same
CN104205819B (en) * 2012-02-01 2017-06-30 诺基亚技术有限公司 Method for video encoding and device
WO2013158024A1 (en) * 2012-04-16 2013-10-24 Telefonaktiebolaget L M Ericsson (Publ) Encoder, decoder and methods thereof for video encoding and decoding
EP2887663B1 (en) 2012-09-29 2017-02-22 Huawei Technologies Co., Ltd. Method, apparatus and system for encoding and decoding video
JP6209772B2 (en) 2013-01-15 2017-10-11 華為技術有限公司Huawei Technologies Co.,Ltd. Video decoder using signaling
ES2687768T3 (en) * 2013-01-16 2018-10-29 Telefonaktiebolaget Lm Ericsson (Publ) Decoder and encoder to encode a video stream
CN104754347B (en) 2013-12-26 2019-05-17 中兴通讯股份有限公司 Coding, coding/decoding method and device, the electronic equipment of video image serial number
US9866851B2 (en) * 2014-06-20 2018-01-09 Qualcomm Incorporated Full picture order count reset for multi-layer codecs
US10097836B2 (en) * 2015-09-28 2018-10-09 Samsung Electronics Co., Ltd. Method and device to mark a reference picture for video coding
US11595652B2 (en) 2019-01-28 2023-02-28 Op Solutions, Llc Explicit signaling of extended long term reference picture retention
AU2019297829B2 (en) 2018-07-01 2022-10-06 FG Innovation Company Limited Systems and methods for signaling picture order count values for pictures included in coded video
CN114501018B (en) 2018-08-17 2024-01-09 华为技术有限公司 Decoding method, device and system for reference image management
CN113597768B (en) * 2019-01-28 2024-10-15 Op方案有限责任公司 On-line and off-line selection to extend long-term reference picture preservation
CN113661714B (en) * 2019-03-11 2024-10-01 交互数字Vc控股公司 Sprite bitstream extraction and repositioning
CN114258544A (en) 2019-08-26 2022-03-29 柯尼卡美能达株式会社 Label (R)
CN116781907A (en) * 2022-03-11 2023-09-19 华为技术有限公司 Encoding and decoding method and electronic equipment

Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040151243A1 (en) * 2003-01-31 2004-08-05 Vasudev Bhaskaran Method and apparatus for DCT domain filtering for block based encoding
US20050084007A1 (en) * 2003-10-16 2005-04-21 Lightstone Michael L. Apparatus, system, and method for video encoder rate control
US7471726B2 (en) * 2003-07-15 2008-12-30 Microsoft Corporation Spatial-domain lapped transform in digital media compression
US20090213938A1 (en) * 2008-02-26 2009-08-27 Qualcomm Incorporated Video decoder error handling
US7609897B2 (en) * 2002-09-09 2009-10-27 Ricoh Company, Ltd. Image coder and image decoder capable of power-saving control in image compression and decompression
US20110122942A1 (en) * 2009-11-20 2011-05-26 Texas Instruments Incorporated Techniques for perceptual encoding of video frames
US20120027089A1 (en) * 2010-07-28 2012-02-02 Qualcomm Incorporated Coding motion vectors in video coding
US20120082210A1 (en) * 2010-10-01 2012-04-05 Qualcomm Incorporated Coding prediction modes in video coding
US8165204B2 (en) * 2008-02-29 2012-04-24 Michael Bronstein Resource allocation for frame-based controller
US20120106624A1 (en) * 2010-11-02 2012-05-03 Mediatek Inc. Method and Apparatus of Slice Boundary Filtering for High Efficiency Video Coding
US20120189053A1 (en) * 2011-01-22 2012-07-26 Qualcomm Incorporated Combined reference picture list construction for video coding
US20120233405A1 (en) * 2011-03-07 2012-09-13 Madhukar Budagavi Caching Method and System for Video Coding
US20120269275A1 (en) * 2010-10-20 2012-10-25 Nokia Corporation Method and device for video coding and decoding
US20130058405A1 (en) * 2011-09-02 2013-03-07 David Zhao Video Coding
US20130077681A1 (en) * 2011-09-23 2013-03-28 Ying Chen Reference picture signaling and decoded picture buffer management
US20130089152A1 (en) * 2011-10-05 2013-04-11 Qualcomm Incorporated Signaling picture identification for video coding
US20130114742A1 (en) * 2011-11-08 2013-05-09 Nokia Corporation Reference picture handling
US20130142257A1 (en) * 2011-12-02 2013-06-06 Qualcomm Incorporated Coding picture order count values identifying long-term reference frames

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3672185B2 (en) * 1999-02-09 2005-07-13 ソニー株式会社 CODING SYSTEM, DEVICE USING THE SAME, AND RECORDING MEDIUM
US6687384B1 (en) 2000-03-27 2004-02-03 Sarnoff Corporation Method and apparatus for embedding data in encoded digital bitstreams
US6887384B1 (en) 2001-09-21 2005-05-03 The Regents Of The University Of California Monolithic microfluidic concentrators and mixers
JP2004007563A (en) 2002-04-19 2004-01-08 Matsushita Electric Ind Co Ltd Method for encoding moving image and method for decoding moving image
JP4434155B2 (en) * 2006-02-08 2010-03-17 ソニー株式会社 Encoding method, encoding program, and encoding apparatus
KR101270167B1 (en) * 2006-08-17 2013-05-31 삼성전자주식회사 Method and apparatus of low complexity for compressing image, method and apparatus of low complexity for reconstructing image
CA2666452C (en) 2006-10-16 2014-12-16 Nokia Corporation System and method for implementing efficient decoded buffer management in multi-view video coding
KR100837410B1 (en) * 2006-11-30 2008-06-12 삼성전자주식회사 Method and apparatus for visually lossless image data compression
EP2048886A1 (en) 2007-10-11 2009-04-15 Panasonic Corporation Coding of adaptive interpolation filter coefficients
WO2010092740A1 (en) * 2009-02-10 2010-08-19 パナソニック株式会社 Image processing apparatus, image processing method, program and integrated circuit
US8617370B2 (en) 2010-09-30 2013-12-31 Cilag Gmbh International Systems and methods of discriminating between a control sample and a test fluid using capacitance
JP5625808B2 (en) * 2010-11-26 2014-11-19 沖電気工業株式会社 Data updating apparatus and program, moving picture decoding apparatus and program, and moving picture distribution system
WO2013008247A1 (en) 2011-07-13 2013-01-17 Neon Laboratories Ltd. Process for preparation of (dl) -norepinephrine acid addition salt, a key intermediate of (r) - (-) - norepinephrine
US20130094774A1 (en) * 2011-10-13 2013-04-18 Sharp Laboratories Of America, Inc. Tracking a reference picture based on a designated picture on an electronic device
BR122021007881B1 (en) 2011-10-28 2023-03-21 Samsung Electronics Co., Ltd VIDEO DECODING METHOD, AND VIDEO CODING METHOD
KR102332492B1 (en) * 2011-11-11 2021-12-01 엘지전자 주식회사 Method and device for transmitting image information, and decoding method and device using same
WO2013162258A1 (en) * 2012-04-23 2013-10-31 삼성전자 주식회사 Multiview video encoding method and device, and multiview video decoding mathod and device
US9319679B2 (en) * 2012-06-07 2016-04-19 Qualcomm Incorporated Signaling data for long term reference pictures for video coding
US9332255B2 (en) 2012-06-28 2016-05-03 Qualcomm Incorporated Signaling long-term reference pictures for video coding
US9591303B2 (en) * 2012-06-28 2017-03-07 Qualcomm Incorporated Random access and signaling of long-term reference pictures in video coding
US9912966B2 (en) * 2014-01-03 2018-03-06 Nokia Technologies Oy Parameter set coding
JP7128580B2 (en) * 2017-07-10 2022-08-31 フラウンホーファー-ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン bitplane encoding

Patent Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7609897B2 (en) * 2002-09-09 2009-10-27 Ricoh Company, Ltd. Image coder and image decoder capable of power-saving control in image compression and decompression
US20040151243A1 (en) * 2003-01-31 2004-08-05 Vasudev Bhaskaran Method and apparatus for DCT domain filtering for block based encoding
US7471726B2 (en) * 2003-07-15 2008-12-30 Microsoft Corporation Spatial-domain lapped transform in digital media compression
US20050084007A1 (en) * 2003-10-16 2005-04-21 Lightstone Michael L. Apparatus, system, and method for video encoder rate control
US20090213938A1 (en) * 2008-02-26 2009-08-27 Qualcomm Incorporated Video decoder error handling
US8165204B2 (en) * 2008-02-29 2012-04-24 Michael Bronstein Resource allocation for frame-based controller
US20110122942A1 (en) * 2009-11-20 2011-05-26 Texas Instruments Incorporated Techniques for perceptual encoding of video frames
US20120027089A1 (en) * 2010-07-28 2012-02-02 Qualcomm Incorporated Coding motion vectors in video coding
US20120082210A1 (en) * 2010-10-01 2012-04-05 Qualcomm Incorporated Coding prediction modes in video coding
US20120269275A1 (en) * 2010-10-20 2012-10-25 Nokia Corporation Method and device for video coding and decoding
US20120106624A1 (en) * 2010-11-02 2012-05-03 Mediatek Inc. Method and Apparatus of Slice Boundary Filtering for High Efficiency Video Coding
US20120189053A1 (en) * 2011-01-22 2012-07-26 Qualcomm Incorporated Combined reference picture list construction for video coding
US20120233405A1 (en) * 2011-03-07 2012-09-13 Madhukar Budagavi Caching Method and System for Video Coding
US20130058405A1 (en) * 2011-09-02 2013-03-07 David Zhao Video Coding
US20130077681A1 (en) * 2011-09-23 2013-03-28 Ying Chen Reference picture signaling and decoded picture buffer management
US20130089152A1 (en) * 2011-10-05 2013-04-11 Qualcomm Incorporated Signaling picture identification for video coding
US20130114742A1 (en) * 2011-11-08 2013-05-09 Nokia Corporation Reference picture handling
US20130142257A1 (en) * 2011-12-02 2013-06-06 Qualcomm Incorporated Coding picture order count values identifying long-term reference frames

Also Published As

Publication number Publication date
JP2015508577A (en) 2015-03-19
JP6530035B2 (en) 2019-06-12
JP6913126B2 (en) 2021-08-04
US20230199172A1 (en) 2023-06-22
JP2019154062A (en) 2019-09-12
US20190182479A1 (en) 2019-06-13
US20170310960A1 (en) 2017-10-26
EP4117288A1 (en) 2023-01-11
JP2018057002A (en) 2018-04-05
US20130188709A1 (en) 2013-07-25
US10506227B2 (en) 2019-12-10
US20220124315A1 (en) 2022-04-21
MY185225A (en) 2021-04-30
MY197320A (en) 2023-06-13
CN104067620A (en) 2014-09-24
EP2807824A4 (en) 2016-05-18
US20170104991A1 (en) 2017-04-13
CN107959859A (en) 2018-04-24
CN104067620B (en) 2018-01-26
US20200112718A1 (en) 2020-04-09
CA2861255C (en) 2021-04-20
US9883181B2 (en) 2018-01-30
EP2807824A1 (en) 2014-12-03
EP2807824B1 (en) 2022-07-20
US10911752B2 (en) 2021-02-02
WO2013111605A1 (en) 2013-08-01
US11582446B2 (en) 2023-02-14
CN107959859B (en) 2021-04-13
CA2861255A1 (en) 2013-08-01
US20210152817A1 (en) 2021-05-20
US11245893B2 (en) 2022-02-08
US10250875B2 (en) 2019-04-02

Similar Documents

Publication Publication Date Title
US11245893B2 (en) Device for decoding a video bitstream
US10230979B2 (en) Video decoder with signaling
EP2759134B1 (en) Reference picture list construction for video coding
US20170366823A1 (en) Method for decoding video bitstream
US10440389B2 (en) Method and device for transmitting image information, and decoding method and device using same
EP2920971B1 (en) Devices and methods for processing of non-idr related syntax for high efficiency video coding (hevc)
US20130272398A1 (en) Long term picture signaling
KR20130118798A (en) Method and apparatus for image decoding

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE

AS Assignment

Owner name: VELOS MEDIA, LLC, TEXAS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SHARP KABUSHIKI KAISHA;REEL/FRAME:046310/0357

Effective date: 20180523