WO2021058596A1 - Codeur, décodeur et flux de données pour codage de rafraîchissement de décodeur progressif et codage évolutif - Google Patents
Codeur, décodeur et flux de données pour codage de rafraîchissement de décodeur progressif et codage évolutif Download PDFInfo
- Publication number
- WO2021058596A1 WO2021058596A1 PCT/EP2020/076616 EP2020076616W WO2021058596A1 WO 2021058596 A1 WO2021058596 A1 WO 2021058596A1 EP 2020076616 W EP2020076616 W EP 2020076616W WO 2021058596 A1 WO2021058596 A1 WO 2021058596A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- picture
- layer
- data stream
- refreshed
- sub
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/107—Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/174—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- the present invention is concerned with coding pictures, and in particular a consecutive sequence of pictures.
- Some embodiments may exploit scalable coding and/or the so-called Gradual Decoder Refresh - GDR - coding scheme for coding the pictures.
- Some embodiments may suggest Scalable Coding and Gradual Decoder Refresh improvements.
- video is encoded in such a way that the sizes of the encoded pictures vary not only due to complexity characteristics of the content but also by the prediction structure used. More concretely, typically, videos are encoded using a prediction structure that is based on some pictures being encoded as Intra slices (not dependent on other pictures) and others being encoded as B or P slices (using other pictures as references). Obviously, pictures being encoded without prediction to other pictures lead to bigger sizes than when temporal correlation is used and pictures are encoded as B or P slices.
- GDR Gradual Decoding Refresh
- Such structures are identified as Gradual Decoding Refresh (GDR) as they differ from typical coding structures in the fact that, in order to achieve a clean picture, several pictures need to be decoded and the video areas are gradually decoded and refreshed until the content can be properly shown; while in typical coding structures, only a particular form of an Intra frame (Random Access Point-RAP) is required to be present and the content can be instantaneously shown without having to decode more access units.
- GDR Gradual Decoding Refresh
- a picture may be subdivided in picture regions (e.g. tiles) which may be refreshed overtime in a so-called Refresh Period RP.
- regions e.g. tiles
- Refresh Period RP Refresh Period
- a first aspect concerns a video data stream comprising a sequence of pictures comprising at least one Gradual Decoder Refresh - GDR - coded picture and one or more subsequent pictures in a refresh period (RP).
- the video data stream further comprises a parameter set (SPS) defining a plurality of picture configurations, which subdivide a picture area (e.g. entire frame) into a first sub-area and a second sub-area among which one corresponds to a refreshed sub-area (e.g. a set of picture regions) comprising one or more refreshed picture regions (e.g. tiles) and the other one corresponds to an un-refreshed sub-area comprising one or more yet un-refreshed picture regions.
- the video data stream further comprises, for each picture within the refresh period, a picture configuration identifier (reg_conf_idx) for identifying a corresponding one picture configuration out of the plurality of picture configurations.
- a decoder for decoding from a data stream at least one picture out of a sequence of pictures comprising at least one Gradual Decoder Refresh - GDR
- the decoder is configured to read from the data stream a parameter set (SPS) defining a plurality of picture configurations, which subdivide a picture area (e.g. entire frame) into a first sub-area and a second sub-area among which one corresponds to a refreshed sub-area (e.g. a set of picture regions) comprising one or more refreshed picture regions (e.g. tiles) and the other one corresponds to an un-refreshed sub-area comprising one or more yet un-refreshed picture regions.
- SPS parameter set
- the decoder is further configured to read from the data stream, for each picture within the refresh period, a picture configuration identifier (reg_conf_idx) for identifying a corresponding one picture configuration out of the plurality of picture configurations for decoding the at least one picture.
- a picture configuration identifier (reg_conf_idx) for identifying a corresponding one picture configuration out of the plurality of picture configurations for decoding the at least one picture.
- an encoder for encoding into a data stream at least one picture out of a sequence of pictures comprising at least one Gradual Decoder Refresh - GDR
- the encoder is configured to write into the data stream a parameter set (SPS) defining a plurality of picture configurations, which subdivide a picture area (e.g. entire frame) into a first sub-area and a second sub-area among which one corresponds to a refreshed sub-area (e.g. a set of picture regions) comprising one or more refreshed picture regions (e.g. tiles) and the other one corresponds to an un-refreshed sub-area comprising one or more yet un-refreshed picture regions.
- the encoder is further configured to set in the data stream, for each picture within the refresh period, a picture configuration identifier (reg_conf_idx) for identifying a corresponding one picture configuration out of the plurality of picture configurations.
- a second aspect concerns a video data stream comprising a sequence of pictures comprising at least one Gradual Decoder Refresh - GDR - coded picture and one or more subsequent pictures in a refresh period, wherein each picture of the sequence of pictures is sequentially coded into the video data stream in units of blocks (e.g. CTUs) into which the respective picture is subdivided.
- the video data stream comprises an implicit signaling, wherein a refreshed sub- area of a respective picture is implicitly signaled in the video data stream based on a block coding order.
- the video data stream comprises, for each block, a syntax element (e.g.
- a flag indicating whether a) the block is a last block located in a first sub-area of a respective picture and lastly coded (e.g. flag: last_ctu_of_gdr_region), and/or b) the block is a first block located in a first sub-area of a respective picture and firstly coded (e.g. flag: first_ctu_of_gdr_region), and/or c) the block adjoins a border confining a first sub-area, and/or d) the block is located inside a first sub-area (e.g. flag: gdr_region_flag).
- a decoder for decoding from a data stream at least one picture out of a sequence of pictures comprising at least one Gradual Decoder Refresh - GDR - coded picture and one or more subsequent pictures in a refresh period (RP), wherein each picture of the sequence of pictures is sequentially decoded from the video data stream in units of blocks (e.g. CTUs) into which the respective picture is subdivided.
- the decoder is configured to implicitly derive from the data stream a refreshed sub-area of the at least one picture based on a block coding order. Additionally or alternatively, the decoder is configured to read from the data stream, for each block, a syntax element (e.g.
- a flag indicating whether a) the block is a last block located in a first sub-area of a respective picture and lastly coded (e.g. flag: last_ctu_of_gdr_region), and/or b) the block is a first block located in a first sub-area of a respective picture and firstly coded (e.g. flag: first_ctu_of_gdr_region), and/or c) the block adjoins a border confining a first sub-area, and/or d) the block is located inside a first sub-area (e.g. flag: gdr_region_flag).
- an encoder for encoding into a data stream at least one picture out of a sequence of pictures comprising at least one Gradual Decoder Refresh - GDR - coded picture and one or more subsequent pictures in a refresh period (RP), wherein each picture of the sequence of pictures is sequentially encoded into the video data stream in units of blocks (e.g. CTUs) into which the respective picture is subdivided.
- the encoder is configured to write into the data stream, for each block, a syntax element (e.g. a flag) indicating whether a) the block is a last block located in a first sub-area of a respective picture and lastly coded (e.g.
- the block is a first block located in a first sub-area of a respective picture and firstly coded (e.g. flag: first_ctu_of_gdr_region), and/or c) the block adjoins a border confining a first sub-area, and/or d) the block is located inside a first sub-area (e.g. flag: gdr_region_flag).
- a third aspect concerns a multi layered scalable video data stream comprising a first sequence of pictures in a first layer (e.g. base layer) and a second sequence of pictures in a second layer (e.g. an enhancement layer), wherein the second sequence of pictures in the second layer comprises at least one Gradual Decoder Refresh - GDR - picture as a start picture and one or more subsequent pictures in a refresh period.
- the multi layered scalable video data stream comprises a signalization carrying information about a possibility that a yet un-refreshed sub- area of the GDR picture of the second layer is to be inter-layer predicted from samples of the first layer.
- the multi layered scalable video data stream comprises information: that in yet un-refreshed sub-areas of the one or more subsequent pictures contained in the refresh period, motion vector prediction is disabled or motion vector prediction is realized non-temporally, or that in a yet un-refreshed sub-area of the GDR picture motion vector prediction is disabled or motion vector prediction is realized non-temporally.
- a decoder for decoding at least one picture from a multi layered scalable video data stream comprising a first sequence of pictures in a first layer (e.g. base layer) and a second sequence of pictures in a second layer (e.g. an enhancement layer), wherein the second sequence of pictures in the second layer comprises at least one Gradual Decoder Refresh - GDR - picture as a start picture and one or more subsequent pictures in a refresh period.
- the decoder is configured to read from the multi layered scalable video data stream a signalization carrying information about a possibility that a yet un-refreshed sub-area of the GDR picture of the second layer is to be inter-layer predicted from samples of the first layer.
- the decoder is further configured to, responsive to the signalization: disable motion vector prediction or to realize motion vector prediction non-temporally in yet un-refreshed sub-areas of the one or more subsequent pictures contained in the refresh period, or to disable motion vector prediction or to realize motion vector prediction non-temporally in a yet un-refreshed sub-area of the GDR picture.
- an encoder for encoding at least one picture into a multi layered scalable video data stream comprising a first sequence of pictures in a first layer (e.g. base layer) and a second sequence of pictures in a second layer (e.g. an enhancement layer), wherein the second sequence of pictures in the second layer comprises at least one Gradual Decoder Refresh - GDR - picture as a start picture and one or more subsequent pictures in a refresh period.
- a first layer e.g. base layer
- a second sequence of pictures in a second layer e.g. an enhancement layer
- the encoder is configured to write into the multi layered scalable video data stream a signalization carrying information about a possibility that a yet un-refreshed sub-area of the GDR picture of the second layer is to be inter-layer predicted from samples of the first layer, and information: that in yet un-refreshed sub-areas of the one or more subsequent pictures contained in the refresh period, motion vector prediction is disabled or motion vector prediction is realized non-temporally, or that in a yet un-refreshed sub-area of the GDR picture motion vector prediction is disabled or motion vector prediction is realized non-temporally.
- a fourth aspect concerns a multi layered scalable video data stream comprising a first sequence of pictures in a first layer (e.g. base layer) and a second sequence of pictures in a second layer (e.g. an enhancement layer), each of the first and second layers comprising a plurality of temporal sublayers.
- the scalable video data stream further comprises a signalization (e.g. vps_sub_layer_independent_flag[i][j]) indicating which temporal sublayers of the second layer (e.g. enhancement layer) are coded by inter-layer prediction.
- a decoder for decoding at least one picture from a multi layered scalable video data stream comprising a first sequence of pictures in a first layer (e.g. base layer) and a second sequence of pictures in a second layer (e.g. an enhancement layer), each of the first and second layers comprising a plurality of temporal sublayers.
- the decoder is configured to decode one or more of the temporal sublayers by using inter-layer prediction based on a signalization derived from the scalable video data stream, said signalization (e.g. vps_sub_layerjndependent_flag[i][j]) indicating which temporal sublayers of the second layer (e.g.
- enhancement layer are to be coded by inter-layer prediction.
- an encoder for encoding at least one picture into a multi layered scalable video data stream comprising a first sequence of pictures in a first layer (e.g. base layer) and a second sequence of pictures in a second layer (e.g. an enhancement layer), each of the first and second layers comprising a plurality of temporal sublayers.
- the encoder is configured to encode one or more of the temporal sublayers by using inter-layer prediction and to write a signalization into the scalable video data stream, said signalization (e.g. vps_sub_layer_independent_flag[i][j]) indicating which temporal sublayers of the second layer (e.g. enhancement layer) are coded by inter-layer prediction.
- a fifth aspect concerns a video data stream comprising at least one picture being subdivided into tiles, and a tile-reordering flag, wherein a) if the tile-reordering flag (e.g. sps_enforce_raster_scan_flag) in the data stream has a first state, it is signaled that tiles of the picture are to be coded using a first coding order which traverses the picture tile by tile, and/or b) if the tile-reordering flag in the data stream has a second state, it is signaled that tiles of the picture are to be coded using a second coding order which traverses the picture along a raster scan order.
- the tile-reordering flag e.g. sps_enforce_raster_scan_flag
- a decoder configured to decode a picture from a data stream, wherein: a) if a tile-reordering flag (e.g. sps_enforce_raster_scan_flag) in the data stream has a first state, the decoder is configured to decode tiles of the picture from the data stream using a first decoding order which traverses the picture tile by tile, and/or b) if the tile-reordering flag in the data stream has a second state, the decoder is configured to decode the tiles of the picture from the data stream using a second decoding order which traverses the picture along a raster scan order.
- a tile-reordering flag e.g. sps_enforce_raster_scan_flag
- an encoder configured to encode a picture into a data stream, wherein: a) the encoder is configured to set a tile-reordering flag (e.g. sps_enforce_raster_scan_flag) in the data stream into a first state, indicating that tiles of the picture are to be coded using a first coding order which traverses the picture tile by tile, and/or b) the encoder is configured to set the tile-reordering flag in the data stream into a second state, indicating that tiles of the picture are to be coded using a second coding order which traverses the picture along a raster scan order.
- a tile-reordering flag e.g. sps_enforce_raster_scan_flag
- Fig. 1 shows a schematic diagram of a Gradual Decoding Refresh concept according to an embodiment
- Fig. 2 shows a schematic diagram of a Gradual Decoding Refresh concept according to an embodiment
- Fig. 3 shows a schematic diagram of a Gradual Decoding Refresh concept using columns of picture regions according to an embodiment
- Fig. 4 shows a schematic diagram of a Gradual Decoding Refresh concept using rows of picture regions according to an embodiment
- Fig. 5 shows a schematic diagram of a Gradual Decoding Refresh concept using packets of picture regions according to an embodiment
- Fig. 6 shows a schematic diagram of a Gradual Decoding Refresh concept using a scalable multilayer video bitstream according to an embodiment
- Fig. 7 shows a schematic diagram of a Gradual Decoding Refresh concept using a scalable multilayer video bitstream with multiple temporal sublayers according to an embodiment.
- Method steps which are depicted by means of a block diagram and which are described with reference to said block diagram may also be executed in an order different from the depicted and/or described order. Furthermore, method steps concerning a particular feature of a device may be replaceable with said feature of said device, and the other way around.
- Figure 1 shows an example of a Gradual Decoding Refresh (GDR) coding structure which differs from conventional coding structures in the fact that, in order to achieve a clean picture, several pictures need to be decoded and the video areas are gradually decoded and refreshed until the content can be properly shown.
- GDR Gradual Decoding Refresh
- FIG. 1 shows an example of a Gradual Decoding Refresh (GDR) coding structure which differs from conventional coding structures in the fact that, in order to achieve a clean picture, several pictures need to be decoded and the video areas are gradually decoded and refreshed until the content can be properly shown.
- GDR Gradual Decoding Refresh
- Figure 1 shows a sequence 100 of pictures 1011, 1012, .... 101 n in a consecutive order.
- Each picture 1011, 1012, ..., 101 n may be divided into picture regions 102, for example into tiles.
- the tiles 102 may be intra-coded 102a or inter-coded 102b.
- Intra-coded picture regions 102a may provide for an access at which the decoder may start accessing the bitstream and start refreshing the entire picture according to the Gradual Decoding Refresh (GDR) principle.
- GDR Gradual Decoding Refresh
- Frames or pictures comprising such an Intra-Coded picture region 102a may therefore also be referred to as an Access Unit (AU).
- GDR Gradual Decoding Refresh
- a picture comprising an Intra-Coded picture region 102a may also be referred to as a GDR- picture 103.
- every second picture may be a GDR-picture 103.
- the GDR-delta is two in this example.
- a picture region that has been Intra-Coded i.e. an Intra-Coded picture region 102a
- Picture regions that have not yet been coded after an access may be referred to as non-refreshed picture regions or dirty picture regions, respectively.
- Variant A Full MCTS based
- the refresh period (RP) is the time interval that needs to be waited until all picture regions 102 are refreshed and a clean picture can be shown. There are different forms of configuring such a bitstream.
- (nine tiles x new refreshed tile at every second frame - 1) 17 frames is the refresh period (RP), i.e. the number of frames until all picture regions 102 are refreshed.
- a GDR picture 103 may be present every second picture, i.e. a picture at which a decoder can start accessing the bitstream and decode a full picture after a full RP. This can be achieved by encoding all picture regions 102 independently of each other over the time, e.g. called MCTS in HEVC and spreading the intra coded blocks 102a in time with a distance of two frames among picture regions 102 in the example above.
- Variant B Constrained inter tiles (c.f. attached Figure 2, also represented above)
- Figure 2 shows a further configuration, wherein a GDR structure can be achieved by having a GDR frame 103 every 18 frames and allowing dependencies among regions (no-MCTS) but allowing only dependencies on previously refreshed regions. In that case only one region is refreshed with intra block and further regions can reference the region refreshed in time. This allows a better efficiency as the first shown configuration, since regions are not encoded with full MCTS.
- a GDR picture 103 is present every 18 th picture while the refresh period (RP) comprises 17 frames.
- Figure 3 shows a further non-limiting example, wherein a GDR picture 103 is present every 18 th picture while 17 frames is the refresh period (RP).
- the Intra-Coded picture region 102a in this example may comprise a picture column instead of an above discussed picture tile.
- Figure 4 shows a further non-limiting example, wherein a GDR picture 103 is present every 18 th picture while 17 frames is the refresh period (RP).
- the Intra-Coded picture region 102a in this example may comprise a picture row instead of an above discussed picture tile.
- D delta GDR, distance between GDR pictures, i.e. possible decoding start.
- a further important aspect to evaluate a GDR technology is the tune-in time required to show a picture, which consists of the RP plus the time that needs to be waited until a GDR picture 103 is found.
- the table above shows the tune-in-time in average and worst-case.
- regions may not be defined in a static way, therefore reducing the signaling overhead and efficiency penalty of using to some extent independent regions, such as tiles for example.
- regions may not be defined in a static way, therefore reducing the signaling overhead and efficiency penalty of using to some extent independent regions, such as tiles for example.
- the identification of which regions are clean (refreshed) and which regions are not yet refreshed may not be unambiguous, and thus comes with some penalties, such as, intra prediction cannot be easily
- the picture regions e.g. in form of tiles
- SPS Sequence Parameter Set
- PPS Picture Parameter Set
- slice headers may point to an index to indicate which is the tile configuration that is in use for a given AU (Access Unit).
- a picture area may comprise an entire frame 101 or picture 101.
- a picture area may be divided into picture sub-areas.
- a first picture sub-area 101 r may comprise only refreshed picture regions 102r. This first picture sub-area 101 r may therefore also be referred to as a refreshed picture sub-area 101 r.
- a second picture sub-area 101 u may only comprise yet un-refreshed picture regions 102u. This second picture sub-area 101u may therefore also be referred to as a yet un-refreshed picture sub-area 101 u.
- a picture region 102 in general may comprise at least one of a picture tile ( Figures 1 and 2), a picture tile column ( Figure 3), a picture tile line ( Figure 4), a coding block (e.g. CTUs), a coding block line, a coding block row, a coding block diagonal, a sample, a row of samples, or a column of samples.
- a refreshed picture region 102r may comprise at least one of a refreshed picture tile ( Figures 1 and 2), a refreshed picture tile column ( Figure 3), a refreshed picture tile line ( Figure 4), a refreshed coding block (e.g.
- a yet unrefreshed picture region 102u may comprise at least one of a yet un-refreshed picture tile ( Figures 1 and 2), a yet un-refreshed picture tile column ( Figure 3), a yet un-refreshed picture tile line ( Figure 4), a yet un-refreshed coding block (e.g.
- a yet un-refreshed coding block line a yet un-refreshed coding block row, a yet un-refreshed coding block diagonal, a yet un-refreshed sample, a yet un-refreshed row of samples, or a yet un-refreshed column of samples.
- a video data stream may be provided, the video data stream comprising a sequence 100 of pictures 1011, 1012, .... 101 n comprising at least one Gradual Decoder Refresh - GDR - coded picture 103 and one or more subsequent pictures in a refresh period RP.
- the video data stream further comprises a parameter set (e.g. SPS or PPS) defining a plurality of picture configurations, which subdivide a picture area 101 (e.g. an entire frame) into a first sub-area 101 r (e.g. first one or more picture regions 102 comprising tiles, rows, columns, etc.) and a second sub-area 101u (e.g.
- the video data stream comprises for each picture 1011, 1012. 101 n within the refresh period RP a picture configuration identifier (e.g. region_configuration_idx) for identifying a corresponding one picture configuration out of the plurality of picture configurations.
- a picture configuration identifier e.g. region_configuration_idx
- a corresponding decoder for decoding from a data stream at least one picture out of a sequence 100 of pictures 1011, 1012, ..., 101 n comprising at least one Gradual Decoder Refresh - GDR - coded picture 103 and one or more subsequent pictures 1012, .... 101 n in a refresh period - RP, wherein the decoder is configured to read from the data stream a parameter set (e.g. PPS or SPS) defining a plurality of picture configurations, which subdivide a picture area (e.g. an entire frame) 101 into a first sub-area 101 r and a second sub-area 101u among which one corresponds to a refreshed sub-area (e.g.
- a parameter set e.g. PPS or SPS
- the decoder may further be configured to read from the data stream, for each picture 1011, 1012, 101 n within the refresh period - RP, a picture configuration identifier (region_configuration_idx) for identifying a corresponding one picture configuration out of the plurality of picture configurations for decoding the at least one picture 1011, 101 2 , 10V
- a corresponding encoder for encoding into a data stream at least one picture out of a sequence 100 of pictures 1011, 1012, ..., 101 n comprising at least one Gradual Decoder Refresh - GDR - coded picture 103 and one or more subsequent pictures 1012, ....
- the encoder is configured to write into the data stream a parameter set defining a plurality of picture configurations, which subdivide a picture area 101 into a first sub-area 101 r and a second sub- area 101u among which one corresponds to a refreshed sub-area 101 r comprising one or more refreshed picture regions 102r and the other one corresponds to an un-refreshed sub- area 101u comprising one or more yet un-refreshed picture regions 102u.
- the encoder is further configured to set in the data stream, for each picture 1011, 1012, 101 n within the refresh period - RP, a picture configuration identifier for identifying a corresponding one picture configuration out of the plurality of picture configurations for decoding the at least one picture
- the method comprising steps of reading from the data stream a parameter set defining a plurality of picture configurations, which subdivide a picture area 101 into a first sub-area 101 r and a second sub- area 101u among which one corresponds to a refreshed sub-area 101 r comprising one or more refreshed picture regions 102r and the other one corresponds to an un-refreshed sub- area 101u comprising one or more yet un-refreshed picture regions 102u.
- the method further comprises a step of reading from the data stream, for each picture 1011, 1012, .... 101 n within the refresh period - RP, a picture configuration identifier for identifying a corresponding one picture configuration out of the plurality of picture configurations for decoding the at least one picture 1011, 1012, .... 10V
- the method comprising steps of writing into the data stream a parameter set defining a plurality of picture configurations, which subdivide a picture area 101 into a first sub-area 101 r and a second sub- area 101u among which one corresponds to a refreshed sub-area 101 r comprising one or more refreshed picture regions 102r and the other one corresponds to an un-refreshed sub- area 101u comprising one or more yet un-refreshed picture regions 102u.
- the method further comprises a step of setting in the data stream, for each picture 1011, 101 2 , ..., 101 n within the refresh period - RP, a picture configuration identifier for identifying a corresponding one picture configuration out of the plurality of picture configurations.
- the sequence 100 of pictures 1011, 101.. 101 n may be coded in different ways.
- the sequence 100 of pictures 1011, 1012, .... 101 n may be coded in a manner so that intra prediction does not cross a boundary between the first and second sub-areas 101 r, 101u.
- the sequence 100 of pictures 1011, 1012, ... , 101 n may be coded in a manner so that temporal prediction of the refreshed sub-area 101 r does not reference the yet un-refreshed sub-area 101u.
- the sequence 100 of pictures 1011, 1012. 101 n may be coded in a manner so that context model derivation does not cross a boundary between the first and second sub-areas 101 r, 101u.
- the corresponding one picture configuration indicates the refreshed picture regions 102r and the yet un-refreshed picture regions 102u contained in a currently coded picture 1011, 1012, .... 101 n of the sequence 100 of pictures.
- each picture configuration out of the plurality of picture configurations may comprise a set of region indices for signaling which picture regions 102 are refreshed picture regions 102r and which picture regions 102 are un- refreshed picture regions 102u. This provides for an explicit signaling of refreshed and yet un-refreshed picture regions 102r, 102u.
- each picture configuration out of the plurality of picture configurations may comprise a set of tile indices for signaling which picture tiles 102 are refreshed picture tiles 102r and which picture tiles 102 are un-refreshed picture tiles 102u.
- the region configuration may contain a set of tile indices.
- the pictures 1011, 101 . 101 n contained in the sequence 100 of pictures may be subdivided into picture tile columns, wherein each picture configuration out of the plurality of picture configurations comprises at least one column index for signaling which picture columns are refreshed picture columns 102r and/or which picture columns are un-refreshed picture columns 102u.
- the region configuration may contain a tile column index.
- the pictures 1011, 1012, .... 101 n contained in the sequence 100 of pictures may be subdivided into picture tile rows, wherein each picture configuration out of the plurality of picture configurations comprises at least one row index for signaling which picture rows are refreshed picture rows 102r and/or which picture rows are un-refreshed picture rows 102u.
- the region configuration may contain a tile row index.
- a picture region may also be represented by a coding block, e.g. by a CTU (Coding Tree Unit).
- a coding block e.g. by a CTU (Coding Tree Unit).
- the pictures 1011, 1012, .... 101 n contained in the sequence 100 of pictures may be subdivided into rows of coding blocks (e.g. CTUs), wherein each picture configuration out of the plurality of picture configurations may comprise at least one row coding block index for signaling which rows of coding blocks are refreshed rows 102r of coding blocks and/or which rows of coding blocks are un-refreshed rows 102u of coding blocks.
- the region configuration may contain a CTU row index.
- the pictures 1011, 1012, .... 101 n contained in the sequence 100 of pictures may be subdivided into columns of coding blocks (e.g. CTUs), wherein each picture configuration out of the plurality of picture configurations may comprise at least one column coding block index for signaling which columns of coding blocks are refreshed columns102r of coding blocks and/or which columns of coding blocks are un-refreshed columns 102u of coding blocks.
- the region configuration may contain a CTU column index.
- each picture configuration out of the plurality of picture configurations may comprise at least one diagonal coding block index for signaling which diagonals of coding blocks are refreshed diagonals 102r of coding blocks and/or which diagonals of coding blocks are un-refreshed diagonals 102u of coding blocks.
- the region configuration may contain one or more indices of a CTU diagonal.
- a picture region may also be represented by samples.
- the pictures 1011, 1012, .... 101 n contained in the sequence 100 of pictures may be subdivided into rows of samples, wherein each picture configuration out of the plurality of picture configurations may comprise at least one sample row index for signaling which rows of samples are refreshed rows 102r of samples and/or which rows of samples are un-refreshed rows 102u of samples.
- the region configuration may contain one or more sample row indexes.
- the pictures 1011, 101 2 , .... 101 n contained in the sequence 100 of pictures may be subdivided into columns of samples, wherein each picture configuration out of the plurality of picture configurations may comprise at least one sample column index for signaling which columns of samples are refreshed columns 102r of samples and/or which columns of samples are un-refreshed columns 102u of samples.
- the region configuration may contain one or more sample column indexes.
- the corresponding one picture configuration may be signaled in a slice header and/or in an Access Unit Delimiter of the video data stream.
- the information about the used region configuration is included into the Access Unit Delimiter (AUD).
- a video data stream comprising a sequence 100 of pictures 1011 , 1012, .... 101 n comprising at least one Gradual Decoder Refresh - GDR - coded picture 103 and one or more subsequent pictures 1012, .... 101 n in a refresh period - RP.
- Each picture of the sequence 100 of pictures 1011 , 101 2 , ..., 101 n may be sequentially coded into the video data stream in units of blocks 102 (e.g. CTUs) into which the respective picture is subdivided.
- the video data stream may comprise an implicit signaling, wherein a refreshed sub-area 102r of a respective picture 1011, 101 2 , .... 101 n is implicitly signaled in the video data stream based on a block coding order.
- a respective decoder i.e. a decoder for decoding from a data stream at least one picture out of a sequence 100 of pictures 1011, 101 2 , .... 101 n comprising at least one Gradual Decoder Refresh - GDR - coded picture 103 and one or more subsequent pictures in a refresh period RP.
- Each picture of the sequence 100 of pictures 1011, 1012, ..., 101 n may be sequentially decoded from the video data stream in units of blocks 102 (e.g. CTUs) into which the respective picture is subdivided.
- the decoder may be configured to implicitly derive from the data stream a refreshed sub-area 102r of the at least one picture based on a block coding order.
- a respective encoder i.e. an encoder for encoding into a data stream at least one picture out of a sequence 100 of pictures 1011, 1012, ..., 101 n comprising at least one Gradual Decoder Refresh - GDR - coded picture 103 and one or more subsequent pictures in a refresh period RP.
- Each picture of the sequence 100 of pictures 1011, 1012, .... 101 n may be sequentially encoded into the video data stream in units of blocks 102 (e.g. CTUs) into which the respective picture is subdivided.
- the encoder may be configured to implicitly derive from the data stream a refreshed sub-area 101 r of the at least one picture based on a block coding order.
- Each CTU (picture region) 102 may contain a flag (which can be CABAC-coded) indicating whether it is the last CTU 102 of the GDR region 103 or not.
- This signaling affects for instance the availability of samples for intra prediction and/or CABAC reset. The benefit of such an approach is that it is more flexible not being limited to a fixed grid defined in a parameter set.
- the syntax element is for indicating a boundary between a refreshed sub-area 101 r and a yet un-refreshed sub-area 101u of a picture 101 out of the sequence 100 of pictures 1011, 101 2 , .... 101 n and/or for indicating which sub-area is a refreshed sub-area 101 r and which sub-area is a yet un-refreshed sub-area 101 u.
- each CTU 102 indicating whether it is the last CTU in the GDR region 103, it is beneficial to identify whether the last CTU 102 in a region means the last CTU 102 in terms of rows or columns. Such an indication may be done in a parameter set, e.g. in SPS.
- the flag region_horizontal_flag may indicate that the last CTU flag in CTU indicates a horizontal split. Otherwise, a vertical split.
- a video data stream, an encoder and a decoder are suggested, wherein in case that the syntax element indicates that a) the block 102 is a last block located in a refreshed sub-area 101 r of a respective picture 101 and lastly coded, the video data stream comprises a further syntax element (e.g. region_horizontal_flag) for indicating whether a1) the block 102 is a lastly coded block of one or more rows of blocks 102 of the refreshed sub-area 101 r, or a2) the block 102 is a lastly coded block of one or more columns of blocks 102 of the refreshed sub-area 101 r.
- a further syntax element e.g. region_horizontal_flag
- the region may be indicated whether the region is an intra-prediction break, i.e. neighbors of another region are not available for prediction, or CABAC, etc.
- the video data stream may comprise an intra-prediction break indication for indicating that neighboring blocks 102 of a neighboring picture region 102u are not available for prediction, e.g. if said neighboring picture region 102u is contained in a yet un-refreshed sub-area 101u.
- the refreshed sub-area 101 r may comprise one or more refreshed picture regions 102r which are arranged in a grid that is aligned with the size of the blocks 102 into which the respective picture 101 is subdivided.
- the regions it is not necessarily known which of the regions is a refreshed (clean) and not refreshed (dirty) region.
- all regions are considered to be “independent” from each other in all or some of the following aspects:
- the signaling implicitly indicates that the left-most region 101 r is a clean region and the availability of intra blocks is constrained for this region - i.e. the blocks 102 in the left-most region cannot use blocks of another (non-left-most) region for all or some of the following aspects:
- the implicit signaling may signal that a first block 102 at a predetermined position in the block coding order (e.g. a first CTU in upper left corner) is part of the refreshed-sub area 101 r.
- a predetermined position in the block coding order e.g. a first CTU in upper left corner
- some embodiments of the present invention may provide for an explicit signaling, wherein video data stream, a respective encoder and a respective decoder are suggested, wherein the video data stream may comprise, for each block 102, a syntax element indicating whether a) the block 102 is a last block located in a first sub-area 101 r of a respective picture and lastly coded (e.g. flag: last_ctu_of_gdr_region), and/or b) the block 102 is a first block located in a first sub-area 101 r of a respective picture and firstly coded (e.g.
- first_ctu_of_gdr_region flag: first_ctu_of_gdr_region
- the block 102 adjoins a border confining a first sub-area 101 r
- the block 102 is located inside a first sub-area 101 r (e.g. flag: gdr_region_flag).
- the start of the GDR region 103 may be indicated at CTU level, e.g. by using a flag (CABAC-coded)
- the CTU based region start and/or end flags may be signaled only in the first CTU column of a tile 102, if horizontal region splits are enabled, and in the first CTU row of a tile 102, if vertical region splits are enabled.
- a picture region of the respective picture 101 may be vertically subdivided into one or more slices 102, wherein the syntax element (e.g. flag: last_ctu_of_gdr_region //flag: first_ctu_of_gdr_region) is signaled for each slice 102.
- the syntax element e.g. flag: last_ctu_of_gdr_region //flag: first_ctu_of_gdr_region
- a picture region of the respective picture 101 may be horizontally subdivided into one or more rows of blocks 102, wherein the syntax element (e.g. flag: last_ctu_of_gdr_region // flag: first_ctu_of_gdr_region) is signaled i. only in the first row, or ii. in every row.
- the syntax element e.g. flag: last_ctu_of_gdr_region // flag: first_ctu_of_gdr_region
- a CTU based (CABAC-coded) flag may be signaled, indicating whether the CTU 102 is part of the GDR refresh region 103 or not.
- the CTU start and/or end indexes of the GDR refresh region 103 may be signaled in the slice header.
- picture regions may be decoupled from the usage of tiles 102 and thereby the scan order may not be affected.
- low delay transmission is desired. In order to achieve low delay transmission, all packets sent should have the same size and not only all AUs. Typically, in those low delay scenarios each AU may be split into multiple packets and in order to achieve that all packets have the same size (or very similar), each packet should have the same amount of blocks 102rthat are refreshed (belong to the clean area 101 r) and of blocks 102u that are not refreshed (belong to the dirty area 101u).
- FIG. 5 shows a non-limiting example of a sequence 100 of pictures 1011 , 101 2 , .... 101 n , wherein each picture 1011, 1012, .... 101 n may be split into multiple packets 501a, 501b, ..., 501 n.
- each packet 501a, 501b, ..., 501 n may have the same amount of blocks 102rthat are refreshed (belong to the clean area 101 r) and of blocks 102u that are not refreshed (belong to the dirty area 101u).
- the tile scan order would be in use and therefore the packets 501a, 501b, ..., 501 n could not have the same amount of blocks 102rthat are refreshed (belong to the clean area 101 r) and of blocks 102u that are not refreshed (belong to the dirty area 101u).
- tiles are used but a syntax element is added to the parameter set that enforces to follow raster scan and not tile scan.
- a syntax element is added to the parameter set that enforces to follow raster scan and not tile scan.
- sps_enfoce_raster_scan_flag In that case, raster scan would be used and not byte alignment would happen within the bitstream for CTUs starting a new tile.
- a video data stream comprising at least one picture 101 being subdivided into tiles 102, and a tile-reordering flag (e.g. sps_enforce_raster_scan_flag), wherein a) if the tile-reordering flag in the data stream has a first state, it is signaled that tiles 102 of the picture 101 are to be coded using a first coding order which traverses the picture 101 tile by tile, and/or b) if the tile-reordering flag in the data stream has a second state, it is signaled that tiles 102 of the picture 101 are to be coded using a second coding order which traverses the picture 101 along a raster scan order.
- a tile-reordering flag e.g. sps_enforce_raster_scan_flag
- a further embodiment suggests a corresponding decoder that may be configured to decode a picture 101 from a data stream, wherein: a) if a tile-reordering flag (e.g. sps_enforce_raster_scan_flag) in the data stream has a first state, the decoder is configured to decode tiles 102 of the picture 101 from the data stream using a first decoding order which traverses the picture 101 tile by tile, and/or b) if the tile-reordering flag in the data stream has a second state, the decoder is configured to decode the tiles 102 of the picture 101 from the data stream using a second decoding order which traverses the picture 101 along a raster scan order.
- a tile-reordering flag e.g. sps_enforce_raster_scan_flag
- the decoder may be configured to decode the picture 101 using a Gradual Decoding Refresh - GDR - approach, wherein the picture 101 may be part of sequence 100 of pictures 1011 , 1012, ... , 101 n which comprises at least one GDR coded picture 103 and one or more subsequent pictures, wherein the picture 101 is block-wise decoded and partitioned into multiple packets 501a, 501b, ..., 501 n, wherein two or more packets (and preferably each packet) comprise the same amount of blocks 102r that are refreshed and/or the same amount of blocks that are yet un-refreshed 102u.
- a Gradual Decoding Refresh - GDR - approach wherein the picture 101 may be part of sequence 100 of pictures 1011 , 1012, ... , 101 n which comprises at least one GDR coded picture 103 and one or more subsequent pictures, wherein the picture 101 is block-wise decoded and partitioned into multiple packets 501a, 501b, ...,
- a further embodiment suggests a corresponding encoder configured to encode a picture 101 into a data stream, wherein: a) the encoder may be configured to set a tile-reordering flag (e.g. sps_enforce_raster_scan_flag) in the data stream into a first state, indicating that tiles 102 of the picture 101 are to be coded using a first coding order which traverses the picture 101 tile by tile, and/or b) the encoder is configured to set the tile-reordering flag in the data stream into a second state, indicating that tiles 102 of the picture 101 are to be coded using a second coding order which traverses the picture 101 along a raster scan order.
- a tile-reordering flag e.g. sps_enforce_raster_scan_flag
- Figure 6 shows a non-limiting example of a GDR approach using a scalable bitstream 600 having a first layer (e.g. a base layer - BL) 601 and a second layer (e.g. an enhancement layer - EL) 602, wherein missing refreshed regions (e.g. not yet refreshed or un-refreshed picture regions) 102u in a yet un-refreshed picture sub-area 101 u in the second layer (e.g. Enhancement Layer - EL) 602 can be substituted with upsampled samples of a refresehed picture region 202r of a refreshed picture sub-area 201 r of the first layer (e.g. base layer - BL) 601.
- a first layer e.g. a base layer - BL
- a second layer e.g. an enhancement layer - EL
- the decoding process of the EL 602 would manage the status of the defined GDR regions (refreshed since GDR or not) and indicate for each region whether its initialized per layer or not. If a region is not initialized, the resampling process of a reference layer 601 for that region would be carried out and sample values would be substituted. Thereby, when decoding starts at the access unit containing the EL GDR picture 103, higher layer pictures can instantly be presented to the user, gradually being updated to the EL quality over the course of a RP.
- a multi layered scalable video data stream 600 comprising a first sequence 200 of pictures 2011, 201 2 , .... 201 n in a first layer (e.g. base layer) 601 and a second sequence 100 of pictures 1011, 1012, .... 101 n in a second layer (e.g. an enhancement layer) 602.
- first layer e.g. base layer
- second layer e.g. an enhancement layer
- the second sequence 100 of pictures 1011, 1012, ..., 101 n in the second layer 602 may comprise at least one Gradual Decoder Refresh - GDR - picture 103 as a start picture and one or more subsequent pictures in a refresh period - RP, wherein the multi layered scalable video data stream 600 may comprise a signalization carrying information about a possibility that a yet un-refreshed sub-area 101 u of the GDR picture 103 of the second layer 602 is to be inter-layer predicted from samples 202r of the first layer 601.
- the signalization may further carry information:
- motion vector prediction is disabled or motion vector prediction is realized non-temporally, or
- TMVP Temporal Motion Vector Prediction
- sub-block TMVP i.e. syntax based prediction of motion vectors
- a basic principle of this aspect suggests to provide the multi layered scalable video data stream 600, wherein said inter-layer prediction from samples 202r of the first layer 601 may comprise substituting one or more samples 102u of the yet un-refreshed sub-area 101u of the GDR picture 103 by an upsampled version of refreshed samples 202r of the first layer 601.
- a further embodiment suggests that all samples 102u of the entire yet un-refreshed sub-area 101u of the GDR picture 103 may be substituted by an upsampled version of refreshed samples 202r of the first layer 601 so that pictures 1011, 101 2 , .... 101 n from the second sequence 100 of coded pictures in the second layer 602 may be instantly presentable to a user.
- a further embodiment suggests that yet un-refreshed sub-areas 101u of the one or more subsequent pictures 101 2 , .... 101 n of the second layer 100 may be refreshed by intra-layer prediction (e.g. inside the second layer 602) using the upsampled substitute samples 202rfrom the first layer 601 which are gradually updated to refreshed samples 102r of the second layer 602.
- combined motion vector candidates in the merge list that are influenced by TMVP or sub-block TMVP candidates are forbidden so that when samples 101u are substituted by upsampled BL samples 202r, the following EL pictures do not rely on incorrect motion vectors on decoder side.
- DMVR Decoder-side Motion Vector Refinement
- At least one of the following coding concepts is disabled for coding the yet un-refreshed sub-areas 101u of the one or more subsequent pictures 1012, ..., 101 n contained in the refresh period - RP:
- ATMVP Advanced Temporal Motion Vector Prediction
- TMVP-influenced candidates e.g. motion vector candidates in the merge list which are influenced by TMVP or sub-block TMVP
- At least one of the following coding concepts is disabled for coding the yet un-refreshed sub-area 101u of the GDR picture 103:
- TMVP Temporal Motion Vector Prediction
- ATMVP Advanced Temporal Motion Vector Prediction
- TMVP-influenced candidates e.g. motion vector candidates in the merge list which are influenced by TMVP or sub-block TMVP
- DMVR Decoder Side Motion Vector Refinement
- the coded layer with GDR is coded independent of other layers and it is expressed in the bitstream that sample substitution can be carried out using an indicated other layer with adequate content for sample substitution.
- the second layer 602 may be coded independently from the first layer 601 or from any further layers, and wherein, if the second sequence 100 of pictures 1011, 101 2 , ...,
- the signalization indicates that the yet un-refreshed sub-area 101 u of the GDR picture 103 of the second layer 602 is to be interlayer predicted from samples 202r of the first layer 601 or of any predetermined (indicated) further layer with adequate content.
- the above restriction may take the form of a bitstream requirement depending on the identified refreshed and non-refreshed region.
- a decoder for decoding at least one picture from a multi layered scalable video data stream 600 comprising a first sequence 200 of pictures 2011 , 201 2 , .... 201 n in a first layer (e.g. a base layer) 601 and a second sequence 100 of pictures 1011, 101 2 , .... 101 n in a second layer (e.g. an enhancement layer) 602.
- the second sequence 100 of pictures 1011, 101 2 , ..., 101 n in the second layer 602 may comprise at least one Gradual Decoder Refresh - GDR - picture 103 as a start picture and one or more subsequent pictures 101 2 , ..., 101 n in a refresh period - RP.
- the decoder may be configured to read from the multi layered scalable video data stream 600 a signalization carrying information about a possibility that a yet un-refreshed sub-area 101 u of the GDR picture 103 of the second layer 602 is to be inter-layer predicted from samples 202r of the first layer 601 , and wherein the decoder is further configured to, responsive to the signalization: disable motion vector prediction or to realize motion vector prediction non-temporally in yet un-refreshed sub-areas 101u of the one or more subsequent pictures 1012, .... 101 n contained in the refresh period - RP, or
- an encoder for encoding at least one picture into a multi layered scalable video data stream 600 comprising a first sequence 200 of pictures 2011, 201 2 , 201 n in a first layer (e.g. base layer) 601 and a second sequence 100 of pictures 1011, 101 2 , .... 101 n in a second layer (e.g. an enhancement layer) 602, wherein the second sequence 100 of pictures 1011 , 101 2 , ..., 101 n in the second layer 602 comprises at least one Gradual Decoder Refresh - GDR - picture 103 as a start picture and one or more subsequent pictures 101 2 , ..., 101 n in a refresh period - RP.
- a first layer e.g. base layer
- a second sequence 100 of pictures 1011, 101 2 , ..., 101 n in the second layer 602 comprises at least one Gradual Decoder Refresh - GDR - picture 103 as a start picture and one or more subsequent pictures 101 2 , ..., 101 n in
- the encoder may be configured to write into the multi layered scalable video data stream 600 a signalization carrying information about a possibility that a yet un-refreshed sub-area 101u of the GDR picture 103 of the second layer 602 is to be inter-layer predicted from refreshed samples 202r of the first layer 601.
- the signalization may further carry information
- motion vector prediction is disabled or motion vector prediction is realized non- temporally, or
- a method for decoding at least one picture from a multi layered scalable video data stream 600 comprising a first sequence 200 of pictures 2011, 201 2 , ..., 201 n in a first layer (e.g. base layer) 601 and a second sequence 100 of pictures 1011, 101 2 , .... 101 n in a second layer (e.g. an enhancement layer) 602, wherein the second sequence 100 of pictures 1011 , 101 2 , ..., 101 n in the second layer 602 comprises at least one Gradual Decoder Refresh - GDR - picture 103 as a start picture and one or more subsequent pictures 101 2 , ..., 101 n in a refresh period - RP.
- a first layer e.g. base layer
- a second sequence 100 of pictures 1011, 101 2 , ..., 101 n in the second layer 602 comprises at least one Gradual Decoder Refresh - GDR - picture 103 as a start picture and one or more subsequent pictures 101 2 , ..., 101 n
- the method comprises steps of reading from the multi layered scalable video data stream 600 a signalization carrying information about a possibility that a yet un-refreshed sub-area 101u of the GDR picture 103 of the second layer 602 is to be inter-layer predicted from refreshed samples 202r of the first layer 601.
- the method may further comprise steps of executing, responsive to the signalization, at least one of the following actions: • disable motion vector prediction or realize motion vector prediction non- temporally in yet un-refreshed sub-areas 101 u of the one or more subsequent pictures 101 2 , .... 101 n contained in the refresh period - RP, or
- a method for encoding at least one picture into a multi layered scalable video data stream 600 comprising a first sequence 200 of pictures 2011, 2012, .... 201 n in a first layer (e.g. base layer) 601 and a second sequence 100 of pictures 1011, 101 2 , .... 101 n in a second layer (e.g. enhancement layer) 602, wherein the second sequence 100 of pictures 1011, 101 2 , ..., 101 n in the second layer 602 comprises at least one Gradual Decoder Refresh - GDR - picture 103 as a start picture and one or more subsequent pictures 101 2 , ..., 101 n in a refresh period (RP).
- RP refresh period
- the method comprises steps of writing into the multi layered scalable video data stream 600 a signalization carrying information about a possibility that a yet un-refreshed sub-area 101u of the GDR picture 103 of the second layer 602 is to be inter-layer predicted from samples 202r of the first layer 601.
- the signalization may further contain information that in yet un-refreshed sub-areas 101u of the one or more subsequent pictures 1012, .... 101 n contained in the refresh period - RP, motion vector prediction is disabled or motion vector prediction is realized non-temporally, or that in a yet un-refreshed sub-area 101u of the GDR picture 103 motion vector prediction is disabled or motion vector prediction is realized non-temporally.
- Scalable video has many benefits for a lot of streaming systems. For instance, by using the correlation of different versions of the content and/or different resolutions, the overall compression efficiency compared to several independent bitstreams (simulcast) with different versions and/or different resolutions is increased drastically. This can lead to big savings on storage at servers and CDNs, reducing the deployment costs of streaming services.
- the coding efficiency of transmitted scalable bitstream to the end device is lower than the corresponding one if it would be a single layer bitstream. That is, the inter-layer prediction comes along with an efficiency loss due to some signaling overhead.
- a joint optimization should be performed that reduces the required storage capacity of the several versions, i.e. the overall bitrate of all version, conditioned to not increasing drastically the size of each version compared to the single layer version. This could be achieved by evaluating the described optimization problem and marking per AU whether the AU uses inter-layer prediction or not. In the previous standards this has been done by marking those pictures as discardable with a discardable flag in lower layers. Alternatively, a slice could also mark that inter-layer is not used.
- a multi layered scalable bitstream 700 may comprise a first layer (e.g. a base layer) 701 and a second layer (e.g. an enhancement layer) 702.
- the bitstream 700 may comprise more than the exemplarily depicted two layers 701 , 702.
- Each of the first and second layers 701 , 702 may comprise two or more temporal sublayers.
- the first layer 701 may comprise a first temporal sublayer 701a and a second temporal sublayer 701b.
- the second layer 702 may comprise a first temporal sublayer 702a and a second temporal sublayer 702b.
- Some pictures contained in a layer 701, 702 may be intra coded, as exemplarily depicted with arrows 710.
- a picture 201 2 of a temporal sublayer 701b of a layer 701 may be intra-coded by referencing one or more pictures 201 1 , 201 3 of a different temporal sublayer 701 b of the same layer 701.
- a picture 101 1 of a temporal sublayer 702a of a layer 702 may be inter-coded by referencing a picture 201 1 of a temporal sublayer 701a of a different layer 701.
- a picture 101 1 of a first temporal sublayer 702a of the second layer (e.g. enhancement layer) 702 may be inter-coded by referencing a picture 201 1 of a first temporal sublayer 701 a of the first layer (e.g. base layer) 701.
- a picture contained in a certain temporal sublayer of a layer may only reference pictures contained in a temporal sublayer of a different layer but with the same temporal sublayer hierarchy.
- a picture 201 1 contained in the first temporal sublayer 702a of the second layer 702 may only reference a picture 101 1 that is also contained in the first temporal sublayer 701a but of the first layer 701. That is, the first temporal sublayers 701a, 702a have the same temporal sublayer hierarchy.
- only the pictures here: having an odd index
- 101 1 , 101 3 , 101 5 , .... 101 n contained in the first sublayer 702a of the second layer 702 may be inter-coded by referencing pictures (here: with odd index) 201 1 , 201 3 , 201 5 , .... 201 n of the first sublayer 701 a of the first layer 701.
- any temporal sublayers that may not be used for inter-coding may be dropped.
- any temporal sublayers having a higher temporal sublayer hierarchy than a predetermined temporal sublayer which is used for inter-coding may be dropped.
- the pictures 201 2 , 201 , ..., 201 n -i contained in the second temporal sublayer 701b of the first layer 701 are not referenced and, therefore, the second temporal sublayer 701b of the first layer 701 is not of interest and may be dropped.
- an efficient way of implementing the described feature would be, for example, to encode every second picture as being dependent or e.g. every fourth picture as using inter-layer dependency (note: a picture may be referred to as being dependent if coded with inter-layer dependency, and as being independent if coded without inter-layer dependency).
- the layer dependency further indicates whether temporal sublayers within a dependent layer depend on the lower layers or not.
- VPS VPS
- SPS SPS
- vps_subjayer_independent_flag[ i ][ j ] being equal to 1 specifies that the (temporal) sublayer with index j contained in the layer with index i does not use inter-layer prediction.
- vps_sub_layer_independent_flag[ i ][ j ] being equal to 0 specifies that the (temporal) sublayer with index j contained in the layer with index i may use inter-layer prediction.
- a multi layered scalable video data stream 700 comprising a first sequence 200 of pictures 2011, 2012, ..., 201 n in a first layer (e.g. base layer) 701 and a second sequence 100 of pictures 1011, 1012, .... 101 n in a second layer (e.g. an enhancement layer) 702, each of the first and second layers 701 , 702 comprising a plurality of temporal sublayers 701a, 701 b; 702a, 702b.
- the scalable video data stream 700 may comprise a signalization (e.g. vps_subjayer_independent_flag[ i ] [j ]) indicating which temporal sublayers 702a, 702b of the second layer (e.g. enhancement layer) 702 may be coded by inter-layer prediction.
- a corresponding decoder for decoding at least one picture from a multi layered scalable video data stream 700 comprising a first sequence 200 of pictures 2011, 2012, .... 201 n in a first layer (e.g. a base layer) 701 and a second sequence 100 of pictures 1011 , 1012, ..., 101 n in a second layer (e.g. an enhancement layer) 702, each of the first and second layers 701 , 702 comprising a plurality of temporal sublayers 701a, 701 b; 702a, 702b.
- the decoder may be configured to decode one or more of the temporal sublayers 701a, 701b; 702a, 702b by using inter-layer prediction based on a signalization (e.g. vps_subjayer_independent_flag [ i ] [ j ]) derived from the scalable video data stream 700, said signalization indicating which temporal sublayers 702a, 702b of the second layer 702 are to be coded by inter-layer prediction.
- a signalization e.g. vps_subjayer_independent_flag [ i ] [ j ]
- a corresponding encoder for encoding at least one picture into a multi layered scalable video data stream 700 comprising a first sequence 200 of pictures 2011, 2012, ..., 201 n in a first layer (e.g. abase layer) 701 and a second sequence 100 of pictures 1011, 1012, .... 101 n in a second layer (e.g. an enhancement layer) 702, each of the first and second layers 701 , 702 comprising a plurality of temporal sublayers 701a, 701b; 702a, 702b.
- the encoder may be configured to encode one or more of the temporal sublayers 701a, 701b; 702a, 702b by using inter-layer prediction and to write a signalization (e.g. vps_sub_layer_independent_flag [ i ] [ j ]) into the scalable video data stream 700, said signalization indicating which temporal sublayers 702a, 702b of the second layer 702 are coded by inter-layer prediction.
- a signalization e.g. vps_sub_layer_independent_flag [ i ] [ j ]
- a corresponding method for decoding at least one picture from a multi layered scalable video data stream 700 comprising a first sequence 200 of pictures 2011, 201 . . 201 n in a first layer (e.g. base layer) 701 and a second sequence 100 of pictures 1011, 1012, .... 101 n in a second layer (e.g. enhancement layer) 702, each of the first and second layers 701 , 702 comprising a plurality of temporal sublayers 701a, 701b; 702a, 702b.
- the method comprises steps of decoding one or more of the temporal sublayers 702a, 702b by using inter-layer prediction based on a signalization derived from the scalable video data stream 700, said signalization (e.g. vps_sub_layerjndependent_flag[i][j]) indicating which temporal sublayers 702a, 702b of the second layer 702 are to be coded by inter-layer prediction.
- signalization e.g. vps_sub_layerjndependent_flag[i][j]
- a corresponding method for encoding at least one picture into a multi layered scalable video data stream 700 comprising a first sequence 200 of pictures 2011, 2012, .... 201 n in a first layer (e.g. base layer) 701 and a second sequence 100 of pictures 1011, 1012, .... 101 n in a second layer (e.g. enhancement layer) 702, each of the first and second layers 701, 702 comprising a plurality of temporal sublayers 701a, 701b; 702a, 702b.
- the method comprises steps of encoding one or more of the temporal sublayers 702a, 702b by using inter-layer prediction based on a signalization derived from the scalable video data stream 700, said signalization (e.g. vps_sub_layerjndependent_flag[i][j]) indicating which temporal sublayers 702a, 702b of the second layer 702 are coded by inter-layer prediction.
- signalization e.g. vps_sub_layerjndependent_flag[i][j]
- the temporal sublayers may comprise a temporal sublayer hierarchy, e.g. first temporal sublayer, second temporal sublayer, third temporal sublayer, and so on.
- the temporal sublayer hierarchy of a layer may be indicated by means of a temporal identifier, e.g. a syntax element, contained in the above mentioned signalization in the video data stream 700.
- temporal sublayers above a predetermined temporal sublayer hierarchy which are not of interest may be dropped.
- This threshold may be indicated by a predetermined one temporal sublayer. For example, it may be signalized in the video data stream 700 that inter-layer prediction may only be used up to the second temporal sublayer of a layer.
- the second temporal sublayer forms the threshold for inter-layer prediction. Accordingly, the first and second temporal sublayers may be inter-layer predicted while any further temporal sublayers having a temporal sublayer hierarchy above the predetermined second temporal sublayer (threshold) may be dropped.
- the aforementioned temporal identifier may indicate the threshold, i.e. the temporal sublayer hierarchy up to which inter-layer prediction may be used. Stated the other way around, the temporal identifier may indicate the threshold, i.e. the temporal sublayer hierarchy from which inter-layer prediction may not be used
- the signalization in the video data stream 700 may comprise a predetermined temporal identifier (threshold) from which the temporal sublayers 702a, 702b of the second layer 702 may be coded without inter-layer prediction.
- those temporal sublayers 702a, 702b of the second layer 702 which comprise a temporal identifier having a value above the predetermined temporal identifier (threshold) are coded without inter-layer prediction.
- the predetermined temporal identifier references the first temporal sublayer 702a since only the first temporal sublayer 702a is coded by inter-layer prediction. Accordingly, all temporal sublayers above the first temporal sublayer 702a, here the second temporal sublayer 702b as well as any higher temporal sublayers (if present), are coded without inter-layer prediction.
- those temporal sublayers 702a, 702b of the second layer 702 which comprise a temporal identifier having a value up to or below the predetermined temporal identifier (threshold) are coded with inter-layer prediction.
- the predetermined temporal identifier references the first temporal sublayer 702a since only the first temporal sublayer 702a is coded by inter-layer prediction. Accordingly, if the signalization (e.g.
- vps_sub_layer_independent_flag[i][j]) may indicate that a temporal sublayer 702b of the second layer 702 does not use a temporal sublayer 701b of a lower layer 701 for inter-layer prediction, this temporal sublayer 701b of the lower layer 701 may be marked as discardable or be discarded or dropped from the multi layered scalable video data stream 700.
- an enable flag is included into the VPS to indicate whether the sub_layer_independent_flag is included into the VPS or whether per-default all sublayers are dependent on a lower layer.
- multi layered scalable video data stream 700 may further comprise a syntax element (e.g. enable_flag in VPS) for indicating whether the signalization (e.g. vps_subjayer_independent_flag[i][j]) is included in the multi layered scalable video data stream 700. If not included, then all temporal sublayers 702a, 702b of the second layer 702 may, per default, depend on a lower layer 701.
- a syntax element e.g. enable_flag in VPS
- an efficient way of implementing the described feature would be, for example, to encode every second picture as being dependent or e.g. every fourth picture as using inter-layer dependency.
- the encoder may be configured to encode a first predetermined row of consecutive pictures (e.g. every second picture) 1012, 1014, .... 10Vi of the second sequence 100 of pictures as being dependent, or to encode a second predetermined row of consecutive pictures (e.g. every fourth picture) of the second sequence 100 of pictures as using inter-layer dependency.
- a first predetermined row of consecutive pictures e.g. every second picture
- a second predetermined row of consecutive pictures e.g. every fourth picture
- aspects have been described in the context of an apparatus, it is clear that these aspects also represent a description of the corresponding method, where a block or device corresponds to a method step or a feature of a method step. Analogously, aspects described in the context of a method step also represent a description of a corresponding block or item or feature of a corresponding apparatus.
- Some or all of the method steps may be executed by (or using) a hardware apparatus, like for example, a microprocessor, a programmable computer or an electronic circuit. In some embodiments, one or more of the most important method steps may be executed by such an apparatus.
- embodiments of the invention can be implemented in hardware or in software or at least partially in hardware or at least partially in software.
- the implementation can be performed using a digital storage medium, for example a floppy disk, a DVD, a Blu-Ray, a CD, a ROM, a PROM, an EPROM, an EEPROM or a FLASH memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed. Therefore, the digital storage medium may be computer readable.
- Some embodiments according to the invention comprise a data carrier having electronically readable control signals, which are capable of cooperating with a programmable computer system, such that one of the methods described herein is performed.
- embodiments of the present invention can be implemented as a computer program product with a program code, the program code being operative for performing one of the methods when the computer program product runs on a computer.
- the program code may for example be stored on a machine readable carrier.
- inventions comprise the computer program for performing one of the methods described herein, stored on a machine readable carrier.
- an embodiment of the inventive method is, therefore, a computer program having a program code for performing one of the methods described herein, when the computer program runs on a computer.
- a further embodiment of the inventive methods is, therefore, a data carrier (or a digital storage medium, or a computer-readable medium) comprising, recorded thereon, the computer program for performing one of the methods described herein.
- the data carrier, the digital storage medium or the recorded medium are typically tangible and/or non- transitory.
- a further embodiment of the inventive method is, therefore, a data stream or a sequence of signals representing the computer program for performing one of the methods described herein.
- the data stream or the sequence of signals may for example be configured to be transferred via a data communication connection, for example via the Internet.
- a further embodiment comprises a processing means, for example a computer, or a programmable logic device, configured to or adapted to perform one of the methods described herein.
- a processing means for example a computer, or a programmable logic device, configured to or adapted to perform one of the methods described herein.
- a further embodiment comprises a computer having installed thereon the computer program for performing one of the methods described herein.
- a further embodiment according to the invention comprises an apparatus or a system configured to transfer (for example, electronically or optically) a computer program for performing one of the methods described herein to a receiver.
- the receiver may, for example, be a computer, a mobile device, a memory device or the like.
- the apparatus or system may, for example, comprise a file server for transferring the computer program to the receiver.
- a programmable logic device for example a field programmable gate array
- a field programmable gate array may cooperate with a microprocessor in order to perform one of the methods described herein.
- the methods are preferably performed by any hardware apparatus.
- the apparatus described herein may be implemented using a hardware apparatus, or using a computer, or using a combination of a hardware apparatus and a computer.
- the methods described herein may be performed using a hardware apparatus, or using a computer, or using a combination of a hardware apparatus and a computer.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Compression Of Band Width Or Redundancy In Fax (AREA)
Abstract
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020227013922A KR20220065874A (ko) | 2019-09-24 | 2020-09-23 | 점진적 디코더 리프레시 코딩 및 스케일가능 코딩을 위한 인코더, 디코더 및 데이터 스트림 |
CA3155739A CA3155739A1 (fr) | 2019-09-24 | 2020-09-23 | Codeur, decodeur et flux de donnees pour codage de rafraichissement de decodeur progressif et codage evolutif |
JP2022518777A JP2022549452A (ja) | 2019-09-24 | 2020-09-23 | 漸進的デコーダリフレッシュ符号化及びスケーラブル符号化のためのエンコーダ、デコーダ及びデータストリーム |
US17/763,453 US12069236B2 (en) | 2019-09-24 | 2020-09-23 | Encoder, decoder and data stream for gradual decoder refresh coding and scalable coding |
CN202080081523.7A CN114731395A (zh) | 2019-09-24 | 2020-09-23 | 用于渐进解码器刷新译码和可扩展译码的编码器、解码器以及数据流 |
BR112022005650A BR112022005650A2 (pt) | 2019-09-24 | 2020-09-23 | Codificador, decodificador e fluxo de dados para codificação gradual de atualização de decodificador e codificação escalável |
AU2020355591A AU2020355591B2 (en) | 2019-09-24 | 2020-09-23 | Encoder, decoder and data stream for gradual decoder refresh coding and scalable coding |
EP20772354.5A EP4035359A1 (fr) | 2019-09-24 | 2020-09-23 | Codeur, décodeur et flux de données pour codage de rafraîchissement de décodeur progressif et codage évolutif |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP19199348.4 | 2019-09-24 | ||
EP19199348 | 2019-09-24 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/763,453 A-371-Of-International US12069236B2 (en) | 2019-09-24 | 2020-09-23 | Encoder, decoder and data stream for gradual decoder refresh coding and scalable coding |
US18/766,977 Continuation US20240364861A1 (en) | 2019-09-24 | 2024-07-09 | Encoder, decoder and data stream for gradual decoder refresh coding and scalable coding |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021058596A1 true WO2021058596A1 (fr) | 2021-04-01 |
Family
ID=68069490
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2020/076616 WO2021058596A1 (fr) | 2019-09-24 | 2020-09-23 | Codeur, décodeur et flux de données pour codage de rafraîchissement de décodeur progressif et codage évolutif |
Country Status (9)
Country | Link |
---|---|
US (1) | US12069236B2 (fr) |
EP (1) | EP4035359A1 (fr) |
JP (1) | JP2022549452A (fr) |
KR (1) | KR20220065874A (fr) |
CN (1) | CN114731395A (fr) |
AU (1) | AU2020355591B2 (fr) |
BR (1) | BR112022005650A2 (fr) |
CA (1) | CA3155739A1 (fr) |
WO (1) | WO2021058596A1 (fr) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023055171A1 (fr) * | 2021-10-01 | 2023-04-06 | 엘지전자 주식회사 | Procédé et dispositif de codage d'image basé sur mdmvr |
TWI847506B (zh) * | 2022-12-06 | 2024-07-01 | 瑞昱半導體股份有限公司 | 可實現分散式逐漸解碼刷新的編碼演算法 |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021188451A1 (fr) | 2020-03-16 | 2021-09-23 | Bytedance Inc. | Unité d'accès à un point d'accès aléatoire dans un codage vidéo échelonnable |
US20230146398A1 (en) * | 2020-03-31 | 2023-05-11 | Nokia Technologies Oy | Merge candidate list for gradual decoding refresh |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1672930A2 (fr) * | 2002-04-29 | 2006-06-21 | Nokia Corporation | Indication des régions dans une image |
US20160014432A1 (en) * | 2013-02-25 | 2016-01-14 | Lg Electronics Inc. | Method for encoding video of multi-layer structure supporting scalability and method for decoding same and apparatus therefor |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9167246B2 (en) | 2008-03-06 | 2015-10-20 | Arris Technology, Inc. | Method and apparatus for decoding an enhanced video stream |
US9998735B2 (en) * | 2013-04-01 | 2018-06-12 | Qualcomm Incorporated | Inter-layer reference picture restriction for high level syntax-only scalable video coding |
KR20150009424A (ko) * | 2013-07-15 | 2015-01-26 | 한국전자통신연구원 | 시간적 서브 레이어 정보에 기반한 계층간 예측을 이용한 영상 부, 복호화 방법 및 그 장치 |
JP2015195543A (ja) * | 2014-03-26 | 2015-11-05 | シャープ株式会社 | 画像復号装置、画像符号化装置 |
-
2020
- 2020-09-23 WO PCT/EP2020/076616 patent/WO2021058596A1/fr unknown
- 2020-09-23 JP JP2022518777A patent/JP2022549452A/ja active Pending
- 2020-09-23 CA CA3155739A patent/CA3155739A1/fr active Pending
- 2020-09-23 CN CN202080081523.7A patent/CN114731395A/zh active Pending
- 2020-09-23 BR BR112022005650A patent/BR112022005650A2/pt unknown
- 2020-09-23 AU AU2020355591A patent/AU2020355591B2/en active Active
- 2020-09-23 KR KR1020227013922A patent/KR20220065874A/ko active Search and Examination
- 2020-09-23 EP EP20772354.5A patent/EP4035359A1/fr active Pending
- 2020-09-23 US US17/763,453 patent/US12069236B2/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1672930A2 (fr) * | 2002-04-29 | 2006-06-21 | Nokia Corporation | Indication des régions dans une image |
US20160014432A1 (en) * | 2013-02-25 | 2016-01-14 | Lg Electronics Inc. | Method for encoding video of multi-layer structure supporting scalability and method for decoding same and apparatus therefor |
Non-Patent Citations (2)
Title |
---|
HENDRY ET AL: "AHG14/AHG17: GDR - gradual decoding refresh", no. JVET-N0115; JVET-N0115, 27 March 2019 (2019-03-27), pages 1 - 5, XP030202639, Retrieved from the Internet <URL:http://phenix.int-evry.fr/jvet/doc_end_user/documents/14_Geneva/wg11/JVET-N0115-v1.zip> * |
KAZUI (FUJITSU) K ET AL: "Description of Core Experiment 2 (CE2): Gradual decoding refresh", no. m49905, 12 July 2019 (2019-07-12), XP030208551, Retrieved from the Internet <URL:http://phenix.int-evry.fr/mpeg/doc_end_user/documents/127_Gothenburg/wg11/m49905-JVET-O2022-v1-JVET-O2022-v1.zip JVET-O2022-v1.docx> [retrieved on 20190712] * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023055171A1 (fr) * | 2021-10-01 | 2023-04-06 | 엘지전자 주식회사 | Procédé et dispositif de codage d'image basé sur mdmvr |
TWI847506B (zh) * | 2022-12-06 | 2024-07-01 | 瑞昱半導體股份有限公司 | 可實現分散式逐漸解碼刷新的編碼演算法 |
Also Published As
Publication number | Publication date |
---|---|
AU2020355591A1 (en) | 2022-04-14 |
US12069236B2 (en) | 2024-08-20 |
CN114731395A (zh) | 2022-07-08 |
JP2022549452A (ja) | 2022-11-25 |
BR112022005650A2 (pt) | 2022-07-19 |
EP4035359A1 (fr) | 2022-08-03 |
CA3155739A1 (fr) | 2021-04-01 |
KR20220065874A (ko) | 2022-05-20 |
US20220408080A1 (en) | 2022-12-22 |
AU2020355591B2 (en) | 2024-03-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2020355591B2 (en) | Encoder, decoder and data stream for gradual decoder refresh coding and scalable coding | |
JP7126332B2 (ja) | 効率的なスケーラブル符号化概念 | |
US8184153B2 (en) | Method and apparatus for defining and reconstructing ROIs in scalable video coding | |
US20120201306A1 (en) | Method and apparatus for defining and reconstructing rois in scalable video coding | |
BRPI0616407B1 (pt) | Codificação/decodificação de vídeo escalonável h.264 com região de interesse | |
CA2951530C (fr) | Reinitialisation d'ensemble de parametres d'image pour codecs multicouche | |
TW202147853A (zh) | 時間移動向量預測、層間參考及時間子層指示的視訊寫碼技術 | |
TW202145791A (zh) | 視訊寫碼中輸出層集合概念技術 | |
US20240364861A1 (en) | Encoder, decoder and data stream for gradual decoder refresh coding and scalable coding | |
US20170324966A1 (en) | Arrangements and methods thereof for processing video |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20772354 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2022518777 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 3155739 Country of ref document: CA |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112022005650 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 2020355591 Country of ref document: AU Date of ref document: 20200923 Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 20227013922 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2020772354 Country of ref document: EP Effective date: 20220425 |
|
ENP | Entry into the national phase |
Ref document number: 112022005650 Country of ref document: BR Kind code of ref document: A2 Effective date: 20220324 |