WO2021107621A1 - 영상/비디오 코딩을 위한 슬라이스 및 타일 구성 - Google Patents
영상/비디오 코딩을 위한 슬라이스 및 타일 구성 Download PDFInfo
- Publication number
- WO2021107621A1 WO2021107621A1 PCT/KR2020/016883 KR2020016883W WO2021107621A1 WO 2021107621 A1 WO2021107621 A1 WO 2021107621A1 KR 2020016883 W KR2020016883 W KR 2020016883W WO 2021107621 A1 WO2021107621 A1 WO 2021107621A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- height
- slice
- tile
- information
- slices
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/174—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/132—Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
- H04N19/423—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/91—Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
Definitions
- overall image/video compression efficiency may be improved.
- a brick scan may indicate a specific sequential ordering of CTUs partitioning a picture
- the CTUs may be arranged in a CTU raster scan within a brick
- the bricks in a tile may be sequentially arranged in a raster scan of the bricks of the tile.
- tiles in a picture may be sequentially aligned with a raster scan of the tiles of the picture.
- a tile is a specific tile row and a rectangular area of CTUs within a specific tile row.
- the tile column is a rectangular region of CTUs, the rectangular region has a height equal to the height of the picture, and the width may be specified by syntax elements in a picture parameter set.
- the tile row is a rectangular region of CTUs, the rectangular region has a width specified by syntax elements in a picture parameter set, and the height may be equal to the height of the picture.
- a tile scan may indicate a specific sequential ordering of CTUs partitioning a picture, wherein the CTUs may be sequentially aligned with a CTU raster scan within a tile, and tiles within a picture may be sequentially aligned with a raster scan of the tiles of the picture.
- a slice may include an integer number of bricks of a picture, and the integer number of bricks may be included in one NAL unit.
- a slice may consist of a number of complete tiles, or it may be a continuous sequence of complete bricks of one tile.
- tile group and slice can be used interchangeably.
- a tile group/tile group header may be referred to as a slice/slice header.
- a unit may represent a basic unit of image processing.
- the unit may include at least one of a specific region of a picture and information related to the region.
- One unit may include one luma block and two chroma (ex. cb, cr) blocks.
- a unit may be used interchangeably with terms such as a block or an area in some cases.
- an MxN block may include samples (or sample arrays) or a set (or arrays) of transform coefficients including M columns and N rows.
- the encoding device may encode the input video/image.
- the encoding apparatus may perform a series of procedures such as prediction, transformation, and quantization for compression and coding efficiency.
- the encoded data (encoded video/image information) may be output in the form of a bitstream.
- the decoding apparatus may decode the video/image by performing a series of procedures such as inverse quantization, inverse transformation, and prediction corresponding to the operation of the encoding apparatus.
- the encoding apparatus 200 includes an image partitioner 210, a predictor 220, a residual processor 230, an entropy encoder 240, It may be configured to include an adder 250 , a filter 260 , and a memory 270 .
- the prediction unit 220 may include an inter prediction unit 221 and an intra prediction unit 222 .
- the residual processing unit 230 may include a transformer 232 , a quantizer 233 , an inverse quantizer 234 , and an inverse transformer 235 .
- the residual processing unit 230 may further include a subtractor 231 .
- the adder 250 may be called a reconstructor or a reconstructed block generator.
- a quad tree structure may be applied first and a binary tree structure and/or a ternary structure may be applied later.
- the binary tree structure may be applied first.
- a coding procedure according to this document may be performed based on the final coding unit that is no longer divided.
- the maximum coding unit may be directly used as the final coding unit based on coding efficiency according to image characteristics, or if necessary, the coding unit may be recursively divided into coding units having a lower depth than the optimal coding unit.
- a coding unit of the size of may be used as the final coding unit.
- the coding procedure may include procedures such as prediction, transformation, and restoration, which will be described later.
- the network may include a broadcasting network and/or a communication network
- the digital storage medium may include various storage media such as USB, SD, CD, DVD, Blu-ray, HDD, and SSD.
- a transmitting unit (not shown) and/or a storing unit (not shown) for storing may be configured as internal/external elements of the encoding apparatus 200, or the transmitting unit It may be included in the entropy encoding unit 240 .
- the filtering unit 260 may improve subjective/objective image quality by applying filtering to the reconstructed signal.
- the filtering unit 260 may generate a modified reconstructed picture by applying various filtering methods to the reconstructed picture, and store the modified reconstructed picture in the memory 270 , specifically, the DPB of the memory 270 .
- the various filtering methods may include, for example, deblocking filtering, a sample adaptive offset, an adaptive loop filter, a bilateral filter, and the like.
- the filtering unit 260 may generate various types of filtering-related information and transmit it to the entropy encoding unit 240 , as will be described later in the description of each filtering method.
- the filtering-related information may be encoded by the entropy encoding unit 240 and output in the form of a bitstream.
- At least one of quantization/inverse quantization and/or transform/inverse transform may be omitted.
- the quantized transform coefficient may be referred to as a transform coefficient.
- the transform coefficients may be called coefficients or residual coefficients, or may still be called transform coefficients for uniformity of expression.
- one picture may include a plurality of slices, and one slice may include a slice header and slice data.
- one picture header may be further added to a plurality of slices (a slice header and a slice data set) in one picture.
- the picture header may include information/parameters commonly applicable to the picture.
- the slice header (slice header syntax) may include information/parameters commonly applicable to the slice.
- An adaptation parameter set (APS) or a picture parameter set (PPS) may include information/parameters commonly applicable to one or more pictures.
- a sequence parameter set (SPS) may include information/parameters commonly applicable to one or more sequences.
- a video parameter set (VPS) may include information/parameters commonly applicable to multiple layers.
- a decoding parameter set may include information/parameters commonly applicable to the entire video.
- the DPS may include information/parameters related to the concatenation of a coded video sequence (CVS).
- CVS coded video sequence
- pictures constituting the video/video may be encoded/decoded according to a series of decoding orders.
- a picture order corresponding to an output order of decoded pictures may be set differently from the decoding order, and based on this, forward prediction as well as backward prediction may be performed during inter prediction based on this.
- S400 may be performed by the entropy decoding unit 310 of the decoding apparatus described above in FIG. 3
- S410 may be performed by the prediction unit 330
- S420 may be performed by the residual processing unit 320
- S430 may be performed by the adder 340
- S440 may be performed by the filtering unit 350 .
- a modified reconstructed picture may be generated through an in-loop filtering procedure for the reconstructed picture generated through the picture reconstruction procedure, and the modified reconstructed picture may be output as a decoded picture, and also of the decoding apparatus It may be stored in the decoded picture buffer or the memory 360 and used as a reference picture in an inter prediction procedure when decoding a picture thereafter.
- the in-loop filtering procedure may be omitted, and in this case, the reconstructed picture may be output as a decoded picture, and is also stored in the decoded picture buffer or memory 360 of the decoding device and interpolated during decoding of subsequent pictures. It can be used as a reference picture in the prediction procedure.
- S500 may be performed by the prediction unit 220 of the encoding apparatus described above in FIG. 2
- S510 may be performed by the residual processing unit 230
- S520 may be performed by the entropy encoding unit 240 .
- S500 may include the inter/intra prediction procedure described in this document
- S610 may include the residual processing procedure described in this document
- S520 may include the information encoding procedure described in this document .
- 6 exemplarily shows a hierarchical structure for a coded image/video.
- a NAL unit may be generated by adding header information (NAL unit header) to a raw byte sequence payload (RBSP) generated in the VCL.
- RBSP refers to slice data, parameter set, SEI message, etc. generated in the VCL.
- the NAL unit header may include NAL unit type information specified according to RBSP data included in the corresponding NAL unit.
- VCL NAL unit and Non-VCL NAL unit may be transmitted through a network by attaching header information according to a data standard of a subsystem.
- the NAL unit may be transformed into a data form of a predetermined standard such as an H.266/VVC file format, Real-time Transport Protocol (RTP), Transport Stream (TS), and transmitted through various networks.
- RTP Real-time Transport Protocol
- TS Transport Stream
- NAL unit type specified according to the type of parameter set included in the Non-VCL NAL unit type.
- NAL unit Type for NAL unit including APS
- the video/video information encoded from the encoding device to the decoding device and signaled in the form of a bitstream includes, as well as intra-picture partitioning-related information, intra/inter prediction information, residual information, in-loop filtering information, etc. It may include information included in the slice header, information included in the APS, information included in the PPS, information included in the SPS, and/or information included in the VPS.
- high level syntax may be coded/signaled for video/image coding.
- a coded picture may consist of one or more slices.
- a parameter describing a coded picture is signaled in a picture header, and a parameter describing a slice is signaled in a slice header.
- the picture header is carried in its own NAL unit form (carried).
- the slice header is present at the beginning of a NAL unit including a payload of a slice (ie, slice data).
- a picture may be divided into subpictures, tiles and/or slices.
- Sub picture signaling may exist in a sequence parameter set (SPS), and tile and square slice signaling may exist in a picture parameter set (PPS).
- Raster-scan slice signaling may be present in the slice header.
- syntax shown in Table 4 and semantics shown in Table 5 may be used for the PPS to indicate the split structure of a picture.
- the number information and the height information may be an Exp-Golomb-coded syntax element.
- the number information may be parsed based on information about a width and a height of a slice including the tile.
- the width information of the slice including the tile may correspond to a syntax element slice_width_in_tiles_minus1[i]
- the slice height information including the tile corresponds to the syntax element slice_height_in_tiles_minus1[i]. can do.
- the i-th slice may be a rectangular slice, and the slices in the tile may also be divided into rectangular slices.
- the decoding apparatus derives the height of the nth slice as the height of the uniform slice when the remaining height of the tile is greater than or equal to the height of the uniform slice, and when the remaining height of the tile is smaller than the height of the uniform slice, the The height of the n-th slice may be derived as the remaining height. Also, when the remaining height of the tile is greater than or equal to the height of the uniform slice, the decoding apparatus derives the updated remaining height by subtracting the height of the n-th slice from the remaining height, and the updated remaining height is the uniform slice If it is greater than or equal to the height of , the height of the n+1th slice may be derived as the height of the uniform slice.
- the decoding apparatus may derive the height of the n+1th slice as the updated remaining height. That is, except for the last slice in the tile, the heights of slices from the n-th slice to the last slice may be uniformly derived.
- the height of the last slice may be less than or equal to the height of slices having the same height (the n ⁇ 1th slice to the slice immediately preceding the last slice).
- the list NumCtuInSlice[i] for i in the range from 0 to num_slices_in_pic_minus1 indicates the number of CTUs in the i-th slice, i in the range from 0 to num_slices_in_pic_minus1 and 0 to NumCtuInSlice[i] - 1
- the matrix CtbAddrInSlice[i][j] for j of the included range indicates the picture raster scan address of the j-th CTB in the i-th slice and may be derived as shown in Table 7 below.
- syntax shown in Table 8 and semantics shown in Table 9 may be used for PPS to indicate a picture division structure.
- uniform_slice_spacing_flag[i] if the value of uniform_slice_spacing_flag[i] is 1, it indicates that CTU rows are uniformly distributed throughout the tile and signaled using syntax elements uniform_slice_height_in_ctu_minus1[i]. If the value of uniform_slice_spacing_flag[i] is 0, this indicates that CTU rows may or may not be evenly distributed throughout the tile, and are signaled using the syntax elements num_slice_in_tile_minus1[i] and slice_height_in_ctu_minus1[i].
- slice_rows_height_in_ctu_minus1[i] + 1 indicates the height of the slice excluding the last slice of the tile in units of CTB when the value of uniform_slice_spacing_flag[i] is 1.
- the value of slice_rows_height_in_ctu_minus1[i] must be in the range from 0 to RowHeight [tileY] - 1 inclusive.
- tileY is an index of a tile row including the slices.
- the decoding apparatus may parse slice_rows_height_in_ctu_minus1 or num_slice_in_tile_minus1 from the PPS based on the value of uniform_slice_spacing_flag. For example, if the value of the uniform_slice_spacing_flag is 1, the decoding apparatus may parse slice_rows_height_in_ctu_minus1 from the PPS and derive the heights of the remaining slices except for the last slice in the tile based on the value of slice_rows_height_in_ctu_minus1.
- variables NumSlicesInTileMinus1[i] and SliceHeightInCtuMinus1[i+k] regarding the number information and height information of slices in a tile may be derived as follows.
- k may be in a range from 0 to NumSlicesInTileMinus1[i].
- the list NumCtuInSlice[i] for i in the range from 0 to num_slices_in_pic_minus1 indicates the number of CTUs in the i-th slice, i in the range from 0 to num_slices_in_pic_minus1 and 0 to NumCtuInSlice[i] - 1
- the matrix CtbAddrInSlice[i][j] for j of the included range indicates the picture raster scan address of the j-th CTB in the i-th slice and may be derived as shown in Table 11 below.
- FIG. 7 and 8 schematically show an example of a video/image encoding method and related components according to an embodiment of the present document.
- the video/image encoding method disclosed in FIG. 7 may be performed by the (video/image) encoding apparatus 200 illustrated in FIGS. 2 and 8 .
- S700 of FIG. 7 may be performed by the image division unit 210 of the encoding apparatus 200
- S710 may be performed by the prediction unit 220 of the encoding apparatus 200
- S720 may be performed by the residual processing unit 230 of the encoding apparatus 200
- S730 and S740 may be performed by the entropy encoding unit 240 of the encoding apparatus 200 .
- the video/image encoding method disclosed in FIG. 7 may include the embodiments described above in this document.
- the image segmentation unit 210 of the encoding apparatus may derive slices within a tile of the current picture ( S700 ).
- the image divider 210 may divide an input image (or a picture, a frame) into one or more CUs.
- the input image may include one or more pictures.
- One picture may be divided into one or more tiles, bricks, slices and/or groups of tiles.
- a slice may include one or more bricks, tiles and/or groups of tiles.
- One brick may contain one or more CTU rows in a tile.
- One tile group may include one or more tiles.
- One tile may include one or more CTUs.
- the CTU may be divided into one or more CUs.
- the image divider 210 may divide the rectangular slice into a plurality of tiles, and divide at least one of the plurality of tiles to derive a plurality of rectangular slices can do.
- the residual processing unit 230 of the encoding apparatus may generate a residual sample and residual information based on the prediction samples generated by the prediction unit 220 and the original picture (original block, original samples) (S720) .
- the residual information is information about the residual samples, and may include information about (quantized) transform coefficients for the residual samples.
- the entropy encoding unit 240 of the encoding apparatus may generate partitioning related information based on the partitioning structure derived from the image dividing unit 210 .
- the partitioning-related information may include information about the number of slices whose heights are explicitly signaled in a tile (number information) and information about the heights of slices whose heights are explicitly signaled (height information).
- the entropy encoding unit 240 may include number information regarding the number of slices whose height is explicitly signaled (provided) in the tile based on the slices derived from the image segmentation unit 210 and the height is explicit. Height information regarding the heights of slices signaled (provided) as , may be generated (S730).
- the entropy encoding unit 240 includes partitioning-related information including the number information and the height information, prediction-related information generated by the prediction unit 220, and/or residual information generated by the residual processing unit 230.
- Image information may be encoded (S740).
- Information encoded by the entropy encoding unit 240 may be output in the form of a bitstream.
- the bitstream may be transmitted to the decoding device through a network or a storage medium.
- the entropy encoding unit 240 may encode image information including syntax elements uniform_slice_spacing_flag, num_slices_in_tile_minus1, slice_rows_height_in_ctu_minus1 and/or slice_height_in_ctu_minus1 based on Tables 8 and 9 described above.
- the syntax elements num_slices_in_tile_minus1, slice_rows_height_in_ctu_minus1, and slice_height_in_ctu_minus1 may or may not be included in the image information based on the value of uniform_slice_spacing_flag.
- FIGS. 9 and 10 schematically show an example of a video/image decoding method and related components according to an embodiment of the present document.
- the entropy decoding unit 310 may parse information (number information) about the number of slices whose height is explicitly signaled among slices in a tile of the current picture from the bitstream (S900), and the number Based on the information, information (height information) regarding the height of the slices whose height is explicitly signaled may be parsed from the bitstream ( S910 ).
- the entropy decoding unit 310 may parse the number information and the height information from the picture parameter set of the bitstream based on Table 4 described above.
- the number information may be parsed based on information about the width and height of a slice including the tile. In this case, the slice including the tile and/or the slices in the tile may be rectangular slices.
- the number information and the height information may be Exp-Golomb-coded syntax elements.
- the height information may include syntax elements for each of the slices for which the height is explicitly signaled.
- the number of syntax elements may be equal to the value of the number information.
- the entropy decoding unit 310 may derive the height of the remaining slices (slices not explicitly signaled) in the tile except for the last slice to be the same as the height of the n-1 th slice. Accordingly, heights of slices from the n-th slice to the last slice in the tile may be uniform except for the last slice in the tile.
- the entropy decoding unit 310 may derive the height of the last slice in the tile based on the remaining height obtained by subtracting the heights of other slices in the tile from the height of the tile.
- the entropy decoding unit 310 may derive the number of slices in the tile ( S920 ).
- the number of slices in the tile may correspond to the number of slices from the 0th slice to the last slice in the tile.
- the entropy decoding unit 310 parses the syntax elements slice_width_in_tiles_minus1 and slice_height_in_tiles_minus1 from the picture parameter set of the bitstream based on Table 8, and based on the value of the syntax element slice_width_in_tiles_minus1 and slice_height_in_tiles_minus1 from the syntax element slice_width_in_tiles_minus1 and slice_height_in_tiles_minus1 from the picture parameter set You can parse the uniform_slice_spacing_flag.
- the functional units shown in each figure may be implemented and performed on a computer, a processor, a microprocessor, a controller, or a chip.
- information for implementation (ex. information on instructions) or an algorithm may be stored in a digital storage medium.
- a content streaming system to which embodiments of this document are applied may largely include an encoding server, a streaming server, a web server, a media storage, a user device, and a multimedia input device.
Abstract
Description
Claims (15)
- 비디오 디코딩 장치에 의하여 수행되는 비디오 디코딩 방법에 있어서,비트스트림으로부터 현재 픽처의 타일 내에서 높이가 명시적으로 시그널링되는 슬라이스들의 개수에 관한 개수 정보를 파싱하는 단계;상기 개수 정보를 기반으로 상기 비트스트림으로부터 상기 높이가 명시적으로 시그널링되는 슬라이스들의 높이에 관한 높이 정보를 파싱하는 단계;상기 개수 정보와 상기 높이 정보를 기반으로 상기 타일 내 슬라이스들의 개수를 도출하는 단계; 및상기 타일 내 슬라이스들을 기반으로 상기 현재 픽처를 디코딩하는 단계를 포함하되,상기 높이 정보는 상기 개수 정보의 값과 동일한 개수의 신택스 요소들을 포함하고,상기 개수 정보의 값이 n임을 기반으로, 상기 타일 내 0번째 슬라이스 내지 n-1번째 슬라이스의 높이들은 상기 신택스 요소들을 기반으로 도출되고,상기 타일 내 n번째 슬라이스의 높이는 상기 n-1번째 슬라이스의 높이를 기반으로 도출되고,상기 타일 내 마지막 슬라이스의 높이는 상기 타일의 높이에서 상기 타일 내 다른 슬라이스들의 높이를 뺀 나머지 높이를 기반으로 도출되는 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 타일 내 슬라이스들의 개수는 상기 0번째 슬라이스부터 상기 마지막 슬라이스까지의 슬라이스들의 개수인 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 n번째 슬라이스의 높이는 상기 n-1번째 슬라이스의 높이와 같게 도출되는 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 n번째 슬라이스부터 상기 타일 내 마지막 슬라이스 바로 전 슬라이스까지의 슬라이스들의 높이는 균일한 것을 특징으로 하는, 비디오 디코딩 방법.
- 제4항에 있어서,상기 마지막 슬라이스의 높이는 상기 n-1번째 슬라이스의 높이 보다 작거나 같은 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 타일의 높이에서 상기 0번째 슬라이스부터 상기 n-1번째 슬라이스의 높이들의 합을 뺀 나머지 높이가 균일 슬라이스의 높이보다 크거나 같은지 비교하는 단계를 더 포함하되,상기 균일 슬라이스의 높이는 상기 n-1번째 슬라이스의 높이와 같고,상기 타일의 높이에서 상기 0번째 슬라이스부터 상기 n-1번째 슬라이스의 높이들의 합을 뺀 나머지 높이가 상기 균일 슬라이스의 높이보다 크거나 같음을 기반으로 상기 균일 슬라이스의 높이를 갖는 상기 n번째 슬라이스가 도출되고,상기 타일의 높이에서 상기 0번째 슬라이스부터 상기 n-1번째 슬라이스의 높이들의 합을 뺀 나머지 높이가 상기 균일 슬라이스의 높이보다 작음을 기반으로, 상기 나머지 높이를 갖는 상기 n번째 슬라이스가 도출되는 것을 특징으로 하는, 비디오 디코딩 방법.
- 제6항에 있어서,상기 나머지 높이가 상기 균일 슬라이스의 높이보다 크거나 같음을 기반으로, 상기 타일의 높이에서 상기 0번째 슬라이스부터 상기 n-1번째 슬라이스의 높이들의 합을 뺀 나머지 높이에서 상기 n번째 슬라이스의 높이를 빼서 업데이트된 나머지 높이를 도출하고,상기 업데이트된 나머지 높이가 상기 균일 슬라이스의 높이보다 크거나 같음을 기반으로, 상기 균일 슬라이스의 높이를 갖는 n+1번째 슬라이스가 도출되고,상기 업데이트된 나머지 높이가 상기 균일 슬라이스의 높이보다 작음을 기반으로, 상기 업데이트된 나머지 높이를 갖는 n+1번째 슬라이스가 도출되는 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 개수 정보와 상기 높이 정보는 지수-골롬 코딩된 신택스 요소(Exp-Golomb-coded syntax element)를 포함하는 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 높이 정보는 상기 높이가 명시적으로 시그널링되는 슬라이스들 각각에 대한 신택스 요소들을 포함하고,상기 신택스 요소들의 개수는 상기 개수 정보의 값과 동일한 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 개수 정보는 신택스 요소 num_exp_slices_in_tile을 포함하고,상기 높이 정보는 신택스 요소 exp_slice_height_in_ctu_minus1을 포함하는 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 타일 내 슬라이스들은 직사각형 슬라이스들인 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 개수 정보는 상기 타일을 포함하는 슬라이스의 너비(width) 및 높이(height)에 관한 정보를 기반으로 파싱되는 것을 특징으로 하는, 비디오 디코딩 방법.
- 제1항에 있어서,상기 높이 정보는 상기 0번째 슬라이스의 높이 내지 상기 n-1 번째 슬라이스의 높이 각각을 CTU(coding tree unit) 행들(rows) 단위로 나타내는 것을 특징으로 하는, 비디오 디코딩 방법.
- 비디오 인코딩 장치에 의하여 수행되는 비디오 인코딩 방법에 있어서,현재 픽처의 타일 내 슬라이스들을 도출하는 단계;상기 도출된 슬라이스들을 기반으로 인트라 예측 또는 인터 예측 중 적어도 하나를 수행하여 예측 샘플들을 생성하는 단계;상기 예측 샘플들을 기반으로 레지듀얼 정보를 생성하는 단계;상기 도출된 슬라이스들을 기반으로 상기 타일 내 높이가 명시적으로 시그널링되는 슬라이스들의 개수에 관한 개수 정보와 상기 높이가 명시적으로 시그널링되는 슬라이스들의 높이에 관한 높이 정보를 생성하는 단계; 및상기 레지듀얼 정보, 상기 개수 정보 및 상기 높이 정보를 포함하는 영상 정보를 인코딩하는 단계를 포함하되,상기 높이 정보는 상기 개수 정보의 값이 n임을 기반으로 상기 타일 내 0번째 슬라이스 내지 n-1번째 슬라이스의 높이를 각각 나타내는 n개의 신택스 요소들을 포함하고,상기 타일 내 n번째 슬라이스의 높이는 상기 n-1번째 슬라이스의 높이를 기반으로 도출되고,상기 타일 내 마지막 슬라이스의 높이는 상기 타일의 높이에서 상기 타일 내 다른 슬라이스들의 높이를 뺀 나머지 높이를 기반으로 도출되는 것을 특징으로 하는, 비디오 인코딩 방법.
- 컴퓨터 판독 가능한 디지털 저장 매체로서, 상기 디지털 저장 매체는 비디오 디코딩 장치에 의하여 비디오 디코딩 방법을 수행하도록 야기하는 정보를 포함하고, 상기 비디오 디코딩 방법은,영상 정보로부터 현재 픽처의 타일 내에서 높이가 명시적으로 시그널링되는 슬라이스들의 개수에 관한 개수 정보를 파싱하는 단계;상기 개수 정보를 기반으로 상기 영상 정보로부터 상기 높이가 명시적으로 시그널링되는 슬라이스들의 높이에 관한 높이 정보를 파싱하는 단계; 및상기 개수 정보와 상기 높이 정보를 기반으로 상기 타일 내 슬라이스들의 개수를 도출하는 단계;상기 타일 내 슬라이스들을 기반으로 상기 현재 픽처를 디코딩하는 단계를 포함하되,상기 높이 정보는 상기 개수 정보의 값과 동일한 개수의 신택스 요소들을 포함하고,상기 개수 정보의 값이 n임을 기반으로, 상기 타일 내 0번째 슬라이스 내지 n-1번째 슬라이스의 높이들은 상기 신택스 요소들을 기반으로 도출되고,상기 타일 내 n번째 슬라이스의 높이는 상기 n-1번째 슬라이스의 높이를 기반으로 도출되고,상기 타일 내 마지막 슬라이스의 높이는 상기 타일의 높이에서 상기 타일 내 다른 슬라이스들의 높이를 뺀 나머지 높이를 기반으로 도출되는 것을 특징으로 하는 것을 특징으로 하는, 디지털 저장 매체.
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2020392150A AU2020392150B2 (en) | 2019-11-28 | 2020-11-26 | Slice and tile configuration for image/video coding |
KR1020227017316A KR20220087513A (ko) | 2019-11-28 | 2020-11-26 | 영상/비디오 코딩을 위한 슬라이스 및 타일 구성 |
CN202080091848.3A CN114930855A (zh) | 2019-11-28 | 2020-11-26 | 用于图像/视频编译的切片和拼块配置 |
CA3163333A CA3163333A1 (en) | 2019-11-28 | 2020-11-26 | Slice and tile configuration for image/video coding |
US17/780,476 US11956450B2 (en) | 2019-11-28 | 2020-11-26 | Slice and tile configuration for image/video coding |
MX2022006485A MX2022006485A (es) | 2019-11-28 | 2020-11-26 | Configuracion de segmento y mosaico para codificacion de imagenes/video. |
AU2024201923A AU2024201923A1 (en) | 2019-11-28 | 2024-03-26 | Slice and tile configuration for image/video coding |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962941846P | 2019-11-28 | 2019-11-28 | |
US62/941,846 | 2019-11-28 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021107621A1 true WO2021107621A1 (ko) | 2021-06-03 |
Family
ID=76128909
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2020/016883 WO2021107621A1 (ko) | 2019-11-28 | 2020-11-26 | 영상/비디오 코딩을 위한 슬라이스 및 타일 구성 |
Country Status (7)
Country | Link |
---|---|
US (1) | US11956450B2 (ko) |
KR (1) | KR20220087513A (ko) |
CN (1) | CN114930855A (ko) |
AU (2) | AU2020392150B2 (ko) |
CA (1) | CA3163333A1 (ko) |
MX (1) | MX2022006485A (ko) |
WO (1) | WO2021107621A1 (ko) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2014030187A (ja) * | 2012-07-02 | 2014-02-13 | Canon Inc | メディアファイル生成方法、メディアファイル生成プログラム |
KR20170113384A (ko) * | 2016-03-30 | 2017-10-12 | 한국전자통신연구원 | 픽쳐 분할 정보를 사용하는 비디오의 부호화 및 복호화를 위한 방법 및 장치 |
KR20180004029A (ko) * | 2016-07-01 | 2018-01-10 | 에스케이텔레콤 주식회사 | 고해상도 영상 스트리밍을 위한 영상 비트스트림 생성방법 및 장치 |
KR20180096831A (ko) * | 2013-10-25 | 2018-08-29 | 미디어텍 인크. | 전송 동기화 이벤트에 따라 압축된 픽처의 전송을 제어하는 방법 및 장치 |
KR20190033771A (ko) * | 2017-09-22 | 2019-04-01 | 삼성전자주식회사 | 영상 인코딩 장치, 영상 디코딩 장치, 영상 인코딩 방법 및 영상 디코딩 방법 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116366835A (zh) | 2013-01-04 | 2023-06-30 | Ge视频压缩有限责任公司 | 高效可伸缩编码概念 |
EP2941864B1 (en) | 2013-01-07 | 2016-12-07 | Telefonaktiebolaget LM Ericsson (publ) | Limiting use of largest transform unit size for intra coding units in inter coded slices of encoded video |
CN113039799B (zh) * | 2019-08-23 | 2023-08-01 | 苹果公司 | 图像信号编码/解码方法及其设备 |
EP4088453A4 (en) * | 2020-02-14 | 2023-05-10 | Beijing Bytedance Network Technology Co., Ltd. | COLOCATED IMAGE INDICATION IN VIDEO BITSTREAMS |
-
2020
- 2020-11-26 WO PCT/KR2020/016883 patent/WO2021107621A1/ko active Application Filing
- 2020-11-26 US US17/780,476 patent/US11956450B2/en active Active
- 2020-11-26 AU AU2020392150A patent/AU2020392150B2/en active Active
- 2020-11-26 MX MX2022006485A patent/MX2022006485A/es unknown
- 2020-11-26 KR KR1020227017316A patent/KR20220087513A/ko unknown
- 2020-11-26 CA CA3163333A patent/CA3163333A1/en active Pending
- 2020-11-26 CN CN202080091848.3A patent/CN114930855A/zh active Pending
-
2024
- 2024-03-26 AU AU2024201923A patent/AU2024201923A1/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2014030187A (ja) * | 2012-07-02 | 2014-02-13 | Canon Inc | メディアファイル生成方法、メディアファイル生成プログラム |
KR20180096831A (ko) * | 2013-10-25 | 2018-08-29 | 미디어텍 인크. | 전송 동기화 이벤트에 따라 압축된 픽처의 전송을 제어하는 방법 및 장치 |
KR20170113384A (ko) * | 2016-03-30 | 2017-10-12 | 한국전자통신연구원 | 픽쳐 분할 정보를 사용하는 비디오의 부호화 및 복호화를 위한 방법 및 장치 |
KR20180004029A (ko) * | 2016-07-01 | 2018-01-10 | 에스케이텔레콤 주식회사 | 고해상도 영상 스트리밍을 위한 영상 비트스트림 생성방법 및 장치 |
KR20190033771A (ko) * | 2017-09-22 | 2019-04-01 | 삼성전자주식회사 | 영상 인코딩 장치, 영상 디코딩 장치, 영상 인코딩 방법 및 영상 디코딩 방법 |
Also Published As
Publication number | Publication date |
---|---|
AU2024201923A1 (en) | 2024-04-11 |
MX2022006485A (es) | 2022-10-10 |
US11956450B2 (en) | 2024-04-09 |
AU2020392150B2 (en) | 2024-03-21 |
CN114930855A (zh) | 2022-08-19 |
KR20220087513A (ko) | 2022-06-24 |
AU2020392150A1 (en) | 2022-07-07 |
US20230007278A1 (en) | 2023-01-05 |
CA3163333A1 (en) | 2021-06-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020197236A1 (ko) | 서브 픽처 핸들링 구조 기반 영상 또는 비디오 코딩 | |
WO2020189960A1 (ko) | 크로마 포맷에 대한 정보를 시그널링 하는 방법 및 장치 | |
WO2021034115A1 (ko) | 크로마 양자화 파라미터 오프셋 관련 정보를 코딩하는 영상 디코딩 방법 및 그 장치 | |
WO2021201515A1 (ko) | Hls를 시그널링하는 영상 부호화/복호화 방법, 장치 및 비트스트림을 저장한 컴퓨터 판독 가능한 기록 매체 | |
WO2021225338A1 (ko) | 영상 디코딩 방법 및 그 장치 | |
WO2020076066A1 (ko) | 신택스 디자인 방법 및 신택스를 이용하여 코딩을 수행하는 장치 | |
WO2021118295A1 (ko) | 루프 필터링을 제어하기 위한 영상 코딩 장치 및 방법 | |
WO2020184821A1 (ko) | Mpm 리스트를 구성하는 방법 및 장치 | |
WO2021118265A1 (ko) | 적응적 루프 필터를 적용하는 비디오 또는 영상 코딩 | |
WO2021118261A1 (ko) | 영상 정보를 시그널링하는 방법 및 장치 | |
WO2021133060A1 (ko) | 서브픽처 기반 영상 코딩 장치 및 방법 | |
WO2021145728A1 (ko) | 인루프 필터링 기반 영상 코딩 장치 및 방법 | |
WO2021118293A1 (ko) | 필터링 기반 영상 코딩 장치 및 방법 | |
WO2020197031A1 (ko) | 영상 코딩 시스템에서 다중 참조 라인 기반의 인트라 예측 방법 및 장치 | |
WO2021107622A1 (ko) | 영상/비디오 코딩 방법 및 장치 | |
WO2021137588A1 (ko) | 픽처 헤더를 포함하는 영상 정보를 코딩하는 영상 디코딩 방법 및 그 장치 | |
WO2021118263A1 (ko) | 영상 정보를 시그널링하는 방법 및 장치 | |
WO2021137589A1 (ko) | 영상 디코딩 방법 및 그 장치 | |
WO2021137591A1 (ko) | Ols dpb 파라미터 인덱스를 포함하는 영상 정보 기반 영상 디코딩 방법 및 그 장치 | |
WO2021118262A1 (ko) | 픽처 레벨 또는 슬라이스 레벨에서 적용되는 영상 정보를 시그널링하는 방법 및 장치 | |
WO2021145726A1 (ko) | 적응적 루프 필터링 기반 영상 코딩 장치 및 방법 | |
WO2021107621A1 (ko) | 영상/비디오 코딩을 위한 슬라이스 및 타일 구성 | |
WO2021107624A1 (ko) | 픽처의 분할 구조에 기반한 영상/비디오 코딩 방법 및 장치 | |
WO2021107623A1 (ko) | 영상/비디오 인코딩/디코딩 방법 및 장치 | |
WO2021107626A1 (ko) | 영상/비디오 인코딩/디코딩 시스템에서 슬라이스에 관한 정보의 시그널링 방법 및 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20893401 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 20227017316 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 3163333 Country of ref document: CA |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2020392150 Country of ref document: AU Date of ref document: 20201126 Kind code of ref document: A |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20893401 Country of ref document: EP Kind code of ref document: A1 |