WO2021202178A1 - Methods and devices for high-level syntax in video coding - Google Patents

Methods and devices for high-level syntax in video coding Download PDF

Info

Publication number
WO2021202178A1
WO2021202178A1 PCT/US2021/023751 US2021023751W WO2021202178A1 WO 2021202178 A1 WO2021202178 A1 WO 2021202178A1 US 2021023751 W US2021023751 W US 2021023751W WO 2021202178 A1 WO2021202178 A1 WO 2021202178A1
Authority
WO
WIPO (PCT)
Prior art keywords
sps
flag
equal
value
slice
Prior art date
Application number
PCT/US2021/023751
Other languages
French (fr)
Inventor
Hong-Jheng Jhu
Yi-Wen Chen
Xiaoyu XIU
Tsung-Chuan MA
Wei Chen
Xianglin Wang
Bing Yu
Original Assignee
Beijing Dajia Internet Information Technology Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Dajia Internet Information Technology Co., Ltd. filed Critical Beijing Dajia Internet Information Technology Co., Ltd.
Priority to EP21781444.1A priority Critical patent/EP4118835A4/en
Priority to CN202180024900.8A priority patent/CN115336280A/en
Publication of WO2021202178A1 publication Critical patent/WO2021202178A1/en
Priority to US17/953,311 priority patent/US20230031964A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/119Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/179Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scene or a shot
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Definitions

  • video coding standards include versatile video coding (VVC), joint exploration test model (JEM), high- efficiency video coding (H.265/HEVC), advanced video coding (H.264/AVC), moving picture expert group (MPEG) coding, or the like.
  • Video coding generally utilizes prediction methods (e.g., inter-prediction, intra-prediction, or the like) that take advantage of redundancy present in video images or sequences.
  • An important goal of video coding techniques is to compress video data into a form that uses a lower bit rate while avoiding or minimizing degradations to video quality.
  • Examples of the present disclosure provide methods and apparatus for high-level syntax coding in video coding.
  • a method for decoding a video signal may include receiving arranged partition constraint syntax elements in sequence parameter set (SPS) level.
  • the arranged partition constraint syntax elements may be arranged so that intra prediction related syntax elements may be defined before inter prediction related syntax elements.
  • the decoder may also obtain a first reference picture I (0) and a second reference picture I (1) associated with a video block in a bitstream.
  • the first reference picture I (0) may be before a current picture and the second reference picture I (1) may be after the current picture in display order.
  • the decoder may also obtain first prediction samples I (0) (i, j) of the video block from a reference block in the first reference picture I (0) .
  • the i and j may represent a coordinate of one sample with the current picture.
  • the decoder may also obtain second prediction samples I (1) (i, j) of the video block from a reference block in the second reference picture I (1) .
  • the decoder may also obtain bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I (0) (i, j), and the second prediction samples I (1) (i, j).
  • a computing device may include one or more processors, a non-transitory computer-readable memory storing instructions executable by the one or more processors.
  • the one or more processors may be configured to receive arranged partition constraint syntax elements in SPS level.
  • the arranged partition constraint syntax elements may be arranged so that intra prediction related syntax elements may be defined before inter prediction related syntax elements.
  • the one or more processors may further be configured to obtain a first reference picture I (0) and a second reference picture I (1) associated with a video block in a bitstream.
  • the first reference picture I (0) may be before a current picture and the second reference picture I (1) may be after the current picture in display order.
  • the one or more processors may further be configured to obtain first prediction samples I (0) (i, j) of the video block from a reference block in the first reference picture I (0) .
  • the i and j may represent a coordinate of one sample with the current picture.
  • the one or more processors may further be configured to obtain second prediction samples I (1) (i, j) of the video block from a reference block in the second reference picture I (1) .
  • the one or more processors may further be configured to obtain bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I (0) (i, j), and the second prediction samples I (1) (i, j).
  • a non-transitory computer- readable storage medium having stored therein instructions is provided. When the instructions are executed by one or more processors of the apparatus, the instructions may cause the apparatus to receive arranged partition constraint syntax elements in SPS level.
  • the arranged partition constraint syntax elements may be arranged so that intra prediction related syntax elements may be defined before inter prediction related syntax elements.
  • the instructions may also cause the apparatus to obtain a first reference picture I (0) and a second reference picture I (1) associated with a video block in a bitstream.
  • the first reference picture I (0) may be before a current picture and the second reference picture I (1) may be after the current picture in display order.
  • the instructions may also cause the apparatus to obtain first prediction samples I (0) (i, j) of the video block from a reference block in the first reference picture I (0) .
  • the i and j may represent a coordinate of one sample with the current picture.
  • the instructions may also cause the apparatus to obtain second prediction samples I (1) (i, j) of the video block from a reference block in the second reference picture I (1) .
  • the instructions may also cause the apparatus to obtain bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I (0) (i, j), and the second prediction samples I (1) (i, j).
  • FIG. 1 is a block diagram of an encoder, according to an example of the present disclosure.
  • FIG. 2 is a block diagram of a decoder, according to an example of the present disclosure.
  • FIG. 3A is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure.
  • FIG. 3B is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure.
  • FIG. 3C is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure.
  • FIG. 3D is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure.
  • FIG. 3E is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure.
  • FIG. 4 is a method for decoding a video signal, according to an example of the present disclosure.
  • FIG. 5 is a method for decoding a video signal, according to an example of the presentt d diiscllosure.
  • FIG. 6 is a diagram illustrating a computing environment coupled with a user interface, according to an example of the present disclosure. DETAILED DESCRIPTION [0020]
  • first, second, third, etc. may be used herein to describe various information, the information should not be limited by these terms. These terms are only used to distinguish one category of information from another. For example, without departing from the scope of the present disclosure, first information may be termed as second information; and similarly, second information may also be termed as first information.
  • first information may be termed as second information; and similarly, second information may also be termed as first information.
  • second information may also be termed as first information.
  • the term “if” may be understood to mean “when” or “upon” or “in response to a judgment” depending on the context.
  • the first version of the HEVC standard was finalized in October 2013, which offers approximately 50% bit-rate saving or equivalent perceptual quality compared to the prior generation video coding standard H.264/MPEG AVC.
  • the HEVC standard provides significant coding improvements than its predecessor, there is evidence that superior coding efficiency can be achieved with additional coding tools over HEVC.
  • both VCEG and MPEG started the exploration work of new coding technologies for future video coding standardization.
  • JVET Joint Video Exploration Team
  • JEM Joint Exploration model
  • One reference software was maintained by the JVET by integrating several additional coding tools on top of the HEVC test model (HM).
  • VVC Versatile Video Coding
  • VTM VVC test model
  • FIG. 1 shows a general diagram of a block-based video encoder for the VVC.
  • FIG. 1 shows atypical encoder 100.
  • the encoder 100 has video input 110, motion compensation 112, motion estimation 114, intra/inter mode decision 116, block predictor 140, adder 128, transform 130, quantization 132, prediction related info 142, intra prediction 118, picture buffer 120, inverse quantization 134, inverse transform 136, adder 126, memory 124, in-loop filter 122, entropy coding 138, and bitstream 144.
  • a video frame is partitioned into a plurality of video blocks for processing. For each given video block, a prediction is formed based on either an inter prediction approach or an intra prediction approach.
  • a prediction residual representing the difference between a current video block, part of video input 110, and its predictor, part of block predictor 140, is sent to a transform 130 from adder 128.
  • Transform coefficients are then sent from the Transform 130 to a Quantization 132 for entropy reduction.
  • Quantized coefficients are then fed to an Entropy Coding 138 to generate a compressed video bitstream.
  • prediction related information 142 from an intra/inter mode decision 116 such as video block partition info, motion vectors (MVs), reference picture index, and intra prediction mode, are also fed through the Entropy Coding 138 and saved into a compressed bitstream 144.
  • Compressed bitstream 144 includes a video bitstream.
  • decoder-related circuitries are also needed in order to reconstruct pixels for the purpose of prediction.
  • a prediction residual is reconstructed through an Inverse Quantization 134 and an Inverse Transform 136.
  • This reconstructed prediction residual is combined with a Block Predictor 140 to generate un-filtered reconstructed pixels for a current video block.
  • Spatial prediction uses pixels from samples of already coded neighboring blocks (which are called reference samples) in the same video frame as the current video block to predict the current video block.
  • Temporal prediction uses reconstructed pixels from already-coded video pictures to predict the current video block. Temporal prediction reduces temporal redundancy inherent in the video signal.
  • the temporal prediction signal for a given coding unit (CU) or coding block is usually signaled by one or more MVs, which indicate the amount and the direction of motion between the current CU and its temporal reference. Further, if multiple reference pictures are supported, one reference picture index is additionally sent, which is used to identify from which reference picture in the reference picture storage, the temporal prediction signal comes from.
  • Motion estimation 114 intakes video input 110 and a signal from picture buffer 120 and output, to motion compensation 112, amotion estimation signal.
  • Motion compensation 112 intakes video input 110, a signal from picture buffer 120, and motion estimation signal from motion estimation 114 and output to intra/inter mode decision 116, a motion compensation signal.
  • an intra/inter mode decision 116 in the encoder 100 chooses the best prediction mode, for example, based on the rate- distortion optimization method.
  • the block predictor 140 is then subtracted from the current video block, and the resulting prediction residual is de-correlated using the transform 130 and the quantization 132.
  • the resulting quantized residual coefficients are inverse quantized by the inverse quantization 134 and inverse transformed by the inverse transform 136 to form the reconstructed residual, which is then added back to the prediction block to form the reconstructed signal of the CU.
  • in-loop filtering 122 such as a deblocking filter, a sample adaptive offset (SAO), and/or an adaptive in-loop filter (ALF) may be applied on the reconstructed CU before it is put in the reference picture storage of the picture buffer 120 and used to code future video blocks.
  • coding mode inter or intra
  • prediction mode information motion information
  • quantized residual coefficients are all sent to the entropy coding unit 138 to be further compressed and packed to form the bitstream.
  • FIG. 1 gives the block diagram of a generic block-based hybrid video encoding system.
  • the input video signal is processed block by block (called coding units (CUs)).
  • CUs coding units
  • VTM-1.0 a CU can be up to 128x128 pixels.
  • HEVC High Efficiency Video Coding
  • one coding tree unit (CTU) is split into CUs to adapt to varying local characteristics based on quad/binary/temary-tree.
  • each CU is always used as the basic unit for both prediction and transform without further partitions.
  • the multi-type tree structure one CTU is firstly partitioned by a quad-tree structure. Then, each quad-tree leaf node can be further partitioned by a binary and ternary tree structure.
  • FIG. 3 A shows a diagram illustrating block quaternary partition in a multi-type tree structure, in accordance with the present disclosure.
  • FIG. 3B shows a diagram illustrating block vertical binary partition in a multi-type tree structure, in accordance with the present disclosure.
  • FIG. 3C shows a diagram illustrating block horizontal binary partition in a multitype tree structure, in accordance with the present disclosure.
  • FIG. 3D shows a diagram illustrating block vertical ternary partition in a multi-type tree structure, in accordance with the present disclosure.
  • FIG. 3E shows a diagram illustrating block horizontal ternary partition in a multitype tree structure, in accordance with the present disclosure.
  • spatial prediction and/or temporal prediction may be performed.
  • Spatial prediction (or “intra prediction”) uses pixels from the samples of already coded neighboring blocks (which are called reference samples) in the same video picture/slice to predict the current video block. Spatial prediction reduces spatial redundancy inherent in the video signal.
  • Temporal prediction (also referred to as “inter prediction” or “motion compensated prediction”) uses reconstructed pixels from the already coded video pictures to predict the current video block. Temporal prediction reduces temporal redundancy inherent in the video signal.
  • Temporal prediction signal for a given CU is usually signaled by one or more motion vectors (MVs) which indicate the amount and the direction of motion between the current CU and its temporal reference.
  • MVs motion vectors
  • one reference picture index is additionally sent, which is used to identify from which reference picture in the reference picture storage the temporal prediction signal comes from.
  • the mode decision block in the encoder chooses the best prediction mode, for example, based on the rate-distortion optimization method.
  • the prediction block is then subtracted from the current video block; and the prediction residual is de-correlated using transform and quantized.
  • the quantized residual coefficients are inverse quantized and inverse transformed to form the reconstructed residual, which is then added back to the prediction block to form the reconstructed signal of the CU.
  • in-loop filtering such as deblocking filter, sample adaptive offset (SAO), and adaptive in-loop filter (ALF) may be applied on the reconstructed CU before it is put in the reference picture storage and used to code future video blocks.
  • coding mode inter or intra
  • prediction mode information motion information
  • quantized residual coefficients are all sent to the entropy coding unit to be further compressed and packed to form the bit-stream.
  • FIG. 2 shows a general block diagram of a video decoder for the VVC. Specifically, FIG. 2 shows a typical decoder 200 block diagram. Decoder 200 has bitstream 210, entropy decoding 212, inverse quantization 214, inverse transform 216, adder 218, intra/inter mode selection 220, intra prediction 222, memory 230, in-loop filter 228, motion compensation 224, picture buffer 226, prediction related info 234, and video output 232.
  • Decoder 200 is similar to the reconstruction-related section residing in the encoder 100 of FIG. 1.
  • an incoming video bitstream 210 is first decoded through an Entropy Decoding 212 to derive quantized coefficient levels and prediction-related information.
  • the quantized coefficient levels are then processed through an Inverse Quantization 214 and an Inverse Transform 216 to obtain a reconstructed prediction residual.
  • a block predictor mechanism implemented in an Intra/inter Mode Selector 220, is configured to perform either an Intra Prediction 222 or a Motion Compensation 224, based on decoded prediction information.
  • a set of unfiltered reconstructed pixels is obtained by summing up the reconstructed prediction residual from the Inverse Transform 216 and a predictive output generated by the block predictor mechanism, using a summer 218.
  • the reconstructed block may further go through an In-Loop Filter 228 before it is stored in a Picture Buffer 226, which functions as a reference picture store.
  • the reconstructed video in the Picture Buffer 226 may be sent to drive a display device, as well as used to predict future video blocks.
  • a filtering operation is performed on these reconstructed pixels to derive a final reconstructed Video Output 232.
  • FIG. 2 gives a general block diagram of a block-based video decoder.
  • the video bit- stream is first entropy decoded at entropy decoding unit.
  • the coding mode and prediction information are sent to either the spatial prediction unit (if intra coded) or the temporal prediction unit (if inter-coded) to form the prediction block.
  • the residual transform coefficients are sent to the inverse quantization unit and inverse transform unit to reconstruct the residual block.
  • the prediction block and the residual block are then added together.
  • the reconstructed block may further go through in-loop filtering before it is stored in reference picture storage.
  • the reconstructed video in reference picture storage is then sent out to drive a display device, as well as used to predict future video blocks.
  • the basic intra prediction scheme applied in the VVC is kept the same as that of the HEVC, except that several modules are further extended and/or improved, e.g., matrix weighted intra prediction (MIP) coding mode, intra sub-partition (ISP) coding mode, extended intra prediction with wide-angle intra directions, position-dependent intra prediction combination (PDPC) and 4-tap intra interpolation.
  • MIP matrix weighted intra prediction
  • ISP intra sub-partition
  • PDPC position-dependent intra prediction combination
  • 4-tap intra interpolation 4-tap intra interpolation.
  • MIP matrix weighted intra prediction
  • ISP intra sub-partition
  • PDPC position-dependent intra prediction combination
  • NAL Network Abstraction Layer
  • a coded bitstream is partitioned into NAL units which, when conveyed over lossy packet networks, should be smaller than the maximum transfer unit size.
  • Each NAL unit consists of a NAL unit header followed by the NAL unit payload.
  • VVC In VVC, a two-byte NAL unit header was introduced with the anticipation that this design is sufficient to support future extensions.
  • the syntax and the associated semantic of the NAL unit header in the current VVC draft specification are illustrated in Table 1 and Table 2, respectively. How to read Table 1 is illustrated in the appendix section of this invention, which could also be found in the VVC specification.
  • Table 1 NAL unit header syntax Table 2.
  • NAL unit type codes and NAL unit type classes [0049] VVC inherits the parameter set concept of HEVC with a few modifications and additions.
  • Parameter sets can be either part of the video bitstream or can be received by a decoder through other means (including out-of-band transmission using a reliable channel, hard coding in encoder and decoder, and so on).
  • a parameter set contains an identification, which is referenced, directly or indirectly, from the slice header, as discussed in more detail later.
  • the referencing process is known as “activation.”
  • activation occurs per picture or per sequence.
  • the concept of activation through referencing was introduced, among other reasons, because implicit activation by virtue of the position of the information in the bitstream (as common for other syntax elements of a video codec) is not available in case of out-of-band transmission.
  • the video parameter set was introduced to convey information that is applicable to multiple layers as well as sub-layers.
  • the VPS was introduced to address these shortcomings as well as to enable a clean and extensible high-level design of multilayer codecs.
  • Each layer of a given video sequence regardless of whether they have the same or different sequence parameter sets (SPS), refers to the same VPS.
  • SPS sequence parameter sets
  • Table 4 The syntax and the associated semantic of the video parameter set in the current VVC draft specification are illustrated in Table 4 and Table 5, respectively. How to read Table 4 is illustrated in the appendix section of this invention, which could also be found in the VVC specification. Table 4.
  • Video parameter set RBSP syntax for( i 1; i ⁇ TotalNumOlss; i++ ) if( NumLayersIn01s[ i ] > 1 ) ols hrd idx[ i ] ue(v)
  • a VPS RBSP shall be available to the decoding process prior to it being referenced, included in at least one AU with Temporalld equal to 0 or provided through external means.
  • VPS NAL units with a particular value of vps_video_parameter_set_id in a CVS shall have the same content.
  • vps video parameter set id provides an identifier for the VPS for reference by other syntax elements.
  • the value of vps_video_parameter_set_id shall be greater than 0.
  • vps_max_layers_minusl plus 1 specifies the maximum allowed number of layers in each CV S referring to the VPS.
  • vps_max_sublayers_minusl plus 1 specifies the maximum number of temporal sublayers that may be present in a layer in each CVS referring to the VPS.
  • the value of vps max sublayers minusl shall be in the range of 0 to 6, inclusive.
  • vps_all_layers_same_num_sublayers_flag 1 specifies that the number of temporal sublayers is the same for all the layers in each CVS referring to the VPS.
  • vps_all_layers_same_num_sublayers_flag 0 specifies that the layers in each CVS referring to the VPS may or may not have the same number of temporal sublayers.
  • the value of vps_all_layers_same_num_sublayers_flag is inferred to be equal to 1.
  • vps_all_independent_layers_flag 1 specifies that all layers in the CVS are independently coded without using inter-layer prediction
  • vps all independent layers flag 0 specifies that one or more of the layers in the CVS may use inter-layer prediction. When not present, the value of vps all independent layers flag is inferred to be equal to 1.
  • vps_layer_id[ i ] specifies the nuh layer id value of the i-th layer. For any two non-negative integer values of m and n, when m is less than n, the value of vps_layer_id[ m ] shall be less than vps_layer_id[ n ].
  • vps_independent_layer_flag[ i ] 1 specifies that the layer with index i does not use inter-layer prediction.
  • vps_independent_layer_flag[ i ] 0 specifies that the layer with index i may use inter-layer prediction and the syntax elements vps_direct_ref_lay er_flag[ i ] [ j ] for j in the range of 0 to i - 1 , inclusive, are present in VPS .
  • the value of vps_independent_layer_flag[ i ] is inferred to be equal to 1.
  • vps_direct_ref_layer_flag[ i ][ j ] 0 specifies that the layer with index j is not a direct reference layer for the layer with index i.
  • vps direct ref layer flag [ i ] [ j ] 1 specifies that the layer with index j is a direct reference layer for the layer with index i.
  • vps_independent_layer_flag[ i ] When vps_independent_layer_flag[ i ] is equal to 0, there shall be at least one value of j in the range of 0 to i ⁇ 1, inclusive, such that the value of vps_direct_ref_layer_flag[ i ][ j ] is equal to 1.
  • max_tid_ref_present_flag[ i ] 1 specifies that the syntax element max_tid_il_ref_pics_plus1[ i ] is present.
  • max_tid_ref_present_flag[ i ] 0 specifies that the syntax element max_tid_il_ref_pics_plus1[ i ] is not present.
  • max_tid_il_ref_pics_plus1[ i ] 0 specifies that inter-layer prediction is not used by non-IRAP pictures of the i-th layer.
  • max_tid_il_ref_pics_plus1[ i ] greater than 0 specifies that, for decoding pictures of the i-th layer, no picture with TemporalId greater than max_tid_il_ref_pics_plus1[ i ] ⁇ 1 is used as ILRP.
  • the value of max tid il ref pics plus1[ i ] is inferred to be equal to 7.
  • each_layer_is_an_ols_flag 1 specifies that each OLS contains only one layer and each layer itself in a CVS referring to the VPS is an OLS with the single included layer being the only output layer.
  • each_layer_is_an_ols_flag 0 that an OLS may contain more than one layer. If vps_max_layers_minus1 is equal to 0, the value of each_layer_is_an_ols_flag is inferred to be equal to 1. Otherwise, when vps_all_independent_layers_flag is equal to 0, the value of each_layer_is_an_ols_flag is inferred to be equal to 0.
  • ols_mode_idc 0 specifies that the total number of OLSs specified by the VPS is equal to vps_max_layers_minus1 + 1, the i-th OLS includes the layers with layer indices from 0 to i, inclusive, and for each OLS only the highest layer in the OLS is output.
  • ols_mode_idc 1 specifies that the total number of OLSs specified by the VPS is equal to vps_max_layers_minus1 + 1, the i-th OLS includes the layers with layer indices from 0 to i, inclusive, and for each OLS all layers in the OLS are output.
  • ols_mode_idc 2 specifies that the total number of OLSs specified by the VPS is explicitly signalled and for each OLS the output layers are explicitly signalled and other layers are the layers that are direct or indirect reference layers of the output layers of the OLS.
  • the value of ols_mode_idc shall be in the range of 0 to 2, inclusive.
  • the value 3 of ols_mode_idc is reserved for future use by ITU-T
  • num_output_layer_sets_minus1 plus 1 specifies the total number of OLSs specified by the VPS when ols_mode_idc is equal to 2.
  • ols_output_layer_flag[ i ][ j ] 0 specifies that the layer with nuh_layer_id equal to vps_layer_id[ j ] is not an output layer of the i-th OLS when ols_mode_idc is equal to 2.
  • the lowest layer in each OLS shall be an independent layer.
  • vps_independent_layer_flag[ GeneralLayerIdx[ LayerIdInOls[ i ][ 0 ] ] ] shall be equal to 1.
  • Each layer shall be included in at least one OLS specified by the VPS.
  • nuh_layer_id nuhLayerId for each layer with a particular value of nuh_layer_id nuhLayerId equal to one of vps_layer_id[ k ] for k in the range of 0 to vps_max_layers_minus1, inclusive, there shall be at least one pair of values of i and j, where i is in the range of 0 to TotalNumOlss ⁇ 1, inclusive, and j is in the range of NumLayersInOls[ i ] ⁇ 1, inclusive, such that the value of LayerIdInOls[ i ][ j ] is equal to nuhLayerId.
  • vps_num_ptls_minus1 plus 1 specifies the number of profile_tier_level( ) syntax structures in the VPS.
  • the value of vps_num_ptls_minus1 shall be less than TotalNumOlss.
  • pt_present_flag[ i ] 1 specifies that profile, tier, and general constraints information are present in the i-th profile_tier_level( ) syntax structure in the VPS.
  • pt_present_flag[ i ] equal to 0 specifies that profile, tier, and general constraints information are not present in the i-th profile_tier_level( ) syntax structure in the VPS.
  • pt_present_flag[ 0 ] The value of pt_present_flag[ 0 ] is inferred to be equal to 1.
  • pt_present_flag[ i ] When pt_present_flag[ i ] is equal to 0, the profile, tier, and general constraints information for the i-th profile_tier_level( ) syntax structure in the VPS are inferred to be the same as that for the ( i ⁇ 1 )-th profile_tier_level( ) syntax structure in the VPS.
  • ptl_max_temporal_id[ i ] specifies the TemporalId of the highest sublayer representation for which the level information is present in the i-th profile_tier_level( ) syntax structure in the VPS.
  • the value of ptl_max_temporal_id[ i ] shall be in the range of 0 to vps_max_sublayers_minus1, inclusive.
  • vps_max_sublayers_minus1 When vps_max_sublayers_minus1 is equal to 0, the value of ptl_max_temporal_id[ i ] is inferred to be equal to 0.
  • vps_max_sublayers_minus1 is greater than 0 and vps_all_layers_same_num_sublayers_flag is equal to 1, the value of ptl_max_temporal_id[ i ] is inferred to be equal to vps_max_sublayers_minus1.
  • vps_ptl_alignment_zero_bit shall be equal to 0.
  • ols_ptl_idx[ i ] specifies the index, to the list of profile_tier_level( ) syntax structures in the VPS, of the profile_tier_level( ) syntax structure that applies to the i-th OLS.
  • the value of ols_ptl_idx[ i ] shall be in the range of 0 to vps_num_ptls_minus1, inclusive.
  • vps_num_ptls_minus1 is equal to 0, the value of ols_ptl_idx[ i ] is inferred to be equal to 0.
  • vps_num_dpb_params specifies the number of dpb_parameters( ) syntax strutcures in the VPS.
  • vps_num_dpb_params shall be in the range of 0 to 16, inclusive. When not present, the value of vps_num_dpb_params is inferred to be equal to 0.
  • vps_sublayer_dpb_params_present_flag is used to control the presence of max_dec_pic_buffering_minus1[ ], max_num_reorder_pics[ ], and max_latency_increase_plus1[ ] syntax elements in the dpb_parameters( ) syntax strucures in the VPS. When not present, vps_sub_dpb_params_info_present_flag is inferred to be equal to 0.
  • dpb_max_temporal_id[ i ] specifies the TemporalId of the highest sublayer representation for which the DPB parameters may be present in the i-th dpb_parameters( ) syntax strutcure in the VPS.
  • the value of dpb_max_temporal_id[ i ] shall be in the range of 0 to vps_max_sublayers_minus1, inclusive. When vps_max_sublayers_minus1 is equal to 0, the value of dpb_max_temporal_id[ i ] is inferred to be equal to 0.
  • vps max sublayers minus1 When vps max sublayers minus1 is greater than 0 and vps all layers same num sublayers flag is equal to 1, the value of dpb_max_temporal_id[ i ] is inferred to be equal to vps_max_sublayers_minus1.
  • ols_dpb_pic_width[ i ] specifies the width, in units of luma samples, of each picture storage buffer for the i-th OLS.
  • ols_dpb_pic_height[ i ] specifies the height, in units of luma samples, of each picture storage buffer for the i-th OLS.
  • ols_dpb_params_idx[ i ] specifies the index, to the list of dpb_parameters( ) syntax structures in the VPS, of the dpb_parameters( ) syntax structure that applies to the i-th OLS when NumLayersInOls[ i ] is greater than 1.
  • the value of ols_dpb_params_idx[ i ] shall be in the range of 0 to vps_num_dpb_params ⁇ 1, inclusive.
  • ols_dpb_params_idx[ i ] When ols_dpb_params_idx[ i ] is not present, the value of ols_dpb_params_idx[ i ] is inferred to be equal to 0.
  • NumLayersInOls[ i ] When NumLayersInOls[ i ] is equal to 1, the dpb_parameters( ) syntax structure that applies to the i-th OLS is present in the SPS referred to by the layer in the i-th OLS.
  • vps_general_hrd_params_present_flag 1 specifies that the syntax structure general_hrd_parameters( ) and other HRD parameters are present in the VPS RBSP syntax structure.
  • vps_general_hrd_params_present_flag 0 specifies that the syntax structure general_hrd_parameters( ) and other HRD parameters are not present in the VPS RBSP syntax structure. When not present, the value of vps_general_hrd_params_present_flag is inferred to be equal to 0. When NumLayersInOls[ i ] is equal to 1, the general_hrd_parameters( ) syntax structure that applies to the i-th OLS is present in the SPS referred to by the layer in the i-th OLS.
  • vps_sublayer_cpb_params_present_flag 1 specifies that the i-th ols_hrd_parameters( ) syntax structure in the VPS contains HRD parameters for the sublayer representations with TemporalId in the range of 0 to hrd_max_tid[ i ], inclusive.
  • vps_sublayer_cpb_params_present_flag 0 specifies that the i-th ols_hrd_parameters( ) syntax structure in the VPS contains HRD parameters for the sublayer representation with TemporalId equal to hrd_max_tid[ i ] only.
  • vps_max_sublayers_minus1 When vps_max_sublayers_minus1 is equal to 0, the value of vps_sublayer_cpb_params_present_flag is inferred to be equal to 0.
  • vps_sublayer_cpb_params_present_flag When vps_sublayer_cpb_params_present_flag is equal to 0, the HRD parameters for the sublayer representations with TemporalId in the range of 0 to hrd_max_tid[ i ] ⁇ 1, inclusive, are inferred to be the same as that for the sublayer representation with TemporalId equal to hrd_max_tid[ i ].
  • HRD parameters starting from the fixed_pic_rate_general_flag[ i ] syntax element till the sublayer_hrd_parameters( i ) syntax structure immediately under the condition "if( general_vcl_hrd_params_present_flag )" in the ols_hrd_parameters syntax structure.
  • num_ols_hrd_params_minus1 plus 1 specifies the number of ols_hrd_parameters( ) syntax structures present in the general_hrd_parameters( ) syntax structure when vps_general_hrd_params_present_flag is equal to 1.
  • hrd_max_tid[ i ] specifies the TemporalId of the highest sublayer representation for which the HRD parameters are contained in the i-th ols_hrd_parameters( ) syntax structure.
  • the value of hrd max tid[ i ] shall be in the range of 0 to vps max sublayers minus1, inclusive. When vps_max_sublayers_minus1 is equal to 0, the value of hrd_max_tid[ i ] is inferred to be equal to 0.
  • vps_max_sublayers_minus1 When vps_max_sublayers_minus1 is greater than 0 and vps_all_layers_same_num_sublayers_flag is equal to 1, the value of hrd_max_tid[ i ] is inferred to be equal to vps_max_sublayers_minus1.
  • ols_hrd_idx[ i ] specifies the index, to the list of ols_hrd_parameters( ) syntax structures in the VPS, of the ols_hrd_parameters( ) syntax structure that applies to the i-th OLS when NumLayersInOls[ i ] is greater than 1.
  • the value of ols_hrd_idx[[ i ] shall be in the range of 0 to num_ols_hrd_params_minus1, inclusive.
  • NumLayersInOls[ i ] is equal to 1
  • the ols_hrd_parameters( ) syntax structure that applies to the i-th OLS is present in the SPS referred to by the layer in the i-th OLS. If the value of num_ols_hrd_param_minus1 + 1 is equal to TotalNumOlss, the value of ols_hrd_idx[ i ] is inferred to be equal to i.
  • vps_extension_flag 0 specifies that no vps_extension_data_flag syntax elements are present in the VPS RBSP syntax structure.
  • vps_extension_flag 1 specifies that there are vps_extension_data_flag syntax elements present in the VPS RBSP syntax structure.
  • vps_extension_data_flag may have any value.
  • SPSs contain information which applies to all slices of a coded video sequence.
  • a coded video sequence starts from an instantaneous decoding refresh (IDR) picture, or a BLA picture, or a CRA picture that is the first picture in the bitstream and includes all subsequent pictures that are not an IDR or BLA picture.
  • a bitstream consists of one or more coded video sequences.
  • the content of the SPS can be roughly subdivided into six categories: 1) a self-reference (its own ID); 2) decoder operation point related information (profile, level, picture size, number sub-layers, and so on); 3) enabling flags for certain tools within a profile, and associated coding tool parameters in case the tool is enabled; 4) information restricting the flexibility of structures and transform coefficient coding; 5) temporal scalability control; and 6) visual usability information (VUI), which includes HRD information.
  • VUI visual usability information
  • Sequence parameter set RBSP semantics An SPS RBSP shall be available to the decoding process prior to it being referenced, included in at least one AU with TemporalId equal to 0 or provided through external means. All SPS NAL units with a particular value of sps_seq_parameter_set_id in a CVS shall have the same content. sps_seq_parameter_set_id provides an identifier for the SPS for reference by other syntax elements. SPS NAL units, regardless of the nuh_layer_id values, share the same value space of sps_seq_parameter_set_id.
  • spsLayerId be the value of the nuh_layer_id of a particular SPS NAL unit
  • vclLayerId be the value of the nuh_layer_id of a particular VCL NAL unit.
  • the particular VCL NAL unit shall not refer to the particular SPS NAL unit unless spsLayerId is less than or equal to vclLayerId and the layer with nuh_layer_id equal to spsLayerId is included in at least one OLS that includes the layer with nuh_layer_id equal to vclLayerId.
  • sps_video_parameter_set_id when greater than 0, specifies the value of vps_video_parameter_set_id for the VPS referred to by the SPS.
  • sps_video_parameter_set_id when greater than 0, specifies the value of vps_video_parameter_set_id for the VPS referred to by the SPS.
  • sps_video_parameter_set_id when greater than 0, specifies the value of vps_video_parameter_set_id for the VPS referred to by the SPS.
  • sps_video_parameter_set_id when sps_video_parameter_set_id is equal to 0, the following applies: – The SPS does not refer to a VPS. – No VPS is referred to when decoding each CLVS referring to the SPS. – The value of vps_max_layers_minus1 is inferred to be equal to 0. –
  • sps_video_parameter_set_id shall be the same in all SPSs that are referred to by CLVSs in a CVS.
  • sps_max_sublayers_minus1 plus 1 specifies the maximum number of temporal sublayers that may be present in each CLVS referring to the SPS.
  • the value of sps_max_sublayers_minus1 shall be in the range of 0 to vps_max_sublayers_minus1, inclusive.
  • sps_reserved_zero_4bits shall be equal to 0 in bitstreams conforming to this version of this Specification.
  • sps_ptl_dpb_hrd_params_present_flag 1 specifies that a profile_tier_level( ) syntax structure and a dpb_parameters( ) syntax structure are present in the SPS, and a general_hrd_parameters( ) syntax structure and an ols_hrd_parameters( ) syntax structure may also be present in the SPS.
  • sps_ptl_dpb_hrd_params_present_flag 0 specifies that none of these four syntax structures is present in the SPS.
  • sps_ptl_dpb_hrd_params_present_flag shall be equal to vps_independent_layer_flag[ GeneralLayerIdx[ nuh_layer_id ] ].
  • gdr_enabled_flag 1 specifies that GDR pictures may be present in CLVSs referring to the SPS.
  • gdr_enabled_flag 0 specifies that GDR pictures are not present in CLVSs referring to the SPS.
  • chroma_format_idc specifies the chroma sampling relative to the luma sampling as specified in clause 6.2.
  • separate_colour_plane_flag 1 specifies that the three colour components of the 4:4:4 chroma format are coded separately.
  • separate_colour_plane_flag 0 specifies that the colour components are not coded separately.
  • separate_colour_plane_flag not present, it is inferred to be equal to 0.
  • separate_colour_plane_flag 1
  • the coded picture consists of three separate components, each of which consists of coded samples of one colour plane (Y, Cb, or Cr) and uses the monochrome coding syntax. In this case, each colour plane is associated with a specific colour_plane_id value.
  • the decoding process of a monochrome picture with one value of colour_plane_id does not use any data from monochrome pictures having different values of colour_plane_id for inter prediction.
  • the value of the variable ChromaArrayType is assigned as follows: – If separate_colour_plane_flag is equal to 0, ChromaArrayType is set equal to chroma_format_idc. – Otherwise (separate_colour_plane_flag is equal to 1), ChromaArrayType is set equal to 0. res_change_in_clvs_allowed_flag equal to 1 specifies that the picture spatial resolution may change within a CLVS referring to the SPS.
  • pic_width_max_in_luma_samples specifies the maximum width, in units of luma samples, of each decoded picture referring to the SPS. pic_width_max_in_luma_samples shall not be equal to 0 and shall be an integer multiple of Max( 8, MinCbSizeY ).
  • pic_width_max_in_luma_samples specifies the maximum height, in units of luma samples, of each decoded picture referring to the SPS. pic_height_max_in_luma_samples shall not be equal to 0 and shall be an integer multiple of Max( 8, MinCbSizeY ).
  • sps_conformance_window_flag 1 indicates that the conformance cropping window offset parameters follow next in the SPS.
  • sps_conformance_window_flag 0 indicates that the conformance cropping window offset parameters are not present in the SPS.
  • sps_conf_win_left_offset, sps_conf_win_right_offset, sps_conf_win_top_offset, and sps_conf_win_bottom_offset specify the cropping window that is applied to pictures with pic_width_in_luma_samples equal to pic_width_max_in_luma_samples and pic_height_in_luma_samples equal to pic_height_max_in_luma_samples.
  • sps_conformance_window_flag When sps_conformance_window_flag is equal to 0, the values of sps_conf_win_left_offset, sps_conf_win_right_offset, sps_conf_win_top_offset, and sps_conf_win_bottom_offset are inferred to be equal to 0.
  • the conformance cropping window contains the luma samples with horizontal picture coordinates from SubWidthC * sps_conf_win_left_offset to pic_width_max_in_luma_samples ⁇ ( SubWidthC * sps_conf_win_right_offset + 1 ) and vertical picture coordinates from SubHeightC * sps_conf_win_top_offset to pic_height_max_in_luma_samples ⁇ ( SubHeightC * sps_conf_win_bottom_offset + 1 ), inclusive.
  • SubWidthC * ( sps_conf_win_left_offset + sps_conf_win_right_offset ) shall be less than pic_width_max_in_luma_samples
  • SubHeightC * ( sps_conf_win_top_offset + sps_conf_win_bottom_offset ) shall be less than pic_height_max_in_luma_samples.
  • ChromaArrayType When ChromaArrayType is not equal to 0, the corresponding specified samples of the two chroma arrays are the samples having picture coordinates ( x / SubWidthC, y / SubHeightC ), where ( x, y ) are the picture coordinates of the specified luma samples.
  • sps_log2_ctu_size_minus5 plus 5 specifies the luma coding tree block size of each CTU. The value of sps_log2_ctu_size_minus5 shall be in the range of 0 to 2, inclusive.
  • CtbLog2SizeY sps_log2_ctu_size_minus5 + 5 (43)
  • CtbSizeY 1 ⁇ CtbLog2SizeY (44)
  • subpic_info_present_flag 1 specifies that subpicture information is present for the CLVS and there may be one or more than one subpicture in each picture of the CLVS.
  • subpic_info_present_flag 0 specifies that subpicture information is not present for the CLVS and there is only one subpicture in each picture of the CLVS.
  • res_change_in_clvs_allowed_flag 1
  • the value of subpic_info_present_flag shall be equal to 0.
  • a bitstream is the result of a sub-bitstream extraction process and contains only a subset of the subpictures of the input bitstream to the sub-bitstream extraction process, it might be required to set the value of subpic_info_present_flag equal to 1 in the RBSP of the SPSs.
  • sps_num_subpics_minus1 plus 1 specifies the number of subpictures in each picture in the CLVS.
  • the value of sps_num_subpics_minus1 shall be in the range of 0 to Ceil( pic_width_max_in_luma_samples ⁇ CtbSizeY ) * Ceil( pic_height_max_in_luma_samples ⁇ CtbSizeY ) ⁇ 1, inclusive.
  • the value of sps_num_subpics_minus1 is inferred to be equal to 0.
  • sps_independent_subpics_flag 1 specifies that no intra prediction, no inter prediction and no in-loop filtering operations may be performed across any subpicture boundary in the CLVS.
  • sps_independent_subpics_flag 0 specifies that inter prediction or in-loop filtering operations across the subpicture boundaries in the CLVS may be allowed.
  • the value of sps_independent_subpics_flag is inferred to be equal to 0.
  • subpic_ctu_top_left_x[ i ] specifies horizontal position of top left CTU of i-th subpicture in unit of CtbSizeY.
  • the length of the syntax element is Ceil( Log2( ( pic_width_max_in_luma_samples + CtbSizeY ⁇ 1 ) >> CtbLog2SizeY ) ) bits.
  • the value of subpic_ctu_top_left_x[ i ] is inferred to be equal to 0.
  • subpic_ctu_top_left_y[ i ] specifies vertical position of top left CTU of i-th subpicture in unit of CtbSizeY.
  • the length of the syntax element is Ceil( Log2( ( pic_height_max_in_luma_samples + CtbSizeY ⁇ 1 ) >> CtbLog2SizeY ) ) bits.
  • subpic_ctu_top_left_y[ i ] When not present, the value of subpic_ctu_top_left_y[ i ] is inferred to be equal to 0.
  • subpic_width_minus1[ i ] plus 1 specifies the width of the i-th subpicture in units of CtbSizeY.
  • the length of the syntax element is Ceil( Log2( ( pic_width_max_in_luma_samples + CtbSizeY ⁇ 1 ) >> CtbLog2SizeY ) ) bits.
  • subpic_width_minus1[ i ] When not present, the value of subpic_width_minus1[ i ] is inferred to be equal to ( ( pic_width_max_in_luma_samples + CtbSizeY ⁇ 1 ) >> CtbLog2SizeY ) ⁇ subpic_ctu_t op_left_x[ i ] ⁇ 1.
  • subpic_height_minus1[ i ] plus 1 specifies the height of the i-th subpicture in units of CtbSizeY.
  • the length of the syntax element is Ceil( Log2( ( pic_height_max_in_luma_samples + CtbSizeY ⁇ 1 ) >> CtbLog2SizeY ) ) bits.
  • subpic_height_minus1[ i ] When not present, the value of subpic_height_minus1[ i ] is inferred to be equal to ( ( pic_height_max_in_luma_samples + CtbSizeY ⁇ 1 ) >> CtbLog2SizeY ) ⁇ subpic_ctu_ top_left_y[ i ] ⁇ 1.
  • subpic_treated_as_pic_flag[ i ] equal to 1 specifies that the i-th subpicture of each coded picture in the CLVS is treated as a picture in the decoding process excluding in-loop filtering operations.
  • subpic_treated_as_pic_flag[ i ] 0 specifies that the i-th subpicture of each coded picture in the CLVS is not treated as a picture in the decoding process excluding in-loop filtering operations.
  • the value of subpic_treated_as_pic_flag[ i ] is inferred to be equal to sps_independent_subpics_flag.
  • All the SPSs referred to by the output layer and its reference layers shall have the same value of sps_num_subpics_minus1 and shall have the same values of subpic_ctu_top_left_x[ j ], subpic_ctu_top_left_y[ j ], subpic_width_minus1[ j ], subpic_height_minus1[ j ], and loop_filter_across_subpic_enabled_flag[ j ], respectively, for each value of j in the range of 0 to sps num subpics minus1, inclusive.
  • All pictures in each access unit in the output layer and its reference layers shall have the same value of SubpicIdVal[ j ] for each value of j in the range of 0 to sps_num_subpics_minus1, inclusive.
  • loop_filter_across_subpic_enabled_flag[ i ] 1 specifies that in-loop filtering operations may be performed across the boundaries of the i-th subpicture in each coded picture in the CLVS.
  • loop_filter_across_subpic_enabled_flag[ i ] equal to 0 specifies that in- loop filtering operations are not performed across the boundaries of the i-th subpicture in each coded picture in the CLVS.
  • loop_filter_across_subpic_enabled_pic_flag[ i ] is inferred to be equal to 1 ⁇ sps_independent_subpics_flag. It is a requirement of bitstream conformance that the shapes of the subpictures shall be such that each subpicture, when decoded, shall have its entire left boundary and entire top boundary consisting of picture boundaries or consisting of boundaries of previously decoded subpictures.
  • sps_subpic_id_len_minus1 plus 1 specifies the number of bits used to represent the syntax element sps_subpic_id[ i ], the syntax elements pps_subpic_id[ i ], when present, and the syntax element slice_subpic_id, when present.
  • the value of sps_subpic_id_len_minus1 shall be in the range of 0 to 15, inclusive.
  • the value of 1 ⁇ ( sps_subpic_id_len_minus1 + 1 ) shall be greater than or equal to sps_num_subpics_minus1 + 1.
  • subpic_id_mapping_explicitly_signalled_flag 1 specifies that the subpicture ID mapping is explicitly signalled, either in the SPS or in the PPSs referred to by coded pictures of the CLVS.
  • subpic_id_mapping_explicitly_signalled_flag 0 specifies that the subpicture ID mapping is not explicitly signalled for the CLVS.
  • the value of subpic_id_mapping_explicitly_signalled_flag is inferred to be equal to 0.
  • subpic_id_mapping_in_sps_flag 1 specifies that the subpicture ID mapping is signalled in the SPS when subpic_id_mapping_explicitly_signalled_flag is equal to 1.
  • subpic_id_mapping_in_sps_flag 0 specifies that subpicture ID mapping is signalled in the PPSs referred to by coded pictures of the CLVS when subpic_id_mapping_explicitly_signalled_flag is equal to 1.
  • sps_subpic_id[ i ] specifies the subpicture ID of the i-th subpicture.
  • sps_entropy_coding_sync_enabled_flag 1 specifies that a specific synchronization process for context variables is invoked before decoding the CTU that includes the first CTB of a row of CTBs in each tile in each picture referring to the SPS, and a specific storage process for context variables is invoked after decoding the CTU that includes the first CTB of a row of CTBs in each tile in each picture referring to the SPS.
  • sps_entropy_coding_sync_enabled_flag 0 specifies that no specific synchronization process for context variables is required to be invoked before decoding the CTU that includes the first CTB of a row of CTBs in each tile in each picture referring to the SPS, and no specific storage process for context variables is required to be invoked after decoding the CTU that includes the first CTB of a row of CTBs in each tile in each picture referring to the SPS.
  • sps_wpp_entry_point_offsets_present_flag 1 specifies that signalling for entry point offsets for CTU rows may be present in the slice headers of pictures referring to the SPS when sps_entropy_coding_sync_enabled_flag is equal to 1.
  • sps_wpp_entry_point_offsets_present_flag 0 specifies that signalling for entry point offsets for CTU rows are not present in the slice headers of pictures referring to the SPS. When not present, the value of sps_wpp_entry_point_offsets_present_flag is inferred to be equal to 0.
  • sps_weighted_pred_flag 1 specifies that weighted prediction may be applied to P slices referring to the SPS.
  • sps_weighted_pred_flag 0 specifies that weighted prediction is not applied to P slices referring to the SPS.
  • sps_weighted_bipred_flag 1 specifies that explicit weighted prediction may be applied to B slices referring to the SPS.
  • sps_weighted_bipred_flag 0 specifies that explicit weighted prediction is not applied to B slices referring to the SPS.
  • sps_poc_msb_flag 1 specifies that the ph_poc_msb_present_flag syntax element is present in PHs referring to the SPS.
  • sps_poc_msb_flag 0 specifies that the ph_poc_msb_present_flag syntax element is not present in PHs referring to the SPS.
  • poc_msb_len_minus1 plus 1 specifies the length, in bits, of the poc_msb_val syntax elements, when present in the PHs referring to the SPS.
  • the value of poc_msb_len_minus1 shall be in the range of 0 to 32 ⁇ log2_max_pic_order_cnt_lsb_minus4 ⁇ 5, inclusive.
  • num_extra_ph_bits_bytes specifies the number of bytes of extra bits in the PH syntax structure for coded pictures referring to the SPS.
  • num_extra_ph_bits_bytes shall be equal to 0 in bitstreams conforming to this version of this Specification. Although the value of num_extra_ph_bits_bytes is required to be equal to 0 in this version of this Specification, decoder conforming to this version of this Specification shall allow the value of num_extra_ph_bits_bytes equal to 1 or 2 to appear in the syntax. num_extra_sh_bits_bytes specifies the number of bytes of extra bits in the slice headers for coded pictures referring to the SPS. The value of num_extra_sh_bits_bytes shall be equal to 0 in bitstreams conforming to this version of this Specification.
  • sps_sublayer_dpb_params_flag is used to control the presence of max dec pic buffering minus1[ i ], max num reorder pics[ i ], and max_latency_increase_plus1[ i ] syntax elements in the dpb_parameters( ) syntax strucure in the SPS.
  • sps_sub_dpb_params_info_present_flag When not present, the value of sps_sub_dpb_params_info_present_flag is inferred to be equal to 0.
  • long_term_ref_pics_flag 0 specifies that no LTRP is used for inter prediction of any coded picture in the CLVS.
  • long_term_ref_pics_flag 1 specifies that LTRPs may be used for inter prediction of one or more coded pictures in the CLVS.
  • inter_layer_ref_pics_present_flag 0 specifies that no ILRP is used for inter prediction of any coded picture in the CLVS.
  • inter_layer_ref_pics_present_flag 1 specifies that ILRPs may be used for inter prediction of one or more coded pictures in the CLVS.
  • sps_video_parameter_set_id 0
  • the value of inter_layer_ref_pics_present_flag is inferred to be equal to 0.
  • vps_independent_layer_flag[ GeneralLayerIdx[ nuh_layer_id ] ] is equal to 1
  • the value of inter_layer_ref_pics_present_flag shall be equal to 0.
  • sps_idr_rpl_present_flag 1 specifies that reference picture list syntax elements are present in slice headers of IDR pictures.
  • sps_idr_rpl_present_flag 0 specifies that reference picture list syntax elements are not present in slice headers of IDR pictures.
  • rpl1_same_as_rpl0_flag 1 specifies that the syntax element num_ref_pic_lists_in_sps[ 1 ] and the syntax structure ref_pic_list_struct( 1, rplsIdx ) are not present and the following applies: – The value of num_ref_pic_lists_in_sps[ 1 ] is inferred to be equal to the value of num_ref_pic_lists_in_sps[ 0 ].
  • each of syntax elements in ref_pic_list_struct( 1, rplsIdx ) is inferred to be equal to the value of corresponding syntax element in ref_pic_list_struct( 0, rplsIdx ) for rplsIdx ranging from 0 to num_ref_pic_lists_in_sps[ 0 ] ⁇ 1.
  • num_ref_pic_lists_in_sps[ i ] specifies the number of the ref_pic_list_struct( listIdx, rplsIdx ) syntax structures with listIdx equal to i included in the SPS.
  • num_ref_pic_lists_in_sps[ i ] shall be in the range of 0 to 64, inclusive. NOTE 4 – For each value of listIdx (equal to 0 or 1), a decoder should allocate memory for a total number of num_ref_pic_lists_in_sps[ i ] + 1 ref_pic_list_struct( listIdx, rplsIdx ) syntax structures since there may be one ref_pic_list_struct( listIdx, rplsIdx ) syntax structure directly signalled in the slice headers of a current picture.
  • qtbtt_dual_tree_intra_flag 1 specifies that, for I slices, each CTU is split into coding units with 64 ⁇ 64 luma samples using an implicit quadtree split, and these coding units are the root of two separate coding_tree syntax structure for luma and chroma.
  • qtbtt_dual_tree_intra_flag 0 specifies separate coding_tree syntax structure is not used for I slices.
  • log2_min_luma_coding_block_size_minus2 plus 2 specifies the minimum luma coding block size.
  • MinCbLog2SizeY log2_min_luma_coding_block_size_minus2 + 2 (48)
  • MinCbSizeY 1 ⁇ MinCbLog2SizeY (49)
  • IbcBufWidthY 256 * 128 / CtbSizeY (50)
  • IbcBufWidthC IbcBufWidthY / SubWidthC (51)
  • VSize Min( 64, CtbSizeY ) (52)
  • the value of MinCbSizeY shall less than or equal to VSize.
  • CtbWidthC and CtbHeightC which specify the width and height, respectively, of the array for each chroma CTB, are derived as follows: – If chroma_format_idc is equal to 0 (monochrome) or separate_colour_plane_flag is equal to 1, CtbWidthC and CtbHeightC are both equal to 0.
  • partition_constraints_override_enabled_flag 0 specifies the absence of partition_constraints_override_flag in PHs referring to the SPS.
  • sps_log2_diff_min_qt_min_cb_intra_slice_luma specifies the default difference between the base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU and the base 2 logarithm of the minimum coding block size in luma samples for luma CUs in slices with slice_type equal to 2 (I) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_min_qt_min_cb_luma present in PHs referring to the SPS.
  • the value of sps_log2_diff_min_qt_min_cb_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY ⁇ MinCbLog2SizeY, inclusive.
  • MinQtLog2SizeIntraY sps_log2_diff_min_qt_min_cb_intra_slice_luma + MinCbLog2SizeY (55)
  • sps_max_mtt_hierarchy_depth_intra_slice_luma specifies the default maximum hierarchy depth for coding units resulting from multi-type tree splitting of a quadtree leaf in slices with slice_type equal to 2 (I) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default maximum hierarchy depth can be overridden by ph_max_mtt_hierarchy_depth_intra_slice_luma present in PHs referring to the SPS.
  • the value of sps_max_mtt_hierarchy_depth_intra_slice_luma shall be in the range of 0 to 2*( CtbLog2SizeY ⁇ MinCbLog2SizeY ), inclusive.
  • sps_log2_diff_max_bt_min_qt_intra_slice_luma specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 2 (I) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_bt_min_qt_luma present in PHs referring to the SPS.
  • the value of sps_log2_diff_max_bt_min_qt_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeIntraY, inclusive.
  • sps_log2_diff_max_bt_min_qt_intra_slice_luma When sps_log2_diff_max_bt_min_qt_intra_slice_luma is not present, the value of sps_log2_diff_max_bt_min_qt_intra_slice_luma is inferred to be equal to 0.
  • sps_log2_diff_max_tt_min_qt_intra_slice_luma specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 2 (I) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_tt_min_qt_luma present in PHs referring to the SPS.
  • the value of sps_log2_diff_max_tt_min_qt_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeIntraY, inclusive.
  • sps_log2_diff_min_qt_min_cb_inter_slice specifies the default difference between the base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU and the base 2 logarithm of the minimum luma coding block size in luma samples for luma CUs in slices with slice_type equal to 0 (B) or 1 (P) referring to the SPS.
  • partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_min_qt_min_cb_luma present in PHs referring to the SPS.
  • sps_log2_diff_min_qt_min_cb_inter_slice shall be in the range of 0 to CtbLog2SizeY ⁇ MinCbLog2SizeY, inclusive.
  • sps_max_mtt_hierarchy_depth_inter_slice specifies the default maximum hierarchy depth for coding units resulting from multi-type tree splitting of a quadtree leaf in slices with slice_type equal to 0 (B) or 1 (P) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default maximum hierarchy depth can be overridden by ph_max_mtt_hierarchy_depth_inter_slice present in PHs referring to the SPS.
  • the value of sps_max_mtt_hierarchy_depth_inter_slice shall be in the range of 0 to 2*( CtbLog2SizeY ⁇ MinCbLog2SizeY ), inclusive.
  • sps_log2_diff_max_bt_min_qt_inter_slice specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 0 (B) or 1 (P) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_bt_min_qt_luma present in PHs referring to the SPS.
  • the value of sps_log2_diff_max_bt_min_qt_inter_slice shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeInterY, inclusive.
  • sps_log2_diff_max_tt_min_qt_inter_slice specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 0 (B) or 1 (P) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_tt_min_qt_luma present in PHs referring to the SPS.
  • the value of sps_log2_diff_max_tt_min_qt_inter_slice shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeInterY, inclusive.
  • sps_log2_diff_min_qt_min_cb_intra_slice_chroma specifies the default difference between the base 2 logarithm of the minimum size in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA and the base 2 logarithm of the minimum coding block size in luma samples for chroma CUs with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_min_qt_min_cb_chroma present in PHs referring to the SPS.
  • the value of sps_log2_diff_min_qt_min_cb_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY ⁇ MinCbLog2SizeY, inclusive.
  • the value of sps_log2_diff_min_qt_min_cb_intra_slice_chroma is inferred to be equal to 0.
  • MinQtLog2SizeIntraC sps_log2_diff_min_qt_min_cb_intra_slice_chroma + MinCbLog2SizeY (57)
  • sps_max_mtt_hierarchy_depth_intra_slice_chroma specifies the default maximum hierarchy depth for chroma coding units resulting from multi-type tree splitting of a chroma quadtree leaf with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default maximum hierarchy depth can be overridden by ph_max_mtt_hierarchy_depth_chroma present in PHs referring to the SPS.
  • the value of sps_max_mtt_hierarchy_depth_intra_slice_chroma shall be in the range of 0 to 2*( CtbLog2SizeY ⁇ MinCbLog2SizeY ), inclusive.
  • the value of sps_max_mtt_hierarchy_depth_intra_slice_chroma is inferred to be equal to 0.
  • sps_log2_diff_max_bt_min_qt_intra_slice_chroma specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a chroma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_bt_min_qt_chroma present in PHs referring to the SPS.
  • the value of sps_log2_diff_max_bt_min_qt_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeIntraC, inclusive.
  • sps_log2_diff_max_bt_min_qt_intra_slice_chroma When sps_log2_diff_max_bt_min_qt_intra_slice_chroma is not present, the value of sps_log2_diff_max_bt_min_qt_intra_slice_chroma is inferred to be equal to 0.
  • sps_log2_diff_max_tt_min_qt_intra_slice_chroma specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a chroma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) referring to the SPS.
  • partition_constraints_override_enabled_flag When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_tt_min_qt_chroma present in PHs referring to the SPS.
  • the value of sps_log2_diff_max_tt_min_qt_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeIntraC, inclusive.
  • sps_log2_diff_max_tt_min_qt_intra_slice_chroma When sps_log2_diff_max_tt_min_qt_intra_slice_chroma is not present, the value of sps_log2_diff_max_tt_min_qt_intra_slice_chroma is inferred to be equal to 0.
  • sps_max_luma_transform_size_64_flag 1 specifies that the maximum transform size in luma samples is equal to 64.
  • sps_max_luma_transform_size_64_flag 0 specifies that the maximum transform size in luma samples is equal to 32.
  • MinTbLog2SizeY 2 (58)
  • MaxTbLog2SizeY sps_max_luma_transform_size_64_flag ?
  • MinTbSizeY 1 ⁇ MinTbLog2SizeY
  • MaxTbSizeY 1 ⁇ MaxTbLog2SizeY
  • sps_joint_cbcr_enabled_flag 0 specifies that the joint coding of chroma residuals is disabled.
  • sps_joint_cbcr_enabled_flag 1 specifies that the joint coding of chroma residuals is enabled.
  • the value of sps_joint_cbcr_enabled_flag is inferred to be equal to 0.
  • same_qp_table_for_chroma 1 specifies that only one chroma QP mapping table is signalled and this table applies to Cb and Cr residuals and additionally to joint Cb-Cr residuals when sps_joint_cbcr_enabled_flag is equal to 1.
  • same_qp_table_for_chroma 0 specifies that chroma QP mapping tables, two for Cb and Cr, and one additional for joint Cb-Cr when sps_joint_cbcr_enabled_flag is equal to 1, are signalled in the SPS.
  • qp_table_start_minus26[ i ] plus 26 specifies the starting luma and chroma QP used to describe the i-th chroma QP mapping table.
  • the value of qp_table_start_minus26[ i ] shall be in the range of ⁇ 26 ⁇ QpBdOffset to 36 inclusive.
  • qp_table_start_minus26[ i ] is not present in the bitstream, the value of qp_table_start_minus26[ i ] is inferred to be equal to 0.
  • num_points_in_qp_table_minus1[ i ] plus 1 specifies the number of points used to describe the i-th chroma QP mapping table.
  • the value of num_points_in_qp_table_minus1[ i ] shall be in the range of 0 to 63 + QpBdOffset, inclusive.
  • num_points_in_qp_table_minus1[ 0 ] is not present in the bitstream, the value of num_points_in_qp_table_minus1[ 0 ] is inferred to be equal to 0.
  • delta_qp_in_val_minus1[ i ][ j ] specifies a delta value used to derive the input coordinate of the j-th pivot point of the i-th chroma QP mapping table.
  • delta_qp_in_val_minus1[ 0 ][ j ] is not present in the bitstream, the value of delta_qp_in_val_minus1[ 0 ][ j ] is inferred to be equal to 0.
  • delta_qp_diff_val[ i ][ j ] specifies a delta value used to derive the output coordinate of the j-th pivot point of the i-th chroma QP mapping table.
  • sps_sao_enabled_flag 1 specifies that the sample adaptive offset process is applied to the reconstructed picture after the deblocking filter process.
  • sps_sao_enabled_flag 0 specifies that the sample adaptive offset process is not applied to the reconstructed picture after the deblocking filter process.
  • sps_alf_enabled_flag 0 specifies that the adaptive loop filter is disabled.
  • sps_alf_enabled_flag 1 specifies that the adaptive loop filter is enabled.
  • sps_ccalf_enabled_flag 0 specifies that the cross-component adaptive loop filter is disabled.
  • sps_ccalf_enabled_flag 1 specifies that the cross-component adaptive loop filter may be enabled.
  • sps_transform_skip_enabled_flag equa to 1 specifies that transform_skip_flag may be present in the transform unit syntax.
  • sps_transform_skip_enabled_flag 0 specifies that transform_skip_flag is not present in the transform unit syntax.
  • log2_transform_skip_max_size_minus2 specifies the maximum block size used for transform skip, and shall be in the range of 0 to 3, inclusive.
  • the variable MaxTsSize is set equal to 1 ⁇ ( log2_transform_skip_max_size_minus2 + 2 ).
  • sps_bdpcm_enabled_flag 1 specifies that intra_bdpcm_luma_flag and intra_bdpcm_chroma_flag may be present in the coding unit syntax for intra coding units.
  • sps_bdpcm_enabled_flag 0 specifies that intra_bdpcm_luma_flag and intra_bdpcm_chroma_flag are not present in the coding unit syntax for intra coding units.
  • the value of sps_bdpcm_enabled_flag is inferred to be equal to 0.
  • sps_ref_wraparound_enabled_flag 1 specifies that horizontal wrap-around motion compensation is applied in inter prediction.
  • sps_ref_wraparound_enabled_flag 0 specifies that horizontal wrap-around motion compensation is not applied.
  • sps_temporal_mvp_enabled_flag 0 specifies that temporal motion vector predictors are not used in the CLVS.
  • sps_sbtmvp_enabled_flag 1 specifies that subblock-based temporal motion vector predictors may be used in decoding of pictures with all slices having slice_type not equal to I in the CLVS.
  • sps_sbtmvp_enabled_flag 0 specifies that subblock-based temporal motion vector predictors are not used in the CLVS.
  • sps_amvr_enabled_flag 1 specifies that adaptive motion vector difference resolution is used in motion vector coding.
  • amvr_enabled_flag 0 specifies that adaptive motion vector difference resolution is not used in motion vector coding.
  • sps_bdof_enabled_flag 0 specifies that the bi-directional optical flow inter prediction is disabled.
  • sps_bdof_enabled_flag 1 specifies that the bi-directional optical flow inter prediction is enabled.
  • sps_bdof_pic_present_flag 1 specifies that ph_disable_bdof_flag is present in PHs referring to the SPS.
  • sps_bdof_pic_present_flag 0 specifies that ph_disable_bdof_flag is not present in PHs referring to the SPS.
  • sps_bdof_pic_present_flag not present, the value of sps_bdof_pic_present_flag is inferred to be equal to 0.
  • sps_smvd_enabled_flag 1 specifies that symmetric motion vector difference may be used in motion vector decoding.
  • sps_smvd_enabled_flag 0 specifies that symmetric motion vector difference is not used in motion vector coding.
  • sps_dmvr_enabled_flag 1 specifies that decoder motion vector refinement based inter bi-prediction is enabled.
  • sps_dmvr_enabled_flag 0 specifies that decoder motion vector refinement based inter bi-prediction is disabled.
  • sps_dmvr_pic_present_flag 1 specifies that ph_disable_dmvr_flag is present in PHs referring to the SPS.
  • sps_dmvr_pic_present_flag 0 specifies that ph_disable_dmvr_flag is not present in PHs referring to the SPS.
  • sps_dmvr_pic_present_flag 1 specifies that merge mode with motion vector difference is enabled.
  • sps_mmvd_enabled_flag 0 specifies that merge mode with motion vector difference is disabled.
  • sps_isp_enabled_flag 1 specifies that intra prediction with subpartitions is enabled.
  • sps_isp_enabled_flag 0 specifies that intra prediction with subpartitions is disabled.
  • sps_mrl_enabled_flag 1 specifies that intra prediction with multiple reference lines is enabled.
  • sps_mrl_enabled_flag 0 specifies that intra prediction with multiple reference lines is disabled.
  • sps_mip_enabled_flag 1 specifies that matrix-based intra prediction is enabled.
  • sps_mip_enabled_flag 0 specifies that matrix-based intra prediction is disabled.
  • sps_cclm_enabled_flag 0 specifies that the cross-component linear model intra prediction from luma component to chroma component is disabled.
  • sps_chroma_horizontal_collocated_flag 0 specifies that prediction processes operate in a manner designed for chroma sample positions that are shifted to the right by 0.5 in units of luma samples relative to corresponding luma sample positions. When sps chroma horizontal collocated flag is not present, it is inferred to be equal to 1.
  • sps_chroma_vertical_collocated_flag 1 specifies that prediction processes operate in a manner designed for chroma sample positions that are not vertically shifted relative to corresponding luma sample positions.
  • sps_chroma_vertical_collocated_flag 0 specifies that prediction processes operate in a manner designed for chroma sample positions that are shifted downward by 0.5 in units of luma samples relative to corresponding luma sample positions. When sps_chroma_vertical_collocated_flag is not present, it is inferred to be equal to 1.
  • sps_mts_enabled_flag 1 specifies that sps_explicit_mts_intra_enabled_flag is present in the sequence parameter set RBSP syntax and sps_explicit_mts_inter_enabled_flag is present in the sequence parameter set RBSP syntax.
  • sps_mts_enabled_flag 0 specifies that sps_explicit_mts_intra_enabled_flag is not present in the sequence parameter set RBSP syntax and sps_explicit_mts_inter_enabled_flag is not present in the sequence parameter set RBSP syntax.
  • sps_explicit_mts_intra_enabled_flag 1 specifies that mts_idx may be present in intra coding unit syntax.
  • sps_explicit_mts_intra_enabled_flag 0 specifies that mts_idx is not present in intra coding unit syntax.
  • sps_explicit_mts_intra_enabled_flag 1 specifies that mts_idx may be present in inter coding unit syntax.
  • sps_explicit_mts_inter_enabled_flag 0 specifies that mts_idx is not present in inter coding unit syntax.
  • the value of sps_explicit_mts_inter_enabled_flag is inferred to be equal to 0.
  • six_minus_max_num_merge_cand specifies the maximum number of merging motion vector prediction (MVP) candidates supported in the SPS subtracted from 6.
  • MVP merging motion vector prediction
  • the value of six_minus_max_num_merge_cand shall be in the range of 0 to 5, inclusive.
  • sps_sbt_enabled_flag 0 specifies that subblock transform for inter-predicted CUs is disabled.
  • sps_sbt_enabled_flag 1 specifies that subblock transform for inter- predicteds CU is enabled.
  • sps_affine_enabled_flag specifies whether affine model based motion compensation can be used for inter prediction. If sps_affine_enabled_flag is equal to 0, the syntax shall be constrained such that no affine model based motion compensation is used in the CLVS, and inter_affine_flag and cu_affine_type_flag are not present in coding unit syntax of the CLVS. Otherwise (sps_affine_enabled_flag is equal to 1), affine model based motion compensation can be used in the CLVS.
  • five_minus_max_num_subblock_merge_cand specifies the maximum number of subblock-based merging motion vector prediction candidates supported in the SPS subtracted from 5.
  • sps_affine_type_flag specifies whether 6-parameter affine model based motion compensation can be used for inter prediction. If sps affine type flag is equal to 0, the syntax shall be constrained such that no 6-parameter affine model based motion compensation is used in the CLVS, and cu_affine_type_flag is not present in coding unit syntax in the CLVS. Otherwise (sps_affine_type_flag is equal to 1), 6-parameter affine model based motion compensation can be used in the CLVS.
  • sps_affine_type_flag When not present, the value of sps_affine_type_flag is inferred to be equal to 0.
  • sps_affine_amvr_enabled_flag 1 specifies that adaptive motion vector difference resolution is used in motion vector coding of affine inter mode.
  • sps_affine_amvr_enabled_flag 0 specifies that adaptive motion vector difference resolution is not used in motion vector coding of affine inter mode.
  • the value of sps_affine_amvr_enabled_flag is inferred to be equal to 0.
  • sps_affine_prof_enabled_flag specifies whether the prediction refinement with optical flow can be used for affine motion compensation.
  • sps_affine_prof_enabled_flag 0
  • the affine motion compensation shall not be refined with optical flow. Otherwise (sps_affine_prof_enabled_flag is equal to 1), the affine motion compensation can be refined with optical flow.
  • the value of sps_affine_prof_enabled_flag is inferred to be equal to 0.
  • sps_prof_pic_present_flag 1 specifies that ph_disable_prof_flag is present in PHs referring to the SPS.
  • sps_prof_pic_present_flag 0 specifies that ph_disable_prof_flag is not present in PHs referring to the SPS.
  • sps_prof_pic_present_flag 1 specifies that pred_mode_plt_flag may be present in the coding unit syntax.
  • sps_palette_enabled_flag 0 specifies that pred_mode_plt_flag is not present in the coding unit syntax.
  • sps_palette_enabled_flag When sps_palette_enabled_flag is not present, it is inferred to be equal to 0.
  • sps_act_enabled_flag 1 specifies that adaptive colour transform may be used and the cu_act_enabled_flag may be present in the coding unit syntax.
  • sps_act_enabled_flag 0 speifies that adaptive colour transform is not used and cu_act_enabled_flag is not present in the coding unit syntax.
  • sps_act_enabled_flag When sps_act_enabled_flag is not present, it is inferred to be equal to 0.
  • sps_bcw_enabled_flag specifies whether bi-prediction with CU weights can be used for inter prediction. If sps_bcw_enabled_flag is equal to 0, the syntax shall be constrained such that no bi-prediction with CU weights is used in the CLVS, and bcw_idx is not present in coding unit syntax of the CLVS.
  • sps_bcw_enabled_flag 1 specifies that the IBC prediction mode may be used in decoding of pictures in the CLVS.
  • sps_ibc_enabled_flag 0 specifies that the IBC prediction mode is not used in the CLVS.
  • sps_ibc_enabled_flag not present, it is inferred to be equal to 0.
  • six_minus_max_num_ibc_merge_cand when sps_ibc_enabled_flag is equal to 1, specifies the maximum number of IBC merging block vector prediction (BVP) candidates supported in the SPS subtracted from 6.
  • BVP block vector prediction
  • sps_fpel_mmvd_enabled_flag 1 specifies that merge mode with motion vector difference is using integer sample precision.
  • sps_fpel_mmvd_enabled_flag 0 specifies that merge mode with motion vector difference can use fractional sample precision.
  • sps_gpm_enabled_flag specifies whether geometric partition based motion compensation can be used for inter prediction.
  • sps_gpm_enabled_flag 0 specifies that the syntax shall be constrained such that no geometric partition based motion compensation is used in the CLVS, and merge_gpm_partition_idx, merge_gpm_idx0, and merge_gpm_idx1 are not present in coding unit syntax of the CLVS.
  • sps_gpm_enabled_flag 1 specifies that geometric partition based motion compensation can be used in the CLVS. When not present, the value of sps_gpm_enabled_flag is inferred to be equal to 0.
  • the value of MaxNumGpmMergeCand shall be in the range of 2 to MaxNumMergeCand, inclusive.
  • sps_lmcs_enabled_flag 1 specifies that luma mapping with chroma scaling is used in the CLVS.
  • sps_lmcs_enabled_flag 0 specifies that luma mapping with chroma scaling is not used in the CLVS.
  • sps_lfnst_enabled_flag 1 specifies that lfnst_idx may be present in intra coding unit syntax.
  • sps_lfnst_enabled_flag 0 specifies that lfnst_idx is not present in intra coding unit syntax.
  • sps_ladf_enabled_flag 1 1
  • sps_num_ladf_intervals_minus2 1
  • sps_ladf_lowest_interval_qp_offset 1
  • sps_ladf_qp_offset i
  • sps_ladf_delta_threshold_minus1 i
  • sps_num_ladf_intervals_minus2 plus 1 specifies the number of sps_ladf_delta_threshold_minus1[ i ] and sps_ladf_qp_offset[ i ] syntax elements that are present in the SPS.
  • sps_num_ladf_intervals_minus2 shall be in the range of 0 to 3, inclusive.
  • sps_ladf_lowest_interval_qp_offset specifies the offset used to derive the variable qP as specified in clause 8.8.3.6.1.
  • the value of sps_ladf_lowest_interval_qp_offset shall be in the range of ⁇ 63 to 63, inclusive.
  • sps_ladf_qp_offset[ i ] specifies the offset array used to derive the variable qP as specified in clause 8.8.3.6.1.
  • sps_ladf_qp_offset[ i ] shall be in the range of ⁇ 63 to 63, inclusive.
  • sps_ladf_delta_threshold_minus1[ i ] is used to compute the values of SpsLadfIntervalLowerBound[ i ], which specifies the lower bound of the i-th luma intensity level interval.
  • the value of sps_ladf_delta_threshold_minus1[ i ] shall be in the range of 0 to 2 BitDepth ⁇ 3, inclusive.
  • the value of SpsLadfIntervalLowerBound[ 0 ] is set equal to 0.
  • log2_parallel_merge_level_minus2 shall be in the range of 0 to CtbLog2SizeY ⁇ 2, inclusive.
  • sps_dep_quant_enabled_flag 0 specifies that dependent quantization is disabled for pictures referring to the SPS.
  • sps_dep_quant_enabled_flag 1 specifies that dependent quantization may be enabled for pictures referring to the SPS.
  • sps_sign_data_hiding_enabled_flag 0 specifies that sign bit hiding is disabled for pictures referring to the SPS.
  • sps_sign_data_hiding_enabled_flag 1 specifies that sign bit hiding may be enabled for pictures referring to the SPS.
  • sps_virtual_boundaries_enabled_flag 1 specifies that disabling in-loop filtering across virtual boundaries may be applied in the coded pictures in the CLVS.
  • sps_virtual_boundaries_enabled_flag 0 specifies that disabling in-loop filtering across virtual boundaries is not applied in the coded pictures in the CLVS.
  • In-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations.
  • sps_virtual_boundaries_present_flag 1 specifies that information of virtual boundaries is signalled in the SPS.
  • sps_virtual_boundaries_present_flag 0 specifies that information of virtual boundaries is not signalled in the SPS.
  • the in-loop filtering operations are disabled across the virtual boundaries in pictures referring to the SPS.
  • In-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations. It is a requirement of bitstream conformance that when the value of res_change_in_clvs_allowed_flag is equal to 1, the value of sps_virtual_boundaries_present_flag shall be equal to 0.
  • sps_num_ver_virtual_boundaries specifies the number of sps_virtual_boundaries_pos_x[ i ] syntax elements that are present in the SPS. When sps_num_ver_virtual_boundaries is not present, it is inferred to be equal to 0.
  • sps_virtual_boundaries_pos_x[ i ] specifies the location of the i-th vertical virtual boundary in units of luma samples divided by 8. The value of sps_virtual_boundaries_pos_x[ i ] shall be in the range of 1 to Ceil( pic_width_in_luma_samples ⁇ 8 ) ⁇ 1, inclusive.
  • sps_num_hor_virtual_boundaries specifies the number of sps_virtual_boundaries_pos_y[ i ] syntax elements that are present in the SPS. When sps_num_hor_virtual_boundaries is not present, it is inferred to be equal to 0. When sps_virtual_boundaries_enabled_flag is equal to 1 and sps_virtual_boundaries_present_flag is equal to 1, the sum of sps_num_ver_virtual_boundaries and sps_num_hor_virtual_boundaries shall be greater than 0.
  • sps_virtual_boundaries_pos_y[ i ] specifies the location of the i-th horizontal virtual boundary in units of luma samples divided by 8.
  • the value of sps_virtual_boundaries_pos_y[ i ] shall be in the range of 1 to Ceil( pic_height_in_luma_samples ⁇ 8 ) ⁇ 1, inclusive.
  • sps_general_hrd_params_present_flag 1 specifies that the syntax structure general_hrd_parameters( ) is present in the SPS RBSP syntax structure.
  • sps_general_hrd_params_present_flag 0 specifies that the syntax structure general_hrd_parameters( ) is not present in the SPS RBSP syntax structure.
  • sps_sublayer_cpb_params_present_flag 1 specifies that the syntax structure old_hrd_parameters( ) in the SPS RBSP includes HRD parameters for sublayer representations with TemporalId in the range of 0 to sps_max_sublayers_minus1, inclusive.
  • sps_sublayer_cpb_params_present_flag 0 specifies that the syntax structure ols_hrd_parameters( ) in the SPS RBSP includes HRD parameters for the sublayer representation with TemporalId equal to sps_max_sublayers_minus1 only.
  • sps max sublayers minus1 the value of sps_sublayer_cpb_params_present_flag is inferred to be equal to 0.
  • the HRD parameters for the sublayer representations with TemporalId in the range of 0 to sps_max_sublayers_minus1 ⁇ 1, inclusive are inferred to be the same as that for the sublayer representation with TemporalId equal to sps_max_sublayers_minus1.
  • These include the HRD parameters starting from the fixed_pic_rate_general_flag[ i ] syntax element till the sublayer_hrd_parameters( i ) syntax structure immediately under the condition "if( general_vcl_hrd_params_present_flag )" in the ols_hrd_parameters syntax structure.
  • field_seq_flag 1 indicates that the CLVS conveys pictures that represent fields.
  • field_seq_flag 0 indicates that the CLVS conveys pictures that represent frames.
  • general_frame_only_constraint_flag 1
  • the value of field_seq_flag shall be equal to 0.
  • field_seq_flag a frame-field information SEI message shall be present for every coded picture in the CLVS.
  • vui_parameters_present_flag 1 specifies that the syntax structure vui_parameters( ) is present in the SPS RBSP syntax structure.
  • vui_parameters_present_flag 0 specifies that the syntax structure vui_parameters( ) is not present in the SPS RBSP syntax structure.
  • sps_extension_flag 0 specifies that no sps_extension_data_flag syntax elements are present in the SPS RBSP syntax structure.
  • sps_extension_flag 1 specifies that there are sps_extension_data_flag syntax elements present in the SPS RBSP syntax structure. sps_extension_data_flag may have any value. Its presence and value do not affect decoder conformance to profiles specified in this version of this Specification. Decoders conforming to this version of this Specification shall ignore all sps_extension_data_flag syntax elements. [0052] VVC’s picture parameter set (PPS) contains such information which could change from picture to picture.
  • the PPS includes information roughly comparable what was part of the PPS in HEVC, including: 1) a self-reference; 2) initial picture control information such as initial quantization parameter (QP), a number of flags indicating the use of, or presence of, certain tools or control information in the slice header; and 3) tiling information.
  • initial picture control information such as initial quantization parameter (QP), a number of flags indicating the use of, or presence of, certain tools or control information in the slice header; and 3) tiling information.
  • QP initial quantization parameter
  • Table 8 How to read the Table 8 is illustrated in the appendix section of this invention which could also be found in the VVC specification. Table 8.
  • Picture parameter set RBSP semantics A PPS RBSP shall be available to the decoding process prior to it being referenced, included in at least one AU with TemporalId less than or equal to the TemporalId of the PPS NAL unit or provided through external means. All PPS NAL units with a particular value of pps_pic_parameter_set_id within a PU shall have the same content. pps_pic_parameter_set_id identifies the PPS for reference by other syntax elements. The value of pps_pic_parameter_set_id shall be in the range of 0 to 63, inclusive.
  • ppsLayerId be the value of the nuh_layer_id of a particular PPS NAL unit
  • vclLayerId be the value of the nuh_layer_id of a particular VCL NAL unit.
  • the particular VCL NAL unit shall not refer to the particular PPS NAL unit unless ppsLayerId is less than or equal to vclLayerId and the layer with nuh_layer_id equal to ppsLayerId is included in at least one OLS that includes the layer with nuh_layer_id equal to vclLayerId.
  • pps_seq_parameter_set_id specifies the value of sps_seq_parameter_set_id for the SPS.
  • the value of pps_seq_parameter_set_id shall be in the range of 0 to 15, inclusive.
  • the value of pps_seq_parameter_set_id shall be the same in all PPSs that are referred to by coded pictures in a CLVS.
  • mixed_nalu_types_in_pic_flag 1 specifies that each picture referring to the PPS has more than one VCL NAL unit, the VCL NAL units do not have the same value of nal_unit_type, and the picture is not an IRAP picture.
  • mixed_nalu_types_in_pic_flag 0 specifies that each picture referring to the PPS has one or more VCL NAL units and the VCL NAL units of each picture refering to the PPS have the same value of nal_unit_type.
  • no_mixed_nalu_types_in_pic_constraint_flag 1
  • the value of mixed_nalu_types_in_pic_flag shall be equal to 0.
  • nal_unit_type value nalUnitTypeA in the range of IDR_W_RADL to CRA_NUT, inclusive, in a picture picA that also contains one or more slices with another value of nal_unit_type (i.e., the value of mixed_nalu_types_in_pic_flag for the picture picA is equal to 1)
  • the slice shall belong to a subpicture subpicA for which the value of the corresponding subpic_treated_as_pic_flag[ i ] is equal to 1.
  • — The slice shall not belong to a subpicture of picA containing VCL NAL units with nal_unit_type not equal to nalUnitTypeA.
  • nalUnitTypeA is equal to CRA, for all the following PUs following the current picture in the CLVS in decoding order and in output order, neither RefPicList[ 0 ] nor RefPicList[ 1 ] of a slice in subpicA in those PUs shall include any picture preceding picA in decoding order in an active entry.
  • nalUnitTypeA is equal to IDR_W_RADL or IDR_N_LP
  • neither RefPicList[ 0 ] nor RefPicList[ 1 ] of a slice in subpicA in those PUs shall include any picture preceding picA in decoding order in an active entry.
  • NOTE 1– mixed_nalu_types_in_pic_flag is 1 indicates that pictures referring to the PPS contain slices with different NAL unit types, e.g., coded pictures originating from a subpicture bitstream merging operation for which encoders have to ensure matching bitstream structure and further alignment of parameters of the original bitstreams.
  • pic_width_in_luma_samples specifies the width of each decoded picture referring to the PPS in units of luma samples.
  • pic_width_in_luma_samples shall not be equal to 0, shall be an integer multiple of Max( 8, MinCbSizeY ), and shall be less than or equal to pic_width_max_in_luma_samples.
  • pic_height_in_luma_samples specifies the height of each decoded picture referring to the PPS in units of luma samples.
  • pic_height_in_luma_samples shall not be equal to 0 and shall be an integer multiple of Max( 8, MinCbSizeY ), and shall be less than or equal to pic_height_max_in_luma_samples.
  • pic_height_in_luma_samples shall be equal to pic_height_max_in_luma_samples.
  • pps_conformance_window_flag 0 indicates that the conformance cropping window offset parameters are not present in the PPS.
  • pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset specify the samples of the pictures in the CLVS that are output from the decoding process, in terms of a rectangular region specified in picture coordinates for output.
  • pps_conformance_window_flag When pps_conformance_window_flag is equal to 0, the values of pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset are inferred to be equal to 0.
  • the conformance cropping window contains the luma samples with horizontal picture coordinates from SubWidthC * pps_conf_win_left_offset to pic_width_in_luma_samples ⁇ ( SubWidthC * pps_conf_win_right_offset + 1 ) and vertical picture coordinates from SubHeightC * pps_conf_win_top_offset to pic_height_in_luma_samples ⁇ ( SubHeightC * pps_conf_win_bottom_offset + 1 ), inclusive.
  • SubWidthC * ( pps_conf_win_left_offset + pps_conf_win_right_offset ) shall be less than pic_width_in_luma_samples
  • SubHeightC * ( pps_conf_win_top_offset + pps_conf_win_bottom_offset ) shall be less than pic_height_in_luma_samples.
  • ChromaArrayType is not equal to 0
  • the corresponding specified samples of the two chroma arrays are the samples having picture coordinates ( x / SubWidthC, y / SubHeightC ), where ( x, y ) are the picture coordinates of the specified luma samples.
  • ppsA and ppsB be any two PPSs referring to the same SPS. It is a requirement of bitstream conformance that, when ppsA and ppsB have the same the values of pic_width_in_luma_samples and pic_height_in_luma_samples, respectively, ppsA and ppsB shall have the same values of pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset, respectively.
  • pic_width_in_luma_samples is equal to pic_width_max_in_luma_samples and pic_height_in_luma_samples is equal to pic_height_max_in_luma_samples
  • bitstream conformance that pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset, are equal to sps_conf_win_left_offset, sps_conf_win_right_offset, sps_conf_win_top_offset, and sps_conf_win_bottom_offset, respectively.
  • scaling_window_explicit_signalling_flag 1 specifies that the scaling window offset parameters are present in the PPS.
  • scaling_window_explicit_signalling_flag 0 specifies that the scaling window offset parameters are not present in the PPS.
  • the value of scaling window explicit signalling flag shall be equal to 0.
  • scaling_win_left_offset, scaling_win_right_offset, scaling_win_top_offset, and scaling_win_bottom_offset specify the offsets that are applied to the picture size for scaling ratio calculation.
  • scaling_win_left_offset, scaling_win_right_offset, scaling_win_top_offset, and scaling_win_bottom_offset are inferred to be equal to pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset, respectively.
  • SubWidthC * ( scaling_win_left_offset + scaling_win_right_offset ) shall be less than pic_width_in_luma_samples
  • SubHeightC * ( scaling_win_top_offset + scaling_win_bottom_offset ) shall be less than pic_height_in_luma_samples.
  • refPicOutputWidthL and refPicOutputHeightL be the PicOutputWidthL and PicOutputHeightL, respectively, of a reference picture of a current picture referring to this PPS.
  • – PicOutputWidthL * 2 shall be greater than or equal to refPicWidthInLumaSamples.
  • PicOutputHeightL * 2 shall be greater than or equal to refPicHeightInLumaSamples.
  • PicOutputWidthL shall be less than or equal to refPicWidthInLumaSamples * 8.
  • PicOutputHeightL shall be less than or equal to refPicHeightInLumaSamples * 8.
  • PicOutputWidthL * pic_width_max_in_luma_samples shall be greater than or equal to refPicOutputWidthL * (pic_width_in_luma_samples ⁇ Max( 8, MinCbSizeY )).
  • PicOutputHeightL * pic_height_max_in_luma_samples shall be greater than or equal to refPicOutputHeightL * (pic_height_in_luma_samples ⁇ Max( 8, MinCbSizeY )).
  • output_flag_present_flag equal to 1 indicates that the pic_output_flag syntax element is present in PHs referring to the PPS.
  • output_flag_present_flag 0 indicates that the pic_output_flag syntax element is not present in PHs referring to the PPS.
  • subpic_id_mapping_in_pps_flag 1 specifies that the subpicture ID mapping is signalled in the PPS.
  • subpic_id_mapping_in_pps_flag 0 specifies that the subpicture ID mapping is not signalled in the PPS. If subpic_id_mapping_explicitly_signalled_flag is 0 or subpic_id_mapping_in_sps_flag is equal to 1, the value of subpic_id_mapping_in_pps_flag shall be equal to 0.
  • subpic_id_mapping_explicitly_signalled_flag 1 and subpic_id_mapping_in_sps_flag is equal to 0
  • the value of subpic_id_mapping_in_pps_flag shall be equal to 1.
  • pps_num_subpics_minus1 shall be equal to sps_num_subpics_minus1.
  • pps_subpic_id_len_minus1 shall be equal to sps_subpic_id_len_minus1.
  • pps subpic id[ i ] specifies the subpicture ID of the i-th subpicture.
  • the length of the pps_subpic_id[ i ] syntax element is pps_subpic_id_len_minus1 + 1 bits.
  • the nal_unit_type for all coded slice NAL units of the subpicture in the current picture with subpicture index i shall be equal to a particular value in the range of IDR_W_RADL to CRA_NUT, inclusive.
  • no_pic_partition_flag 1 specifies that no picture partitioning is applied to each picture referring to the PPS.
  • no_pic_partition_flag 0 specifies each picture referring to the PPS may be partitioned into more than one tile or slice. It is a requirement of bitstream conformance that the value of no_pic_partition_flag shall be the same for all PPSs that are referred to by coded pictures within a CLVS. It is a requirement of bitstream conformance that the value of no_pic_partition_flag shall not be equal to 1 when the value of sps_num_subpics_minus1 + 1 is greater than 1. pps_log2_ctu_size_minus5 plus 5 specifies the luma coding tree block size of each CTU.
  • pps_log2_ctu_size_minus5 shall be equal to sps_log2_ctu_size_minus5.
  • num_exp_tile_columns_minus1 plus 1 specifies the number of explicitly provided tile column widths.
  • the value of num_exp_tile_columns_minus1 shall be in the range of 0 to PicWidthInCtbsY ⁇ 1, inclusive.
  • no_pic_partition_flag is equal to 1
  • the value of num_exp_tile_columns_minus1 is inferred to be equal to 0.
  • num_exp_tile_rows_minus1 plus 1 specifies the number of explicitly provided tile row heights.
  • num_exp_tile_rows_minus1 shall be in the range of 0 to PicHeightInCtbsY ⁇ 1, inclusive.
  • no_pic_partition_flag is equal to 1
  • the value of num_tile_rows_minus1 is inferred to be equal to 0.
  • tile_column_width_minus1[ i ] plus 1 specifies the width of the i-th tile column in units of CTBs for i in the range of 0 to num_exp_tile_columns_minus1 ⁇ 1, inclusive.
  • tile_column_width_minus1[ num_exp_tile_columns_minus1 ] is used to derive the width of the tile columns with index greater than or equal to num_exp_tile_columns_minus1 as specified in clause 6.5.1.
  • the value of tile_column_width_minus1[ i ] shall be in the range of 0 to PicWidthInCtbsY ⁇ 1, inclusive. When not present, the value of tile_column_width_minus1[ 0 ] is inferred to be equal to PicWidthInCtbsY ⁇ 1.
  • tile_row_height_minus1[ i ] plus 1 specifies the height of the i-th tile row in units of CTBs for i in the range of 0 to num_exp_tile_rows_minus1 ⁇ 1, inclusive.
  • tile_row_height_minus1[ num_exp_tile_rows_minus1 ] is used to derive the height of the tile rows with index greater than or equal to num_exp_tile_rows_minus1 as specified in clause 6.5.1.
  • the value of tile_row_height_minus1[ i ] shall be in the range of 0 to PicHeightInCtbsY ⁇ 1, inclusive.
  • tile_row_height_minus1[ 0 ] is inferred to be equal to PicHeightInCtbsY ⁇ 1.
  • rect_slice_flag 0 specifies that tiles within each slice are in raster scan order and the slice information is not signalled in PPS.
  • rect_slice_flag 1 specifies that tiles within each slice cover a rectangular region of the picture and the slice information is signalled in the PPS.
  • rect_slice_flag is inferred to be equal to 1.
  • subpic_info_present_flag is equal to 1
  • the value of rect_slice_flag shall be equal to 1.
  • single_slice_per_subpic_flag 1 specifies that each subpicture consists of one and only one rectangular slice.
  • single_slice_per_subpic_flag 0 specifies that each subpicture may consist of one or more rectangular slices.
  • the value of single_slice_per_subpic_flag is inferred to be equal to 0. lices in each picture referring to the PPS.
  • the value of num_slices_in_pic_minus1 shall be in the range of 0 to MaxSlicesPerPicture ⁇ 1, inclusive, where MaxSlicesPerPicture is specified in Annex A.
  • tile_idx_delta_present_flag 0 specifies that tile_idx_delta[ i ] syntax elements are not present in the PPS and all pictures referring to the PPS are partitioned into rectangular slice rows and rectangular slice columns in slice raster order.
  • tile_idx_delta_present_flag 1 specifies that tile_idx_delta[ i ] syntax elements may be present in the PPS and all rectangular slices in pictures referring to the PPS are specified in the order indicated by the values of the tile_idx_delta[ i ] in increasing values of i.
  • the value of tile_idx_delta_present_flag is inferred to be equal to 0.
  • slice_width_in_tiles_minus1[ i ] plus 1 specifies the width of the i-th rectangular slice in units of tile columns.
  • the value of slice_width_in_tiles_minus1[ i ] shall be in the range of 0 to NumTileColumns ⁇ 1, inclusive.
  • slice_width_in_tiles_minus1[ i ] is inferred to be equal to 0.
  • slice_height_in_tiles_minus1[ i ] plus 1 specifies the height of the i-th rectangular slice in units of tile rows when num_exp_slices_in_tile[ i ] is equal to 0.
  • the value of slice_height_in_tiles_minus1[ i ] shall be in the range of 0 to NumTileRows ⁇ 1, inclusive.
  • num_exp_slices_in_tile[ i ] specifies the number of explicitly provided slice heights for the slices in the tile containing the i-th slice (i.e., the tile with tile index equal to SliceTopLeftTileIdx[ i ]).
  • num_exp_slices_in_tile[ i ] shall be in the range of 0 to RowHeight[ SliceTopLeftTileIdx[ i ] / NumTileColumns ] ⁇ 1, inclusive. When not present, the value of num_exp_slices_in_tile[ i ] is inferred to be equal to 0. NOTE 3 – If num_exp_slices_in_tile[ i ] is equal to 0, the tile containing the i-th slice is not split into multiple slices.
  • exp_slice_height_in_ctus_minus1[ i ][ j ] plus 1 specifies the height of the j-th rectangular slice in the tile containing the i-th slice in units of CTU rows.
  • the value of exp_slice_height_in_ctus_minus1[ i ][ j ] shall be in the range of 0 to RowHeight[ SliceTopLeftTileIdx[ i ] / NumTileColumns ] ⁇ 1, inclusive.
  • tile_idx_delta[ i ] specifies the difference between the tile index of the tile containing the first CTU in the ( i + 1 )-th rectangular slice and the tile index of the tile containing the first CTU in the i-th rectangular slice.
  • the value of tile_idx_delta[ i ] shall be in the range of ⁇ NumTilesInPic + 1 to NumTilesInPic ⁇ 1, inclusive. When not present, the value of tile_idx_delta[ i ] is inferred to be equal to 0. When present, the value of tile_idx_delta[ i ] shall not be equal to 0.
  • loop_filter_across_tiles_enabled_flag 1 specifies that in-loop filtering operations may be performed across tile boundaries in pictures referring to the PPS.
  • loop_filter_across_tiles_enabled_flag 0 specifies that in-loop filtering operations are not performed across tile boundaries in pictures referring to the PPS.
  • the in-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations. When not present, the value of loop_filter_across_tiles_enabled_flag is inferred to be equal to 1.
  • loop_filter_across_slices_enabled_flag 1 specifies that in-loop filtering operations may be performed across slice boundaries in pictures referring to the PPS.
  • loop_filter_across_slice_enabled_flag 0 specifies that in-loop filtering operations are not performed across slice boundaries in pictures referring to the PPS.
  • the in-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations. When not present, the value of loop_filter_across_slices_enabled_flag is inferred to be equal to 0.
  • cabac_init_present_flag 1 specifies that cabac_init_flag is present in slice headers referring to the PPS.
  • cabac_init_present_flag 0 specifies that cabac_init_flag is not present in slice headers referring to the PPS.
  • num_ref_idx_default_active_minus1[ i ] plus 1 when i is equal to 0, specifies the inferred value of the variable NumRefIdxActive[ 0 ] for P or B slices with num_ref_idx_active_override_flag equal to 0, and, when i is equal to 1, specifies the inferred value of NumRefIdxActive[ 1 ] for B slices with num_ref_idx_active_override_flag equal to 0.
  • rpl1_idx_present_flag 0 specifies that rpl_sps_flag[ 1 ] and rpl_idx[ 1 ] are not present in the PH syntax structures or the slice headers for pictures referring to the PPS.
  • rpl1_idx_present_flag 1 specifies that rpl_sps_flag[ 1 ] and rpl_idx[ 1 ] may be present in the PH syntax structures or the slice headers for pictures referring to the PPS.
  • init_qp_minus26 plus 26 specifies the initial value of SliceQp Y for each slice referring to the PPS.
  • the initial value of SliceQp Y is modified at the picture level when a non-zero value of ph_qp_delta is decoded or at the slice level when a non-zero value of slice_qp_delta is decoded.
  • the value of init_qp_minus26 shall be in the range of ⁇ ( 26 + QpBdOffset ) to +37, inclusive.
  • cu_qp_delta_enabled_flag 1 specifies that the ph_cu_qp_delta_subdiv_intra_slice and ph_cu_qp_delta_subdiv_inter_slice syntax elements are present in PHs referring to the PPS and cu_qp_delta_abs may be present in the transform unit syntax.
  • cu_qp_delta_enabled_flag 0 specifies that the ph_cu_qp_delta_subdiv_intra_slice and ph_cu_qp_delta_subdiv_inter_slice syntax elements are not present in PHs referring to the PPS and cu_qp_delta_abs is not present in the transform unit syntax.
  • pps_chroma_tool_offsets_present_flag 1 specifies that chroma tool offsets related syntax elements are present in the PPS RBSP syntax structure.
  • pps_chroma_tool_offsets_present_flag 0 specifies that chroma tool offsets related syntax elements are not present in in the PPS RBSP syntax structure.
  • ChromaArrayType is equal to 0
  • the value of pps_chroma_tool_offsets_present_flag shall be equal to 0.
  • pps_cb_qp_offset and pps_cr_qp_offset specify the offsets to the luma quantization parameter Qp′ Y used for deriving Qp′ Cb and Qp′ Cr , respectively.
  • the values of pps_cb_qp_offset and pps_cr_qp_offset shall be in the range of ⁇ 12 to +12, inclusive.
  • pps_cb_qp_offset and pps_cr_qp_offset are not used in the decoding process and decoders shall ignore their value.
  • the values of pps_cb_qp_offset and pps_cr_qp_offset are inferred to be equalt to 0.
  • pps_joint_cbcr_qp_offset_present_flag 1 specifies that pps_joint_cbcr_qp_offset_value and joint_cbcr_qp_offset_list[ i ] are present in the PPS RBSP syntax structure.
  • pps_joint_cbcr_qp_offset_present_flag 0 specifies that pps_joint_cbcr_qp_offset_value and joint_cbcr_qp_offset_list[ i ] are not present in the PPS RBSP syntax structure.
  • ChromaArrayType is equal to 0 or sps_joint_cbcr_enabled_flag is equal to 0, the value of pps_joint_cbcr_qp_offset_present_flag shall be equal to 0.
  • the value of pps_joint_cbcr_qp_offset_present_flag is inferred to be equal to 0.
  • pps_joint_cbcr_qp_offset_value specifies the offset to the luma quantization parameter Qp′Y used for deriving Qp′CbCr.
  • the value of pps_joint_cbcr_qp_offset_value shall be in the range of ⁇ 12 to +12, inclusive.
  • pps_joint_cbcr_qp_offset_present_flag 0
  • pps_joint_cbcr_qp_offset_value 0
  • pps_joint_cbcr_qp_offset_value 0
  • pps_joint_cbcr_qp_offset_value 0
  • pps_joint_cbcr_qp_offset_present_flag 1 specifies that the slice_cb_qp_offset and slice_cr_qp_offset syntax elements are present in the associated slice headers.
  • pps_slice_chroma_qp_offsets_present_flag 0 specifies that the slice_cb_qp_offset and slice_cr_qp_offset syntax elements are not present in the associated slice headers.
  • pps_slice_chroma_qp_offsets_present_flag When not present, the value of pps_slice_chroma_qp_offsets_present_flag is inferred to be equal to 0.
  • pps_cu_chroma_qp_offset_list_enabled_flag 1 specifies that the ph_cu_chroma_qp_offset_subdiv_intra_slice and ph_cu_chroma_qp_offset_subdiv_inter_slice syntax elements are present in PHs referring to the PPS and cu chroma qp offset flag may be present in the transform unit syntax and the palette coding syntax.
  • pps_cu_chroma_qp_offset_list_enabled_flag 0 specifies that the ph_cu_chroma_qp_offset_subdiv_intra_slice and ph_cu_chroma_qp_offset_subdiv_inter_slice syntax elements are not present in PHs referring to the PPS and the cu_chroma_qp_offset_flag is not present in the transform unit syntax and the palette coding syntax.
  • the value of pps_cu_chroma_qp_offset_list_enabled_flag is inferred to be equal to 0.
  • chroma_qp_offset_list_len_minus1 plus 1 specifies the number of cb_qp_offset_list[ i ], cr_qp_offset_list[ i ], and joint_cbcr_qp_offset_list[ i ], syntax elements that are present in the PPS RBSP syntax structure.
  • the value of chroma_qp_offset_list_len_minus1 shall be in the range of 0 to 5, inclusive.
  • cb_qp_offset_list[ i ], cr_qp_offset_list[ i ], and joint_cbcr_qp_offset_list[ i ], specify offsets used in the derivation of Qp′Cb, Qp′Cr, and Qp′CbCr, respectively.
  • the values of cb_qp_offset_list[ i ], cr_qp_offset_list[ i ], and joint_cbcr_qp_offset_list[ i ] shall be in the range of ⁇ 12 to +12, inclusive.
  • pps_joint_cbcr_qp_offset_present_flag 0 specifies that weighted prediction is not applied to P slices referring to the PPS.
  • pps_weighted_pred_flag 1 specifies that weighted prediction is applied to P slices referring to the PPS.
  • pps_weighted_bipred_flag 0 specifies that explicit weighted prediction is not applied to B slices referring to the PPS.
  • pps_weighted_bipred_flag 1 specifies that explicit weighted prediction is applied to B slices referring to the PPS.
  • deblocking_filter_control_present_flag 1 specifies the presence of deblocking filter control syntax elements in the PPS.
  • deblocking_filter_control_present_flag 0 specifies the absence of deblocking filter control syntax elements in the PPS.
  • deblocking_filter_override_enabled_flag 1 specifies the presence of ph_deblocking_filter_override_flag in the PHs referring to the PPS or slice_deblocking_filter_override_flag in the slice headers referring to the PPS.
  • deblocking_filter_override_enabled_flag 0 specifies the absence of ph_deblocking_filter_override_flag in PHs referring to the PPS or slice_deblocking_filter_override_flag in slice headers referring to the PPS.
  • the value of deblocking_filter_override_enabled_flag is inferred to be equal to 0.
  • pps_deblocking_filter_disabled_flag 1 specifies that the operation of deblocking filter is not applied for slices referring to the PPS in which slice_deblocking_filter_disabled_flag is not present.
  • pps_deblocking_filter_disabled_flag 0 specifies that the operation of the deblocking filter is applied for slices referring to the PPS in which slice_deblocking_filter_disabled_flag is not present.
  • the value of pps_deblocking_filter_disabled_flag is inferred to be equal to 0.
  • pps_beta_offset_div2 and pps_tc_offset_div2 specify the default deblocking parameter offsets for ⁇ and tC (divided by 2) that are applied to the luma component for slices referring to the PPS, unless the default deblocking parameter offsets are overridden by the deblocking parameter offsets present in the picture headers or the slice headers of the slices referring to the PPS.
  • the values of pps beta offset div2 and pps tc offset div2 shall both be in the range of ⁇ 12 to 12, inclusive. When not present, the values of pps_beta_offset_div2 and pps_tc_offset_div2 are both inferred to be equal to 0.
  • pps_cb_beta_offset_div2 and pps_cb_tc_offset_div2 specify the default deblocking parameter offsets for ⁇ and tC (divided by 2) that are applied to the Cb component for slices referring to the PPS, unless the default deblocking parameter offsets are overridden by the deblocking parameter offsets present in the picture headers or the slice headers of the slices referring to the PPS.
  • the values of pps_cb_beta_offset_div2 and pps_cb_tc_offset_div2 shall both be in the range of ⁇ 12 to 12, inclusive.
  • pps_cb_beta_offset_div2 and pps_cb_tc_offset_div2 are both inferred to be equal to 0.
  • pps_cr_beta_offset_div2 and pps_cr_tc_offset_div2 specify the default deblocking parameter offsets for ⁇ and tC (divided by 2) that are applied to the Cr component for slices referring to the PPS, unless the default deblocking parameter offsets are overridden by the deblocking parameter offsets present in the picture headers or the slice headers of the slices referring to the PPS.
  • pps_cr_beta_offset_div2 and pps_cr_tc_offset_div2 shall both be in the range of ⁇ 12 to 12, inclusive.
  • the values of pps_cr_beta_offset_div2 and pps_cr_tc_offset_div2 are both inferred to be equal to 0.
  • rpl_info_in_ph_flag 1 specifies that reference picture list information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • rpl_info_in_ph_flag 0 specifies that reference picture list information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • dbf_info_in_ph_flag 1 specifies that deblocking filter information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • dbf_info_in_ph_flag 0 specifies that deblocking filter information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • dbf_info_in_ph_flag When not present, the value of dbf_info_in_ph_flag is inferred to be equal to 0.
  • sao_info_in_ph_flag 1 specifies that SAO filter information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • sao_info_in_ph_flag 0 specifies that SAO filter information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • alf_info_in_ph_flag 1 specifies that ALF information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • alf_info_in_ph_flag 0 specifies that ALF information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • wp_info_in_ph_flag 1 specifies that weighted prediction information may be present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • wp_info_in_ph_flag 0 specifies that weighted prediction information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • the value of wp_info_in_ph_flag is inferred to be equal to 0.
  • qp_delta_info_in_ph_flag 1 specifies that QP delta information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • qp_delta_info_in_ph_flag 0 specifies that QP delta information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure.
  • pps_ref_wraparound_enabled_flag 1 specifies that horizontal wrap-around motion compensation is applied in inter prediction.
  • pps_ref_wraparound_enabled_flag 0 specifies that horizontal wrap-around motion compensation is not applied.
  • pps_ref_wraparound_offset shall be in the range of 0 to ( pic_width_in_luma_samples / MinCbSizeY ) ⁇ ( CtbSizeY / MinCbSizeY ) ⁇ 2, inclusive.
  • the variable PpsRefWraparoundOffset is set equal to pps_ref_wraparound_offset+( CtbSizeY / MinCbSizeY ) + 2.
  • picture_header_extension_present_flag 0 specifies that no PH extension syntax elements are present in PHs referring to the PPS.
  • picture_header_extension_present_flag 1 specifies that PH extension syntax elements are present in PHs referring to the PPS.
  • picture_header_extension_present_flag shall be equal to 0 in bitstreams conforming to this version of this Specification.
  • slice_header_extension_present_flag 0 specifies that no slice header extension syntax elements are present in the slice headers for coded pictures referring to the PPS.
  • slice_header_extension_present_flag 1 specifies that slice header extension syntax elements are present in the slice headers for coded pictures referring to the PPS.
  • slice_header_extension_present_flag shall be equal to 0 in bitstreams conforming to this version of this Specification.
  • pps_extension_flag 0 specifies that no pps_extension_data_flag syntax elements are present in the PPS RBSP syntax structure.
  • pps_extension_flag 1 specifies that there are pps_extension_data_flag syntax elements present in the PPS RBSP syntax structure.
  • pps_extension_data_flag may have any value. Its presence and value do not affect decoder conformance to profiles specified in this version of this Specification. Decoders conforming to this version of this Specification shall ignore all pps_extension_data_flag syntax elements.
  • the slice header contains information that can change from slice to slice, as well as such picture-related information that is relatively small or relevant only for a certain slice or picture types.
  • the size of the slice header may be noticeably bigger than the PPS, particular when there are tile or wavefront entry point offsets in the slice header and RPS, prediction weights, or reference picture list modifications are explicitly signaled.
  • the syntax and the associated semantic of the sequence parameter set in the current VVC draft specification are illustrated in Table 10 and Table 11, respectively. How to read Table 10 is illustrated in the appendix section of this invention, which could also be found in the VVC specification. Table 10. Picture header structure syntax
  • the PH syntax structure contains information that is common for all slices of the coded picture associated with the PH syntax structure.
  • gdr_or_irap_pic_flag 1 specifies that the current picture is a GDR or IRAP picture.
  • gdr_or_irap_pic_flag 0 specifies that the current picture may or may not be a GDR or IRAP picture.
  • gdr_pic_flag 1 specifies the picture associated with the PH is a GDR picture.
  • gdr_pic_flag 0 specifies that the picture associated with the PH is not a GDR picture.
  • the value of gdr_pic_flag is inferred to be equal to 0.
  • gdr_enabled_flag When gdr_enabled_flag is equal to 0, the value of gdr_pic_flag shall be equal to 0.
  • ph_inter_slice_allowed_flag 0 specifies that all coded slices of the picture have slice_type equal to 2.
  • ph_inter_slice_allowed_flag 1 specifies that there may or may not be one or more coded slices in the picture that have slice_type equal to 0 or 1.
  • ph_intra_slice_allowed_flag 0 specifies that all coded slices of the picture have slice_type equal to 0 or 1.
  • ph_intra_slice_allowed_flag 1 specifies that there may or may not be one or more coded slices in the picture that have slice_type equal to 2.When not present, the value of ph_intra_slice_allowed_flag is inferred to be equal to 1.
  • the encoder is expected to set the values of both ph_inter_slice_allowed_flag and ph_intra_slice_allowed_flag equal to 1.
  • non_reference_picture_flag 1 specifies the picture associated with the PH is never used as a reference picture.
  • non_reference_picture_flag 0 specifies the picture associated with the PH may or may not be used as a reference picture.
  • ph_pic_parameter_set_id specifies the value of pps_pic_parameter_set_id for the PPS in use.
  • the value of ph_pic_parameter_set_id shall be in the range of 0 to 63, inclusive. It is a requirement of bitstream conformance that the value of TemporalId of the PH shall be greater than or equal to the value of TemporalId of the PPS that has pps_pic_parameter_set_id equal to ph_pic_parameter_set_id.
  • ph_pic_order_cnt_lsb specifies the picture order count modulo MaxPicOrderCntLsb for the current picture.
  • the length of the ph_pic_order_cnt_lsb syntax element is log2_max_pic_order_cnt_lsb_minus4 + 4 bits.
  • the value of the ph_pic_order_cnt_lsb shall be in the range of 0 to MaxPicOrderCntLsb ⁇ 1, inclusive.
  • no_output_of_prior_pics_flag affects the output of previously-decoded pictures in the DPB after the decoding of a CLVSS picture that is not the first picture in the bitstream as specified in Annex C.
  • recovery_poc_cnt specifies the recovery point of decoded pictures in output order. If the current picture is a GDR picture that is associated with the PH, and there is a picture picA that follows the current GDR picture in decoding order in the CLVS that has PicOrderCntVal equal to the PicOrderCntVal of the current GDR picture plus the value of recovery_poc_cnt, the picture picA is referred to as the recovery point picture. Otherwise, the first picture in output order that has PicOrderCntVal greater than the PicOrderCntVal of the current picture plus the value of recovery_poc_cnt is referred to as the recovery point picture. The recovery point picture shall not precede the current GDR picture in decoding order.
  • the value of recovery_poc_cnt shall be in the range of 0 to MaxPicOrderCntLsb ⁇ 1, inclusive.
  • gdr_enabled_flag is equal to 1 and PicOrderCntVal of the current picture is greater than or equal to RpPicOrderCntVal of the associated GDR picture
  • the current and subsequent decoded pictures in output order are exact match to the corresponding pictures produced by starting the decoding process from the previous IRAP picture, when present, preceding the associated GDR picture in decoding order.
  • ph_extra_bit[ i ] may be equal to 1 or 0. Decoders conforming to this version of this Specification shall ignore the value of ph_extra_bit[ i ]. Its value does not affect decoder conformance to profiles specified in this version of specification.
  • ph_poc_msb_present_flag 1 specifies that the syntax element poc_msb_val is present in the PH.
  • ph_poc_msb_present_flag 0 specifies that the syntax element poc_msb_val is not present in the PH.
  • vps_independent_layer_flag[ GeneralLayerIdx[ nuh_layer_id ] ] is equal to 0 and there is a picture in the current AU in a reference layer of the current layer
  • the value of ph_poc_msb_present_flag shall be equal to 0.
  • poc_msb_val specifies the POC MSB value of the current picture.
  • the length of the syntax element poc_msb_val is poc_msb_len_minus1 + 1 bits.
  • ph_alf_enabled_flag 1 specifies that adaptive loop filter is enabled for all slices associated with the PH and may be applied to Y, Cb, or Cr colour component in the slices.
  • ph_alf_enabled_flag 0 specifies that adaptive loop filter may be disabled for one, or more, or all slices associated with the PH. When not present, ph_alf_enabled_flag is inferred to be equal to 0.
  • ph num alf aps ids luma specifies the number of ALF APSs that the slices associated with the PH refers to.
  • ph_alf_aps_id_luma[ i ] specifies the adaptation_parameter_set_id of the i-th ALF APS that the luma component of the slices associated with the PH refers to.
  • alf_luma_filter_signal_flag of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_alf_aps_id_luma[ i ] shall be equal to 1.
  • the TemporalId of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_alf_aps_id_luma[ i ] shall be less than or equal to the TemporalId of the picture associated with the PH.
  • ph_alf_chroma_idc 0 specifies that the adaptive loop filter is not applied to Cb and Cr colour components.
  • ph_alf_chroma_idc 1 indicates that the adaptive loop filter is applied to the Cb colour component.
  • ph_alf_chroma_idc 2 indicates that the adaptive loop filter is applied to the Cr colour component.
  • ph_alf_chroma_idc indicates that the adaptive loop filter is applied to Cb and Cr colour components.
  • ph_alf_aps_id_chroma specifies the adaptation_parameter_set_id of the ALF APS that the chroma component of the slices associated with the PH refers to.
  • the value of alf_chroma_filter_signal_flag of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_alf_aps_id_chroma shall be equal to 1.
  • the TemporalId of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_alf_aps_id_chroma shall be less than or equal to the TemporalId of the picture associated with the PH.
  • ph_cc_alf_cb_enabled_flag 1 specifies that cross-component filter for Cb colour component is enabled for all slices associated with the PH and may be applied to Cb colour component in the slices.
  • ph_cc_alf_cb_enabled_flag 0 specifies that cross- component filter for Cb colour component may be disabled for one, or more, or all slices associated with the PH.
  • ph_cc_alf_cb_enabled_flag is inferred to be equal to 0.
  • ph_cc_alf_cb_aps_id specifies the adaptation_parameter_set_id of the ALF APS that the Cb colour component of the slices associated with the PH refers to.
  • the value of alf_cc_cb_filter_signal_flag of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_cc_alf_cb_aps_id shall be equal to 1.
  • the TemporalId of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_cc_alf_cb_aps_id shall be less than or equal to the TemporalId of the picture associated with the PH.
  • ph_cc_alf_cr_enabled_flag 1 specifies that cross-compoent filter for Cr colour component is enabled for all slices associated with the PH and may be applied to Cr colour component in the slices.
  • ph_cc_alf_cr_enabled_flag 0 specifies that cross- component filter for Cr colour component may be disabled for one, or more, or all slices associated with the PH.
  • ph_cc_alf_cr_enabled_flag is inferred to be equal to 0.
  • ph_cc_alf_cr_aps_id specifies the adaptation_parameter_set_id of the ALF APS that the Cr colour component of the slices associated with the PH refers to.
  • the value of alf_cc_cr_filter_signal_flag of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_cc_alf_cr_aps_id shall be equal to 1.
  • the TemporalId of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_cc_alf_cr_aps_id shall be less than or equal to the TemporalId of the picture associated with the PH.
  • ph_lmcs_enabled_flag 1 specifies that luma mapping with chroma scaling is enabled for all slices associated with the PH.
  • ph_lmcs_enabled_flag 0 specifies that luma mapping with chroma scaling may be disabled for one, or more, or all slices associated with the PH.
  • ph_lmcs_enabled_flag When not present, the value of ph_lmcs_enabled_flag is inferred to be equal to 0.
  • ph_lmcs_aps_id specifies the adaptation_parameter_set_id of the LMCS APS that the slices associated with the PH refers to.
  • the TemporalId of the APS NAL unit having aps_params_type equal to LMCS_APS and adaptation_parameter_set_id equal to ph_lmcs_aps_id shall be less than or equal to the TemporalId of the picture associated with PH.
  • ph_chroma_residual_scale_flag 1 specifies that chroma residual scaling is enabled for the all slices associated with the PH.
  • ph_chroma_residual_scale_flag 0 specifies that chroma residual scaling may be disabled for one, or more, or all slices associated with the PH.
  • ph_chroma_residual_scale_flag not present, it is inferred to be equal to 0.
  • ph_scaling_list_present_flag 1 specifies that the scaling list data used for the slices associated with the PH is derived based on the scaling list data contained in the referenced scaling list APS.
  • ph_scaling_list_present_flag 0 specifies that the scaling list data used for the slices associated with the PH is set to be equal to 16. When not present, the value of ph_scaling_list_present_flag is inferred to be equal to 0.
  • ph_scaling_list_aps_id specifies the adaptation_parameter_set_id of the scaling list APS.
  • the TemporalId of the APS NAL unit having aps_params_type equal to SCALING_APS and adaptation_parameter_set_id equal to ph_scaling_list_aps_id shall be less than or equal to the TemporalId of the picture associated with PH.
  • ph_virtual_boundaries_present_flag 1 specifies that information of virtual boundaries is signalled in the PH.
  • ph_virtual_boundaries_present_flag 0 specifies that information of virtual boundaries is not signalled in the PH.
  • the in-loop filtering operations are disabled across the virtual boundaries in the picture.
  • the in-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations.
  • the value of ph_virtual_boundaries_present_flag is inferred to be equal to 0.
  • ph_num_ver_virtual_boundaries specifies the number of ph_virtual_boundaries_pos_x[ i ] syntax elements that are present in the PH.
  • the value of ph_virtual_boundaries_pos_x[ i ] shall be in the range of 1 to Ceil( pic_width_in_luma_samples ⁇ 8 ) ⁇ 1, inclusive.
  • the distance between any two vertical virtual boundaries shall be greater than or equal to CtbSizeY luma samples.
  • ph_num_hor_virtual_boundaries specifies the number of ph_virtual_boundaries_pos_y[ i ] syntax elements that are present in the PH. When ph_num_hor_virtual_boundaries is not present, it is inferred to be equal to 0.
  • sps_num_hor_virtual_boundaries : ph_num_hor_virtual_boundaries (85)
  • sps_virtual_boundaries_enabled_flag is equal to 1
  • ph_virtual_boundaries_present_flag is equal to 1
  • the sum of ph_num_ver_virtual_boundaries and ph_num_hor_virtual_boundaries shall be greater than 0.
  • ph_virtual_boundaries_pos_y[ i ] specifies the location of the i-th horizontal virtual boundary in units of luma samples divided by 8.
  • the value of ph_virtual_boundaries_pos_y[ i ] shall be in the range of 1 to Ceil( pic height in luma samples ⁇ 8 ) ⁇ 1, inclusive.
  • pic_output_flag affects the decoded picture output and removal processes as specified in Annex C. When pic_output_flag is not present, it is inferred to be equal to 1.
  • partition_constraints_override_flag 1 specifies that partition constraint parameters are present in the PH.
  • partition_constraints_override_flag 0 specifies that partition constraint parameters are not present in the PH.
  • partition_constraints_override_flag When not present, the value of partition_constraints_override_flag is inferred to be equal to 0.
  • ph_log2_diff_min_qt_min_cb_intra_slice_luma specifies the difference between the base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU and the base 2 logarithm of the minimum coding block size in luma samples for luma CUs in the slices with slice_type equal to 2 (I) associated with the PH.
  • ph_log2_diff_min_qt_min_cb_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY ⁇ MinCbLog2SizeY, inclusive.
  • the value of ph_log2_diff_min_qt_min_cb_luma is inferred to be equal to sps_log2_diff_min_qt_min_cb_intra_slice_luma.
  • ph_max_mtt_hierarchy_depth_intra_slice_luma specifies the maximum hierarchy depth for coding units resulting from multi-type tree splitting of a quadtree leaf in slices with slice_type equal to 2 (I) associated with the PH.
  • the value of ph_max_mtt_hierarchy_depth_intra_slice_luma shall be in the range of 0 to 2*( CtbLog2SizeY ⁇ MinCbLog2SizeY ), inclusive.
  • ph_max_mtt_hierarchy_depth_intra_slice_luma is inferred to be equal to sps_max_mtt_hierarchy_depth_intra_slice_luma.
  • ph_log2_diff_max_bt_min_qt_intra_slice_luma specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 2 (I) associated with the PH.
  • the value of ph_log2_diff_max_bt_min_qt_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeIntraY, inclusive.
  • ph_log2_diff_max_bt_min_qt_intra_slice_luma is inferred to be equal to sps_log2_diff_max_bt_min_qt_intra_slice_luma.
  • ph_log2_diff_max_tt_min_qt_intra_slice_luma specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 2 (I) associated with the PH.
  • ph_log2_diff_max_tt_min_qt_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeIntraY, inclusive.
  • the value of ph_log2_diff_max_tt_min_qt_intra_slice_luma is inferred to be equal to sps_log2_diff_max_tt_min_qt_intra_slice_luma.
  • ph_log2_diff_min_qt_min_cb_intra_slice_chroma specifies the difference between the base 2 logarithm of the minimum size in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA and the base 2 logarithm of the minimum coding block size in luma samples for chroma CUs with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) associated with the PH.
  • ph_log2_diff_min_qt_min_cb_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY ⁇ MinCbLog2SizeY, inclusive.
  • the value of ph_log2_diff_min_qt_min_cb_intra_slice_chroma is inferred to be equal to sps_log2_diff_min_qt_min_cb_intra_slice_chroma.
  • ph_max_mtt_hierarchy_depth_intra_slice_chroma specifies the maximum hierarchy depth for chroma coding units resulting from multi-type tree splitting of a chroma quadtree leaf with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) associated with the PH.
  • the value of ph_max_mtt_hierarchy_depth_intra_slice_chroma shall be in the range of 0 to 2*( CtbLog2SizeY ⁇ MinCbLog2SizeY ), inclusive.
  • ph_max_mtt_hierarchy_depth_intra_slice_chroma is inferred to be equal to sps_max_mtt_hierarchy_depth_intra_slice_chroma.
  • ph_log2_diff_max_bt_min_qt_intra_slice_chroma specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a chroma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) associated with the PH.
  • ph_log2_diff_max_bt_min_qt_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeIntraC, inclusive.
  • the value of ph_log2_diff_max_bt_min_qt_intra_slice_chroma is inferred to be equal to sps_log2_diff_max_bt_min_qt_intra_slice_chroma.
  • ph_log2_diff_max_tt_min_qt_intra_slice_chroma specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a chroma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) associated with the PH.
  • ph_log2_diff_max_tt_min_qt_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeIntraC, inclusive.
  • ph_log2_diff_max_tt_min_qt_intra_slice_chroma is inferred to be equal to sps_log2_diff_max_tt_min_qt_intra_slice_chroma
  • ph_cu_qp_delta_subdiv_intra_slice specifies the maximum cbSubdiv value of coding units in intra slice that convey cu_qp_delta_abs and cu_qp_delta_sign_flag.
  • ph_cu_qp_delta_subdiv_intra_slice shall be in the range of 0 to 2 * ( CtbLog2SizeY ⁇ MinQtLog2SizeIntraY + ph_max_mtt_hierarchy_depth_intra_slice_ luma ), inclusive.
  • the value of ph_cu_qp_delta_subdiv_intra_slice is inferred to be equal to 0.
  • ph_cu_chroma_qp_offset_subdiv_intra_slice specifies the maximum cbSubdiv value of coding units in intra slice that convey cu_chroma_qp_offset_flag.
  • ph_cu_chroma_qp_offset_subdiv_intra_slice shall be in the range of 0 to 2 * ( CtbLog2SizeY ⁇ MinQtLog2SizeIntraY + ph_max_mtt_hierarchy_depth_intra_slice_ luma ), inclusive.
  • the value of ph_cu_chroma_qp_offset_subdiv_intra_slice is inferred to be equal to 0.
  • ph_log2_diff_min_qt_min_cb_inter_slice specifies the difference between the base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU and the base 2 logarithm of the minimum luma coding block size in luma samples for luma CUs in the slices with slice_type equal to 0 (B) or 1 (P) associated with the PH.
  • the value of ph_log2_diff_min_qt_min_cb_inter_slice shall be in the range of 0 to CtbLog2SizeY ⁇ MinCbLog2SizeY, inclusive.
  • ph_log2_diff_min_qt_min_cb_luma is inferred to be equal to sps_log2_diff_min_qt_min_cb_inter_slice.
  • ph_max_mtt_hierarchy_depth_inter_slice specifies the maximum hierarchy depth for coding units resulting from multi-type tree splitting of a quadtree leaf in slices with slice_type equal to 0 (B) or 1 (P) associated with the PH.
  • the value of ph_max_mtt_hierarchy_depth_inter_slice shall be in the range of 0 to 2*( CtbLog2SizeY ⁇ MinCbLog2SizeY ), inclusive.
  • ph_max_mtt_hierarchy_depth_inter_slice When not present, the value of ph_max_mtt_hierarchy_depth_inter_slice is inferred to be equal to sps_max_mtt_hierarchy_depth_inter_slice.
  • ph_log2_diff_max_bt_min_qt_inter_slice specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in the slices with slice_type equal to 0 (B) or 1 (P) associated with the PH.
  • ph_log2_diff_max_bt_min_qt_inter_slice shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeInterY, inclusive.
  • the value of ph_log2_diff_max_bt_min_qt_inter_slice is inferred to be equal to sps_log2_diff_max_bt_min_qt_inter_slice.
  • ph_log2_diff_max_tt_min_qt_inter_slice specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 0 (B) or 1 (P) associated with the PH.
  • the value of ph_log2_diff_max_tt_min_qt_inter_slice shall be in the range of 0 to CtbLog2SizeY ⁇ MinQtLog2SizeInterY, inclusive.
  • ph_log2_diff_max_tt_min_qt_inter_slice is inferred to be equal to sps_log2_diff_max_tt_min_qt_inter_slice.
  • ph_cu_qp_delta_subdiv_inter_slice specifies the maximum cbSubdiv value of coding units that in inter slice convey cu_qp_delta_abs and cu_qp_delta_sign_flag.
  • ph_cu_qp_delta_subdiv_inter_slice shall be in the range of 0 to 2 * ( CtbLog2SizeY ⁇ MinQtLog2SizeInterY + ph_max_mtt_hierarchy_depth_inter_slice ) , inclusive.
  • the value of ph_cu_qp_delta_subdiv_inter_slice is inferred to be equal to 0.
  • ph_cu_chroma_qp_offset_subdiv_inter_slice specifies the maximum cbSubdiv value of coding units in inter slice that convey cu_chroma_qp_offset_flag.
  • ph_cu_chroma_qp_offset_subdiv_inter_slice shall be in the range of 0 to 2 * ( CtbLog2SizeY ⁇ MinQtLog2SizeInterY + ph_max_mtt_hierarchy_depth_inter_slice ) , inclusive.
  • the value of ph_cu_chroma_qp_offset_subdiv_inter_slice is inferred to be equal to 0.
  • ph_temporal_mvp_enabled_flag specifies whether temporal motion vector predictors can be used for inter prediction for slices associated with the PH.
  • ph_temporal_mvp_enabled_flag 0
  • the syntax elements of the slices associated with the PH shall be constrained such that no temporal motion vector predictor is used in decoding of the slices. Otherwise (ph_temporal_mvp_enabled_flag is equal to 1), temporal motion vector predictors may be used in decoding of the slices associated with the PH.
  • the value of ph_temporal_mvp_enabled_flag is inferred to be equal to 0.
  • the value of ph_temporal_mvp_enabled_flag shall be equal to 0.
  • ph_collocated_from_l0_flag 1 specifies that the collocated picture used for temporal motion vector prediction is derived from reference picture list 0.
  • ph_collocated_from_l0_flag 0 specifies that the collocated picture used for temporal motion vector prediction is derived from reference picture list 1.
  • ph_collocated_ref_idx specifies the reference index of the collocated picture used for temporal motion vector prediction.
  • ph_collocated_from_l0_flag When ph_collocated_from_l0_flag is equal to 1, ph_collocated_ref_idx refers to an entry in reference picture list 0, and the value of ph_collocated_ref_idx shall be in the range of 0 to num_ref_entries[ 0 ][ RplsIdx[ 0 ] ] ⁇ 1, inclusive.
  • ph_collocated_from_l0_flag When ph_collocated_from_l0_flag is equal to 0, ph_collocated_ref_idx refers to an entry in reference picture list 1, and the value of ph_collocated_ref_idx shall be in the range of 0 to num_ref_entries[ 1 ][ RplsIdx[ 1 ] ] ⁇ 1, inclusive. When not present, the value of ph_collocated_ref_idx is inferred to be equal to 0.
  • mvd_l1_zero_flag 0 indicates that the mvd coding( x0, y0, 1 ) syntax structure is parsed.
  • ph_fpel_mmvd_enabled_flag 1 specifies that merge mode with motion vector difference uses integer sample precision in the slices associated with the PH.
  • ph_fpel_mmvd_enabled_flag 0 specifies that merge mode with motion vector difference can use fractional sample precision in the slices associated with the PH.
  • the value of ph_fpel_mmvd_enabled_flag is inferred to be 0.
  • ph_disable_bdof_flag 1 specifies that bi-directional optical flow inter prediction based inter bi-prediction is disabled in the slices associated with the PH.
  • ph_disable_bdof_flag 0 specifies that bi-directional optical flow inter prediction based inter bi-prediction may or may not be enabled in the slices associated with the PH.
  • ph_disable_bdof_flag not present, the following applies: – If sps_bdof_enabled_flag is equal to 1, the value of ph_disable_bdof_flag is inferred to be equal to 0. – Otherwise (sps_bdof_enabled_flag is equal to 0), the value of ph_disable_bdof_flag is inferred to be equal to 1.
  • ph_disable_dmvr_flag 1 specifies that decoder motion vector refinement based inter bi-prediction is disabled in the slices associated with the PH.
  • ph_disable_dmvr_flag 0 specifies that decoder motion vector refinement based inter bi-prediction may or may not be enabled in the slices associated with the PH.
  • ph_disable_dmvr_flag is not present, the following applies: – If sps_dmvr_enabled_flag is equal to 1, the value of ph_disable_dmvr_flag is inferred to be equal to 0.
  • ph_disable_dmvr_flag 1 specifies that prediction refinement with optical flow is disabled in the slices associated with the PH.
  • ph_disable_prof_flag 0 specifies that prediction refinement with optical flow may or may not be enabled in the slices associated with the PH.
  • ph_disable_prof_flag When ph_disable_prof_flag is not present, the following applies: – If sps_affine_prof_enabled_flag is equal to 1, the value of ph_disable_prof_flag is inferred to be equal to 0. – Otherwise (sps_affine_prof_enabled_flag is equal to 0), the value of ph_disable_prof_flag is inferred to be equal to 1.
  • ph_qp_delta specifies the initial value of QpY to be used for the coding blocks in the picture until modified by the value of CuQpDeltaVal in the coding unit layer.
  • qp_delta_info_in_ph_flag 1
  • the value of SliceQpY shall be in the range of ⁇ QpBdOffset to +63, inclusive.
  • ph_joint_cbcr_sign_flag specifies whether, in transform units with tu_joint_cbcr_residual_flag[ x0 ][ y0 ] equal to 1, the collocated residual samples of both chroma components have inverted signs.
  • ph_sao_chroma_enabled_flag When ph_sao_chroma_enabled_flag is not present, it is inferred to be equal to 0.
  • ph_dep_quant_enabled_flag 0 specifies that dependent quantization is disabled for the current picture.
  • ph_dep_quant_enabled_flag 1 specifies that dependent quantization is enabled for the current picture.
  • pic_sign_data_hiding_enabled_flag 0 specifies that sign bit hiding is disabled for the current picture.
  • pic_sign_data_hiding_enabled_flag 1 specifies that sign bit hiding is enabled for the current picture.
  • pic_sign_data_hiding_enabled_flag not present, it is inferred to be equal to 0.
  • ph_deblocking_filter_override_flag 1 specifies that deblocking parameters are present in the PH.
  • ph_deblocking_filter_override_flag 0 specifies that deblocking parameters are not present in the PH.
  • ph_deblocking_filter_override_flag When not present, the value of ph_deblocking_filter_override_flag is inferred to be equal to 0.
  • ph_deblocking_filter_disabled_flag 1 specifies that the operation of the deblocking filter is not applied for the slices associated with the PH.
  • ph_deblocking_filter_disabled_flag 0 specifies that the operation of the deblocking filter is applied for the slices associated with the PH.
  • ph_deblocking_filter_disabled_flag When ph_deblocking_filter_disabled_flag is not present, it is inferred to be equal to pps_deblocking_filter_disabled_flag.
  • ph_beta_offset_div2 and ph_tc_offset_div2 specify the deblocking parameter offsets for ⁇ and tC (divided by 2) that are applied to the luma component for the slices associated with the PH.
  • the values of ph_beta_offset_div2 and ph_tc_offset_div2 shall both be in the range of ⁇ 12 to 12, inclusive.
  • the values of ph_beta_offset_div2 and ph_tc_offset_div2 are inferred to be equal to pps_beta_offset_div2 and pps_tc_offset_div2, respectively.
  • ph_cb_beta_offset_div2 and ph_cb_tc_offset_div2 specify the deblocking parameter offsets for ⁇ and tC (divided by 2) that are applied to the Cb component for the slices associated with the PH.
  • the values of ph_cb_beta_offset_div2 and ph_cb_tc_offset_div2 shall both be in the range of ⁇ 12 to 12, inclusive. When not present, the values of
  • the partition constraint syntax elements are grouped by prediction type, with intra prediction related first, followed by inter prediction related.
  • the order of the partition constraint syntax elements in SPS is consistent with the order of the partition constraint syntax elements in the picture header.
  • An example of the decoding process on VVC Draft is illustrated in Table 12 below. The changes to the VVC Draft are shown using bold and italicized font. The strikethrough text is removed from the process. Table 12. Proposed sequence parameter set RBSP syntax [0061]
  • FIG.4 shows a method for decoding a video signal in accordance with the present disclosure. The method may be, for example, applied to a decoder.
  • the decoder may receive arranged partition constraint syntax elements in SPS level.
  • the decoder may obtain a first reference picture I (0) and a second reference picture I (1) associated with a video block in a bitstream.
  • the first reference picture I (0) is before a current picture and the second reference picture I (1) is after the current picture in display order.
  • the decoder may obtain first prediction samples I (0) (i, j) of the video block from a reference block in the first reference picture I (0) .
  • the i and j represent a coordinate of one sample with the current picture.
  • the decoder may obtain second prediction samples I (1) (i, j) of the video block from a reference block in the second reference picture I (1) .
  • the decoder may obtain bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I (0) (i, j) , and the second prediction samples I (1) (i, j).
  • FIG.5 shows a method for decoding a video signal in accordance with the present disclosure. The method may be, for example, applied to a decoder.
  • the decoder may receive a bitstream that includes VPS, SPS, PPS, picture header, and slice header for coded video data.
  • the decoder may decode the VPS.
  • the decoder may decode the SPS and obtain an arranged partition constraint syntax elements in SPS level.
  • the decoder may decode the PPS.
  • the decoder may decode the picture header.
  • the decoder may decode the slice header.
  • the decoder may decode the video data based on VPS, SPS, PPS, picture header and slice header.
  • the partition constraint syntax elements for dual-tree chroma in SPS should be signaled together under dual-tree chroma cases.
  • Table 13 An example of the decoding process on VVC Draft is illustrated in Table 13 below. The changes to the VVC Draft are shown using bold and italicized font. The strikethrough text is removed from the process. Table 13. Proposed sequence parameter set RBSP syntax [0070] If also considering defining intra prediction related syntaxes prior to those related to inter prediction, according to the method of the disclosure, another example of the decoding process on VVC Draft is illustrated in table 14 below. The changes to the VVC Draft are shown using bold and italicized font The strikethrough text is removed from the process Table 14.
  • Proposed sequence parameter set RBSP syntax [0071] Conditionally Signaling Inter-Prediction Related Syntax Elements [0072] As mentioned in the earlier description, according to the earlier description, according to the earlier description, according to the earlier description, according to the earlier description, according to the earlier description, according to the earlier description, according to the earlier description, according to the earlier description, it is proposed to add a flag in VVC syntax at a certain coding level to indicate if inter prediction is used or not in a sequence, picture and/or slice. In case inter prediction is not used, inter prediction related syntaxes are not signaled at the corresponding coding level, e.g., sequence, picture and/or slice level. [0073] In one example, according to the method of the disclosure, a flag is added in SPS to indicate if inter prediction is used in coding the current video sequence.
  • inter prediction related syntax elements are not signaled in SPS.
  • An example of the decoding process on VVC Draft is illustrated in Table 15 below. The changes to the VVC Draft are shown using bold and italicized font. Table 15. Proposed sequence parameter set RBSP syntax
  • sps_inter_slice_used_flag 0 specifies that all coded slices of the video sequence have slice_type equal to 2.
  • sps_inter_slice_used_flag 1 specifies that there may or may not be one or more coded slices in the video sequence that have slice_type equal to 0 or 1.
  • FIG.6 shows a computing environment 610 coupled with a user interface 660.
  • the computing environment 610 can be part of a data processing server.
  • the computing environment 610 includes processor 620, memory 640, and I/O interface 650.
  • the processor 620 typically controls overall operations of the computing environment 610, such as the operations associated with the display, data acquisition, data communications, and image processing.
  • the processor 620 may include one or more processors to execute instructions to perform all or some of the steps in the above-described methods.
  • the processor 620 may include one or more modules that facilitate the interaction between the processor 620 and other components.
  • the processor may be a Central Processing Unit (CPU), a microprocessor, a single chip machine, a GPU, or the like.
  • the memory 640 is configured to store various types of data to support the operation of the computing environment 610. Memory 640 may include predetermine software 642.
  • the memory 640 may be implemented by using any type of volatile or non-volatile memory devices, or a combination thereof, such as a static random access memory (SRAM), an electrically erasable programmable read-only memory (EEPROM), an erasable programmable read-only memory (EPROM), a programmable read-only memory (PROM), a read-only memory (ROM), a magnetic memory, a flash memory, a magnetic or optical disk.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read-only memory
  • EPROM erasable programmable read-only memory
  • PROM programmable read-only memory
  • ROM read-only memory
  • magnetic memory a magnetic memory
  • flash memory a flash memory
  • magnetic or optical disk a magnetic or optical disk.
  • the I/O interface 650 provides an interface between the processor 620 and peripheral interface modules, such as a keyboard, a click wheel, buttons, and the like.
  • the buttons may include but are not limited to, a home button, a start scan button, and a stop scan button.
  • the I/O interface 650 can be coupled with an encoder and decoder.
  • non-transitory computer-readable storage medium comprising a plurality of programs, such as comprised in the memory 640, executable by the processor 620 in the computing environment 610, for performing the above- described methods.
  • the non-transitory computer-readable storage medium may be a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disc, an optical data storage device or the like.
  • the non-transitory computer-readable storage medium has stored therein a plurality of programs for execution by a computing device having one or more processors, where the plurality of programs when executed by the one or more processors, cause the computing device to perform the above-described method for motion prediction.
  • the computing environment 610 may be implemented with one or more application-specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field- programmable gate arrays (FPGAs), graphical processing units (GPUs), controllers, micro controllers, microprocessors, or other electronic components, for performing the above methods.
  • ASICs application-specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGAs field- programmable gate arrays
  • GPUs graphical processing units
  • controllers micro controllers, microprocessors, or other electronic components, for performing the above methods.

Abstract

Methods, apparatuses, and non-transitory computer-readable storage medium are provided for decoding video signals. A decoder receives arranged partition constraint syntax elements in sequence parameter set (SPS) level. The arranged partition constraint syntax elements are arranged so that intra prediction related syntax elements are defined before inter prediction related syntax elements. The decoder obtains a first reference picture I (0) and a second reference picture I (1) associated with a video block in a bitstream. The decoder obtains first prediction samples I (0)(i, j) of the video block from a reference block in the first reference picture I (0). The decoder obtains second prediction samples I (1) (i, j) of the video block from a reference block in the second reference picture I (1) . The decoder obtains bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I (0)(i, j), and the second prediction samples I (1) (i, j).

Description

METHODS AND DEVICES FOR HIGH-LEVEL SYNTAX IN VIDEO CODING CROSS-REFERENCE TO RELATED APPLICATION [0001] This application is based upon and claims priority to Provisional Applications No. 63/003,229 filed on March 31, 2020, the entire contents thereof are incorporated herein by reference in their entireties for all purposes. TECHNICAL FIELD [0002] This disclosure is related to video coding and compression. More specifically, this application relates to methods and apparatuses for high-level syntax in video bitstream applicable to one or more video coding standards. BACKGROUND [0003] Various video coding techniques may be used to compress video data. Video coding is performed according to one or more video coding standards. For example, video coding standards include versatile video coding (VVC), joint exploration test model (JEM), high- efficiency video coding (H.265/HEVC), advanced video coding (H.264/AVC), moving picture expert group (MPEG) coding, or the like. Video coding generally utilizes prediction methods (e.g., inter-prediction, intra-prediction, or the like) that take advantage of redundancy present in video images or sequences. An important goal of video coding techniques is to compress video data into a form that uses a lower bit rate while avoiding or minimizing degradations to video quality. SUMMARY [0004] Examples of the present disclosure provide methods and apparatus for high-level syntax coding in video coding. [0005] According to a first aspect of the present disclosure, a method for decoding a video signal is provided. The method may include receiving arranged partition constraint syntax elements in sequence parameter set (SPS) level. The arranged partition constraint syntax elements may be arranged so that intra prediction related syntax elements may be defined before inter prediction related syntax elements. The decoder may also obtain a first reference picture I(0) and a second reference picture I(1) associated with a video block in a bitstream. The first reference picture I(0) may be before a current picture and the second reference picture I(1) may be after the current picture in display order. The decoder may also obtain first prediction samples I(0)(i, j) of the video block from a reference block in the first reference picture I(0). The i and j may represent a coordinate of one sample with the current picture. The decoder may also obtain second prediction samples I(1)(i, j) of the video block from a reference block in the second reference picture I(1). The decoder may also obtain bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I(0)(i, j), and the second prediction samples I(1)(i, j). [0006] According to a second aspect of the present disclosure, a computing device is provided. The computing device may include one or more processors, a non-transitory computer-readable memory storing instructions executable by the one or more processors. The one or more processors may be configured to receive arranged partition constraint syntax elements in SPS level. The arranged partition constraint syntax elements may be arranged so that intra prediction related syntax elements may be defined before inter prediction related syntax elements. The one or more processors may further be configured to obtain a first reference picture I(0) and a second reference picture I(1) associated with a video block in a bitstream. The first reference picture I(0) may be before a current picture and the second reference picture I(1) may be after the current picture in display order. The one or more processors may further be configured to obtain first prediction samples I(0)(i, j) of the video block from a reference block in the first reference picture I(0). The i and j may represent a coordinate of one sample with the current picture. The one or more processors may further be configured to obtain second prediction samples I(1)(i, j) of the video block from a reference block in the second reference picture I(1) . The one or more processors may further be configured to obtain bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I(0)(i, j), and the second prediction samples I(1)(i, j). [0007] According to a third aspect of the present disclosure, a non-transitory computer- readable storage medium having stored therein instructions is provided. When the instructions are executed by one or more processors of the apparatus, the instructions may cause the apparatus to receive arranged partition constraint syntax elements in SPS level. The arranged partition constraint syntax elements may be arranged so that intra prediction related syntax elements may be defined before inter prediction related syntax elements. The instructions may also cause the apparatus to obtain a first reference picture I(0) and a second reference picture I(1) associated with a video block in a bitstream. The first reference picture I(0) may be before a current picture and the second reference picture I(1) may be after the current picture in display order. The instructions may also cause the apparatus to obtain first prediction samples I(0)(i, j) of the video block from a reference block in the first reference picture I(0). The i and j may represent a coordinate of one sample with the current picture. The instructions may also cause the apparatus to obtain second prediction samples I(1)(i, j) of the video block from a reference block in the second reference picture I(1). The instructions may also cause the apparatus to obtain bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I(0)(i, j), and the second prediction samples I(1)(i, j). [0008] It is to be understood that the above general descriptions and detailed descriptions below are only exemplary and explanatory and not intended to limit the present disclosure. BRIEF DESCRIPTION OF THE DRAWINGS [0009] The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate examples consistent with the present disclosure and, together with the description, serve to explain the principles of the disclosure. [0010] FIG. 1 is a block diagram of an encoder, according to an example of the present disclosure. [0011] FIG. 2 is a block diagram of a decoder, according to an example of the present disclosure. [0012] FIG. 3A is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure. [0013] FIG. 3B is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure. [0014] FIG. 3C is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure. [0015] FIG. 3D is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure. [0016] FIG. 3E is a diagram illustrating block partitions in a multi-type tree structure, according to an example of the present disclosure. [0017] FIG. 4 is a method for decoding a video signal, according to an example of the present disclosure. [0018] FIG. 5 is a method for decoding a video signal, according to an example of the presentt d diiscllosure. [0019] FIG. 6 is a diagram illustrating a computing environment coupled with a user interface, according to an example of the present disclosure. DETAILED DESCRIPTION [0020] Reference will now be made in detail to exemplary embodiments, examples of which are illustrated in the accompanying drawings. The following description refers to the accompanying drawings in which the same numbers in different drawings represent the same or similar elements unless otherwise represented. The implementations set forth in the following description of exemplary embodiments do not represent all implementations consistent with the present disclosure. Instead, they are merely examples of apparatuses and methods consistent with aspects related to the present disclosure, as recited in the appended claims. [0021] The terminology used in the present disclosure is for the purpose of describing particular embodiments only and is not intended to limit the present disclosure. As used in the present disclosure and the appended claims, the singular forms “a,” “an,” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It shall also be understood that the term “and/or” used herein is intended to signify and include any or all possible combinations of one or more of the associated listed items. [0022] It shall be understood that, although the terms “first,” “second,” “third,” etc., may be used herein to describe various information, the information should not be limited by these terms. These terms are only used to distinguish one category of information from another. For example, without departing from the scope of the present disclosure, first information may be termed as second information; and similarly, second information may also be termed as first information. As used herein, the term “if” may be understood to mean “when” or “upon” or “in response to a judgment” depending on the context. [0023] The first version of the HEVC standard was finalized in October 2013, which offers approximately 50% bit-rate saving or equivalent perceptual quality compared to the prior generation video coding standard H.264/MPEG AVC. Although the HEVC standard provides significant coding improvements than its predecessor, there is evidence that superior coding efficiency can be achieved with additional coding tools over HEVC. Based on that, both VCEG and MPEG started the exploration work of new coding technologies for future video coding standardization. one Joint Video Exploration Team (JVET) was formed in Oct.2015 by ITU-T VECG and ISO/IEC MPEG to begin a significant study of advanced technologies that could enable substantial enhancement of coding efficiency. One reference software called the joint exploration model (JEM) was maintained by the JVET by integrating several additional coding tools on top of the HEVC test model (HM).
[0024] In Oct. 2017, the j oint call for proposals (CfP) on video compression with capability beyond HEVC was issued by ITU-T and ISO/IEC. In Apr. 2018, 23 CfP responses were received and evaluated at the 10-th JVET meeting, which demonstrated compression efficiency gain over the HEVC around 40%. Based on such evaluation results, the JVET launched a new project to develop the new generation video coding standard that is named Versatile Video Coding (VVC). In the same month, one reference software codebase, called VVC test model (VTM), was established for demonstrating a reference implementation of the VVC standard. [0025] Like HEVC, the VVC is built upon the block-based hybrid video coding framework.
[0026] FIG. 1 shows a general diagram of a block-based video encoder for the VVC. Specifically, FIG. 1 shows atypical encoder 100. The encoder 100 has video input 110, motion compensation 112, motion estimation 114, intra/inter mode decision 116, block predictor 140, adder 128, transform 130, quantization 132, prediction related info 142, intra prediction 118, picture buffer 120, inverse quantization 134, inverse transform 136, adder 126, memory 124, in-loop filter 122, entropy coding 138, and bitstream 144.
[0027] In the encoder 100, a video frame is partitioned into a plurality of video blocks for processing. For each given video block, a prediction is formed based on either an inter prediction approach or an intra prediction approach.
[0028] A prediction residual, representing the difference between a current video block, part of video input 110, and its predictor, part of block predictor 140, is sent to a transform 130 from adder 128. Transform coefficients are then sent from the Transform 130 to a Quantization 132 for entropy reduction. Quantized coefficients are then fed to an Entropy Coding 138 to generate a compressed video bitstream. As shown in FIG. 1, prediction related information 142 from an intra/inter mode decision 116, such as video block partition info, motion vectors (MVs), reference picture index, and intra prediction mode, are also fed through the Entropy Coding 138 and saved into a compressed bitstream 144. Compressed bitstream 144 includes a video bitstream.
[0029] In the encoder 100, decoder-related circuitries are also needed in order to reconstruct pixels for the purpose of prediction. First, a prediction residual is reconstructed through an Inverse Quantization 134 and an Inverse Transform 136. This reconstructed prediction residual is combined with a Block Predictor 140 to generate un-filtered reconstructed pixels for a current video block.
[0030] Spatial prediction (or “intra prediction”) uses pixels from samples of already coded neighboring blocks (which are called reference samples) in the same video frame as the current video block to predict the current video block.
[0031] Temporal prediction (also referred to as “inter prediction”) uses reconstructed pixels from already-coded video pictures to predict the current video block. Temporal prediction reduces temporal redundancy inherent in the video signal. The temporal prediction signal for a given coding unit (CU) or coding block is usually signaled by one or more MVs, which indicate the amount and the direction of motion between the current CU and its temporal reference. Further, if multiple reference pictures are supported, one reference picture index is additionally sent, which is used to identify from which reference picture in the reference picture storage, the temporal prediction signal comes from.
[0032] Motion estimation 114 intakes video input 110 and a signal from picture buffer 120 and output, to motion compensation 112, amotion estimation signal. Motion compensation 112 intakes video input 110, a signal from picture buffer 120, and motion estimation signal from motion estimation 114 and output to intra/inter mode decision 116, a motion compensation signal.
[0033] After spatial and/or temporal prediction is performed, an intra/inter mode decision 116 in the encoder 100 chooses the best prediction mode, for example, based on the rate- distortion optimization method. The block predictor 140 is then subtracted from the current video block, and the resulting prediction residual is de-correlated using the transform 130 and the quantization 132. The resulting quantized residual coefficients are inverse quantized by the inverse quantization 134 and inverse transformed by the inverse transform 136 to form the reconstructed residual, which is then added back to the prediction block to form the reconstructed signal of the CU. Further in-loop filtering 122, such as a deblocking filter, a sample adaptive offset (SAO), and/or an adaptive in-loop filter (ALF) may be applied on the reconstructed CU before it is put in the reference picture storage of the picture buffer 120 and used to code future video blocks. To form the output video bitstream 144, coding mode (inter or intra), prediction mode information, motion information, and quantized residual coefficients are all sent to the entropy coding unit 138 to be further compressed and packed to form the bitstream.
[0034] FIG. 1 gives the block diagram of a generic block-based hybrid video encoding system. The input video signal is processed block by block (called coding units (CUs)). In VTM-1.0, a CU can be up to 128x128 pixels. However, different from the HEVC, which partitions blocks only based on quad-trees, in the VVC, one coding tree unit (CTU) is split into CUs to adapt to varying local characteristics based on quad/binary/temary-tree. Additionally, the concept of multiple partition unit type in the HEVC is removed, i.e., the separation of CU, a prediction unit (PU), and a transform unit (TU) does not exist in the VVC anymore; instead, each CU is always used as the basic unit for both prediction and transform without further partitions. In the multi-type tree structure, one CTU is firstly partitioned by a quad-tree structure. Then, each quad-tree leaf node can be further partitioned by a binary and ternary tree structure.
[0035] As shown in FIG. 3A, 3B, 3C, 3D, and 3E, there are five splitting types, quaternary partitioning, horizontal binary partitioning, vertical binary partitioning, horizontal ternary partitioning, and vertical ternary partitioning.
[0036] FIG. 3 A shows a diagram illustrating block quaternary partition in a multi-type tree structure, in accordance with the present disclosure.
[0037] FIG. 3B shows a diagram illustrating block vertical binary partition in a multi-type tree structure, in accordance with the present disclosure.
[0038] FIG. 3C shows a diagram illustrating block horizontal binary partition in a multitype tree structure, in accordance with the present disclosure.
[0039] FIG. 3D shows a diagram illustrating block vertical ternary partition in a multi-type tree structure, in accordance with the present disclosure.
[0040] FIG. 3E shows a diagram illustrating block horizontal ternary partition in a multitype tree structure, in accordance with the present disclosure.
[0041] In FIG. 1, spatial prediction and/or temporal prediction may be performed. Spatial prediction (or “intra prediction”) uses pixels from the samples of already coded neighboring blocks (which are called reference samples) in the same video picture/slice to predict the current video block. Spatial prediction reduces spatial redundancy inherent in the video signal. Temporal prediction (also referred to as “inter prediction” or “motion compensated prediction”) uses reconstructed pixels from the already coded video pictures to predict the current video block. Temporal prediction reduces temporal redundancy inherent in the video signal. Temporal prediction signal for a given CU is usually signaled by one or more motion vectors (MVs) which indicate the amount and the direction of motion between the current CU and its temporal reference. Also, if multiple reference pictures are supported, one reference picture index is additionally sent, which is used to identify from which reference picture in the reference picture storage the temporal prediction signal comes from. After spatial and/or temporal prediction, the mode decision block in the encoder chooses the best prediction mode, for example, based on the rate-distortion optimization method. The prediction block is then subtracted from the current video block; and the prediction residual is de-correlated using transform and quantized. The quantized residual coefficients are inverse quantized and inverse transformed to form the reconstructed residual, which is then added back to the prediction block to form the reconstructed signal of the CU. Further in-loop filtering, such as deblocking filter, sample adaptive offset (SAO), and adaptive in-loop filter (ALF) may be applied on the reconstructed CU before it is put in the reference picture storage and used to code future video blocks. To form the output video bit-stream, coding mode (inter or intra), prediction mode information, motion information, and quantized residual coefficients are all sent to the entropy coding unit to be further compressed and packed to form the bit-stream.
[0042] FIG. 2 shows a general block diagram of a video decoder for the VVC. Specifically, FIG. 2 shows a typical decoder 200 block diagram. Decoder 200 has bitstream 210, entropy decoding 212, inverse quantization 214, inverse transform 216, adder 218, intra/inter mode selection 220, intra prediction 222, memory 230, in-loop filter 228, motion compensation 224, picture buffer 226, prediction related info 234, and video output 232.
[0043] Decoder 200 is similar to the reconstruction-related section residing in the encoder 100 of FIG. 1. In the decoder 200, an incoming video bitstream 210 is first decoded through an Entropy Decoding 212 to derive quantized coefficient levels and prediction-related information. The quantized coefficient levels are then processed through an Inverse Quantization 214 and an Inverse Transform 216 to obtain a reconstructed prediction residual. A block predictor mechanism, implemented in an Intra/inter Mode Selector 220, is configured to perform either an Intra Prediction 222 or a Motion Compensation 224, based on decoded prediction information. A set of unfiltered reconstructed pixels is obtained by summing up the reconstructed prediction residual from the Inverse Transform 216 and a predictive output generated by the block predictor mechanism, using a summer 218.
[0044] The reconstructed block may further go through an In-Loop Filter 228 before it is stored in a Picture Buffer 226, which functions as a reference picture store. The reconstructed video in the Picture Buffer 226 may be sent to drive a display device, as well as used to predict future video blocks. In situations where the In-Loop Filter 228 is turned on, a filtering operation is performed on these reconstructed pixels to derive a final reconstructed Video Output 232.
[0045] FIG. 2 gives a general block diagram of a block-based video decoder. The video bit- stream is first entropy decoded at entropy decoding unit. The coding mode and prediction information are sent to either the spatial prediction unit (if intra coded) or the temporal prediction unit (if inter-coded) to form the prediction block. The residual transform coefficients are sent to the inverse quantization unit and inverse transform unit to reconstruct the residual block. The prediction block and the residual block are then added together. The reconstructed block may further go through in-loop filtering before it is stored in reference picture storage. The reconstructed video in reference picture storage is then sent out to drive a display device, as well as used to predict future video blocks. [0046] In general, the basic intra prediction scheme applied in the VVC is kept the same as that of the HEVC, except that several modules are further extended and/or improved, e.g., matrix weighted intra prediction (MIP) coding mode, intra sub-partition (ISP) coding mode, extended intra prediction with wide-angle intra directions, position-dependent intra prediction combination (PDPC) and 4-tap intra interpolation. The main focus of the disclosure is to improve the existing high-level syntax design in the VVC standard. The related background knowledge is elaborated in the following sections. [0047] Like HEVC, VVC uses a Network Abstraction Layer (NAL) unit-based bitstream structure. A coded bitstream is partitioned into NAL units which, when conveyed over lossy packet networks, should be smaller than the maximum transfer unit size. Each NAL unit consists of a NAL unit header followed by the NAL unit payload. There are two conceptual classes of NAL units. Video coding layer (VCL) NAL units containing coded sample data, e.g., coded slice NAL units, whereas non-VCL NAL units that contain metadata typically belonging to more than one coded picture, or where the association with a single coded picture would be meaningless, such as parameter set NAL units, or where the information is not needed by the decoding process, such as SEI NAL units. [0048] In VVC, a two-byte NAL unit header was introduced with the anticipation that this design is sufficient to support future extensions. The syntax and the associated semantic of the NAL unit header in the current VVC draft specification are illustrated in Table 1 and Table 2, respectively. How to read Table 1 is illustrated in the appendix section of this invention, which could also be found in the VVC specification. Table 1. NAL unit header syntax
Figure imgf000012_0001
Table 2. NAL unit header semantics
Figure imgf000012_0003
Table 3. NAL unit type codes and NAL unit type classes
Figure imgf000012_0002
Figure imgf000013_0001
Figure imgf000014_0002
[0049] VVC inherits the parameter set concept of HEVC with a few modifications and additions. Parameter sets can be either part of the video bitstream or can be received by a decoder through other means (including out-of-band transmission using a reliable channel, hard coding in encoder and decoder, and so on). A parameter set contains an identification, which is referenced, directly or indirectly, from the slice header, as discussed in more detail later. The referencing process is known as “activation.” Depending on the parameter set type, the activation occurs per picture or per sequence. The concept of activation through referencing was introduced, among other reasons, because implicit activation by virtue of the position of the information in the bitstream (as common for other syntax elements of a video codec) is not available in case of out-of-band transmission. [0050] The video parameter set (VPS) was introduced to convey information that is applicable to multiple layers as well as sub-layers. The VPS was introduced to address these shortcomings as well as to enable a clean and extensible high-level design of multilayer codecs. Each layer of a given video sequence, regardless of whether they have the same or different sequence parameter sets (SPS), refers to the same VPS. The syntax and the associated semantic of the video parameter set in the current VVC draft specification are illustrated in Table 4 and Table 5, respectively. How to read Table 4 is illustrated in the appendix section of this invention, which could also be found in the VVC specification. Table 4. Video parameter set RBSP syntax
Figure imgf000014_0001
Figure imgf000015_0001
Figure imgf000016_0001
for( i = 1; i < TotalNumOlss; i++ ) if( NumLayersIn01s[ i ] > 1 ) ols hrd idx[ i ] ue(v)
} vps extension flag u(l) if( vps extension flag ) while( more rbsp data( ) ) vps extension data flag u(l) rbsp trailing bits( )
Figure imgf000017_0001
}
Table 5. Video parameter set RBSP semantics
A VPS RBSP shall be available to the decoding process prior to it being referenced, included in at least one AU with Temporalld equal to 0 or provided through external means.
All VPS NAL units with a particular value of vps_video_parameter_set_id in a CVS shall have the same content. vps video parameter set id provides an identifier for the VPS for reference by other syntax elements. The value of vps_video_parameter_set_id shall be greater than 0. vps_max_layers_minusl plus 1 specifies the maximum allowed number of layers in each CV S referring to the VPS. vps_max_sublayers_minusl plus 1 specifies the maximum number of temporal sublayers that may be present in a layer in each CVS referring to the VPS. The value of vps max sublayers minusl shall be in the range of 0 to 6, inclusive. vps_all_layers_same_num_sublayers_flag equal to 1 specifies that the number of temporal sublayers is the same for all the layers in each CVS referring to the VPS. vps_all_layers_same_num_sublayers_flag equal to 0 specifies that the layers in each CVS referring to the VPS may or may not have the same number of temporal sublayers. When not present, the value of vps_all_layers_same_num_sublayers_flag is inferred to be equal to 1. vps_all_independent_layers_flag equal to 1 specifies that all layers in the CVS are independently coded without using inter-layer prediction, vps all independent layers flag equal to 0 specifies that one or more of the layers in the CVS may use inter-layer prediction. When not present, the value of vps all independent layers flag is inferred to be equal to 1. vps_layer_id[ i ] specifies the nuh layer id value of the i-th layer. For any two non-negative integer values of m and n, when m is less than n, the value of vps_layer_id[ m ] shall be less than vps_layer_id[ n ]. vps_independent_layer_flag[ i ] equal to 1 specifies that the layer with index i does not use inter-layer prediction. vps_independent_layer_flag[ i ] equal to 0 specifies that the layer with index i may use inter-layer prediction and the syntax elements vps_direct_ref_lay er_flag[ i ] [ j ] for j in the range of 0 to i - 1 , inclusive, are present in VPS . When not present, the value of vps_independent_layer_flag[ i ] is inferred to be equal to 1. vps_direct_ref_layer_flag[ i ][ j ] equal to 0 specifies that the layer with index j is not a direct reference layer for the layer with index i. vps direct ref layer flag [ i ] [ j ] equal to 1 specifies that the layer with index j is a direct reference layer for the layer with index i. When vps_direct_ref_layer_flag[ i ][ j ] is not present for i and j in the range of 0 to vps_max_layers_minus1, inclusive, it is inferred to be equal to 0. When vps_independent_layer_flag[ i ] is equal to 0, there shall be at least one value of j in the range of 0 to i − 1, inclusive, such that the value of vps_direct_ref_layer_flag[ i ][ j ] is equal to 1. The variables NumDirectRefLayers[ i ], DirectRefLayerIdx[ i ][ d ], NumRefLayers[ i ], RefLayerIdx[ i ][ r ], and LayerUsedAsRefLayerFlag[ j ] are derived as follows: for( i = 0; i <= vps_max_layers_minus1; i++ ) { for( j = 0; j <= vps_max_layers_minus1; j++ ) { dependencyFlag[ i ][ j ] = vps_direct_ref_layer_flag[ i ][ j ] for( k = 0; k < i; k++ ) if( vps_direct_ref_layer_flag[ i ][ k ] && dependencyFlag[ k ][ j ] ) dependencyFlag[ i ][ j ] = 1 } LayerUsedAsRefLayerFlag[ i ] = 0 } for( i = 0; i <= vps_max_layers_minus1; i++ ) { for( j = 0, d = 0, r = 0; j <= vps_max_layers_minus1; j++ ) { (37) if( vps_direct_ref_layer_flag[ i ][ j ] ) { DirectRefLayerIdx[ i ][ d++ ] = j LayerUsedAsRefLayerFlag[ j ] = 1 } if( dependencyFlag[ i ][ j ] ) RefLayerIdx[ i ][ r++ ] = j } NumDirectRefLayers[ i ] = d NumRefLayers[ i ] = r } The variable GeneralLayerIdx[ i ], specifying the layer index of the layer with nuh_layer_id equal to vps_layer_id[ i ], is derived as follows: for( i = 0; i <= vps_max_layers_minus1; i++ ) (38) GeneralLayerIdx[ vps_layer_id[ i ] ] = i For any two different values of i and j, both in the range of 0 to vps_max_layers_minus1, inclusive, when dependencyFlag[ i ][ j ] equal to 1, it is a requirement of bitstream conformance that the values of chroma_format_idc and bit_depth_minus8 that apply to the i-th layer shall be equal to the values of chroma_format_idc and bit_depth_minus8, respectively, that apply to the j-th layer. max_tid_ref_present_flag[ i ] equal to 1 specifies that the syntax element max_tid_il_ref_pics_plus1[ i ] is present. max_tid_ref_present_flag[ i ] equal to 0 specifies that the syntax element max_tid_il_ref_pics_plus1[ i ] is not present. max_tid_il_ref_pics_plus1[ i ] equal to 0 specifies that inter-layer prediction is not used by non-IRAP pictures of the i-th layer. max_tid_il_ref_pics_plus1[ i ] greater than 0 specifies that, for decoding pictures of the i-th layer, no picture with TemporalId greater than max_tid_il_ref_pics_plus1[ i ] − 1 is used as ILRP. When not present, the value of max tid il ref pics plus1[ i ] is inferred to be equal to 7. each_layer_is_an_ols_flag equal to 1 specifies that each OLS contains only one layer and each layer itself in a CVS referring to the VPS is an OLS with the single included layer being the only output layer. each_layer_is_an_ols_flag equal to 0 that an OLS may contain more than one layer. If vps_max_layers_minus1 is equal to 0, the value of each_layer_is_an_ols_flag is inferred to be equal to 1. Otherwise, when vps_all_independent_layers_flag is equal to 0, the value of each_layer_is_an_ols_flag is inferred to be equal to 0. ols_mode_idc equal to 0 specifies that the total number of OLSs specified by the VPS is equal to vps_max_layers_minus1 + 1, the i-th OLS includes the layers with layer indices from 0 to i, inclusive, and for each OLS only the highest layer in the OLS is output. ols_mode_idc equal to 1 specifies that the total number of OLSs specified by the VPS is equal to vps_max_layers_minus1 + 1, the i-th OLS includes the layers with layer indices from 0 to i, inclusive, and for each OLS all layers in the OLS are output. ols_mode_idc equal to 2 specifies that the total number of OLSs specified by the VPS is explicitly signalled and for each OLS the output layers are explicitly signalled and other layers are the layers that are direct or indirect reference layers of the output layers of the OLS. The value of ols_mode_idc shall be in the range of 0 to 2, inclusive. The value 3 of ols_mode_idc is reserved for future use by ITU-T | ISO/IEC. When vps_all_independent_layers_flag is equal to 1 and each_layer_is_an_ols_flag is equal to 0, the value of ols_mode_idc is inferred to be equal to 2. num_output_layer_sets_minus1 plus 1 specifies the total number of OLSs specified by the VPS when ols_mode_idc is equal to 2. The variable TotalNumOlss, specifying the total number of OLSs specified by the VPS, is derived as follows: if( vps_max_layers_minus1 = = 0 ) TotalNumOlss = 1 else if( each_layer_is_an_ols_flag | | ols_mode_idc = = 0 | | ols_mode_idc = = 1 ) TotalNumOlss = vps_max_layers_minus1 + 1 (39) else if( ols_mode_idc = = 2 ) TotalNumOlss = num_output_layer_sets_minus1 + 1 ols_output_layer_flag[ i ][ j ] equal to 1 specifies that the layer with nuh_layer_id equal to vps_layer_id[ j ] is an output layer of the i-th OLS when ols_mode_idc is equal to 2. ols_output_layer_flag[ i ][ j ] equal to 0 specifies that the layer with nuh_layer_id equal to vps_layer_id[ j ] is not an output layer of the i-th OLS when ols_mode_idc is equal to 2. The variable NumOutputLayersInOls[ i ], specifying the number of output layers in the i-th OLS, the variable NumSubLayersInLayerInOLS[ i ][ j ], specifying the number of sublayers in the j-th layer in the i-th OLS, the variable OutputLayerIdInOls[ i ][ j ], specifying the nuh_layer_id value of the j-th output layer in the i-th OLS, and the variable LayerUsedAsOutputLayerFlag[ k ], specifying whether the k-th layer is used as an output layer in at least one OLS, are derived as follows: NumOutputLayersInOls[ 0 ] = 1 OutputLayerIdInOls[ 0 ][ 0 ] = vps_layer_id[ 0 ] NumSubLayersInLayerInOLS[ 0 ][ 0 ] = vps_max_sub_layers_minus1 + 1 LayerUsedAsOutputLayerFlag[ 0 ] = 1 for( i = 1, i <= vps_max_layers_minus1; i++ ) { if( each_layer_is_an_ols_flag | | ols_mode_idc < 2 ) LayerUsedAsOutputLayerFlag[ i ] = 1 else /*( !each_layer_is_an_ols_flag && ols_mode_idc = = 2 ) */ LayerUsedAsOutputLayerFlag[ i ] = 0 } for( i = 1; i < TotalNumOlss; i++ ) if( each_layer_is_an_ols_flag | | ols_mode_idc = = 0 ) { NumOutputLayersInOls[ i ] = 1 OutputLayerIdInOls[ i ][ 0 ] = vps_layer_id[ i ] for( j = 0; j < i && ( ols_mode_idc = = 0 ); j++ ) NumSubLayersInLayerInOLS[ i ][ j ] = max_tid_il_ref_pics_plus1[ i ] NumSubLayersInLayerInOLS[ i ][ i ] = vps_max_sub_layers_minus1 + 1 } else if( ols_mode_idc = = 1 ) { NumOutputLayersInOls[ i ] = i + 1 for( j = 0; j < NumOutputLayersInOls[ i ]; j++ ) { OutputLayerIdInOls[ i ][ j ] = vps_layer_id[ j ] NumSubLayersInLayerInOLS[ i ][ j ] = vps_max_sub_layers_minus1 + 1 } } else if( ols_mode_idc = = 2 ) { for( j = 0; j <= vps_max_layers_minus1; j++ ) { layerIncludedInOlsFlag[ i ][ j ] = 0 NumSubLayersInLayerInOLS[ i ][ j ] = 0 } for( k = 0, j = 0; k <= vps_max_layers_minus1; k++ ) (40) if( ols_output_layer_flag[ i ][ k ] ) { layerIncludedInOlsFlag[ i ][ k ] = 1 LayerUsedAsOutputLayerFlag[ k ] = 1 OutputLayerIdx[ i ][ j ] = k OutputLayerIdInOls[ i ][ j++ ] = vps_layer_id[ k ] NumSubLayersInLayerInOLS[ i ][ j ] = vps_max_sub_layers_minus1 + 1 } NumOutputLayersInOls[ i ] = j for( j = 0; j < NumOutputLayersInOls[ i ]; j++ ) { idx = OutputLayerIdx[ i ][ j ] for( k = 0; k < NumRefLayers[ idx ]; k++ ) { layerIncludedInOlsFlag[ i ][ RefLayerIdx[ idx ][ k ] ] = 1 if( NumSubLayersInLayerInOLS[ i ][ RefLayerIdx[ idx ][ k ] ] < max_tid_il_ref_pics_plus1[ OutputLayerIdInOls[ i ][ j ] ] ) NumSubLayersInLayerInOLS[ i ][ RefLayerIdx[ idx ][ k ] ] = max_tid_il_ref_pics_plus1[ OutputLayerIdInOls[ i ][ j ] ] } } } For each value of i in the range of 0 to vps_max_layers_minus1, inclusive, the values of LayerUsedAsRefLayerFlag[ i ] and LayerUsedAsOutputLayerFlag[ i ] shall not be both equal to 0. In other words, there shall be no layer that is neither an output layer of at least one OLS nor a direct reference layer of any other layer. For each OLS, there shall be at least one layer that is an output layer. In other words, for any value of i in the range of 0 to TotalNumOlss − 1, inclusive, the value of NumOutputLayersInOls[ i ] shall be greater than or equal to 1. The variable NumLayersInOls[ i ], specifying the number of layers in the i-th OLS, and the variable LayerIdInOls[ i ][ j ], specifying the nuh_layer_id value of the j-th layer in the i-th OLS, are derived as follows: NumLayersInOls[ 0 ] = 1 LayerIdInOls[ 0 ][ 0 ] = vps_layer_id[ 0 ] for( i = 1; i < TotalNumOlss; i++ ) { if( each_layer_is_an_ols_flag ) { NumLayersInOls[ i ] = 1 LayerIdInOls[ i ][ 0 ] = vps_layer_id[ i ] (41) } else if( ols_mode_idc = = 0 | | ols_mode_idc = = 1 ) { NumLayersInOls[ i ] = i + 1 for( j = 0; j < NumLayersInOls[ i ]; j++ ) LayerIdInOls[ i ][ j ] = vps_layer_id[ j ] } else if( ols_mode_idc = = 2 ) { for( k = 0, j = 0; k <= vps_max_layers_minus1; k++ ) if( layerIncludedInOlsFlag[ i ][ k ] ) LayerIdInOls[ i ][ j++ ] = vps_layer_id[ k ] NumLayersInOls[ i ] = j } } NOTE 1 – The 0-th OLS contains only the lowest layer (i.e., the layer with nuh_layer_id equal to vps_layer_id[ 0 ]) and for the 0-th OLS the only included layer is output. The variable OlsLayerIdx[ i ][ j ], specifying the OLS layer index of the layer with nuh_layer_id equal to LayerIdInOls[ i ][ j ], is derived as follows: for( i = 0; i < TotalNumOlss; i++ ) for j = 0; j < NumLayersInOls[ i ]; j++ ) (42) OlsLayerIdx[ i ][ LayerIdInOls[ i ][ j ] ] = j The lowest layer in each OLS shall be an independent layer. In other words, for each i in the range of 0 to TotalNumOlss − 1, inclusive, the value of vps_independent_layer_flag[ GeneralLayerIdx[ LayerIdInOls[ i ][ 0 ] ] ] shall be equal to 1. Each layer shall be included in at least one OLS specified by the VPS. In other words, for each layer with a particular value of nuh_layer_id nuhLayerId equal to one of vps_layer_id[ k ] for k in the range of 0 to vps_max_layers_minus1, inclusive, there shall be at least one pair of values of i and j, where i is in the range of 0 to TotalNumOlss − 1, inclusive, and j is in the range of NumLayersInOls[ i ] − 1, inclusive, such that the value of LayerIdInOls[ i ][ j ] is equal to nuhLayerId. vps_num_ptls_minus1 plus 1 specifies the number of profile_tier_level( ) syntax structures in the VPS. The value of vps_num_ptls_minus1 shall be less than TotalNumOlss. pt_present_flag[ i ] equal to 1 specifies that profile, tier, and general constraints information are present in the i-th profile_tier_level( ) syntax structure in the VPS. pt_present_flag[ i ] equal to 0 specifies that profile, tier, and general constraints information are not present in the i-th profile_tier_level( ) syntax structure in the VPS. The value of pt_present_flag[ 0 ] is inferred to be equal to 1. When pt_present_flag[ i ] is equal to 0, the profile, tier, and general constraints information for the i-th profile_tier_level( ) syntax structure in the VPS are inferred to be the same as that for the ( i − 1 )-th profile_tier_level( ) syntax structure in the VPS. ptl_max_temporal_id[ i ] specifies the TemporalId of the highest sublayer representation for which the level information is present in the i-th profile_tier_level( ) syntax structure in the VPS. The value of ptl_max_temporal_id[ i ] shall be in the range of 0 to vps_max_sublayers_minus1, inclusive. When vps_max_sublayers_minus1 is equal to 0, the value of ptl_max_temporal_id[ i ] is inferred to be equal to 0. When vps_max_sublayers_minus1 is greater than 0 and vps_all_layers_same_num_sublayers_flag is equal to 1, the value of ptl_max_temporal_id[ i ] is inferred to be equal to vps_max_sublayers_minus1. vps_ptl_alignment_zero_bit shall be equal to 0. ols_ptl_idx[ i ] specifies the index, to the list of profile_tier_level( ) syntax structures in the VPS, of the profile_tier_level( ) syntax structure that applies to the i-th OLS. When present, the value of ols_ptl_idx[ i ] shall be in the range of 0 to vps_num_ptls_minus1, inclusive. When vps_num_ptls_minus1 is equal to 0, the value of ols_ptl_idx[ i ] is inferred to be equal to 0. When NumLayersInOls[ i ] is equal to 1, the profile_tier_level( ) syntax structure that applies to the i-th OLS is also present in the SPS referred to by the layer in the i-th OLS. It is a requirement of bitstream conformance that, when NumLayersInOls[ i ] is equal to 1, the profile_tier_level( ) syntax structures signalled in the VPS and in the SPS for the i-th OLS shall be identical. vps_num_dpb_params specifies the number of dpb_parameters( ) syntax strutcures in the VPS. The value of vps_num_dpb_params shall be in the range of 0 to 16, inclusive. When not present, the value of vps_num_dpb_params is inferred to be equal to 0. vps_sublayer_dpb_params_present_flag is used to control the presence of max_dec_pic_buffering_minus1[ ], max_num_reorder_pics[ ], and max_latency_increase_plus1[ ] syntax elements in the dpb_parameters( ) syntax strucures in the VPS. When not present, vps_sub_dpb_params_info_present_flag is inferred to be equal to 0. dpb_max_temporal_id[ i ] specifies the TemporalId of the highest sublayer representation for which the DPB parameters may be present in the i-th dpb_parameters( ) syntax strutcure in the VPS. The value of dpb_max_temporal_id[ i ] shall be in the range of 0 to vps_max_sublayers_minus1, inclusive. When vps_max_sublayers_minus1 is equal to 0, the value of dpb_max_temporal_id[ i ] is inferred to be equal to 0. When vps max sublayers minus1 is greater than 0 and vps all layers same num sublayers flag is equal to 1, the value of dpb_max_temporal_id[ i ] is inferred to be equal to vps_max_sublayers_minus1. ols_dpb_pic_width[ i ] specifies the width, in units of luma samples, of each picture storage buffer for the i-th OLS. ols_dpb_pic_height[ i ] specifies the height, in units of luma samples, of each picture storage buffer for the i-th OLS. ols_dpb_params_idx[ i ] specifies the index, to the list of dpb_parameters( ) syntax structures in the VPS, of the dpb_parameters( ) syntax structure that applies to the i-th OLS when NumLayersInOls[ i ] is greater than 1. When present, the value of ols_dpb_params_idx[ i ] shall be in the range of 0 to vps_num_dpb_params − 1, inclusive. When ols_dpb_params_idx[ i ] is not present, the value of ols_dpb_params_idx[ i ] is inferred to be equal to 0. When NumLayersInOls[ i ] is equal to 1, the dpb_parameters( ) syntax structure that applies to the i-th OLS is present in the SPS referred to by the layer in the i-th OLS. vps_general_hrd_params_present_flag equal to 1 specifies that the syntax structure general_hrd_parameters( ) and other HRD parameters are present in the VPS RBSP syntax structure. vps_general_hrd_params_present_flag equal to 0 specifies that the syntax structure general_hrd_parameters( ) and other HRD parameters are not present in the VPS RBSP syntax structure. When not present, the value of vps_general_hrd_params_present_flag is inferred to be equal to 0. When NumLayersInOls[ i ] is equal to 1, the general_hrd_parameters( ) syntax structure that applies to the i-th OLS is present in the SPS referred to by the layer in the i-th OLS. vps_sublayer_cpb_params_present_flag equal to 1 specifies that the i-th ols_hrd_parameters( ) syntax structure in the VPS contains HRD parameters for the sublayer representations with TemporalId in the range of 0 to hrd_max_tid[ i ], inclusive. vps_sublayer_cpb_params_present_flag equal to 0 specifies that the i-th ols_hrd_parameters( ) syntax structure in the VPS contains HRD parameters for the sublayer representation with TemporalId equal to hrd_max_tid[ i ] only. When vps_max_sublayers_minus1 is equal to 0, the value of vps_sublayer_cpb_params_present_flag is inferred to be equal to 0. When vps_sublayer_cpb_params_present_flag is equal to 0, the HRD parameters for the sublayer representations with TemporalId in the range of 0 to hrd_max_tid[ i ] − 1, inclusive, are inferred to be the same as that for the sublayer representation with TemporalId equal to hrd_max_tid[ i ]. These include the HRD parameters starting from the fixed_pic_rate_general_flag[ i ] syntax element till the sublayer_hrd_parameters( i ) syntax structure immediately under the condition "if( general_vcl_hrd_params_present_flag )" in the ols_hrd_parameters syntax structure. num_ols_hrd_params_minus1 plus 1 specifies the number of ols_hrd_parameters( ) syntax structures present in the general_hrd_parameters( ) syntax structure when vps_general_hrd_params_present_flag is equal to 1. The value of num_ols_hrd_params_minus1 shall be in the range of 0 to TotalNumOlss − 1, inclusive. hrd_max_tid[ i ] specifies the TemporalId of the highest sublayer representation for which the HRD parameters are contained in the i-th ols_hrd_parameters( ) syntax structure. The value of hrd max tid[ i ] shall be in the range of 0 to vps max sublayers minus1, inclusive. When vps_max_sublayers_minus1 is equal to 0, the value of hrd_max_tid[ i ] is inferred to be equal to 0. When vps_max_sublayers_minus1 is greater than 0 and vps_all_layers_same_num_sublayers_flag is equal to 1, the value of hrd_max_tid[ i ] is inferred to be equal to vps_max_sublayers_minus1. ols_hrd_idx[ i ] specifies the index, to the list of ols_hrd_parameters( ) syntax structures in the VPS, of the ols_hrd_parameters( ) syntax structure that applies to the i-th OLS when NumLayersInOls[ i ] is greater than 1. The value of ols_hrd_idx[[ i ] shall be in the range of 0 to num_ols_hrd_params_minus1, inclusive. When NumLayersInOls[ i ] is equal to 1, the ols_hrd_parameters( ) syntax structure that applies to the i-th OLS is present in the SPS referred to by the layer in the i-th OLS. If the value of num_ols_hrd_param_minus1 + 1 is equal to TotalNumOlss, the value of ols_hrd_idx[ i ] is inferred to be equal to i. Otherwise, when NumLayersInOls[ i ] is greater than 1 and num_ols_hrd_params_minus1 is equal to 0, the value of ols_hrd_idx[[ i ] is inferred to be equal to 0. vps_extension_flag equal to 0 specifies that no vps_extension_data_flag syntax elements are present in the VPS RBSP syntax structure. vps_extension_flag equal to 1 specifies that there are vps_extension_data_flag syntax elements present in the VPS RBSP syntax structure. vps_extension_data_flag may have any value. Its presence and value do not affect decoder conformance to profiles specified in this version of this Specification. Decoders conforming to this version of this Specification shall ignore all vps_extension_data_flag syntax elements. [0051] In VVC, SPSs contain information which applies to all slices of a coded video sequence. A coded video sequence starts from an instantaneous decoding refresh (IDR) picture, or a BLA picture, or a CRA picture that is the first picture in the bitstream and includes all subsequent pictures that are not an IDR or BLA picture. A bitstream consists of one or more coded video sequences. The content of the SPS can be roughly subdivided into six categories: 1) a self-reference (its own ID); 2) decoder operation point related information (profile, level, picture size, number sub-layers, and so on); 3) enabling flags for certain tools within a profile, and associated coding tool parameters in case the tool is enabled; 4) information restricting the flexibility of structures and transform coefficient coding; 5) temporal scalability control; and 6) visual usability information (VUI), which includes HRD information. The syntax and the associated semantic of the sequence parameter set in the current VVC draft specification are illustrated in Table 6 and Table 7, respectively. How to read Table 6 is illustrated in the appendix section of this invention, which could also be found in the VVC specification. Table 6. Sequence parameter set RBSP syntax
Figure imgf000024_0001
Figure imgf000025_0001
Figure imgf000026_0001
Figure imgf000027_0001
Figure imgf000028_0001
Figure imgf000029_0001
Figure imgf000030_0001
Table 7. Sequence parameter set RBSP semantics An SPS RBSP shall be available to the decoding process prior to it being referenced, included in at least one AU with TemporalId equal to 0 or provided through external means. All SPS NAL units with a particular value of sps_seq_parameter_set_id in a CVS shall have the same content. sps_seq_parameter_set_id provides an identifier for the SPS for reference by other syntax elements. SPS NAL units, regardless of the nuh_layer_id values, share the same value space of sps_seq_parameter_set_id. Let spsLayerId be the value of the nuh_layer_id of a particular SPS NAL unit, and vclLayerId be the value of the nuh_layer_id of a particular VCL NAL unit. The particular VCL NAL unit shall not refer to the particular SPS NAL unit unless spsLayerId is less than or equal to vclLayerId and the layer with nuh_layer_id equal to spsLayerId is included in at least one OLS that includes the layer with nuh_layer_id equal to vclLayerId. sps_video_parameter_set_id, when greater than 0, specifies the value of vps_video_parameter_set_id for the VPS referred to by the SPS. When sps_video_parameter_set_id is equal to 0, the following applies: – The SPS does not refer to a VPS. – No VPS is referred to when decoding each CLVS referring to the SPS. – The value of vps_max_layers_minus1 is inferred to be equal to 0. – The CVS shall contain only one layer (i.e., all VCL NAL unit in the CVS shall have the same value of nuh_layer_id). – The value of GeneralLayerIdx[ nuh_layer_id ] is inferred to be equal to 0. – The value of vps_independent_layer_flag[ GeneralLayerIdx[ nuh_layer_id ] ] is inferred to be equal to 1. When vps_independent_layer_flag[ GeneralLayerIdx[ nuh_layer_id ] ] is equal to 1, the SPS referred to by a CLVS with a particluar nuh_layer_id value nuhLayerId shall have nuh_layer_id equal to nuhLayerId. The value of sps_video_parameter_set_id shall be the same in all SPSs that are referred to by CLVSs in a CVS. sps_max_sublayers_minus1 plus 1 specifies the maximum number of temporal sublayers that may be present in each CLVS referring to the SPS. The value of sps_max_sublayers_minus1 shall be in the range of 0 to vps_max_sublayers_minus1, inclusive. sps_reserved_zero_4bits shall be equal to 0 in bitstreams conforming to this version of this Specification. Other values for sps_reserved_zero_4bits are reserved for future use by ITU- T | ISO/IEC. sps_ptl_dpb_hrd_params_present_flag equal to 1 specifies that a profile_tier_level( ) syntax structure and a dpb_parameters( ) syntax structure are present in the SPS, and a general_hrd_parameters( ) syntax structure and an ols_hrd_parameters( ) syntax structure may also be present in the SPS. sps_ptl_dpb_hrd_params_present_flag equal to 0 specifies that none of these four syntax structures is present in the SPS. The value of sps_ptl_dpb_hrd_params_present_flag shall be equal to vps_independent_layer_flag[ GeneralLayerIdx[ nuh_layer_id ] ]. gdr_enabled_flag equal to 1 specifies that GDR pictures may be present in CLVSs referring to the SPS. gdr_enabled_flag equal to 0 specifies that GDR pictures are not present in CLVSs referring to the SPS. chroma_format_idc specifies the chroma sampling relative to the luma sampling as specified in clause 6.2. separate_colour_plane_flag equal to 1 specifies that the three colour components of the 4:4:4 chroma format are coded separately. separate_colour_plane_flag equal to 0 specifies that the colour components are not coded separately. When separate_colour_plane_flag is not present, it is inferred to be equal to 0. When separate_colour_plane_flag is equal to 1, the coded picture consists of three separate components, each of which consists of coded samples of one colour plane (Y, Cb, or Cr) and uses the monochrome coding syntax. In this case, each colour plane is associated with a specific colour_plane_id value. NOTE 1 – There is no dependency in decoding processes between the colour planes having different colour_plane_id values. For example, the decoding process of a monochrome picture with one value of colour_plane_id does not use any data from monochrome pictures having different values of colour_plane_id for inter prediction. Depending on the value of separate_colour_plane_flag, the value of the variable ChromaArrayType is assigned as follows: – If separate_colour_plane_flag is equal to 0, ChromaArrayType is set equal to chroma_format_idc. – Otherwise (separate_colour_plane_flag is equal to 1), ChromaArrayType is set equal to 0. res_change_in_clvs_allowed_flag equal to 1 specifies that the picture spatial resolution may change within a CLVS referring to the SPS. res_change_in_clvs_allowed_flag equal to 0 specifies that the picture spatial resolution does not change within any CLVS referring to the SPS. pic_width_max_in_luma_samples specifies the maximum width, in units of luma samples, of each decoded picture referring to the SPS. pic_width_max_in_luma_samples shall not be equal to 0 and shall be an integer multiple of Max( 8, MinCbSizeY ). It is a requirement of bitstream conformance that, for any OLS with OLS index i that contains one or more layers that refers to the SPS, the value of pic_width_max_in_luma_samples shall be less than or equal to the value of ols_dpb_pic_width[ i ]. pic_height_max_in_luma_samples specifies the maximum height, in units of luma samples, of each decoded picture referring to the SPS. pic_height_max_in_luma_samples shall not be equal to 0 and shall be an integer multiple of Max( 8, MinCbSizeY ). It is a requirement of bitstream conformance that, for any OLS with OLS index i that contains one or more layers that refers to the SPS, the value of pic_height_max_in_luma_samples shall be less than or equal to the value of ols_dpb_pic_height[ i ]. sps_conformance_window_flag equal to 1 indicates that the conformance cropping window offset parameters follow next in the SPS. sps_conformance_window_flag equal to 0 indicates that the conformance cropping window offset parameters are not present in the SPS. sps_conf_win_left_offset, sps_conf_win_right_offset, sps_conf_win_top_offset, and sps_conf_win_bottom_offset specify the cropping window that is applied to pictures with pic_width_in_luma_samples equal to pic_width_max_in_luma_samples and pic_height_in_luma_samples equal to pic_height_max_in_luma_samples. When sps_conformance_window_flag is equal to 0, the values of sps_conf_win_left_offset, sps_conf_win_right_offset, sps_conf_win_top_offset, and sps_conf_win_bottom_offset are inferred to be equal to 0. The conformance cropping window contains the luma samples with horizontal picture coordinates from SubWidthC * sps_conf_win_left_offset to pic_width_max_in_luma_samples − ( SubWidthC * sps_conf_win_right_offset + 1 ) and vertical picture coordinates from SubHeightC * sps_conf_win_top_offset to pic_height_max_in_luma_samples − ( SubHeightC * sps_conf_win_bottom_offset + 1 ), inclusive. The value of SubWidthC * ( sps_conf_win_left_offset + sps_conf_win_right_offset ) shall be less than pic_width_max_in_luma_samples, and the value of SubHeightC * ( sps_conf_win_top_offset + sps_conf_win_bottom_offset ) shall be less than pic_height_max_in_luma_samples. When ChromaArrayType is not equal to 0, the corresponding specified samples of the two chroma arrays are the samples having picture coordinates ( x / SubWidthC, y / SubHeightC ), where ( x, y ) are the picture coordinates of the specified luma samples. NOTE 2 – The conformance cropping window offset parameters are only applied at the output. All internal decoding processes are applied to the uncropped picture size. sps_log2_ctu_size_minus5 plus 5 specifies the luma coding tree block size of each CTU. The value of sps_log2_ctu_size_minus5 shall be in the range of 0 to 2, inclusive. The value 3 for sps_log2_ctu_size_minus5 is reserved for future use by ITU-T | ISO/IEC. The variables CtbLog2SizeY and CtbSizeY are derived as follows: CtbLog2SizeY = sps_log2_ctu_size_minus5 + 5 (43) CtbSizeY = 1 << CtbLog2SizeY (44) subpic_info_present_flag equal to 1 specifies that subpicture information is present for the CLVS and there may be one or more than one subpicture in each picture of the CLVS. subpic_info_present_flag equal to 0 specifies that subpicture information is not present for the CLVS and there is only one subpicture in each picture of the CLVS. When res_change_in_clvs_allowed_flag is equal to 1, the value of subpic_info_present_flag shall be equal to 0. NOTE 3 – When a bitstream is the result of a sub-bitstream extraction process and contains only a subset of the subpictures of the input bitstream to the sub-bitstream extraction process, it might be required to set the value of subpic_info_present_flag equal to 1 in the RBSP of the SPSs. sps_num_subpics_minus1 plus 1 specifies the number of subpictures in each picture in the CLVS. The value of sps_num_subpics_minus1 shall be in the range of 0 to Ceil( pic_width_max_in_luma_samples ÷ CtbSizeY ) * Ceil( pic_height_max_in_luma_samples ÷ CtbSizeY ) − 1, inclusive. When not present, the value of sps_num_subpics_minus1 is inferred to be equal to 0. sps_independent_subpics_flag equal to 1 specifies that no intra prediction, no inter prediction and no in-loop filtering operations may be performed across any subpicture boundary in the CLVS. sps_independent_subpics_flag equal to 0 specifies that inter prediction or in-loop filtering operations across the subpicture boundaries in the CLVS may be allowed. When not present, the value of sps_independent_subpics_flag is inferred to be equal to 0. subpic_ctu_top_left_x[ i ] specifies horizontal position of top left CTU of i-th subpicture in unit of CtbSizeY. The length of the syntax element is Ceil( Log2( ( pic_width_max_in_luma_samples + CtbSizeY − 1 ) >> CtbLog2SizeY ) ) bits. When not present, the value of subpic_ctu_top_left_x[ i ] is inferred to be equal to 0. subpic_ctu_top_left_y[ i ] specifies vertical position of top left CTU of i-th subpicture in unit of CtbSizeY. The length of the syntax element is Ceil( Log2( ( pic_height_max_in_luma_samples + CtbSizeY − 1 ) >> CtbLog2SizeY ) ) bits. When not present, the value of subpic_ctu_top_left_y[ i ] is inferred to be equal to 0. subpic_width_minus1[ i ] plus 1 specifies the width of the i-th subpicture in units of CtbSizeY. The length of the syntax element is Ceil( Log2( ( pic_width_max_in_luma_samples + CtbSizeY − 1 ) >> CtbLog2SizeY ) ) bits. When not present, the value of subpic_width_minus1[ i ] is inferred to be equal to ( ( pic_width_max_in_luma_samples + CtbSizeY− 1 ) >> CtbLog2SizeY ) − subpic_ctu_t op_left_x[ i ] − 1. subpic_height_minus1[ i ] plus 1 specifies the height of the i-th subpicture in units of CtbSizeY. The length of the syntax element is Ceil( Log2( ( pic_height_max_in_luma_samples + CtbSizeY − 1 ) >> CtbLog2SizeY ) ) bits. When not present, the value of subpic_height_minus1[ i ] is inferred to be equal to ( ( pic_height_max_in_luma_samples + CtbSizeY− 1 ) >> CtbLog2SizeY ) − subpic_ctu_ top_left_y[ i ] − 1. subpic_treated_as_pic_flag[ i ] equal to 1 specifies that the i-th subpicture of each coded picture in the CLVS is treated as a picture in the decoding process excluding in-loop filtering operations. subpic_treated_as_pic_flag[ i ] equal to 0 specifies that the i-th subpicture of each coded picture in the CLVS is not treated as a picture in the decoding process excluding in-loop filtering operations. When not present, the value of subpic_treated_as_pic_flag[ i ] is inferred to be equal to sps_independent_subpics_flag. When subpic_treated_as_pic_flag[ i ] is equal to 1, it is a requirement of bitstream conformance that all of the following conditions are true for each output layer and its reference layers in an OLS that includes the layer containing the i-th subpicture as an output layer: – All pictures in the output layer and its reference layers shall have the same value of pic_width_in_luma_samples and the same value of pic_height_in_luma_samples. – All the SPSs referred to by the output layer and its reference layers shall have the same value of sps_num_subpics_minus1 and shall have the same values of subpic_ctu_top_left_x[ j ], subpic_ctu_top_left_y[ j ], subpic_width_minus1[ j ], subpic_height_minus1[ j ], and loop_filter_across_subpic_enabled_flag[ j ], respectively, for each value of j in the range of 0 to sps num subpics minus1, inclusive. – All pictures in each access unit in the output layer and its reference layers shall have the same value of SubpicIdVal[ j ] for each value of j in the range of 0 to sps_num_subpics_minus1, inclusive. loop_filter_across_subpic_enabled_flag[ i ] equal to 1 specifies that in-loop filtering operations may be performed across the boundaries of the i-th subpicture in each coded picture in the CLVS. loop_filter_across_subpic_enabled_flag[ i ] equal to 0 specifies that in- loop filtering operations are not performed across the boundaries of the i-th subpicture in each coded picture in the CLVS. When not present, the value of loop_filter_across_subpic_enabled_pic_flag[ i ] is inferred to be equal to 1 − sps_independent_subpics_flag. It is a requirement of bitstream conformance that the shapes of the subpictures shall be such that each subpicture, when decoded, shall have its entire left boundary and entire top boundary consisting of picture boundaries or consisting of boundaries of previously decoded subpictures. sps_subpic_id_len_minus1 plus 1 specifies the number of bits used to represent the syntax element sps_subpic_id[ i ], the syntax elements pps_subpic_id[ i ], when present, and the syntax element slice_subpic_id, when present. The value of sps_subpic_id_len_minus1 shall be in the range of 0 to 15, inclusive. The value of 1 << ( sps_subpic_id_len_minus1 + 1 ) shall be greater than or equal to sps_num_subpics_minus1 + 1. subpic_id_mapping_explicitly_signalled_flag equal to 1 specifies that the subpicture ID mapping is explicitly signalled, either in the SPS or in the PPSs referred to by coded pictures of the CLVS. subpic_id_mapping_explicitly_signalled_flag equal to 0 specifies that the subpicture ID mapping is not explicitly signalled for the CLVS. When not present, the value of subpic_id_mapping_explicitly_signalled_flag is inferred to be equal to 0. subpic_id_mapping_in_sps_flag equal to 1 specifies that the subpicture ID mapping is signalled in the SPS when subpic_id_mapping_explicitly_signalled_flag is equal to 1. subpic_id_mapping_in_sps_flag equal to 0 specifies that subpicture ID mapping is signalled in the PPSs referred to by coded pictures of the CLVS when subpic_id_mapping_explicitly_signalled_flag is equal to 1. sps_subpic_id[ i ] specifies the subpicture ID of the i-th subpicture. The length of the sps_subpic_id[ i ] syntax element is sps_subpic_id_len_minus1 + 1 bits. bit_depth_minus8 specifies the bit depth of the samples of the luma and chroma arrays, BitDepth, and the value of the luma and chroma quantization parameter range offset, QpBdOffset, as follows: BitDepth = 8 + bit_depth_minus8 (45) QpBdOffset = 6 * bit_depth_minus8 (46) bit_depth_minus8 shall be in the range of 0 to 8, inclusive. sps_entropy_coding_sync_enabled_flag equal to 1 specifies that a specific synchronization process for context variables is invoked before decoding the CTU that includes the first CTB of a row of CTBs in each tile in each picture referring to the SPS, and a specific storage process for context variables is invoked after decoding the CTU that includes the first CTB of a row of CTBs in each tile in each picture referring to the SPS. sps_entropy_coding_sync_enabled_flag equal to 0 specifies that no specific synchronization process for context variables is required to be invoked before decoding the CTU that includes the first CTB of a row of CTBs in each tile in each picture referring to the SPS, and no specific storage process for context variables is required to be invoked after decoding the CTU that includes the first CTB of a row of CTBs in each tile in each picture referring to the SPS. sps_wpp_entry_point_offsets_present_flag equal to 1 specifies that signalling for entry point offsets for CTU rows may be present in the slice headers of pictures referring to the SPS when sps_entropy_coding_sync_enabled_flag is equal to 1. sps_wpp_entry_point_offsets_present_flag equal to 0 specifies that signalling for entry point offsets for CTU rows are not present in the slice headers of pictures referring to the SPS. When not present, the value of sps_wpp_entry_point_offsets_present_flag is inferred to be equal to 0. sps_weighted_pred_flag equal to 1 specifies that weighted prediction may be applied to P slices referring to the SPS. sps_weighted_pred_flag equal to 0 specifies that weighted prediction is not applied to P slices referring to the SPS. sps_weighted_bipred_flag equal to 1 specifies that explicit weighted prediction may be applied to B slices referring to the SPS. sps_weighted_bipred_flag equal to 0 specifies that explicit weighted prediction is not applied to B slices referring to the SPS. log2_max_pic_order_cnt_lsb_minus4 specifies the value of the variable MaxPicOrderCntLsb that is used in the decoding process for picture order count as follows: MaxPicOrderCntLsb = 2( log2_max_pic_order_cnt_lsb_minus4 + 4 ) (47) The value of log2_max_pic_order_cnt_lsb_minus4 shall be in the range of 0 to 12, inclusive. sps_poc_msb_flag equal to 1 specifies that the ph_poc_msb_present_flag syntax element is present in PHs referring to the SPS. sps_poc_msb_flag equal to 0 specifies that the ph_poc_msb_present_flag syntax element is not present in PHs referring to the SPS. poc_msb_len_minus1 plus 1 specifies the length, in bits, of the poc_msb_val syntax elements, when present in the PHs referring to the SPS. The value of poc_msb_len_minus1 shall be in the range of 0 to 32 − log2_max_pic_order_cnt_lsb_minus4 − 5, inclusive. num_extra_ph_bits_bytes specifies the number of bytes of extra bits in the PH syntax structure for coded pictures referring to the SPS. The value of num_extra_ph_bits_bytes shall be equal to 0 in bitstreams conforming to this version of this Specification. Although the value of num_extra_ph_bits_bytes is required to be equal to 0 in this version of this Specification, decoder conforming to this version of this Specification shall allow the value of num_extra_ph_bits_bytes equal to 1 or 2 to appear in the syntax. num_extra_sh_bits_bytes specifies the number of bytes of extra bits in the slice headers for coded pictures referring to the SPS. The value of num_extra_sh_bits_bytes shall be equal to 0 in bitstreams conforming to this version of this Specification. Although the value of num_extra_sh_bits_bytes is required to be equal to 0 in this version of this Specification, decoder conforming to this version of this Specification shall allow the value of num_extra_sh_bits_bytes equal to 1 or 2 to appear in the syntax. sps_sublayer_dpb_params_flag is used to control the presence of max dec pic buffering minus1[ i ], max num reorder pics[ i ], and max_latency_increase_plus1[ i ] syntax elements in the dpb_parameters( ) syntax strucure in the SPS. When not present, the value of sps_sub_dpb_params_info_present_flag is inferred to be equal to 0. long_term_ref_pics_flag equal to 0 specifies that no LTRP is used for inter prediction of any coded picture in the CLVS. long_term_ref_pics_flag equal to 1 specifies that LTRPs may be used for inter prediction of one or more coded pictures in the CLVS. inter_layer_ref_pics_present_flag equal to 0 specifies that no ILRP is used for inter prediction of any coded picture in the CLVS. inter_layer_ref_pics_present_flag equal to 1 specifies that ILRPs may be used for inter prediction of one or more coded pictures in the CLVS. When sps_video_parameter_set_id is equal to 0, the value of inter_layer_ref_pics_present_flag is inferred to be equal to 0. When vps_independent_layer_flag[ GeneralLayerIdx[ nuh_layer_id ] ] is equal to 1, the value of inter_layer_ref_pics_present_flag shall be equal to 0. sps_idr_rpl_present_flag equal to 1 specifies that reference picture list syntax elements are present in slice headers of IDR pictures. sps_idr_rpl_present_flag equal to 0 specifies that reference picture list syntax elements are not present in slice headers of IDR pictures. rpl1_same_as_rpl0_flag equal to 1 specifies that the syntax element num_ref_pic_lists_in_sps[ 1 ] and the syntax structure ref_pic_list_struct( 1, rplsIdx ) are not present and the following applies: – The value of num_ref_pic_lists_in_sps[ 1 ] is inferred to be equal to the value of num_ref_pic_lists_in_sps[ 0 ]. – The value of each of syntax elements in ref_pic_list_struct( 1, rplsIdx ) is inferred to be equal to the value of corresponding syntax element in ref_pic_list_struct( 0, rplsIdx ) for rplsIdx ranging from 0 to num_ref_pic_lists_in_sps[ 0 ] − 1. num_ref_pic_lists_in_sps[ i ] specifies the number of the ref_pic_list_struct( listIdx, rplsIdx ) syntax structures with listIdx equal to i included in the SPS. The value of num_ref_pic_lists_in_sps[ i ] shall be in the range of 0 to 64, inclusive. NOTE 4 – For each value of listIdx (equal to 0 or 1), a decoder should allocate memory for a total number of num_ref_pic_lists_in_sps[ i ] + 1 ref_pic_list_struct( listIdx, rplsIdx ) syntax structures since there may be one ref_pic_list_struct( listIdx, rplsIdx ) syntax structure directly signalled in the slice headers of a current picture. qtbtt_dual_tree_intra_flag equal to 1 specifies that, for I slices, each CTU is split into coding units with 64×64 luma samples using an implicit quadtree split, and these coding units are the root of two separate coding_tree syntax structure for luma and chroma. qtbtt_dual_tree_intra_flag equal to 0 specifies separate coding_tree syntax structure is not used for I slices. When qtbtt_dual_tree_intra_flag is not present, it is inferred to be equal to 0. log2_min_luma_coding_block_size_minus2 plus 2 specifies the minimum luma coding block size. The value range of log2_min_luma_coding_block_size_minus2 shall be in the range of 0 to Min( 4, sps_log2_ctu_size_minus5 + 3 ), inclusive. The variables MinCbLog2SizeY, MinCbSizeY, IbcBufWidthY, IbcBufWidthC and Vsize are derived as follows: MinCbLog2SizeY = log2_min_luma_coding_block_size_minus2 + 2 (48) MinCbSizeY = 1 << MinCbLog2SizeY (49) IbcBufWidthY = 256 * 128 / CtbSizeY (50) IbcBufWidthC = IbcBufWidthY / SubWidthC (51) VSize = Min( 64, CtbSizeY ) (52) The value of MinCbSizeY shall less than or equal to VSize. The variables CtbWidthC and CtbHeightC, which specify the width and height, respectively, of the array for each chroma CTB, are derived as follows: – If chroma_format_idc is equal to 0 (monochrome) or separate_colour_plane_flag is equal to 1, CtbWidthC and CtbHeightC are both equal to 0. – Otherwise, CtbWidthC and CtbHeightC are derived as follows: CtbWidthC = CtbSizeY / SubWidthC (53) CtbHeightC = CtbSizeY / SubHeightC (54) For log2BlockWidth ranging from 0 to 4 and for log2BlockHeight ranging from 0 to 4, inclusive, the up-right diagonal scan order array initialization process as specified in clause 6.5.2 is invoked with 1 << log2BlockWidth and 1 << log2BlockHeight as inputs, and the output is assigned to DiagScanOrder[ log2BlockWidth ][ log2BlockHeight ]. For log2BlockWidth ranging from 0 to 6 and for log2BlockHeight ranging from 0 to 6, inclusive, the horizontal and vertical traverse scan order array initialization process as specified in clause 6.5.3 is invoked with 1 << log2BlockWidth and 1 << log2BlockHeight as inputs, and the output is assigned to HorTravScanOrder[ log2BlockWidth ][ log2BlockHeight ] and VerTravScanOrder[ log2BlockWidth ][ log2BlockHeight ]. partition_constraints_override_enabled_flag equal to 1 specifies the presence of partition_constraints_override_flag in PHs referring to the SPS. partition_constraints_override_enabled_flag equal to 0 specifies the absence of partition_constraints_override_flag in PHs referring to the SPS. sps_log2_diff_min_qt_min_cb_intra_slice_luma specifies the default difference between the base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU and the base 2 logarithm of the minimum coding block size in luma samples for luma CUs in slices with slice_type equal to 2 (I) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_min_qt_min_cb_luma present in PHs referring to the SPS. The value of sps_log2_diff_min_qt_min_cb_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY − MinCbLog2SizeY, inclusive. The base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU is derived as follows: MinQtLog2SizeIntraY = sps_log2_diff_min_qt_min_cb_intra_slice_luma + MinCbLog2SizeY (55) sps_max_mtt_hierarchy_depth_intra_slice_luma specifies the default maximum hierarchy depth for coding units resulting from multi-type tree splitting of a quadtree leaf in slices with slice_type equal to 2 (I) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default maximum hierarchy depth can be overridden by ph_max_mtt_hierarchy_depth_intra_slice_luma present in PHs referring to the SPS. The value of sps_max_mtt_hierarchy_depth_intra_slice_luma shall be in the range of 0 to 2*( CtbLog2SizeY − MinCbLog2SizeY ), inclusive. sps_log2_diff_max_bt_min_qt_intra_slice_luma specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 2 (I) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_bt_min_qt_luma present in PHs referring to the SPS. The value of sps_log2_diff_max_bt_min_qt_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeIntraY, inclusive. When sps_log2_diff_max_bt_min_qt_intra_slice_luma is not present, the value of sps_log2_diff_max_bt_min_qt_intra_slice_luma is inferred to be equal to 0. sps_log2_diff_max_tt_min_qt_intra_slice_luma specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 2 (I) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_tt_min_qt_luma present in PHs referring to the SPS. The value of sps_log2_diff_max_tt_min_qt_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeIntraY, inclusive. When sps_log2_diff_max_tt_min_qt_intra_slice_luma is not present, the value of sps_log2_diff_max_tt_min_qt_intra_slice_luma is inferred to be equal to 0. sps_log2_diff_min_qt_min_cb_inter_slice specifies the default difference between the base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU and the base 2 logarithm of the minimum luma coding block size in luma samples for luma CUs in slices with slice_type equal to 0 (B) or 1 (P) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_min_qt_min_cb_luma present in PHs referring to the SPS. The value of sps_log2_diff_min_qt_min_cb_inter_slice shall be in the range of 0 to CtbLog2SizeY − MinCbLog2SizeY, inclusive. The base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU is derived as follows: MinQtLog2SizeInterY = sps_log2_diff_min_qt_min_cb_inter_slice + MinCbLog2SizeY (56) sps_max_mtt_hierarchy_depth_inter_slice specifies the default maximum hierarchy depth for coding units resulting from multi-type tree splitting of a quadtree leaf in slices with slice_type equal to 0 (B) or 1 (P) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default maximum hierarchy depth can be overridden by ph_max_mtt_hierarchy_depth_inter_slice present in PHs referring to the SPS. The value of sps_max_mtt_hierarchy_depth_inter_slice shall be in the range of 0 to 2*( CtbLog2SizeY − MinCbLog2SizeY ), inclusive. sps_log2_diff_max_bt_min_qt_inter_slice specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 0 (B) or 1 (P) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_bt_min_qt_luma present in PHs referring to the SPS. The value of sps_log2_diff_max_bt_min_qt_inter_slice shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeInterY, inclusive. When sps_log2_diff_max_bt_min_qt_inter_slice is not present, the value of sps_log2_diff_max_bt_min_qt_inter_slice is inferred to be equal to 0. sps_log2_diff_max_tt_min_qt_inter_slice specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 0 (B) or 1 (P) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_tt_min_qt_luma present in PHs referring to the SPS. The value of sps_log2_diff_max_tt_min_qt_inter_slice shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeInterY, inclusive. When sps_log2_diff_max_tt_min_qt_inter_slice is not present, the value of sps_log2_diff_max_tt_min_qt_inter_slice is inferred to be equal to 0. sps_log2_diff_min_qt_min_cb_intra_slice_chroma specifies the default difference between the base 2 logarithm of the minimum size in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA and the base 2 logarithm of the minimum coding block size in luma samples for chroma CUs with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_min_qt_min_cb_chroma present in PHs referring to the SPS. The value of sps_log2_diff_min_qt_min_cb_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY − MinCbLog2SizeY, inclusive. When not present, the value of sps_log2_diff_min_qt_min_cb_intra_slice_chroma is inferred to be equal to 0. The base 2 logarithm of the minimum size in luma samples of a chroma leaf block resulting from quadtree splitting of a CTU with treeType equal to DUAL_TREE_CHROMA is derived as follows: MinQtLog2SizeIntraC = sps_log2_diff_min_qt_min_cb_intra_slice_chroma + MinCbLog2SizeY (57) sps_max_mtt_hierarchy_depth_intra_slice_chroma specifies the default maximum hierarchy depth for chroma coding units resulting from multi-type tree splitting of a chroma quadtree leaf with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default maximum hierarchy depth can be overridden by ph_max_mtt_hierarchy_depth_chroma present in PHs referring to the SPS. The value of sps_max_mtt_hierarchy_depth_intra_slice_chroma shall be in the range of 0 to 2*( CtbLog2SizeY − MinCbLog2SizeY ), inclusive. When not present, the value of sps_max_mtt_hierarchy_depth_intra_slice_chroma is inferred to be equal to 0. sps_log2_diff_max_bt_min_qt_intra_slice_chroma specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a chroma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_bt_min_qt_chroma present in PHs referring to the SPS. The value of sps_log2_diff_max_bt_min_qt_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeIntraC, inclusive. When sps_log2_diff_max_bt_min_qt_intra_slice_chroma is not present, the value of sps_log2_diff_max_bt_min_qt_intra_slice_chroma is inferred to be equal to 0. sps_log2_diff_max_tt_min_qt_intra_slice_chroma specifies the default difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a chroma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) referring to the SPS. When partition_constraints_override_enabled_flag is equal to 1, the default difference can be overridden by ph_log2_diff_max_tt_min_qt_chroma present in PHs referring to the SPS. The value of sps_log2_diff_max_tt_min_qt_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeIntraC, inclusive. When sps_log2_diff_max_tt_min_qt_intra_slice_chroma is not present, the value of sps_log2_diff_max_tt_min_qt_intra_slice_chroma is inferred to be equal to 0. sps_max_luma_transform_size_64_flag equal to 1 specifies that the maximum transform size in luma samples is equal to 64. sps_max_luma_transform_size_64_flag equal to 0 specifies that the maximum transform size in luma samples is equal to 32. When CtbSizeY is less than 64, the value of sps_max_luma_transform_size_64_flag shall be equal to 0. The variables MinTbLog2SizeY, MaxTbLog2SizeY, MinTbSizeY, and MaxTbSizeY are derived as follows: MinTbLog2SizeY = 2 (58) MaxTbLog2SizeY = sps_max_luma_transform_size_64_flag ? 6 : 5 (59) MinTbSizeY = 1 << MinTbLog2SizeY (60) MaxTbSizeY = 1 << MaxTbLog2SizeY (61) sps_joint_cbcr_enabled_flag equal to 0 specifies that the joint coding of chroma residuals is disabled. sps_joint_cbcr_enabled_flag equal to 1 specifies that the joint coding of chroma residuals is enabled. When not present, the value of sps_joint_cbcr_enabled_flag is inferred to be equal to 0. same_qp_table_for_chroma equal to 1 specifies that only one chroma QP mapping table is signalled and this table applies to Cb and Cr residuals and additionally to joint Cb-Cr residuals when sps_joint_cbcr_enabled_flag is equal to 1. same_qp_table_for_chroma equal to 0 specifies that chroma QP mapping tables, two for Cb and Cr, and one additional for joint Cb-Cr when sps_joint_cbcr_enabled_flag is equal to 1, are signalled in the SPS. When same_qp_table_for_chroma is not present in the bitstream, the value of same_qp_table_for_chroma is inferred to be equal to 1. qp_table_start_minus26[ i ] plus 26 specifies the starting luma and chroma QP used to describe the i-th chroma QP mapping table. The value of qp_table_start_minus26[ i ] shall be in the range of −26 − QpBdOffset to 36 inclusive. When qp_table_start_minus26[ i ] is not present in the bitstream, the value of qp_table_start_minus26[ i ] is inferred to be equal to 0. num_points_in_qp_table_minus1[ i ] plus 1 specifies the number of points used to describe the i-th chroma QP mapping table. The value of num_points_in_qp_table_minus1[ i ] shall be in the range of 0 to 63 + QpBdOffset, inclusive. When num_points_in_qp_table_minus1[ 0 ] is not present in the bitstream, the value of num_points_in_qp_table_minus1[ 0 ] is inferred to be equal to 0. delta_qp_in_val_minus1[ i ][ j ] specifies a delta value used to derive the input coordinate of the j-th pivot point of the i-th chroma QP mapping table. When delta_qp_in_val_minus1[ 0 ][ j ] is not present in the bitstream, the value of delta_qp_in_val_minus1[ 0 ][ j ] is inferred to be equal to 0. delta_qp_diff_val[ i ][ j ] specifies a delta value used to derive the output coordinate of the j-th pivot point of the i-th chroma QP mapping table. The i-th chroma QP mapping table ChromaQpTable[ i ] for i = 0..numQpTables − 1 is derived as follows: qpInVal[ i ][ 0 ] = qp_table_start_minus26[ i ] + 26 qpOutVal[ i ][ 0 ] = qpInVal[ i ][ 0 ] for( j = 0; j <= num_points_in_qp_table_minus1[ i ]; j++ ) { qpInVal[ i ][ j + 1 ] = qpInVal[ i ][ j ] + delta_qp_in_val_minus1[ i ][ j ] + 1 qpOutVal[ i ][ j + 1 ] = qpOutVal[ i ][ j ] + ( delta_qp_in_val_minus1[ i ][ j ] ^ delta_qp_diff_val[ i ][ j ] ) } ChromaQpTable[ i ][ qpInVal[ i ][ 0 ] ] = qpOutVal[ i ][ 0 ] for( k = qpInVal[ i ][ 0 ] − 1; k >= −QpBdOffset; k − − ) ChromaQpTable[ i ][ k ] = Clip3( −QpBdOffset, 63, ChromaQpTable[ i ][ k + 1 ] − 1 ) (62) for( j = 0; j <= num_points_in_qp_table_minus1[ i ]; j++ ) { sh = ( delta_qp_in_val_minus1[ i ][j ] + 1 ) >> 1 for( k = qpInVal[ i ][ j ] + 1, m = 1; k <= qpInval[ i ][ j + 1 ]; k++, m++ ) ChromaQpTable[ i ][ k ] = ChromaQpTable[ i ][ qpInVal[ i ][ j ] ] + ( ( qpOutVal[ i ][j + 1] − qpOutVal[ i ][j ] ) * m + sh ) / ( delta_qp_in_val_minus1[ i ][ j ] + 1 ) } for( k = qpInVal[ i ][ num_points_in_qp_table_minus1[ i ] + 1 ] + 1; k <= 63; k++ ) ChromaQpTable[ i ][ k ] = Clip3( −QpBdOffset, 63, ChromaQpTable[ i ][ k − 1 ] + 1 ) When same_qp_table_for_chroma is equal to 1, ChromaQpTable[ 1 ][ k ] and ChromaQpTable[ 2 ][ k ] are set equal to ChromaQpTable[ 0 ][ k ] for k in the range of −QpBdOffset to 63, inclusive. It is a requirement of bitstream conformance that the values of qpInVal[ i ][ j ] and qpOutVal[ i ][ j ] shall be in the range of −QpBdOffset to 63, inclusive for i in the range of 0 to numQpTables − 1, inclusive, and j in the range of 0 to num_points_in_qp_table_minus1[ i ] + 1, inclusive. sps_sao_enabled_flag equal to 1 specifies that the sample adaptive offset process is applied to the reconstructed picture after the deblocking filter process. sps_sao_enabled_flag equal to 0 specifies that the sample adaptive offset process is not applied to the reconstructed picture after the deblocking filter process. sps_alf_enabled_flag equal to 0 specifies that the adaptive loop filter is disabled. sps_alf_enabled_flag equal to 1 specifies that the adaptive loop filter is enabled. sps_ccalf_enabled_flag equal to 0 specifies that the cross-component adaptive loop filter is disabled. sps_ccalf_enabled_flag equal to 1 specifies that the cross-component adaptive loop filter may be enabled. sps_transform_skip_enabled_flag equa to 1 specifies that transform_skip_flag may be present in the transform unit syntax. sps_transform_skip_enabled_flag equal to 0 specifies that transform_skip_flag is not present in the transform unit syntax. log2_transform_skip_max_size_minus2 specifies the maximum block size used for transform skip, and shall be in the range of 0 to 3, inclusive. The variable MaxTsSize is set equal to 1 << ( log2_transform_skip_max_size_minus2 + 2 ). sps_bdpcm_enabled_flag equal to 1 specifies that intra_bdpcm_luma_flag and intra_bdpcm_chroma_flag may be present in the coding unit syntax for intra coding units. sps_bdpcm_enabled_flag equal to 0 specifies that intra_bdpcm_luma_flag and intra_bdpcm_chroma_flag are not present in the coding unit syntax for intra coding units. When not present, the value of sps_bdpcm_enabled_flag is inferred to be equal to 0. sps_ref_wraparound_enabled_flag equal to 1 specifies that horizontal wrap-around motion compensation is applied in inter prediction. sps_ref_wraparound_enabled_flag equal to 0 specifies that horizontal wrap-around motion compensation is not applied. When the value of ( CtbSizeY / MinCbSizeY + 1) is greater than ( pic_width_in_luma_samples / MinCbSizeY − 1 ), where pic_width_in_luma_samples is the value of pic_width_in_luma_samples in any PPS that refers to the SPS, the value of sps_ref_wraparound_enabled_flag shall be equal to 0. sps_temporal_mvp_enabled_flag equal to 1 specifies that temporal motion vector predictors may be used in the CLVS. sps_temporal_mvp_enabled_flag equal to 0 specifies that temporal motion vector predictors are not used in the CLVS. sps_sbtmvp_enabled_flag equal to 1 specifies that subblock-based temporal motion vector predictors may be used in decoding of pictures with all slices having slice_type not equal to I in the CLVS. sps_sbtmvp_enabled_flag equal to 0 specifies that subblock-based temporal motion vector predictors are not used in the CLVS. When sps_sbtmvp_enabled_flag is not present, it is inferred to be equal to 0. sps_amvr_enabled_flag equal to 1 specifies that adaptive motion vector difference resolution is used in motion vector coding. amvr_enabled_flag equal to 0 specifies that adaptive motion vector difference resolution is not used in motion vector coding. sps_bdof_enabled_flag equal to 0 specifies that the bi-directional optical flow inter prediction is disabled. sps_bdof_enabled_flag equal to 1 specifies that the bi-directional optical flow inter prediction is enabled. sps_bdof_pic_present_flag equal to 1 specifies that ph_disable_bdof_flag is present in PHs referring to the SPS. sps_bdof_pic_present_flag equal to 0 specifies that ph_disable_bdof_flag is not present in PHs referring to the SPS. When sps_bdof_pic_present_flag is not present, the value of sps_bdof_pic_present_flag is inferred to be equal to 0. sps_smvd_enabled_flag equal to 1 specifies that symmetric motion vector difference may be used in motion vector decoding. sps_smvd_enabled_flag equal to 0 specifies that symmetric motion vector difference is not used in motion vector coding. sps_dmvr_enabled_flag equal to 1 specifies that decoder motion vector refinement based inter bi-prediction is enabled. sps_dmvr_enabled_flag equal to 0 specifies that decoder motion vector refinement based inter bi-prediction is disabled. sps_dmvr_pic_present_flag equal to 1 specifies that ph_disable_dmvr_flag is present in PHs referring to the SPS. sps_dmvr_pic_present_flag equal to 0 specifies that ph_disable_dmvr_flag is not present in PHs referring to the SPS. When sps_dmvr_pic_present_flag is not present, the value of sps_dmvr_pic_present_flag is inferred to be equal to 0. sps_mmvd_enabled_flag equal to 1 specifies that merge mode with motion vector difference is enabled. sps_mmvd_enabled_flag equal to 0 specifies that merge mode with motion vector difference is disabled. sps_isp_enabled_flag equal to 1 specifies that intra prediction with subpartitions is enabled. sps_isp_enabled_flag equal to 0 specifies that intra prediction with subpartitions is disabled. sps_mrl_enabled_flag equal to 1 specifies that intra prediction with multiple reference lines is enabled. sps_mrl_enabled_flag equal to 0 specifies that intra prediction with multiple reference lines is disabled. sps_mip_enabled_flag equal to 1 specifies that matrix-based intra prediction is enabled. sps_mip_enabled_flag equal to 0 specifies that matrix-based intra prediction is disabled. sps_cclm_enabled_flag equal to 0 specifies that the cross-component linear model intra prediction from luma component to chroma component is disabled. sps_cclm_enabled_flag equal to 1 specifies that the cross-component linear model intra prediction from luma component to chroma componenent is enabled. When sps_cclm_enabled_flag is not present, it is inferred to be equal to 0. sps_chroma_horizontal_collocated_flag equal to 1 specifies that prediction processes operate in a manner designed for chroma sample positions that are not horizontally shifted relative to corresponding luma sample positions. sps_chroma_horizontal_collocated_flag equal to 0 specifies that prediction processes operate in a manner designed for chroma sample positions that are shifted to the right by 0.5 in units of luma samples relative to corresponding luma sample positions. When sps chroma horizontal collocated flag is not present, it is inferred to be equal to 1. sps_chroma_vertical_collocated_flag equal to 1 specifies that prediction processes operate in a manner designed for chroma sample positions that are not vertically shifted relative to corresponding luma sample positions. sps_chroma_vertical_collocated_flag equal to 0 specifies that prediction processes operate in a manner designed for chroma sample positions that are shifted downward by 0.5 in units of luma samples relative to corresponding luma sample positions. When sps_chroma_vertical_collocated_flag is not present, it is inferred to be equal to 1. sps_mts_enabled_flag equal to 1 specifies that sps_explicit_mts_intra_enabled_flag is present in the sequence parameter set RBSP syntax and sps_explicit_mts_inter_enabled_flag is present in the sequence parameter set RBSP syntax. sps_mts_enabled_flag equal to 0 specifies that sps_explicit_mts_intra_enabled_flag is not present in the sequence parameter set RBSP syntax and sps_explicit_mts_inter_enabled_flag is not present in the sequence parameter set RBSP syntax. sps_explicit_mts_intra_enabled_flag equal to 1 specifies that mts_idx may be present in intra coding unit syntax. sps_explicit_mts_intra_enabled_flag equal to 0 specifies that mts_idx is not present in intra coding unit syntax. When not present, the value of sps_explicit_mts_intra_enabled_flag is inferred to be equal to 0. sps_explicit_mts_inter_enabled_flag equal to 1 specifies that mts_idx may be present in inter coding unit syntax. sps_explicit_mts_inter_enabled_flag equal to 0 specifies that mts_idx is not present in inter coding unit syntax. When not present, the value of sps_explicit_mts_inter_enabled_flag is inferred to be equal to 0. six_minus_max_num_merge_cand specifies the maximum number of merging motion vector prediction (MVP) candidates supported in the SPS subtracted from 6. The value of six_minus_max_num_merge_cand shall be in the range of 0 to 5, inclusive. The maximum number of merging MVP candidates, MaxNumMergeCand, is derived as follows: MaxNumMergeCand = 6 − six_minus_max_num_merge_cand (63) sps_sbt_enabled_flag equal to 0 specifies that subblock transform for inter-predicted CUs is disabled. sps_sbt_enabled_flag equal to 1 specifies that subblock transform for inter- predicteds CU is enabled. sps_affine_enabled_flag specifies whether affine model based motion compensation can be used for inter prediction. If sps_affine_enabled_flag is equal to 0, the syntax shall be constrained such that no affine model based motion compensation is used in the CLVS, and inter_affine_flag and cu_affine_type_flag are not present in coding unit syntax of the CLVS. Otherwise (sps_affine_enabled_flag is equal to 1), affine model based motion compensation can be used in the CLVS. five_minus_max_num_subblock_merge_cand specifies the maximum number of subblock-based merging motion vector prediction candidates supported in the SPS subtracted from 5. The value of five_minus_max_num_subblock_merge_cand shall be in the range of 0 to 5, inclusive. sps_affine_type_flag specifies whether 6-parameter affine model based motion compensation can be used for inter prediction. If sps affine type flag is equal to 0, the syntax shall be constrained such that no 6-parameter affine model based motion compensation is used in the CLVS, and cu_affine_type_flag is not present in coding unit syntax in the CLVS. Otherwise (sps_affine_type_flag is equal to 1), 6-parameter affine model based motion compensation can be used in the CLVS. When not present, the value of sps_affine_type_flag is inferred to be equal to 0. sps_affine_amvr_enabled_flag equal to 1 specifies that adaptive motion vector difference resolution is used in motion vector coding of affine inter mode. sps_affine_amvr_enabled_flag equal to 0 specifies that adaptive motion vector difference resolution is not used in motion vector coding of affine inter mode. When not present, the value of sps_affine_amvr_enabled_flag is inferred to be equal to 0. sps_affine_prof_enabled_flag specifies whether the prediction refinement with optical flow can be used for affine motion compensation. If sps_affine_prof_enabled_flag is equal to 0, the affine motion compensation shall not be refined with optical flow. Otherwise (sps_affine_prof_enabled_flag is equal to 1), the affine motion compensation can be refined with optical flow. When not present, the value of sps_affine_prof_enabled_flag is inferred to be equal to 0. sps_prof_pic_present_flag equal to 1 specifies that ph_disable_prof_flag is present in PHs referring to the SPS. sps_prof_pic_present_flag equal to 0 specifies that ph_disable_prof_flag is not present in PHs referring to the SPS. When sps_prof_pic_present_flag is not present, the value of sps_prof_pic_present_flag is inferred to be equal to 0. sps_palette_enabled_flag equal to 1 specifies that pred_mode_plt_flag may be present in the coding unit syntax. sps_palette_enabled_flag equal to 0 specifies that pred_mode_plt_flag is not present in the coding unit syntax. When sps_palette_enabled_flag is not present, it is inferred to be equal to 0. sps_act_enabled_flag equal to 1 specifies that adaptive colour transform may be used and the cu_act_enabled_flag may be present in the coding unit syntax. sps_act_enabled_flag equal to 0 speifies that adaptive colour transform is not used and cu_act_enabled_flag is not present in the coding unit syntax. When sps_act_enabled_flag is not present, it is inferred to be equal to 0. min_qp_prime_ts_minus4 specifies the minimum allowed quantization parameter for transform skip mode as follows: QpPrimeTsMin = 4 + min_qp_prime_ts_minus4 (64) The value of min_qp_prime_ts_minus4 shall be in the range of 0 to 48, inclusive. sps_bcw_enabled_flag specifies whether bi-prediction with CU weights can be used for inter prediction. If sps_bcw_enabled_flag is equal to 0, the syntax shall be constrained such that no bi-prediction with CU weights is used in the CLVS, and bcw_idx is not present in coding unit syntax of the CLVS. Otherwise (sps_bcw_enabled_flag is equal to 1), bi- prediction with CU weights can be used in the CLVS. sps_ibc_enabled_flag equal to 1 specifies that the IBC prediction mode may be used in decoding of pictures in the CLVS. sps_ibc_enabled_flag equal to 0 specifies that the IBC prediction mode is not used in the CLVS. When sps_ibc_enabled_flag is not present, it is inferred to be equal to 0. six_minus_max_num_ibc_merge_cand, when sps_ibc_enabled_flag is equal to 1, specifies the maximum number of IBC merging block vector prediction (BVP) candidates supported in the SPS subtracted from 6. The value of six_minus_max_num_ibc_merge_cand shall be in the range of 0 to 5, inclusive. The maximum number of IBC merging BVP candidates, MaxNumIbcMergeCand, is derived as follows: if( sps_ibc_enabled_flag ) MaxNumIbcMergeCand = 6 − six_minus_max_num_ibc_merge_cand (65) else MaxNumIbcMergeCand = 0 sps_ciip_enabled_flag specifies that ciip_flag may be present in the coding unit syntax for inter coding units. sps_ciip_enabled_flag equal to 0 specifies that ciip_flag is not present in the coding unit syntax for inter coding units. sps_fpel_mmvd_enabled_flag equal to 1 specifies that merge mode with motion vector difference is using integer sample precision. sps_fpel_mmvd_enabled_flag equal to 0 specifies that merge mode with motion vector difference can use fractional sample precision. sps_gpm_enabled_flag specifies whether geometric partition based motion compensation can be used for inter prediction. sps_gpm_enabled_flag equal to 0 specifies that the syntax shall be constrained such that no geometric partition based motion compensation is used in the CLVS, and merge_gpm_partition_idx, merge_gpm_idx0, and merge_gpm_idx1 are not present in coding unit syntax of the CLVS. sps_gpm_enabled_flag equal to 1 specifies that geometric partition based motion compensation can be used in the CLVS. When not present, the value of sps_gpm_enabled_flag is inferred to be equal to 0. max_num_merge_cand_minus_max_num_gpm_cand specifies the maximum number of geometric partitioning merge mode candidates supported in the SPS subtracted from MaxNumMergeCand. The maximum number of geometric partitioning merge mode candidates, MaxNumGpmMergeCand, is derived as follows: if( sps_gpm_enabled_flag && MaxNumMergeCand >= 3 ) MaxNumGpmMergeCand = MaxNumMergeCand − max_num_merge_cand_minus_max_num_gpm_cand (66) else if( sps_gpm_enabled_flag && MaxNumMergeCand = = 2 ) MaxNumGpmMergeCand = 2 else MaxNumGpmMergeCand = 0 The value of MaxNumGpmMergeCand shall be in the range of 2 to MaxNumMergeCand, inclusive. sps_lmcs_enabled_flag equal to 1 specifies that luma mapping with chroma scaling is used in the CLVS. sps_lmcs_enabled_flag equal to 0 specifies that luma mapping with chroma scaling is not used in the CLVS. sps_lfnst_enabled_flag equal to 1 specifies that lfnst_idx may be present in intra coding unit syntax. sps_lfnst_enabled_flag equal to 0 specifies that lfnst_idx is not present in intra coding unit syntax. sps_ladf_enabled_flag equal to 1, specifies that sps_num_ladf_intervals_minus2, sps_ladf_lowest_interval_qp_offset, sps_ladf_qp_offset[ i ], and sps_ladf_delta_threshold_minus1[ i ] are present in the SPS. sps_num_ladf_intervals_minus2 plus 1 specifies the number of sps_ladf_delta_threshold_minus1[ i ] and sps_ladf_qp_offset[ i ] syntax elements that are present in the SPS. The value of sps_num_ladf_intervals_minus2 shall be in the range of 0 to 3, inclusive. sps_ladf_lowest_interval_qp_offset specifies the offset used to derive the variable qP as specified in clause 8.8.3.6.1. The value of sps_ladf_lowest_interval_qp_offset shall be in the range of −63 to 63, inclusive. sps_ladf_qp_offset[ i ] specifies the offset array used to derive the variable qP as specified in clause 8.8.3.6.1. The value of sps_ladf_qp_offset[ i ] shall be in the range of −63 to 63, inclusive. sps_ladf_delta_threshold_minus1[ i ] is used to compute the values of SpsLadfIntervalLowerBound[ i ], which specifies the lower bound of the i-th luma intensity level interval. The value of sps_ladf_delta_threshold_minus1[ i ] shall be in the range of 0 to 2BitDepth − 3, inclusive. The value of SpsLadfIntervalLowerBound[ 0 ] is set equal to 0. For each value of i in the range of 0 to sps_num_ladf_intervals_minus2, inclusive, the variable SpsLadfIntervalLowerBound[ i + 1 ] is derived as follows: SpsLadfIntervalLowerBound[ i + 1 ] = SpsLadfIntervalLowerBound[ i ] (67) + sps_ladf_delta_threshold_minus1[ i ] + 1 log2_parallel_merge_level_minus2 plus 2 specifies the value of the variable Log2ParMrgLevel, which is used in the derivation process for spatial merging candidates as specified in clause 8.5.2.3, the derivation process for motion vectors and reference indices in subblock merge mode as specified in clause 8.5.5.2, and to control the invocation of the updating process for the history-based motion vector predictor list in clause 8.5.2.1. The value of log2_parallel_merge_level_minus2 shall be in the range of 0 to CtbLog2SizeY − 2, inclusive. The variable Log2ParMrgLevel is derived as follows: Log2ParMrgLevel = log2_parallel_merge_level_minus2 + 2 (68) sps_scaling_list_enabled_flag equal to 1 specifies that a scaling list is used for the scaling process for transform coefficients. sps_scaling_list_enabled_flag equal to 0 specifies that scaling list is not used for the scaling process for transform coefficients. sps_dep_quant_enabled_flag equal to 0 specifies that dependent quantization is disabled for pictures referring to the SPS. sps_dep_quant_enabled_flag equal to 1 specifies that dependent quantization may be enabled for pictures referring to the SPS. sps_sign_data_hiding_enabled_flag equal to 0 specifies that sign bit hiding is disabled for pictures referring to the SPS. sps_sign_data_hiding_enabled_flag equal to 1 specifies that sign bit hiding may be enabled for pictures referring to the SPS. When sps_sign_data_hiding_enabled_flag is not present, it is inferred to be equal to 0. sps_virtual_boundaries_enabled_flag equal to 1 specifies that disabling in-loop filtering across virtual boundaries may be applied in the coded pictures in the CLVS. sps_virtual_boundaries_enabled_flag equal to 0 specifies that disabling in-loop filtering across virtual boundaries is not applied in the coded pictures in the CLVS. In-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations. sps_virtual_boundaries_present_flag equal to 1 specifies that information of virtual boundaries is signalled in the SPS. sps_virtual_boundaries_present_flag equal to 0 specifies that information of virtual boundaries is not signalled in the SPS. When there is one or more than one virtual boundaries signalled in the SPS, the in-loop filtering operations are disabled across the virtual boundaries in pictures referring to the SPS. In-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations. It is a requirement of bitstream conformance that when the value of res_change_in_clvs_allowed_flag is equal to 1, the value of sps_virtual_boundaries_present_flag shall be equal to 0. sps_num_ver_virtual_boundaries specifies the number of sps_virtual_boundaries_pos_x[ i ] syntax elements that are present in the SPS. When sps_num_ver_virtual_boundaries is not present, it is inferred to be equal to 0. sps_virtual_boundaries_pos_x[ i ] specifies the location of the i-th vertical virtual boundary in units of luma samples divided by 8. The value of sps_virtual_boundaries_pos_x[ i ] shall be in the range of 1 to Ceil( pic_width_in_luma_samples ÷ 8 ) − 1, inclusive. sps_num_hor_virtual_boundaries specifies the number of sps_virtual_boundaries_pos_y[ i ] syntax elements that are present in the SPS. When sps_num_hor_virtual_boundaries is not present, it is inferred to be equal to 0. When sps_virtual_boundaries_enabled_flag is equal to 1 and sps_virtual_boundaries_present_flag is equal to 1, the sum of sps_num_ver_virtual_boundaries and sps_num_hor_virtual_boundaries shall be greater than 0. sps_virtual_boundaries_pos_y[ i ] specifies the location of the i-th horizontal virtual boundary in units of luma samples divided by 8. The value of sps_virtual_boundaries_pos_y[ i ] shall be in the range of 1 to Ceil( pic_height_in_luma_samples ÷ 8 ) − 1, inclusive. sps_general_hrd_params_present_flag equal to 1 specifies that the syntax structure general_hrd_parameters( ) is present in the SPS RBSP syntax structure. sps_general_hrd_params_present_flag equal to 0 specifies that the syntax structure general_hrd_parameters( ) is not present in the SPS RBSP syntax structure. sps_sublayer_cpb_params_present_flag equal to 1 specifies that the syntax structure old_hrd_parameters( ) in the SPS RBSP includes HRD parameters for sublayer representations with TemporalId in the range of 0 to sps_max_sublayers_minus1, inclusive. sps_sublayer_cpb_params_present_flag equal to 0 specifies that the syntax structure ols_hrd_parameters( ) in the SPS RBSP includes HRD parameters for the sublayer representation with TemporalId equal to sps_max_sublayers_minus1 only. When sps max sublayers minus1 is equal to 0, the value of sps_sublayer_cpb_params_present_flag is inferred to be equal to 0. When sps_sublayer_cpb_params_present_flag is equal to 0, the HRD parameters for the sublayer representations with TemporalId in the range of 0 to sps_max_sublayers_minus1 − 1, inclusive, are inferred to be the same as that for the sublayer representation with TemporalId equal to sps_max_sublayers_minus1. These include the HRD parameters starting from the fixed_pic_rate_general_flag[ i ] syntax element till the sublayer_hrd_parameters( i ) syntax structure immediately under the condition "if( general_vcl_hrd_params_present_flag )" in the ols_hrd_parameters syntax structure. field_seq_flag equal to 1 indicates that the CLVS conveys pictures that represent fields. field_seq_flag equal to 0 indicates that the CLVS conveys pictures that represent frames. When general_frame_only_constraint_flag is equal to 1, the value of field_seq_flag shall be equal to 0. When field_seq_flag is equal to 1, a frame-field information SEI message shall be present for every coded picture in the CLVS. NOTE 5 – The specified decoding process does not treat pictures that represent fields or frames differently. A sequence of pictures that represent fields would therefore be coded with the picture dimensions of an individual field. For example, pictures that represent 1080i fields would commonly have cropped output dimensions of 1920x540, while the sequence picture rate would commonly express the rate of the source fields (typically between 50 and 60 Hz), instead of the source frame rate (typically between 25 and 30 Hz). vui_parameters_present_flag equal to 1 specifies that the syntax structure vui_parameters( ) is present in the SPS RBSP syntax structure. vui_parameters_present_flag equal to 0 specifies that the syntax structure vui_parameters( ) is not present in the SPS RBSP syntax structure. sps_extension_flag equal to 0 specifies that no sps_extension_data_flag syntax elements are present in the SPS RBSP syntax structure. sps_extension_flag equal to 1 specifies that there are sps_extension_data_flag syntax elements present in the SPS RBSP syntax structure. sps_extension_data_flag may have any value. Its presence and value do not affect decoder conformance to profiles specified in this version of this Specification. Decoders conforming to this version of this Specification shall ignore all sps_extension_data_flag syntax elements. [0052] VVC’s picture parameter set (PPS) contains such information which could change from picture to picture. The PPS includes information roughly comparable what was part of the PPS in HEVC, including: 1) a self-reference; 2) initial picture control information such as initial quantization parameter (QP), a number of flags indicating the use of, or presence of, certain tools or control information in the slice header; and 3) tiling information. The syntax and the associated semantic of the sequence parameter set in current VVC draft specification is illustrated in Table 8 and Table 9, respectively. How to read the Table 8 is illustrated in the appendix section of this invention which could also be found in the VVC specification. Table 8. Picture parameter set RBSP syntax pic_parameter_set_rbsp( ) { Descriptor
Figure imgf000051_0001
Figure imgf000052_0001
Figure imgf000053_0001
picture_header_extension_present_flag u(1) slice_header_extension_present_flag u(1) pps_extension_flag u(1) if( pps_extension_flag ) while( more_rbsp_data( ) ) pps_extension_data_flag u(1) rbsp_trailing_bits( )
Figure imgf000054_0001
} Table 9. Picture parameter set RBSP semantics A PPS RBSP shall be available to the decoding process prior to it being referenced, included in at least one AU with TemporalId less than or equal to the TemporalId of the PPS NAL unit or provided through external means. All PPS NAL units with a particular value of pps_pic_parameter_set_id within a PU shall have the same content. pps_pic_parameter_set_id identifies the PPS for reference by other syntax elements. The value of pps_pic_parameter_set_id shall be in the range of 0 to 63, inclusive. PPS NAL units, regardless of the nuh_layer_id values, share the same value space of pps_pic_parameter_set_id. Let ppsLayerId be the value of the nuh_layer_id of a particular PPS NAL unit, and vclLayerId be the value of the nuh_layer_id of a particular VCL NAL unit. The particular VCL NAL unit shall not refer to the particular PPS NAL unit unless ppsLayerId is less than or equal to vclLayerId and the layer with nuh_layer_id equal to ppsLayerId is included in at least one OLS that includes the layer with nuh_layer_id equal to vclLayerId. pps_seq_parameter_set_id specifies the value of sps_seq_parameter_set_id for the SPS. The value of pps_seq_parameter_set_id shall be in the range of 0 to 15, inclusive. The value of pps_seq_parameter_set_id shall be the same in all PPSs that are referred to by coded pictures in a CLVS. mixed_nalu_types_in_pic_flag equal to 1 specifies that each picture referring to the PPS has more than one VCL NAL unit, the VCL NAL units do not have the same value of nal_unit_type, and the picture is not an IRAP picture. mixed_nalu_types_in_pic_flag equal to 0 specifies that each picture referring to the PPS has one or more VCL NAL units and the VCL NAL units of each picture refering to the PPS have the same value of nal_unit_type. When no_mixed_nalu_types_in_pic_constraint_flag is equal to 1, the value of mixed_nalu_types_in_pic_flag shall be equal to 0. For each slice with a nal_unit_type value nalUnitTypeA in the range of IDR_W_RADL to CRA_NUT, inclusive, in a picture picA that also contains one or more slices with another value of nal_unit_type (i.e., the value of mixed_nalu_types_in_pic_flag for the picture picA is equal to 1), the following applies: – The slice shall belong to a subpicture subpicA for which the value of the corresponding subpic_treated_as_pic_flag[ i ] is equal to 1. – The slice shall not belong to a subpicture of picA containing VCL NAL units with nal_unit_type not equal to nalUnitTypeA. – If nalUnitTypeA is equal to CRA, for all the following PUs following the current picture in the CLVS in decoding order and in output order, neither RefPicList[ 0 ] nor RefPicList[ 1 ] of a slice in subpicA in those PUs shall include any picture preceding picA in decoding order in an active entry. – Otherwise (i.e., nalUnitTypeA is equal to IDR_W_RADL or IDR_N_LP), for all the PUs in the CLVS following the current picture in decoding order, neither RefPicList[ 0 ] nor RefPicList[ 1 ] of a slice in subpicA in those PUs shall include any picture preceding picA in decoding order in an active entry. NOTE 1– mixed_nalu_types_in_pic_flag equal to 1 indicates that pictures referring to the PPS contain slices with different NAL unit types, e.g., coded pictures originating from a subpicture bitstream merging operation for which encoders have to ensure matching bitstream structure and further alignment of parameters of the original bitstreams. One example of such alignments is as follows: When the value of sps_idr_rpl_present_flag is equal to 0 and mixed_nalu_types_in_pic_flag is equal to 1, a picture referring to the PPS cannot have slices with nal_unit_type equal to IDR_W_RADL or IDR_N_LP. pic_width_in_luma_samples specifies the width of each decoded picture referring to the PPS in units of luma samples. pic_width_in_luma_samples shall not be equal to 0, shall be an integer multiple of Max( 8, MinCbSizeY ), and shall be less than or equal to pic_width_max_in_luma_samples. When res_change_in_clvs_allowed_flag equal to 0, the value of pic_width_in_luma_samples shall be equal to pic_width_max_in_luma_samples. pic_height_in_luma_samples specifies the height of each decoded picture referring to the PPS in units of luma samples. pic_height_in_luma_samples shall not be equal to 0 and shall be an integer multiple of Max( 8, MinCbSizeY ), and shall be less than or equal to pic_height_max_in_luma_samples. When res_change_in_clvs_allowed_flag equal to 0, the value of pic_height_in_luma_samples shall be equal to pic_height_max_in_luma_samples. The variables PicWidthInCtbsY, PicHeightInCtbsY, PicSizeInCtbsY, PicWidthInMinCbsY, PicHeightInMinCbsY, PicSizeInMinCbsY, PicSizeInSamplesY, PicWidthInSamplesC and PicHeightInSamplesC are derived as follows: PicWidthInCtbsY = Ceil( pic_width_in_luma_samples ÷ CtbSizeY ) (69) PicHeightInCtbsY = Ceil( pic_height_in_luma_samples ÷ CtbSizeY ) (70) PicSizeInCtbsY = PicWidthInCtbsY * PicHeightInCtbsY (71) PicWidthInMinCbsY = pic_width_in_luma_samples / MinCbSizeY (72) PicHeightInMinCbsY = pic_height_in_luma_samples / MinCbSizeY (73) PicSizeInMinCbsY = PicWidthInMinCbsY * PicHeightInMinCbsY (74) PicSizeInSamplesY = pic_width_in_luma_samples * pic_height_in_luma_samples (75) PicWidthInSamplesC = pic_width_in_luma_samples / SubWidthC (76) PicHeightInSamplesC = pic_height_in_luma_samples / SubHeightC (77) pps_conformance_window_flag equal to 1 indicates that the conformance cropping window offset parameters follow next in the PPS. pps_conformance_window_flag equal to 0 indicates that the conformance cropping window offset parameters are not present in the PPS. pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset specify the samples of the pictures in the CLVS that are output from the decoding process, in terms of a rectangular region specified in picture coordinates for output. When pps_conformance_window_flag is equal to 0, the values of pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset are inferred to be equal to 0. The conformance cropping window contains the luma samples with horizontal picture coordinates from SubWidthC * pps_conf_win_left_offset to pic_width_in_luma_samples − ( SubWidthC * pps_conf_win_right_offset + 1 ) and vertical picture coordinates from SubHeightC * pps_conf_win_top_offset to pic_height_in_luma_samples − ( SubHeightC * pps_conf_win_bottom_offset + 1 ), inclusive. The value of SubWidthC * ( pps_conf_win_left_offset + pps_conf_win_right_offset ) shall be less than pic_width_in_luma_samples, and the value of SubHeightC * ( pps_conf_win_top_offset + pps_conf_win_bottom_offset ) shall be less than pic_height_in_luma_samples. When ChromaArrayType is not equal to 0, the corresponding specified samples of the two chroma arrays are the samples having picture coordinates ( x / SubWidthC, y / SubHeightC ), where ( x, y ) are the picture coordinates of the specified luma samples. NOTE 2 – The conformance cropping window offset parameters are only applied at the output. All internal decoding processes are applied to the uncropped picture size. Let ppsA and ppsB be any two PPSs referring to the same SPS. It is a requirement of bitstream conformance that, when ppsA and ppsB have the same the values of pic_width_in_luma_samples and pic_height_in_luma_samples, respectively, ppsA and ppsB shall have the same values of pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset, respectively. When pic_width_in_luma_samples is equal to pic_width_max_in_luma_samples and pic_height_in_luma_samples is equal to pic_height_max_in_luma_samples, it is a requirement of bitstream conformance that pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset, are equal to sps_conf_win_left_offset, sps_conf_win_right_offset, sps_conf_win_top_offset, and sps_conf_win_bottom_offset, respectively. scaling_window_explicit_signalling_flag equal to 1 specifies that the scaling window offset parameters are present in the PPS. scaling_window_explicit_signalling_flag equal to 0 specifies that the scaling window offset parameters are not present in the PPS. When res_change_in_clvs_allowed_flag is equal to 0, the value of scaling window explicit signalling flag shall be equal to 0. scaling_win_left_offset, scaling_win_right_offset, scaling_win_top_offset, and scaling_win_bottom_offset specify the offsets that are applied to the picture size for scaling ratio calculation. When not present, the values of scaling_win_left_offset, scaling_win_right_offset, scaling_win_top_offset, and scaling_win_bottom_offset are inferred to be equal to pps_conf_win_left_offset, pps_conf_win_right_offset, pps_conf_win_top_offset, and pps_conf_win_bottom_offset, respectively. The value of SubWidthC * ( scaling_win_left_offset + scaling_win_right_offset ) shall be less than pic_width_in_luma_samples, and the value of SubHeightC * ( scaling_win_top_offset + scaling_win_bottom_offset ) shall be less than pic_height_in_luma_samples. The variables PicOutputWidthL and PicOutputHeightL are derived as follows: PicOutputWidthL = pic_width_in_luma_samples − (78) SubWidthC * ( scaling_win_right_offset + scaling_win_left_offset ) PicOutputHeightL = pic_height_in_luma_samples − (79) SubWidthC * ( scaling_win_bottom_offset + scaling_win_top_offset ) Let refPicOutputWidthL and refPicOutputHeightL be the PicOutputWidthL and PicOutputHeightL, respectively, of a reference picture of a current picture referring to this PPS. Is a requirement of bitstream conformance that all of the following conditions are satisfied: – PicOutputWidthL * 2 shall be greater than or equal to refPicWidthInLumaSamples. – PicOutputHeightL * 2 shall be greater than or equal to refPicHeightInLumaSamples. – PicOutputWidthL shall be less than or equal to refPicWidthInLumaSamples * 8. – PicOutputHeightL shall be less than or equal to refPicHeightInLumaSamples * 8. – PicOutputWidthL * pic_width_max_in_luma_samples shall be greater than or equal to refPicOutputWidthL * (pic_width_in_luma_samples − Max( 8, MinCbSizeY )). – PicOutputHeightL * pic_height_max_in_luma_samples shall be greater than or equal to refPicOutputHeightL * (pic_height_in_luma_samples − Max( 8, MinCbSizeY )). output_flag_present_flag equal to 1 indicates that the pic_output_flag syntax element is present in PHs referring to the PPS. output_flag_present_flag equal to 0 indicates that the pic_output_flag syntax element is not present in PHs referring to the PPS. subpic_id_mapping_in_pps_flag equal to 1 specifies that the subpicture ID mapping is signalled in the PPS. subpic_id_mapping_in_pps_flag equal to 0 specifies that the subpicture ID mapping is not signalled in the PPS. If subpic_id_mapping_explicitly_signalled_flag is 0 or subpic_id_mapping_in_sps_flag is equal to 1, the value of subpic_id_mapping_in_pps_flag shall be equal to 0. Otherwise (subpic_id_mapping_explicitly_signalled_flag is equal to 1 and subpic_id_mapping_in_sps_flag is equal to 0), the value of subpic_id_mapping_in_pps_flag shall be equal to 1. pps_num_subpics_minus1 shall be equal to sps_num_subpics_minus1. pps_subpic_id_len_minus1 shall be equal to sps_subpic_id_len_minus1. pps subpic id[ i ] specifies the subpicture ID of the i-th subpicture. The length of the pps_subpic_id[ i ] syntax element is pps_subpic_id_len_minus1 + 1 bits. The variable SubpicIdVal[ i ], for each value of i in the range of 0 to sps_num_subpics_minus1, inclusive, is derived as follows: for( i = 0; i <= sps_num_subpics_minus1; i++ ) if( subpic_id_mapping_explicitly_signalled_flag ) SubpicIdVal[ i ] = subpic_id_mapping_in_pps_flag ? pps_subpic_id[ i ] : sps_subpic_id[ i ] (80) else SubpicIdVal[ i ] = i It is a requirement of bitstream conformance that both of the following constraints apply: – For any two differenty values of i and j in the range of 0 to sps_num_subpics_minus1, inclusive, SubpicIdVal[ i ] shall not be equal to SubpicIdVal[ j ]. – When the current picture is not the first picture of the CLVS, for each value of i in the range of 0 to sps_num_subpics_minus1, inclusive, if the value of SubpicIdVal[ i ] is not equal to the value of SubpicIdVal[ i ] of the previous picture in decoding order in the same layer, the nal_unit_type for all coded slice NAL units of the subpicture in the current picture with subpicture index i shall be equal to a particular value in the range of IDR_W_RADL to CRA_NUT, inclusive. no_pic_partition_flag equal to 1 specifies that no picture partitioning is applied to each picture referring to the PPS. no_pic_partition_flag equal to 0 specifies each picture referring to the PPS may be partitioned into more than one tile or slice. It is a requirement of bitstream conformance that the value of no_pic_partition_flag shall be the same for all PPSs that are referred to by coded pictures within a CLVS. It is a requirement of bitstream conformance that the value of no_pic_partition_flag shall not be equal to 1 when the value of sps_num_subpics_minus1 + 1 is greater than 1. pps_log2_ctu_size_minus5 plus 5 specifies the luma coding tree block size of each CTU. pps_log2_ctu_size_minus5 shall be equal to sps_log2_ctu_size_minus5. num_exp_tile_columns_minus1 plus 1 specifies the number of explicitly provided tile column widths. The value of num_exp_tile_columns_minus1 shall be in the range of 0 to PicWidthInCtbsY − 1, inclusive. When no_pic_partition_flag is equal to 1, the value of num_exp_tile_columns_minus1 is inferred to be equal to 0. num_exp_tile_rows_minus1 plus 1 specifies the number of explicitly provided tile row heights. The value of num_exp_tile_rows_minus1 shall be in the range of 0 to PicHeightInCtbsY − 1, inclusive. When no_pic_partition_flag is equal to 1, the value of num_tile_rows_minus1 is inferred to be equal to 0. tile_column_width_minus1[ i ] plus 1 specifies the width of the i-th tile column in units of CTBs for i in the range of 0 to num_exp_tile_columns_minus1 − 1, inclusive. tile_column_width_minus1[ num_exp_tile_columns_minus1 ] is used to derive the width of the tile columns with index greater than or equal to num_exp_tile_columns_minus1 as specified in clause 6.5.1. The value of tile_column_width_minus1[ i ] shall be in the range of 0 to PicWidthInCtbsY − 1, inclusive. When not present, the value of tile_column_width_minus1[ 0 ] is inferred to be equal to PicWidthInCtbsY − 1. tile_row_height_minus1[ i ] plus 1 specifies the height of the i-th tile row in units of CTBs for i in the range of 0 to num_exp_tile_rows_minus1 − 1, inclusive. tile_row_height_minus1[ num_exp_tile_rows_minus1 ] is used to derive the height of the tile rows with index greater than or equal to num_exp_tile_rows_minus1 as specified in clause 6.5.1. The value of tile_row_height_minus1[ i ] shall be in the range of 0 to PicHeightInCtbsY − 1, inclusive. When not present, the value of tile_row_height_minus1[ 0 ] is inferred to be equal to PicHeightInCtbsY − 1. rect_slice_flag equal to 0 specifies that tiles within each slice are in raster scan order and the slice information is not signalled in PPS. rect_slice_flag equal to 1 specifies that tiles within each slice cover a rectangular region of the picture and the slice information is signalled in the PPS. When not present, rect_slice_flag is inferred to be equal to 1. When subpic_info_present_flag is equal to 1, the value of rect_slice_flag shall be equal to 1. single_slice_per_subpic_flag equal to 1 specifies that each subpicture consists of one and only one rectangular slice. single_slice_per_subpic_flag equal to 0 specifies that each subpicture may consist of one or more rectangular slices. When not present, the value of single_slice_per_subpic_flag is inferred to be equal to 0. lices in each picture referring to the PPS. The value of num_slices_in_pic_minus1 shall be in the range of 0 to MaxSlicesPerPicture − 1, inclusive, where MaxSlicesPerPicture is specified in Annex A. When no_pic_partition_flag is equal to 1, the value of num_slices_in_pic_minus1 is inferred to be equal to 0. When single_slice_per_subpic_flag is equal to 1, the value of num_slices_in_pic_minus1 is inferred to be equal to sps_num_subpics_minus1. tile_idx_delta_present_flag equal to 0 specifies that tile_idx_delta[ i ] syntax elements are not present in the PPS and all pictures referring to the PPS are partitioned into rectangular slice rows and rectangular slice columns in slice raster order. tile_idx_delta_present_flag equal to 1 specifies that tile_idx_delta[ i ] syntax elements may be present in the PPS and all rectangular slices in pictures referring to the PPS are specified in the order indicated by the values of the tile_idx_delta[ i ] in increasing values of i. When not present, the value of tile_idx_delta_present_flag is inferred to be equal to 0. slice_width_in_tiles_minus1[ i ] plus 1 specifies the width of the i-th rectangular slice in units of tile columns. The value of slice_width_in_tiles_minus1[ i ] shall be in the range of 0 to NumTileColumns − 1, inclusive. When i is less than num_slices_in_pic_minus1 and NumTileColumns is equal to 1, the value of slice_width_in_tiles_minus1[ i ] is inferred to be equal to 0. slice_height_in_tiles_minus1[ i ] plus 1 specifies the height of the i-th rectangular slice in units of tile rows when num_exp_slices_in_tile[ i ] is equal to 0. The value of slice_height_in_tiles_minus1[ i ] shall be in the range of 0 to NumTileRows − 1, inclusive. When i is less than num_slices_in_pic_minus1 and slice_height_in_tiles_minus1[ i ] is not present, it is inferred to be equal to NumTileRows = = 1 ? 0 : slice_height_in_tiles_minus1[ i − 1 ]. num_exp_slices_in_tile[ i ] specifies the number of explicitly provided slice heights for the slices in the tile containing the i-th slice (i.e., the tile with tile index equal to SliceTopLeftTileIdx[ i ]). The value of num_exp_slices_in_tile[ i ] shall be in the range of 0 to RowHeight[ SliceTopLeftTileIdx[ i ] / NumTileColumns ] − 1, inclusive. When not present, the value of num_exp_slices_in_tile[ i ] is inferred to be equal to 0. NOTE 3 – If num_exp_slices_in_tile[ i ] is equal to 0, the tile containing the i-th slice is not split into multiple slices. Otherwise (num_exp_slices_in_tile[ i ] is greater than 0), the tile containing the i-th slice may or may not be split into multiple slices. exp_slice_height_in_ctus_minus1[ i ][ j ] plus 1 specifies the height of the j-th rectangular slice in the tile containing the i-th slice in units of CTU rows. The value of exp_slice_height_in_ctus_minus1[ i ][ j ] shall be in the range of 0 to RowHeight[ SliceTopLeftTileIdx[ i ] / NumTileColumns ] − 1, inclusive. tile_idx_delta[ i ] specifies the difference between the tile index of the tile containing the first CTU in the ( i + 1 )-th rectangular slice and the tile index of the tile containing the first CTU in the i-th rectangular slice. The value of tile_idx_delta[ i ] shall be in the range of −NumTilesInPic + 1 to NumTilesInPic − 1, inclusive. When not present, the value of tile_idx_delta[ i ] is inferred to be equal to 0. When present, the value of tile_idx_delta[ i ] shall not be equal to 0. loop_filter_across_tiles_enabled_flag equal to 1 specifies that in-loop filtering operations may be performed across tile boundaries in pictures referring to the PPS. loop_filter_across_tiles_enabled_flag equal to 0 specifies that in-loop filtering operations are not performed across tile boundaries in pictures referring to the PPS. The in-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations. When not present, the value of loop_filter_across_tiles_enabled_flag is inferred to be equal to 1. loop_filter_across_slices_enabled_flag equal to 1 specifies that in-loop filtering operations may be performed across slice boundaries in pictures referring to the PPS. loop_filter_across_slice_enabled_flag equal to 0 specifies that in-loop filtering operations are not performed across slice boundaries in pictures referring to the PPS. The in-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations. When not present, the value of loop_filter_across_slices_enabled_flag is inferred to be equal to 0. cabac_init_present_flag equal to 1 specifies that cabac_init_flag is present in slice headers referring to the PPS. cabac_init_present_flag equal to 0 specifies that cabac_init_flag is not present in slice headers referring to the PPS. num_ref_idx_default_active_minus1[ i ] plus 1, when i is equal to 0, specifies the inferred value of the variable NumRefIdxActive[ 0 ] for P or B slices with num_ref_idx_active_override_flag equal to 0, and, when i is equal to 1, specifies the inferred value of NumRefIdxActive[ 1 ] for B slices with num_ref_idx_active_override_flag equal to 0. The value of num_ref_idx_default_active_minus1[ i ] shall be in the range of 0 to 14, inclusive. rpl1_idx_present_flag equal to 0 specifies that rpl_sps_flag[ 1 ] and rpl_idx[ 1 ] are not present in the PH syntax structures or the slice headers for pictures referring to the PPS. rpl1_idx_present_flag equal to 1 specifies that rpl_sps_flag[ 1 ] and rpl_idx[ 1 ] may be present in the PH syntax structures or the slice headers for pictures referring to the PPS. init_qp_minus26 plus 26 specifies the initial value of SliceQpY for each slice referring to the PPS. The initial value of SliceQpY is modified at the picture level when a non-zero value of ph_qp_delta is decoded or at the slice level when a non-zero value of slice_qp_delta is decoded. The value of init_qp_minus26 shall be in the range of −( 26 + QpBdOffset ) to +37, inclusive. cu_qp_delta_enabled_flag equal to 1 specifies that the ph_cu_qp_delta_subdiv_intra_slice and ph_cu_qp_delta_subdiv_inter_slice syntax elements are present in PHs referring to the PPS and cu_qp_delta_abs may be present in the transform unit syntax. cu_qp_delta_enabled_flag equal to 0 specifies that the ph_cu_qp_delta_subdiv_intra_slice and ph_cu_qp_delta_subdiv_inter_slice syntax elements are not present in PHs referring to the PPS and cu_qp_delta_abs is not present in the transform unit syntax. pps_chroma_tool_offsets_present_flag equal to 1 specifies that chroma tool offsets related syntax elements are present in the PPS RBSP syntax structure. pps_chroma_tool_offsets_present_flag equal to 0 specifies that chroma tool offsets related syntax elements are not present in in the PPS RBSP syntax structure. When ChromaArrayType is equal to 0, the value of pps_chroma_tool_offsets_present_flag shall be equal to 0. pps_cb_qp_offset and pps_cr_qp_offset specify the offsets to the luma quantization parameter Qp′Y used for deriving Qp′Cb and Qp′Cr, respectively. The values of pps_cb_qp_offset and pps_cr_qp_offset shall be in the range of −12 to +12, inclusive. When ChromaArrayType is equal to 0, pps_cb_qp_offset and pps_cr_qp_offset are not used in the decoding process and decoders shall ignore their value. When not present, the values of pps_cb_qp_offset and pps_cr_qp_offset are inferred to be equalt to 0. pps_joint_cbcr_qp_offset_present_flag equal to 1 specifies that pps_joint_cbcr_qp_offset_value and joint_cbcr_qp_offset_list[ i ] are present in the PPS RBSP syntax structure. pps_joint_cbcr_qp_offset_present_flag equal to 0 specifies that pps_joint_cbcr_qp_offset_value and joint_cbcr_qp_offset_list[ i ] are not present in the PPS RBSP syntax structure. When ChromaArrayType is equal to 0 or sps_joint_cbcr_enabled_flag is equal to 0, the value of pps_joint_cbcr_qp_offset_present_flag shall be equal to 0. When not present, the value of pps_joint_cbcr_qp_offset_present_flag is inferred to be equal to 0. pps_joint_cbcr_qp_offset_value specifies the offset to the luma quantization parameter Qp′Y used for deriving Qp′CbCr. The value of pps_joint_cbcr_qp_offset_value shall be in the range of −12 to +12, inclusive. When ChromaArrayType is equal to 0 or sps_joint_cbcr_enabled_flag is equal to 0, pps_joint_cbcr_qp_offset_value is not used in the decoding process and decoders shall ignore its value. When pps_joint_cbcr_qp_offset_present_flag is equal to 0, pps_joint_cbcr_qp_offset_value is not present and is inferred to be equal to 0. pps_slice_chroma_qp_offsets_present_flag equal to 1 specifies that the slice_cb_qp_offset and slice_cr_qp_offset syntax elements are present in the associated slice headers. pps_slice_chroma_qp_offsets_present_flag equal to 0 specifies that the slice_cb_qp_offset and slice_cr_qp_offset syntax elements are not present in the associated slice headers. When not present, the value of pps_slice_chroma_qp_offsets_present_flag is inferred to be equal to 0. pps_cu_chroma_qp_offset_list_enabled_flag equal to 1 specifies that the ph_cu_chroma_qp_offset_subdiv_intra_slice and ph_cu_chroma_qp_offset_subdiv_inter_slice syntax elements are present in PHs referring to the PPS and cu chroma qp offset flag may be present in the transform unit syntax and the palette coding syntax. pps_cu_chroma_qp_offset_list_enabled_flag equal to 0 specifies that the ph_cu_chroma_qp_offset_subdiv_intra_slice and ph_cu_chroma_qp_offset_subdiv_inter_slice syntax elements are not present in PHs referring to the PPS and the cu_chroma_qp_offset_flag is not present in the transform unit syntax and the palette coding syntax. When not present, the value of pps_cu_chroma_qp_offset_list_enabled_flag is inferred to be equal to 0. chroma_qp_offset_list_len_minus1 plus 1 specifies the number of cb_qp_offset_list[ i ], cr_qp_offset_list[ i ], and joint_cbcr_qp_offset_list[ i ], syntax elements that are present in the PPS RBSP syntax structure. The value of chroma_qp_offset_list_len_minus1 shall be in the range of 0 to 5, inclusive. cb_qp_offset_list[ i ], cr_qp_offset_list[ i ], and joint_cbcr_qp_offset_list[ i ], specify offsets used in the derivation of Qp′Cb, Qp′Cr, and Qp′CbCr, respectively. The values of cb_qp_offset_list[ i ], cr_qp_offset_list[ i ], and joint_cbcr_qp_offset_list[ i ] shall be in the range of −12 to +12, inclusive. When pps_joint_cbcr_qp_offset_present_flag is equal to 0, joint_cbcr_qp_offset_list[ i ] is not present and it is inferred to be equal to 0. pps_weighted_pred_flag equal to 0 specifies that weighted prediction is not applied to P slices referring to the PPS. pps_weighted_pred_flag equal to 1 specifies that weighted prediction is applied to P slices referring to the PPS. When sps_weighted_pred_flag is equal to 0, the value of pps_weighted_pred_flag shall be equal to 0. pps_weighted_bipred_flag equal to 0 specifies that explicit weighted prediction is not applied to B slices referring to the PPS. pps_weighted_bipred_flag equal to 1 specifies that explicit weighted prediction is applied to B slices referring to the PPS. When sps_weighted_bipred_flag is equal to 0, the value of pps_weighted_bipred_flag shall be equal to 0. deblocking_filter_control_present_flag equal to 1 specifies the presence of deblocking filter control syntax elements in the PPS. deblocking_filter_control_present_flag equal to 0 specifies the absence of deblocking filter control syntax elements in the PPS. deblocking_filter_override_enabled_flag equal to 1 specifies the presence of ph_deblocking_filter_override_flag in the PHs referring to the PPS or slice_deblocking_filter_override_flag in the slice headers referring to the PPS. deblocking_filter_override_enabled_flag equal to 0 specifies the absence of ph_deblocking_filter_override_flag in PHs referring to the PPS or slice_deblocking_filter_override_flag in slice headers referring to the PPS. When not present, the value of deblocking_filter_override_enabled_flag is inferred to be equal to 0. pps_deblocking_filter_disabled_flag equal to 1 specifies that the operation of deblocking filter is not applied for slices referring to the PPS in which slice_deblocking_filter_disabled_flag is not present. pps_deblocking_filter_disabled_flag equal to 0 specifies that the operation of the deblocking filter is applied for slices referring to the PPS in which slice_deblocking_filter_disabled_flag is not present. When not present, the value of pps_deblocking_filter_disabled_flag is inferred to be equal to 0. pps_beta_offset_div2 and pps_tc_offset_div2 specify the default deblocking parameter offsets for β and tC (divided by 2) that are applied to the luma component for slices referring to the PPS, unless the default deblocking parameter offsets are overridden by the deblocking parameter offsets present in the picture headers or the slice headers of the slices referring to the PPS. The values of pps beta offset div2 and pps tc offset div2 shall both be in the range of −12 to 12, inclusive. When not present, the values of pps_beta_offset_div2 and pps_tc_offset_div2 are both inferred to be equal to 0. pps_cb_beta_offset_div2 and pps_cb_tc_offset_div2 specify the default deblocking parameter offsets for β and tC (divided by 2) that are applied to the Cb component for slices referring to the PPS, unless the default deblocking parameter offsets are overridden by the deblocking parameter offsets present in the picture headers or the slice headers of the slices referring to the PPS. The values of pps_cb_beta_offset_div2 and pps_cb_tc_offset_div2 shall both be in the range of −12 to 12, inclusive. When not present, the values of pps_cb_beta_offset_div2 and pps_cb_tc_offset_div2 are both inferred to be equal to 0. pps_cr_beta_offset_div2 and pps_cr_tc_offset_div2 specify the default deblocking parameter offsets for β and tC (divided by 2) that are applied to the Cr component for slices referring to the PPS, unless the default deblocking parameter offsets are overridden by the deblocking parameter offsets present in the picture headers or the slice headers of the slices referring to the PPS. The values of pps_cr_beta_offset_div2 and pps_cr_tc_offset_div2 shall both be in the range of −12 to 12, inclusive. When not present, the values of pps_cr_beta_offset_div2 and pps_cr_tc_offset_div2 are both inferred to be equal to 0. rpl_info_in_ph_flag equal to 1 specifies that reference picture list information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure. rpl_info_in_ph_flag equal to 0 specifies that reference picture list information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure. dbf_info_in_ph_flag equal to 1 specifies that deblocking filter information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure. dbf_info_in_ph_flag equal to 0 specifies that deblocking filter information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure. When not present, the value of dbf_info_in_ph_flag is inferred to be equal to 0. sao_info_in_ph_flag equal to 1 specifies that SAO filter information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure. sao_info_in_ph_flag equal to 0 specifies that SAO filter information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure. alf_info_in_ph_flag equal to 1 specifies that ALF information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure. alf_info_in_ph_flag equal to 0 specifies that ALF information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure. wp_info_in_ph_flag equal to 1 specifies that weighted prediction information may be present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure. wp_info_in_ph_flag equal to 0 specifies that weighted prediction information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure. When not present, the value of wp_info_in_ph_flag is inferred to be equal to 0. qp_delta_info_in_ph_flag equal to 1 specifies that QP delta information is present in the PH syntax structure and not present in slice headers referring to the PPS that do not contain a PH syntax structure. qp_delta_info_in_ph_flag equal to 0 specifies that QP delta information is not present in the PH syntax structure and may be present in slice headers referring to the PPS that do not contain a PH syntax structure. pps_ref_wraparound_enabled_flag equal to 1 specifies that horizontal wrap-around motion compensation is applied in inter prediction. pps_ref_wraparound_enabled_flag equal to 0 specifies that horizontal wrap-around motion compensation is not applied. When the value of CtbSizeY / MinCbSizeY + 1 is greater than pic_width_in_luma_samples / MinCbSizeY − 1, the value of pps_ref_wraparound_enabled_flag shall be equal to 0. When sps_ref_wraparound_enabled_flag is equal to 0, the value of pps_ref_wraparound_enabled_flag shall be equal to 0. pps_ref_wraparound_offset plus ( CtbSizeY / MinCbSizeY ) + 2 specifies the offset used for computing the horizontal wrap-around position in units of MinCbSizeY luma samples. The value of pps_ref_wraparound_offset shall be in the range of 0 to ( pic_width_in_luma_samples / MinCbSizeY ) − ( CtbSizeY / MinCbSizeY ) − 2, inclusive. The variable PpsRefWraparoundOffset is set equal to pps_ref_wraparound_offset+( CtbSizeY / MinCbSizeY ) + 2. picture_header_extension_present_flag equal to 0 specifies that no PH extension syntax elements are present in PHs referring to the PPS. picture_header_extension_present_flag equal to 1 specifies that PH extension syntax elements are present in PHs referring to the PPS. picture_header_extension_present_flag shall be equal to 0 in bitstreams conforming to this version of this Specification. slice_header_extension_present_flag equal to 0 specifies that no slice header extension syntax elements are present in the slice headers for coded pictures referring to the PPS. slice_header_extension_present_flag equal to 1 specifies that slice header extension syntax elements are present in the slice headers for coded pictures referring to the PPS. slice_header_extension_present_flag shall be equal to 0 in bitstreams conforming to this version of this Specification. pps_extension_flag equal to 0 specifies that no pps_extension_data_flag syntax elements are present in the PPS RBSP syntax structure. pps_extension_flag equal to 1 specifies that there are pps_extension_data_flag syntax elements present in the PPS RBSP syntax structure. pps_extension_data_flag may have any value. Its presence and value do not affect decoder conformance to profiles specified in this version of this Specification. Decoders conforming to this version of this Specification shall ignore all pps_extension_data_flag syntax elements. [0053] The slice header contains information that can change from slice to slice, as well as such picture-related information that is relatively small or relevant only for a certain slice or picture types. The size of the slice header may be noticeably bigger than the PPS, particular when there are tile or wavefront entry point offsets in the slice header and RPS, prediction weights, or reference picture list modifications are explicitly signaled. The syntax and the associated semantic of the sequence parameter set in the current VVC draft specification are illustrated in Table 10 and Table 11, respectively. How to read Table 10 is illustrated in the appendix section of this invention, which could also be found in the VVC specification. Table 10. Picture header structure syntax
Figure imgf000065_0001
Figure imgf000066_0001
Figure imgf000067_0001
Figure imgf000068_0001
Figure imgf000069_0001
Table 11. Picture header structure semantics The PH syntax structure contains information that is common for all slices of the coded picture associated with the PH syntax structure. gdr_or_irap_pic_flag equal to 1 specifies that the current picture is a GDR or IRAP picture. gdr_or_irap_pic_flag equal to 0 specifies that the current picture may or may not be a GDR or IRAP picture. gdr_pic_flag equal to 1 specifies the picture associated with the PH is a GDR picture. gdr_pic_flag equal to 0 specifies that the picture associated with the PH is not a GDR picture. When not present, the value of gdr_pic_flag is inferred to be equal to 0. When gdr_enabled_flag is equal to 0, the value of gdr_pic_flag shall be equal to 0. ph_inter_slice_allowed_flag equal to 0 specifies that all coded slices of the picture have slice_type equal to 2. ph_inter_slice_allowed_flag equal to 1 specifies that there may or may not be one or more coded slices in the picture that have slice_type equal to 0 or 1. ph_intra_slice_allowed_flag equal to 0 specifies that all coded slices of the picture have slice_type equal to 0 or 1. ph_intra_slice_allowed_flag equal to 1 specifies that there may or may not be one or more coded slices in the picture that have slice_type equal to 2.When not present, the value of ph_intra_slice_allowed_flag is inferred to be equal to 1. NOTE 1 – For bitstreams that are suppposed to work subpicure based bitstream merging without the need of changing PH NAL units, the encoder is expected to set the values of both ph_inter_slice_allowed_flag and ph_intra_slice_allowed_flag equal to 1. non_reference_picture_flag equal to 1 specifies the picture associated with the PH is never used as a reference picture. non_reference_picture_flag equal to 0 specifies the picture associated with the PH may or may not be used as a reference picture. ph_pic_parameter_set_id specifies the value of pps_pic_parameter_set_id for the PPS in use. The value of ph_pic_parameter_set_id shall be in the range of 0 to 63, inclusive. It is a requirement of bitstream conformance that the value of TemporalId of the PH shall be greater than or equal to the value of TemporalId of the PPS that has pps_pic_parameter_set_id equal to ph_pic_parameter_set_id. ph_pic_order_cnt_lsb specifies the picture order count modulo MaxPicOrderCntLsb for the current picture. The length of the ph_pic_order_cnt_lsb syntax element is log2_max_pic_order_cnt_lsb_minus4 + 4 bits. The value of the ph_pic_order_cnt_lsb shall be in the range of 0 to MaxPicOrderCntLsb − 1, inclusive. no_output_of_prior_pics_flag affects the output of previously-decoded pictures in the DPB after the decoding of a CLVSS picture that is not the first picture in the bitstream as specified in Annex C. recovery_poc_cnt specifies the recovery point of decoded pictures in output order. If the current picture is a GDR picture that is associated with the PH, and there is a picture picA that follows the current GDR picture in decoding order in the CLVS that has PicOrderCntVal equal to the PicOrderCntVal of the current GDR picture plus the value of recovery_poc_cnt, the picture picA is referred to as the recovery point picture. Otherwise, the first picture in output order that has PicOrderCntVal greater than the PicOrderCntVal of the current picture plus the value of recovery_poc_cnt is referred to as the recovery point picture. The recovery point picture shall not precede the current GDR picture in decoding order. The value of recovery_poc_cnt shall be in the range of 0 to MaxPicOrderCntLsb − 1, inclusive. When the current picture is a GDR picture, the variable RpPicOrderCntVal is derived as follows: RpPicOrderCntVal = PicOrderCntVal + recovery_poc_cnt (81) NOTE 2 – When gdr_enabled_flag is equal to 1 and PicOrderCntVal of the current picture is greater than or equal to RpPicOrderCntVal of the associated GDR picture, the current and subsequent decoded pictures in output order are exact match to the corresponding pictures produced by starting the decoding process from the previous IRAP picture, when present, preceding the associated GDR picture in decoding order. ph_extra_bit[ i ] may be equal to 1 or 0. Decoders conforming to this version of this Specification shall ignore the value of ph_extra_bit[ i ]. Its value does not affect decoder conformance to profiles specified in this version of specification. ph_poc_msb_present_flag equal to 1 specifies that the syntax element poc_msb_val is present in the PH. ph_poc_msb_present_flag equal to 0 specifies that the syntax element poc_msb_val is not present in the PH. When vps_independent_layer_flag[ GeneralLayerIdx[ nuh_layer_id ] ] is equal to 0 and there is a picture in the current AU in a reference layer of the current layer, the value of ph_poc_msb_present_flag shall be equal to 0. poc_msb_val specifies the POC MSB value of the current picture. The length of the syntax element poc_msb_val is poc_msb_len_minus1 + 1 bits. ph_alf_enabled_flag equal to 1 specifies that adaptive loop filter is enabled for all slices associated with the PH and may be applied to Y, Cb, or Cr colour component in the slices. ph_alf_enabled_flag equal to 0 specifies that adaptive loop filter may be disabled for one, or more, or all slices associated with the PH. When not present, ph_alf_enabled_flag is inferred to be equal to 0. ph num alf aps ids luma specifies the number of ALF APSs that the slices associated with the PH refers to. ph_alf_aps_id_luma[ i ] specifies the adaptation_parameter_set_id of the i-th ALF APS that the luma component of the slices associated with the PH refers to. The value of alf_luma_filter_signal_flag of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_alf_aps_id_luma[ i ] shall be equal to 1. The TemporalId of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_alf_aps_id_luma[ i ] shall be less than or equal to the TemporalId of the picture associated with the PH. ph_alf_chroma_idc equal to 0 specifies that the adaptive loop filter is not applied to Cb and Cr colour components. ph_alf_chroma_idc equal to 1 indicates that the adaptive loop filter is applied to the Cb colour component. ph_alf_chroma_idc equal to 2 indicates that the adaptive loop filter is applied to the Cr colour component. ph_alf_chroma_idc equal to 3 indicates that the adaptive loop filter is applied to Cb and Cr colour components. When ph_alf_chroma_idc is not present, it is inferred to be equal to 0. ph_alf_aps_id_chroma specifies the adaptation_parameter_set_id of the ALF APS that the chroma component of the slices associated with the PH refers to. The value of alf_chroma_filter_signal_flag of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_alf_aps_id_chroma shall be equal to 1. The TemporalId of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_alf_aps_id_chroma shall be less than or equal to the TemporalId of the picture associated with the PH. ph_cc_alf_cb_enabled_flag equal to 1 specifies that cross-component filter for Cb colour component is enabled for all slices associated with the PH and may be applied to Cb colour component in the slices. ph_cc_alf_cb_enabled_flag equal to 0 specifies that cross- component filter for Cb colour component may be disabled for one, or more, or all slices associated with the PH. When not present, ph_cc_alf_cb_enabled_flag is inferred to be equal to 0. ph_cc_alf_cb_aps_id specifies the adaptation_parameter_set_id of the ALF APS that the Cb colour component of the slices associated with the PH refers to. The value of alf_cc_cb_filter_signal_flag of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_cc_alf_cb_aps_id shall be equal to 1. The TemporalId of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_cc_alf_cb_aps_id shall be less than or equal to the TemporalId of the picture associated with the PH. ph_cc_alf_cr_enabled_flag equal to 1 specifies that cross-compoent filter for Cr colour component is enabled for all slices associated with the PH and may be applied to Cr colour component in the slices. ph_cc_alf_cr_enabled_flag equal to 0 specifies that cross- component filter for Cr colour component may be disabled for one, or more, or all slices associated with the PH. When not present, ph_cc_alf_cr_enabled_flag is inferred to be equal to 0. ph_cc_alf_cr_aps_id specifies the adaptation_parameter_set_id of the ALF APS that the Cr colour component of the slices associated with the PH refers to. The value of alf_cc_cr_filter_signal_flag of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_cc_alf_cr_aps_id shall be equal to 1. The TemporalId of the APS NAL unit having aps_params_type equal to ALF_APS and adaptation_parameter_set_id equal to ph_cc_alf_cr_aps_id shall be less than or equal to the TemporalId of the picture associated with the PH. ph_lmcs_enabled_flag equal to 1 specifies that luma mapping with chroma scaling is enabled for all slices associated with the PH. ph_lmcs_enabled_flag equal to 0 specifies that luma mapping with chroma scaling may be disabled for one, or more, or all slices associated with the PH. When not present, the value of ph_lmcs_enabled_flag is inferred to be equal to 0. ph_lmcs_aps_id specifies the adaptation_parameter_set_id of the LMCS APS that the slices associated with the PH refers to. The TemporalId of the APS NAL unit having aps_params_type equal to LMCS_APS and adaptation_parameter_set_id equal to ph_lmcs_aps_id shall be less than or equal to the TemporalId of the picture associated with PH. ph_chroma_residual_scale_flag equal to 1 specifies that chroma residual scaling is enabled for the all slices associated with the PH. ph_chroma_residual_scale_flag equal to 0 specifies that chroma residual scaling may be disabled for one, or more, or all slices associated with the PH. When ph_chroma_residual_scale_flag is not present, it is inferred to be equal to 0. ph_scaling_list_present_flag equal to 1 specifies that the scaling list data used for the slices associated with the PH is derived based on the scaling list data contained in the referenced scaling list APS. ph_scaling_list_present_flag equal to 0 specifies that the scaling list data used for the slices associated with the PH is set to be equal to 16. When not present, the value of ph_scaling_list_present_flag is inferred to be equal to 0. ph_scaling_list_aps_id specifies the adaptation_parameter_set_id of the scaling list APS. The TemporalId of the APS NAL unit having aps_params_type equal to SCALING_APS and adaptation_parameter_set_id equal to ph_scaling_list_aps_id shall be less than or equal to the TemporalId of the picture associated with PH. ph_virtual_boundaries_present_flag equal to 1 specifies that information of virtual boundaries is signalled in the PH. ph_virtual_boundaries_present_flag equal to 0 specifies that information of virtual boundaries is not signalled in the PH. When there is one or more than one virtual boundaries signalled in the PH, the in-loop filtering operations are disabled across the virtual boundaries in the picture. The in-loop filtering operations include the deblocking filter, sample adaptive offset filter, and adaptive loop filter operations. When not present, the value of ph_virtual_boundaries_present_flag is inferred to be equal to 0. It is a requirement of bitstream conformance that, when subpic_info_present_flag is equal to 1, the value of ph_virtual_boundaries_present_flag shall be equal to 0. The variable VirtualBoundariesPresentFlag is derived as follows: VirtualBoundariesPresentFlag = 0 if( sps_virtual_boundaries_enabled_flag ) VirtualBoundariesPresentFlag = sps_virtual_boundaries_present_flag | | ph_virtual_boundaries_present_flag (82) ph_num_ver_virtual_boundaries specifies the number of ph_virtual_boundaries_pos_x[ i ] syntax elements that are present in the PH. When ph_num_ver_virtual_boundaries is not present, it is inferred to be equal to 0. The variable NumVerVirtualBoundaries is derived as follows: NumVerVirtualBoundaries = 0 if( sps_virtual_boundaries_enabled_flag ) NumVerVirtualBoundaries = sps_virtual_boundaries_present_flag ? sps_num_ver_virtual_boundaries : ph_num_ver_virtual_boundaries (83) ph_virtual_boundaries_pos_x[ i ] specifies the location of the i-th vertical virtual boundary in units of luma samples divided by 8. The value of ph_virtual_boundaries_pos_x[ i ] shall be in the range of 1 to Ceil( pic_width_in_luma_samples ÷ 8 ) − 1, inclusive. The list VirtualBoundariesPosX[ i ] for i ranging from 0 to NumVerVirtualBoundaries − 1, inclusive, in units of luma samples, specifying the locations of the vertical virtual boundaries, is derived as follows: for( i = 0; i < NumVerVirtualBoundaries; i++) VirtualBoundariesPosX[ i ] = ( sps_virtual_boundaries_present_flag ? sps_virtual_boundaries_pos_x[ i ] : ph_virtual_boundaries_pos_x[ i ] ) * 8 (84) The distance between any two vertical virtual boundaries shall be greater than or equal to CtbSizeY luma samples. ph_num_hor_virtual_boundaries specifies the number of ph_virtual_boundaries_pos_y[ i ] syntax elements that are present in the PH. When ph_num_hor_virtual_boundaries is not present, it is inferred to be equal to 0. The parameter NumHorVirtualBoundaries is derived as follows: NumHorVirtualBoundaries = 0 if( sps_virtual_boundaries_enabled_flag ) NumHorVirtualBoundaries = sps_virtual_boundaries_present_flag ? sps_num_hor_virtual_boundaries : ph_num_hor_virtual_boundaries (85) When sps_virtual_boundaries_enabled_flag is equal to 1 and ph_virtual_boundaries_present_flag is equal to 1, the sum of ph_num_ver_virtual_boundaries and ph_num_hor_virtual_boundaries shall be greater than 0. ph_virtual_boundaries_pos_y[ i ] specifies the location of the i-th horizontal virtual boundary in units of luma samples divided by 8. The value of ph_virtual_boundaries_pos_y[ i ] shall be in the range of 1 to Ceil( pic height in luma samples ÷ 8 ) − 1, inclusive. The list VirtualBoundariesPosY[ i ] for i ranging from 0 to NumHorVirtualBoundaries − 1, inclusive, in units of luma samples, specifying the locations of the horizontal virtual boundaries, is derived as follows: for( i = 0; i < NumHorVirtualBoundaries; i++) VirtualBoundariesPosY[ i ] = ( sps_virtual_boundaries_present_flag ? sps_virtual_boundaries_pos_y[ i ] : ph_virtual_boundaries_pos_y[ i ] ) * 8 (86) The distance between any two horizontal virtual boundaries shall be greater than or equal to CtbSizeY luma samples. pic_output_flag affects the decoded picture output and removal processes as specified in Annex C. When pic_output_flag is not present, it is inferred to be equal to 1. partition_constraints_override_flag equal to 1 specifies that partition constraint parameters are present in the PH. partition_constraints_override_flag equal to 0 specifies that partition constraint parameters are not present in the PH. When not present, the value of partition_constraints_override_flag is inferred to be equal to 0. ph_log2_diff_min_qt_min_cb_intra_slice_luma specifies the difference between the base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU and the base 2 logarithm of the minimum coding block size in luma samples for luma CUs in the slices with slice_type equal to 2 (I) associated with the PH. The value of ph_log2_diff_min_qt_min_cb_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY − MinCbLog2SizeY, inclusive. When not present, the value of ph_log2_diff_min_qt_min_cb_luma is inferred to be equal to sps_log2_diff_min_qt_min_cb_intra_slice_luma. ph_max_mtt_hierarchy_depth_intra_slice_luma specifies the maximum hierarchy depth for coding units resulting from multi-type tree splitting of a quadtree leaf in slices with slice_type equal to 2 (I) associated with the PH. The value of ph_max_mtt_hierarchy_depth_intra_slice_luma shall be in the range of 0 to 2*( CtbLog2SizeY − MinCbLog2SizeY ), inclusive. When not present, the value of ph_max_mtt_hierarchy_depth_intra_slice_luma is inferred to be equal to sps_max_mtt_hierarchy_depth_intra_slice_luma. ph_log2_diff_max_bt_min_qt_intra_slice_luma specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 2 (I) associated with the PH. The value of ph_log2_diff_max_bt_min_qt_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeIntraY, inclusive. When not present, the value of ph_log2_diff_max_bt_min_qt_intra_slice_luma is inferred to be equal to sps_log2_diff_max_bt_min_qt_intra_slice_luma. ph_log2_diff_max_tt_min_qt_intra_slice_luma specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 2 (I) associated with the PH. The value of ph_log2_diff_max_tt_min_qt_intra_slice_luma shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeIntraY, inclusive. When not present, the value of ph_log2_diff_max_tt_min_qt_intra_slice_luma is inferred to be equal to sps_log2_diff_max_tt_min_qt_intra_slice_luma. ph_log2_diff_min_qt_min_cb_intra_slice_chroma specifies the difference between the base 2 logarithm of the minimum size in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA and the base 2 logarithm of the minimum coding block size in luma samples for chroma CUs with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) associated with the PH. The value of ph_log2_diff_min_qt_min_cb_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY − MinCbLog2SizeY, inclusive. When not present, the value of ph_log2_diff_min_qt_min_cb_intra_slice_chroma is inferred to be equal to sps_log2_diff_min_qt_min_cb_intra_slice_chroma. ph_max_mtt_hierarchy_depth_intra_slice_chroma specifies the maximum hierarchy depth for chroma coding units resulting from multi-type tree splitting of a chroma quadtree leaf with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) associated with the PH. The value of ph_max_mtt_hierarchy_depth_intra_slice_chroma shall be in the range of 0 to 2*( CtbLog2SizeY − MinCbLog2SizeY ), inclusive. When not present, the value of ph_max_mtt_hierarchy_depth_intra_slice_chroma is inferred to be equal to sps_max_mtt_hierarchy_depth_intra_slice_chroma. ph_log2_diff_max_bt_min_qt_intra_slice_chroma specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a chroma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) associated with the PH. The value of ph_log2_diff_max_bt_min_qt_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeIntraC, inclusive. When not present, the value of ph_log2_diff_max_bt_min_qt_intra_slice_chroma is inferred to be equal to sps_log2_diff_max_bt_min_qt_intra_slice_chroma. ph_log2_diff_max_tt_min_qt_intra_slice_chroma specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a chroma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a chroma leaf block resulting from quadtree splitting of a chroma CTU with treeType equal to DUAL_TREE_CHROMA in slices with slice_type equal to 2 (I) associated with the PH. The value of ph_log2_diff_max_tt_min_qt_intra_slice_chroma shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeIntraC, inclusive. When not present, the value of ph_log2_diff_max_tt_min_qt_intra_slice_chroma is inferred to be equal to sps_log2_diff_max_tt_min_qt_intra_slice_chroma ph_cu_qp_delta_subdiv_intra_slice specifies the maximum cbSubdiv value of coding units in intra slice that convey cu_qp_delta_abs and cu_qp_delta_sign_flag. The value of ph_cu_qp_delta_subdiv_intra_slice shall be in the range of 0 to 2 * ( CtbLog2SizeY − MinQtLog2SizeIntraY + ph_max_mtt_hierarchy_depth_intra_slice_ luma ), inclusive. When not present, the value of ph_cu_qp_delta_subdiv_intra_slice is inferred to be equal to 0. ph_cu_chroma_qp_offset_subdiv_intra_slice specifies the maximum cbSubdiv value of coding units in intra slice that convey cu_chroma_qp_offset_flag. The value of ph_cu_chroma_qp_offset_subdiv_intra_slice shall be in the range of 0 to 2 * ( CtbLog2SizeY − MinQtLog2SizeIntraY + ph_max_mtt_hierarchy_depth_intra_slice_ luma ), inclusive. When not present, the value of ph_cu_chroma_qp_offset_subdiv_intra_slice is inferred to be equal to 0. ph_log2_diff_min_qt_min_cb_inter_slice specifies the difference between the base 2 logarithm of the minimum size in luma samples of a luma leaf block resulting from quadtree splitting of a CTU and the base 2 logarithm of the minimum luma coding block size in luma samples for luma CUs in the slices with slice_type equal to 0 (B) or 1 (P) associated with the PH. The value of ph_log2_diff_min_qt_min_cb_inter_slice shall be in the range of 0 to CtbLog2SizeY − MinCbLog2SizeY, inclusive. When not present, the value of ph_log2_diff_min_qt_min_cb_luma is inferred to be equal to sps_log2_diff_min_qt_min_cb_inter_slice. ph_max_mtt_hierarchy_depth_inter_slice specifies the maximum hierarchy depth for coding units resulting from multi-type tree splitting of a quadtree leaf in slices with slice_type equal to 0 (B) or 1 (P) associated with the PH. The value of ph_max_mtt_hierarchy_depth_inter_slice shall be in the range of 0 to 2*( CtbLog2SizeY − MinCbLog2SizeY ), inclusive. When not present, the value of ph_max_mtt_hierarchy_depth_inter_slice is inferred to be equal to sps_max_mtt_hierarchy_depth_inter_slice. ph_log2_diff_max_bt_min_qt_inter_slice specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a binary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in the slices with slice_type equal to 0 (B) or 1 (P) associated with the PH. The value of ph_log2_diff_max_bt_min_qt_inter_slice shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeInterY, inclusive. When not present, the value of ph_log2_diff_max_bt_min_qt_inter_slice is inferred to be equal to sps_log2_diff_max_bt_min_qt_inter_slice. ph_log2_diff_max_tt_min_qt_inter_slice specifies the difference between the base 2 logarithm of the maximum size (width or height) in luma samples of a luma coding block that can be split using a ternary split and the minimum size (width or height) in luma samples of a luma leaf block resulting from quadtree splitting of a CTU in slices with slice_type equal to 0 (B) or 1 (P) associated with the PH. The value of ph_log2_diff_max_tt_min_qt_inter_slice shall be in the range of 0 to CtbLog2SizeY − MinQtLog2SizeInterY, inclusive. When not present, the value of ph_log2_diff_max_tt_min_qt_inter_slice is inferred to be equal to sps_log2_diff_max_tt_min_qt_inter_slice. ph_cu_qp_delta_subdiv_inter_slice specifies the maximum cbSubdiv value of coding units that in inter slice convey cu_qp_delta_abs and cu_qp_delta_sign_flag. The value of ph_cu_qp_delta_subdiv_inter_slice shall be in the range of 0 to 2 * ( CtbLog2SizeY − MinQtLog2SizeInterY + ph_max_mtt_hierarchy_depth_inter_slice ) , inclusive. When not present, the value of ph_cu_qp_delta_subdiv_inter_slice is inferred to be equal to 0. ph_cu_chroma_qp_offset_subdiv_inter_slice specifies the maximum cbSubdiv value of coding units in inter slice that convey cu_chroma_qp_offset_flag. The value of ph_cu_chroma_qp_offset_subdiv_inter_slice shall be in the range of 0 to 2 * ( CtbLog2SizeY − MinQtLog2SizeInterY + ph_max_mtt_hierarchy_depth_inter_slice ) , inclusive. When not present, the value of ph_cu_chroma_qp_offset_subdiv_inter_slice is inferred to be equal to 0. ph_temporal_mvp_enabled_flag specifies whether temporal motion vector predictors can be used for inter prediction for slices associated with the PH. If ph_temporal_mvp_enabled_flag is equal to 0, the syntax elements of the slices associated with the PH shall be constrained such that no temporal motion vector predictor is used in decoding of the slices. Otherwise (ph_temporal_mvp_enabled_flag is equal to 1), temporal motion vector predictors may be used in decoding of the slices associated with the PH. When not present, the value of ph_temporal_mvp_enabled_flag is inferred to be equal to 0. When no reference picture in the DPB has the same spatial resolution as the current picture, the value of ph_temporal_mvp_enabled_flag shall be equal to 0. The maximum number of subblock-based merging MVP candidates, MaxNumSubblockMergeCand, is derived as follows: if( sps_affine_enabled_flag ) MaxNumSubblockMergeCand = 5 − five_minus_max_num_subblock_merge_cand (87) else MaxNumSubblockMergeCand = sps_sbtmvp_enabled_flag && ph_temporal_mvp_enable_flag The value of MaxNumSubblockMergeCand shall be in the range of 0 to 5, inclusive. ph_collocated_from_l0_flag equal to 1 specifies that the collocated picture used for temporal motion vector prediction is derived from reference picture list 0. ph_collocated_from_l0_flag equal to 0 specifies that the collocated picture used for temporal motion vector prediction is derived from reference picture list 1. ph_collocated_ref_idx specifies the reference index of the collocated picture used for temporal motion vector prediction. When ph_collocated_from_l0_flag is equal to 1, ph_collocated_ref_idx refers to an entry in reference picture list 0, and the value of ph_collocated_ref_idx shall be in the range of 0 to num_ref_entries[ 0 ][ RplsIdx[ 0 ] ] − 1, inclusive. When ph_collocated_from_l0_flag is equal to 0, ph_collocated_ref_idx refers to an entry in reference picture list 1, and the value of ph_collocated_ref_idx shall be in the range of 0 to num_ref_entries[ 1 ][ RplsIdx[ 1 ] ] − 1, inclusive. When not present, the value of ph_collocated_ref_idx is inferred to be equal to 0. mvd_l1_zero_flag equal to 1 indicates that the mvd_coding( x0, y0, 1 ) syntax structure is not parsed and MvdL1[ x0 ][ y0 ][ compIdx ] and MvdCpL1[ x0 ][ y0 ][ cpIdx ][ compIdx ] are set equal to 0 for compIdx = 0..1 and cpIdx = 0..2. mvd_l1_zero_flag equal to 0 indicates that the mvd coding( x0, y0, 1 ) syntax structure is parsed. ph_fpel_mmvd_enabled_flag equal to 1 specifies that merge mode with motion vector difference uses integer sample precision in the slices associated with the PH. ph_fpel_mmvd_enabled_flag equal to 0 specifies that merge mode with motion vector difference can use fractional sample precision in the slices associated with the PH. When not present, the value of ph_fpel_mmvd_enabled_flag is inferred to be 0. ph_disable_bdof_flag equal to 1 specifies that bi-directional optical flow inter prediction based inter bi-prediction is disabled in the slices associated with the PH. ph_disable_bdof_flag equal to 0 specifies that bi-directional optical flow inter prediction based inter bi-prediction may or may not be enabled in the slices associated with the PH. When ph_disable_bdof_flag is not present, the following applies: – If sps_bdof_enabled_flag is equal to 1, the value of ph_disable_bdof_flag is inferred to be equal to 0. – Otherwise (sps_bdof_enabled_flag is equal to 0), the value of ph_disable_bdof_flag is inferred to be equal to 1. ph_disable_dmvr_flag equal to 1 specifies that decoder motion vector refinement based inter bi-prediction is disabled in the slices associated with the PH. ph_disable_dmvr_flag equal to 0 specifies that decoder motion vector refinement based inter bi-prediction may or may not be enabled in the slices associated with the PH. When ph_disable_dmvr_flag is not present, the following applies: – If sps_dmvr_enabled_flag is equal to 1, the value of ph_disable_dmvr_flag is inferred to be equal to 0. – Otherwise (sps_dmvr_enabled_flag is equal to 0), the value of ph_disable_dmvr_flag is inferred to be equal to 1. ph_disable_prof_flag equal to 1 specifies that prediction refinement with optical flow is disabled in the slices associated with the PH. ph_disable_prof_flag equal to 0 specifies that prediction refinement with optical flow may or may not be enabled in the slices associated with the PH. When ph_disable_prof_flag is not present, the following applies: – If sps_affine_prof_enabled_flag is equal to 1, the value of ph_disable_prof_flag is inferred to be equal to 0. – Otherwise (sps_affine_prof_enabled_flag is equal to 0), the value of ph_disable_prof_flag is inferred to be equal to 1. ph_qp_delta specifies the initial value of QpY to be used for the coding blocks in the picture until modified by the value of CuQpDeltaVal in the coding unit layer. When qp_delta_info_in_ph_flag is equal to 1, the initial value of the QpY quantization parameter for all slices of the picture, SliceQpY, is derived as follows: SliceQpY = 26 + init_qp_minus26 + ph_qp_delta (88) The value of SliceQpY shall be in the range of −QpBdOffset to +63, inclusive. ph_joint_cbcr_sign_flag specifies whether, in transform units with tu_joint_cbcr_residual_flag[ x0 ][ y0 ] equal to 1, the collocated residual samples of both chroma components have inverted signs. When tu_joint_cbcr_residual_flag[ x0 ][ y0 ] equal to 1 for a transform unit, ph_joint_cbcr_sign_flag equal to 0 specifies that the sign of each residual sample of the Cr (or Cb) component is identical to the sign of the collocated Cb (or Cr) residual sample and ph_joint_cbcr_sign_flag equal to 1 specifies that the sign of each residual sample of the Cr (or Cb) component is given by the inverted sign of the collocated Cb (or Cr) residual sample. ph_sao_luma_enabled_flag equal to 1 specifies that SAO is enabled for the luma component in all slices associated with the PH; ph_sao_luma_enabled_flag equal to 0 specifies that SAO for the luma component may be disabled for one, or more, or all slices associated with the PH. When ph_sao_luma_enabled_flag is not present, it is inferred to be equal to 0. ph_sao_chroma_enabled_flag equal to 1 specifies that SAO is enabled for the chroma component in all slices associated with the PH; ph_sao_chroma_enabled_flag equal to 0 specifies that SAO for chroma component may be disabled for one, or more, or all slices associated with the PH. When ph_sao_chroma_enabled_flag is not present, it is inferred to be equal to 0. ph_dep_quant_enabled_flag equal to 0 specifies that dependent quantization is disabled for the current picture. ph_dep_quant_enabled_flag equal to 1 specifies that dependent quantization is enabled for the current picture. When ph_dep_quant_enabled_flag is not present, it is inferred to be equal to 0. pic_sign_data_hiding_enabled_flag equal to 0 specifies that sign bit hiding is disabled for the current picture. pic_sign_data_hiding_enabled_flag equal to 1 specifies that sign bit hiding is enabled for the current picture. When pic_sign_data_hiding_enabled_flag is not present, it is inferred to be equal to 0. ph_deblocking_filter_override_flag equal to 1 specifies that deblocking parameters are present in the PH. ph_deblocking_filter_override_flag equal to 0 specifies that deblocking parameters are not present in the PH. When not present, the value of ph_deblocking_filter_override_flag is inferred to be equal to 0. ph_deblocking_filter_disabled_flag equal to 1 specifies that the operation of the deblocking filter is not applied for the slices associated with the PH. ph_deblocking_filter_disabled_flag equal to 0 specifies that the operation of the deblocking filter is applied for the slices associated with the PH. When ph_deblocking_filter_disabled_flag is not present, it is inferred to be equal to pps_deblocking_filter_disabled_flag. ph_beta_offset_div2 and ph_tc_offset_div2 specify the deblocking parameter offsets for β and tC (divided by 2) that are applied to the luma component for the slices associated with the PH. The values of ph_beta_offset_div2 and ph_tc_offset_div2 shall both be in the range of −12 to 12, inclusive. When not present, the values of ph_beta_offset_div2 and ph_tc_offset_div2 are inferred to be equal to pps_beta_offset_div2 and pps_tc_offset_div2, respectively. ph_cb_beta_offset_div2 and ph_cb_tc_offset_div2 specify the deblocking parameter offsets for β and tC (divided by 2) that are applied to the Cb component for the slices associated with the PH. The values of ph_cb_beta_offset_div2 and ph_cb_tc_offset_div2 shall both be in the range of −12 to 12, inclusive. When not present, the values of
Figure imgf000080_0001
[0054] Improvements to Syntax Elements
[0055] In current VVC, when there are similar syntax elements for intra and inter prediction, respectively, in some places, the syntax elements related to inter prediction are defined prior to those related to intra prediction. Such an order may not be preferable, given the fact that intra prediction is allowed in all picture/slice types while inter prediction is not. It would be beneficial from a standardization point of view to always define intra prediction related syntaxes prior to those for inter prediction.
[0056] It is also observed that in the current VVC, some syntax elements that are highly correlated to each other are defined at different places in a spread manner. It would also be beneficial from a standardization point of view to group some syntaxes together.
[0057] Proposed Methods
[0058] In this disclosure, to address the issues as pointed out in the “problem statement” section, methods are provided to simplify and/or further improve the existing design of the high-level syntax. It is noted that the invented methods could be applied independently or jointly.
[0059] Grouping the Partition Constraint Syntax Elements by Prediction Type
[0060] In this disclosure, it is proposed to rearrange the syntax elements so that the intra prediction related syntax elements are defined before those related to inter prediction.
According to the disclosure, the partition constraint syntax elements are grouped by prediction type, with intra prediction related first, followed by inter prediction related. In one embodiment, the order of the partition constraint syntax elements in SPS is consistent with the order of the partition constraint syntax elements in the picture header. An example of the decoding process on VVC Draft is illustrated in Table 12 below. The changes to the VVC Draft are shown using bold and italicized font. The strikethrough text is removed from the process. Table 12. Proposed sequence parameter set RBSP syntax
Figure imgf000081_0001
Figure imgf000082_0001
[0061] FIG.4 shows a method for decoding a video signal in accordance with the present disclosure. The method may be, for example, applied to a decoder. [0062] In step 410, the decoder may receive arranged partition constraint syntax elements in SPS level. The arranged partition constraint syntax elements are arranged so that intra prediction related syntax elements are defined before inter prediction related syntax elements. [0063] In step 412, the decoder may obtain a first reference picture I(0) and a second reference picture I(1) associated with a video block in a bitstream. The first reference picture I(0) is before a current picture and the second reference picture I(1) is after the current picture in display order. [0064] In step 414, the decoder may obtain first prediction samples I(0)(i, j) of the video block from a reference block in the first reference picture I(0). The i and j represent a coordinate of one sample with the current picture. [0065] In step 416, the decoder may obtain second prediction samples I(1)(i, j) of the video block from a reference block in the second reference picture I(1). [0066] In step 418, the decoder may obtain bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I(0)(i, j) , and the second prediction samples I(1)(i, j). [0067] FIG.5 shows a method for decoding a video signal in accordance with the present disclosure. The method may be, for example, applied to a decoder. In step 510, the decoder may receive a bitstream that includes VPS, SPS, PPS, picture header, and slice header for coded video data. In step 512, the decoder may decode the VPS. In step 514, the decoder may decode the SPS and obtain an arranged partition constraint syntax elements in SPS level. In step 516, the decoder may decode the PPS. In step 518, the decoder may decode the picture header. In step 520, the decoder may decode the slice header. In step 522, the decoder may decode the video data based on VPS, SPS, PPS, picture header and slice header. [0068] Grouping the Dual-Tree Chroma Syntax Elements [0069] In this disclosure, it is proposed to group the syntax elements related to dual-tree chroma type. In one embodiment, the partition constraint syntax elements for dual-tree chroma in SPS should be signaled together under dual-tree chroma cases. An example of the decoding process on VVC Draft is illustrated in Table 13 below. The changes to the VVC Draft are shown using bold and italicized font. The strikethrough text is removed from the process. Table 13. Proposed sequence parameter set RBSP syntax
Figure imgf000083_0001
[0070] If also considering defining intra prediction related syntaxes prior to those related to inter prediction, according to the method of the disclosure, another example of the decoding process on VVC Draft is illustrated in table 14 below. The changes to the VVC Draft are shown using bold and italicized font The strikethrough text is removed from the process Table 14. Proposed sequence parameter set RBSP syntax
Figure imgf000084_0001
Figure imgf000085_0001
[0071] Conditionally Signaling Inter-Prediction Related Syntax Elements [0072] As mentioned in the earlier description, according to the current VVC, intra prediction is allowed in all picture/slice types while inter prediction is not. According to this disclosure, it is proposed to add a flag in VVC syntax at a certain coding level to indicate if inter prediction is used or not in a sequence, picture and/or slice. In case inter prediction is not used, inter prediction related syntaxes are not signaled at the corresponding coding level, e.g., sequence, picture and/or slice level. [0073] In one example, according to the method of the disclosure, a flag is added in SPS to indicate if inter prediction is used in coding the current video sequence. In case it is not used, inter prediction related syntax elements are not signaled in SPS. An example of the decoding process on VVC Draft is illustrated in Table 15 below. The changes to the VVC Draft are shown using bold and italicized font. Table 15. Proposed sequence parameter set RBSP syntax
Figure imgf000085_0002
Figure imgf000086_0001
[0074] 7.4.3.3 Sequence parameter set RBSP semantics [0075] sps_inter_slice_used_flag equal to 0 specifies that all coded slices of the video sequence have slice_type equal to 2. sps_inter_slice_used_flag equal to 1 specifies that there may or may not be one or more coded slices in the video sequence that have slice_type equal to 0 or 1. [0076] The above methods may be implemented using an apparatus that includes one or more circuitries, which include application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), controllers, microcontrollers, microprocessors, or other electronic components. The apparatus may use the circuitries in combination with the other hardware or software components for performing the above- described methods. Each module, sub-module, unit, or sub-unit disclosed above may be implemented at least partially using one or more circuitries. [0077] FIG.6 shows a computing environment 610 coupled with a user interface 660. The computing environment 610 can be part of a data processing server. The computing environment 610 includes processor 620, memory 640, and I/O interface 650. [0078] The processor 620 typically controls overall operations of the computing environment 610, such as the operations associated with the display, data acquisition, data communications, and image processing. The processor 620 may include one or more processors to execute instructions to perform all or some of the steps in the above-described methods. Moreover, the processor 620 may include one or more modules that facilitate the interaction between the processor 620 and other components. The processor may be a Central Processing Unit (CPU), a microprocessor, a single chip machine, a GPU, or the like. [0079] The memory 640 is configured to store various types of data to support the operation of the computing environment 610. Memory 640 may include predetermine software 642. Examples of such data include instructions for any applications or methods operated on the computing environment 610, video datasets, image data, etc. The memory 640 may be implemented by using any type of volatile or non-volatile memory devices, or a combination thereof, such as a static random access memory (SRAM), an electrically erasable programmable read-only memory (EEPROM), an erasable programmable read-only memory (EPROM), a programmable read-only memory (PROM), a read-only memory (ROM), a magnetic memory, a flash memory, a magnetic or optical disk.
[0080] The I/O interface 650 provides an interface between the processor 620 and peripheral interface modules, such as a keyboard, a click wheel, buttons, and the like. The buttons may include but are not limited to, a home button, a start scan button, and a stop scan button. The I/O interface 650 can be coupled with an encoder and decoder.
[0081] In some embodiments, there is also provided a non-transitory computer-readable storage medium comprising a plurality of programs, such as comprised in the memory 640, executable by the processor 620 in the computing environment 610, for performing the above- described methods. For example, the non-transitory computer-readable storage medium may be a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disc, an optical data storage device or the like.
[0082] The non-transitory computer-readable storage medium has stored therein a plurality of programs for execution by a computing device having one or more processors, where the plurality of programs when executed by the one or more processors, cause the computing device to perform the above-described method for motion prediction.
[0083] In some embodiments, the computing environment 610 may be implemented with one or more application-specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field- programmable gate arrays (FPGAs), graphical processing units (GPUs), controllers, micro controllers, microprocessors, or other electronic components, for performing the above methods.
[0084] Other examples of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed here. This application is intended to cover any variations, uses, or adaptations of the disclosure following the general principles thereof and including such departures from the present disclosure as come within known or customary practice in the art. It is intended that the specification and examples be considered as exemplary only. [0085] It will be appreciated that the present disclosure is not limited to the exact examples described above and illustrated in the accompanying drawings, and that various modifications and changes can be made without departing from the scope thereof.

Claims

CLAIMS What is claimed is: 1. A method for decoding a video signal, comprising: receiving, by a decoder, arranged partition constraint syntax elements in sequence parameter set (SPS) level, wherein the arranged partition constraint syntax elements are arranged so that intra prediction related syntax elements are defined before inter prediction related syntax elements; obtaining, at the decoder, a first reference picture I(0) and a second reference picture I(1) associated with a video block in a bitstream, wherein the first reference picture I(0) is before a current picture and the second reference picture I(1) is after the current picture in display order; obtaining, at the decoder, first prediction samples I(0)(i, j) of the video block from a reference block in the first reference picture I(1), wherein i and j represent a coordinate of one sample with the current picture; obtaining, at the decoder, second prediction samples I(1)(i, j) of the video block from a reference block in the second reference picture I(1); and obtaining, at the decoder, bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I(0)(i, j), and the second prediction samples I(1)(i, j).
2. The method of claim 1, wherein the arranged partition constraint syntax elements are arranged in a consistent order to that of partition constraint syntax elements in a picture header.
3. The method of claim 1, wherein receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level comprises: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: determining that a qtbtt_dual_tree_intra_flag flag is true; setting, when the qtbtt_dual_tree_intra_flag flag is true, an sps_log2_diff_min_qt_min_cb_intra_slice_chroma value; and setting an sps log2 diff min qt min cb inter slice value.
4. The method of claim 1, wherein receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level comprises: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements comprise partition constraint syntax elements for dual-tree chroma in the SPS level and are signaled together under dual tree chroma cases.
5. The method of claim 1, wherein receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level comprises: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: setting an log2_min_luma_coding_block_size_minus2 value; determining that a sps_max_mtt_hierarchy_depth_intra_slice_luma value is not 0; setting a sps_log2_diff_max_bt_min_qt_intra_slice_luma value; setting a sps_log2_diff_min_qt_min_cb_inter_slice value; determining that a ChromaArrayType value is not equal to 0; signaling a qtbtt_dual_tree_intra_flag flag; determining that a qtbtt dual tree intra flag flag is true; and setting, when the qtbtt_dual_tree_intra_flag flag is true, a sps_log2_diff_min_qt_min_cb_intra_slice_chroma value.
6. The method of claim 1, wherein receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level comprises: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: setting an log2_min_luma_coding_block_size_minus2 value; determining that a sps_max_mtt_hierarchy_depth_intra_slice_luma value is not 0; setting a sps_log2_diff_max_bt_min_qt_intra_slice_luma value; determining that a ChromaArrayType value is not equal to 0; signaling a qtbtt_dual_tree_intra_flag flag; determining that a qtbtt dual tree intra flag flag is true; setting, when the qtbtt_dual_tree_intra_flag flag is true, a sps_log2_diff_min_qt_min_cb_intra_slice_chroma value; and setting a sps_log2_diff_min_qt_min_cb_inter_slice value.
7. The method of claim 1, wherein receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level comprises: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements comprise a versatile video coding (VvC) syntax flag that indicates that inter prediction is used.
8. The method of claim 7, wherein the VVC syntax flag is signaled at the SPS level and indicates that inter prediction is used in coding a current video sequence.
9. The method of claim 7, wherein receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level comprises: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: signaling a sps_inter_slice_used_flag flag; determining that a ChromaArrayType value is not equal to 0; signaling a qtbtt_dual_tree_intra_flag flag; determining that a sps_max_mtt_hierarchy_depth_intra_slice_luma value is not 0; setting a sps_log2_diff_max_bt_min_qt_intra_slice_luma value; determining that the sps_inter_slice_used_flag flag is not true; and setting, when the sps_inter_slice_used_flag flag is not true, a sps_log2_diff_min_qt_min_cb_inter_slice value.
10. A computing device, comprising: one or more processors; and a non-transitory computer-readable storage medium storing instructions executable by the one or more processors, wherein the one or more processors are configured to: receive arranged partition constraint syntax elements in sequence parameter set (SPS) level, wherein the arranged partition constraint syntax elements are arranged so that intra prediction related syntax elements are defined before inter prediction related syntax elements; obtain a first reference picture I(0) and a second reference picture I(1) associated with a video block in a bitstream, wherein the first reference picture I(0) is before a current picture and the second reference picture I(1) is after the current picture in display order; obtain first prediction samples I(0)(i, j) of the video block from a reference block in the first reference picture I(1), wherein i and j represent a coordinate of one sample with the current picture; obtain second prediction samples I(1)(i, j) of the video block from a reference block in the second reference picture I(1); and obtain bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I(0)(i, j), and the second prediction samples I(1)(i, j).
11. The computing device of claim 10, wherein the arranged partition constraint syntax elements are arranged in a consistent order to that of partition constraint syntax elements in a picture header.
12. The computing device of claim 10, wherein the one or more processors configured to receive the arranged partition constraint syntax elements in the SPS level are further configured to: receive the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: determine that a qtbtt_dual_tree_intra_flag flag is true; set, when the qtbtt_dual_tree_intra_flag flag is true, an sps_log2_diff_min_qt_min_cb_intra_slice_chroma value; and set an sps_log2_diff_min_qt_min_cb_inter_slice value.
13. The computing device of claim 10, wherein the one or more processors configured to receive the arranged partition constraint syntax elements in the SPS level are further configured to: receive the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements comprise partition constraint syntax elements for dual-tree chroma in the SPS level and are signaled together under dual-tree chroma cases
14. The computing device of claim 10, wherein the one or more processors configured to receive the arranged partition constraint syntax elements in the SPS level are further configured to: receive the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: set an log2_min_luma_coding_block_size_minus2 value; determine that a sps_max_mtt_hierarchy_depth_intra_slice_luma value is not
0; set a sps_log2_diff_max_bt_min_qt_intra_slice_luma value; set a sps_log2_diff_min_qt_min_cb_inter_slice value; determine that a ChromaArrayType value is not equal to 0; signal a qtbtt dual tree intra flag flag; determine that a qtbtt dual tree intra flag flag is true; and set, when the qtbtt dual tree intra flag flag is true, a sps_log2_diff_min_qt_min_cb_intra_slice_chroma value.
15. The computing device of claim 10, wherein the one or more processors configured to receive the arranged partition constraint syntax elements in the SPS level are further configured to: receive the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: set an log2_min_luma_coding_block_size_minus2 value; determine that a sps_max_mtt_hierarchy_depth_intra_slice_luma value is not
0; set a sps_log2_diff_max_bt_min_qt_intra_slice_luma value; determine that a ChromaArrayType value is not equal to 0; signal a qtbtt_dual_tree_intra_flag flag; determine that a qtbtt_dual_tree_intra_flag flag is true; set, when the qtbtt_dual_tree_intra_flag flag is true, a sps_log2_diff_min_qt_min_cb_intra_slice_chroma value; and set a sps_log2_diff_min_qt_min_cb_inter_slice value.
16. The computing device of claim 10, wherein the one or more processors configured to receive the arranged partition constraint syntax elements in the SPS level are further configured to: receive the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements comprise a versatile video coding (VVC) syntax flag that indicates that inter prediction is used.
17. The computing device of claim 16, wherein the VVC syntax flag is signaled at the SPS level and indicates that inter prediction is used in coding a current video sequence.
18. The computing device of claim 16, wherein the one or more processors configured to receive the arranged partition constraint syntax elements in the SPS level are further configured to: receive the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: signal a sps_inter_slice_used_flag flag; determine that a ChromaArrayType value is not equal to 0; signal a qtbtt_dual_tree_intra_flag flag; determine that a sps_max_mtt_hierarchy_depth_intra_slice_luma value is not 0; set a sps_log2_diff_max_bt_min_qt_intra_slice_luma value; determine that the sps_inter_slice_used_flag flag is not set; and set a sps_log2_diff_min_qt_min_cb_inter_slice value.
19. A non-transitory computer-readable storage medium storing a plurality of programs for execution by a computing device having one or more processors, wherein the plurality of programs, when executed by the one or more processors, cause the computing device to perform acts comprising: receiving, by a decoder, arranged partition constraint syntax elements in sequence parameter set (SPS) level, wherein the arranged partition constraint syntax elements are arranged so that intra prediction related syntax elements are defined before inter prediction related syntax elements; obtaining, at the decoder, a first reference picture I(0) and a second reference picture I(1) associated with a video block in a bitstream, wherein the first reference picture I(0) is before a current picture and the second reference picture I(1) is after the current picture in display order; obtaining, at the decoder, first prediction samples I(0)(i, j) of the video block from a reference block in the first reference picture I(0), wherein i and j represent a coordinate of one sample with the current picture; obtaining, at the decoder, second prediction samples I(1)(i, j) of the video block from a reference block in the second reference picture I(1); and obtaining, at the decoder, bi-prediction samples based on the arranged partition constraint syntax elements, the first prediction samples I(0)(i, j), and the second prediction samples I(1)(i, j).
20. The non-transitory computer-readable storage medium of claim 19, wherein the arranged partition constraint syntax elements are arranged in a consistent order to that of partition constraint syntax elements in a picture header.
21. The non-transitory computer-readable storage medium of claim 19, wherein the plurality of programs further cause the computing device to perform: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: determining that a qtbtt_dual_tree_intra_flag flag is true; setting, when the qtbtt_dual_tree_intra_flag flag is true, an sps_log2_diff_min_qt_min_cb_intra_slice_chroma value; and setting an sps_log2_diff_min_qt_min_cb_inter_slice value.
22. The non-transitory computer-readable storage medium of claim 19, wherein the plurality of programs further cause the computing device to perform: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements comprise partition constraint syntax elements for dual-tree chroma in the SPS level and are signaled together under dual- tree chroma cases.
23. The non-transitory computer-readable storage medium of claim 19, wherein the plurality of programs further cause the computing device to perform: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: setting an log2_min_luma_coding_block_size_minus2 value; determining that a sps_max_mtt_hierarchy_depth_intra_slice_luma value is not 0; setting a sps_log2_diff_max_bt_min_qt_intra_slice_luma value; setting a sps_log2_diff_min_qt_min_cb_inter_slice value; determining that a ChromaArrayType value is not equal to 0; signaling a qtbtt_dual_tree_intra_flag flag; determining that a qtbtt_dual_tree_intra_flag flag is true; and setting, when the qtbtt_dual_tree_intra_flag flag is true, a sps_log2_diff_min_qt_min_cb_intra_slice_chroma value.
24. The non-transitory computer-readable storage medium of claim 19, wherein the plurality of programs further cause the computing device to perform: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: setting an log2_min_luma_coding_block_size_minus2 value; determining that a sps_max_mtt_hierarchy_depth_intra_slice_luma value is not 0; setting a sps_log2_diff_max_bt_min_qt_intra_slice_luma value; determining that a ChromaArrayType value is not equa l to 0 ; signaling a qtbtt_dual_tree_intra_flag flag; determining that a qtbtt_dual_tree_intra_flag flag is true; setting, when the qtbtt_dual_tree_intra_flag flag is true, a sps_log2_diff_min_qt_min_cb_intra_slice_chroma value; and setting a sps_log2_diff_min_qt_min_cb_inter_slice value.
25. The non-transitory computer-readable storage medium of claim 19, wherein the plurality of programs further cause the computing device to perform: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements comprise a versatile video coding (VVC) syntax flag that indicates that inter prediction is used.
26. The non-transitory computer-readable storage medium of claim 25, wherein the VVC syntax flag is signaled at the SPS level and indicates that inter prediction is used in coding a current video sequence.
27. The non-transitory computer-readable storage medium of claim 25, wherein the plurality of programs further cause the computing device to perform: receiving, by the decoder, the arranged partition constraint syntax elements in the SPS level, wherein the arranged partition constraint syntax elements are arranged by: signaling a sps_inter_slice_used_flag flag; determining that a ChromaArrayType value is not equal to 0; signaling a qtbtt_dual_tree_intra_flag flag; determining that a sps_max_mtt_hierarchy_depth_intra_slice_luma value is not 0; setting a sps_log2_diff_max_bt_min_qt_intra_slice_luma value; determining that the sps_inter_slice_used_flag flag is not true; and setting, when the sps_inter_slice_used_flag flag is not true, a sps_log2_diff_min_qt_min_cb_inter_slice value.
PCT/US2021/023751 2020-03-31 2021-03-23 Methods and devices for high-level syntax in video coding WO2021202178A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP21781444.1A EP4118835A4 (en) 2020-03-31 2021-03-23 Methods and devices for high-level syntax in video coding
CN202180024900.8A CN115336280A (en) 2020-03-31 2021-03-23 Method and apparatus for high level syntax in video coding and decoding
US17/953,311 US20230031964A1 (en) 2020-03-31 2022-09-26 Methods and devices for high-level syntax in video coding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202063003229P 2020-03-31 2020-03-31
US63/003,229 2020-03-31

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US17/953,311 Continuation US20230031964A1 (en) 2020-03-31 2022-09-26 Methods and devices for high-level syntax in video coding

Publications (1)

Publication Number Publication Date
WO2021202178A1 true WO2021202178A1 (en) 2021-10-07

Family

ID=77930141

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2021/023751 WO2021202178A1 (en) 2020-03-31 2021-03-23 Methods and devices for high-level syntax in video coding

Country Status (4)

Country Link
US (1) US20230031964A1 (en)
EP (1) EP4118835A4 (en)
CN (1) CN115336280A (en)
WO (1) WO2021202178A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210314623A1 (en) * 2020-04-02 2021-10-07 Qualcomm Incorporated General constraint information syntax in video coding
WO2023132992A1 (en) * 2022-01-10 2023-07-13 Innopeak Technology, Inc. Signaling general constraints information for video coding

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019001785A1 (en) * 2017-06-30 2019-01-03 Huawei Technologies Co., Ltd. Overlapped search space for bi-predictive motion vector refinement
US20190166381A1 (en) * 2011-03-03 2019-05-30 Sun Patent Trust Moving picture coding method, moving picture decoding method, moving picture coding apparatus, moving picture decoding apparatus, and moving picture coding and decoding apparatus
US20190327488A1 (en) * 2010-03-17 2019-10-24 Ntt Docomo, Inc. Moving image prediction encoding/decoding system
WO2020060843A1 (en) * 2018-09-19 2020-03-26 Interdigital Vc Holdings, Inc. Generalized bi-prediction index coding

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190327488A1 (en) * 2010-03-17 2019-10-24 Ntt Docomo, Inc. Moving image prediction encoding/decoding system
US20190166381A1 (en) * 2011-03-03 2019-05-30 Sun Patent Trust Moving picture coding method, moving picture decoding method, moving picture coding apparatus, moving picture decoding apparatus, and moving picture coding and decoding apparatus
WO2019001785A1 (en) * 2017-06-30 2019-01-03 Huawei Technologies Co., Ltd. Overlapped search space for bi-predictive motion vector refinement
WO2020060843A1 (en) * 2018-09-19 2020-03-26 Interdigital Vc Holdings, Inc. Generalized bi-prediction index coding

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
BENJAMIN BROSS , JIANLE CHEN , SHAN LIU , YE-KUI WANG: "Versatile Video Coding (Draft 8)", 17. JVET MEETING; 20200107 - 20200117; BRUSSELS; (THE JOINT VIDEO EXPLORATION TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ), no. JVET-Q2001-vE, 12 March 2020 (2020-03-12), pages 1 - 510, XP030285390 *
See also references of EP4118835A4 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210314623A1 (en) * 2020-04-02 2021-10-07 Qualcomm Incorporated General constraint information syntax in video coding
US11563980B2 (en) * 2020-04-02 2023-01-24 Qualcomm Incorporated General constraint information syntax in video coding
WO2023132992A1 (en) * 2022-01-10 2023-07-13 Innopeak Technology, Inc. Signaling general constraints information for video coding

Also Published As

Publication number Publication date
EP4118835A1 (en) 2023-01-18
EP4118835A4 (en) 2023-05-10
CN115336280A (en) 2022-11-11
US20230031964A1 (en) 2023-02-02

Similar Documents

Publication Publication Date Title
EP3158739B1 (en) Block adaptive color-space conversion coding
US20240048699A1 (en) Interplay between subpictures and in-loop filtering
KR20220143843A (en) Constraints on high level syntax elements
JP7407299B2 (en) High-level bitstream syntax for quantization parameters
JP7454681B2 (en) Video coding and decoding constraints
US20230008778A1 (en) Interplay between picture header and slice header of a video bitstream
KR20220066897A (en) Syntax for subpicture signaling in video bitstreams
US20230031964A1 (en) Methods and devices for high-level syntax in video coding
CN115428444A (en) Constraints on adaptive parameter set syntax elements
US20230199223A1 (en) High-level syntax for video coding
JP2024003125A (en) General constraint information in video coding
KR20220163969A (en) Signaling for Inter Prediction in Higher Level Syntax
US20230103542A1 (en) Methods and devices for high-level syntax in video coding
KR20230016655A (en) Palette mode with local dual tree mode type definition
KR20230015416A (en) Signaling of general constraint flags
JP7482220B2 (en) Syntax constraints on subpicture parameter set signaling
US20240137571A1 (en) General Constraint Information In Video Coding
KR20220114555A (en) Signaling of subpicture level parameters in the video bitstream

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21781444

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2021781444

Country of ref document: EP

Effective date: 20221011

NENP Non-entry into the national phase

Ref country code: DE