WO2020084604A1 - Procédés rapides de décision d'arbre de partition - Google Patents

Procédés rapides de décision d'arbre de partition Download PDF

Info

Publication number
WO2020084604A1
WO2020084604A1 PCT/IB2019/059219 IB2019059219W WO2020084604A1 WO 2020084604 A1 WO2020084604 A1 WO 2020084604A1 IB 2019059219 W IB2019059219 W IB 2019059219W WO 2020084604 A1 WO2020084604 A1 WO 2020084604A1
Authority
WO
WIPO (PCT)
Prior art keywords
eqt
current block
block
video
partitioning
Prior art date
Application number
PCT/IB2019/059219
Other languages
English (en)
Inventor
Li Zhang
Kai Zhang
Hongbin Liu
Yue Wang
Original Assignee
Beijing Bytedance Network Technology Co., Ltd.
Bytedance Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Bytedance Network Technology Co., Ltd., Bytedance Inc. filed Critical Beijing Bytedance Network Technology Co., Ltd.
Publication of WO2020084604A1 publication Critical patent/WO2020084604A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/119Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/147Data rate or code amount at the encoder output according to rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Definitions

  • the present document relates to video and image coding technology.
  • Some disclosed embodiments relate to coding and decoding of images and video pictures using a rule based extended quadtree partitioning process.
  • certain aspects of the rule are pre-defined, allowing encoder and decoder embodiments to generate partition tree and performing decoding using fewer computational resources that traditional image and video coding techniques.
  • a method for video process includes determining, for a conversion between a current block of a video and a bitstream representation of the video, whether an extended quadtree (EQT) partitioning process is applicable to the current block based on a rule, and performing the conversion based on the determining.
  • the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block, and the rule specifies a maximum depth of the EQT partitioning process based on an attribute associated with the current block.
  • a method of visual media processing includes performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block, and wherein the rule specifies that in case that the rule is used for partitioning the current block, then each subblock is further split into a binary tree (BT) partitioning or another EQT partitioning, and both BT and the another EQT partitioning have depths that meet a pre-defined relationship.
  • EQT extended quadtree
  • a method of visual media processing includes performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block, and wherein the rule allows the EQT partitioning process for the current block based on a width or a height of the current block.
  • EQT extended quadtree
  • another method of visual media processing includes performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block, and wherein the rule allows the EQT partitioning process for the current block based on a position of the current block.
  • EQT extended quadtree
  • another method of visual media processing includes performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block, and wherein the rule allows a maximum depth of the EQT partitioning process to depend on a distance between a current picture of the current block and a reference picture for the current block or a quantization parameter of the current block or a temporal layer id of the current picture.
  • EQT extended quadtree
  • FIG. 1 shows a block diagram for an example implementation of video encoding and decoding.
  • FIG. 2 shows an example of macroblock (MB) partitioning according to the
  • AVC H.264/Audio Video Codec
  • FIG. 3 shows an example of modes for splitting a coding block (CB) into prediction blocks (PBs) subject to certain size constraints. E.g., intra-pictures are allowed to use only M x M and M/2 x M/2 sizes.
  • FIG. 4 shows an example of subdivisions of a coding tree block (CTB) into CBs and transform blocks (TBs).
  • CB coding tree block
  • TB transform blocks
  • solid lines indicate CB boundaries
  • dashed lines indicate TB boundaries.
  • Feft side is CTB with partitioning, and right side the corresponding quadtree.
  • FIG. 5 is an example illustration of a Quad Tree Binary Tree (QTBT) structure.
  • QTBT Quad Tree Binary Tree
  • FIG. 6 shows various examples of block partitioning.
  • FIG. 7A - 7K show examples of block partitioning.
  • FIG. 8A - 8D show examples of block partitioning.
  • FIG. 9A-9B shows an example of generalized triple tree partitioning (GTT).
  • FIG. 10 shows an example of syntax and semantics for versatile boundary partitioning.
  • FIG. 11 A - 11B shows an example of allowed EQT patterns that may be further split into EQT or BT.
  • FIG. 12 shows an example of binarization of partitioning.
  • FIG. 13A and 13B show examples of horizontal and vertical EQTs.
  • FIG. 14 shows an example hardware platform for implementing some disclosed methods.
  • FIG. 15 shows another example hardware platform for implementing some disclosed methods.
  • FIG. 16 is a flowchart of an example method of visual media processing.
  • FIG. 17 shows an example of 7 specific positions used in EQT partition early termination.
  • FIG. 18 is a block diagram of an example video processing system in which disclosed techniques may be implemented.
  • FIG. 19 is a flowchart representation of a method for video processing in accordance with the present disclosure.
  • the present document provides several techniques that can be embodied into digital video or image, collectively called visual media, encoders and decoders. Section headings are used in the present document for clarity of understanding and do not limit scope of the techniques and embodiments disclosed in each section only to that section.
  • This document is related to image/video coding, especially on the partition structure, i.e., how to split one Coding Tree Unit (CTU) into multiple Coding Units (CUs) and how to accelerate encoders to select the best partition structure. It may be applied to the existing video coding standard like HEVC, or the standard (Versatile Video Coding) to be finalized. It may be also applicable to future video coding standards or video codec.
  • CTU Coding Tree Unit
  • CUs Coding Units
  • Video coding standards have evolved primarily through the development of the well- known ITU-T and ISO/IEC standards.
  • the ITU-T produced H.261 and H.263, ISO/IEC produced MPEG-l and MPEG-4 Visual, and the two organizations jointly produced the
  • H.262/MPEG-2 Video and H.264/MPEG-4 Advanced Video Coding (A VC) and H.265/HEVC standards Since H.262, the video coding standards are based on the hybrid video coding structure wherein temporal prediction plus transform coding are utilized.
  • An example of a typical HEVC encoder framework is depicted in FIG. 1. [0035] 2.1 Partition tree structure in H.264/AVC
  • the core of the coding layer in previous standards was the macroblock, containing a 16x16 block of luma samples and, in the usual case of 4:2:0 color sampling, two corresponding 8x8 blocks of chroma samples.
  • An intra-coded block uses spatial prediction to exploit spatial correlation among pixels.
  • Two partitions are defined: 16x16 and 4x4.
  • An inter-coded block uses temporal prediction, instead of spatial prediction, by estimating motion among pictures.
  • Motion can be estimated independently for either 16x16 macroblock or any of its sub-macroblock partitions: 16x8, 8x16, 8x8, 8x4, 4x8, 4x4 (see FIG. 5). Only one motion vector (MV) per sub-macroblock partition is allowed.
  • FIG. 2 shows an example of MB partitions in H.264/AVC.
  • a CTU is split into CUs by using a quadtree structure denoted as coding tree to adapt to various local characteristics.
  • the decision whether to code a picture area using inter-picture (temporal) or intra-picture (spatial) prediction is made at the CU level.
  • Each CU can be further split into one, two or four PUs according to the PU splitting type. Inside one PU, the same prediction process is applied, and the relevant information is transmitted to the decoder on a PU basis.
  • a CU can be partitioned into transform units (TUs) according to another quadtree structure similar to the coding tree for the CU.
  • TUs transform units
  • Coding tree units and coding tree block (CTB) structure The analogous structure in HEVC is the coding tree unit (CTU), which has a size selected by the encoder and can be larger than a traditional macroblock.
  • the CTU consists of a luma CTB and the corresponding chroma CTBs and syntax elements.
  • HEVC then supports a partitioning of the CTBs into smaller blocks using a tree structure and quadtree-like signaling.
  • Coding units and coding blocks (CBs):
  • the quadtree syntax of the CTU specifies the size and positions of its luma and chroma CBs. The root of the quadtree is associated with the CTU. Hence, the size of the luma CTB is the largest supported size for a luma CB.
  • the splitting of a CTU into luma and chroma CBs is signaled jointly.
  • a CTB may contain only one CU or may be split to form multiple CUs, and each CU has an associated partitioning into prediction units (PUs) and a tree of transform units (TUs).
  • PUs prediction units
  • TUs tree of transform units
  • Prediction units and prediction blocks The decision whether to code a picture area using inter picture or intra picture prediction is made at the CU level.
  • a PU partitioning structure has its root at the CU level.
  • the luma and chroma CBs can then be further split in size and predicted from luma and chroma prediction blocks (PBs).
  • HEVC supports variable PB sizes from 64x64 down to 4x4 samples.
  • the prediction residual is coded using block transforms.
  • a TU tree structure has its root at the CU level.
  • the luma CB residual may be identical to the luma transform block (TB) or may be further split into smaller luma TBs. The same applies to the chroma TBs.
  • Integer basis functions similar to those of a discrete cosine transform (DCT) are defined for the square TB sizes 4x4, 8x8, 16x16, and 32x32.
  • DCT discrete cosine transform
  • an integer transform derived from a form of discrete sine transform (DST) is alternatively specified.
  • FIG. 3 shows an example of modes for splitting a coding block (CB) into prediction blocks (PBs) subject to certain size constraints. E.g., intra-pictures are allowed to use only M x M and M/2 x M/2 sizes.
  • FIG. 4 shows an example of subdivisions of a coding tree block (CTB) into CBs and transform blocks (TBs).
  • CB coding tree block
  • TB transform blocks
  • solid lines indicate CB boundaries
  • dashed lines indicate TB boundaries.
  • Feft side is CTB with partitioning, and right side the corresponding quadtree.
  • JVET Joint Video Exploration Team
  • the QTBT structure removes the concepts of multiple partition types, i.e. it removes the separation of the CU, PU and TU concepts, and supports more flexibility for CU partition shapes.
  • a CU can have either a square or rectangular shape.
  • a coding tree unit (CTU) is first partitioned by a quadtree structure.
  • the quadtree leaf nodes are further partitioned by a binary tree structure.
  • the binary tree leaf nodes are called coding units (CUs), and that segmentation is used for prediction and transform processing without any further partitioning.
  • CUs coding units
  • a CU sometimes consists of coding blocks (CBs) of different colour components, e.g. one CU contains one luma CB and two chroma CBs in the case of P and B slices of the 4:2:0 chroma format and sometimes consists of a CB of a single component, e.g., one CU contains only one luma CB or just two chroma CBs in the case of I slices.
  • CBs coding blocks
  • - CTU size the root node size of a quadtree, the same concept as in HEVC
  • the CTU size is set as 128x128 luma samples with two corresponding 64x64 blocks of chroma samples
  • the MinQTSize is set as 16x16
  • the MaxBTSize is set as 64x64
  • the MinBTSize (for both width and height) is set as 4x4
  • the MaxBTDepth is set as 4.
  • the quadtree partitioning is applied to the CTU first to generate quadtree leaf nodes.
  • the quadtree leaf nodes may have a size from 16x16 (e.g., the MinQTSize) to 128x128 (e.g., the CTU size).
  • the quadtree leaf node is 128x128, it will not be further split by the binary tree since the size exceeds the MaxBTSize (e.g., 64x64). Otherwise, the quadtree leaf node could be further partitioned by the binary tree. Therefore, the quadtree leaf node is also the root node for the binary tree and it has the binary tree depth as 0. When the binary tree depth reaches MaxBTDepth (e.g., 4), no further splitting is considered. When the binary tree node has width equal to MinBTSize (e.g., 4), no further horizontal splitting is considered. Similarly, when the binary tree node has height equal to MinBTSize, no further vertical splitting is considered. The leaf nodes of the binary tree are further processed by prediction and transform processing without any further partitioning. In the JEM, the maximum CTU size is 256x256 luma samples.
  • each splitting (i.e., non-leaf) node of the binary tree for example as shown in FIG. 5, one flag is signalled to indicate which splitting type (i.e., horizontal or vertical) is used, where 0 indicates horizontal splitting and 1 indicates vertical splitting.
  • the quadtree splitting there is no need to indicate the splitting type since quadtree splitting always splits a block both horizontally and vertically to produce 4 sub-blocks with an equal size.
  • the QTBT scheme supports the ability for the luma and chroma to have a separate QTBT structure.
  • the luma and chroma CTBs in one CTU share the same QTBT structure.
  • the luma CTB is partitioned into luma CUs by a QTBT structure
  • the chroma CTBs are partitioned into chroma CUs by another QTBT structure. This means that a CU in an I slice consists of a coding block of the luma component or coding blocks of two chroma components, and a CU in a P or B slice consists of coding blocks of all three colour components.
  • inter prediction for small blocks is restricted to reduce the memory access of motion compensation, such that bi-prediction is not supported for 4x8 and 8x4 blocks, and inter prediction is not supported for 4x4 blocks. In the QTBT of the JEM, these restrictions are removed.
  • FIG. 6 shows example of block partitioning patterns (a) quad-tree partitioning (b) vertical binary-tree partitioning (c) horizontal binary-tree partitioning (d) vertical center-side triple-tree partitioning (e) horizontal center-side triple-tree partitioning
  • a CTU is firstly partitioned by region tree (RT).
  • a RT leaf may be further split with prediction tree (PT).
  • PT prediction tree
  • a PT leaf may also be further split with PT until max PT depth is reached.
  • a PT leaf is the basic coding unit. It is still called CU for convenience.
  • a CU cannot be further split.
  • Prediction and transform are both applied on CU in the same way as JEM.
  • the whole partition structure is named‘multiple-type-tree’.
  • a extended quad tree (EQT) partitioning structure corresponding to a block partitioning process including an extended quad tree partitioning process for the block of video data, wherein the extended quad partitioning structure represents partitioning the block of video data into final sub-blocks, and when the extended quad tree partitioning process decides to apply extended quad tree partition to one given block, said one given block is always split into four sub-block; decoding the final sub-blocks based on the video bitstream; and decoding the block of video data based on the final sub-blocks decoded according to the EQT structure derived.
  • EQT is presented in the above-captioned patent application, incorporated by reference herein.
  • the EQT partitioning process can be applied to a given block recursively to generate EQT leaf nodes.
  • EQT when EQT is applied to a certain block, for each of the sub-block due to EQT, it may further be split into BT and/or QT and/or TT and/or EQT and/or other kinds of partition trees.
  • EQT and QT may share the same depth increment process and same restrictions of leaf node sizes.
  • the partitioning of one node could be implicitly terminated when the size of the node reaches a minimum allowed quad tree leaf node size or EQT depth associated with the node reaches a maximum allowed quad tree depth.
  • EQT and QT may share different depth increment process and/or restrictions of leaf node sizes.
  • the partitioning of one node by EQT is implicitly terminated when the size of the node reaches a minimum allowed EQT leaf node size or EQT depth associated with the node reaches a maximum allowed EQT depth.
  • the EQT depth and/or the minimum allowed EQT leaf node sizes may be signaled in sequences parameter set (SPS), and/or picture parameter set (PPS), and/or slice header, and/or CTU, and/or regions, and/or tiles, and/or CUs.
  • one block may be split equally into four partitions, such as M/4 x N or M x N/4 (examples are depicted in FIG. 7 A and FIG. 7B or split equally into four partitions and the partition size is dependent on the maximum and minimum values of M and N.
  • one 4x32 block may be split into four 4x8 sub-blocks while a 32x4 block may be split into four 8x4 sub-blocks.
  • one block may be split unequally into four partitions, such as two partitions are with size equal to (M*w0/w) x (N*h0/h) and the other two are with (M*(w-w0)/w) x (N*(h-h0)/h).
  • wO and w may be equal to 1 and 2, respectively that is the width is reduced by half while the height could use other ratios instead of 2: 1 to get the sub-blocks. Examples for this case are depicted in FIG. 7C and FIG. 7E.
  • hO and h may be equal to 1 and 2, respectively, that is the height is reduced by half while the width could use other ratios instead of 2: 1. Examples for this case are depicted in FIG. 7D and FIG. 7F.
  • FIG. 7G and 7H show two alternative examples of quad tree partitioning.
  • FIG. 71 shows a more general case of quad tree partitioning with different shapes of partitions
  • FIG. 7J and 7K show general examples of FIGs. 7A and 7B.
  • FIG. 7C shows a sub-block width fixed to be M/2, height equal to N/4 or 3N/4, smaller for top two partitions.
  • FIG. 7D shows a sub-block height fixed to be N/2, width equal to M/4 or 3M/4, smaller for left two partitions
  • FIG. 7E shows a sub-block width fixed to be M/2, height equal to 3N/4 or N/4, smaller for bottom two partitions.
  • FIG. 7F shows a sub-block height fixed to be N/2, width equal to 3M/4 or M/4, smaller for right two partitions.
  • the following example dimensions are shown FIG. 7G M x N/4 and M/2 x N/2;
  • FIG. 7H N x M/4 and N/2 x M/2
  • FIG. 71 MlxNl, (M- Ml)xNl, Mlx(N-Nl) and (M-Ml)x(N-Nl);
  • FIG. 7G M x N/4 and M/2 x N/2
  • FIG. 7H N x M/4 and N/2 x M/2
  • FIG. 71 MlxNl, (M- Ml)xNl, Mlx(N-Nl) and (M-Ml)x(N-Nl);
  • FIG. 7G M x N/4 and M/2
  • a flexible tree (FT) partitioning structure corresponding to a block partitioning process including an FT partitioning process for the block of video data, wherein the FT partitioning structure represents partitioning the block of video data into final sub-blocks, and when FT partitioning process decides to apply FT partition to one given block, said one given block is split into K sub-blocks wherein K could be larger than 4; decoding the final sub-blocks based on the video bitstream; and decoding the block of video data based on the final sub-blocks decoded according to the FT structure derived.
  • FT flexible tree
  • the FT partitioning process can be applied to a given block recursively to generate FT tree leaf nodes.
  • the partitioning of one node is implicitly terminated when the node reaches a minimum allowed FT leaf node size or FT depth associated with the node reaches a maximum allowed FT depth.
  • FT when FT is applied to a certain block, for each of the sub block due to FT, it may further be split into BT, and/or QT, and/or EQT, and/or TT, and/or other kinds of partition trees.
  • the FT depth or the minimum allowed FT leaf node sizes or the minimum allowed partition size for FT may be signaled in sequences parameter set (SPS), and/or picture parameter set (PPS), and/or slice header, and/or CTU, and/or regions, and/or tiles, and/or CUs.
  • SPS sequences parameter set
  • PPS picture parameter set
  • slice header and/or CTU, and/or regions, and/or tiles, and/or CUs.
  • all of the sub-blocks due to FT partitions may be with the same size; alternatively, the sizes of different sub-blocks may be different.
  • K is equal to 6 or 8.
  • a generalized TT (GTT) partition pattern may be defined as splitting for both horizontal and vertical. Examples are shown in FIG. 9A and 9B.
  • the proposed methods may be applied under certain conditions. In other words, when the condition(s) are not satisfied, there is no need to signal the partition types.
  • the proposed methods may be used to replace the existing partition tree types. Alternatively, furthermore, the proposed methods may be only used as a replacement under certain conditions.
  • the condition may include the picture and/or slice types; and/or block sizes; and/or the coded modes; and/or whether one block is located at picture/slice/tile boundary.
  • the proposed EQT may be treated in the same way as QT. In this case, when it is indicated that the partition tree type is QT, more flags/indications of the detailed quad-tree partition patterns may be further signaled. Alternatively, EQT may be treated as additional partition patterns.
  • the signaling of partitioning methods of EQT or FT or GTT may be conditional, e.g., one or some EQP/FT/GTT partitioning methods may not be used in some cases, and the bits corresponding to these partitioning methods are not signaled.
  • VVC Versatile Video Coding
  • the boundary partition method can use regular block partition syntax to keep the continuity CAB AC engine as well as matching the picture boundary
  • the versatile boundary partition obtains the following rules (both encoder and decoder):
  • forced QT is used to perform the FBP in the current forced partition level
  • partitions of EQT and QT/BT/TT There may be some redundancy between partitions of EQT and QT/BT/TT. For example, for a block with size of MxN, it may be split into vertical BT three times (firstly split to two M/2 *N partitions, then for each M/2*N partition, further apply vertical BT split) to get four M/4*N partitions. Also, to get four M/4xN partitions, the block could choose directly using EQT as FIG. 7B. There still exists a problem how to signal EQT efficiently.
  • Embodiments may include image or video encoder and decoders.
  • Example 1 When EQT is applied to a certain block, for each of the sub-block due to EQT, it may further be split into BT and/or EQT, and BT and EQT may share the same maximum depth value denoted by DBTM CIX (e.g., MaxBTDepth in section 2.3.1)
  • DBTM CIX e.g., MaxBTDepth in section 2.3.1
  • FIGs. 7A-7K only two kinds of EQT depicted in FIGs. 7A-7K may be allowed.
  • the two allowed EQT patterns are depicted in FIG. 1 1 A and 11B which shows an example of allowed EQT patterns which may be further split to EQT or BT.
  • one allowed EQT pattern may include a top partition that is full width and one-fourth height, followed by two side-by-side partitions of half width and half height of the block, followed by a bottom partition of full width and one-fourth the height of the block (e.g., FIG. 11 A).
  • Another allowed partition includes a left portion of full height and one-fourth width, followed by two partitions with half width and half height vertically stacked over each other, followed by a right partition that is full height and one- fourth width (e.g., FIG. 1 1B). It will be appreciated that, in one aspect, each partition has equal area.
  • BT when BT is applied to a certain block, for each of the sub-blocks due to BT, it may further be split into BT and/or EQT, and BT and EQT may share the same maximum depth value.
  • EQT and BT may use different depth increment process. For example, when each block may be assigned with a depth value denoted by DBT (DBI may start from 0). If one block (with depth value equal to DBT ) is split with EQT, each of the sub-block’s depth value is set to DBT+2.
  • the maximum depth value allowed for EQT may be set to the sum of maximum depth value allowed for QT and maximum depth value allowed for BT.
  • maximumly allowed root node size for EQT may be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • Example 3 One flag is firstly signaled to indicate whether it is BT or EQT before signaling the direction of BT or EQT (e.g., horizontal or vertical), and one more flag may be further signaled to indicate it uses horizontal or vertical splitting direction.
  • FIG. 12 the binarization of partitioning is shown in FIG. 12.
  • Table 1 shows example bin value for each bin index. It should be noted that it is equivalent to exchange all the “0” and“1” in the table.
  • direction of BT or EQT is defined to be parallel or perpendicular to the current split direction.
  • one flag may be firstly signaled to indicate whether QT or EQT or non-(EQT and QT) is used. If non-(EQT and QT) is selected, BT splitting information may be further signaled.
  • Example 4 The flag to indicate whether EQT or BT is used may be context coded, and the context is dependent on the depth information of both current block’s and its neighboring blocks
  • the neighboring blocks may be defined as the above and left blocks relative to the current block.
  • both the quad-tree depth and BT/EQT depth may be utilized in the coding of the flag.
  • One variable Dctx can be derived for each block based on its depth information, e.g., it is set to (2*QT depth + BT/EQT depth). In some embodiments, (2*QT depth + BT/EQT depth) may be further quantized before being used for context selection.
  • the context index is defined as: ( (Dctx of above block > Dctx of current block)? 1 : 0 ) + ( (Dctx of left block > Dctx of current block)? 1 : 0 ). In some embodiments, when a neighboring block is not available, its associated Dctx is set to 0.
  • Example 5 In some embodiments, whether to and how to apply EQT split may depend on the width and height (denoted as W and H) of the block to be split.
  • Tl and/or T2 can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • Tl and/or T2 can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • Tl and/or T2 can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • Tl and/or T2 can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • T can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • T can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • T can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • T can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • T can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • T can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • T can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • T can be signaled from the encoder to the decoder in VPS/SPS/PPS/picture header/slice header/tile group header/tile/CTU.
  • EQT splitting is not allowed.
  • whether to and how to apply EQT split may depend on the width and/or height (denoted as W and H) of the sub-block due to EQT splitting of one block.
  • W and H width and/or height
  • Example 6 In some embodiments, whether to and how to apply EQT split may depend on the position of the block to be split.
  • whether to and how to apply EQT split may depend on whether the current block to be split is at the border of the picture or not.
  • (x,y) is the coordinate of the top-left position of the current block
  • (PW, PH) are the width and height of the picture
  • (W, H) are the width and height of a block with the current QT depth and BT/EQT depth. Then the current block is at the bottom border when y+H>PH; the current block is at the right border when x+W>PW; and the current block is at the bottom-right corner border when y+H>PH and x+W >PW.
  • no EQT splits are allowed when the current block is at the bottom border.
  • no EQT splits are allowed when the current block is at the right border.
  • horizontal EQT as shown in FIG. 11 A is not allowed when the current block is at the bottom border.
  • horizontal EQT as shown in FIG. 11 A is not allowed when the current block is at the right border.
  • horizontal EQT as shown in FIG. 11 A is not allowed when the current block is at the bottom-right corner border.
  • vertical EQT as shown in FIG. 11B is not allowed when the current block is at the bottom border.
  • vertical EQT as shown in FIG. 11B is not allowed when the current block is at the right border.
  • vertical EQT as shown in FIG. 11B is not allowed when the current block is at the bottom-right corner border.
  • horizontal EQT and horizontal BT may be allowed when the current block is at the bottom border.
  • vertical EQT and vertical BT may be allowed when the current block is at the right border.
  • Example 7 When one or some kinds of EQT is not allowed, the followings may apply. [00149] In one example, the parsing procedure depends on whether one or some kinds of EQT is not allowed. The corresponding syntax elements related to EQT are not signaled if one or some kinds of EQT is not allowed.
  • the parsing procedure does not depend on whether one or some kinds of EQT is not allowed.
  • the corresponding syntax elements related to EQT are signaled no matter one or some kinds of EQT is allowed or not.
  • a conformance encoder does not signal one or some kinds of EQT if they are not allowed.
  • a conformance decoder can interpret EQT as some other kinds of split such as QT, BT or not split when it parses out an EQT split but that kind of EQT is not allowed.
  • Example 8 The maximum EQT depth may depend on an attribute associated with the current block and/or the current picture. In one example, the maximum EQT depth may depend on the distance between current picture and the reference picture, e.g., Picture Order Count (POC) difference.
  • POC Picture Order Count
  • the maximum EQT depth may depend on the temporal layer identifier of the current picture.
  • the maximum EQT depth may depend on whether the current picture is referenced by other pictures or not.
  • the maximum EQT depth may depend on the quantization parameter(s).
  • Rate-distortion optimization is a method of improving video quality in video compression. The method optimizes the amount of distortion (loss of video quality) against the amount of data required to encode the video. Rate-distortion cost estimation is useful for many H.264/advanced video coding (A VC) applications including rate-distortion optimization (RDO) for mode-decision and rate-control.
  • a VC H.264/advanced video coding
  • RDO rate-distortion optimization
  • at the certain BT/EQT depth when the best mode of current block and its neighboring blocks are both skip mode, there is no need to further check the rate-distortion cost calculation for further splitting.
  • Example 10 In some embodiments, the average EQT depth for EQT-split blocks of previously coded pictures/slices/tiles is recorded. When coding the current video unit, there is no need to further check the rate-distortion cost calculation for larger EQT depth as compared to the recorded average depth.
  • the average EQT value may be recorded for each temporal layer. In this case, for each video data to be coded, it only utilizes the recorded average value for the same temporal layer.
  • Example 11 In some embodiments, the average size for EQT-split blocks of previously coded pictures/slices/tiles is recorded when coding the current video unit, there is no need to further check the rate-distortion cost calculation for smaller block sizes compared to the recorded block size.
  • the average EQT block size may be recorded for each temporal layer. In this case, for each video data to be coded, it only utilizes the recorded average value for the same temporal layer.
  • only the average EQT block size for the first temporal layer is recorded. In this case, for each video data to be coded, it always utilizes the recorded average value for the first temporal layer.
  • Example 12 In some embodiments, whether to check an EQT splitting that has not been checked may depend on the depth of QT/BT/EQT splitting that have already been checked for the current block.
  • whether to skip checking of the EQT splitting partitions or not may depend on the average QT/BT splitting depth of current block
  • the average splitting depth is estimated after checking BT horizontal splitting
  • the average splitting depth is estimated after checking BT vertical splitting
  • the average splitting depth is estimated after checking BT horizontal and vertical splitting
  • block depths are collected at some specific positions coordinating with EQT splitting characteristic and then averaged.
  • One example of 7 specific positions is depicted in FIG. 17.
  • all horizontal lines represent quarter splitting of the block in horizontal direction and vertical lines depict quarter splitting of blocks in vertical direction.
  • a threshold is calculated as a function of the estimated average splitting depth and the current splitting depth.
  • a threshold table is applied to find the threshold.
  • the estimated average splitting depth and the current splitting depth is used as the key to fetch a corresponding threshold stored in the table.
  • the above method is applicable to certain slice/picture type, such as I slices or I pictures.
  • FIG. 13A and 13B show example of quadtree partitioning for horizotnal EQT parti tionss and vertical EQT partitions.
  • FIG. 14 is a block diagram illustrating an example of the architecture for a computer system or other control device 1400 that can be utilized to implement various portions of the presently disclosed technology.
  • the computer system 1400 includes one or more processors 1405 and memory 1410 connected via an interconnect 1425.
  • the interconnect 1425 may represent any one or more separate physical buses, point to point connections, or both, connected by appropriate bridges, adapters, or controllers.
  • the interconnect 1425 therefore, may include, for example, a system bus, a Peripheral Component Interconnect (PCI) bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI bus, a PCI
  • ISA industry standard architecture
  • SCSI small computer system interface
  • USB universal serial bus
  • IIC I2C
  • the processor(s) 1405 may include central processing units (CPUs) to control the overall operation of, for example, the host computer. In certain embodiments, the processor(s) 1405 accomplish this by executing software or firmware stored in memory 1410.
  • the processor(s) 1405 may be, or may include, one or more programmable general-purpose or special-purpose microprocessors, digital signal processors (DSPs), programmable controllers, application specific integrated circuits (ASICs), programmable logic devices (PLDs), or the like, or a combination of such devices.
  • the memory 1410 can be or include the main memory of the computer system.
  • the memory 1410 represents any suitable form of random access memory (RAM), read-only memory (ROM), flash memory, or the like, or a combination of such devices.
  • RAM random access memory
  • ROM read-only memory
  • flash memory or the like, or a combination of such devices.
  • the memory 1410 may contain, among other things, a set of machine instructions which, when executed by processor 1405, causes the processor 1405 to perform operations to implement embodiments of the presently disclosed technology.
  • the network adapter 1415 provides the computer system 1400 with the ability to communicate with remote devices, such as the storage clients, and/or other storage servers, and may be, for example, an Ethernet adapter or Fiber Channel adapter.
  • FIG. 15 shows a block diagram of an example embodiment of a device 1500 that can be utilized to implement various portions of the presently disclosed technology.
  • the mobile device 1500 can be a laptop, a smartphone, a tablet, a camcorder, or other types of devices that are capable of processing videos.
  • the mobile device 1500 includes a processor or controller 1501 to process data, and memory 1502 in communication with the processor 1501 to store and/or buffer data.
  • the processor 1501 can include a central processing unit (CPU) or a microcontroller unit (MCU).
  • the processor 1501 can include a field-programmable gate-array (FPGA).
  • FPGA field-programmable gate-array
  • the mobile device 1500 includes or is in communication with a graphics processing unit (GPU), video processing unit (VPU) and/or wireless communications unit for various visual and/or communications data processing functions of the smartphone device.
  • the memory 1502 can include and store processor-executable code, which when executed by the processor 1501, configures the mobile device 1500 to perform various operations, e.g., such as receiving information, commands, and/or data, processing information and data, and transmitting or providing processed information/data to another device, such as an actuator or external display.
  • the memory 1502 can store information and data, such as instructions, software, values, images, and other data processed or referenced by the processor 1501.
  • the mobile device 1500 includes an input/output (I/O) unit 1503 to interface the processor 1501 and/or memory 1502 to other modules, units or devices.
  • I/O unit 1503 can interface the processor 1501 and memory 1502 with to utilize various types of wireless interfaces compatible with typical data communication standards, e.g., such as between the one or more computers in the cloud and the user device.
  • the mobile device 1500 can interface with other devices using a wired connection via the I/O unit 1503.
  • the mobile device 1500 can also interface with other external interfaces, such as data storage, and/or visual or audio display devices 1504, to retrieve and transfer data and information that can be processed by the processor, stored in the memory, or exhibited on an output unit of a display device 1504 or an external device.
  • the display device 1504 can display a video frame in accordance with the disclosed technology.
  • FIG. 16 is a flowchart of a method 1600 of visual media processing.
  • the method 1600 includes performing (1602), using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block, and wherein the rule specifies that in case that the rule is used for partitioning the current block, then each subblock is further split into a binary tree (BT) partitioning or another EQT partitioning, and both BT and the another EQT partitioning have depths that meet a pre-defined relationship.
  • EQT extended quadtree
  • Another method of visual media processing includes performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block, and wherein the rule allows the EQT partitioning process for the current block based on a width or a height of the current block.
  • EQT extended quadtree
  • Another method of visual media processing includes performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block, and wherein the rule allows the EQT partitioning process for the current block based on a position of the current block.
  • EQT extended quadtree
  • Another method of visual media processing includes performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block, and wherein the rule allows a maximum depth of the EQT partitioning process to depend on a distance between a current picture of the current block and a reference picture for the current block or a quantization parameter of the current block or a temporal layer id of the current picture.
  • EQT extended quadtree
  • the bitstream representation of a current block of video may include bits of a bitstream (compressed representation of a video) that may be non contiguous and may depend on header information, as is known in the art of video compression.
  • a current block may include samples representative of one or more of luma and chroma components, or rotational variations thereof (e.g, YCrCb or YUV, and so on).
  • a method of visual media processing comprising: performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block; and wherein the rule specifies that in case that the rule is used for partitioning the current block, then each subblock is further split into a binary tree (BT) partitioning or another EQT partitioning, and both BT and the another EQT partitioning have depths that meet a pre defined relationship.
  • EQT extended quadtree
  • M and N are integers and wherein the two possible partitions include a first partitioning comprising an M x N/4 top portion, followed by two side-by-side M/2 x N/2 middle portions, followed by an M x N/4 bottom portions, or a second partitioning comprising an M/4 x N left portion, two M/2 x N/2 middle portions and one M/4 x N right portion.
  • bitstream representation is configured to include a first field indicative of partitioning of the current block between EQT partitioning or BT partitioning and a second field indicative of a splitting direction for the current block between horizontal and vertical directions.
  • a method of visual media processing comprising: performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block; and wherein the rule allows the EQT partitioning process for the current block based on a width or a height of the current block.
  • EQT extended quadtree
  • a method of visual media processing comprising: performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block; and wherein the rule allows the EQT partitioning process for the current block based on a position of the current block.
  • EQT extended quadtree
  • a method of visual media processing comprising: performing, using a rule for using an extended quadtree (EQT) partitioning process, a conversion between a current block of visual media data and a corresponding bitstream representation of the block, wherein the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block; and wherein the rule specifies (1) a maximum depth of the EQT partitioning process to depend on a distance between a current picture of the current block and a reference picture for the current block or a quantization parameter of the current block or a temporal layer id of the current picture, or (2) using splitting depth of the current block of visual media data in deciding to check for additional EQT partitioning.
  • EQT extended quadtree
  • a video processing apparatus comprising a processor configured to implement a method recited in any one or more of clauses 1 to 50.
  • a computer readable media which includes a program comprising code for a processor to carry out a method recited in any one or more of clauses 1 to 50.
  • the partitioning techniques may be specified using parameter sets (picture or video parameter sets) or pre-specified based on a rule. Accordingly, the number of bits needed to signal partitioning of blocks may be reduced. Similarly, the partitioning decision may also be simplified due to the various rules specified in this document, thereby allowing for lower complexity implementations of encoders or decoders.
  • the position dependency of the partitioning rule may be based on a video region in which the current block is present (e.g., clause 26).
  • the video region may include the current block or a larger portion such as a tile, or a slice or a picture in which the current block is present.
  • FIG. 18 is a block diagram showing an example video processing system 1800 in which various techniques disclosed herein may be implemented. V arious implementations may include some or all of the components of the system 1800.
  • the system 1800 may include input 1802 for receiving video content.
  • the video content may be received in a raw or uncompressed format, e.g., 8 or 10 bit multi-component pixel values, or may be in a compressed or encoded format.
  • the input 1802 may represent a network interface, a peripheral bus interface, or a storage interface. Examples of network interface include wired interfaces such as Ethernet, passive optical network (PON), etc. and wireless interfaces such as Wi-Fi or cellular interfaces.
  • the system 1800 may include a coding component 1804 that may implement the various coding or encoding methods described in the present document.
  • the coding component 1804 may reduce the average bitrate of video from the input 1802 to the output of the coding component 1804 to produce a coded representation of the video.
  • the coding techniques are therefore sometimes called video compression or video transcoding techniques.
  • the output of the coding component 1804 may be either stored, or transmitted via a communication connected, as represented by the component 1806.
  • the stored or communicated bitstream (or coded) representation of the video received at the input 1802 may be used by the component 1808 for generating pixel values or displayable video that is sent to a display interface 1810.
  • the process of generating user-viewable video from the bitstream representation is sometimes called video decompression.
  • certain video processing operations are referred to as “coding” operations or tools, it will be appreciated that the coding tools or operations are used at an encoder and corresponding decoding tools or operations that reverse the results of the coding will be performed by
  • Examples of a peripheral bus interface or a display interface may include universal serial bus (USB) or high definition multimedia interface (HDMI) or Displayport, and so on.
  • Examples of storage interfaces include SATA (serial advanced technology attachment), PCI,
  • IDE interface and the like.
  • the techniques described in the present document may be embodied in various electronic devices such as mobile phones, laptops, smartphones or other devices that are capable of performing digital data processing and/or video display.
  • FIG. 19 is a flowchart representation of a method 1900 for video processing in accordance with the present disclosure.
  • the method 1900 includes, at operation 1902, determining, for a conversion between a current block of a video and a bitstream representation of the video, whether an extended quadtree (EQT) partitioning process is applicable to the current block based on a rule.
  • the EQT partitioning process includes partitioning a given block into exactly four sub-blocks, at least one of which has a size different from half of width of the given block times half of a height of the given block.
  • the rule specifies a maximum depth of the EQT partitioning process based on an attribute associated with the current block.
  • the method 1900 includes, at operation 1904, performing the conversion based on the determining.
  • the conversion includes generating the current block from the bitstream representation.
  • the conversion also includes generating the bitstream representation from the current block.
  • the attribute comprises a distance between a current picture of the current block and a reference picture for the current block. In some embodiments, the attribute comprises a difference of a Picture Order Count (POC). In some embodiments, the attribute comprises a temporal layer identifier of a current picture of the current block. In some embodiments, the attribute comprises whether a current picture of the current block is referenced by other pictures of the video. In some embodiments, the attribute comprises a quantization parameter of the current block.
  • POC Picture Order Count
  • the rule further specifies, at a predefined coding depth of the current block, to disable a subsequent partitioning process in case that the current block and neighboring blocks are coded using a skip mode. In some embodiments, the rule further specifies to disable a subsequent partitioning process in case that the current block and neighboring blocks are coded using a merge mode. In some embodiments, the rule further specifies to disable a subsequent partitioning process in case a parent block of the current block is coded using a skip mode.
  • the rule further specifies to disable the EQT partitioning process in case a coding depth of the current block is larger than an average coding depth of previously coded blocks.
  • the average coding depth of the previously coded blocks can be calculated based on a binary tree partitioning process, a quadtree partitioning process, or the EQT partitioning process.
  • a threshold representing a relationship between the coding depth of the current block and the average coding depth of the previously coded blocks is compared against a table of threshold values to determine whether the EQT partitioning process is disabled.
  • the average coding depth is calculated over previously encoded pictures, slices, or tiles.
  • the average coding depth is calculated for a temporal layer in which the current block is positioned. In some embodiments, the average coding depth is calculated only for a first temporal layer of the video. In some embodiments, the average coding depth is calculated based on a binary tree horizontal splitting. In some embodiments, the average coding depth is calculated based on a binary tree vertical splitting. In some embodiments, the average coding depth is determined at a number of specific positions in the current block. In some embodiments, the number of specific positions is equal to 7.
  • the rule further specifies to disable the EQT partitioning process in case a size of the currently block is smaller than an average size of previously coded blocks.
  • the average size is calculated for blocks of previously encoded pictures, slices, or tiles.
  • the average size is calculated for blocks of a temporal layer in which the current block is positioned.
  • the average size is calculated only for blocks of a first temporal layer of the video.
  • Some embodiments of the disclosed technology include making a decision or determination to enable a video processing tool or mode.
  • the encoder when the video processing tool or mode is enabled, the encoder will use or implement the tool or mode in the processing of a block of video, but may not necessarily modify the resulting bitstream based on the usage of the tool or mode. That is, a conversion from the block of video to the bitstream representation of the video will use the video processing tool or mode when it is enabled based on the decision or determination.
  • the decoder when the video processing tool or mode is enabled, the decoder will process the bitstream with the knowledge that the bitstream has been modified based on the video processing tool or mode. That is, a conversion from the bitstream representation of the video to the block of video will be performed using the video processing tool or mode that was enabled based on the decision or determination.
  • Some embodiments of the disclosed technology include making a decision or determination to disable a video processing tool or mode.
  • the encoder will not use the tool or mode in the conversion of the block of video to the bitstream representation of the video.
  • the decoder will process the bitstream with the knowledge that the bitstream has not been modified using the video processing tool or mode that was enabled based on the decision or determination.
  • the disclosed and other embodiments, modules and the functional operations described in this document can be implemented in digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this document and their structural equivalents, or in combinations of one or more of them.
  • the disclosed and other embodiments can be implemented as one or more computer program products, i.e., one or more modules of computer program instructions encoded on a computer readable medium for execution by, or to control the operation of, data processing apparatus.
  • the computer readable medium can be a machine -readable storage device, a machine -readable storage substrate, a memory device, a composition of matter effecting a machine-readable propagated signal, or a combination of one or more them.
  • data processing apparatus encompasses all apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, or multiple processors or computers.
  • the apparatus can include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, or a combination of one or more of them.
  • a propagated signal is an artificially generated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal, that is generated to encode information for transmission to suitable receiver apparatus.
  • a computer program (also known as a program, software, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment.
  • a computer program does not necessarily correspond to a file in a file system.
  • a program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub programs, or portions of code).
  • a computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.
  • processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer.
  • a processor will receive instructions and data from a read only memory or a random-access memory or both.
  • the essential elements of a computer are a processor for performing instructions and one or more memory devices for storing instructions and data.
  • a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto optical disks, or optical disks.
  • mass storage devices for storing data, e.g., magnetic, magneto optical disks, or optical disks.
  • a computer need not have such devices.
  • Computer readable media suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto optical disks; and CD ROM and DVD-ROM disks.
  • semiconductor memory devices e.g., EPROM, EEPROM, and flash memory devices
  • magnetic disks e.g., internal hard disks or removable disks
  • magneto optical disks e.g., CD ROM and DVD-ROM disks.
  • the processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Un procédé de traitement d'une vidéo comprend les étapes consistant à : déterminer, pour une conversion entre un bloc actuel d'une vidéo et une représentation en flux binaire de la vidéo, si un processus de partitionnement d'un arbre quaternaire étendu (EQT) est applicable au bloc actuel sur la base d'une règle ; et effectuer la conversion sur la base de la détermination. Le processus de partitionnement d'EQT suppose le partitionnement d'un bloc donné en exactement quatre sous-blocs. Au moins un des sous-blocs a une taille non égale à la moitié de la largeur du bloc donné multipliée par la moitié de la hauteur du bloc donné. La règle spécifie une profondeur maximale du processus de partitionnement d'EQT sur la base d'un attribut associé au bloc actuel.
PCT/IB2019/059219 2018-10-26 2019-10-28 Procédés rapides de décision d'arbre de partition WO2020084604A1 (fr)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN2018111990 2018-10-26
CNPCT/CN2018/111990 2018-10-26
CN2018119316 2018-12-05
CNPCT/CN2018/119316 2018-12-05

Publications (1)

Publication Number Publication Date
WO2020084604A1 true WO2020084604A1 (fr) 2020-04-30

Family

ID=68470574

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2019/059219 WO2020084604A1 (fr) 2018-10-26 2019-10-28 Procédés rapides de décision d'arbre de partition

Country Status (2)

Country Link
CN (1) CN111107368B (fr)
WO (1) WO2020084604A1 (fr)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112396016B (zh) * 2020-11-26 2021-07-23 武汉宏数信息技术有限责任公司 一种基于大数据技术的人脸识别系统
WO2022213966A1 (fr) * 2021-04-05 2022-10-13 Beijing Bytedance Network Technology Co., Ltd. Contraintes de partitionnement basées sur le voisinage
CN113518220B (zh) * 2021-04-15 2023-07-25 中山大学 基于导向滤波和边缘检测的帧内划分方法、装置及介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104519362B (zh) * 2014-12-23 2017-10-17 电子科技大学 一种邻帧预测深度相似性的视频编码方法
WO2018088805A1 (fr) * 2016-11-08 2018-05-17 주식회사 케이티 Procédé et appareil de traitement de signal vidéo
EP3383045A1 (fr) * 2017-03-27 2018-10-03 Thomson Licensing Priorisation de multiples divisions pour un codage rapide

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20120045369A (ko) * 2010-10-29 2012-05-09 에스케이 텔레콤주식회사 쿼드 트리 기반으로 블록 필터 정보를 부호화/복호화하는 영상 부호화/복호화 장치 및 방법
EP3024240A4 (fr) * 2013-07-18 2017-03-22 Samsung Electronics Co., Ltd. Procédé de prédiction intra-scène d'image de profondeur pour appareil et procédé de décodage et de codage vidéo inter-couches
CN104935940B (zh) * 2014-03-17 2018-02-16 寰发股份有限公司 基于深度的块分割的信号传递方法
WO2015192314A1 (fr) * 2014-06-17 2015-12-23 Mediatek Singapore Pte. Ltd. Procédé simplifié pour le partitionnement de blocs en fonction de la profondeur
CN112954368B (zh) * 2015-03-13 2024-05-24 Lg电子株式会社 处理视频信号的方法及其设备
WO2017219342A1 (fr) * 2016-06-24 2017-12-28 Mediatek Inc. Procédés de signalisation de paramètre de quantification pour structure d'arbre binaire et arbre quaternaire (qtbt)
US10779004B2 (en) * 2016-10-12 2020-09-15 Mediatek Inc. Methods and apparatuses of constrained multi-type-tree block partition for video coding
US20180139444A1 (en) * 2016-11-16 2018-05-17 Mediatek Inc. Method and Apparatus of Video Coding Using Flexible Quadtree and Binary Tree Block Partitions
US20180242024A1 (en) * 2017-02-21 2018-08-23 Mediatek Inc. Methods and Apparatuses of Candidate Set Determination for Quad-tree Plus Binary-tree Splitting Blocks

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104519362B (zh) * 2014-12-23 2017-10-17 电子科技大学 一种邻帧预测深度相似性的视频编码方法
WO2018088805A1 (fr) * 2016-11-08 2018-05-17 주식회사 케이티 Procédé et appareil de traitement de signal vidéo
US20190313129A1 (en) * 2016-11-08 2019-10-10 Kt Corporation Method and apparatus for processing video signal
EP3383045A1 (fr) * 2017-03-27 2018-10-03 Thomson Licensing Priorisation de multiples divisions pour un codage rapide

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MISRA K ET AL: "Description of SDR and HDR video coding technology proposal by Sharp and Foxconn", 10. JVET MEETING; 10-4-2018 - 20-4-2018; SAN DIEGO; (THE JOINT VIDEO EXPLORATION TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://PHENIX.INT-EVRY.FR/JVET/,, no. JVET-J0026-v9, 13 April 2018 (2018-04-13), XP030151194 *
WU F ET AL: "Description of SDR video coding technology proposal by University of Science and Technology of China, Peking University, Harbin Institute of Technology, and Wuhan University (IEEE 1857.10 Study Group)", 10. JVET MEETING; 10-4-2018 - 20-4-2018; SAN DIEGO; (THE JOINT VIDEO EXPLORATION TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://PHENIX.INT-EVRY.FR/JVET/,, no. JVET-J0032-v4, 12 April 2018 (2018-04-12), XP030151203 *

Also Published As

Publication number Publication date
CN111107368B (zh) 2024-05-14
CN111107368A (zh) 2020-05-05

Similar Documents

Publication Publication Date Title
US11570482B2 (en) Restriction of extended quadtree
US11388476B2 (en) Block partitioning method and apparatus
US11632575B2 (en) Binarization of partitioning related to extended quadtree
CN113519164A (zh) 对分割结构的限制
US11228761B2 (en) Border handling for extended quadtree partitions
CN110662050B (zh) 处理视频数据的方法、装置及存储介质
WO2020084604A1 (fr) Procédés rapides de décision d'arbre de partition

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19798731

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 18/08/2021)

122 Ep: pct application non-entry in european phase

Ref document number: 19798731

Country of ref document: EP

Kind code of ref document: A1