WO2021138605A1 - Methods and apparatuses for video coding using triangle partition - Google Patents

Methods and apparatuses for video coding using triangle partition Download PDF

Info

Publication number
WO2021138605A1
WO2021138605A1 PCT/US2020/067735 US2020067735W WO2021138605A1 WO 2021138605 A1 WO2021138605 A1 WO 2021138605A1 US 2020067735 W US2020067735 W US 2020067735W WO 2021138605 A1 WO2021138605 A1 WO 2021138605A1
Authority
WO
WIPO (PCT)
Prior art keywords
list
prediction
uni
merge
candidate
Prior art date
Application number
PCT/US2020/067735
Other languages
English (en)
French (fr)
Inventor
Xianglin Wang
Yi-Wen Chen
Xiaoyu XIU
Tsung-Chuan MA
Hong-Jheng Jhu
Bing Yu
Original Assignee
Beijing Dajia Internet Information Technology Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Dajia Internet Information Technology Co., Ltd. filed Critical Beijing Dajia Internet Information Technology Co., Ltd.
Priority to CN202080089982.XA priority Critical patent/CN114982230A/zh
Publication of WO2021138605A1 publication Critical patent/WO2021138605A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/573Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction

Definitions

  • the present application generally relates to video coding and compression, and in particular but not limited to, methods and apparatus for motion compensated prediction using triangular prediction unit (i.e., a special case of geometric partition prediction unit) in video coding.
  • triangular prediction unit i.e., a special case of geometric partition prediction unit
  • an apparatus for video coding with geometric partition including:
  • FIG. 5 is a schematic diagram illustrating multi-type tree splitting modes in accordance with some implementations of the present disclosure.
  • FIG. 10 is a schematic diagram illustrating candidate positions for a temporal merge candidate in accordance with some implementations of the present disclosure.
  • FIG. 11 A to FIG. 1 IB are schematic diagrams illustrating examples of uni -prediction motion vector (MV) selection for triangle prediction mode in accordance with some implementations of the present disclosure.
  • MV motion vector
  • FIG. 19 is a block diagram illustrating an exemplary apparatus for video coding in accordance with some implementations of the present disclosure.
  • FIG. 20 is a flowchart illustrating an exemplary process of video coding for motion compensated prediction using geometric prediction unit in accordance with some implementations of the present disclosure.
  • module may include memory (shared, dedicated, or group) that stores code or instructions that may be executed by one or more processors.
  • a module may include one or more circuits with or without stored code or instructions.
  • the module or circuit may include one or more components that are directly or indirectly connected. These components may or may not be physically attached to, or located adjacent to, one another.
  • a unit or module may be implemented purely by software, purely by hardware, or by a combination of hardware and software. In a pure software implementation, for example, the unit or module may include functionally related code blocks or software components, that are directly or indirectly linked together, so as to perform a particular function.
  • intra prediction is usually based on unfiltered reconstructed pixels, while inter prediction is based on filtered reconstructed pixels if these filter options are turned on by the encoder 100.
  • FIG. 3 An example of block partitioning by using the QTBT scheme, and the corresponding tree representation are illustrated in FIG. 3.
  • the solid lines indicate quadtree splitting and dotted lines indicate binary tree splitting.
  • the coding tree unit (CTU) 300 is first partitioned by a quadtree structure, and three of the four quadtree leaf nodes 302, 304, 306, 308 are further partitioned by either a quadtree structure or a binary tree structure.
  • the quadtree leaf node 306 is further partitioned by quadtree splitting;
  • the quadtree leaf node 304 is further partitioned into two leaf nodes 304a, 304b by binary tree splitting; and the quadtree leaf node 302 is also further partitioned by binary tree splitting.
  • the luma and chroma CTBs in one CTU share the same QTBT structure.
  • the luma CTB is partitioned into CUs by a QTBT structure
  • the chroma CTBs are partitioned into chroma CUs by another QTBT structure.
  • a CU in an I slice consists of a coding block of the luma component or coding blocks of two chroma components
  • a CU in a P or B slice consists of coding blocks of all three colour components.
  • VVC Versatile Video Coding
  • VTM1 VVC Test Model 1
  • a CU Under the affine prediction mode, a CU may be split into multiple 4x4 PUs for prediction. Motion vectors may be derived for each 4x4 PU and motion compensation may be performed accordingly on the 4x4 PU.
  • a CU Under the ATMVP mode, a CU may be split into one or multiple 8x8 PUs for prediction. Motion vectors are derived for each 8x8 PU and motion compensation may be performed accordingly on the 8x8 PU.
  • Under the triangle prediction mode a CU may be split into two triangular shape prediction units. Motion vectors are derived for each PU and motion compensation is performed accordingly. The triangle prediction mode is supported for inter prediction. More details of the triangle prediction mode are illustrated below.
  • FIG. 7 is a schematic diagram illustrating positions of the neighboring blocks in accordance with some implementations of the present disclosure.
  • Both the left block and the top block have a triangle prediction flag of 0;
  • the scaled motion vector for the temporal merge candidate is scaled from the motion vector of the co-located PU col PU using the POC distances, tb and td, where tb is defined to be the POC difference between the reference picture of the current picture curr ref and the current picture curr_pic and td is defined to be the POC difference between the reference picture of the co-located picture col ref and the co-located picture col_pic.
  • the reference picture index of the temporal merge candidate is set equal to zero.
  • a practical realization of the scaling process is described in the HEVC draft specification. For a B-slice, two motion vectors, one for reference picture List 0 and the other for reference picture List 1, are obtained and combined to make the bi-predictive merge candidate.
  • a motion vector pruning process may be performed to make sure that the new motion vector to be added is different from those motion vectors already in the uni-directional prediction merge list.
  • Such motion vector pruning process may also be performed in a partial manner for lower complexity, e.g., checking the new motion vector to be added only against some but not all motion vectors already in the uni-directional prediction merge list.
  • no motion vector pruning i.e., motion vector comparison operation is performed in the process.
  • prediction List 0 motion vectors of the candidates in the first merge list are checked and selected into the uni-directional prediction merge list first, followed by prediction List 1 motion vectors of those candidates; and if the uni-directional prediction merge list is still not full, uni prediction zero motion vectors may be added.
  • prediction List 0 motion vectors of the candidates in the first merge list are used as the uni-directional prediction merge candidates, indexed according to the same index order as they are in the first merge list. That is, upon determining that the POC of the current picture is greater than each one of the POCs of the reference pictures, the reference indices are arranged according to a same order of List 0 motion vectors of the candidates in the first merge list.
  • prediction List 0 motion vectors of the candidates in the first merge list are used as the uni-directional prediction merge candidates, indexed according to the same index order as they are in the first merge list. Otherwise, if the current picture uses backward prediction, List 0 and List 1 motion vectors of each candidate in the first merge list are used as the uni-directional prediction merge candidates, indexed based on an interleaving manner as described above, i.e., List 0 motion vector of the first candidate in the first merge list followed by List 1 motion vector of the first candidate, and then List 0 motion vector of the second candidate followed by list 1 motion vector of the second candidate, and so on.
  • the motion vector plus certain motion offset is indexed as the uni-directional prediction merge candidate following the motion vector of the candidate.
  • uni-prediction merge candidates may be selected for triangle prediction according to the following rules:
  • a candidate’s List x (with x being 0 or 1) motion vector is preferred and selected first if it is available, and an example where x is equal to 0 is shown in FIG. 16B. Similar to other examples explained earlier, in the case where a uni-prediction motion vector is not available for a certain candidate in the first merge list, the corresponding uni-prediction motion vector of the same candidate from the other reference list is selected. As a result, for each candidate in the first merge list, a uni-prediction motion vector can be obtained and used for triangle and/or geometric prediction mode. The obtained uni-prediction motion vector shares the same index value as the merge candidate in the first merge list.
  • up to 2N uni-prediction motion vectors may be used for each of the two triangular partitions.
  • the two merge index values for the two partitions under triangle prediction mode do not have to be different from each other. In other words, they may take the same value.
  • the index values are signaled directly without adjustment before signaling. More specifically, unlike what is defined in the current VVC, the second index value is signaled to the decoder directly without performing any adjustment to the value prior to signaling.
  • 1 IB is used in determining from which prediction list a motion vector of a merge candidate is selected for triangle mode prediction. In other words, if indexl is an even value, List 0 motion vector of the candidate indicated by indexl is selected, and if indexl is an odd value, List 1 motion vector of the candidate indicated by indexl is selected. For index2, if it is an even value, List 1 motion vector is selected, and if it is an odd value, List 0 motion vector is selected. In the case where a motion vector corresponding to certain prediction list does not exist, certain default motion vector may be used instead, e.g. a zero motion vector, or the corresponding motion vector from the other prediction list, etc. It is also possible that the pattern shown in FIG. 1 IB is used for indexl and that shown in FIG.
  • a list-0 zero vector and a list-1 zero vector may be separately added into the uni-directional prediction merge list until the list is full.
  • motion vector pruning operations are performed to see if the motion vector is the same as any of the motion vectors already selected in that uni-directional prediction merge list. If the pruning operations conclude that the motion vector is different from those motion vectors it is compared against, this motion vector is added into the uni-directional prediction merge list. Otherwise, the motion vector is not selected or added into that list. If the total number of motion vector pruning operations performed reaches N, then no more pruning operations are performed in selecting remaining motion vectors into that list.
  • motion vector pruning operations are performed only among the first K candidates from the first merge list, where K is a positive integer.
  • K is a positive integer.
  • motion vector pruning operations would be performed while checking the first K motion vectors in the order indicated by those arrows. Starting from the (K+l)th candidate, no motion vector pruning operations will be performed in selecting the remaining uni-prediction motion vectors into that uni-directional prediction merge list.
  • the sensor component 1914 may further include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
  • the communication component 1916 is configured to facilitate wired or wireless communication between the apparatus 1900 and other devices.
  • the apparatus 1900 may access a wireless network based on a communication standard, such as WiFi, 4G, or a combination thereof.
  • the communication component 1916 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel.
  • the communication component 1916 may further include a Near Field Communication (NFC) module for promoting short-range communication.
  • the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra-Wide Band (UWB) technology, Bluetooth (BT) technology and other technology.
  • RFID Radio Frequency Identification
  • IrDA infrared data association
  • UWB Ultra-Wide Band
  • Bluetooth Bluetooth
  • step 2008 the processor 1920 constructs a uni-directional prediction merge list based on the uni-prediction MVs.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
PCT/US2020/067735 2019-12-31 2020-12-31 Methods and apparatuses for video coding using triangle partition WO2021138605A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202080089982.XA CN114982230A (zh) 2019-12-31 2020-12-31 用于使用三角形分区的视频编解码的方法和装置

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201962956112P 2019-12-31 2019-12-31
US62/956,112 2019-12-31

Publications (1)

Publication Number Publication Date
WO2021138605A1 true WO2021138605A1 (en) 2021-07-08

Family

ID=76686910

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2020/067735 WO2021138605A1 (en) 2019-12-31 2020-12-31 Methods and apparatuses for video coding using triangle partition

Country Status (2)

Country Link
CN (1) CN114982230A (zh)
WO (1) WO2021138605A1 (zh)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013056656A1 (en) * 2011-10-19 2013-04-25 Mediatek Inc. Method and apparatus for derivation of motion vector predictor candidate set
US20130294513A1 (en) * 2012-05-07 2013-11-07 Qualcomm Incorporated Inter layer merge list construction for video coding
US20140169472A1 (en) * 2012-12-19 2014-06-19 Mikhail Fludkov Motion estimation engine for video encoding

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013042225A (ja) * 2011-08-11 2013-02-28 Jvc Kenwood Corp 動画像復号装置、動画像復号方法及び動画像復号プログラム
US20130177083A1 (en) * 2012-01-05 2013-07-11 Qualcomm Incorporated Motion vector candidate index signaling in video coding
US9426463B2 (en) * 2012-02-08 2016-08-23 Qualcomm Incorporated Restriction of prediction units in B slices to uni-directional inter prediction
US9451277B2 (en) * 2012-02-08 2016-09-20 Qualcomm Incorporated Restriction of prediction units in B slices to uni-directional inter prediction
US20130343459A1 (en) * 2012-06-22 2013-12-26 Nokia Corporation Method and apparatus for video coding
US9479778B2 (en) * 2012-08-13 2016-10-25 Qualcomm Incorporated Device and method for coding video information using base layer motion vector candidate
KR101908249B1 (ko) * 2014-11-18 2018-10-15 미디어텍 인크. 단방향 예측 및 병합 후보로부터의 모션 벡터에 기초한 양방향 예측 비디오 코딩 방법
EP3357245A4 (en) * 2015-11-05 2019-03-13 MediaTek Inc. METHOD AND DEVICE OF INTERPRESSATION USING AN AVERAGE MOTION VECTOR FOR VIDEO CODING
US20180242024A1 (en) * 2017-02-21 2018-08-23 Mediatek Inc. Methods and Apparatuses of Candidate Set Determination for Quad-tree Plus Binary-tree Splitting Blocks
US11394992B2 (en) * 2018-03-14 2022-07-19 Hfi Innovation Inc. Methods and apparatuses of generating average candidates in video coding systems
CN110312130B (zh) * 2019-06-25 2021-10-15 浙江大华技术股份有限公司 基于三角模式的帧间预测、视频编码方法及设备

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013056656A1 (en) * 2011-10-19 2013-04-25 Mediatek Inc. Method and apparatus for derivation of motion vector predictor candidate set
US20130294513A1 (en) * 2012-05-07 2013-11-07 Qualcomm Incorporated Inter layer merge list construction for video coding
US20140169472A1 (en) * 2012-12-19 2014-06-19 Mikhail Fludkov Motion estimation engine for video encoding

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
A. ROBERT (TECHNICOLOR), T. POIRIER (TECHNICOLOR), F. LELEANNEC (TECHNICOLOR), F. GALPIN (TECHNICOLOR): "CE4-related: CE4-4.5 pruning reduction in TPM using regular merge candidates", 14. JVET MEETING; 20190319 - 20190327; GENEVA; (THE JOINT VIDEO EXPLORATION TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ), 15 March 2019 (2019-03-15), XP030203251 *
X. WANG (KWAI), Y.-W. CHEN (KWAI), X. XIU (KWAI), T.-C. MA (KWAI INC.): "CE4-related: An improved method for triangle merge list construction", 14. JVET MEETING; 20190319 - 20190327; GENEVA; (THE JOINT VIDEO EXPLORATION TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ), no. JVET-N0340_r1, 19 March 2019 (2019-03-19), pages 1 - 3, XP030203697 *

Also Published As

Publication number Publication date
CN114982230A (zh) 2022-08-30

Similar Documents

Publication Publication Date Title
US11134261B2 (en) Methods and apparatus of video coding for triangle prediction
US20220021894A1 (en) Methods and apparatuses for signaling of merge modes in video coding
US20240323354A1 (en) Methods and apparatuses for video coding with triangle prediction
US20240314352A1 (en) Methods and apparatus of video coding for triangle prediction
EP4277277A2 (en) Methods and apparatus of video coding for triangle prediction
US20230089782A1 (en) Methods and apparatuses for video coding using geometric partition
US20220239902A1 (en) Methods and apparatuses for video coding using triangle partition
WO2021138605A1 (en) Methods and apparatuses for video coding using triangle partition
CN113994672B (zh) 用于利用三角形预测进行视频编解码的方法和装置
WO2020236991A1 (en) Methods and apparatuses for video coding using triangle partition
WO2021003126A1 (en) Methods and apparatuses for video coding using triangle partition

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20911289

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20911289

Country of ref document: EP

Kind code of ref document: A1