JP2024510433A - ビデオ圧縮のための時間的構造ベースの条件付き畳み込みニューラルネットワーク - Google Patents

ビデオ圧縮のための時間的構造ベースの条件付き畳み込みニューラルネットワーク Download PDF

Info

Publication number
JP2024510433A
JP2024510433A JP2023554294A JP2023554294A JP2024510433A JP 2024510433 A JP2024510433 A JP 2024510433A JP 2023554294 A JP2023554294 A JP 2023554294A JP 2023554294 A JP2023554294 A JP 2023554294A JP 2024510433 A JP2024510433 A JP 2024510433A
Authority
JP
Japan
Prior art keywords
conditional
block
video
encoding
bitstream
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2023554294A
Other languages
English (en)
Japanese (ja)
Other versions
JP2024510433A5 (https=
JPWO2022197772A5 (https=
Inventor
ラカペ、ファビアン
ビゲイン、ジョーン
フェルトマン、サイモン
プシュパラジャ、アクシェイ
Original Assignee
ヴィド スケール インコーポレイテッド
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ヴィド スケール インコーポレイテッド filed Critical ヴィド スケール インコーポレイテッド
Publication of JP2024510433A publication Critical patent/JP2024510433A/ja
Publication of JP2024510433A5 publication Critical patent/JP2024510433A5/ja
Publication of JPWO2022197772A5 publication Critical patent/JPWO2022197772A5/ja
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/537Motion estimation other than block-based
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/573Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/58Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/80Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
    • H04N19/82Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/117Filters, e.g. for pre-processing or post-processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/177Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/577Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
JP2023554294A 2021-03-18 2022-03-16 ビデオ圧縮のための時間的構造ベースの条件付き畳み込みニューラルネットワーク Pending JP2024510433A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202163162791P 2021-03-18 2021-03-18
US63/162,791 2021-03-18
PCT/US2022/020504 WO2022197772A1 (en) 2021-03-18 2022-03-16 Temporal structure-based conditional convolutional neural networks for video compression

Publications (3)

Publication Number Publication Date
JP2024510433A true JP2024510433A (ja) 2024-03-07
JP2024510433A5 JP2024510433A5 (https=) 2025-04-18
JPWO2022197772A5 JPWO2022197772A5 (https=) 2025-04-18

Family

ID=81328100

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023554294A Pending JP2024510433A (ja) 2021-03-18 2022-03-16 ビデオ圧縮のための時間的構造ベースの条件付き畳み込みニューラルネットワーク

Country Status (6)

Country Link
US (1) US20240187640A1 (https=)
EP (1) EP4309366A1 (https=)
JP (1) JP2024510433A (https=)
CN (1) CN116998154A (https=)
MX (1) MX2023010960A (https=)
WO (1) WO2022197772A1 (https=)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2024511587A (ja) * 2021-04-01 2024-03-14 ホアウェイ・テクノロジーズ・カンパニー・リミテッド ニューラルネットワークベースのピクチャ処理における補助情報の独立した配置
JP2024513693A (ja) * 2021-04-01 2024-03-27 ホアウェイ・テクノロジーズ・カンパニー・リミテッド ピクチャデータ処理ニューラルネットワークに入力される補助情報の構成可能な位置

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4480174A4 (en) * 2022-02-17 2026-02-11 Op Solutions Llc VIDEO CODING SYSTEMS AND METHODS FOR DEVICES USING AN AUTOCODER
FR3153177A1 (fr) * 2023-09-14 2025-03-21 Orange Procédé et dispositif de codage et décodage d’images.

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020016857A1 (en) * 2018-07-20 2020-01-23 Beijing Bytedance Network Technology Co., Ltd. Motion prediction based on updated motion vectors

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9485515B2 (en) * 2013-08-23 2016-11-01 Google Inc. Video coding using reference motion vectors
US10887597B2 (en) * 2015-06-09 2021-01-05 Qualcomm Incorporated Systems and methods of determining illumination compensation parameters for video coding
CN113348472B (zh) * 2019-01-23 2026-04-14 谷歌有限责任公司 具有软内核选择的卷积神经网络

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020016857A1 (en) * 2018-07-20 2020-01-23 Beijing Bytedance Network Technology Co., Ltd. Motion prediction based on updated motion vectors

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZEQIANG LI, ET AL.: "AHG11: Updated information on inter-predictioncoding tool with deep neural network", JOINT VIDEO EXPERTS TEAM (JVET) OF ITU-T SG 16 WP 3 AND ISO/IEC JTC 1/SC 29, vol. [JVET-U0087-v2] (version 2), JPN6025051228, 5 January 2021 (2021-01-05), ISSN: 0005754066 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2024511587A (ja) * 2021-04-01 2024-03-14 ホアウェイ・テクノロジーズ・カンパニー・リミテッド ニューラルネットワークベースのピクチャ処理における補助情報の独立した配置
JP2024513693A (ja) * 2021-04-01 2024-03-27 ホアウェイ・テクノロジーズ・カンパニー・リミテッド ピクチャデータ処理ニューラルネットワークに入力される補助情報の構成可能な位置
JP7641402B2 (ja) 2021-04-01 2025-03-06 ホアウェイ・テクノロジーズ・カンパニー・リミテッド ピクチャデータ処理ニューラルネットワークに入力される補助情報の構成可能な位置
JP7720402B2 (ja) 2021-04-01 2025-08-07 ホアウェイ・テクノロジーズ・カンパニー・リミテッド ニューラルネットワークベースのピクチャ処理における補助情報の独立した配置
US12586255B2 (en) 2021-04-01 2026-03-24 Huawei Technologies Co., Ltd. Configurable positions for auxiliary information input into a picture data processing neural network

Also Published As

Publication number Publication date
MX2023010960A (es) 2023-09-27
US20240187640A1 (en) 2024-06-06
CN116998154A (zh) 2023-11-03
WO2022197772A1 (en) 2022-09-22
EP4309366A1 (en) 2024-01-24

Similar Documents

Publication Publication Date Title
JP2023543985A (ja) 多用途ビデオコーディングのためのテンプレートマッチング予測
US12537979B2 (en) Method and an apparatus for encoding/decoding images and videos using artificial neural network based tools
US20230396801A1 (en) Learned video compression framework for multiple machine tasks
CN116134822A (zh) 用于更新基于深度神经网络的图像或视频解码器的方法和装置
JP2024510433A (ja) ビデオ圧縮のための時間的構造ベースの条件付き畳み込みニューラルネットワーク
KR20230025879A (ko) 신경 네트워크 기반 인트라 예측 모드에 대한 변환 프로세스의 적응
CN114450965B (zh) 基于长范围端对端深度学习的视频压缩
US12587669B2 (en) Motion flow coding for deep learning based YUV video compression
JP2024537625A (ja) デコーダ側イントラモード導出における角度離散化の改善
JP2025535086A (ja) 暗黙的ニューラル表現の学習された辞書を使用する画像及びビデオ圧縮
KR20220088888A (ko) 인트라 예측을 위한 신경망의 반복 트레이닝
CN113574887A (zh) 基于低位移秩的深度神经网络压缩
KR20210069715A (ko) 비디오 인코딩 및 디코딩의 아핀 모드 시그널링
CN112806011B (zh) 改进的虚拟时间仿射候选
JP2024513657A (ja) ビデオエンコード及びデコードのためのテンプレートマッチング予測
WO2023146634A1 (en) Block-based compression and latent space intra prediction
JP2024513873A (ja) 切り替え可能な補間フィルタを用いる幾何学的分割
CN114930819B (zh) 三角形合并模式中的子块合并候选
EP3815373A1 (en) Virtual temporal affine candidates
US20260087680A1 (en) Reinforcement learning-based rate control for end-to-end neural network based video compression
CN114097235B (zh) 用于仿射和sbtmvp运动矢量预测模式的hmvc
KR20260022311A (ko) 회귀-기반 아핀 양방향-예측 가중치들
EP4655943A1 (en) Multi-residual autoencoder for image and video compression

Legal Events

Date Code Title Description
RD02 Notification of acceptance of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7422

Effective date: 20231016

A711 Notification of change in applicant

Free format text: JAPANESE INTERMEDIATE CODE: A711

Effective date: 20241115

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20250317

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20250409

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20251106

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20251216

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20260310