JP2024510433A - ビデオ圧縮のための時間的構造ベースの条件付き畳み込みニューラルネットワーク - Google Patents
ビデオ圧縮のための時間的構造ベースの条件付き畳み込みニューラルネットワーク Download PDFInfo
- Publication number
- JP2024510433A JP2024510433A JP2023554294A JP2023554294A JP2024510433A JP 2024510433 A JP2024510433 A JP 2024510433A JP 2023554294 A JP2023554294 A JP 2023554294A JP 2023554294 A JP2023554294 A JP 2023554294A JP 2024510433 A JP2024510433 A JP 2024510433A
- Authority
- JP
- Japan
- Prior art keywords
- conditional
- block
- video
- encoding
- bitstream
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/537—Motion estimation other than block-based
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/184—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/573—Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/58—Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
- H04N19/82—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/117—Filters, e.g. for pre-processing or post-processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/132—Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/177—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/186—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/577—Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163162791P | 2021-03-18 | 2021-03-18 | |
| US63/162,791 | 2021-03-18 | ||
| PCT/US2022/020504 WO2022197772A1 (en) | 2021-03-18 | 2022-03-16 | Temporal structure-based conditional convolutional neural networks for video compression |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2024510433A true JP2024510433A (ja) | 2024-03-07 |
| JPWO2022197772A5 JPWO2022197772A5 (enExample) | 2025-04-18 |
Family
ID=81328100
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023554294A Pending JP2024510433A (ja) | 2021-03-18 | 2022-03-16 | ビデオ圧縮のための時間的構造ベースの条件付き畳み込みニューラルネットワーク |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20240187640A1 (enExample) |
| EP (1) | EP4309366A1 (enExample) |
| JP (1) | JP2024510433A (enExample) |
| CN (1) | CN116998154A (enExample) |
| MX (1) | MX2023010960A (enExample) |
| WO (1) | WO2022197772A1 (enExample) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2024511587A (ja) * | 2021-04-01 | 2024-03-14 | ホアウェイ・テクノロジーズ・カンパニー・リミテッド | ニューラルネットワークベースのピクチャ処理における補助情報の独立した配置 |
| JP2024513693A (ja) * | 2021-04-01 | 2024-03-27 | ホアウェイ・テクノロジーズ・カンパニー・リミテッド | ピクチャデータ処理ニューラルネットワークに入力される補助情報の構成可能な位置 |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP4480174A1 (en) * | 2022-02-17 | 2024-12-25 | OP Solutions, LLC | Systems and methods for video coding for machines using an autoencoder |
| FR3153177A1 (fr) * | 2023-09-14 | 2025-03-21 | Orange | Procédé et dispositif de codage et décodage d’images. |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9485515B2 (en) * | 2013-08-23 | 2016-11-01 | Google Inc. | Video coding using reference motion vectors |
| US10887597B2 (en) * | 2015-06-09 | 2021-01-05 | Qualcomm Incorporated | Systems and methods of determining illumination compensation parameters for video coding |
| WO2020016859A2 (en) * | 2018-07-20 | 2020-01-23 | Beijing Bytedance Network Technology Co., Ltd. | Motion prediction based on updated motion vectors |
| EP3899806B1 (en) * | 2019-01-23 | 2025-12-10 | Google LLC | Convolutional neural networks with soft kernel selection |
-
2022
- 2022-03-16 JP JP2023554294A patent/JP2024510433A/ja active Pending
- 2022-03-16 WO PCT/US2022/020504 patent/WO2022197772A1/en not_active Ceased
- 2022-03-16 CN CN202280022116.8A patent/CN116998154A/zh active Pending
- 2022-03-16 EP EP22714698.2A patent/EP4309366A1/en active Pending
- 2022-03-16 MX MX2023010960A patent/MX2023010960A/es unknown
- 2022-03-16 US US18/281,844 patent/US20240187640A1/en active Pending
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2024511587A (ja) * | 2021-04-01 | 2024-03-14 | ホアウェイ・テクノロジーズ・カンパニー・リミテッド | ニューラルネットワークベースのピクチャ処理における補助情報の独立した配置 |
| JP2024513693A (ja) * | 2021-04-01 | 2024-03-27 | ホアウェイ・テクノロジーズ・カンパニー・リミテッド | ピクチャデータ処理ニューラルネットワークに入力される補助情報の構成可能な位置 |
| JP7641402B2 (ja) | 2021-04-01 | 2025-03-06 | ホアウェイ・テクノロジーズ・カンパニー・リミテッド | ピクチャデータ処理ニューラルネットワークに入力される補助情報の構成可能な位置 |
| JP7720402B2 (ja) | 2021-04-01 | 2025-08-07 | ホアウェイ・テクノロジーズ・カンパニー・リミテッド | ニューラルネットワークベースのピクチャ処理における補助情報の独立した配置 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20240187640A1 (en) | 2024-06-06 |
| WO2022197772A1 (en) | 2022-09-22 |
| MX2023010960A (es) | 2023-09-27 |
| EP4309366A1 (en) | 2024-01-24 |
| CN116998154A (zh) | 2023-11-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2023543985A (ja) | 多用途ビデオコーディングのためのテンプレートマッチング予測 | |
| US20240380929A1 (en) | A method and an apparatus for encoding/decoding images and videos using artificial neural network based tools | |
| KR20210134034A (ko) | 서브블록 기반 로컬 조명 보상을 이용한 비디오 인코딩 및 디코딩 방법 및 장치 | |
| US20230396801A1 (en) | Learned video compression framework for multiple machine tasks | |
| JP2024510433A (ja) | ビデオ圧縮のための時間的構造ベースの条件付き畳み込みニューラルネットワーク | |
| KR20230025879A (ko) | 신경 네트워크 기반 인트라 예측 모드에 대한 변환 프로세스의 적응 | |
| CN113574887A (zh) | 基于低位移秩的深度神经网络压缩 | |
| KR20210069715A (ko) | 비디오 인코딩 및 디코딩의 아핀 모드 시그널링 | |
| JP2025535086A (ja) | 暗黙的ニューラル表現の学習された辞書を使用する画像及びビデオ圧縮 | |
| KR20220088888A (ko) | 인트라 예측을 위한 신경망의 반복 트레이닝 | |
| CN112806011B (zh) | 改进的虚拟时间仿射候选 | |
| JP2024537625A (ja) | デコーダ側イントラモード導出における角度離散化の改善 | |
| US20240155148A1 (en) | Motion flow coding for deep learning based yuv video compression | |
| KR20220035108A (ko) | 행렬 기반 인트라 예측을 이용한 비디오 인코딩 및 디코딩을 위한 방법 및 장치 | |
| WO2023146634A1 (en) | Block-based compression and latent space intra prediction | |
| JP2024513873A (ja) | 切り替え可能な補間フィルタを用いる幾何学的分割 | |
| CN114930819B (zh) | 三角形合并模式中的子块合并候选 | |
| EP3815373A1 (en) | Virtual temporal affine candidates | |
| JP2024513657A (ja) | ビデオエンコード及びデコードのためのテンプレートマッチング予測 | |
| JP2025533482A (ja) | エンドツーエンドニューラルネットワークベースのビデオ圧縮のための強化学習に基づくレート制御 | |
| CN114097235B (zh) | 用于仿射和sbtmvp运动矢量预测模式的hmvc | |
| CN114450965B (en) | Video compression based on long-range end-to-end depth learning | |
| EP4655943A1 (en) | Multi-residual autoencoder for image and video compression | |
| WO2025162700A1 (en) | Multi-definition implicit neural representation video encoding |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RD02 | Notification of acceptance of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7422 Effective date: 20231016 |
|
| A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20241115 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20250317 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20250409 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20251106 |