CN116998154A - 用于视频压缩的基于时间结构的条件卷积神经网络 - Google Patents
用于视频压缩的基于时间结构的条件卷积神经网络 Download PDFInfo
- Publication number
- CN116998154A CN116998154A CN202280022116.8A CN202280022116A CN116998154A CN 116998154 A CN116998154 A CN 116998154A CN 202280022116 A CN202280022116 A CN 202280022116A CN 116998154 A CN116998154 A CN 116998154A
- Authority
- CN
- China
- Prior art keywords
- conditional
- block
- video
- bitstream
- convolution
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/537—Motion estimation other than block-based
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/184—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/573—Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/58—Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
- H04N19/82—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/117—Filters, e.g. for pre-processing or post-processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/132—Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/177—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/186—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/577—Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163162791P | 2021-03-18 | 2021-03-18 | |
| US63/162,791 | 2021-03-18 | ||
| PCT/US2022/020504 WO2022197772A1 (en) | 2021-03-18 | 2022-03-16 | Temporal structure-based conditional convolutional neural networks for video compression |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN116998154A true CN116998154A (zh) | 2023-11-03 |
Family
ID=81328100
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202280022116.8A Pending CN116998154A (zh) | 2021-03-18 | 2022-03-16 | 用于视频压缩的基于时间结构的条件卷积神经网络 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20240187640A1 (enExample) |
| EP (1) | EP4309366A1 (enExample) |
| JP (1) | JP2024510433A (enExample) |
| CN (1) | CN116998154A (enExample) |
| MX (1) | MX2023010960A (enExample) |
| WO (1) | WO2022197772A1 (enExample) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117441333A (zh) * | 2021-04-01 | 2024-01-23 | 华为技术有限公司 | 用于输入图像数据处理神经网络的辅助信息的可配置位置 |
| EP4272437A1 (en) * | 2021-04-01 | 2023-11-08 | Huawei Technologies Co., Ltd. | Independent positioning of auxiliary information in neural network based picture processing |
| EP4480174A1 (en) * | 2022-02-17 | 2024-12-25 | OP Solutions, LLC | Systems and methods for video coding for machines using an autoencoder |
| FR3153177A1 (fr) * | 2023-09-14 | 2025-03-21 | Orange | Procédé et dispositif de codage et décodage d’images. |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9485515B2 (en) * | 2013-08-23 | 2016-11-01 | Google Inc. | Video coding using reference motion vectors |
| US10887597B2 (en) * | 2015-06-09 | 2021-01-05 | Qualcomm Incorporated | Systems and methods of determining illumination compensation parameters for video coding |
| WO2020016859A2 (en) * | 2018-07-20 | 2020-01-23 | Beijing Bytedance Network Technology Co., Ltd. | Motion prediction based on updated motion vectors |
| EP3899806B1 (en) * | 2019-01-23 | 2025-12-10 | Google LLC | Convolutional neural networks with soft kernel selection |
-
2022
- 2022-03-16 JP JP2023554294A patent/JP2024510433A/ja active Pending
- 2022-03-16 WO PCT/US2022/020504 patent/WO2022197772A1/en not_active Ceased
- 2022-03-16 CN CN202280022116.8A patent/CN116998154A/zh active Pending
- 2022-03-16 EP EP22714698.2A patent/EP4309366A1/en active Pending
- 2022-03-16 MX MX2023010960A patent/MX2023010960A/es unknown
- 2022-03-16 US US18/281,844 patent/US20240187640A1/en active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| US20240187640A1 (en) | 2024-06-06 |
| JP2024510433A (ja) | 2024-03-07 |
| WO2022197772A1 (en) | 2022-09-22 |
| MX2023010960A (es) | 2023-09-27 |
| EP4309366A1 (en) | 2024-01-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20240380929A1 (en) | A method and an apparatus for encoding/decoding images and videos using artificial neural network based tools | |
| US20230396801A1 (en) | Learned video compression framework for multiple machine tasks | |
| KR20210083353A (ko) | 이웃 샘플 의존 파라메트릭 모델에 기초한 코딩 모드의 단순화 | |
| CN113170113B (zh) | 用于视频编码和解码的三角形和多重假设组合 | |
| TWI879800B (zh) | 用於視訊編碼或解碼之方法、裝置及器件以及相關的非暫時性電腦可讀媒體及電腦程式產品 | |
| US20240187640A1 (en) | Temporal structure-based conditional convolutional neural networks for video compression | |
| CN113330747B (zh) | 利用适应于加权预测的双向光流进行视频编码和解码的方法和装置 | |
| CN112740674B (zh) | 使用双预测进行视频编码和解码的方法和装置 | |
| US12200197B2 (en) | Virtual temporal affine candidates | |
| KR20210069715A (ko) | 비디오 인코딩 및 디코딩의 아핀 모드 시그널링 | |
| CN112806011B (zh) | 改进的虚拟时间仿射候选 | |
| JP2024537625A (ja) | デコーダ側イントラモード導出における角度離散化の改善 | |
| CN114270829B (zh) | 局部照明补偿标志继承 | |
| CN116171577A (zh) | 深度预测修正 | |
| US20240155148A1 (en) | Motion flow coding for deep learning based yuv video compression | |
| CN114930819B (zh) | 三角形合并模式中的子块合并候选 | |
| CN117280684A (zh) | 具有可切换内插滤波器的几何分区 | |
| KR20220123666A (ko) | 가중-예측 파라미터들의 추정 | |
| JP2025533482A (ja) | エンドツーエンドニューラルネットワークベースのビデオ圧縮のための強化学習に基づくレート制御 | |
| CN114424535B (zh) | 使用外部参考对视频编码和解码进行预测 | |
| WO2025149357A1 (en) | Mmvd offsets table for dmvr candidates | |
| CN118975232A (zh) | 基于帧内预测方向的运动信息参数传播 | |
| CN117501692A (zh) | 用于视频编码和解码的模板匹配预测 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| TA01 | Transfer of patent application right | ||
| TA01 | Transfer of patent application right |
Effective date of registration: 20240827 Address after: Delaware, USA Applicant after: Interactive Digital VC Holdings Country or region after: U.S.A. Address before: Wilmington, Delaware, USA Applicant before: VID SCALE, Inc. Country or region before: U.S.A. |