CN112075082B - 用于基于cabac的神经网络实现方式的视频编码与解码的方法及设备 - Google Patents
用于基于cabac的神经网络实现方式的视频编码与解码的方法及设备 Download PDFInfo
- Publication number
- CN112075082B CN112075082B CN201980028681.3A CN201980028681A CN112075082B CN 112075082 B CN112075082 B CN 112075082B CN 201980028681 A CN201980028681 A CN 201980028681A CN 112075082 B CN112075082 B CN 112075082B
- Authority
- CN
- China
- Prior art keywords
- syntax element
- bin
- current block
- encoded
- context
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/13—Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/11—Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/132—Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/91—Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/002—Image coding using neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP18305537.5 | 2018-04-27 | ||
| EP18305537.5A EP3562162A1 (en) | 2018-04-27 | 2018-04-27 | Method and apparatus for video encoding and decoding based on neural network implementation of cabac |
| PCT/US2019/028859 WO2019209913A1 (en) | 2018-04-27 | 2019-04-24 | Method and apparatus for video encoding and decoding based on neural network implementation of cabac |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN112075082A CN112075082A (zh) | 2020-12-11 |
| CN112075082B true CN112075082B (zh) | 2024-03-26 |
Family
ID=62143088
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201980028681.3A Active CN112075082B (zh) | 2018-04-27 | 2019-04-24 | 用于基于cabac的神经网络实现方式的视频编码与解码的方法及设备 |
Country Status (5)
| Country | Link |
|---|---|
| US (2) | US11323716B2 (https=) |
| EP (2) | EP3562162A1 (https=) |
| JP (1) | JP7421492B2 (https=) |
| CN (1) | CN112075082B (https=) |
| WO (1) | WO2019209913A1 (https=) |
Families Citing this family (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3562162A1 (en) * | 2018-04-27 | 2019-10-30 | InterDigital VC Holdings, Inc. | Method and apparatus for video encoding and decoding based on neural network implementation of cabac |
| US10652581B1 (en) * | 2019-02-27 | 2020-05-12 | Google Llc | Entropy coding in image and video compression using machine learning |
| CN112235583B (zh) * | 2019-07-15 | 2021-12-24 | 华为技术有限公司 | 基于小波变换的图像编解码方法及装置 |
| US12323596B2 (en) | 2019-11-08 | 2025-06-03 | Interdigital Madison Patent Holdings, Sas | Deep intra prediction of an image block |
| US11671110B2 (en) * | 2019-11-22 | 2023-06-06 | Tencent America LLC | Method and apparatus for neural network model compression/decompression |
| US11395007B2 (en) | 2019-12-12 | 2022-07-19 | Tencent America LLC | Method for signaling dependent and independent picture header |
| CN111431540B (zh) * | 2020-04-01 | 2021-10-08 | 西安交通大学 | 一种基于神经网络模型的fpga配置文件算术压缩与解压方法 |
| EP4136756A1 (en) * | 2020-04-14 | 2023-02-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Decoder for decoding weight parameters of a neural network, encoder, methods and encoded representation using probability estimation parameters |
| US12363310B2 (en) * | 2020-06-16 | 2025-07-15 | Nokia Technologies Oy | Guided probability model for compressed representation of neural networks |
| CN114339262B (zh) * | 2020-09-30 | 2023-02-14 | 华为技术有限公司 | 熵编/解码方法及装置 |
| FR3114933B1 (fr) * | 2020-10-06 | 2023-10-13 | Fond B Com | Procédé et dispositif électronique de décodage d’un flux de données, et programme d’ordinateur associé |
| CN114501031B (zh) * | 2020-11-13 | 2023-06-02 | 华为技术有限公司 | 一种压缩编码、解压缩方法以及装置 |
| WO2022116165A1 (zh) * | 2020-12-04 | 2022-06-09 | 深圳市大疆创新科技有限公司 | 视频编码方法、解码方法、编码器、解码器以及ai加速器 |
| CN114915782B (zh) * | 2021-02-10 | 2025-09-12 | 华为技术有限公司 | 一种编码方法、解码方法及设备 |
| US11425368B1 (en) * | 2021-02-17 | 2022-08-23 | Adobe Inc. | Lossless image compression using block based prediction and optimized context adaptive entropy coding |
| FR3120174A1 (fr) * | 2021-02-19 | 2022-08-26 | Orange | Prédiction pondérée d’image, codage et décodage d’image utilisant une telle prédiction pondérée |
| CN115118972B (zh) * | 2021-03-17 | 2025-09-02 | 华为技术有限公司 | 视频图像的编解码方法及相关设备 |
| CN117099370A (zh) * | 2021-03-26 | 2023-11-21 | 杜比实验室特许公司 | 使用神经网络对图像和视频编码中的潜在特征进行多分布熵建模 |
| CN117321989A (zh) * | 2021-04-01 | 2023-12-29 | 华为技术有限公司 | 基于神经网络的图像处理中的辅助信息的独立定位 |
| US12041252B2 (en) | 2021-06-07 | 2024-07-16 | Sony Interactive Entertainment Inc. | Multi-threaded CABAC decoding |
| CN115695812A (zh) | 2021-07-30 | 2023-02-03 | 中兴通讯股份有限公司 | 视频编码、视频解码方法、装置、电子设备和存储介质 |
| CN114615507B (zh) * | 2022-05-11 | 2022-09-13 | 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) | 一种图像编码方法、解码方法及相关装置 |
| CN116519695B (zh) * | 2022-12-06 | 2025-09-09 | 贵州民族大学 | 一种光学元件表面缺陷检测方法及相关设备 |
| CN120958811A (zh) * | 2023-03-22 | 2025-11-14 | 抖音视界有限公司 | 用于视觉数据处理的方法、装置和介质 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002061678A2 (en) * | 2001-01-31 | 2002-08-08 | Prediction Dynamics Limited | Feature selection for neural networks |
| CN1857001A (zh) * | 2003-05-20 | 2006-11-01 | Amt先进多媒体科技公司 | 混合视频压缩方法 |
| US8775341B1 (en) * | 2010-10-26 | 2014-07-08 | Michael Lamport Commons | Intelligent control with hierarchical stacked neural networks |
| CN107736027A (zh) * | 2015-06-12 | 2018-02-23 | 松下知识产权经营株式会社 | 图像编码方法、图像解码方法、图像编码装置及图像解码装置 |
| US9941900B1 (en) * | 2017-10-03 | 2018-04-10 | Dropbox, Inc. | Techniques for general-purpose lossless data compression using a recurrent neural network |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3075049B2 (ja) * | 1993-11-30 | 2000-08-07 | 三菱電機株式会社 | 画像符号化装置 |
| AU2002351389A1 (en) * | 2001-12-17 | 2003-06-30 | Microsoft Corporation | Skip macroblock coding |
| CN1190755C (zh) * | 2002-11-08 | 2005-02-23 | 北京工业大学 | 基于感知器的彩色图像无损压缩方法 |
| US20070233477A1 (en) * | 2006-03-30 | 2007-10-04 | Infima Ltd. | Lossless Data Compression Using Adaptive Context Modeling |
| JP2009111691A (ja) | 2007-10-30 | 2009-05-21 | Hitachi Ltd | 画像符号化装置及び符号化方法、画像復号化装置及び復号化方法 |
| JP5676744B2 (ja) * | 2010-04-13 | 2015-02-25 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | エントロピー符号化 |
| US9264706B2 (en) | 2012-04-11 | 2016-02-16 | Qualcomm Incorporated | Bypass bins for reference index coding in video coding |
| KR20150090206A (ko) | 2013-01-30 | 2015-08-05 | 인텔 코포레이션 | 차세대 비디오를 위한 코딩을 위한 콘텐츠 적응적 파라메트릭 변환 |
| US10979718B2 (en) * | 2017-09-01 | 2021-04-13 | Apple Inc. | Machine learning video processing systems and methods |
| US11166014B2 (en) * | 2017-12-14 | 2021-11-02 | Electronics And Telecommunications Research Institute | Image encoding and decoding method and device using prediction network |
| KR102435595B1 (ko) * | 2017-12-15 | 2022-08-25 | 한국전자통신연구원 | 분산 처리 환경에서의 학습 파라미터의 압축 및 전송을 제공하는 방법 및 장치 |
| US10841577B2 (en) * | 2018-02-08 | 2020-11-17 | Electronics And Telecommunications Research Institute | Method and apparatus for video encoding and video decoding based on neural network |
| EP3777141B1 (en) * | 2018-03-29 | 2025-09-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Intra-prediction mode concept for block-wise picture coding |
| EP3562162A1 (en) * | 2018-04-27 | 2019-10-30 | InterDigital VC Holdings, Inc. | Method and apparatus for video encoding and decoding based on neural network implementation of cabac |
| US10652581B1 (en) * | 2019-02-27 | 2020-05-12 | Google Llc | Entropy coding in image and video compression using machine learning |
-
2018
- 2018-04-27 EP EP18305537.5A patent/EP3562162A1/en not_active Withdrawn
-
2019
- 2019-04-24 JP JP2020550073A patent/JP7421492B2/ja active Active
- 2019-04-24 EP EP19732146.6A patent/EP3785442A1/en active Pending
- 2019-04-24 WO PCT/US2019/028859 patent/WO2019209913A1/en not_active Ceased
- 2019-04-24 CN CN201980028681.3A patent/CN112075082B/zh active Active
- 2019-04-24 US US17/050,730 patent/US11323716B2/en active Active
-
2022
- 2022-04-22 US US17/727,253 patent/US20220264095A1/en not_active Abandoned
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002061678A2 (en) * | 2001-01-31 | 2002-08-08 | Prediction Dynamics Limited | Feature selection for neural networks |
| CN1857001A (zh) * | 2003-05-20 | 2006-11-01 | Amt先进多媒体科技公司 | 混合视频压缩方法 |
| US8775341B1 (en) * | 2010-10-26 | 2014-07-08 | Michael Lamport Commons | Intelligent control with hierarchical stacked neural networks |
| CN107736027A (zh) * | 2015-06-12 | 2018-02-23 | 松下知识产权经营株式会社 | 图像编码方法、图像解码方法、图像编码装置及图像解码装置 |
| US9941900B1 (en) * | 2017-10-03 | 2018-04-10 | Dropbox, Inc. | Techniques for general-purpose lossless data compression using a recurrent neural network |
Non-Patent Citations (4)
| Title |
|---|
| Description of Core Experiment 3: Intra Prediction and Mode Coding;Geert Van der Auwera;Joint Video Experts Team (JVET);全文 * |
| Description of SDR, HDR, and 360° video coding technology proposal by Fraunhofer HHI;M. Albrecht ET AL;Joint Video Experts Team (JVET);参见第2.1.3、2.1.9.2部分 * |
| Intra prediction modes based on neural networks;Jonathan Pfaff ET AL;Joint Video Experts Team (JVET);全文 * |
| JVET AHG report: Tool evaluation (AHG1).Joint Video Exploration Team (JVET).2017,全文. * |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2021520087A (ja) | 2021-08-12 |
| US20220264095A1 (en) | 2022-08-18 |
| JP7421492B2 (ja) | 2024-01-24 |
| WO2019209913A1 (en) | 2019-10-31 |
| EP3562162A1 (en) | 2019-10-30 |
| EP3785442A1 (en) | 2021-03-03 |
| US20210120247A1 (en) | 2021-04-22 |
| CN112075082A (zh) | 2020-12-11 |
| US11323716B2 (en) | 2022-05-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN112075082B (zh) | 用于基于cabac的神经网络实现方式的视频编码与解码的方法及设备 | |
| US12512854B2 (en) | Methods and apparatus for unified significance map coding | |
| CN103959775B (zh) | 一种视频数据编解码的方法及设备 | |
| CN103563381B (zh) | 对视频数据进行上下文自适应译码 | |
| TWI856996B (zh) | 用於係數位準之逃逸寫碼 | |
| US11483562B2 (en) | Method and apparatus for video encoding and decoding based on context switching | |
| CN109997361A (zh) | 用于视频译码的低复杂度符号预测 | |
| US10791341B2 (en) | Binary arithmetic coding with progressive modification of adaptation parameters | |
| US11695962B2 (en) | Encoding and decoding methods and corresponding devices | |
| CN112534815A (zh) | 使用阈值用于系数译码的常规译码二进制数缩减 | |
| CN117915092A (zh) | 数据编码和解码方法、数据编码和解码设备及存储介质 | |
| KR20190120337A (ko) | 픽처 인코딩 및 디코딩을 위한 방법 및 디바이스 | |
| KR102380579B1 (ko) | 비디오 데이터에 관련된 신택스 엘리먼트를 나타내는 이진 심볼들의 시퀀스의 컨텍스트-적응적 이진 산술 코딩을 위한 방법 및 디바이스 | |
| CN119135922A (zh) | 视频解码方法、视频编码方法、装置及设备 | |
| US12355996B2 (en) | Encoding and decoding methods and corresponding devices | |
| CN120476593A (zh) | 帧内模板匹配预测方法、视频编解码方法、装置和系统 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |