CN112075082B - 用于基于cabac的神经网络实现方式的视频编码与解码的方法及设备 - Google Patents

用于基于cabac的神经网络实现方式的视频编码与解码的方法及设备 Download PDF

Info

Publication number
CN112075082B
CN112075082B CN201980028681.3A CN201980028681A CN112075082B CN 112075082 B CN112075082 B CN 112075082B CN 201980028681 A CN201980028681 A CN 201980028681A CN 112075082 B CN112075082 B CN 112075082B
Authority
CN
China
Prior art keywords
syntax element
bin
current block
encoded
context
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201980028681.3A
Other languages
English (en)
Chinese (zh)
Other versions
CN112075082A (zh
Inventor
F.加尔平
F.拉卡普
K.纳瑟
P.博德斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
InterDigital VC Holdings Inc
Original Assignee
InterDigital VC Holdings Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by InterDigital VC Holdings Inc filed Critical InterDigital VC Holdings Inc
Publication of CN112075082A publication Critical patent/CN112075082A/zh
Application granted granted Critical
Publication of CN112075082B publication Critical patent/CN112075082B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/13Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/11Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/002Image coding using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
CN201980028681.3A 2018-04-27 2019-04-24 用于基于cabac的神经网络实现方式的视频编码与解码的方法及设备 Active CN112075082B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP18305537.5 2018-04-27
EP18305537.5A EP3562162A1 (en) 2018-04-27 2018-04-27 Method and apparatus for video encoding and decoding based on neural network implementation of cabac
PCT/US2019/028859 WO2019209913A1 (en) 2018-04-27 2019-04-24 Method and apparatus for video encoding and decoding based on neural network implementation of cabac

Publications (2)

Publication Number Publication Date
CN112075082A CN112075082A (zh) 2020-12-11
CN112075082B true CN112075082B (zh) 2024-03-26

Family

ID=62143088

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201980028681.3A Active CN112075082B (zh) 2018-04-27 2019-04-24 用于基于cabac的神经网络实现方式的视频编码与解码的方法及设备

Country Status (5)

Country Link
US (2) US11323716B2 (https=)
EP (2) EP3562162A1 (https=)
JP (1) JP7421492B2 (https=)
CN (1) CN112075082B (https=)
WO (1) WO2019209913A1 (https=)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3562162A1 (en) * 2018-04-27 2019-10-30 InterDigital VC Holdings, Inc. Method and apparatus for video encoding and decoding based on neural network implementation of cabac
US10652581B1 (en) * 2019-02-27 2020-05-12 Google Llc Entropy coding in image and video compression using machine learning
CN112235583B (zh) * 2019-07-15 2021-12-24 华为技术有限公司 基于小波变换的图像编解码方法及装置
US12323596B2 (en) 2019-11-08 2025-06-03 Interdigital Madison Patent Holdings, Sas Deep intra prediction of an image block
US11671110B2 (en) * 2019-11-22 2023-06-06 Tencent America LLC Method and apparatus for neural network model compression/decompression
US11395007B2 (en) 2019-12-12 2022-07-19 Tencent America LLC Method for signaling dependent and independent picture header
CN111431540B (zh) * 2020-04-01 2021-10-08 西安交通大学 一种基于神经网络模型的fpga配置文件算术压缩与解压方法
EP4136756A1 (en) * 2020-04-14 2023-02-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder for decoding weight parameters of a neural network, encoder, methods and encoded representation using probability estimation parameters
US12363310B2 (en) * 2020-06-16 2025-07-15 Nokia Technologies Oy Guided probability model for compressed representation of neural networks
CN114339262B (zh) * 2020-09-30 2023-02-14 华为技术有限公司 熵编/解码方法及装置
FR3114933B1 (fr) * 2020-10-06 2023-10-13 Fond B Com Procédé et dispositif électronique de décodage d’un flux de données, et programme d’ordinateur associé
CN114501031B (zh) * 2020-11-13 2023-06-02 华为技术有限公司 一种压缩编码、解压缩方法以及装置
WO2022116165A1 (zh) * 2020-12-04 2022-06-09 深圳市大疆创新科技有限公司 视频编码方法、解码方法、编码器、解码器以及ai加速器
CN114915782B (zh) * 2021-02-10 2025-09-12 华为技术有限公司 一种编码方法、解码方法及设备
US11425368B1 (en) * 2021-02-17 2022-08-23 Adobe Inc. Lossless image compression using block based prediction and optimized context adaptive entropy coding
FR3120174A1 (fr) * 2021-02-19 2022-08-26 Orange Prédiction pondérée d’image, codage et décodage d’image utilisant une telle prédiction pondérée
CN115118972B (zh) * 2021-03-17 2025-09-02 华为技术有限公司 视频图像的编解码方法及相关设备
CN117099370A (zh) * 2021-03-26 2023-11-21 杜比实验室特许公司 使用神经网络对图像和视频编码中的潜在特征进行多分布熵建模
CN117321989A (zh) * 2021-04-01 2023-12-29 华为技术有限公司 基于神经网络的图像处理中的辅助信息的独立定位
US12041252B2 (en) 2021-06-07 2024-07-16 Sony Interactive Entertainment Inc. Multi-threaded CABAC decoding
CN115695812A (zh) 2021-07-30 2023-02-03 中兴通讯股份有限公司 视频编码、视频解码方法、装置、电子设备和存储介质
CN114615507B (zh) * 2022-05-11 2022-09-13 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) 一种图像编码方法、解码方法及相关装置
CN116519695B (zh) * 2022-12-06 2025-09-09 贵州民族大学 一种光学元件表面缺陷检测方法及相关设备
CN120958811A (zh) * 2023-03-22 2025-11-14 抖音视界有限公司 用于视觉数据处理的方法、装置和介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002061678A2 (en) * 2001-01-31 2002-08-08 Prediction Dynamics Limited Feature selection for neural networks
CN1857001A (zh) * 2003-05-20 2006-11-01 Amt先进多媒体科技公司 混合视频压缩方法
US8775341B1 (en) * 2010-10-26 2014-07-08 Michael Lamport Commons Intelligent control with hierarchical stacked neural networks
CN107736027A (zh) * 2015-06-12 2018-02-23 松下知识产权经营株式会社 图像编码方法、图像解码方法、图像编码装置及图像解码装置
US9941900B1 (en) * 2017-10-03 2018-04-10 Dropbox, Inc. Techniques for general-purpose lossless data compression using a recurrent neural network

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3075049B2 (ja) * 1993-11-30 2000-08-07 三菱電機株式会社 画像符号化装置
AU2002351389A1 (en) * 2001-12-17 2003-06-30 Microsoft Corporation Skip macroblock coding
CN1190755C (zh) * 2002-11-08 2005-02-23 北京工业大学 基于感知器的彩色图像无损压缩方法
US20070233477A1 (en) * 2006-03-30 2007-10-04 Infima Ltd. Lossless Data Compression Using Adaptive Context Modeling
JP2009111691A (ja) 2007-10-30 2009-05-21 Hitachi Ltd 画像符号化装置及び符号化方法、画像復号化装置及び復号化方法
JP5676744B2 (ja) * 2010-04-13 2015-02-25 フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン エントロピー符号化
US9264706B2 (en) 2012-04-11 2016-02-16 Qualcomm Incorporated Bypass bins for reference index coding in video coding
KR20150090206A (ko) 2013-01-30 2015-08-05 인텔 코포레이션 차세대 비디오를 위한 코딩을 위한 콘텐츠 적응적 파라메트릭 변환
US10979718B2 (en) * 2017-09-01 2021-04-13 Apple Inc. Machine learning video processing systems and methods
US11166014B2 (en) * 2017-12-14 2021-11-02 Electronics And Telecommunications Research Institute Image encoding and decoding method and device using prediction network
KR102435595B1 (ko) * 2017-12-15 2022-08-25 한국전자통신연구원 분산 처리 환경에서의 학습 파라미터의 압축 및 전송을 제공하는 방법 및 장치
US10841577B2 (en) * 2018-02-08 2020-11-17 Electronics And Telecommunications Research Institute Method and apparatus for video encoding and video decoding based on neural network
EP3777141B1 (en) * 2018-03-29 2025-09-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Intra-prediction mode concept for block-wise picture coding
EP3562162A1 (en) * 2018-04-27 2019-10-30 InterDigital VC Holdings, Inc. Method and apparatus for video encoding and decoding based on neural network implementation of cabac
US10652581B1 (en) * 2019-02-27 2020-05-12 Google Llc Entropy coding in image and video compression using machine learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002061678A2 (en) * 2001-01-31 2002-08-08 Prediction Dynamics Limited Feature selection for neural networks
CN1857001A (zh) * 2003-05-20 2006-11-01 Amt先进多媒体科技公司 混合视频压缩方法
US8775341B1 (en) * 2010-10-26 2014-07-08 Michael Lamport Commons Intelligent control with hierarchical stacked neural networks
CN107736027A (zh) * 2015-06-12 2018-02-23 松下知识产权经营株式会社 图像编码方法、图像解码方法、图像编码装置及图像解码装置
US9941900B1 (en) * 2017-10-03 2018-04-10 Dropbox, Inc. Techniques for general-purpose lossless data compression using a recurrent neural network

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Description of Core Experiment 3: Intra Prediction and Mode Coding;Geert Van der Auwera;Joint Video Experts Team (JVET);全文 *
Description of SDR, HDR, and 360° video coding technology proposal by Fraunhofer HHI;M. Albrecht ET AL;Joint Video Experts Team (JVET);参见第2.1.3、2.1.9.2部分 *
Intra prediction modes based on neural networks;Jonathan Pfaff ET AL;Joint Video Experts Team (JVET);全文 *
JVET AHG report: Tool evaluation (AHG1).Joint Video Exploration Team (JVET).2017,全文. *

Also Published As

Publication number Publication date
JP2021520087A (ja) 2021-08-12
US20220264095A1 (en) 2022-08-18
JP7421492B2 (ja) 2024-01-24
WO2019209913A1 (en) 2019-10-31
EP3562162A1 (en) 2019-10-30
EP3785442A1 (en) 2021-03-03
US20210120247A1 (en) 2021-04-22
CN112075082A (zh) 2020-12-11
US11323716B2 (en) 2022-05-03

Similar Documents

Publication Publication Date Title
CN112075082B (zh) 用于基于cabac的神经网络实现方式的视频编码与解码的方法及设备
US12512854B2 (en) Methods and apparatus for unified significance map coding
CN103959775B (zh) 一种视频数据编解码的方法及设备
CN103563381B (zh) 对视频数据进行上下文自适应译码
TWI856996B (zh) 用於係數位準之逃逸寫碼
US11483562B2 (en) Method and apparatus for video encoding and decoding based on context switching
CN109997361A (zh) 用于视频译码的低复杂度符号预测
US10791341B2 (en) Binary arithmetic coding with progressive modification of adaptation parameters
US11695962B2 (en) Encoding and decoding methods and corresponding devices
CN112534815A (zh) 使用阈值用于系数译码的常规译码二进制数缩减
CN117915092A (zh) 数据编码和解码方法、数据编码和解码设备及存储介质
KR20190120337A (ko) 픽처 인코딩 및 디코딩을 위한 방법 및 디바이스
KR102380579B1 (ko) 비디오 데이터에 관련된 신택스 엘리먼트를 나타내는 이진 심볼들의 시퀀스의 컨텍스트-적응적 이진 산술 코딩을 위한 방법 및 디바이스
CN119135922A (zh) 视频解码方法、视频编码方法、装置及设备
US12355996B2 (en) Encoding and decoding methods and corresponding devices
CN120476593A (zh) 帧内模板匹配预测方法、视频编解码方法、装置和系统

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant