CN116325729B - 用于视频译码的方法和装置 - Google Patents

用于视频译码的方法和装置

Info

Publication number
CN116325729B
CN116325729B CN202180065157.0A CN202180065157A CN116325729B CN 116325729 B CN116325729 B CN 116325729B CN 202180065157 A CN202180065157 A CN 202180065157A CN 116325729 B CN116325729 B CN 116325729B
Authority
CN
China
Prior art keywords
alpha
block
video
syntax elements
cnn
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202180065157.0A
Other languages
English (en)
Chinese (zh)
Other versions
CN116325729A (zh
Inventor
王洪涛
陈建乐
M·卡切夫维茨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of CN116325729A publication Critical patent/CN116325729A/zh
Application granted granted Critical
Publication of CN116325729B publication Critical patent/CN116325729B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/436Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/002Image coding using neural networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/117Filters, e.g. for pre-processing or post-processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/80Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
    • H04N19/82Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Picture Signal Circuits (AREA)
CN202180065157.0A 2020-09-30 2021-09-30 用于视频译码的方法和装置 Active CN116325729B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US202063085936P 2020-09-30 2020-09-30
US63/085,936 2020-09-30
US17/489,459 2021-09-29
US17/489,459 US11647212B2 (en) 2020-09-30 2021-09-29 Activation function design in neural network-based filtering process for video coding
PCT/US2021/052950 WO2022072684A1 (en) 2020-09-30 2021-09-30 Activation function design in neural network-based filtering process for video coding

Publications (2)

Publication Number Publication Date
CN116325729A CN116325729A (zh) 2023-06-23
CN116325729B true CN116325729B (zh) 2026-03-27

Family

ID=80821961

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202180065157.0A Active CN116325729B (zh) 2020-09-30 2021-09-30 用于视频译码的方法和装置

Country Status (7)

Country Link
US (2) US11647212B2 (https=)
EP (1) EP4222954A1 (https=)
JP (1) JP7787883B2 (https=)
KR (1) KR20230078658A (https=)
CN (1) CN116325729B (https=)
PH (1) PH12023550209A1 (https=)
WO (1) WO2022072684A1 (https=)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11647212B2 (en) 2020-09-30 2023-05-09 Qualcomm Incorporated Activation function design in neural network-based filtering process for video coding
US20220321919A1 (en) * 2021-03-23 2022-10-06 Sharp Kabushiki Kaisha Systems and methods for signaling neural network-based in-loop filter parameter information in video coding
US12167047B2 (en) * 2022-01-13 2024-12-10 Tencent America LLC Neural network-based deblocking filters
US12556718B2 (en) * 2022-12-29 2026-02-17 Samsung Electronics Co., Ltd. Electronic device and method with image encoding and decoding
CN116805971B (zh) * 2023-04-11 2024-07-12 腾讯科技(深圳)有限公司 图像编解码方法、装置、设备
US12542932B2 (en) * 2023-04-12 2026-02-03 Qualcomm Incorporated Neural network-based in loop filter architectures with separable convolution and multi-scale enhancement for video coding
CN119211544A (zh) * 2023-06-26 2024-12-27 腾讯科技(深圳)有限公司 基于神经网络的图像滤波及编解码方法、装置、设备、存储介质
CN120345242A (zh) * 2023-07-05 2025-07-18 Lg电子株式会社 图像编码/解码方法、存储比特流的记录介质和发送比特流的方法
WO2025217290A1 (en) * 2024-04-10 2025-10-16 Qualcomm Incorporated Improvements of resnet based in-loop filter architecture for video coding

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020188273A1 (en) * 2019-03-20 2020-09-24 V-Nova International Limited Low complexity enhancement video coding

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11113800B2 (en) * 2017-01-18 2021-09-07 Nvidia Corporation Filtering image data using a neural network
EP3451293A1 (en) * 2017-08-28 2019-03-06 Thomson Licensing Method and apparatus for filtering with multi-branch deep learning
US10284432B1 (en) * 2018-07-03 2019-05-07 Kabushiki Kaisha Ubitus Method for enhancing quality of media transmitted via network
US11025907B2 (en) * 2019-02-28 2021-06-01 Google Llc Receptive-field-conforming convolution models for video coding
US10999606B2 (en) * 2019-01-08 2021-05-04 Intel Corporation Method and system of neural network loop filtering for video coding
US12282840B2 (en) 2019-01-11 2025-04-22 Samsung Electronics Co., Ltd. Method and apparatus with neural network layer contraction
KR102646695B1 (ko) 2019-01-15 2024-03-12 포틀랜드 스테이트 유니버시티 비디오 프레임 보간을 위한 특징 피라미드 워핑
EP3706046A1 (en) * 2019-03-04 2020-09-09 InterDigital VC Holdings, Inc. Method and device for picture encoding and decoding
EP3938962B1 (en) * 2019-03-15 2025-11-26 Dolby International AB Method and apparatus for updating a neural network
KR20200114436A (ko) 2019-03-28 2020-10-07 국방과학연구소 스케일러블 영상 부호화를 수행하는 장치 및 방법
US10909728B1 (en) * 2019-05-01 2021-02-02 Amazon Technologies, Inc. Learned lossy image compression codec
US11216917B2 (en) 2019-05-03 2022-01-04 Amazon Technologies, Inc. Video enhancement using a neural network
US10944996B2 (en) 2019-08-19 2021-03-09 Intel Corporation Visual quality optimized video compression
US11647212B2 (en) 2020-09-30 2023-05-09 Qualcomm Incorporated Activation function design in neural network-based filtering process for video coding

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020188273A1 (en) * 2019-03-20 2020-09-24 V-Nova International Limited Low complexity enhancement video coding

Also Published As

Publication number Publication date
EP4222954A1 (en) 2023-08-09
KR20230078658A (ko) 2023-06-02
JP2023543762A (ja) 2023-10-18
US11778213B2 (en) 2023-10-03
US20230012661A1 (en) 2023-01-19
PH12023550209A1 (en) 2024-06-24
WO2022072684A1 (en) 2022-04-07
US20220103845A1 (en) 2022-03-31
CN116325729A (zh) 2023-06-23
US11647212B2 (en) 2023-05-09
JP7787883B2 (ja) 2025-12-17

Similar Documents

Publication Publication Date Title
JP7795528B2 (ja) ビデオコーディング中にフィルタ処理するための複数のニューラルネットワークモデル
CN116235494B (zh) 用于视频译码的滤波过程的方法和装置
CN113853784B (zh) 用于视频译码的多个自适应环路滤波器集合的方法和装置
CN116325729B (zh) 用于视频译码的方法和装置
CN114258675B (zh) 用于视频编码的跨分量自适应环路滤波
CN114080802B (zh) 用于视频译码中的变换跳过模式和调色板模式的最小允许量化参数
CN114128286B (zh) 视频编解码中的环绕运动补偿
CN116508321A (zh) 视频译码期间基于联合分量神经网络的滤波
CN114223202B (zh) 低频不可分离变换(lfnst)信令
CN113940069A (zh) 用于视频译码中的低频不可分离变换的变换和最后有效系数位置信令
CN114080805B (zh) 用于视频译码的自适应环路滤波的非线性扩展
KR20230038709A (ko) 다중 적응형 루프 필터 세트들
CN118921465A (zh) 视频译码中的系数域块差分脉冲译码调制
CN113170162B (zh) 用于视频译码的共享候选列表和并行候选列表推导
CN114503590B (zh) 用信号发送针对变换跳过中的残差值的译码方案以进行视频译码
CN115866275B (zh) 用于视频编解码的变换单元设计
CN116235498B (zh) 去块滤波器参数信令
CN116210222B (zh) 约束用于以不同比特深度对视频数据进行译码的自适应环路滤波的操作比特深度
CN114731403B (zh) 基于量化参数的残差编解码选择和低层级信令
CN116235495A (zh) 用于视频译码中的跨分量线性模型(cclm)模式的固定比特深度处理
CN116250233A (zh) 具有最坏情况复杂度处理的扩展低频不可分离变换(lfnst)设计
CN119729016A (zh) 视频译码中的dc帧内模式预测
CN120419172A (zh) 自适应环路滤波器分类器
CN121420548A (zh) 用于随机存取视频译码的常规和神经网络编解码器
CN119487855A (zh) 视频数据的复杂性降低的多模式神经网络滤波

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant