JP7815247B2 - ニューラルネットワークベースのビデオコーディングのためのフロントエンドアーキテクチャ - Google Patents

ニューラルネットワークベースのビデオコーディングのためのフロントエンドアーキテクチャ

Info

Publication number
JP7815247B2
JP7815247B2 JP2023532549A JP2023532549A JP7815247B2 JP 7815247 B2 JP7815247 B2 JP 7815247B2 JP 2023532549 A JP2023532549 A JP 2023532549A JP 2023532549 A JP2023532549 A JP 2023532549A JP 7815247 B2 JP7815247 B2 JP 7815247B2
Authority
JP
Japan
Prior art keywords
frame
channel
layer
convolutional layer
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2023532549A
Other languages
English (en)
Japanese (ja)
Other versions
JP2023553369A5 (https=
JP2023553369A (ja
Inventor
エギルメス、ヒルミ・エネス
シン、アンキテシュ・クマー
コバン、ムハンメド・ゼイド
カルチェビチ、マルタ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US17/643,383 external-priority patent/US12231666B2/en
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of JP2023553369A publication Critical patent/JP2023553369A/ja
Publication of JP2023553369A5 publication Critical patent/JP2023553369A5/ja
Application granted granted Critical
Publication of JP7815247B2 publication Critical patent/JP7815247B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/002Image coding using neural networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/13Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • H04N19/192Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/439Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using cascaded computational arrangements for performing a single operation, e.g. filtering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/88Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving rearrangement of data among different coding units, e.g. shuffling, interleaving, scrambling or permutation of pixel data or permutation of transform coefficient data among different blocks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/117Filters, e.g. for pre-processing or post-processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
JP2023532549A 2020-12-10 2021-12-09 ニューラルネットワークベースのビデオコーディングのためのフロントエンドアーキテクチャ Active JP7815247B2 (ja)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US202063124016P 2020-12-10 2020-12-10
US63/124,016 2020-12-10
US202063131802P 2020-12-30 2020-12-30
US63/131,802 2020-12-30
US17/643,383 2021-12-08
US17/643,383 US12231666B2 (en) 2020-12-10 2021-12-08 Front-end architecture for neural network based video coding
PCT/US2021/072824 WO2022126120A1 (en) 2020-12-10 2021-12-09 A front-end architecture for neural network based video coding

Publications (3)

Publication Number Publication Date
JP2023553369A JP2023553369A (ja) 2023-12-21
JP2023553369A5 JP2023553369A5 (https=) 2024-11-20
JP7815247B2 true JP7815247B2 (ja) 2026-02-17

Family

ID=79283114

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023532549A Active JP7815247B2 (ja) 2020-12-10 2021-12-09 ニューラルネットワークベースのビデオコーディングのためのフロントエンドアーキテクチャ

Country Status (5)

Country Link
EP (1) EP4260561A1 (https=)
JP (1) JP7815247B2 (https=)
KR (1) KR20230117346A (https=)
TW (1) TWI883294B (https=)
WO (1) WO2022126120A1 (https=)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4494341A1 (en) * 2022-07-01 2025-01-22 Huawei Technologies Co., Ltd. Parallel processing of image regions with neural networks ? decoding, post filtering, and rdoq
WO2024208149A1 (en) * 2023-04-01 2024-10-10 Douyin Vision Co., Ltd. Method, apparatus, and medium for visual data processing
US20240357118A1 (en) * 2023-04-11 2024-10-24 Alibaba Innovation Private Limited Methods and non-transitory computer readable storage medium for spatial resampling towards machine vision
CN121190582A (zh) * 2023-04-18 2025-12-23 华为技术有限公司 图像解压缩方法和装置
CN119450037A (zh) * 2023-07-28 2025-02-14 腾讯科技(深圳)有限公司 滤波及编解码方法、装置及电子设备
CN121646789A (zh) * 2024-06-29 2026-03-10 京东方科技集团股份有限公司 图像处理装置、图像处理方法和显示装置
CN119743644B (zh) * 2024-12-18 2026-01-13 北京潞晨科技有限公司 视频生成方法、装置、电子设备、存储介质及产品

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019525544A (ja) 2016-06-24 2019-09-05 韓国科学技術院Korea Advanced Institute Of Science And Technology Cnn基盤インループフィルタを含む符号化方法と装置及び復号化方法と装置
WO2020101257A1 (en) 2018-11-12 2020-05-22 Samsung Electronics Co., Ltd. Display apparatus and method of controlling the same
JP2020191630A (ja) 2019-05-22 2020-11-26 富士通株式会社 画像コーディング装置、確率モデル生成装置及び画像デコーディング装置

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109426858B (zh) * 2017-08-29 2021-04-06 京东方科技集团股份有限公司 神经网络、训练方法、图像处理方法及图像处理装置
CN108184129B (zh) * 2017-12-11 2020-01-10 北京大学 一种视频编解码方法、装置及用于图像滤波的神经网络
US20200053388A1 (en) * 2018-08-10 2020-02-13 Disney Enterprises, Inc. Machine learning based video compression
CN111861877A (zh) * 2019-04-25 2020-10-30 华为技术有限公司 视频超分变率的方法和装置

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019525544A (ja) 2016-06-24 2019-09-05 韓国科学技術院Korea Advanced Institute Of Science And Technology Cnn基盤インループフィルタを含む符号化方法と装置及び復号化方法と装置
WO2020101257A1 (en) 2018-11-12 2020-05-22 Samsung Electronics Co., Ltd. Display apparatus and method of controlling the same
JP2020191630A (ja) 2019-05-22 2020-11-26 富士通株式会社 画像コーディング装置、確率モデル生成装置及び画像デコーディング装置

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Ankitesh K. Singh. Singh, et al.,DNNVC: A study of handling YUV420 input format for DNN-based video coding [online],Joint Video Exploration Team (JVET) of ITU-T SG16 WP 3 and ISO/IEC JTC 1/SC 29 20th Meeting, by teleconference, 7-16 Oct. 2020,[JVET-T0123],2020年10月14日,pp.1-8,[retrieved on 2025-09-17],Retrieved from <https://jvet-experts.org/doc_end_user/documents/20_Teleconference/wg11/JVET-T0123-v3.zip><JVET-T0123.docx>

Also Published As

Publication number Publication date
TWI883294B (zh) 2025-05-11
WO2022126120A1 (en) 2022-06-16
KR20230117346A (ko) 2023-08-08
EP4260561A1 (en) 2023-10-18
JP2023553369A (ja) 2023-12-21
TW202243476A (zh) 2022-11-01

Similar Documents

Publication Publication Date Title
CN116114247B (zh) 用于处理视频数据的装置和方法
JP7628550B2 (ja) 再帰ベースの機械学習システムを使用したビデオ圧縮
JP7815247B2 (ja) ニューラルネットワークベースのビデオコーディングのためのフロントエンドアーキテクチャ
US12231666B2 (en) Front-end architecture for neural network based video coding
US12003734B2 (en) Machine learning based flow determination for video coding
CN117980916A (zh) 用于媒体的变换译码的基于变换器的架构
KR102831175B1 (ko) 머신 러닝 향상을 갖는 비디오 코딩을 위한 비트-레이트 추정
US11399198B1 (en) Learned B-frame compression
US12177473B2 (en) Video coding using optical flow and residual predictors
US12394100B2 (en) Video coding using camera motion compensation and object motion compensation
US20240214578A1 (en) Regularizing neural networks with data quantization using exponential family priors
JP7840973B2 (ja) ビデオコーディングのための機械学習ベースのフロー決定
CN116547965A (zh) 用于基于神经网络的视频译码的前端架构
CN116965032A (zh) 用于视频译码的基于机器学习的流确定

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20241111

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20241111

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20250918

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20250930

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20251217

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20260113

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20260204

R150 Certificate of patent or registration of utility model

Ref document number: 7815247

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150