JP7640552B2 - ニューラルネットワークのパラメータを符号化するための概念 - Google Patents

ニューラルネットワークのパラメータを符号化するための概念 Download PDF

Info

Publication number
JP7640552B2
JP7640552B2 JP2022538077A JP2022538077A JP7640552B2 JP 7640552 B2 JP7640552 B2 JP 7640552B2 JP 2022538077 A JP2022538077 A JP 2022538077A JP 2022538077 A JP2022538077 A JP 2022538077A JP 7640552 B2 JP7640552 B2 JP 7640552B2
Authority
JP
Japan
Prior art keywords
neural network
network parameters
quantization
current
reconstruction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2022538077A
Other languages
English (en)
Japanese (ja)
Other versions
JP2023507502A (ja
Inventor
ポール ハーセ
ハイナー キルヒホッファー
ハイコ シュヴァルツ
デトレフ マルペ
トーマス ウィーガント
Original Assignee
フラウンホッファー-ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by フラウンホッファー-ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ filed Critical フラウンホッファー-ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ
Publication of JP2023507502A publication Critical patent/JP2023507502A/ja
Priority to JP2024179366A priority Critical patent/JP7783376B2/ja
Application granted granted Critical
Publication of JP7640552B2 publication Critical patent/JP7640552B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • H04L47/2483Traffic characterised by specific attributes, e.g. priority or QoS involving identification of individual flows
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0495Quantised networks; Sparse networks; Compressed networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/13Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Neurology (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Executing Machine-Instructions (AREA)
  • Apparatus For Radiation Diagnosis (AREA)
JP2022538077A 2019-12-20 2020-12-21 ニューラルネットワークのパラメータを符号化するための概念 Active JP7640552B2 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2024179366A JP7783376B2 (ja) 2019-12-20 2024-10-11 ニューラルネットワークのパラメータを符号化するための概念

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP19218862 2019-12-20
EP19218862.1 2019-12-20
PCT/EP2020/087489 WO2021123438A1 (en) 2019-12-20 2020-12-21 Concepts for coding neural networks parameters

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP2024179366A Division JP7783376B2 (ja) 2019-12-20 2024-10-11 ニューラルネットワークのパラメータを符号化するための概念

Publications (2)

Publication Number Publication Date
JP2023507502A JP2023507502A (ja) 2023-02-22
JP7640552B2 true JP7640552B2 (ja) 2025-03-05

Family

ID=69104239

Family Applications (2)

Application Number Title Priority Date Filing Date
JP2022538077A Active JP7640552B2 (ja) 2019-12-20 2020-12-21 ニューラルネットワークのパラメータを符号化するための概念
JP2024179366A Active JP7783376B2 (ja) 2019-12-20 2024-10-11 ニューラルネットワークのパラメータを符号化するための概念

Family Applications After (1)

Application Number Title Priority Date Filing Date
JP2024179366A Active JP7783376B2 (ja) 2019-12-20 2024-10-11 ニューラルネットワークのパラメータを符号化するための概念

Country Status (6)

Country Link
US (2) US20220393986A1 (https=)
EP (1) EP4078454A1 (https=)
JP (2) JP7640552B2 (https=)
KR (1) KR20220127261A (https=)
CN (1) CN115087988A (https=)
WO (1) WO2021123438A1 (https=)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11037330B2 (en) * 2017-04-08 2021-06-15 Intel Corporation Low rank matrix compression
CN115699020A (zh) * 2020-06-05 2023-02-03 华为技术有限公司 用于神经网络的量化
US12363310B2 (en) * 2020-06-16 2025-07-15 Nokia Technologies Oy Guided probability model for compressed representation of neural networks
US20230229894A1 (en) * 2020-06-25 2023-07-20 Intellectual Discovery Co., Ltd. Method and apparatus for compression and training of neural network
WO2022139438A1 (ko) * 2020-12-22 2022-06-30 인텔렉추얼디스커버리 주식회사 딥러닝 기반 이미지 코딩 방법 및 장치
JP7325015B2 (ja) * 2021-03-24 2023-08-14 パナソニックIpマネジメント株式会社 量子化方法、量子化装置、及び、プログラム
US11909975B2 (en) * 2021-06-18 2024-02-20 Tencent America LLC Dependent scalar quantization with substitution in neural image compression
CN118140458A (zh) * 2021-10-13 2024-06-04 谷歌有限责任公司 量化的机器学习配置信息
KR20240132484A (ko) * 2022-01-09 2024-09-03 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 신경 네트워크 파라미터를 인코딩 및 디코딩하는 컨셉
WO2024013109A1 (en) * 2022-07-11 2024-01-18 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder, decoder and methods for coding a data structure
KR20250047001A (ko) * 2023-09-27 2025-04-03 삼성전자주식회사 심층 신경망 모델을 포함하는 전자 장치 및 그 동작 방법

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019185769A1 (en) 2018-03-29 2019-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Dependent quantization

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2006201490B2 (en) * 2005-04-19 2008-05-22 Samsung Electronics Co., Ltd. Method and apparatus for adaptively selecting context model for entropy coding
US7894523B2 (en) * 2005-09-05 2011-02-22 Lg Electronics Inc. Method for modeling coding information of a video signal for compressing/decompressing coding information
US9584802B2 (en) * 2012-04-13 2017-02-28 Texas Instruments Incorporated Reducing context coded and bypass coded bins to improve context adaptive binary arithmetic coding (CABAC) throughput
CA2807908A1 (en) * 2012-06-30 2013-12-30 Research In Motion Limited Position-based context selection for greater-than-one flag decoding and encoding
US11451840B2 (en) * 2018-06-18 2022-09-20 Qualcomm Incorporated Trellis coded quantization coefficient coding
TW202601464A (zh) 2019-10-01 2026-01-01 弗勞恩霍夫爾協會 用於編/解碼神經網路參數之設備與方法、及相關資料串流與電腦程式

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019185769A1 (en) 2018-03-29 2019-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Dependent quantization

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Marta Karczewicz et al.,CE8-related: Sign context modelling and level mapping for TS residual coding,Joint Video Experts Team (JVET),2019年03月21日,[JVET-N0455] (version 3)
Simon Wiedemann et al.,DeepCABAC: A universal compression algorithm for deep neural networks,arXiv,2019年07月27日,pp.1-18

Also Published As

Publication number Publication date
JP2025016517A (ja) 2025-02-04
JP2023507502A (ja) 2023-02-22
CN115087988A (zh) 2022-09-20
US20250343764A1 (en) 2025-11-06
EP4078454A1 (en) 2022-10-26
KR20220127261A (ko) 2022-09-19
WO2021123438A1 (en) 2021-06-24
JP7783376B2 (ja) 2025-12-09
US20220393986A1 (en) 2022-12-08

Similar Documents

Publication Publication Date Title
JP7640552B2 (ja) ニューラルネットワークのパラメータを符号化するための概念
Kirchhoffer et al. Overview of the neural network compression and representation (NNR) standard
US20250278604A1 (en) Methods and apparatuses for compressing parameters of neural networks
JP2025513886A (ja) ニューラルネットワークの復号化パラメータを提供するためのデコーダ、エンコーダ、方法、及び並べ替えを使用するコンピュータプログラム
JP2025186542A (ja) 更新モデルを使用して、ニューラルネットワークパラメーターを復号化する装置、方法及びコンピュータープログラム、並びにニューラルネットワークパラメーターを符号化する装置、方法及びコンピュータープログラム
US20240364362A1 (en) Concepts for encoding and decoding neural network parameters
JP2026067854A (ja) ニューラルネットワークのパラメータを符号化するための概念
Farkash et al. Transform trellis coding of images at low bit rates

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20221019

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20221019

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20231130

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20231212

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240312

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20240611

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20241011

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A821

Effective date: 20241011

A911 Transfer to examiner for re-examination before appeal (zenchi)

Free format text: JAPANESE INTERMEDIATE CODE: A911

Effective date: 20241031

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20250121

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20250220

R150 Certificate of patent or registration of utility model

Ref document number: 7640552

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150