CN116940946A - 可分割深度神经网络中的动态特征尺寸适配 - Google Patents

可分割深度神经网络中的动态特征尺寸适配 Download PDF

Info

Publication number
CN116940946A
CN116940946A CN202280013234.2A CN202280013234A CN116940946A CN 116940946 A CN116940946 A CN 116940946A CN 202280013234 A CN202280013234 A CN 202280013234A CN 116940946 A CN116940946 A CN 116940946A
Authority
CN
China
Prior art keywords
neural network
dnn
compression
compression factor
dnn model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202280013234.2A
Other languages
English (en)
Chinese (zh)
Inventor
S·K·库马拉斯瓦米
Q·K·N·董
A·奥泽罗夫
P·方丹
F·施尼茨勒
A·兰伯特
吉斯伦·佩尔蒂埃
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
InterDigital CE Patent Holdings SAS
Original Assignee
InterDigital CE Patent Holdings SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by InterDigital CE Patent Holdings SAS filed Critical InterDigital CE Patent Holdings SAS
Publication of CN116940946A publication Critical patent/CN116940946A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0495Quantised networks; Sparse networks; Compressed networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/096Transfer learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/098Distributed learning, e.g. federated learning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1008Server selection for load balancing based on parameters of servers, e.g. available memory or workload
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/60General implementation details not specific to a particular type of compression
    • H03M7/6041Compression optimized for errors
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/60General implementation details not specific to a particular type of compression
    • H03M7/6064Selection of Compressor
    • H03M7/6076Selection between compressors of the same type

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Neurology (AREA)
  • Computer Hardware Design (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Mobile Radio Communication Systems (AREA)
CN202280013234.2A 2021-02-05 2022-02-03 可分割深度神经网络中的动态特征尺寸适配 Pending CN116940946A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP21305156 2021-02-05
EP21305156.8 2021-02-05
PCT/EP2022/052633 WO2022167547A1 (en) 2021-02-05 2022-02-03 Dynamic feature size adaptation in splitable deep neural networks

Publications (1)

Publication Number Publication Date
CN116940946A true CN116940946A (zh) 2023-10-24

Family

ID=74661327

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202280013234.2A Pending CN116940946A (zh) 2021-02-05 2022-02-03 可分割深度神经网络中的动态特征尺寸适配

Country Status (5)

Country Link
US (1) US20240311621A1 (https=)
EP (1) EP4288907A1 (https=)
JP (1) JP2024509670A (https=)
CN (1) CN116940946A (https=)
WO (1) WO2022167547A1 (https=)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118843159A (zh) * 2024-09-23 2024-10-25 四川科锐得电力通信技术有限公司 一种基于无线网桥的无信号区输电线路数据传输方法及系统
WO2026016173A1 (en) * 2024-07-19 2026-01-22 Apple Inc. Performance monitoring of chained ai model in wireless communications

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20230422117A1 (en) * 2022-06-09 2023-12-28 Qualcomm Incorporated User equipment machine learning service continuity
CN117632463A (zh) * 2022-08-24 2024-03-01 华为技术有限公司 一种计算任务的分割方法及相关装置
CN115499658B (zh) * 2022-09-20 2024-05-07 支付宝(杭州)信息技术有限公司 虚拟世界的数据传输方法及装置
CN118473556A (zh) * 2023-02-09 2024-08-09 索尼集团公司 用于分割学习的电子设备和方法、计算机可读存储介质
WO2024168748A1 (zh) * 2023-02-16 2024-08-22 富士通株式会社 模型发送和接收方法以及装置
US12526439B2 (en) 2023-04-22 2026-01-13 Qualcomm Incorporated Rate adaptation for video coding for machines
CN120958815A (zh) * 2023-04-22 2025-11-14 高通股份有限公司 用于机器的视频译码的速率自适应
CN121336391A (zh) * 2023-06-13 2026-01-13 华为技术有限公司 通信方法和通信装置
IL325805A (en) * 2023-07-18 2026-03-01 Interdigital Vc Holdings Inc Tensor information for intermediate data
WO2025019540A1 (en) * 2023-07-19 2025-01-23 Interdigital Vc Holdings, Inc. Multi-layer split points output information
WO2025047742A1 (ja) * 2023-08-30 2025-03-06 京セラ株式会社 通信制御方法及びユーザ装置
US12587970B2 (en) * 2023-09-05 2026-03-24 Qualcomm Incorporated Decibel compression point information reporting
EP4651458A1 (en) * 2024-05-13 2025-11-19 InterDigital CE Patent Holdings, SAS Methods, apparatuses and systems related to transport partial results data with intermediate data
CN118741441B (zh) * 2024-07-18 2025-02-28 北京物资学院 无线蜂窝网络中终端选择大语言模型的方法和装置

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102285064B1 (ko) * 2017-10-30 2021-08-04 한국전자통신연구원 은닉 변수를 이용하는 영상 및 신경망 압축을 위한 방법 및 장치
WO2019134802A1 (en) * 2018-01-03 2019-07-11 Signify Holding B.V. System and methods to share machine learning functionality between cloud and an iot network
WO2019193660A1 (ja) * 2018-04-03 2019-10-10 株式会社ウフル 機械学習済みモデル切り替えシステム、エッジデバイス、機械学習済みモデル切り替え方法、及びプログラム
JP7056345B2 (ja) * 2018-04-18 2022-04-19 日本電信電話株式会社 データ分析システム、方法、及びプログラム
US11700518B2 (en) * 2019-05-31 2023-07-11 Huawei Technologies Co., Ltd. Methods and systems for relaying feature-driven communications

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2026016173A1 (en) * 2024-07-19 2026-01-22 Apple Inc. Performance monitoring of chained ai model in wireless communications
CN118843159A (zh) * 2024-09-23 2024-10-25 四川科锐得电力通信技术有限公司 一种基于无线网桥的无信号区输电线路数据传输方法及系统

Also Published As

Publication number Publication date
US20240311621A1 (en) 2024-09-19
JP2024509670A (ja) 2024-03-05
WO2022167547A1 (en) 2022-08-11
EP4288907A1 (en) 2023-12-13

Similar Documents

Publication Publication Date Title
CN116940946A (zh) 可分割深度神经网络中的动态特征尺寸适配
US11388644B2 (en) Apparatus and method for load balancing in wireless communication system
CN108496384A (zh) 通信方式控制方法及设备
CN117729641A (zh) 用于波束报告的设备、方法和装置
CN119096480A (zh) 用于波束管理的装置
US20250031151A1 (en) Uplink transmit power control
CN111418162B (zh) 利用参考权重向量的ue特定的波束映射
CN115865570B (zh) 通过信道状态信息报告增强多用户mimo的方法、基站
US12519519B2 (en) Providing a first radio beam and a second radio beam
CN117692281A (zh) 使用有损压缩方法确定协方差
CN120513469A (zh) 飞行路径更新触发
US20250056211A1 (en) Capability information transmission
EP4348967A1 (en) Managing ue capabilities in cellular communication system
WO2025060349A1 (en) Methods, devices, and computer readable medium for artificial intelligence (ai) service
WO2025092059A1 (en) Method, apparatus, and system for artificial intelligence (ai) model splitting
WO2024213187A1 (en) Indication of network side additional condition
US20250184797A1 (en) Method for enhanced measurement and reporting
US20250031068A1 (en) Mechanism for functionality based life cycle management
WO2025130456A1 (zh) 一种通信方法及装置
CN121463098A (zh) 反馈报告的配置方法、装置,网络侧设备和终端
WO2025157040A1 (zh) 通信方法、装置及系统
WO2024223044A1 (en) Updating machine learning model
WO2026061059A1 (zh) 一种模型注册及访问的方法和装置
WO2025203527A1 (ja) 制御装置、制御システム、及び制御方法
TW202608102A (zh) 用於無線網路的通用ue資料收集

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination