JP2025505291A - スパースニューラルネットワークのための適応テンソル畳み込みカーネル - Google Patents

スパースニューラルネットワークのための適応テンソル畳み込みカーネル Download PDF

Info

Publication number
JP2025505291A
JP2025505291A JP2024548397A JP2024548397A JP2025505291A JP 2025505291 A JP2025505291 A JP 2025505291A JP 2024548397 A JP2024548397 A JP 2024548397A JP 2024548397 A JP2024548397 A JP 2024548397A JP 2025505291 A JP2025505291 A JP 2025505291A
Authority
JP
Japan
Prior art keywords
filters
array
ifm
tensor
pes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2024548397A
Other languages
English (en)
Japanese (ja)
Other versions
JP2025505291A5 (https=
JPWO2023155748A5 (https=
Inventor
チャン シアオチエン
イェン エンシュイ
シアオ チーピン
Original Assignee
モフェット インターナショナル カンパニー,リミティド
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by モフェット インターナショナル カンパニー,リミティド filed Critical モフェット インターナショナル カンパニー,リミティド
Publication of JP2025505291A publication Critical patent/JP2025505291A/ja
Publication of JP2025505291A5 publication Critical patent/JP2025505291A5/ja
Publication of JPWO2023155748A5 publication Critical patent/JPWO2023155748A5/ja
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0495Quantised networks; Sparse networks; Compressed networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Complex Calculations (AREA)
  • Image Processing (AREA)
JP2024548397A 2022-02-16 2023-02-13 スパースニューラルネットワークのための適応テンソル畳み込みカーネル Pending JP2025505291A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US17/673,490 US20230259758A1 (en) 2022-02-16 2022-02-16 Adaptive tensor compute kernel for sparse neural network
US17/673,490 2022-02-16
PCT/CN2023/075661 WO2023155748A1 (en) 2022-02-16 2023-02-13 Adaptive tensor compute kernel for sparse neural network

Publications (3)

Publication Number Publication Date
JP2025505291A true JP2025505291A (ja) 2025-02-21
JP2025505291A5 JP2025505291A5 (https=) 2026-02-20
JPWO2023155748A5 JPWO2023155748A5 (https=) 2026-02-20

Family

ID=87558678

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2024548397A Pending JP2025505291A (ja) 2022-02-16 2023-02-13 スパースニューラルネットワークのための適応テンソル畳み込みカーネル

Country Status (7)

Country Link
US (1) US20230259758A1 (https=)
EP (1) EP4479887A4 (https=)
JP (1) JP2025505291A (https=)
KR (1) KR20240149907A (https=)
CN (1) CN118715527A (https=)
TW (1) TWI857493B (https=)
WO (1) WO2023155748A1 (https=)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116261736B (zh) * 2020-06-12 2024-08-16 墨芯国际有限公司 用于双稀疏卷积处理和并行化的方法和系统
CN112925644B (zh) * 2021-02-26 2024-08-13 北京小米松果电子有限公司 深度学习算子优化方法、装置、设备及存储介质
CN116662330A (zh) * 2022-02-21 2023-08-29 中兴通讯股份有限公司 数据处理方法、转发芯片、存储介质及程序产品
US12567122B1 (en) 2022-04-19 2026-03-03 Nvidia Corporation Application programming interface to modify tensor dimensions
US20230140173A1 (en) * 2022-08-19 2023-05-04 Arnab Raha Deep neural network (dnn) accelerators with heterogeneous tiling
TWI873681B (zh) * 2023-06-14 2025-02-21 緯創資通股份有限公司 物件檢測方法、機器學習方法及電子裝置
CN117707791B (zh) * 2024-02-02 2024-05-14 北京壁仞科技开发有限公司 用于进行注意力运算的方法、设备和存储介质
CN118152713B (zh) * 2024-05-10 2024-08-06 北京壁仞科技开发有限公司 数据处理方法、装置、电子设备和计算机可读存储介质
TWI884041B (zh) * 2024-07-19 2025-05-11 國立清華大學 基於混合精度演算法和記憶體內運算加速器之軟硬體協同運作方法及其系統及非暫態電腦可讀儲存媒體
CN121233886B (zh) * 2025-12-01 2026-03-20 上海壁仞科技股份有限公司 卷积计算方法、电子设备、存储介质及程序产品

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10169298B1 (en) * 2017-05-11 2019-01-01 NovuMind Limited Native tensor processor, using outer product unit
US11475351B2 (en) * 2017-11-15 2022-10-18 Uatc, Llc Systems and methods for object detection, tracking, and motion prediction
US11443176B2 (en) * 2018-05-17 2022-09-13 International Business Machines Corporation Acceleration of convolutional neural networks on analog arrays
CN112888459B (zh) * 2018-06-01 2023-05-23 格里尔公司 卷积神经网络系统及数据分类方法
US11429850B2 (en) * 2018-07-19 2022-08-30 Xilinx, Inc. Performing consecutive mac operations on a set of data using different kernels in a MAC circuit
US11481934B2 (en) * 2018-10-10 2022-10-25 New York University System, method, and computer-accessible medium for generating magnetic resonance imaging-based anatomically guided positron emission tomography reconstruction images with a convolutional neural network
EP3654247B1 (en) * 2018-11-15 2025-01-01 IMEC vzw Convolution engine for neural networks
US10878173B2 (en) * 2018-11-29 2020-12-29 Adobe Inc. Object recognition and tagging based on fusion deep learning models
US11604958B2 (en) * 2019-03-13 2023-03-14 Samsung Electronics Co., Ltd. Method and apparatus for processing computation of zero value in processing of layers in neural network
WO2021071930A1 (en) * 2019-10-07 2021-04-15 Google Llc Redistributing tensor elements between machine learning computing units
US12554962B2 (en) * 2019-12-24 2026-02-17 Intel Corporation Configurable processor element arrays for implementing convolutional neural networks
CN115456161A (zh) * 2020-03-27 2022-12-09 华为技术有限公司 一种数据处理方法和数据处理系统
KR102914873B1 (ko) * 2020-12-14 2026-01-16 삼성전자 주식회사 채널 수에 기초하여 컨볼루션 연산을 수행하는 npu 장치 및 이의 동작 방법
KR102602584B1 (ko) * 2021-04-14 2023-11-16 한국전자통신연구원 인공 지능 반도체 프로세서 및 인공 지능 반도체 프로세서의 동작 방법
US20230195419A1 (en) * 2021-12-17 2023-06-22 Arm Limited System and Method for Accelerating Neural Networks

Also Published As

Publication number Publication date
CN118715527A (zh) 2024-09-27
EP4479887A4 (en) 2026-01-07
EP4479887A1 (en) 2024-12-25
TW202343310A (zh) 2023-11-01
WO2023155748A1 (en) 2023-08-24
KR20240149907A (ko) 2024-10-15
TWI857493B (zh) 2024-10-01
US20230259758A1 (en) 2023-08-17

Similar Documents

Publication Publication Date Title
JP2025505291A (ja) スパースニューラルネットワークのための適応テンソル畳み込みカーネル
Lu et al. SpWA: An efficient sparse winograd convolutional neural networks accelerator on FPGAs
CN115485695B (zh) 用于分层权重稀疏卷积处理的方法和系统
CN111831254B (zh) 图像处理加速方法、图像处理模型存储方法及对应装置
US11645529B2 (en) Sparsifying neural network models
JP6715900B2 (ja) ニューラルネットワークのパラメータを適応させるための方法および装置
WO2022002157A1 (en) Method and system for balanced-weight sparse convolution processing
JP2020509501A (ja) 行列乗算アクセラレータ(mma)を用いる基本計算原始関数の実装
JP2024502225A (ja) ワークロードが平準化された活性化スパース性を用いた畳込みのための方法およびシステム
US20240273163A1 (en) Accelerator for sparse matrix multiplication in neural networks
CN118410214B (zh) 一种基于稀疏矩阵的气象数据处理方法、设备及介质
KR102372869B1 (ko) 인공 신경망을 위한 행렬 연산기 및 행렬 연산 방법
US12530170B2 (en) Vector operation acceleration with convolution computation unit
US12333415B2 (en) Neural network accelerators
US11748251B2 (en) Storing tensors in memory based on depth
CN119416850B (zh) 一种适配硬件张量指令及内存的神经网络推理优化方法
CN114662647A (zh) 处理用于神经网络的层的数据
CN116171437A (zh) 通过智能内存池进行嵌入的方法和系统
US20240126617A1 (en) Deep fusion of kernel execution
CN119850874A (zh) 三维网格的编码方法、三维网格的解码方法和相关装置
Díaz A design methodology for automatic generation of processor arrays based on the polytope model.

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20260212

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20260212

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20260212