JP2025505291A - スパースニューラルネットワークのための適応テンソル畳み込みカーネル - Google Patents
スパースニューラルネットワークのための適応テンソル畳み込みカーネル Download PDFInfo
- Publication number
- JP2025505291A JP2025505291A JP2024548397A JP2024548397A JP2025505291A JP 2025505291 A JP2025505291 A JP 2025505291A JP 2024548397 A JP2024548397 A JP 2024548397A JP 2024548397 A JP2024548397 A JP 2024548397A JP 2025505291 A JP2025505291 A JP 2025505291A
- Authority
- JP
- Japan
- Prior art keywords
- filters
- array
- ifm
- tensor
- pes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0495—Quantised networks; Sparse networks; Compressed networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Complex Calculations (AREA)
- Image Processing (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/673,490 US20230259758A1 (en) | 2022-02-16 | 2022-02-16 | Adaptive tensor compute kernel for sparse neural network |
| US17/673,490 | 2022-02-16 | ||
| PCT/CN2023/075661 WO2023155748A1 (en) | 2022-02-16 | 2023-02-13 | Adaptive tensor compute kernel for sparse neural network |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2025505291A true JP2025505291A (ja) | 2025-02-21 |
| JP2025505291A5 JP2025505291A5 (https=) | 2026-02-20 |
| JPWO2023155748A5 JPWO2023155748A5 (https=) | 2026-02-20 |
Family
ID=87558678
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2024548397A Pending JP2025505291A (ja) | 2022-02-16 | 2023-02-13 | スパースニューラルネットワークのための適応テンソル畳み込みカーネル |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US20230259758A1 (https=) |
| EP (1) | EP4479887A4 (https=) |
| JP (1) | JP2025505291A (https=) |
| KR (1) | KR20240149907A (https=) |
| CN (1) | CN118715527A (https=) |
| TW (1) | TWI857493B (https=) |
| WO (1) | WO2023155748A1 (https=) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116261736B (zh) * | 2020-06-12 | 2024-08-16 | 墨芯国际有限公司 | 用于双稀疏卷积处理和并行化的方法和系统 |
| CN112925644B (zh) * | 2021-02-26 | 2024-08-13 | 北京小米松果电子有限公司 | 深度学习算子优化方法、装置、设备及存储介质 |
| CN116662330A (zh) * | 2022-02-21 | 2023-08-29 | 中兴通讯股份有限公司 | 数据处理方法、转发芯片、存储介质及程序产品 |
| US12567122B1 (en) | 2022-04-19 | 2026-03-03 | Nvidia Corporation | Application programming interface to modify tensor dimensions |
| US20230140173A1 (en) * | 2022-08-19 | 2023-05-04 | Arnab Raha | Deep neural network (dnn) accelerators with heterogeneous tiling |
| TWI873681B (zh) * | 2023-06-14 | 2025-02-21 | 緯創資通股份有限公司 | 物件檢測方法、機器學習方法及電子裝置 |
| CN117707791B (zh) * | 2024-02-02 | 2024-05-14 | 北京壁仞科技开发有限公司 | 用于进行注意力运算的方法、设备和存储介质 |
| CN118152713B (zh) * | 2024-05-10 | 2024-08-06 | 北京壁仞科技开发有限公司 | 数据处理方法、装置、电子设备和计算机可读存储介质 |
| TWI884041B (zh) * | 2024-07-19 | 2025-05-11 | 國立清華大學 | 基於混合精度演算法和記憶體內運算加速器之軟硬體協同運作方法及其系統及非暫態電腦可讀儲存媒體 |
| CN121233886B (zh) * | 2025-12-01 | 2026-03-20 | 上海壁仞科技股份有限公司 | 卷积计算方法、电子设备、存储介质及程序产品 |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10169298B1 (en) * | 2017-05-11 | 2019-01-01 | NovuMind Limited | Native tensor processor, using outer product unit |
| US11475351B2 (en) * | 2017-11-15 | 2022-10-18 | Uatc, Llc | Systems and methods for object detection, tracking, and motion prediction |
| US11443176B2 (en) * | 2018-05-17 | 2022-09-13 | International Business Machines Corporation | Acceleration of convolutional neural networks on analog arrays |
| CN112888459B (zh) * | 2018-06-01 | 2023-05-23 | 格里尔公司 | 卷积神经网络系统及数据分类方法 |
| US11429850B2 (en) * | 2018-07-19 | 2022-08-30 | Xilinx, Inc. | Performing consecutive mac operations on a set of data using different kernels in a MAC circuit |
| US11481934B2 (en) * | 2018-10-10 | 2022-10-25 | New York University | System, method, and computer-accessible medium for generating magnetic resonance imaging-based anatomically guided positron emission tomography reconstruction images with a convolutional neural network |
| EP3654247B1 (en) * | 2018-11-15 | 2025-01-01 | IMEC vzw | Convolution engine for neural networks |
| US10878173B2 (en) * | 2018-11-29 | 2020-12-29 | Adobe Inc. | Object recognition and tagging based on fusion deep learning models |
| US11604958B2 (en) * | 2019-03-13 | 2023-03-14 | Samsung Electronics Co., Ltd. | Method and apparatus for processing computation of zero value in processing of layers in neural network |
| WO2021071930A1 (en) * | 2019-10-07 | 2021-04-15 | Google Llc | Redistributing tensor elements between machine learning computing units |
| US12554962B2 (en) * | 2019-12-24 | 2026-02-17 | Intel Corporation | Configurable processor element arrays for implementing convolutional neural networks |
| CN115456161A (zh) * | 2020-03-27 | 2022-12-09 | 华为技术有限公司 | 一种数据处理方法和数据处理系统 |
| KR102914873B1 (ko) * | 2020-12-14 | 2026-01-16 | 삼성전자 주식회사 | 채널 수에 기초하여 컨볼루션 연산을 수행하는 npu 장치 및 이의 동작 방법 |
| KR102602584B1 (ko) * | 2021-04-14 | 2023-11-16 | 한국전자통신연구원 | 인공 지능 반도체 프로세서 및 인공 지능 반도체 프로세서의 동작 방법 |
| US20230195419A1 (en) * | 2021-12-17 | 2023-06-22 | Arm Limited | System and Method for Accelerating Neural Networks |
-
2022
- 2022-02-16 US US17/673,490 patent/US20230259758A1/en active Pending
-
2023
- 2023-02-13 EP EP23755750.9A patent/EP4479887A4/en active Pending
- 2023-02-13 KR KR1020247028942A patent/KR20240149907A/ko active Pending
- 2023-02-13 CN CN202380021998.0A patent/CN118715527A/zh active Pending
- 2023-02-13 JP JP2024548397A patent/JP2025505291A/ja active Pending
- 2023-02-13 WO PCT/CN2023/075661 patent/WO2023155748A1/en not_active Ceased
- 2023-02-16 TW TW112105472A patent/TWI857493B/zh active
Also Published As
| Publication number | Publication date |
|---|---|
| CN118715527A (zh) | 2024-09-27 |
| EP4479887A4 (en) | 2026-01-07 |
| EP4479887A1 (en) | 2024-12-25 |
| TW202343310A (zh) | 2023-11-01 |
| WO2023155748A1 (en) | 2023-08-24 |
| KR20240149907A (ko) | 2024-10-15 |
| TWI857493B (zh) | 2024-10-01 |
| US20230259758A1 (en) | 2023-08-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2025505291A (ja) | スパースニューラルネットワークのための適応テンソル畳み込みカーネル | |
| Lu et al. | SpWA: An efficient sparse winograd convolutional neural networks accelerator on FPGAs | |
| CN115485695B (zh) | 用于分层权重稀疏卷积处理的方法和系统 | |
| CN111831254B (zh) | 图像处理加速方法、图像处理模型存储方法及对应装置 | |
| US11645529B2 (en) | Sparsifying neural network models | |
| JP6715900B2 (ja) | ニューラルネットワークのパラメータを適応させるための方法および装置 | |
| WO2022002157A1 (en) | Method and system for balanced-weight sparse convolution processing | |
| JP2020509501A (ja) | 行列乗算アクセラレータ(mma)を用いる基本計算原始関数の実装 | |
| JP2024502225A (ja) | ワークロードが平準化された活性化スパース性を用いた畳込みのための方法およびシステム | |
| US20240273163A1 (en) | Accelerator for sparse matrix multiplication in neural networks | |
| CN118410214B (zh) | 一种基于稀疏矩阵的气象数据处理方法、设备及介质 | |
| KR102372869B1 (ko) | 인공 신경망을 위한 행렬 연산기 및 행렬 연산 방법 | |
| US12530170B2 (en) | Vector operation acceleration with convolution computation unit | |
| US12333415B2 (en) | Neural network accelerators | |
| US11748251B2 (en) | Storing tensors in memory based on depth | |
| CN119416850B (zh) | 一种适配硬件张量指令及内存的神经网络推理优化方法 | |
| CN114662647A (zh) | 处理用于神经网络的层的数据 | |
| CN116171437A (zh) | 通过智能内存池进行嵌入的方法和系统 | |
| US20240126617A1 (en) | Deep fusion of kernel execution | |
| CN119850874A (zh) | 三维网格的编码方法、三维网格的解码方法和相关装置 | |
| Díaz | A design methodology for automatic generation of processor arrays based on the polytope model. |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20260212 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20260212 |
|
| A871 | Explanation of circumstances concerning accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A871 Effective date: 20260212 |