JP2025505291A5 - - Google Patents

Info

Publication number
JP2025505291A5
JP2025505291A5 JP2024548397A JP2024548397A JP2025505291A5 JP 2025505291 A5 JP2025505291 A5 JP 2025505291A5 JP 2024548397 A JP2024548397 A JP 2024548397A JP 2024548397 A JP2024548397 A JP 2024548397A JP 2025505291 A5 JP2025505291 A5 JP 2025505291A5
Authority
JP
Japan
Prior art keywords
filters
ifm
array
convolution
dimension
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2024548397A
Other languages
English (en)
Japanese (ja)
Other versions
JP2025505291A (ja
Filing date
Publication date
Priority claimed from US17/673,490 external-priority patent/US20230259758A1/en
Application filed filed Critical
Publication of JP2025505291A publication Critical patent/JP2025505291A/ja
Publication of JP2025505291A5 publication Critical patent/JP2025505291A5/ja
Pending legal-status Critical Current

Links

JP2024548397A 2022-02-16 2023-02-13 スパースニューラルネットワークのための適応テンソル畳み込みカーネル Pending JP2025505291A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US17/673,490 2022-02-16
US17/673,490 US20230259758A1 (en) 2022-02-16 2022-02-16 Adaptive tensor compute kernel for sparse neural network
PCT/CN2023/075661 WO2023155748A1 (en) 2022-02-16 2023-02-13 Adaptive tensor compute kernel for sparse neural network

Publications (2)

Publication Number Publication Date
JP2025505291A JP2025505291A (ja) 2025-02-21
JP2025505291A5 true JP2025505291A5 (https=) 2026-02-20

Family

ID=87558678

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2024548397A Pending JP2025505291A (ja) 2022-02-16 2023-02-13 スパースニューラルネットワークのための適応テンソル畳み込みカーネル

Country Status (7)

Country Link
US (1) US20230259758A1 (https=)
EP (1) EP4479887A4 (https=)
JP (1) JP2025505291A (https=)
KR (1) KR20240149907A (https=)
CN (1) CN118715527A (https=)
TW (1) TWI857493B (https=)
WO (1) WO2023155748A1 (https=)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116261736B (zh) * 2020-06-12 2024-08-16 墨芯国际有限公司 用于双稀疏卷积处理和并行化的方法和系统
CN112925644B (zh) * 2021-02-26 2024-08-13 北京小米松果电子有限公司 深度学习算子优化方法、装置、设备及存储介质
CN116662330A (zh) * 2022-02-21 2023-08-29 中兴通讯股份有限公司 数据处理方法、转发芯片、存储介质及程序产品
US12567122B1 (en) 2022-04-19 2026-03-03 Nvidia Corporation Application programming interface to modify tensor dimensions
US20230140173A1 (en) * 2022-08-19 2023-05-04 Arnab Raha Deep neural network (dnn) accelerators with heterogeneous tiling
TWI873681B (zh) * 2023-06-14 2025-02-21 緯創資通股份有限公司 物件檢測方法、機器學習方法及電子裝置
CN117707791B (zh) * 2024-02-02 2024-05-14 北京壁仞科技开发有限公司 用于进行注意力运算的方法、设备和存储介质
CN118152713B (zh) * 2024-05-10 2024-08-06 北京壁仞科技开发有限公司 数据处理方法、装置、电子设备和计算机可读存储介质
TWI884041B (zh) * 2024-07-19 2025-05-11 國立清華大學 基於混合精度演算法和記憶體內運算加速器之軟硬體協同運作方法及其系統及非暫態電腦可讀儲存媒體
CN121233886B (zh) * 2025-12-01 2026-03-20 上海壁仞科技股份有限公司 卷积计算方法、电子设备、存储介质及程序产品

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10169298B1 (en) * 2017-05-11 2019-01-01 NovuMind Limited Native tensor processor, using outer product unit
US11475351B2 (en) * 2017-11-15 2022-10-18 Uatc, Llc Systems and methods for object detection, tracking, and motion prediction
US11443176B2 (en) * 2018-05-17 2022-09-13 International Business Machines Corporation Acceleration of convolutional neural networks on analog arrays
AU2019277698A1 (en) * 2018-06-01 2020-11-19 Grail, Llc Convolutional neural network systems and methods for data classification
US11429850B2 (en) * 2018-07-19 2022-08-30 Xilinx, Inc. Performing consecutive mac operations on a set of data using different kernels in a MAC circuit
US11481934B2 (en) * 2018-10-10 2022-10-25 New York University System, method, and computer-accessible medium for generating magnetic resonance imaging-based anatomically guided positron emission tomography reconstruction images with a convolutional neural network
EP3654247B1 (en) * 2018-11-15 2025-01-01 IMEC vzw Convolution engine for neural networks
US10878173B2 (en) * 2018-11-29 2020-12-29 Adobe Inc. Object recognition and tagging based on fusion deep learning models
US11604958B2 (en) * 2019-03-13 2023-03-14 Samsung Electronics Co., Ltd. Method and apparatus for processing computation of zero value in processing of layers in neural network
US12461789B2 (en) * 2019-10-07 2025-11-04 Google Llc Redistributing tensor elements between machine learning computing units
US12554962B2 (en) * 2019-12-24 2026-02-17 Intel Corporation Configurable processor element arrays for implementing convolutional neural networks
CN115456160A (zh) * 2020-03-27 2022-12-09 华为技术有限公司 一种数据处理方法和数据处理设备
KR102914873B1 (ko) * 2020-12-14 2026-01-16 삼성전자 주식회사 채널 수에 기초하여 컨볼루션 연산을 수행하는 npu 장치 및 이의 동작 방법
KR102602584B1 (ko) * 2021-04-14 2023-11-16 한국전자통신연구원 인공 지능 반도체 프로세서 및 인공 지능 반도체 프로세서의 동작 방법
US20230195419A1 (en) * 2021-12-17 2023-06-22 Arm Limited System and Method for Accelerating Neural Networks

Similar Documents

Publication Publication Date Title
JP2025505291A5 (https=)
US11645529B2 (en) Sparsifying neural network models
CN110119809B (zh) 对神经网络中非对称量化数据执行mac运算的装置和方法
TWI765168B (zh) 用於硬體中之轉置神經網路矩陣之方法、系統及電腦儲存媒體
CN110520834B (zh) 替选循环限制
CN109886400B (zh) 基于卷积核拆分的卷积神经网络硬件加速器系统及其计算方法
CN110263925B (zh) 一种基于fpga的卷积神经网络前向预测的硬件加速实现装置
JP3639323B2 (ja) メモリ分散型並列計算機による連立1次方程式計算処理方法および計算機
CN110543934B (zh) 一种用于卷积神经网络的脉动阵列计算结构及方法
CN107633297B (zh) 一种基于并行快速fir滤波器算法的卷积神经网络硬件加速器
US20230259758A1 (en) Adaptive tensor compute kernel for sparse neural network
CN113326916A (zh) 将卷积映射到分区通道卷积引擎
CN105549078B (zh) 不规则地震数据的五维插值处理方法及装置
CN113344172A (zh) 将卷积映射到通道卷积引擎
CN108802726B (zh) 基于图形处理器gpu的合成孔径雷达成像方法
CN109446478B (zh) 一种基于迭代和可重构方式的复协方差矩阵计算系统
CN113435569A (zh) 使用每通道卷积运算的流水线逐点卷积
CN112434786B (zh) 一种基于winograd动态卷积块的图像处理方法
JP7700142B2 (ja) 機械学習アクセラレータの電力削減
CN110110844A (zh) 基于OpenCL的卷积神经网络并行处理方法
CN116348882A (zh) 一种卷积神经网络数据处理方法及其相关设备
CN114758209B (zh) 卷积结果获取方法、装置、计算机设备及存储介质
CN107516131A (zh) 卷积计算的加速方法和装置、电子设备和存储介质
CN109598335B (zh) 一种二维卷积脉动阵列结构及实现方法
Niu et al. Spec2: Spectral sparse cnn accelerator on fpgas