JPWO2023155748A5 - - Google Patents

Info

Publication number
JPWO2023155748A5
JPWO2023155748A5 JP2024548397A JP2024548397A JPWO2023155748A5 JP WO2023155748 A5 JPWO2023155748 A5 JP WO2023155748A5 JP 2024548397 A JP2024548397 A JP 2024548397A JP 2024548397 A JP2024548397 A JP 2024548397A JP WO2023155748 A5 JPWO2023155748 A5 JP WO2023155748A5
Authority
JP
Japan
Prior art keywords
filters
array
ifm
convolution
dimension
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2024548397A
Other languages
English (en)
Japanese (ja)
Other versions
JP2025505291A5 (https=
JP2025505291A (ja
Publication date
Priority claimed from US17/673,490 external-priority patent/US20230259758A1/en
Application filed filed Critical
Publication of JP2025505291A publication Critical patent/JP2025505291A/ja
Publication of JP2025505291A5 publication Critical patent/JP2025505291A5/ja
Publication of JPWO2023155748A5 publication Critical patent/JPWO2023155748A5/ja
Pending legal-status Critical Current

Links

JP2024548397A 2022-02-16 2023-02-13 スパースニューラルネットワークのための適応テンソル畳み込みカーネル Pending JP2025505291A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US17/673,490 US20230259758A1 (en) 2022-02-16 2022-02-16 Adaptive tensor compute kernel for sparse neural network
US17/673,490 2022-02-16
PCT/CN2023/075661 WO2023155748A1 (en) 2022-02-16 2023-02-13 Adaptive tensor compute kernel for sparse neural network

Publications (3)

Publication Number Publication Date
JP2025505291A JP2025505291A (ja) 2025-02-21
JP2025505291A5 JP2025505291A5 (https=) 2026-02-20
JPWO2023155748A5 true JPWO2023155748A5 (https=) 2026-02-20

Family

ID=87558678

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2024548397A Pending JP2025505291A (ja) 2022-02-16 2023-02-13 スパースニューラルネットワークのための適応テンソル畳み込みカーネル

Country Status (7)

Country Link
US (1) US20230259758A1 (https=)
EP (1) EP4479887A4 (https=)
JP (1) JP2025505291A (https=)
KR (1) KR20240149907A (https=)
CN (1) CN118715527A (https=)
TW (1) TWI857493B (https=)
WO (1) WO2023155748A1 (https=)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116261736B (zh) * 2020-06-12 2024-08-16 墨芯国际有限公司 用于双稀疏卷积处理和并行化的方法和系统
CN112925644B (zh) * 2021-02-26 2024-08-13 北京小米松果电子有限公司 深度学习算子优化方法、装置、设备及存储介质
CN116662330A (zh) * 2022-02-21 2023-08-29 中兴通讯股份有限公司 数据处理方法、转发芯片、存储介质及程序产品
US12567122B1 (en) 2022-04-19 2026-03-03 Nvidia Corporation Application programming interface to modify tensor dimensions
US20230140173A1 (en) * 2022-08-19 2023-05-04 Arnab Raha Deep neural network (dnn) accelerators with heterogeneous tiling
TWI873681B (zh) * 2023-06-14 2025-02-21 緯創資通股份有限公司 物件檢測方法、機器學習方法及電子裝置
CN117707791B (zh) * 2024-02-02 2024-05-14 北京壁仞科技开发有限公司 用于进行注意力运算的方法、设备和存储介质
CN118152713B (zh) * 2024-05-10 2024-08-06 北京壁仞科技开发有限公司 数据处理方法、装置、电子设备和计算机可读存储介质
TWI884041B (zh) * 2024-07-19 2025-05-11 國立清華大學 基於混合精度演算法和記憶體內運算加速器之軟硬體協同運作方法及其系統及非暫態電腦可讀儲存媒體
CN121233886B (zh) * 2025-12-01 2026-03-20 上海壁仞科技股份有限公司 卷积计算方法、电子设备、存储介质及程序产品

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10169298B1 (en) * 2017-05-11 2019-01-01 NovuMind Limited Native tensor processor, using outer product unit
US11475351B2 (en) * 2017-11-15 2022-10-18 Uatc, Llc Systems and methods for object detection, tracking, and motion prediction
US11443176B2 (en) * 2018-05-17 2022-09-13 International Business Machines Corporation Acceleration of convolutional neural networks on analog arrays
CN112888459B (zh) * 2018-06-01 2023-05-23 格里尔公司 卷积神经网络系统及数据分类方法
US11429850B2 (en) * 2018-07-19 2022-08-30 Xilinx, Inc. Performing consecutive mac operations on a set of data using different kernels in a MAC circuit
US11481934B2 (en) * 2018-10-10 2022-10-25 New York University System, method, and computer-accessible medium for generating magnetic resonance imaging-based anatomically guided positron emission tomography reconstruction images with a convolutional neural network
EP3654247B1 (en) * 2018-11-15 2025-01-01 IMEC vzw Convolution engine for neural networks
US10878173B2 (en) * 2018-11-29 2020-12-29 Adobe Inc. Object recognition and tagging based on fusion deep learning models
US11604958B2 (en) * 2019-03-13 2023-03-14 Samsung Electronics Co., Ltd. Method and apparatus for processing computation of zero value in processing of layers in neural network
WO2021071930A1 (en) * 2019-10-07 2021-04-15 Google Llc Redistributing tensor elements between machine learning computing units
US12554962B2 (en) * 2019-12-24 2026-02-17 Intel Corporation Configurable processor element arrays for implementing convolutional neural networks
CN115456161A (zh) * 2020-03-27 2022-12-09 华为技术有限公司 一种数据处理方法和数据处理系统
KR102914873B1 (ko) * 2020-12-14 2026-01-16 삼성전자 주식회사 채널 수에 기초하여 컨볼루션 연산을 수행하는 npu 장치 및 이의 동작 방법
KR102602584B1 (ko) * 2021-04-14 2023-11-16 한국전자통신연구원 인공 지능 반도체 프로세서 및 인공 지능 반도체 프로세서의 동작 방법
US20230195419A1 (en) * 2021-12-17 2023-06-22 Arm Limited System and Method for Accelerating Neural Networks

Similar Documents

Publication Publication Date Title
US20250094530A1 (en) Expanded kernel generation
JP7394104B2 (ja) ハードウェアにおけるカーネルストライドの実行
JP7433356B2 (ja) 加算器を使用した多次元テンソルにおけるデータへのアクセス
CN110520834B (zh) 替选循环限制
US20190340510A1 (en) Sparsifying neural network models
CN110263925B (zh) 一种基于fpga的卷积神经网络前向预测的硬件加速实现装置
CN112989267A (zh) 用于执行卷积运算的方法和系统
WO2019119301A1 (zh) 在卷积神经网络模型中确定特征图像的方法和装置
JPWO2023155748A5 (https=)
CN113326916A (zh) 将卷积映射到分区通道卷积引擎
CN112434786B (zh) 一种基于winograd动态卷积块的图像处理方法
CN113344172A (zh) 将卷积映射到通道卷积引擎
JP7700142B2 (ja) 機械学習アクセラレータの電力削減
CN113496279A (zh) 使用点对点连接的通道卷积引擎的分组卷积
WO2023065983A1 (zh) 计算装置、神经网络处理设备、芯片及处理数据的方法
WO2019136750A1 (zh) 人工智能计算辅助处理装置、方法、存储介质、及终端
CN116348882A (zh) 一种卷积神经网络数据处理方法及其相关设备
CN113536216A (zh) 用分布流水线可分离卷积运算将卷积映射到相连处理元件
CN109598335B (zh) 一种二维卷积脉动阵列结构及实现方法
CN114281755B (zh) 一种面向向量处理器的半精度向量化卷积方法及系统
KR102833795B1 (ko) 컨볼루션 뉴럴 네트워크를 처리하는 방법 및 장치
CN113657587B (zh) 基于fpga的可变形卷积加速方法及装置
CN118587082A (zh) 用于在gpu上执行标准卷积的方法和系统
JP4052181B2 (ja) 通信隠蔽型の並列高速フーリエ変換方法
CN117787365A (zh) 一种卷积数据流的调度方法、装置、介质及设备