CN118715527A - 用于稀疏神经网络的自适应张量计算核 - Google Patents

用于稀疏神经网络的自适应张量计算核 Download PDF

Info

Publication number
CN118715527A
CN118715527A CN202380021998.0A CN202380021998A CN118715527A CN 118715527 A CN118715527 A CN 118715527A CN 202380021998 A CN202380021998 A CN 202380021998A CN 118715527 A CN118715527 A CN 118715527A
Authority
CN
China
Prior art keywords
filters
array
ifm
tensor
dimension
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202380021998.0A
Other languages
English (en)
Chinese (zh)
Inventor
张晓谦
严恩勖
肖志斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mozi International Co ltd
Original Assignee
Mozi International Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mozi International Co ltd filed Critical Mozi International Co ltd
Publication of CN118715527A publication Critical patent/CN118715527A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0495Quantised networks; Sparse networks; Compressed networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Complex Calculations (AREA)
  • Image Processing (AREA)
CN202380021998.0A 2022-02-16 2023-02-13 用于稀疏神经网络的自适应张量计算核 Pending CN118715527A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US17/673,490 US20230259758A1 (en) 2022-02-16 2022-02-16 Adaptive tensor compute kernel for sparse neural network
US17/673,490 2022-02-16
PCT/CN2023/075661 WO2023155748A1 (en) 2022-02-16 2023-02-13 Adaptive tensor compute kernel for sparse neural network

Publications (1)

Publication Number Publication Date
CN118715527A true CN118715527A (zh) 2024-09-27

Family

ID=87558678

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202380021998.0A Pending CN118715527A (zh) 2022-02-16 2023-02-13 用于稀疏神经网络的自适应张量计算核

Country Status (7)

Country Link
US (1) US20230259758A1 (https=)
EP (1) EP4479887A4 (https=)
JP (1) JP2025505291A (https=)
KR (1) KR20240149907A (https=)
CN (1) CN118715527A (https=)
TW (1) TWI857493B (https=)
WO (1) WO2023155748A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN121233886A (zh) * 2025-12-01 2025-12-30 上海壁仞科技股份有限公司 卷积计算方法、电子设备、存储介质及程序产品

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116261736B (zh) * 2020-06-12 2024-08-16 墨芯国际有限公司 用于双稀疏卷积处理和并行化的方法和系统
CN112925644B (zh) * 2021-02-26 2024-08-13 北京小米松果电子有限公司 深度学习算子优化方法、装置、设备及存储介质
CN116662330A (zh) * 2022-02-21 2023-08-29 中兴通讯股份有限公司 数据处理方法、转发芯片、存储介质及程序产品
US12567122B1 (en) 2022-04-19 2026-03-03 Nvidia Corporation Application programming interface to modify tensor dimensions
US20230140173A1 (en) * 2022-08-19 2023-05-04 Arnab Raha Deep neural network (dnn) accelerators with heterogeneous tiling
TWI873681B (zh) * 2023-06-14 2025-02-21 緯創資通股份有限公司 物件檢測方法、機器學習方法及電子裝置
CN117707791B (zh) * 2024-02-02 2024-05-14 北京壁仞科技开发有限公司 用于进行注意力运算的方法、设备和存储介质
CN118152713B (zh) * 2024-05-10 2024-08-06 北京壁仞科技开发有限公司 数据处理方法、装置、电子设备和计算机可读存储介质
TWI884041B (zh) * 2024-07-19 2025-05-11 國立清華大學 基於混合精度演算法和記憶體內運算加速器之軟硬體協同運作方法及其系統及非暫態電腦可讀儲存媒體

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10169298B1 (en) * 2017-05-11 2019-01-01 NovuMind Limited Native tensor processor, using outer product unit
US11475351B2 (en) * 2017-11-15 2022-10-18 Uatc, Llc Systems and methods for object detection, tracking, and motion prediction
US11443176B2 (en) * 2018-05-17 2022-09-13 International Business Machines Corporation Acceleration of convolutional neural networks on analog arrays
CN112888459B (zh) * 2018-06-01 2023-05-23 格里尔公司 卷积神经网络系统及数据分类方法
US11429850B2 (en) * 2018-07-19 2022-08-30 Xilinx, Inc. Performing consecutive mac operations on a set of data using different kernels in a MAC circuit
US11481934B2 (en) * 2018-10-10 2022-10-25 New York University System, method, and computer-accessible medium for generating magnetic resonance imaging-based anatomically guided positron emission tomography reconstruction images with a convolutional neural network
EP3654247B1 (en) * 2018-11-15 2025-01-01 IMEC vzw Convolution engine for neural networks
US10878173B2 (en) * 2018-11-29 2020-12-29 Adobe Inc. Object recognition and tagging based on fusion deep learning models
US11604958B2 (en) * 2019-03-13 2023-03-14 Samsung Electronics Co., Ltd. Method and apparatus for processing computation of zero value in processing of layers in neural network
WO2021071930A1 (en) * 2019-10-07 2021-04-15 Google Llc Redistributing tensor elements between machine learning computing units
US12554962B2 (en) * 2019-12-24 2026-02-17 Intel Corporation Configurable processor element arrays for implementing convolutional neural networks
CN115456161A (zh) * 2020-03-27 2022-12-09 华为技术有限公司 一种数据处理方法和数据处理系统
KR102914873B1 (ko) * 2020-12-14 2026-01-16 삼성전자 주식회사 채널 수에 기초하여 컨볼루션 연산을 수행하는 npu 장치 및 이의 동작 방법
KR102602584B1 (ko) * 2021-04-14 2023-11-16 한국전자통신연구원 인공 지능 반도체 프로세서 및 인공 지능 반도체 프로세서의 동작 방법
US20230195419A1 (en) * 2021-12-17 2023-06-22 Arm Limited System and Method for Accelerating Neural Networks

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN121233886A (zh) * 2025-12-01 2025-12-30 上海壁仞科技股份有限公司 卷积计算方法、电子设备、存储介质及程序产品

Also Published As

Publication number Publication date
EP4479887A4 (en) 2026-01-07
EP4479887A1 (en) 2024-12-25
TW202343310A (zh) 2023-11-01
WO2023155748A1 (en) 2023-08-24
KR20240149907A (ko) 2024-10-15
TWI857493B (zh) 2024-10-01
US20230259758A1 (en) 2023-08-17
JP2025505291A (ja) 2025-02-21

Similar Documents

Publication Publication Date Title
CN118715527A (zh) 用于稀疏神经网络的自适应张量计算核
CN115485695B (zh) 用于分层权重稀疏卷积处理的方法和系统
CN117273101B (zh) 用于均衡权重稀疏卷积处理的方法及系统
JP2024502225A (ja) ワークロードが平準化された活性化スパース性を用いた畳込みのための方法およびシステム
WO2022223051A1 (zh) 加速器、计算机系统、方法和存储介质
US20240273163A1 (en) Accelerator for sparse matrix multiplication in neural networks
CN116261736B (zh) 用于双稀疏卷积处理和并行化的方法和系统
CN113592702A (zh) 基于深度卷积神经网络的图像算法加速器及系统和方法
CN117391160A (zh) 加速方法、加速器和存储介质
CN119416850B (zh) 一种适配硬件张量指令及内存的神经网络推理优化方法
WO2026051867A1 (zh) 图卷积网络训练方法、图数据处理方法、装置及设备
HK40051237B (zh) 用於卷积神经网络的超像素方法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination