CN112771546A - 运算加速器和压缩方法 - Google Patents

运算加速器和压缩方法 Download PDF

Info

Publication number
CN112771546A
CN112771546A CN201880098124.4A CN201880098124A CN112771546A CN 112771546 A CN112771546 A CN 112771546A CN 201880098124 A CN201880098124 A CN 201880098124A CN 112771546 A CN112771546 A CN 112771546A
Authority
CN
China
Prior art keywords
calculation result
compression
accelerator
data
control instruction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201880098124.4A
Other languages
English (en)
Inventor
刘保庆
刘虎
陈清龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN112771546A publication Critical patent/CN112771546A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/14Handling requests for interconnection or transfer
    • G06F13/20Handling requests for interconnection or transfer for access to input/output bus
    • G06F13/28Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access DMA, cycle steal
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/60General implementation details not specific to a particular type of compression
    • H03M7/6047Power optimization with respect to the encoder, decoder, storage or transmission
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3059Digital compression and data reduction techniques where the original information is represented by a subset or similar information, e.g. lossy compression
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/60General implementation details not specific to a particular type of compression
    • H03M7/6017Methods or arrangements to increase the throughput
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/70Type of the data to be coded, other than image and sound

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Computational Mathematics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Algebra (AREA)
  • Databases & Information Systems (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Neurology (AREA)
  • Advance Control (AREA)

Abstract

本申请公开了一种运算加速器,该运算加速器包括:第一缓存,用于存储第一输入数据;第二缓存,用于存储权重数据;与该输入缓存和该权重缓存连接的运算电路,用于对第一输入数据和该权重数据进行矩阵乘运算以得到计算结果;压缩模块,用于对该计算结果进行压缩以得到压缩数据;与该压缩模块连接的直接存储器访问控制器DMAC,用于将该压缩数据存入到该运算加速器之外的存储器。由于在该运算加速器中增加了压缩模块,降低了从该运算加速器中搬运计算结果到存储器的数据量,节省该运算加速器的I/O带宽,提升该运算加速器的计算性能。

Description

PCT国内申请,说明书已公开。

Claims (24)

  1. PCT国内申请,权利要求书已公开。
CN201880098124.4A 2018-09-30 2018-09-30 运算加速器和压缩方法 Pending CN112771546A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/109117 WO2020062252A1 (zh) 2018-09-30 2018-09-30 运算加速器和压缩方法

Publications (1)

Publication Number Publication Date
CN112771546A true CN112771546A (zh) 2021-05-07

Family

ID=69949799

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880098124.4A Pending CN112771546A (zh) 2018-09-30 2018-09-30 运算加速器和压缩方法

Country Status (4)

Country Link
US (1) US11960421B2 (zh)
EP (1) EP3852015A4 (zh)
CN (1) CN112771546A (zh)
WO (1) WO2020062252A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11507831B2 (en) * 2020-02-24 2022-11-22 Stmicroelectronics International N.V. Pooling unit for deep learning acceleration
KR102383962B1 (ko) * 2020-11-19 2022-04-07 한국전자기술연구원 가변 데이터 압축/복원기를 포함하는 딥러닝 가속 장치
CN112559043A (zh) * 2020-12-23 2021-03-26 苏州易行电子科技有限公司 一种轻量级人工智能加速模块

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102004004796B4 (de) * 2004-01-30 2007-11-29 Infineon Technologies Ag Vorrichtung zur Datenübertragung zwischen Speichern
JP2007226615A (ja) * 2006-02-24 2007-09-06 Matsushita Electric Ind Co Ltd 情報処理装置、圧縮プログラム生成方法及び情報処理システム
US9153230B2 (en) * 2012-10-23 2015-10-06 Google Inc. Mobile speech recognition hardware accelerator
US10540588B2 (en) * 2015-06-29 2020-01-21 Microsoft Technology Licensing, Llc Deep neural network processing on hardware accelerators with stacked memory
CN106954002B (zh) * 2016-01-07 2019-03-26 深圳市汇顶科技股份有限公司 一种指纹数据的压缩方法及装置
US10733505B2 (en) * 2016-11-10 2020-08-04 Google Llc Performing kernel striding in hardware
US10096134B2 (en) * 2017-02-01 2018-10-09 Nvidia Corporation Data compaction and memory bandwidth reduction for sparse neural networks
CN107341544B (zh) * 2017-06-30 2020-04-10 清华大学 一种基于可分割阵列的可重构加速器及其实现方法
US20190228037A1 (en) * 2017-08-19 2019-07-25 Wave Computing, Inc. Checkpointing data flow graph computation for machine learning
CN108416434B (zh) * 2018-02-07 2021-06-04 复旦大学 针对神经网络的卷积层与全连接层进行加速的电路结构
US10846363B2 (en) * 2018-11-19 2020-11-24 Microsoft Technology Licensing, Llc Compression-encoding scheduled inputs for matrix computations

Also Published As

Publication number Publication date
WO2020062252A1 (zh) 2020-04-02
US20210216483A1 (en) 2021-07-15
US11960421B2 (en) 2024-04-16
EP3852015A4 (en) 2021-09-01
EP3852015A1 (en) 2021-07-21

Similar Documents

Publication Publication Date Title
US11960421B2 (en) Operation accelerator and compression method
US20230026006A1 (en) Convolution computation engine, artificial intelligence chip, and data processing method
CN111338695B (zh) 基于流水线技术的数据处理方法及相关产品
CN114723033B (zh) 数据处理方法、装置、ai芯片、电子设备及存储介质
US11570477B2 (en) Data preprocessing and data augmentation in frequency domain
CN113570033B (zh) 神经网络处理单元、神经网络的处理方法及其装置
US20220253222A1 (en) Data reduction method, apparatus, computing device, and storage medium
CN110874627A (zh) 数据处理方法、数据处理装置及计算机可读介质
CN111083933A (zh) 数据存储及获取方法和装置
CN111813721B (zh) 神经网络数据处理方法、装置、设备及存储介质
US20200242467A1 (en) Calculation method and calculation device for sparse neural network, electronic device, computer readable storage medium, and computer program product
CN115130672B (zh) 一种软硬件协同优化卷积神经网络计算的方法及装置
CN115292033A (zh) 模型运行方法、装置、存储介质及电子设备
WO2022213073A1 (en) Sparse machine learning acceleration
CN113495669B (zh) 一种解压装置、加速器、和用于解压装置的方法
CN114118394A (zh) 神经网络模型的加速方法和装置
CN111382853B (zh) 数据处理装置、方法、芯片及电子设备
CN113641674A (zh) 一种自适应全局序号发生方法和装置
US20220101100A1 (en) Load distribution for a distributed neural network
CN113570034B (zh) 处理装置、神经网络的处理方法及其装置
CN115482147B (zh) 基于压缩数据直接计算的高效并行图处理方法和系统
CN113095211B (zh) 一种图像处理方法、系统及电子设备
US20220414457A1 (en) Selective data structure encoding for deep neural network training
Bok et al. Real-Time Inference Platform for Object Detection on Edge Device
CN116709419A (zh) 张量数据发送处理方法及张量数据接收处理方法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination