CN110163793A - 卷积计算加速方法和装置 - Google Patents
卷积计算加速方法和装置 Download PDFInfo
- Publication number
- CN110163793A CN110163793A CN201910446542.4A CN201910446542A CN110163793A CN 110163793 A CN110163793 A CN 110163793A CN 201910446542 A CN201910446542 A CN 201910446542A CN 110163793 A CN110163793 A CN 110163793A
- Authority
- CN
- China
- Prior art keywords
- convolution
- convolutional calculation
- pixel
- row
- calculation unit
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/20—Processor architectures; Processor configuration, e.g. pipelining
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Neurology (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910446542.4A CN110163793B (zh) | 2019-05-27 | 2019-05-27 | 卷积计算加速方法和装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910446542.4A CN110163793B (zh) | 2019-05-27 | 2019-05-27 | 卷积计算加速方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110163793A true CN110163793A (zh) | 2019-08-23 |
CN110163793B CN110163793B (zh) | 2023-05-23 |
Family
ID=67629292
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910446542.4A Active CN110163793B (zh) | 2019-05-27 | 2019-05-27 | 卷积计算加速方法和装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110163793B (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111898743A (zh) * | 2020-06-02 | 2020-11-06 | 深圳市九天睿芯科技有限公司 | 一种cnn加速方法及加速器 |
CN112183732A (zh) * | 2020-10-22 | 2021-01-05 | 中国人民解放军国防科技大学 | 卷积神经网络加速方法、装置和计算机设备 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106846235A (zh) * | 2016-12-26 | 2017-06-13 | 中国科学院计算技术研究所 | 一种利用NVIDIA Kepler GPU汇编指令加速的卷积优化方法及系统 |
CN107578055A (zh) * | 2017-06-20 | 2018-01-12 | 北京陌上花科技有限公司 | 一种图像预测方法和装置 |
CN108257114A (zh) * | 2017-12-29 | 2018-07-06 | 天津市万贸科技有限公司 | 一种基于深度学习的输电设备缺陷自动识别方法 |
CN108681984A (zh) * | 2018-07-26 | 2018-10-19 | 珠海市微半导体有限公司 | 一种3*3卷积算法的加速电路 |
US20190035047A1 (en) * | 2017-07-28 | 2019-01-31 | Google Inc. | Image Capture Devices Featuring Intelligent Use of Lightweight Hardware-Generated Statistics |
CN109782603A (zh) * | 2019-02-03 | 2019-05-21 | 中国石油大学(华东) | 旋转机械耦合故障的检测方法及监测系统 |
-
2019
- 2019-05-27 CN CN201910446542.4A patent/CN110163793B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106846235A (zh) * | 2016-12-26 | 2017-06-13 | 中国科学院计算技术研究所 | 一种利用NVIDIA Kepler GPU汇编指令加速的卷积优化方法及系统 |
CN107578055A (zh) * | 2017-06-20 | 2018-01-12 | 北京陌上花科技有限公司 | 一种图像预测方法和装置 |
US20190035047A1 (en) * | 2017-07-28 | 2019-01-31 | Google Inc. | Image Capture Devices Featuring Intelligent Use of Lightweight Hardware-Generated Statistics |
CN108257114A (zh) * | 2017-12-29 | 2018-07-06 | 天津市万贸科技有限公司 | 一种基于深度学习的输电设备缺陷自动识别方法 |
CN108681984A (zh) * | 2018-07-26 | 2018-10-19 | 珠海市微半导体有限公司 | 一种3*3卷积算法的加速电路 |
CN109782603A (zh) * | 2019-02-03 | 2019-05-21 | 中国石油大学(华东) | 旋转机械耦合故障的检测方法及监测系统 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111898743A (zh) * | 2020-06-02 | 2020-11-06 | 深圳市九天睿芯科技有限公司 | 一种cnn加速方法及加速器 |
CN112183732A (zh) * | 2020-10-22 | 2021-01-05 | 中国人民解放军国防科技大学 | 卷积神经网络加速方法、装置和计算机设备 |
Also Published As
Publication number | Publication date |
---|---|
CN110163793B (zh) | 2023-05-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109978161B (zh) | 一种通用的卷积-池化同步处理卷积核系统 | |
CN103049241B (zh) | 一种提高cpu+gpu异构装置计算性能的方法 | |
CN108108809B (zh) | 一种针对卷积神经元网络进行推理加速的硬件架构及其工作方法 | |
CN107862374A (zh) | 基于流水线的神经网络处理系统和处理方法 | |
CN108090565A (zh) | 一种卷积神经网络并行化训练加速方法 | |
CN107862378A (zh) | 基于多核的卷积神经网络加速方法及系统、存储介质及终端 | |
CN112200300B (zh) | 卷积神经网络运算方法及装置 | |
CN107085562B (zh) | 一种基于高效复用数据流的神经网络处理器及设计方法 | |
CN102521854A (zh) | 一种适用于二维流场的并行流线放置方法 | |
CN107341761A (zh) | 一种深度神经网络的计算执行方法和系统 | |
CN114995782B (zh) | 数据处理方法、装置、设备和可读存储介质 | |
CN110163793A (zh) | 卷积计算加速方法和装置 | |
CN112596701B (zh) | 基于单边雅克比奇异值分解的fpga加速实现方法 | |
CN109146065A (zh) | 二维数据的卷积运算方法及装置 | |
WO2023160050A1 (zh) | 数据处理方法、装置、设备及存储介质 | |
CN108197075B (zh) | 一种Inception结构的多核实现方法 | |
CN116128019A (zh) | Transformer模型的并行训练方法及装置 | |
WO2020103883A1 (zh) | 执行矩阵乘法运算的方法、电路及soc | |
CN110490308A (zh) | 加速库的设计方法、终端设备及存储介质 | |
CN109447239B (zh) | 一种基于arm的嵌入式卷积神经网络加速方法 | |
CN112560356A (zh) | 面向众核架构的稀疏矩阵向量乘众核优化方法 | |
CN106484532A (zh) | 面向sph流体模拟的gpgpu并行计算方法 | |
CN109032667A (zh) | 一种分子动力学模拟中邻接表快速建立方法和系统 | |
US11874898B2 (en) | Streaming-based artificial intelligence convolution processing method and apparatus, readable storage medium and terminal | |
CN107256203A (zh) | 一种矩阵向量乘法的实现方法和装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Su Fang Inventor after: Tian Hui Inventor after: Wu Tongda Inventor after: Li Jinyang Inventor after: Ma Jun Inventor before: Su Fang Inventor before: Liu Yongpan Inventor before: Tian Hui Inventor before: Wu Tongda Inventor before: Li Jinyang Inventor before: Ma Jun |
|
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Su Fang Inventor after: Wu Tongda Inventor after: Li Jinyang Inventor after: Ma Jun Inventor before: Su Fang Inventor before: Liu Yongpan Inventor before: Tian Hui Inventor before: Wu Tongda Inventor before: Li Jinyang Inventor before: Ma Jun |
|
CB03 | Change of inventor or designer information | ||
CI02 | Correction of invention patent application |
Correction item: Inventor Correct: Su Fang|Liu Yongpan|Tian Hui|Wu Tongda|Ma Jun|Li Jinyang False: Su Fang|Tian Hui|Wu Tongda|Li Jinyang|Ma Jun Number: 24-01 Volume: 36 |
|
CI02 | Correction of invention patent application | ||
GR01 | Patent grant | ||
GR01 | Patent grant |