CN111178518A - Software and hardware cooperative acceleration method based on FPGA - Google Patents
Software and hardware cooperative acceleration method based on FPGA Download PDFInfo
- Publication number
- CN111178518A CN111178518A CN201911350336.XA CN201911350336A CN111178518A CN 111178518 A CN111178518 A CN 111178518A CN 201911350336 A CN201911350336 A CN 201911350336A CN 111178518 A CN111178518 A CN 111178518A
- Authority
- CN
- China
- Prior art keywords
- data
- convolution
- module
- neural network
- result
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Neurology (AREA)
- Advance Control (AREA)
- Complex Calculations (AREA)
Abstract
Description
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911350336.XA CN111178518A (en) | 2019-12-24 | 2019-12-24 | Software and hardware cooperative acceleration method based on FPGA |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911350336.XA CN111178518A (en) | 2019-12-24 | 2019-12-24 | Software and hardware cooperative acceleration method based on FPGA |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111178518A true CN111178518A (en) | 2020-05-19 |
Family
ID=70646347
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911350336.XA Pending CN111178518A (en) | 2019-12-24 | 2019-12-24 | Software and hardware cooperative acceleration method based on FPGA |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111178518A (en) |
Cited By (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111814972A (en) * | 2020-07-08 | 2020-10-23 | 上海雪湖科技有限公司 | Neural network convolution operation acceleration method based on FPGA |
CN111882051A (en) * | 2020-07-29 | 2020-11-03 | 复旦大学 | Global broadcast data input circuit for neural network processing |
CN112001492A (en) * | 2020-08-07 | 2020-11-27 | 中山大学 | Mixed flow type acceleration framework and acceleration method for binary weight Densenet model |
CN112003792A (en) * | 2020-07-23 | 2020-11-27 | 烽火通信科技股份有限公司 | Software and hardware cooperative message acceleration method and device |
CN112329545A (en) * | 2020-10-13 | 2021-02-05 | 江苏大学 | ZCU104 platform-based convolutional neural network implementation and processing method for application of convolutional neural network implementation in fruit identification |
CN112508184A (en) * | 2020-12-16 | 2021-03-16 | 重庆邮电大学 | Design method of fast image recognition accelerator based on convolutional neural network |
CN112734011A (en) * | 2021-01-04 | 2021-04-30 | 北京大学 | Deep neural network accelerator collaborative design method based on incremental synthesis |
CN112766478A (en) * | 2021-01-21 | 2021-05-07 | 中国电子科技集团公司信息科学研究院 | FPGA pipeline structure for convolutional neural network |
CN112862080A (en) * | 2021-03-10 | 2021-05-28 | 中山大学 | Hardware calculation method of attention mechanism of EfficientNet |
CN113033087A (en) * | 2021-03-17 | 2021-06-25 | 电子科技大学 | High-speed data transmission method for optical neural network based on FPGA |
CN113094118A (en) * | 2021-04-26 | 2021-07-09 | 深圳思谋信息科技有限公司 | Data processing system, method, apparatus, computer device and storage medium |
CN113238988A (en) * | 2021-06-08 | 2021-08-10 | 中科寒武纪科技股份有限公司 | Processing system, integrated circuit and board card for optimizing parameters of deep neural network |
CN113238987A (en) * | 2021-06-08 | 2021-08-10 | 中科寒武纪科技股份有限公司 | Statistic quantizer, storage device, processing device and board card for quantized data |
CN113362292A (en) * | 2021-05-27 | 2021-09-07 | 重庆邮电大学 | Bone age assessment method and system based on programmable logic gate array |
CN113392963A (en) * | 2021-05-08 | 2021-09-14 | 北京化工大学 | CNN hardware acceleration system design method based on FPGA |
CN113792621A (en) * | 2021-08-27 | 2021-12-14 | 杭州电子科技大学 | Target detection accelerator design method based on FPGA |
CN113902099A (en) * | 2021-10-08 | 2022-01-07 | 电子科技大学 | Neural network design and optimization method based on software and hardware joint learning |
CN114911628A (en) * | 2022-06-15 | 2022-08-16 | 福州大学 | MobileNet hardware acceleration system based on FPGA |
CN115130672A (en) * | 2022-06-08 | 2022-09-30 | 武汉大学 | Method and device for calculating convolution neural network by software and hardware collaborative optimization |
CN115658323A (en) * | 2022-11-15 | 2023-01-31 | 国网上海能源互联网研究院有限公司 | FPGA load flow calculation acceleration architecture and method based on software and hardware cooperation |
US11775720B2 (en) | 2021-07-02 | 2023-10-03 | International Business Machines Corporation | Integrated circuit development using machine learning-based prediction of power, performance, and area |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180046894A1 (en) * | 2016-08-12 | 2018-02-15 | DeePhi Technology Co., Ltd. | Method for optimizing an artificial neural network (ann) |
CN108280514A (en) * | 2018-01-05 | 2018-07-13 | 中国科学技术大学 | Sparse neural network acceleration system based on FPGA and design method |
CN109871949A (en) * | 2017-12-22 | 2019-06-11 | 泓图睿语(北京)科技有限公司 | Convolutional neural networks accelerator and accelerated method |
US20190190538A1 (en) * | 2017-12-18 | 2019-06-20 | Facebook, Inc. | Accelerator hardware for compression and decompression |
CN109934339A (en) * | 2019-03-06 | 2019-06-25 | 东南大学 | A kind of general convolutional neural networks accelerator based on a dimension systolic array |
WO2019137060A1 (en) * | 2018-01-15 | 2019-07-18 | 合肥工业大学 | Convolutional neural network hardware accelerator based on multicast network-on-chip, and operation mode thereof |
-
2019
- 2019-12-24 CN CN201911350336.XA patent/CN111178518A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180046894A1 (en) * | 2016-08-12 | 2018-02-15 | DeePhi Technology Co., Ltd. | Method for optimizing an artificial neural network (ann) |
US20190190538A1 (en) * | 2017-12-18 | 2019-06-20 | Facebook, Inc. | Accelerator hardware for compression and decompression |
CN109871949A (en) * | 2017-12-22 | 2019-06-11 | 泓图睿语(北京)科技有限公司 | Convolutional neural networks accelerator and accelerated method |
CN108280514A (en) * | 2018-01-05 | 2018-07-13 | 中国科学技术大学 | Sparse neural network acceleration system based on FPGA and design method |
WO2019137060A1 (en) * | 2018-01-15 | 2019-07-18 | 合肥工业大学 | Convolutional neural network hardware accelerator based on multicast network-on-chip, and operation mode thereof |
CN109934339A (en) * | 2019-03-06 | 2019-06-25 | 东南大学 | A kind of general convolutional neural networks accelerator based on a dimension systolic array |
Non-Patent Citations (2)
Title |
---|
LEANDRO D. MEDUS等: "A Novel Systolic Parallel Hardware Architecture for the FPGA Acceleration of Feedforward Neural Networks" * |
张榜 等: "一种基于FPGA的卷积神经网络加速器的设计与实现" * |
Cited By (34)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111814972B (en) * | 2020-07-08 | 2024-02-02 | 上海雪湖科技有限公司 | Neural network convolution operation acceleration method based on FPGA |
CN111814972A (en) * | 2020-07-08 | 2020-10-23 | 上海雪湖科技有限公司 | Neural network convolution operation acceleration method based on FPGA |
CN112003792A (en) * | 2020-07-23 | 2020-11-27 | 烽火通信科技股份有限公司 | Software and hardware cooperative message acceleration method and device |
CN112003792B (en) * | 2020-07-23 | 2022-04-15 | 烽火通信科技股份有限公司 | Software and hardware cooperative message acceleration method and device |
CN111882051A (en) * | 2020-07-29 | 2020-11-03 | 复旦大学 | Global broadcast data input circuit for neural network processing |
CN111882051B (en) * | 2020-07-29 | 2022-05-20 | 复旦大学 | Global broadcast data input circuit for neural network processing |
CN112001492A (en) * | 2020-08-07 | 2020-11-27 | 中山大学 | Mixed flow type acceleration framework and acceleration method for binary weight Densenet model |
CN112001492B (en) * | 2020-08-07 | 2023-06-23 | 中山大学 | Mixed running water type acceleration architecture and acceleration method for binary weight DenseNet model |
CN112329545B (en) * | 2020-10-13 | 2024-05-14 | 江苏大学 | ZCU104 platform-based convolutional neural network implementation and processing method of application of same in fruit identification |
CN112329545A (en) * | 2020-10-13 | 2021-02-05 | 江苏大学 | ZCU104 platform-based convolutional neural network implementation and processing method for application of convolutional neural network implementation in fruit identification |
CN112508184A (en) * | 2020-12-16 | 2021-03-16 | 重庆邮电大学 | Design method of fast image recognition accelerator based on convolutional neural network |
CN112508184B (en) * | 2020-12-16 | 2022-04-29 | 重庆邮电大学 | Design method of fast image recognition accelerator based on convolutional neural network |
CN112734011A (en) * | 2021-01-04 | 2021-04-30 | 北京大学 | Deep neural network accelerator collaborative design method based on incremental synthesis |
CN112766478A (en) * | 2021-01-21 | 2021-05-07 | 中国电子科技集团公司信息科学研究院 | FPGA pipeline structure for convolutional neural network |
CN112766478B (en) * | 2021-01-21 | 2024-04-12 | 中国电子科技集团公司信息科学研究院 | FPGA (field programmable Gate array) pipeline structure oriented to convolutional neural network |
CN112862080A (en) * | 2021-03-10 | 2021-05-28 | 中山大学 | Hardware calculation method of attention mechanism of EfficientNet |
CN112862080B (en) * | 2021-03-10 | 2023-08-15 | 中山大学 | Hardware computing method of attention mechanism of Efficient Net |
CN113033087B (en) * | 2021-03-17 | 2022-06-07 | 电子科技大学 | High-speed data transmission method for optical neural network based on FPGA |
CN113033087A (en) * | 2021-03-17 | 2021-06-25 | 电子科技大学 | High-speed data transmission method for optical neural network based on FPGA |
CN113094118A (en) * | 2021-04-26 | 2021-07-09 | 深圳思谋信息科技有限公司 | Data processing system, method, apparatus, computer device and storage medium |
CN113392963A (en) * | 2021-05-08 | 2021-09-14 | 北京化工大学 | CNN hardware acceleration system design method based on FPGA |
CN113392963B (en) * | 2021-05-08 | 2023-12-19 | 北京化工大学 | FPGA-based CNN hardware acceleration system design method |
CN113362292A (en) * | 2021-05-27 | 2021-09-07 | 重庆邮电大学 | Bone age assessment method and system based on programmable logic gate array |
CN113238987A (en) * | 2021-06-08 | 2021-08-10 | 中科寒武纪科技股份有限公司 | Statistic quantizer, storage device, processing device and board card for quantized data |
CN113238988A (en) * | 2021-06-08 | 2021-08-10 | 中科寒武纪科技股份有限公司 | Processing system, integrated circuit and board card for optimizing parameters of deep neural network |
US11775720B2 (en) | 2021-07-02 | 2023-10-03 | International Business Machines Corporation | Integrated circuit development using machine learning-based prediction of power, performance, and area |
CN113792621B (en) * | 2021-08-27 | 2024-04-05 | 杭州电子科技大学 | FPGA-based target detection accelerator design method |
CN113792621A (en) * | 2021-08-27 | 2021-12-14 | 杭州电子科技大学 | Target detection accelerator design method based on FPGA |
CN113902099A (en) * | 2021-10-08 | 2022-01-07 | 电子科技大学 | Neural network design and optimization method based on software and hardware joint learning |
CN113902099B (en) * | 2021-10-08 | 2023-06-02 | 电子科技大学 | Neural network design and optimization method based on software and hardware joint learning |
CN115130672B (en) * | 2022-06-08 | 2024-03-08 | 武汉大学 | Software and hardware collaborative optimization convolutional neural network calculation method and device |
CN115130672A (en) * | 2022-06-08 | 2022-09-30 | 武汉大学 | Method and device for calculating convolution neural network by software and hardware collaborative optimization |
CN114911628A (en) * | 2022-06-15 | 2022-08-16 | 福州大学 | MobileNet hardware acceleration system based on FPGA |
CN115658323A (en) * | 2022-11-15 | 2023-01-31 | 国网上海能源互联网研究院有限公司 | FPGA load flow calculation acceleration architecture and method based on software and hardware cooperation |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111178518A (en) | Software and hardware cooperative acceleration method based on FPGA | |
CN111459877B (en) | Winograd YOLOv2 target detection model method based on FPGA acceleration | |
US20220012593A1 (en) | Neural network accelerator and neural network acceleration method based on structured pruning and low-bit quantization | |
CN109934339B (en) | General convolutional neural network accelerator based on one-dimensional pulse array | |
CN110390385B (en) | BNRP-based configurable parallel general convolutional neural network accelerator | |
CN108280514B (en) | FPGA-based sparse neural network acceleration system and design method | |
CN110516801B (en) | High-throughput-rate dynamic reconfigurable convolutional neural network accelerator | |
CN111242289B (en) | Convolutional neural network acceleration system and method with expandable scale | |
CN107480789B (en) | Efficient conversion method and device of deep learning model | |
CN109447241B (en) | Dynamic reconfigurable convolutional neural network accelerator architecture for field of Internet of things | |
CN111967468A (en) | FPGA-based lightweight target detection neural network implementation method | |
US11763156B2 (en) | Neural network compression based on bank-balanced sparsity | |
CN109146067B (en) | Policy convolution neural network accelerator based on FPGA | |
CN108764466A (en) | Convolutional neural networks hardware based on field programmable gate array and its accelerated method | |
CN113051216B (en) | MobileNet-SSD target detection device and method based on FPGA acceleration | |
CN109284824B (en) | Reconfigurable technology-based device for accelerating convolution and pooling operation | |
CN113792621B (en) | FPGA-based target detection accelerator design method | |
CN113392973B (en) | AI chip neural network acceleration method based on FPGA | |
CN112950656A (en) | Block convolution method for pre-reading data according to channel based on FPGA platform | |
Shahshahani et al. | Memory optimization techniques for fpga based cnn implementations | |
CN109472734B (en) | Target detection network based on FPGA and implementation method thereof | |
CN109740619B (en) | Neural network terminal operation method and device for target recognition | |
Zong-ling et al. | The design of lightweight and multi parallel CNN accelerator based on FPGA | |
CN116822600A (en) | Neural network search chip based on RISC-V architecture | |
CN116011534A (en) | FPGA-based general convolutional neural network accelerator implementation method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Yan Chenggang Inventor after: Li Yang Inventor after: Liu Bingtao Inventor after: Shi Zhiguo Inventor after: Sun Yaoqi Inventor after: Zhang Jiyong Inventor after: Zhang Yongdong Inventor after: Shen Tao Inventor before: Yan Chenggang Inventor before: Li Yang Inventor before: Liu Bingtao Inventor before: Sun Yaoqi Inventor before: Zhang Jiyong Inventor before: Zhang Yongdong Inventor before: Shen Tao |