CN113705795A - 卷积处理方法、装置、卷积神经网络加速器和存储介质 - Google Patents
卷积处理方法、装置、卷积神经网络加速器和存储介质 Download PDFInfo
- Publication number
- CN113705795A CN113705795A CN202111086222.6A CN202111086222A CN113705795A CN 113705795 A CN113705795 A CN 113705795A CN 202111086222 A CN202111086222 A CN 202111086222A CN 113705795 A CN113705795 A CN 113705795A
- Authority
- CN
- China
- Prior art keywords
- data
- multiply
- pipeline
- add operation
- weight
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 14
- 238000013528 artificial neural network Methods 0.000 title abstract description 8
- 238000000034 method Methods 0.000 claims abstract description 24
- 238000004590 computer program Methods 0.000 claims description 18
- 230000001960 triggered effect Effects 0.000 claims description 17
- 238000013527 convolutional neural network Methods 0.000 claims description 6
- 238000013473 artificial intelligence Methods 0.000 abstract description 3
- 238000010586 diagram Methods 0.000 description 5
- 238000003708 edge detection Methods 0.000 description 4
- 238000010009 beating Methods 0.000 description 3
- 238000003706 image smoothing Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
- G06F7/48—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
- G06F7/52—Multiplying; Dividing
- G06F7/523—Multiplying only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3867—Concurrent instruction execution, e.g. pipeline or look ahead using instruction pipelines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Neurology (AREA)
- Complex Calculations (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111086222.6A CN113705795A (zh) | 2021-09-16 | 2021-09-16 | 卷积处理方法、装置、卷积神经网络加速器和存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111086222.6A CN113705795A (zh) | 2021-09-16 | 2021-09-16 | 卷积处理方法、装置、卷积神经网络加速器和存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113705795A true CN113705795A (zh) | 2021-11-26 |
Family
ID=78661126
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111086222.6A Pending CN113705795A (zh) | 2021-09-16 | 2021-09-16 | 卷积处理方法、装置、卷积神经网络加速器和存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113705795A (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114528526A (zh) * | 2022-04-24 | 2022-05-24 | 深圳思谋信息科技有限公司 | 卷积数据处理方法、装置、卷积运算加速器和存储介质 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6988183B1 (en) * | 1998-06-26 | 2006-01-17 | Derek Chi-Lan Wong | Methods for increasing instruction-level parallelism in microprocessors and digital system |
US20180373677A1 (en) * | 2017-05-16 | 2018-12-27 | Jaber Technology Holdings Us Inc. | Apparatus and Methods of Providing Efficient Data Parallelization for Multi-Dimensional FFTs |
CN109313723A (zh) * | 2018-01-15 | 2019-02-05 | 深圳鲲云信息科技有限公司 | 人工智能卷积处理方法、装置、可读存储介质、及终端 |
CN109416755A (zh) * | 2018-01-15 | 2019-03-01 | 深圳鲲云信息科技有限公司 | 人工智能并行处理方法、装置、可读存储介质、及终端 |
CN110598844A (zh) * | 2019-08-06 | 2019-12-20 | 天津大学 | 一种基于fpga的并行卷积神经网络加速器及加速方法 |
CN110647975A (zh) * | 2018-06-27 | 2020-01-03 | 龙芯中科技术有限公司 | 一种数据处理方法、装置、设备以及介质 |
CN111416743A (zh) * | 2020-03-19 | 2020-07-14 | 华中科技大学 | 一种卷积网络加速器、配置方法及计算机可读存储介质 |
WO2020173183A1 (en) * | 2019-02-27 | 2020-09-03 | Huawei Technologies Co., Ltd. | Parallel processing pipeline considerations for video data with portions designated for special treatment |
US20200349433A1 (en) * | 2018-01-15 | 2020-11-05 | Shenzhen Corerain Technologies Co., Ltd. | Streaming-based artificial intelligence convolution processing method and apparatus, readable storage medium and terminal |
-
2021
- 2021-09-16 CN CN202111086222.6A patent/CN113705795A/zh active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6988183B1 (en) * | 1998-06-26 | 2006-01-17 | Derek Chi-Lan Wong | Methods for increasing instruction-level parallelism in microprocessors and digital system |
US20180373677A1 (en) * | 2017-05-16 | 2018-12-27 | Jaber Technology Holdings Us Inc. | Apparatus and Methods of Providing Efficient Data Parallelization for Multi-Dimensional FFTs |
CN109313723A (zh) * | 2018-01-15 | 2019-02-05 | 深圳鲲云信息科技有限公司 | 人工智能卷积处理方法、装置、可读存储介质、及终端 |
CN109416755A (zh) * | 2018-01-15 | 2019-03-01 | 深圳鲲云信息科技有限公司 | 人工智能并行处理方法、装置、可读存储介质、及终端 |
US20200349433A1 (en) * | 2018-01-15 | 2020-11-05 | Shenzhen Corerain Technologies Co., Ltd. | Streaming-based artificial intelligence convolution processing method and apparatus, readable storage medium and terminal |
CN110647975A (zh) * | 2018-06-27 | 2020-01-03 | 龙芯中科技术有限公司 | 一种数据处理方法、装置、设备以及介质 |
WO2020173183A1 (en) * | 2019-02-27 | 2020-09-03 | Huawei Technologies Co., Ltd. | Parallel processing pipeline considerations for video data with portions designated for special treatment |
CN110598844A (zh) * | 2019-08-06 | 2019-12-20 | 天津大学 | 一种基于fpga的并行卷积神经网络加速器及加速方法 |
CN111416743A (zh) * | 2020-03-19 | 2020-07-14 | 华中科技大学 | 一种卷积网络加速器、配置方法及计算机可读存储介质 |
Non-Patent Citations (2)
Title |
---|
徐欣;刘强;王少军;: "一种高度并行的卷积神经网络加速器设计方法", 哈尔滨工业大学学报, no. 04 * |
陈磊;叶焱;: "多方向自适应阈值边缘检测算法及FPGA并行实现", 无线通信技术, no. 04 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114528526A (zh) * | 2022-04-24 | 2022-05-24 | 深圳思谋信息科技有限公司 | 卷积数据处理方法、装置、卷积运算加速器和存储介质 |
CN114528526B (zh) * | 2022-04-24 | 2022-08-02 | 深圳思谋信息科技有限公司 | 卷积数据处理方法、装置、卷积运算加速器和存储介质 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6507271B2 (ja) | Cnn処理方法およびデバイス | |
CN115344237B (zh) | 结合Karatsuba和蒙哥马利模乘的数据处理方法 | |
US20190146756A1 (en) | Segment divider, segment division operation method, and electronic device | |
EP3769208B1 (en) | Stochastic rounding logic | |
US20180088908A1 (en) | Circuit for Performing a Multiply-and-Accumulate Operation | |
US20140351566A1 (en) | Moving average processing in processor and processor | |
CN113705795A (zh) | 卷积处理方法、装置、卷积神经网络加速器和存储介质 | |
JP7387017B2 (ja) | アドレス生成方法及びユニット、深層学習処理器、チップ、電子機器並びにコンピュータプログラム | |
CN111445016B (zh) | 加速非线性数学计算的系统及方法 | |
US20070198811A1 (en) | Data-driven information processor performing operations between data sets included in data packet | |
CN110659014B (zh) | 乘法器及神经网络计算平台 | |
CN110716751B (zh) | 高并行度计算平台、系统及计算实现方法 | |
CN113033813A (zh) | 数据处理方法、装置、计算机设备和存储介质 | |
CN111179175B (zh) | 基于卷积神经网络的图像处理方法、装置及存储介质 | |
CN112668709B (zh) | 计算装置以及用于数据重用的方法 | |
CN111008697B (zh) | 一种卷积神经网络加速器实现架构 | |
CN114385112A (zh) | 处理模数乘法的装置及方法 | |
CN116157807A (zh) | 用于可变卷积运算的弹性瓶颈架构 | |
Fischer et al. | BinArray: A scalable hardware accelerator for binary approximated CNNs | |
CN111124358A (zh) | 一种序列累加器的运算方法和设备 | |
Zadiraka et al. | Calculating the Sum of Multidigit Values in a Parallel Computational Model | |
CN112862109B (zh) | 深度学习模型的执行方法、装置、电子设备及存储介质 | |
CN111340215B (zh) | 一种网络模型推理加速方法、装置、存储介质和智能设备 | |
CN111694543B (zh) | 近似乘法器设计方法、近似乘法器和图像锐化电路 | |
CN116301903B (zh) | 一种编译器、ai网络编译方法、处理方法、执行系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Hu Feng Inventor after: Zhang Bin Inventor after: Liang Youqiang Inventor after: Liu Zhaohan Inventor after: Shen Xiaoyong Inventor after: Lv Jiangbo Inventor before: Hu Feng Inventor before: Zhang Bin Inventor before: Liang Youqiang Inventor before: Liu Zhaohan Inventor before: Yu Bei Inventor before: Shen Xiaoyong Inventor before: Lv Jiangbo Inventor before: Jia Jiaya |
|
CB03 | Change of inventor or designer information |