CN109558170A - 一种支持数据级并行和多指令融合的二维数据通路架构 - Google Patents
一种支持数据级并行和多指令融合的二维数据通路架构 Download PDFInfo
- Publication number
- CN109558170A CN109558170A CN201811314543.5A CN201811314543A CN109558170A CN 109558170 A CN109558170 A CN 109558170A CN 201811314543 A CN201811314543 A CN 201811314543A CN 109558170 A CN109558170 A CN 109558170A
- Authority
- CN
- China
- Prior art keywords
- unit
- parallel
- arithmetical
- data
- arithmetic logic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000004927 fusion Effects 0.000 title description 23
- 238000012805 post-processing Methods 0.000 claims abstract description 56
- 238000012545 processing Methods 0.000 claims abstract description 35
- 238000006073 displacement reaction Methods 0.000 claims description 7
- 238000000034 method Methods 0.000 abstract description 19
- 230000008569 process Effects 0.000 abstract description 8
- 238000010586 diagram Methods 0.000 description 22
- 238000001914 filtration Methods 0.000 description 5
- 241001269238 Data Species 0.000 description 4
- 238000009825 accumulation Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 230000001186 cumulative effect Effects 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
- G06F9/3853—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution of compound instructions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
- G06F9/3887—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units controlled by a single instruction for multiple data lanes [SIMD]
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Advance Control (AREA)
- Complex Calculations (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811314543.5A CN109558170B (zh) | 2018-11-06 | 2018-11-06 | 一种支持数据级并行和多指令融合的二维数据通路架构 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811314543.5A CN109558170B (zh) | 2018-11-06 | 2018-11-06 | 一种支持数据级并行和多指令融合的二维数据通路架构 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109558170A true CN109558170A (zh) | 2019-04-02 |
CN109558170B CN109558170B (zh) | 2021-05-04 |
Family
ID=65865994
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811314543.5A Active CN109558170B (zh) | 2018-11-06 | 2018-11-06 | 一种支持数据级并行和多指令融合的二维数据通路架构 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109558170B (zh) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5237667A (en) * | 1987-06-05 | 1993-08-17 | Mitsubishi Denki Kabushiki Kaisha | Digital signal processor system having host processor for writing instructions into internal processor memory |
WO2001009717A1 (en) * | 1999-08-02 | 2001-02-08 | Morton Steven G | Video digital signal processor chip |
CN101174200B (zh) * | 2007-05-18 | 2010-09-08 | 清华大学 | 一种具有五级流水线结构的浮点乘加融合单元 |
CN102508643A (zh) * | 2011-11-16 | 2012-06-20 | 刘大可 | 一种多核并行数字信号处理器及并行指令集的运行方法 |
CN102707931A (zh) * | 2012-05-09 | 2012-10-03 | 刘大可 | 一种基于并行数据通道的数字信号处理器 |
US8725990B1 (en) * | 2004-11-15 | 2014-05-13 | Nvidia Corporation | Configurable SIMD engine with high, low and mixed precision modes |
CN105468335A (zh) * | 2015-11-24 | 2016-04-06 | 中国科学院计算技术研究所 | 流水级运算装置、数据处理方法及片上网络芯片 |
CN103019656B (zh) * | 2012-12-04 | 2016-04-27 | 中国科学院半导体研究所 | 可动态重构的多级并行单指令多数据阵列处理系统 |
US9712185B2 (en) * | 2012-05-19 | 2017-07-18 | Olsen Ip Reserve, Llc | System and method for improved fractional binary to fractional residue converter and multipler |
-
2018
- 2018-11-06 CN CN201811314543.5A patent/CN109558170B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5237667A (en) * | 1987-06-05 | 1993-08-17 | Mitsubishi Denki Kabushiki Kaisha | Digital signal processor system having host processor for writing instructions into internal processor memory |
WO2001009717A1 (en) * | 1999-08-02 | 2001-02-08 | Morton Steven G | Video digital signal processor chip |
US8725990B1 (en) * | 2004-11-15 | 2014-05-13 | Nvidia Corporation | Configurable SIMD engine with high, low and mixed precision modes |
CN101174200B (zh) * | 2007-05-18 | 2010-09-08 | 清华大学 | 一种具有五级流水线结构的浮点乘加融合单元 |
CN102508643A (zh) * | 2011-11-16 | 2012-06-20 | 刘大可 | 一种多核并行数字信号处理器及并行指令集的运行方法 |
CN102707931A (zh) * | 2012-05-09 | 2012-10-03 | 刘大可 | 一种基于并行数据通道的数字信号处理器 |
US9712185B2 (en) * | 2012-05-19 | 2017-07-18 | Olsen Ip Reserve, Llc | System and method for improved fractional binary to fractional residue converter and multipler |
CN103019656B (zh) * | 2012-12-04 | 2016-04-27 | 中国科学院半导体研究所 | 可动态重构的多级并行单指令多数据阵列处理系统 |
CN105468335A (zh) * | 2015-11-24 | 2016-04-06 | 中国科学院计算技术研究所 | 流水级运算装置、数据处理方法及片上网络芯片 |
Non-Patent Citations (2)
Title |
---|
ANDREAS EHLIAR: "Area efficient floating-point adder and multiplier with IEEE-754 compatible semantics", 《2014 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT)》 * |
PER KARLSTROM 等: "High Performance, Low Latency FPGA based Floating Point Adder and Multiplier Units in a Virtex 4", 《2006 NORCHIP》 * |
Also Published As
Publication number | Publication date |
---|---|
CN109558170B (zh) | 2021-05-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Liu et al. | Automatic code generation of convolutional neural networks in FPGA implementation | |
CN100530168C (zh) | 用于执行计算操作的系统、方法及设备 | |
CN103064650B (zh) | 控制移位分组数据的位校正的装置 | |
CN107665126A (zh) | 用于外积累加操作的处理器和方法 | |
CN100405361C (zh) | 用于执行计算操作的方法、系统以及设备 | |
CN107341541A (zh) | 一种用于执行全连接层神经网络训练的装置和方法 | |
CN110705703B (zh) | 基于脉动阵列的稀疏神经网络处理器 | |
CN102707931A (zh) | 一种基于并行数据通道的数字信号处理器 | |
CN102053817A (zh) | 用于执行乘法累加运算的设备和方法 | |
CN110851779B (zh) | 用于稀疏矩阵运算的脉动阵列架构 | |
JP7495194B2 (ja) | 積和演算用のプロセッサ・ユニット | |
CN107667345A (zh) | 打包数据对齐加计算指令、处理器、方法和系统 | |
CN110007896A (zh) | 具有时钟选通的用于执行矩阵乘法的硬件单元 | |
CN113052304A (zh) | 用于具有部分读取/写入的脉动阵列的系统和方法 | |
Prasad et al. | Design of low power and high speed modified carry select adder for 16 bit Vedic Multiplier | |
CN106951394A (zh) | 一种可重构定浮点通用fft处理器 | |
EP3841461B1 (en) | Digital circuit with compressed carry | |
CN109558170A (zh) | 一种支持数据级并行和多指令融合的二维数据通路架构 | |
EP3480710A1 (en) | Computer architectures and instructions for multiplication | |
Patle et al. | Implementation of Baugh-Wooley Multiplier Based on Soft-Core Processor | |
CN103677735B (zh) | 一种数据处理装置及数字信号处理器 | |
Jana et al. | OTIS-MOT: an efficient interconnection network for parallel processing | |
Samanth et al. | A novel approach to develop low power MACs for 2D image filtering | |
WO2001088691A2 (en) | Method and apparatus of dsp resource allocation and use | |
Pechanek et al. | An introduction to an array memory processor for application specific acceleration |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20210126 Address after: Room 908, block C, Kechuang headquarters building, No. 320, pubin Road, Jiangpu street, Nanjing area, Jiangsu Free Trade Zone, Nanjing City, Jiangsu Province, 211800 Applicant after: Jixin communication technology (Nanjing) Co.,Ltd. Address before: 570228 Hainan University, 58 Renmin Avenue, Meilan District, Haikou City, Hainan Province Applicant before: HAINAN University |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230703 Address after: Room 908, block C, Kechuang headquarters building, No. 320, pubin Road, Jiangpu street, Nanjing area, Nanjing Free Trade Zone, 211800 Jiangsu Province Patentee after: Jixin communication technology (Nanjing) Co.,Ltd. Patentee after: Polar core communication technology (Xi'an) Co.,Ltd. Address before: Room 908, block C, Kechuang headquarters building, No. 320, pubin Road, Jiangpu street, Nanjing area, Jiangsu Free Trade Zone, Nanjing City, Jiangsu Province, 211800 Patentee before: Jixin communication technology (Nanjing) Co.,Ltd. |