CN105630735A - Coprocessor based on reconfigurable computational array - Google Patents
Coprocessor based on reconfigurable computational array Download PDFInfo
- Publication number
- CN105630735A CN105630735A CN201510998467.4A CN201510998467A CN105630735A CN 105630735 A CN105630735 A CN 105630735A CN 201510998467 A CN201510998467 A CN 201510998467A CN 105630735 A CN105630735 A CN 105630735A
- Authority
- CN
- China
- Prior art keywords
- controller
- reconfigurable
- computing
- coprocessor
- algorithm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000005540 biological transmission Effects 0.000 claims abstract description 12
- 238000003860 storage Methods 0.000 claims abstract description 6
- 238000000034 method Methods 0.000 claims description 21
- 238000012545 processing Methods 0.000 claims description 15
- 238000012546 transfer Methods 0.000 claims description 9
- 239000011159 matrix material Substances 0.000 claims description 6
- 230000017105 transposition Effects 0.000 claims description 4
- 230000021615 conjugation Effects 0.000 claims description 3
- 238000013461 design Methods 0.000 description 13
- 230000006870 function Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 101100517196 Arabidopsis thaliana NRPE1 gene Proteins 0.000 description 2
- 101100190825 Bos taurus PMEL gene Proteins 0.000 description 2
- 101100073341 Oryza sativa subsp. japonica KAO gene Proteins 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 101150005492 rpe1 gene Proteins 0.000 description 2
- 241000024287 Areas Species 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/78—Architectures of general purpose stored program computers comprising a single central processing unit
- G06F15/7867—Architectures of general purpose stored program computers comprising a single central processing unit with reconfigurable architecture
- G06F15/7871—Reconfiguration support, e.g. configuration loading, configuration switching, or hardware OS
Landscapes
- Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Advance Control (AREA)
Abstract
The invention relates to a coprocessor based on a reconfigurable computational array. The coprocessor comprises a main controller which receives control information sent by an external universal processor, then analyzes the control information and sends out a corresponding configuration instruction, a reconfiguration controller which sends out the configuration information according to an algorithm parameter in the configuration instruction, an operation core unit which receives the configuration information and completes basic computation such as complex multiplication, complex addition and real multiplication, and a DMA unit which receives a transmission parameter of the configuration instruction and carries out data among an external DDR, an internal storage module and the main controller, wherein the configuration instruction comprises the transmission parameter and the algorithm parameter, and the configuration information comprises an execution signal used for selecting and organizing a logic algorithm in the operation core unit and an internal network gating signal. The coprocessor has the advantages of being good in computing performance and small in area consumption.
Description
Technical field
The present invention relates to the coprocessor of reconfigureable computing array.
Background technology
Along with the progress of science and technology, people are more and more higher to the requirement of calculated performance, and high performance signal process is widely used in the fields such as image procossing, scientific algorithm and Industry Control. In addition, these high performance signal process Application Areass comprising advanced signal process technology and intensive calculations work are also improving year by year for the real-time of system and the requirement of versatility, and the calculation requirement of computing system is also more and more higher.
Existing general procedure device, comprise CPU(CentralProcessingUnit, central processing unit) and DSP (DigitalSignalProcessing, digital signal processing), the realization of high performance signal Processing Algorithm can also be completed, but still there is following problem: general procedure device is in order to realize versatility, structure is comparatively complicated, needing to pay bigger power consumption and area cost for floating-point matrix operation, general procedure device makes it realize consuming the long time at intensive algorithm based on the feature that instruction stream is executed the task in addition.
Along with the development of semiconductor fabrication, fpga logic capacity greatly promotes and also makes large-scale digital circuitry can be mapped on the FPGA chip of monolithic, but FPGA is based on searching table (LookUpTable, LUT) chip area that implementation determines it is much bigger compared to application specific integrated circuit (ApplicationSpecificIntegratedCircuit, ASIC).
It is for the special accelerating hardware module of specific algorithm design that another kind improves the mentality of designing of calculated performance, such as special FFT module, matrix inversion module and filtering operation module etc., but integrated too much accelerating module can strengthen hardware resource and area overhead undoubtedly in a system.
Summary of the invention
It is an object of the invention to provide a kind of coprocessor based on reconfigurable hardware, it may also be useful to fixing hardware computation resource, the acceleration of algorithms of different can be realized by configuring the change of information. For achieving the above object, the coprocessor of the present invention comprises
Principal controller, receives the control information that outside general procedure device sends, then resolves described control information, and send corresponding configuration-direct, and described configuration-direct comprises transmission parameter and algorithm parameter;
Reconfigurable controller, according to the algorithm parameter in described configuration-direct, sends configuration information, and described configuration information is for selecting and organizes the executive signal of the logic algorithm in arithmetic core unit and inner network gating signal;
Arithmetic core unit, receives described configuration information, completes the fundamental operation taken advantage of again, be added with, take advantage of in fact according to configuration information;
DMA unit, receives the transmission parameter of described configuration-direct, carries out outside DDR and the carrying of the data between storage inside module, principal controller.
The further design of described reconfigurable coprocessor is, described transmission parameter comprises transfer and counts, transfer start address, data transfer direction and data mode of transport; Described algorithm parameter comprises computing type and computing is counted.
The further design of described reconfigurable coprocessor is, described arithmetic core unit comprises reconfigureable computing array.
The further design of described reconfigurable coprocessor is, described reconfigureable computing array is made up of the computing array of coarsness, being integrated with six reconfigurable processing units, reconfigureable computing array can realize reconstruct between the reconstruct of reconfigurable processing unit inside and reconfigurable processing unit.
The further design of described reconfigurable coprocessor is, described arithmetic core unit also comprises some multi-path choice devices and input-output register, and described multi-path choice device, input-output register communicate to connect with reconfigureable computing array respectively.
The further design of described reconfigurable coprocessor is, configuration information comprises computing type and computing data.
The further design of described reconfigurable coprocessor is, also comprises AXI interface, and described AXI interface is used for being connected principal controller with outside general procedure device;
The further design of described reconfigurable coprocessor is, also comprises memory module, for receive arithmetic core unit, the data of principal controller and store.
The further design of described reconfigurable coprocessor is, principal controller comprises
Device configuration register, for storing the mode of operation of reconfigurable processing core, has holotype, from pattern, debugging mode, interrupts pattern and query pattern;
Computing configuration register, for storing described algorithm parameter, comprises computing type, and computing is counted, digital data transmission start address;
Status register, for storing the state of reconfigurable coprocessor.
The further design of described reconfigurable coprocessor is, reconfigurable controller comprises reconstituted state machine unit, heavy register unit and algorithm sub-controller, and algorithm sub-controller comprises FFT controller, FIR controller, relevant control device, addition controller, multiplication controller, dot product controller, conjugation controller, transposition controller, Covariance Controller, division controller, plural number ask mould controller, surely floating change over controller and matrix inversion controller.
The advantage of the present invention is as follows:
1) calculated performance is good; Compared to the system structure of DSP based on instruction stream, this coprocessor can reach higher efficiency based on the structure of configuration flow, after principal controller receives exterior arrangement, can again organize the interconnection mode between reconfigureable computing array according to configuration information. The scale being directed to computing array, have also been made optimization process to each subalgorithm supported in this system, reaches optimum performance.
2) area consumption is little; Completing same function, realize area based on the FPGA searching table and realize much bigger than what application specific integrated circuit adopted hardwire, in the chip design field that area requirements is very harsh, the existing way of coprocessor has very big area advantage.
Accompanying drawing explanation
Fig. 1 is the coprocessor overall architecture schematic diagram of the reconfigureable computing array of the present invention.
Fig. 2 is the workflow schematic diagram of the coprocessor of reconfigureable computing array.
Fig. 3 is the module diagram of the coprocessor of reconfigureable computing array.
Fig. 4 is the schematic diagram of reconfigureable computing array.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the coprocessor based on reconfigureable computing array of the present invention is described in detail.
Such as Fig. 1, coprocessor is primarily of principal controller, reconfigurable controller, arithmetic core unit and DMA unit composition. Principal controller, receives the control information that outside general procedure device sends, then resolves control information, and send corresponding configuration-direct. Reconfigurable controller, according to the described configuration-direct that principal controller sends, sends configuration information, and described configuration information is for selecting and organize the logic algorithm in arithmetic core unit and changes the internet gated fashion of arithmetic core unit. Arithmetic core unit, receives described configuration information, completes according to configuration information to take advantage of again, is added with, real the fundamental operation such as takes advantage of. DMA unit, receives described configuration information, carries out outside DDR and the carrying of the data between storage inside module, principal controller.
Should start based on the coprocessor of reconfigureable computing array and need outside general procedure device configuration. Control information is imported into by AXI interface, is stored in the register of principal controller. Principal controller comprises device configuration register, computing configuration register, status register and abnormal interruption register, and above-mentioned Parasites Fauna is for receiving configuration information and the information of characterization system state.
Configuration information is resolved by principal controller, contains the mode of operation of reconfigurable processing core, have holotype in device configuration register, from pattern, debugging mode, interrupts pattern and query pattern.
Further, completing each parameter of specific algorithm needed for computing configuration register contains, this parameter comprises computing type, and computing is counted, digital data transmission start address etc. Whether whether status register opening can be read, and indicates which kind of pattern this processor core is operated in, accept effectively to configure information, busy, whether the states such as interruption occurs. Outside general procedure device can read the parameter of coprocessor state register, to determine next step control information exporting to principal controller.
As shown in Figure 2, after coprocessor initialize, general procedure device is sent control information to coprocessor by AXI interface, after principal controller receives and confirms control information effectively, control information is resolved and is passed to reconfigurable controller and DMA respectively.
Reconfigurable controller is the main modular of execution algorithm control and operation, and principal controller is sent to the algorithm parameter in the configuration-direct of reconfigurable controller, and algorithm parameter comprises computing type and computing is counted. Reconfigurable controller just can start corresponding algorithm sub-controller according to these information. Meanwhile, DMA also receives the transmission parameter in the configuration-direct of principal controller, comprises transfer and counts, transfer start address, data transfer direction and data mode of transport. Computing source data is moved into from DDR and the output of result data is all completed by DMA, and after write data complete, DMA can notify principal controller, and for distributing next step task, result data have also been carried end with DMA by the completing an of algorithm. After reconfigurable controller completes to complete with DMA carrying source data, reconfigurable controller just can start computing.
Reconfigurable controller comprises reconstituted state machine unit, heavy register unit and algorithm sub-controller. Algorithm sub-controller comprises FFT controller, FIR controller, relevant control device, addition controller, multiplication controller, dot product controller, conjugation controller, transposition controller, Covariance Controller, division controller, plural number ask mould controller, surely floating change over controller and matrix inversion controller, see Fig. 3.
Reconfigurable controller receives configuration-direct from principal controller and resolves, the algorithm executive signal sent. This executive signal also comprises the execution parameter of corresponding algorithm. Reconfigurable controller, according to the final election device of executive signal gating required memory, processing unit, starts the sub-controller of this algorithm after completing data stream gating. Computing array received is to the algorithm executive signal sent of reconfigurable controller, and starts according to execution parameter to perform computing operation. By operation result and complete signal and return to reconfigurable controller after computing completes.
The core that each algorithm sub-controller completes computing is reconfigureable computing array, and as shown in Figure 4, also namely all algorithm sub-controllers are all connected with the most basic computing array its structure according to respective algorithm requirements. The different computing arrays selected by algorithm sub-controller is also different, and such as FIR sub-controller needs to use the RPE1 ~ RPE4 comprising complex multiplication and complex addition; Fixed floating conversion sub-controller only needs to use RPE5; Comparatively complicated matrix inversion sub-controller then to be used RPE1 ~ RPE4 and comprise the RPE6 of floating-point division.
Algorithm sub-controller can feed back to principal controller by completing signal by reconfigurable controller after terminating computing, and result data also have corresponding feedback signal after having been carried by DMA. Principal controller receive these two complete feedback signal after namely current computing terminate, this restructural system can receive and configure next time afterwards, performs next task.
From describing above, the combination of independent subalgorithm can be got up by configuration information by the scheduling of this restructural system, completes combinatorial operation. Its process is as follows: be first the division of subtask, and the application that complete is divided into some the subtasks successively performed; Then generating corresponding configuration information according to each subtask leaves in instruction district, due to the source data of subtask (such as the N+1 time) rearward may come from before the result of task (the N time), intermediate result can be left in storage inside device by treating processes, just need to, from external input sources data, only need to the source address of the result storage address of N task and the N+1 time task be set to identical before the N+1 time computing like this; Finally, after all subtasks complete, result is exported in outside DDR by DMA.
Based on above-mentioned hardware implementing, the main function of this coprocessor is as follows:
1. computing function is configurable, and process counts/and parameter is configurable.
2. support master and slave mode operation regimes.
3. support the batch processing function of algorithm combination.
4. support table tennis stream treatment.
5. small point data are stored in on-chip SRAM and directly process; A little bigger logarithmic data is stored in DDR, supports segmentation reading process.
6. built-in DMA, number removed by support order, transposition, supports chain type and two dimension DMA.
Reconfigurable coprocessor of the present invention, in Radar Signal Processing, realizes there is following advantage relative to the DSP of current main flow realization and FPGA:
1) calculated performance is good; Compared to the system structure of DSP based on instruction stream, this design can reach higher efficiency based on the structure of configuration flow, after principal controller receives exterior arrangement, can again organize the interconnection mode between reconfigureable computing array according to configuration information. The scale being directed to computing array, have also been made optimization process to each subalgorithm supported in this system, reaches optimum performance.
2) area consumption is little; Completing same function, realize area based on the FPGA searching table and realize much bigger than what application specific integrated circuit adopted hardwire, in the chip design field that area requirements is very harsh, the existing way of coprocessor has very big area advantage.
Claims (10)
1. the coprocessor based on reconfigureable computing array, it is characterised in that comprising:
Principal controller, receives the control information that outside general procedure device sends, then resolves described control information, and send corresponding configuration-direct, and described configuration-direct comprises transmission parameter and algorithm parameter;
Reconfigurable controller, according to the algorithm parameter in described configuration-direct, sends configuration information, and described configuration information is for selecting and organizes the executive signal of the logic algorithm in arithmetic core unit and inner network gating signal;
Arithmetic core unit, receives described configuration information, completes according to configuration information to take advantage of again, is added with, real the fundamental operation such as takes advantage of;
DMA unit, receives the transmission parameter of described configuration-direct, carries out outside DDR and the carrying of the data between storage inside module, principal controller.
2. reconfigurable coprocessor according to claim 1, it is characterised in that described transmission parameter comprises transfer and counts, transfer start address, data transfer direction and data mode of transport; Described algorithm parameter comprises computing type and computing is counted.
3. reconfigurable coprocessor according to claim 1, it is characterised in that described arithmetic core unit comprises reconfigureable computing array.
4. reconfigurable coprocessor according to claim 3, it is characterized in that described reconfigureable computing array is made up of the computing array of coarsness, being integrated with six reconfigurable processing units, reconfigureable computing array can realize reconstruct between the reconstruct of reconfigurable processing unit inside and reconfigurable processing unit.
5. reconfigurable coprocessor according to claim 3, it is characterized in that described arithmetic core unit also comprises some multi-path choice devices and input-output register, described multi-path choice device, input-output register communicate to connect with reconfigureable computing array respectively.
6. reconfigurable coprocessor according to claim 1, it is characterised in that configuration information comprises computing type and computing data.
7. reconfigurable coprocessor according to claim 1, it is characterised in that also comprise AXI interface, described AXI interface is used for being connected principal controller with outside general procedure device.
8. reconfigurable coprocessor according to claim 1, it is characterised in that also comprise memory module, for receive arithmetic core unit, the data of principal controller and store.
9. reconfigurable coprocessor according to claim 1, it is characterised in that principal controller comprises
Device configuration register, for storing the mode of operation of reconfigurable processing core, has holotype, from pattern, debugging mode, interrupts pattern and query pattern;
Computing configuration register, for storing described algorithm parameter, comprises computing type, and computing is counted, digital data transmission start address;
Status register, for representing the state of reconfigurable coprocessor.
10. reconfigurable coprocessor according to claim 1, it is characterized in that reconfigurable controller comprises reconstituted state machine unit, heavy register unit and algorithm sub-controller, algorithm sub-controller comprises FFT controller, FIR controller, relevant control device, addition controller, multiplication controller, dot product controller, conjugation controller, transposition controller, Covariance Controller, division controller, plural number ask mould controller, surely floating change over controller and matrix inversion controller.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510998467.4A CN105630735A (en) | 2015-12-25 | 2015-12-25 | Coprocessor based on reconfigurable computational array |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510998467.4A CN105630735A (en) | 2015-12-25 | 2015-12-25 | Coprocessor based on reconfigurable computational array |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105630735A true CN105630735A (en) | 2016-06-01 |
Family
ID=56045698
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510998467.4A Pending CN105630735A (en) | 2015-12-25 | 2015-12-25 | Coprocessor based on reconfigurable computational array |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105630735A (en) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106484657A (en) * | 2016-11-18 | 2017-03-08 | 成都嘉纳海威科技有限责任公司 | A kind of reconfigurable signal processor ASIC framework and its reconstructing method |
CN107657316A (en) * | 2016-08-12 | 2018-02-02 | 北京深鉴科技有限公司 | The cooperative system of general processor and neural network processor designs |
CN107688469A (en) * | 2016-12-23 | 2018-02-13 | 北京国睿中数科技股份有限公司 | Take into account the Reconfigurable Computation device of universal command and special instruction |
CN108228529A (en) * | 2017-09-28 | 2018-06-29 | 清华大学无锡应用技术研究院 | For the method for reconfigureable computing array and its system to be configured |
CN108628693A (en) * | 2018-04-17 | 2018-10-09 | 清华大学无锡应用技术研究院 | Processor debugging method and system |
CN109144889A (en) * | 2018-06-25 | 2019-01-04 | 中国科学院声学研究所 | Ultrasound detection data memory module and FPGA circuitry based on FPGA |
CN109446478A (en) * | 2018-10-30 | 2019-03-08 | 南京大学 | A kind of complex covariance matrix computing system based on iteration and restructural mode |
CN110398921A (en) * | 2019-07-24 | 2019-11-01 | 理工雷科电子(西安)有限公司 | A kind of multitask dynamic restructuring processing system and method based on FPGA control |
CN110502278A (en) * | 2019-07-24 | 2019-11-26 | 福州瑞芯微电子股份有限公司 | Neural network coprocessor and its association's processing method based on RiscV extended instruction |
CN111045965A (en) * | 2019-10-25 | 2020-04-21 | 南京大学 | Hardware implementation method for multi-channel conflict-free splitting, computer equipment and readable storage medium for operating method |
CN111090248A (en) * | 2018-10-24 | 2020-05-01 | 意法半导体(格勒诺布尔2)公司 | Microcontroller for performing configurable processing operations |
CN112381220A (en) * | 2020-12-08 | 2021-02-19 | 厦门壹普智慧科技有限公司 | Neural network tensor processor |
CN112580792A (en) * | 2020-12-08 | 2021-03-30 | 厦门壹普智慧科技有限公司 | Neural network multi-core tensor processor |
WO2021073137A1 (en) * | 2019-10-17 | 2021-04-22 | 辰芯科技有限公司 | Reconfigurable processor and reconfigurable processor system |
CN112732639A (en) * | 2021-04-01 | 2021-04-30 | 南京大学 | Coarse-grained dynamic reconfigurable processor and data processing method thereof |
CN113064854A (en) * | 2021-04-15 | 2021-07-02 | 天津芯海创科技有限公司 | Hardware calculation reconstruction method and device, computer equipment and storage medium |
CN113253664A (en) * | 2021-07-02 | 2021-08-13 | 峰岹科技(深圳)股份有限公司 | Coprocessor, coprocessor control method, terminal and storage medium |
CN113743599A (en) * | 2021-08-08 | 2021-12-03 | 苏州浪潮智能科技有限公司 | Operation device and server of convolutional neural network |
CN114116598A (en) * | 2021-11-22 | 2022-03-01 | 珠海泰为电子有限公司 | Chip with built-in reconfigurable coprocessor and electronic equipment |
CN114328311A (en) * | 2021-12-15 | 2022-04-12 | 珠海一微半导体股份有限公司 | Storage controller architecture, data processing circuit and data processing method |
CN114548390A (en) * | 2022-02-25 | 2022-05-27 | 电子科技大学 | RISC-V and nerve morphology calculation-based heterogeneous architecture processing system |
CN117573607A (en) * | 2023-11-28 | 2024-02-20 | 北京智芯微电子科技有限公司 | Reconfigurable coprocessor, chip, multi-core signal processing system and computing method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006119777A (en) * | 2004-10-20 | 2006-05-11 | Renesas Technology Corp | Semiconductor device |
CN103927219A (en) * | 2014-05-04 | 2014-07-16 | 南京大学 | Accurate-period simulation model for reconfigurable special processor core and hardware architecture thereof |
CN103970720A (en) * | 2014-05-30 | 2014-08-06 | 东南大学 | Embedded reconfigurable system based on large-scale coarse granularity and processing method of system |
-
2015
- 2015-12-25 CN CN201510998467.4A patent/CN105630735A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006119777A (en) * | 2004-10-20 | 2006-05-11 | Renesas Technology Corp | Semiconductor device |
CN103927219A (en) * | 2014-05-04 | 2014-07-16 | 南京大学 | Accurate-period simulation model for reconfigurable special processor core and hardware architecture thereof |
CN103970720A (en) * | 2014-05-30 | 2014-08-06 | 东南大学 | Embedded reconfigurable system based on large-scale coarse granularity and processing method of system |
Cited By (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107657316A (en) * | 2016-08-12 | 2018-02-02 | 北京深鉴科技有限公司 | The cooperative system of general processor and neural network processor designs |
CN107657316B (en) * | 2016-08-12 | 2020-04-07 | 北京深鉴智能科技有限公司 | Design of cooperative system of general processor and neural network processor |
CN106484657A (en) * | 2016-11-18 | 2017-03-08 | 成都嘉纳海威科技有限责任公司 | A kind of reconfigurable signal processor ASIC framework and its reconstructing method |
CN107688469A (en) * | 2016-12-23 | 2018-02-13 | 北京国睿中数科技股份有限公司 | Take into account the Reconfigurable Computation device of universal command and special instruction |
CN107688469B (en) * | 2016-12-23 | 2020-02-11 | 北京国睿中数科技股份有限公司 | Reconfigurable computing device with both general instructions and special instructions |
CN108228529B (en) * | 2017-09-28 | 2019-07-26 | 清华大学无锡应用技术研究院 | For configuring the method and its system of reconfigureable computing array |
CN108228529A (en) * | 2017-09-28 | 2018-06-29 | 清华大学无锡应用技术研究院 | For the method for reconfigureable computing array and its system to be configured |
CN108628693A (en) * | 2018-04-17 | 2018-10-09 | 清华大学无锡应用技术研究院 | Processor debugging method and system |
CN108628693B (en) * | 2018-04-17 | 2019-10-25 | 清华大学 | Processor debugging method and system |
CN109144889A (en) * | 2018-06-25 | 2019-01-04 | 中国科学院声学研究所 | Ultrasound detection data memory module and FPGA circuitry based on FPGA |
CN109144889B (en) * | 2018-06-25 | 2022-11-25 | 中国科学院声学研究所 | Ultrasonic testing data storage module based on FPGA and FPGA circuit |
CN111090248A (en) * | 2018-10-24 | 2020-05-01 | 意法半导体(格勒诺布尔2)公司 | Microcontroller for performing configurable processing operations |
CN109446478A (en) * | 2018-10-30 | 2019-03-08 | 南京大学 | A kind of complex covariance matrix computing system based on iteration and restructural mode |
CN109446478B (en) * | 2018-10-30 | 2021-09-28 | 南京大学 | Complex covariance matrix calculation system based on iteration and reconfigurable mode |
CN110398921A (en) * | 2019-07-24 | 2019-11-01 | 理工雷科电子(西安)有限公司 | A kind of multitask dynamic restructuring processing system and method based on FPGA control |
CN110502278A (en) * | 2019-07-24 | 2019-11-26 | 福州瑞芯微电子股份有限公司 | Neural network coprocessor and its association's processing method based on RiscV extended instruction |
US12001381B2 (en) | 2019-10-17 | 2024-06-04 | Morningcore Technology Co., China | Reconfigurable processor and reconfigurable processor system |
WO2021073137A1 (en) * | 2019-10-17 | 2021-04-22 | 辰芯科技有限公司 | Reconfigurable processor and reconfigurable processor system |
CN111045965A (en) * | 2019-10-25 | 2020-04-21 | 南京大学 | Hardware implementation method for multi-channel conflict-free splitting, computer equipment and readable storage medium for operating method |
CN112580792A (en) * | 2020-12-08 | 2021-03-30 | 厦门壹普智慧科技有限公司 | Neural network multi-core tensor processor |
CN112580792B (en) * | 2020-12-08 | 2023-07-25 | 厦门壹普智慧科技有限公司 | Neural network multi-core tensor processor |
CN112381220B (en) * | 2020-12-08 | 2024-05-24 | 厦门壹普智慧科技有限公司 | Neural network tensor processor |
CN112381220A (en) * | 2020-12-08 | 2021-02-19 | 厦门壹普智慧科技有限公司 | Neural network tensor processor |
CN112732639A (en) * | 2021-04-01 | 2021-04-30 | 南京大学 | Coarse-grained dynamic reconfigurable processor and data processing method thereof |
CN113064854A (en) * | 2021-04-15 | 2021-07-02 | 天津芯海创科技有限公司 | Hardware calculation reconstruction method and device, computer equipment and storage medium |
CN113253664A (en) * | 2021-07-02 | 2021-08-13 | 峰岹科技(深圳)股份有限公司 | Coprocessor, coprocessor control method, terminal and storage medium |
CN113743599A (en) * | 2021-08-08 | 2021-12-03 | 苏州浪潮智能科技有限公司 | Operation device and server of convolutional neural network |
CN113743599B (en) * | 2021-08-08 | 2023-08-25 | 苏州浪潮智能科技有限公司 | Computing device and server of convolutional neural network |
CN114116598A (en) * | 2021-11-22 | 2022-03-01 | 珠海泰为电子有限公司 | Chip with built-in reconfigurable coprocessor and electronic equipment |
CN114328311A (en) * | 2021-12-15 | 2022-04-12 | 珠海一微半导体股份有限公司 | Storage controller architecture, data processing circuit and data processing method |
CN114548390A (en) * | 2022-02-25 | 2022-05-27 | 电子科技大学 | RISC-V and nerve morphology calculation-based heterogeneous architecture processing system |
CN117573607A (en) * | 2023-11-28 | 2024-02-20 | 北京智芯微电子科技有限公司 | Reconfigurable coprocessor, chip, multi-core signal processing system and computing method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105630735A (en) | Coprocessor based on reconfigurable computational array | |
CN108416436B (en) | Method and system for neural network partitioning using multi-core processing module | |
US7325123B2 (en) | Hierarchical interconnect for configuring separate interconnects for each group of fixed and diverse computational elements | |
CN107301456B (en) | Deep neural network multi-core acceleration implementation method based on vector processor | |
US11080593B2 (en) | Electronic circuit, in particular capable of implementing a neural network, and neural system | |
CN112463719A (en) | In-memory computing method realized based on coarse-grained reconfigurable array | |
Nurvitadhi et al. | In-package domain-specific asics for intel® stratix® 10 fpgas: A case study of accelerating deep learning using tensortile asic | |
Bu et al. | A design methodology for fixed-size systolic arrays | |
CN102073481A (en) | Multi-kernel DSP reconfigurable special integrated circuit system | |
CN101089840A (en) | Matrix multiplication parallel computing system based on multi-FPGA | |
US10693466B2 (en) | Self-adaptive chip and configuration method | |
TW201807622A (en) | Multi-layer artificial neural network | |
CN109753319B (en) | Device for releasing dynamic link library and related product | |
CN114356836A (en) | RISC-V based three-dimensional interconnected many-core processor architecture and working method thereof | |
Viet Huynh | FPGA-based acceleration for convolutional neural networks on PYNQ-Z2 | |
Delaye et al. | Deep learning challenges and solutions with xilinx fpgas | |
Chalamalasetti et al. | MORA-an architecture and programming model for a resource efficient coarse grained reconfigurable processor | |
Li et al. | Optimized data reuse via reordering for sparse matrix-vector multiplication on fpgas | |
Nair et al. | Fpga acceleration of gcn in light of the symmetry of graph adjacency matrix | |
Angizi et al. | Processing-in-memory acceleration of mac-based applications using residue number system: A comparative study | |
Morcel et al. | Fpga-based accelerator for deep convolutional neural networks for the spark environment | |
Liang et al. | Design of 16-bit fixed-point CNN coprocessor based on FPGA | |
Sriraman et al. | Customized FPGA Design and Analysis of Soft-Core Processor for DNN | |
Jayaraman et al. | Hypersort: High-performance Parallel Sorting on HBM-enabled FPGA | |
CN117222991A (en) | Network-on-chip processing system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160601 |