CN105677595A - FPGA method achieving computation speedup and PCIESSD storage simultaneously - Google Patents
FPGA method achieving computation speedup and PCIESSD storage simultaneously Download PDFInfo
- Publication number
- CN105677595A CN105677595A CN201610039292.9A CN201610039292A CN105677595A CN 105677595 A CN105677595 A CN 105677595A CN 201610039292 A CN201610039292 A CN 201610039292A CN 105677595 A CN105677595 A CN 105677595A
- Authority
- CN
- China
- Prior art keywords
- fpga
- pciessd
- controller
- ssd
- ssd controller
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/14—Handling requests for interconnection or transfer
- G06F13/16—Handling requests for interconnection or transfer for access to memory bus
- G06F13/1668—Details of memory controller
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/14—Handling requests for interconnection or transfer
- G06F13/20—Handling requests for interconnection or transfer for access to input/output bus
- G06F13/28—Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access DMA, cycle steal
- G06F13/282—Cycle stealing DMA
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/78—Architectures of general purpose stored program computers comprising a single central processing unit
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Hardware Design (AREA)
- Advance Control (AREA)
Abstract
The invention discloses an FPGA method achieving computation speedup and PCIE SSD storage simultaneously. An FPGA is used, an SSD controller and an algorithm accelerator are integrated in the FPGA, the FPGA is further internally provided with a DDR controller and a direct memory read module DMA, and the direct memory read module DMA is connected with the SSD controller, the DDR controller and the algorithm accelerator respectively. According to the FPGA method achieving computation speedup and PCIE SSD storage simultaneously, the two functions of computation speedup and SSD storage are achieved on PCIE equipment, the layout difficult is reduced, overall power consumption of server nodes is reduced, and the cost of an enterprise is reduced.
Description
Technical field
The present invention relates to a kind of FPGA method, specifically a kind of simultaneously realization calculates the FPGA method accelerated with PCIESSD stores.
Background technology
Along with the fast development of informationization, high-density computation requirement gets more and more, and computing power and storage IO (I/O) Capability Requirement to Single-Server node are more and more higher. At present, enterprise market extensively adopts PCIESSD to improve the IO ability stored; And for computing power, generally adopt GPU(GraphicsProcessingUnit) accelerate card or FPGA acceleration card. Although it is strong that enterprise-level GPU accelerates card computing power, but price is expensive, power consumption height, adds enterprise cost. And FPGA acceleration card relative GPU acceleration card price is lower, low in energy consumption, but use PCIESSD and FPGA to accelerate to block simultaneously, not only to be taken multiple server slot, and increase more power consumption, add the cost of enterprise.
Summary of the invention
It is an object of the invention to provide a kind of simultaneously realization and calculate the FPGA method accelerated with PCIESSD stores, to solve in above-mentioned background technology the problem proposed.
For achieving the above object, the present invention provides following technical scheme:
A kind of simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, employ a slice FPGA, SSD controller and algorithm accelerator it is integrated with in FPGA, SSD controller is further comprises in FPGA, described SSD controller is by logic realization in FPGA, the flash array of management and control SSD, SSD controller is by PCIE interface and server node communication.
As the scheme that the present invention is further: described algorithm accelerator is realized by fpga logic, by PCIE interface and server node communication.
As the scheme that the present invention is further: further comprises DDR controller in described FPGA.
As the present invention's further scheme: further comprises direct internal memory read module DMA in described FPGA, direct internal memory read module DMA connects SSD controller, DDR controller and algorithm accelerator respectively.
Compared with prior art, the invention has the beneficial effects as follows: the present invention achieves algorithm in a PCIE device and accelerates to store two functions with SSD, decreases layout difficulty, reduces the overall power of server node, reduces the cost of enterprise.
Accompanying drawing explanation
Fig. 1 realizes calculating the structural representation accelerating the FPGA method with PCIESSD storage simultaneously; Fig. 2 realizes calculating the schema accelerating the FPGA method with PCIESSD storage simultaneously.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making other embodiments all obtained under creative work prerequisite, belong to the scope of protection of the invention.
Refer to Fig. 1, in the embodiment of the present invention, a kind of simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, employ a slice FPGA, being integrated with SSD controller and algorithm accelerator in FPGA, further comprises SSD controller in FPGA, described SSD controller is by logic realization in FPGA, the flash array of management and control SSD, SSD controller is by PCIE interface and server node communication; Described algorithm accelerator is realized by fpga logic, by PCIE interface and server node communication; DDR controller is further comprises in described FPGA; Further comprises direct internal memory read module DMA in described FPGA, direct internal memory read module DMA connects SSD controller, DDR controller and algorithm accelerator respectively.
The principle of work of the present invention is: refer to Fig. 1, and server node is communicated with FPGA by PCIE interface; The inner directly internal memory read module DMA of FPGA realizes PCIETLP(TransactionLayerPackage) agreement layer encapsulation and resolving, obtain order, distribution and converge data, reporting interruption; SSD controller receives order and the data that direct internal memory read module DMA distributes, according to order control NANDFLASH array; Algorithm accelerator receives order and the data that direct internal memory read module DMA distributes, and does the calculating of respective algorithms according to order; DDR controller is that direct internal memory read module DMA and algorithm accelerator are data cached.
The performing step of the present invention is as follows: 1, according to FPGA, NANDFLASH, DDRSDRAM of actual computation density demand and SSD Capacity Selection appropriate resources, according to, shown in Fig. 1, designing hardware.
2, FPGA workflow diagram according to Fig. 2 writes the RTL(RegisterTransferLevel of the direct internal memory read module DMA of key modules) code.
3, according to Fig. 1 definition FPGA internal structure in FPGA top layer exampleization, connect each function module, write the RTL top code of FPGA.
4, FPGA workflow diagram according to Fig. 2 writes emulation platform, completes system emulation.
5, adding temporal constraint in FPGA compiling instrument, compiling RTL code, obtains recordable paper.
6, schema according to Fig. 2 writes corresponding software-driven.
To those skilled in the art, it is clear that the invention is not restricted to the details of above-mentioned one exemplary embodiment, and when not deviating from spirit or the essential characteristic of the present invention, it is possible to realize the present invention in other specific forms. Therefore, no matter from which point, embodiment all should be regarded as exemplary, and right and wrong are restrictive, the scope of the present invention is limited by claims instead of above-mentioned explanation, it is intended that all changes in the implication of the equivalent important document dropping on claim and scope included in the present invention. Any Reference numeral in claim should be considered as the claim involved by limiting.
In addition, it is to be understood that, although this specification sheets is described according to enforcement mode, but not each enforcement mode only comprises an independent technical scheme, this kind of narrating mode of specification sheets is only for clarity sake, those skilled in the art should by specification sheets integrally, and the technical scheme in each embodiment through appropriately combined, can also form other enforcement modes that it will be appreciated by those skilled in the art that.
Claims (4)
1. one kind realizes calculating the FPGA method accelerated with PCIESSD stores simultaneously, it is characterized in that, employ a slice FPGA, SSD controller and algorithm accelerator it is integrated with in FPGA, SSD controller is further comprises in FPGA, described SSD controller is by logic realization in FPGA, and the flash array of management and control SSD, SSD controller is by PCIE interface and server node communication.
2. according to claim 1 simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, it is characterised in that, described algorithm accelerator is realized by fpga logic, by PCIE interface and server node communication.
3. according to claim 1 simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, it is characterised in that, further comprises DDR controller in described FPGA.
4. according to claim 1 simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, it is characterized in that, further comprises direct internal memory read module DMA in described FPGA, direct internal memory read module DMA connects SSD controller, DDR controller and algorithm accelerator respectively.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610039292.9A CN105677595A (en) | 2016-01-21 | 2016-01-21 | FPGA method achieving computation speedup and PCIESSD storage simultaneously |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610039292.9A CN105677595A (en) | 2016-01-21 | 2016-01-21 | FPGA method achieving computation speedup and PCIESSD storage simultaneously |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105677595A true CN105677595A (en) | 2016-06-15 |
Family
ID=56301794
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610039292.9A Pending CN105677595A (en) | 2016-01-21 | 2016-01-21 | FPGA method achieving computation speedup and PCIESSD storage simultaneously |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105677595A (en) |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106528492A (en) * | 2016-10-27 | 2017-03-22 | 济南浪潮高新科技投资发展有限公司 | High-speed large-capacity recording board card realized based on FPGA |
CN107193764A (en) * | 2017-05-23 | 2017-09-22 | 济南浪潮高新科技投资发展有限公司 | A kind of SRIO interface solid hard disk design methods based on PowerPC |
CN108897706A (en) * | 2018-05-10 | 2018-11-27 | 北京微密科技发展有限公司 | Accelerator interface |
CN109491934A (en) * | 2018-09-28 | 2019-03-19 | 方信息科技(上海)有限公司 | A kind of storage management system control method of integrated computing function |
CN109902043A (en) * | 2019-01-30 | 2019-06-18 | 中国科学院声学研究所 | A kind of national secret algorithm acceleration processing system based on FPGA |
CN110188066A (en) * | 2019-05-07 | 2019-08-30 | 方一信息科技(上海)有限公司 | A kind of FPGA for Large Volume Data and the FPGA algorithm based on opencl |
CN110622145A (en) * | 2017-05-15 | 2019-12-27 | 莫列斯有限公司 | Reconfigurable server and server rack with reconfigurable server |
CN110618963A (en) * | 2018-06-19 | 2019-12-27 | 上海威固信息技术股份有限公司 | Heterogeneous computing hardware acceleration system and method integrating computing and storage |
CN110968537A (en) * | 2018-09-28 | 2020-04-07 | 方一信息科技(上海)有限公司 | PCIE SSD-based FPGA searching and matching method |
US10915469B2 (en) | 2018-03-09 | 2021-02-09 | Samsung Electronics Co., Ltd. | Method and apparatus for supporting a field programmable gate array (FPGA) based add-in-card (AIC) solid state drive (SSD) |
TWI772611B (en) * | 2018-03-05 | 2022-08-01 | 南韓商三星電子股份有限公司 | Host system and method thereof and acceleration module |
CN115857805A (en) * | 2022-11-30 | 2023-03-28 | 合肥腾芯微电子有限公司 | Artificial intelligence computable storage system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120236656A1 (en) * | 2010-07-09 | 2012-09-20 | Stec, Inc. | Apparatus and method for determining a read level of a memory cell based on cycle information |
CN103038751A (en) * | 2010-05-28 | 2013-04-10 | 爱德万测试公司 | Flexible storage interface tester with variable parallelism and firmware upgradeability |
CN103999060A (en) * | 2011-12-23 | 2014-08-20 | 国际商业机器公司 | Solid-state storage management |
CN104952037A (en) * | 2014-03-27 | 2015-09-30 | 联科集团(中国)有限公司 | Method and system for rescaling image files |
-
2016
- 2016-01-21 CN CN201610039292.9A patent/CN105677595A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103038751A (en) * | 2010-05-28 | 2013-04-10 | 爱德万测试公司 | Flexible storage interface tester with variable parallelism and firmware upgradeability |
US20120236656A1 (en) * | 2010-07-09 | 2012-09-20 | Stec, Inc. | Apparatus and method for determining a read level of a memory cell based on cycle information |
CN103999060A (en) * | 2011-12-23 | 2014-08-20 | 国际商业机器公司 | Solid-state storage management |
CN104952037A (en) * | 2014-03-27 | 2015-09-30 | 联科集团(中国)有限公司 | Method and system for rescaling image files |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106528492A (en) * | 2016-10-27 | 2017-03-22 | 济南浪潮高新科技投资发展有限公司 | High-speed large-capacity recording board card realized based on FPGA |
US11907152B2 (en) | 2017-05-15 | 2024-02-20 | Molex, Llc | Reconfigurable server and server rack with same |
CN110622145A (en) * | 2017-05-15 | 2019-12-27 | 莫列斯有限公司 | Reconfigurable server and server rack with reconfigurable server |
CN107193764A (en) * | 2017-05-23 | 2017-09-22 | 济南浪潮高新科技投资发展有限公司 | A kind of SRIO interface solid hard disk design methods based on PowerPC |
CN107193764B (en) * | 2017-05-23 | 2019-12-13 | 浪潮集团有限公司 | SRIO interface solid state disk design method based on PowerPC |
US11892957B2 (en) | 2018-03-05 | 2024-02-06 | Samsung Electronics Co., Ltd. | SSD architecture for FPGA based acceleration |
TWI772611B (en) * | 2018-03-05 | 2022-08-01 | 南韓商三星電子股份有限公司 | Host system and method thereof and acceleration module |
US11520714B2 (en) | 2018-03-09 | 2022-12-06 | Samsung Electronics Co., Ltd. | Method and apparatus for supporting a field programmable gate array (FPGA) based add-in-card (AIC) solid state drive (SSD) |
US10915469B2 (en) | 2018-03-09 | 2021-02-09 | Samsung Electronics Co., Ltd. | Method and apparatus for supporting a field programmable gate array (FPGA) based add-in-card (AIC) solid state drive (SSD) |
CN108897706A (en) * | 2018-05-10 | 2018-11-27 | 北京微密科技发展有限公司 | Accelerator interface |
CN108897706B (en) * | 2018-05-10 | 2021-07-23 | 北京融芯微科技有限公司 | Accelerator interface |
CN110618963A (en) * | 2018-06-19 | 2019-12-27 | 上海威固信息技术股份有限公司 | Heterogeneous computing hardware acceleration system and method integrating computing and storage |
CN109491934B (en) * | 2018-09-28 | 2021-03-02 | 方一信息科技(上海)有限公司 | Storage management system control method integrating computing function |
CN110968537B (en) * | 2018-09-28 | 2021-02-02 | 方一信息科技(上海)有限公司 | PCIE SSD-based FPGA searching and matching method |
CN110968537A (en) * | 2018-09-28 | 2020-04-07 | 方一信息科技(上海)有限公司 | PCIE SSD-based FPGA searching and matching method |
CN109491934A (en) * | 2018-09-28 | 2019-03-19 | 方信息科技(上海)有限公司 | A kind of storage management system control method of integrated computing function |
CN109902043A (en) * | 2019-01-30 | 2019-06-18 | 中国科学院声学研究所 | A kind of national secret algorithm acceleration processing system based on FPGA |
CN110188066A (en) * | 2019-05-07 | 2019-08-30 | 方一信息科技(上海)有限公司 | A kind of FPGA for Large Volume Data and the FPGA algorithm based on opencl |
CN115857805A (en) * | 2022-11-30 | 2023-03-28 | 合肥腾芯微电子有限公司 | Artificial intelligence computable storage system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105677595A (en) | FPGA method achieving computation speedup and PCIESSD storage simultaneously | |
KR102284972B1 (en) | Data processing method and device | |
CN102629258B (en) | Repeating data deleting method and device | |
CN103631538A (en) | Cold and hot data identification threshold value calculation method, device and system | |
WO2017156968A1 (en) | Neural network computing method, system and device therefor | |
CN108563808A (en) | The design method of heterogeneous reconfigurable figure computation accelerator system based on FPGA | |
US10700714B2 (en) | Method of operating decoder for reducing computational complexity and method of operating data storage device including the decoder | |
CN111831254A (en) | Image processing acceleration method, image processing model storage method and corresponding device | |
CN103593477A (en) | Collocation method and device of Hash database | |
US10725494B2 (en) | Optimizing neurosynaptic networks | |
CN103631894A (en) | Dynamic copy management method based on HDFS | |
CN105808169A (en) | Data deduplication method, apparatus and system | |
CN109542713A (en) | A kind of verification method and verifying device | |
CN103793483B (en) | Clock tree generating method and system based on clock node clusters | |
CN104699946A (en) | Game scene management method and device | |
CN109460406A (en) | A kind of data processing method and device | |
CN105022968A (en) | Integrity checking method of memory data | |
US20180239791A1 (en) | Bitmap-based storage space management system and methods thereof | |
CN101800768A (en) | Gridding data transcription generation method based on storage alliance subset partition | |
CN105095515A (en) | Bucket dividing method, device and equipment supporting fast query of Map-Reduce output result | |
CN103150435B (en) | Clock node knot group's method and clock network structure | |
CN106940712A (en) | Sequence generating method and equipment | |
CN103294407B (en) | Storage device and data read-write method | |
CN107563080B (en) | GPU-based two-phase medium random model parallel generation method and electronic equipment | |
CN102799750B (en) | Method for quickly generating common side and non-common sides of geometry surface triangle |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20160615 |
|
WD01 | Invention patent application deemed withdrawn after publication |