CN105677595A - FPGA method achieving computation speedup and PCIESSD storage simultaneously - Google Patents

FPGA method achieving computation speedup and PCIESSD storage simultaneously Download PDF

Info

Publication number
CN105677595A
CN105677595A CN201610039292.9A CN201610039292A CN105677595A CN 105677595 A CN105677595 A CN 105677595A CN 201610039292 A CN201610039292 A CN 201610039292A CN 105677595 A CN105677595 A CN 105677595A
Authority
CN
China
Prior art keywords
fpga
pciessd
controller
ssd
ssd controller
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610039292.9A
Other languages
Chinese (zh)
Inventor
肖飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
One Way Information Technology (shanghai) Co Ltd
Original Assignee
One Way Information Technology (shanghai) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by One Way Information Technology (shanghai) Co Ltd filed Critical One Way Information Technology (shanghai) Co Ltd
Priority to CN201610039292.9A priority Critical patent/CN105677595A/en
Publication of CN105677595A publication Critical patent/CN105677595A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/14Handling requests for interconnection or transfer
    • G06F13/16Handling requests for interconnection or transfer for access to memory bus
    • G06F13/1668Details of memory controller
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/14Handling requests for interconnection or transfer
    • G06F13/20Handling requests for interconnection or transfer for access to input/output bus
    • G06F13/28Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access DMA, cycle steal
    • G06F13/282Cycle stealing DMA
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/78Architectures of general purpose stored program computers comprising a single central processing unit
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Advance Control (AREA)

Abstract

The invention discloses an FPGA method achieving computation speedup and PCIE SSD storage simultaneously. An FPGA is used, an SSD controller and an algorithm accelerator are integrated in the FPGA, the FPGA is further internally provided with a DDR controller and a direct memory read module DMA, and the direct memory read module DMA is connected with the SSD controller, the DDR controller and the algorithm accelerator respectively. According to the FPGA method achieving computation speedup and PCIE SSD storage simultaneously, the two functions of computation speedup and SSD storage are achieved on PCIE equipment, the layout difficult is reduced, overall power consumption of server nodes is reduced, and the cost of an enterprise is reduced.

Description

A kind of simultaneously realization calculates the FPGA method accelerated with PCIESSD stores
Technical field
The present invention relates to a kind of FPGA method, specifically a kind of simultaneously realization calculates the FPGA method accelerated with PCIESSD stores.
Background technology
Along with the fast development of informationization, high-density computation requirement gets more and more, and computing power and storage IO (I/O) Capability Requirement to Single-Server node are more and more higher. At present, enterprise market extensively adopts PCIESSD to improve the IO ability stored; And for computing power, generally adopt GPU(GraphicsProcessingUnit) accelerate card or FPGA acceleration card. Although it is strong that enterprise-level GPU accelerates card computing power, but price is expensive, power consumption height, adds enterprise cost. And FPGA acceleration card relative GPU acceleration card price is lower, low in energy consumption, but use PCIESSD and FPGA to accelerate to block simultaneously, not only to be taken multiple server slot, and increase more power consumption, add the cost of enterprise.
Summary of the invention
It is an object of the invention to provide a kind of simultaneously realization and calculate the FPGA method accelerated with PCIESSD stores, to solve in above-mentioned background technology the problem proposed.
For achieving the above object, the present invention provides following technical scheme:
A kind of simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, employ a slice FPGA, SSD controller and algorithm accelerator it is integrated with in FPGA, SSD controller is further comprises in FPGA, described SSD controller is by logic realization in FPGA, the flash array of management and control SSD, SSD controller is by PCIE interface and server node communication.
As the scheme that the present invention is further: described algorithm accelerator is realized by fpga logic, by PCIE interface and server node communication.
As the scheme that the present invention is further: further comprises DDR controller in described FPGA.
As the present invention's further scheme: further comprises direct internal memory read module DMA in described FPGA, direct internal memory read module DMA connects SSD controller, DDR controller and algorithm accelerator respectively.
Compared with prior art, the invention has the beneficial effects as follows: the present invention achieves algorithm in a PCIE device and accelerates to store two functions with SSD, decreases layout difficulty, reduces the overall power of server node, reduces the cost of enterprise.
Accompanying drawing explanation
Fig. 1 realizes calculating the structural representation accelerating the FPGA method with PCIESSD storage simultaneously; Fig. 2 realizes calculating the schema accelerating the FPGA method with PCIESSD storage simultaneously.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making other embodiments all obtained under creative work prerequisite, belong to the scope of protection of the invention.
Refer to Fig. 1, in the embodiment of the present invention, a kind of simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, employ a slice FPGA, being integrated with SSD controller and algorithm accelerator in FPGA, further comprises SSD controller in FPGA, described SSD controller is by logic realization in FPGA, the flash array of management and control SSD, SSD controller is by PCIE interface and server node communication; Described algorithm accelerator is realized by fpga logic, by PCIE interface and server node communication; DDR controller is further comprises in described FPGA; Further comprises direct internal memory read module DMA in described FPGA, direct internal memory read module DMA connects SSD controller, DDR controller and algorithm accelerator respectively.
The principle of work of the present invention is: refer to Fig. 1, and server node is communicated with FPGA by PCIE interface; The inner directly internal memory read module DMA of FPGA realizes PCIETLP(TransactionLayerPackage) agreement layer encapsulation and resolving, obtain order, distribution and converge data, reporting interruption; SSD controller receives order and the data that direct internal memory read module DMA distributes, according to order control NANDFLASH array; Algorithm accelerator receives order and the data that direct internal memory read module DMA distributes, and does the calculating of respective algorithms according to order; DDR controller is that direct internal memory read module DMA and algorithm accelerator are data cached.
The performing step of the present invention is as follows: 1, according to FPGA, NANDFLASH, DDRSDRAM of actual computation density demand and SSD Capacity Selection appropriate resources, according to, shown in Fig. 1, designing hardware.
2, FPGA workflow diagram according to Fig. 2 writes the RTL(RegisterTransferLevel of the direct internal memory read module DMA of key modules) code.
3, according to Fig. 1 definition FPGA internal structure in FPGA top layer exampleization, connect each function module, write the RTL top code of FPGA.
4, FPGA workflow diagram according to Fig. 2 writes emulation platform, completes system emulation.
5, adding temporal constraint in FPGA compiling instrument, compiling RTL code, obtains recordable paper.
6, schema according to Fig. 2 writes corresponding software-driven.
To those skilled in the art, it is clear that the invention is not restricted to the details of above-mentioned one exemplary embodiment, and when not deviating from spirit or the essential characteristic of the present invention, it is possible to realize the present invention in other specific forms. Therefore, no matter from which point, embodiment all should be regarded as exemplary, and right and wrong are restrictive, the scope of the present invention is limited by claims instead of above-mentioned explanation, it is intended that all changes in the implication of the equivalent important document dropping on claim and scope included in the present invention. Any Reference numeral in claim should be considered as the claim involved by limiting.
In addition, it is to be understood that, although this specification sheets is described according to enforcement mode, but not each enforcement mode only comprises an independent technical scheme, this kind of narrating mode of specification sheets is only for clarity sake, those skilled in the art should by specification sheets integrally, and the technical scheme in each embodiment through appropriately combined, can also form other enforcement modes that it will be appreciated by those skilled in the art that.

Claims (4)

1. one kind realizes calculating the FPGA method accelerated with PCIESSD stores simultaneously, it is characterized in that, employ a slice FPGA, SSD controller and algorithm accelerator it is integrated with in FPGA, SSD controller is further comprises in FPGA, described SSD controller is by logic realization in FPGA, and the flash array of management and control SSD, SSD controller is by PCIE interface and server node communication.
2. according to claim 1 simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, it is characterised in that, described algorithm accelerator is realized by fpga logic, by PCIE interface and server node communication.
3. according to claim 1 simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, it is characterised in that, further comprises DDR controller in described FPGA.
4. according to claim 1 simultaneously realization calculates the FPGA method accelerated with PCIESSD stores, it is characterized in that, further comprises direct internal memory read module DMA in described FPGA, direct internal memory read module DMA connects SSD controller, DDR controller and algorithm accelerator respectively.
CN201610039292.9A 2016-01-21 2016-01-21 FPGA method achieving computation speedup and PCIESSD storage simultaneously Pending CN105677595A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610039292.9A CN105677595A (en) 2016-01-21 2016-01-21 FPGA method achieving computation speedup and PCIESSD storage simultaneously

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610039292.9A CN105677595A (en) 2016-01-21 2016-01-21 FPGA method achieving computation speedup and PCIESSD storage simultaneously

Publications (1)

Publication Number Publication Date
CN105677595A true CN105677595A (en) 2016-06-15

Family

ID=56301794

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610039292.9A Pending CN105677595A (en) 2016-01-21 2016-01-21 FPGA method achieving computation speedup and PCIESSD storage simultaneously

Country Status (1)

Country Link
CN (1) CN105677595A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106528492A (en) * 2016-10-27 2017-03-22 济南浪潮高新科技投资发展有限公司 High-speed large-capacity recording board card realized based on FPGA
CN107193764A (en) * 2017-05-23 2017-09-22 济南浪潮高新科技投资发展有限公司 A kind of SRIO interface solid hard disk design methods based on PowerPC
CN108897706A (en) * 2018-05-10 2018-11-27 北京微密科技发展有限公司 Accelerator interface
CN109491934A (en) * 2018-09-28 2019-03-19 方信息科技(上海)有限公司 A kind of storage management system control method of integrated computing function
CN109902043A (en) * 2019-01-30 2019-06-18 中国科学院声学研究所 A kind of national secret algorithm acceleration processing system based on FPGA
CN110188066A (en) * 2019-05-07 2019-08-30 方一信息科技(上海)有限公司 A kind of FPGA for Large Volume Data and the FPGA algorithm based on opencl
CN110622145A (en) * 2017-05-15 2019-12-27 莫列斯有限公司 Reconfigurable server and server rack with reconfigurable server
CN110618963A (en) * 2018-06-19 2019-12-27 上海威固信息技术股份有限公司 Heterogeneous computing hardware acceleration system and method integrating computing and storage
CN110968537A (en) * 2018-09-28 2020-04-07 方一信息科技(上海)有限公司 PCIE SSD-based FPGA searching and matching method
US10915469B2 (en) 2018-03-09 2021-02-09 Samsung Electronics Co., Ltd. Method and apparatus for supporting a field programmable gate array (FPGA) based add-in-card (AIC) solid state drive (SSD)
TWI772611B (en) * 2018-03-05 2022-08-01 南韓商三星電子股份有限公司 Host system and method thereof and acceleration module
CN115857805A (en) * 2022-11-30 2023-03-28 合肥腾芯微电子有限公司 Artificial intelligence computable storage system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120236656A1 (en) * 2010-07-09 2012-09-20 Stec, Inc. Apparatus and method for determining a read level of a memory cell based on cycle information
CN103038751A (en) * 2010-05-28 2013-04-10 爱德万测试公司 Flexible storage interface tester with variable parallelism and firmware upgradeability
CN103999060A (en) * 2011-12-23 2014-08-20 国际商业机器公司 Solid-state storage management
CN104952037A (en) * 2014-03-27 2015-09-30 联科集团(中国)有限公司 Method and system for rescaling image files

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103038751A (en) * 2010-05-28 2013-04-10 爱德万测试公司 Flexible storage interface tester with variable parallelism and firmware upgradeability
US20120236656A1 (en) * 2010-07-09 2012-09-20 Stec, Inc. Apparatus and method for determining a read level of a memory cell based on cycle information
CN103999060A (en) * 2011-12-23 2014-08-20 国际商业机器公司 Solid-state storage management
CN104952037A (en) * 2014-03-27 2015-09-30 联科集团(中国)有限公司 Method and system for rescaling image files

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106528492A (en) * 2016-10-27 2017-03-22 济南浪潮高新科技投资发展有限公司 High-speed large-capacity recording board card realized based on FPGA
US11907152B2 (en) 2017-05-15 2024-02-20 Molex, Llc Reconfigurable server and server rack with same
CN110622145A (en) * 2017-05-15 2019-12-27 莫列斯有限公司 Reconfigurable server and server rack with reconfigurable server
CN107193764A (en) * 2017-05-23 2017-09-22 济南浪潮高新科技投资发展有限公司 A kind of SRIO interface solid hard disk design methods based on PowerPC
CN107193764B (en) * 2017-05-23 2019-12-13 浪潮集团有限公司 SRIO interface solid state disk design method based on PowerPC
US11892957B2 (en) 2018-03-05 2024-02-06 Samsung Electronics Co., Ltd. SSD architecture for FPGA based acceleration
TWI772611B (en) * 2018-03-05 2022-08-01 南韓商三星電子股份有限公司 Host system and method thereof and acceleration module
US11520714B2 (en) 2018-03-09 2022-12-06 Samsung Electronics Co., Ltd. Method and apparatus for supporting a field programmable gate array (FPGA) based add-in-card (AIC) solid state drive (SSD)
US10915469B2 (en) 2018-03-09 2021-02-09 Samsung Electronics Co., Ltd. Method and apparatus for supporting a field programmable gate array (FPGA) based add-in-card (AIC) solid state drive (SSD)
CN108897706A (en) * 2018-05-10 2018-11-27 北京微密科技发展有限公司 Accelerator interface
CN108897706B (en) * 2018-05-10 2021-07-23 北京融芯微科技有限公司 Accelerator interface
CN110618963A (en) * 2018-06-19 2019-12-27 上海威固信息技术股份有限公司 Heterogeneous computing hardware acceleration system and method integrating computing and storage
CN109491934B (en) * 2018-09-28 2021-03-02 方一信息科技(上海)有限公司 Storage management system control method integrating computing function
CN110968537B (en) * 2018-09-28 2021-02-02 方一信息科技(上海)有限公司 PCIE SSD-based FPGA searching and matching method
CN110968537A (en) * 2018-09-28 2020-04-07 方一信息科技(上海)有限公司 PCIE SSD-based FPGA searching and matching method
CN109491934A (en) * 2018-09-28 2019-03-19 方信息科技(上海)有限公司 A kind of storage management system control method of integrated computing function
CN109902043A (en) * 2019-01-30 2019-06-18 中国科学院声学研究所 A kind of national secret algorithm acceleration processing system based on FPGA
CN110188066A (en) * 2019-05-07 2019-08-30 方一信息科技(上海)有限公司 A kind of FPGA for Large Volume Data and the FPGA algorithm based on opencl
CN115857805A (en) * 2022-11-30 2023-03-28 合肥腾芯微电子有限公司 Artificial intelligence computable storage system

Similar Documents

Publication Publication Date Title
CN105677595A (en) FPGA method achieving computation speedup and PCIESSD storage simultaneously
KR102284972B1 (en) Data processing method and device
CN102629258B (en) Repeating data deleting method and device
CN103631538A (en) Cold and hot data identification threshold value calculation method, device and system
WO2017156968A1 (en) Neural network computing method, system and device therefor
CN108563808A (en) The design method of heterogeneous reconfigurable figure computation accelerator system based on FPGA
US10700714B2 (en) Method of operating decoder for reducing computational complexity and method of operating data storage device including the decoder
CN111831254A (en) Image processing acceleration method, image processing model storage method and corresponding device
CN103593477A (en) Collocation method and device of Hash database
US10725494B2 (en) Optimizing neurosynaptic networks
CN103631894A (en) Dynamic copy management method based on HDFS
CN105808169A (en) Data deduplication method, apparatus and system
CN109542713A (en) A kind of verification method and verifying device
CN103793483B (en) Clock tree generating method and system based on clock node clusters
CN104699946A (en) Game scene management method and device
CN109460406A (en) A kind of data processing method and device
CN105022968A (en) Integrity checking method of memory data
US20180239791A1 (en) Bitmap-based storage space management system and methods thereof
CN101800768A (en) Gridding data transcription generation method based on storage alliance subset partition
CN105095515A (en) Bucket dividing method, device and equipment supporting fast query of Map-Reduce output result
CN103150435B (en) Clock node knot group's method and clock network structure
CN106940712A (en) Sequence generating method and equipment
CN103294407B (en) Storage device and data read-write method
CN107563080B (en) GPU-based two-phase medium random model parallel generation method and electronic equipment
CN102799750B (en) Method for quickly generating common side and non-common sides of geometry surface triangle

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160615

WD01 Invention patent application deemed withdrawn after publication