CN107247681A - A kind of big data engine prototype based on multinuclear isomery CPU GPU FPGA - Google Patents
A kind of big data engine prototype based on multinuclear isomery CPU GPU FPGA Download PDFInfo
- Publication number
- CN107247681A CN107247681A CN201710387597.3A CN201710387597A CN107247681A CN 107247681 A CN107247681 A CN 107247681A CN 201710387597 A CN201710387597 A CN 201710387597A CN 107247681 A CN107247681 A CN 107247681A
- Authority
- CN
- China
- Prior art keywords
- gpu
- fpga
- cpu
- big data
- data engine
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
- G06F15/163—Interprocessor communication
- G06F15/167—Interprocessor communication using a common memory, e.g. mailbox
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
- G06F15/163—Interprocessor communication
- G06F15/17—Interprocessor communication using an input/output type connection, e.g. channel, I/O port
Landscapes
- Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Multi Processors (AREA)
Abstract
The present invention discloses a kind of big data engine prototype based on multinuclear isomery CPU GPU FPGA.Framework is characterized by independent parallel C PU and parallel GPU, there is respective magnetic random storage subsystem, may have access to the magnetic RAM of other side, and outside FPGA structure;GPU is connected to chipset by I/O buses, is then connected again by I/O bridges with CPU;CPU is cached by ALU, register file and intelligent flash and EBI is constituted.System transplantation is carried out, to support multinuclear isomery CPU GPU FPGA big data exchange architectures, a kind of big data engine prototype is constituted.
Description
Technical field
The present invention relates to a kind of big data engine prototype based on multinuclear isomery CPU-GPU-FPGA
Background technology
With cloud computing and virtualization, the features such as " extensive ", " high density ", " high energy consumption ", " complication " is showed, is built
If with development New Generation of IDC, lifting data center infrastructure management will become increasingly important, the basis of data center
The new trend that framework fusion management will develop with intelligence as data center.
Ultra-large type data center provide from infrastructure to data analysis below, screening, application whole application take
Business.It is not only data analysis, in addition to the special service different from the generalization service that public cloud is provided is in the cloud of intelligence manufacture
Calculate, and super computing, this just proposes requirements at the higher level to the disposal ability of big data.
Heterogeneous Computing refers to the calculation that system is constituted using the computing unit of different type instruction set and architectural framework.
Common computing unit classification includes:CUP (central processing unit), GPU (graphics processor), FPGA (field programmable gate array)
Deng.It has been trend of the times that CPU is merged with GPU height, but this is more than the change of hardware view, is more calculating theory
Change.How different calculating tasks is automatically assigned to be most appropriate to the chip of the processing task, highest energy is realized whereby
Effect ratio and highest transistor utilization rate, as the new programming mode of exploration or the computation schema significant problem to be faced.
After increasing special stone is integrated into FPGA, FPGA design method needs occur essence
Change.The SoC design method progressively accepted in IC design fields is likewise introduced into FPGA design field, this side
The core of method is to surround CPU core Method of Spreading Design, and the system bus drawn using CPU is trunk, and other modules all hang over this
In bus, such as, the system based on CPU is developed on FPGA, after electricity on FPGA, hardware logic passes through chip configuration successful
Afterwards, read software document and go in SDRAM (synchronous DRAM), software is run in SDRAM.
It is overall with more interior check figures and computation capability although the dedicated computing unit working frequency such as GPU is relatively low
The ratio and performance/power dissipation ratio of performance/chip area are all very high, are but far from being fully used.Particularly GPU general meter
Calculation is imported into high parallel computation field, to fusion multi-sensor information of the processing including vision sensor, plays new core
Effect.
Intelligent flash caching is read buffer.When unmodified data block because the pressure in space is removed out caching
Area's cache, these data blocks are just moved into flash cache;If needing these data again, database will be again this
A little data blocks are moved back to from flash cache.Flash cache utilizes the I/O speed of flash memory device, higher than the storage performance based on disk
Much;With enough CPU, flash cache can be used.
Magnetic RAM (Magnetic Random Access Memory, abbreviation MRAM) possesses static random and deposited
The high speed of reservoir (SRAM) reads write capability, and dynamic RAM (DRAM) high integration, and substantially may be used
To be repeatedly written infinitely.
The invention provides a kind of big data engine prototype based on multinuclear isomery CPU-GPU-FPGA.The feature of framework is
With independent parallel C PU and parallel GPU, there is respective magnetic random storage subsystem, may have access to the magnetic random of other side
Memory, and outside FPGA structure;GPU is connected to chipset by I/O buses, then passes through I/O bridges and CPU phases again
Even;CPU is cached by ALU, register file and intelligent flash and EBI is constituted.System transplantation is carried out, to support multinuclear
Isomery CPU-GPU-FPGA big data exchange architectures, constitute a kind of big data engine prototype.
The content of the invention
It is an object of the invention to provide a kind of big data exchange architecture based on multinuclear isomery CPU-GPU-FPGA.This hair
It is bright including following characteristics:
Inventive technique scheme
1. a kind of big data exchange architecture based on multinuclear isomery CPU-GPU-FPGA, the feature of framework:
1) there is independent parallel C PU and parallel GPU, there is respective magnetic random storage subsystem, other side is may have access to
Magnetic RAM, and outside the FPGA structure;
2) GPU is connected to chipset by I/O buses, is then connected again by I/O bridges with CPU;
3) CPU is cached by ALU, register file and intelligent flash and EBI is constituted.
2. the hardware structure based on claim 1, carries out system transplantation, to support the big numbers of multinuclear isomery CPU-GPU-FPGA
According to exchange architecture, a kind of big data engine prototype is constituted.
Brief description of the drawings
Accompanying drawing 1 is the big data engine prototype figure based on multinuclear isomery CPU-GPU-FPGA.
Embodiment
This big data engine prototype based on multinuclear isomery CPU-GPU-FPGA, comprises the following steps feature:
1) there is independent parallel C PU and parallel GPU, there is respective magnetic random storage subsystem, other side is may have access to
Magnetic RAM, and outside the FPGA structure;
2) GPU is connected to chipset by I/O buses, is then connected again by I/O bridges with CPU;
3) CPU is cached by ALU, register file and intelligent flash and EBI is constituted;
4) system transplantation is carried out, to support multinuclear isomery CPU-GPU-FPGA big data exchange architectures, a kind of big number is constituted
According to engine prototype.
Claims (2)
1. a kind of big data exchange architecture based on multinuclear isomery CPU-GPU-FPGA, the feature of framework:
1) there is independent parallel C PU and parallel GPU, have respective magnetic random storage subsystem, may have access to the magnetic of other side
Property random access memory, and outside the FPGA structure;
2) GPU is connected to chipset by I/O buses, is then connected again by I/O bridges with CPU;
3) CPU is cached by ALU, register file and intelligent flash and EBI is constituted.
2. the hardware structure based on claim 1, carries out system transplantation, to support multinuclear isomery CPU-GPU-FPGA big datas to draw
Framework is held up, a kind of big data engine prototype is constituted.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710387597.3A CN107247681A (en) | 2017-05-27 | 2017-05-27 | A kind of big data engine prototype based on multinuclear isomery CPU GPU FPGA |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710387597.3A CN107247681A (en) | 2017-05-27 | 2017-05-27 | A kind of big data engine prototype based on multinuclear isomery CPU GPU FPGA |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107247681A true CN107247681A (en) | 2017-10-13 |
Family
ID=60017426
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710387597.3A Pending CN107247681A (en) | 2017-05-27 | 2017-05-27 | A kind of big data engine prototype based on multinuclear isomery CPU GPU FPGA |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107247681A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108038874A (en) * | 2017-12-01 | 2018-05-15 | 中国科学院自动化研究所 | Towards the real-time registration apparatus of scanning electron microscope image and method of sequence section |
-
2017
- 2017-05-27 CN CN201710387597.3A patent/CN107247681A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108038874A (en) * | 2017-12-01 | 2018-05-15 | 中国科学院自动化研究所 | Towards the real-time registration apparatus of scanning electron microscope image and method of sequence section |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106683158B (en) | Modeling system of GPU texture mapping non-blocking storage Cache | |
Shan et al. | FPMR: MapReduce framework on FPGA | |
Hong-Tao et al. | K-means on commodity GPUs with CUDA | |
Gao et al. | Practical near-data processing for in-memory analytics frameworks | |
US8364739B2 (en) | Sparse matrix-vector multiplication on graphics processor units | |
Cheng et al. | Using mobile GPU for general-purpose computing–a case study of face recognition on smartphones | |
CN106383695B (en) | The acceleration system and its design method of clustering algorithm based on FPGA | |
CN102880509B (en) | Compute unified device architecture (CUDA) based grid digital elevation model (DEM) neighborhood analysis system and method | |
Chen et al. | Unified non-volatile memory and NAND flash memory architecture in smartphones | |
Islam et al. | Improving node-level mapreduce performance using processing-in-memory technologies | |
Xiao et al. | Image Sobel edge extraction algorithm accelerated by OpenCL | |
CN105550978A (en) | GPU3D engine on-chip memory hierarchy structure facing unified dyeing architecture | |
CN107247681A (en) | A kind of big data engine prototype based on multinuclear isomery CPU GPU FPGA | |
Li et al. | A Novel Memory‐Scheduling Strategy for Large Convolutional Neural Network on Memory‐Limited Devices | |
CN107256201A (en) | A kind of big data exchange architecture based on multinuclear isomery CPU GPU FPGA | |
Rakvic et al. | Energy efficient iris recognition with graphics processing units | |
Wang et al. | GPU-rrtmg_Sw: Accelerating a shortwave radiative transfer scheme on GPU | |
CN115756605A (en) | Shallow cloud convection parameterization scheme heterogeneous computing method based on multiple GPUs | |
CN102981798A (en) | Floating point independent source analytical algorithm circuit structure based on FPGA (Field Programmable Gate Array) | |
Zhang et al. | High-performance spatial join processing on gpgpus with applications to large-scale taxi trip data | |
Jun et al. | Asymmetric DRAM synthesis for heterogeneous chip multiprocessors in 3D-stacked architecture | |
Cheng et al. | Performance optimization of vision apps on mobile application processor | |
Li et al. | Heterogeneous acceleration algorithms for shallow cumulus convection scheme over GPU clusters | |
Solano-Quinde et al. | Multi-GPU implementation of the Horizontal Diffusion method of the Weather Research and Forecast Model | |
Qiu et al. | Implementation of Remote‐Sensing Data Processing Platform Based on Computable Storage |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20171013 |
|
WD01 | Invention patent application deemed withdrawn after publication |