CN105892931A - heterogeneous CPU-GPU system configuration based on intelligent flash cache - Google Patents

heterogeneous CPU-GPU system configuration based on intelligent flash cache Download PDF

Info

Publication number
CN105892931A
CN105892931A CN201410206761.2A CN201410206761A CN105892931A CN 105892931 A CN105892931 A CN 105892931A CN 201410206761 A CN201410206761 A CN 201410206761A CN 105892931 A CN105892931 A CN 105892931A
Authority
CN
China
Prior art keywords
cpu
gpu
intelligent flash
flash cache
intelligent
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410206761.2A
Other languages
Chinese (zh)
Inventor
张军
宋惟忠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai DC Science Co Ltd
Original Assignee
Shanghai Jingzhi Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jingzhi Information Technology Co Ltd filed Critical Shanghai Jingzhi Information Technology Co Ltd
Priority to CN201410206761.2A priority Critical patent/CN105892931A/en
Publication of CN105892931A publication Critical patent/CN105892931A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses an operating system prototype of a <{EN0}>heterogeneous CPU-GPU system configuration based on an intelligent flash cache; the configuration is characterized by comprising independent parallel CPU and parallel GPU, both have a storage subsystem, and can access the memory of both parties; the CPU is connected with a chipset through a I/O bus, and connected with the CPU through a I/O bridge; the CPU comprises an ALU, a dwell vessel file, an intelligent flash cache and a bus interface; Linux system transplant is carried out so as to support the heterogeneous CPU-GPU system configuration based on the intelligent flash cache, thus forming a self made operating system prototype.

Description

A kind of operating system prototype supporting isomery CPU-GPU system architecture based on intelligent flash caching
Technical field
The present invention relates to a kind of operating system prototype supporting isomery CPU-GPU system architecture based on intelligent flash caching
Background technology
CPU i.e. central processing unit is one piece of ultra-large integrated circuit, is arithmetic core and the control core of a computer.Mainly include arithmetical unit (ALU) and the big parts of controller (CU) two.Additionally, also include several depositors and cache memory and realize the bus of data, control and the state of contact between them.It is collectively referred to as the big core component of electronic computer three with internal storage and input-output apparatus.GPU i.e. graphic process unit is a kind of special at personal computer, work station, game machine and the microprocessor of some mobile device epigraph operation.
Heterogeneous Computing is primarily referred to as using the calculation of the computing unit composition system of dissimilar instruction set and architectural framework.Common computing unit classification includes the processors such as CPU, GPU.Heterogeneous Computing obtains more concern in recent years, and the traditional approach being primarily due to improve computing capability by promoting cpu clock frequency and number of cores encounters heat radiation and energy consumption bottleneck.Although and the dedicated computing unit operating frequencies such as GPU are relatively low, having more interior check figure and computation capability, ratio and the performance/power dissipation ratio of overall performance/chip area are the highest, are far from being fully used.
The design of CPU allows it compare and is good at process irregular data structure and uncertain access mode, and recursive algorithm, branch's intensity code and single-threading program.This kind of program task has the steps such as instruction scheduling, circulation, branch, logical judgment and the execution of complexity.And GPU is good at processing rules data structure and measurable access mode.Gather both strong points, reach the optimization of overall performance by Heterogeneous Computing.
Intelligent flash caching is a read-only caching.Being removed out buffer area cache when unmodified data block due to the pressure in space, these data blocks are just moved in flash cache;If needing again these data, data base will retract these data blocks from flash cache again.Flash cache utilizes the I/O speed of flash memory device, more much higher than storage performance based on disk;There is enough CPU, flash cache can be used.
The invention provides a kind of operating system prototype supporting isomery CPU-GPU system architecture based on intelligent flash caching.Framework is characterized by independent parallel C PU and parallel GPU, has respective storage subsystem, all may have access to the memorizer of the other side;GPU is connected to chipset by I/O bus, is connected with CPU by I/O bridge the most again;CPU is cached by ALU, register file and intelligent flash and EBI forms.Carry out linux system transplanting, to support isomery CPU-GPU system architecture based on intelligent flash caching, constitute a kind of self-making operational system prototype.
Summary of the invention
It is an object of the invention to provide a kind of operating system prototype supporting isomery CPU-GPU system architecture based on intelligent flash caching.The present invention includes following characteristics:
Inventive technique scheme
1. an isomery CPU-GPU system architecture based on intelligent flash caching, the feature of framework:
1) there is independent parallel C PU and parallel GPU, have respective storage subsystem, all may have access to the memorizer of the other side;
2) GPU is connected to chipset by I/O bus, is connected with CPU by I/O bridge the most again;
3) CPU is cached by ALU, register file and intelligent flash and EBI forms.
2. hardware structure based on claim 1, carries out linux system transplanting, to support isomery CPU-GPU system architecture based on intelligent flash caching, constitutes a kind of self-making operational system prototype.
Accompanying drawing explanation
Accompanying drawing 1 is isomery CPU-GPU system architecture based on intelligent flash caching.
Detailed description of the invention
1. an isomery CPU-GPU system architecture based on intelligent flash caching, the feature of framework:
1) there is independent parallel C PU and parallel GPU, have respective storage subsystem, all may have access to the memorizer of the other side;
2) GPU is connected to chipset by I/O bus, is connected with CPU by I/O bridge the most again;
3) CPU is cached by ALU, register file and intelligent flash and EBI forms.
2. hardware structure based on claim 1, carries out linux system transplanting, to support isomery CPU-GPU system architecture based on intelligent flash caching, constitutes a kind of self-making operational system prototype.

Claims (2)

1. an isomery CPU-GPU system architecture based on intelligent flash caching, the feature of framework:
1) there is independent parallel C PU and parallel GPU, have respective storage subsystem, all may have access to the memorizer of the other side;
2) GPU is connected to chipset by I/O bus, is connected with CPU by I/O bridge the most again;
3) CPU is cached by ALU, register file and intelligent flash and EBI forms.
2. hardware structure based on claim 1, carries out linux system transplanting, to support isomery CPU-GPU system architecture based on intelligent flash caching, constitutes a kind of self-making operational system prototype.
CN201410206761.2A 2014-05-16 2014-05-16 heterogeneous CPU-GPU system configuration based on intelligent flash cache Pending CN105892931A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410206761.2A CN105892931A (en) 2014-05-16 2014-05-16 heterogeneous CPU-GPU system configuration based on intelligent flash cache

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410206761.2A CN105892931A (en) 2014-05-16 2014-05-16 heterogeneous CPU-GPU system configuration based on intelligent flash cache

Publications (1)

Publication Number Publication Date
CN105892931A true CN105892931A (en) 2016-08-24

Family

ID=56999939

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410206761.2A Pending CN105892931A (en) 2014-05-16 2014-05-16 heterogeneous CPU-GPU system configuration based on intelligent flash cache

Country Status (1)

Country Link
CN (1) CN105892931A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105320630A (en) * 2014-08-01 2016-02-10 张军 Heterogeneous multi-core CPU-GPU (Central Processing Unit-Graphics Processing Unit) system architecture based on intelligent flash cache
CN106502956A (en) * 2016-10-28 2017-03-15 张军 A kind of operating system prototype of multinuclear isomery CPU GPU system frameworks
CN106708777A (en) * 2017-01-23 2017-05-24 张军 Multi-core heterogeneous CPU - CPU - FPGA architecture
CN106843045A (en) * 2017-01-23 2017-06-13 张军 A kind of embedded OS prototype based on multinuclear isomery CPU GPU FPGA system frameworks

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050122339A1 (en) * 2003-06-30 2005-06-09 Microsoft Corporation System and method for parallel execution of data generation tasks
CN101526934A (en) * 2009-04-21 2009-09-09 浪潮电子信息产业股份有限公司 Construction method of GPU and CPU combined processor
CN101963918A (en) * 2010-10-26 2011-02-02 上海交通大学 Method for realizing virtual execution environment of central processing unit (CPU)/graphics processing unit (GPU) heterogeneous platform
US20110157195A1 (en) * 2009-12-31 2011-06-30 Eric Sprangle Sharing resources between a CPU and GPU
CN103345382A (en) * 2013-07-15 2013-10-09 郑州师范学院 CPU+GPU group nuclear supercomputer system and SIFT feature matching parallel computing method
CN103399730A (en) * 2013-07-29 2013-11-20 中国科学院地理科学与资源研究所 Parallel processing method for judgment of segment intersection for solid geometrical bodies on the basis of CPU-GPU architecture
CN103559017A (en) * 2013-10-23 2014-02-05 东软集团股份有限公司 Character string matching method and system based on graphic processing unit (GPU) heterogeneous computing platform

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050122339A1 (en) * 2003-06-30 2005-06-09 Microsoft Corporation System and method for parallel execution of data generation tasks
CN101526934A (en) * 2009-04-21 2009-09-09 浪潮电子信息产业股份有限公司 Construction method of GPU and CPU combined processor
US20110157195A1 (en) * 2009-12-31 2011-06-30 Eric Sprangle Sharing resources between a CPU and GPU
CN101963918A (en) * 2010-10-26 2011-02-02 上海交通大学 Method for realizing virtual execution environment of central processing unit (CPU)/graphics processing unit (GPU) heterogeneous platform
CN103345382A (en) * 2013-07-15 2013-10-09 郑州师范学院 CPU+GPU group nuclear supercomputer system and SIFT feature matching parallel computing method
CN103399730A (en) * 2013-07-29 2013-11-20 中国科学院地理科学与资源研究所 Parallel processing method for judgment of segment intersection for solid geometrical bodies on the basis of CPU-GPU architecture
CN103559017A (en) * 2013-10-23 2014-02-05 东软集团股份有限公司 Character string matching method and system based on graphic processing unit (GPU) heterogeneous computing platform

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105320630A (en) * 2014-08-01 2016-02-10 张军 Heterogeneous multi-core CPU-GPU (Central Processing Unit-Graphics Processing Unit) system architecture based on intelligent flash cache
CN106502956A (en) * 2016-10-28 2017-03-15 张军 A kind of operating system prototype of multinuclear isomery CPU GPU system frameworks
CN106708777A (en) * 2017-01-23 2017-05-24 张军 Multi-core heterogeneous CPU - CPU - FPGA architecture
CN106843045A (en) * 2017-01-23 2017-06-13 张军 A kind of embedded OS prototype based on multinuclear isomery CPU GPU FPGA system frameworks

Similar Documents

Publication Publication Date Title
Jouppi et al. Ten lessons from three generations shaped google’s tpuv4i: Industrial product
Jouppi et al. Motivation for and evaluation of the first tensor processing unit
US10002108B2 (en) Processing device for performing convolution operations
Singh et al. Evolution of processor architecture in mobile phones
CN106663061B (en) Virtualization of memory for programmable logic
CN103218338B (en) The real-time many DSP debug system of a kind of signal processor system
CN108885586B (en) Processor, method, system, and instruction for fetching data to an indicated cache level with guaranteed completion
CN105892931A (en) heterogeneous CPU-GPU system configuration based on intelligent flash cache
CN105446920A (en) Loongson-based FPGA embedded computer and configuration method thereof
CN111052039A (en) Multi-standard power management scheme for pooled accelerator architectures
US20140025930A1 (en) Multi-core processor sharing li cache and method of operating same
JP2006522385A (en) Apparatus and method for providing multi-threaded computer processing
CN111209247A (en) Integrated circuit computing device and computing processing system
CN105320630A (en) Heterogeneous multi-core CPU-GPU (Central Processing Unit-Graphics Processing Unit) system architecture based on intelligent flash cache
Liang et al. Ins-dla: An in-ssd deep learning accelerator for near-data processing
US8996833B2 (en) Multi latency configurable cache
WO2014105133A1 (en) Table driven multiple passive trip platform passive thermal management
US20160055005A1 (en) System and Method for Page-Conscious GPU Instruction
CN105988945A (en) Heterogeneous multiprocessor system and driving control method thereof
Momose SX-ACE, brand-new vector supercomputer for higher sustained performance I
Kondo et al. SMYLEref: A reference architecture for manycore-processor SoCs
Lee et al. Fast shared on-chip memory architecture for efficient hybrid computing with CGRAs
CN209746539U (en) Acceleration card device for self-adaptive programmable storage calculation
KR101682985B1 (en) Priority based intelligent platform passive thermal management
CN201336033Y (en) Embedded card and embedded computer system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Zhang Jun

Inventor before: Zhang Jun

Inventor before: Song Weizhong

TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20171222

Address after: 200120 room 4A06, No. 277, Ruichang Road, Pudong New Area, Shanghai, 6 rooms

Applicant after: Shanghai Deheng Data Technology Co. Ltd.

Address before: 200061 room No. 100, Lane 40, Tam Tam Road, Shanghai, Putuo District, China 1103

Applicant before: SHANGHAI JINGZHI INFORMATION TECHNOLOGY CO., LTD.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160824