CN104866457B - A kind of chip multi-core processor static framework based on shared buffer memory - Google Patents
A kind of chip multi-core processor static framework based on shared buffer memory Download PDFInfo
- Publication number
- CN104866457B CN104866457B CN201510302580.4A CN201510302580A CN104866457B CN 104866457 B CN104866457 B CN 104866457B CN 201510302580 A CN201510302580 A CN 201510302580A CN 104866457 B CN104866457 B CN 104866457B
- Authority
- CN
- China
- Prior art keywords
- node
- buffer memory
- shared buffer
- core
- classes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Memory System Of A Hierarchy Structure (AREA)
Abstract
The invention discloses a kind of chip multi-core processor static frameworks based on shared buffer memory, it includes n node, and n is the even number more than 0, and the node includes n/2 A classes node and n/2 B class node;A classes node includes processing core core, local private cache L1 and router R;B classes node includes processing core core, local private cache L1, router R and shared buffer memory l2bank;It is communicated between node by router;A classes node and the into intermittent distribution of B classes node;The capacity of the shared buffer memory l2bank is twice of the shared buffer memory in generic structure.The present invention provides a kind of chip multi-core processor static frameworks based on shared buffer memory, on the basis of the access for not increasing entire network-on-chip is delayed and does not increase the Congestion Level SPCC of entire network-on-chip, the hardware spending and area of afterbody cache are saved, saves the quiescent dissipation of cache.
Description
Technical field
The present invention relates to a kind of chip multi-core processor static frameworks based on shared buffer memory.
Background technology
As shown in Figure 1, a kind of common network-on-chip multi-core processor framework, we are with common 16 core, and two level is at a high speed
For caching, this common structure is made of 16 nodes, and each node includes the router R for communication, process cores
The shared buffer memory l2bank of heart core and local private cache L1 and a larger area, since the structure is based on shared
The mechanism of caching carries out data interaction and communication, and shared buffer memory l2bank is in the area right and wrong shared by entire network-on-chip
It is often big, therefore its power consumption for bringing influences also very big, the especially ratio shared by quiescent dissipation.
It will be seen that each processor core core is connected there are one shared buffer memory l2bank from Fig. 1, due to
During routine access, read-write data may be read and write shared slow where the processor core core of relative distance farther out
The data stored in l2bank are deposited, so when doing Static Design, average number of hops(Core accesses the average departure of shared buffer memory l2bank
From)And during data interaction, the congestion problems of whole network(It guarantees fairness)All it is factor needed to be considered, number
Increasing according to the requirement of storage, the area of shared buffer memory l2bank is also increasing, thus its quiescent dissipation accounts for entire on piece
The ratio of network uses this generic structure since shared buffer memory l2bank areas are excessive also in the trend increased increasingly
Caused power problems also become one it is very important the problem of.
Invention content
It is an object of the invention to overcome the deficiencies of the prior art and provide a kind of chip multi-core processing based on shared buffer memory
Device static framework, on the access delay for not increasing entire network-on-chip, the basis for the Congestion Level SPCC for not increasing entire network-on-chip
On, the hardware spending and area of afterbody cache are saved, saves the quiescent dissipation of cache.
The purpose of the present invention is achieved through the following technical solutions:A kind of chip multi-core processing based on shared buffer memory
Device static framework, it includes n node, and n is the even number more than 0, and the node includes n/2 A classes node and n/2 B class
Node;The A classes node includes processing core core, local private cache L1 and router R;The B class nodes
Including processing core core, local private cache L1, router R and shared buffer memory l2bank;Lead between the node
Router is crossed to communicate;The A classes node and the into intermittent distribution of B classes node;The appearance of the shared buffer memory l2bank
Measure twice for the shared buffer memory in generic structure.
The node number is 16, including the B class nodes described in the A classes node described in 8 and 8.
The local private cache L1 includes instruction buffer and data buffer storage.
The beneficial effects of the invention are as follows:(1)It can ensure the average distance and generic structure of access shared buffer memory l2bank
Average distance it is the same, so as to will not additionally increase entire network-on-chip access delay.
(2)The present invention is in design process of hardware, and each shared buffer memory l2bank structures and traditional identical are only deposited
Storage space increases one times, and the number of shared buffer memory l2bank is reduced to original half, i.e., entire afterbody is slow at a high speed
The shared buffer memory capacity deposited and generic structure it is identical, for meeting the needs of data storage;But reduce the l2bank of half,
Also reduce the peripheral hardware expense of half, for example, the number of amplifier and decoder we reduce half, therefore it is hard
Part design area can also be reduced, and the quiescent dissipation of cache can also reduce.
(3)The into intermittent distribution of node comprising shared buffer memory l2bank and the node not comprising shared buffer memory l2bank,
During data interaction, the problem of can guaranteeing fairness, network congestion will not be brought.
Description of the drawings
Fig. 1 is common network-on-chip multi-core processor configuration diagram;
Fig. 2 is the structure diagram of the present invention.
Specific embodiment
Technical scheme of the present invention is described in further detail below in conjunction with the accompanying drawings, but protection scope of the present invention is not limited to
It is as described below.
As shown in Fig. 2, a kind of chip multi-core processor static framework based on shared buffer memory, it includes n node, and n is
Even number more than 0, the node include n/2 A classes node and n/2 B class node;The A classes node includes process cores
Heart core, local private cache L1 and router R;The B classes node includes processing core core, local privately owned high speed
Cache L1, router R and shared buffer memory l2bank;It is communicated between the node by router;The A class nodes
With the into intermittent distribution of B classes node;The capacity of the shared buffer memory l2bank is twice of the shared buffer memory in generic structure.
The node number is 16, including the B class nodes described in the A classes node described in 8 and 8.
The local private cache L1 includes instruction buffer and data buffer storage.
From figure 2 it can be seen that A classes node not comprising shared buffer memory l2bank and including shared buffer memory l2bankB classes
The distribution situation of node, shown in the figure 16 Node distributions are arranged for four rows four, no matter from laterally or longitudinal, each two
A B class node is all included between A class nodes, an A class node is all included between each two B class nodes;Here it is hereinbefore carry
The intermittent distribution arrived.
Claims (1)
1. a kind of chip multi-core processor based on shared buffer memory, which is characterized in that its static framework is:It includes n node,
N is the even number more than 0, and the node includes n/2 A classes node and n/2 B class node;The A classes node includes processing
Core core, local private cache L1 and router R;The B classes node includes processing core core, local privately owned height
Speed caching L1, router R and shared buffer memory l2bank;It is communicated between the node by router;The A class sections
Point and the into intermittent distribution of B classes node;The capacity of the shared buffer memory l2bank is two of the shared buffer memory in generic structure
Times;The node number is 16, including the B class nodes described in the A classes node described in 8 and 8;The local is privately owned
Cache L1 includes instruction buffer and data buffer storage.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510302580.4A CN104866457B (en) | 2015-06-04 | 2015-06-04 | A kind of chip multi-core processor static framework based on shared buffer memory |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510302580.4A CN104866457B (en) | 2015-06-04 | 2015-06-04 | A kind of chip multi-core processor static framework based on shared buffer memory |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104866457A CN104866457A (en) | 2015-08-26 |
CN104866457B true CN104866457B (en) | 2018-06-15 |
Family
ID=53912297
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510302580.4A Expired - Fee Related CN104866457B (en) | 2015-06-04 | 2015-06-04 | A kind of chip multi-core processor static framework based on shared buffer memory |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104866457B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107870871B (en) * | 2016-09-23 | 2021-08-20 | 华为技术有限公司 | Method and device for allocating cache |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101510191B (en) * | 2009-03-26 | 2010-10-06 | 浙江大学 | Implementing method of multi-core system structure with buffer window |
CN101706755A (en) * | 2009-11-24 | 2010-05-12 | 中国科学技术大学苏州研究院 | Caching collaboration system of on-chip multi-core processor and cooperative processing method thereof |
CN102103568B (en) * | 2011-01-30 | 2012-10-10 | 中国科学院计算技术研究所 | Method for realizing cache coherence protocol of chip multiprocessor (CMP) system |
CN102270180B (en) * | 2011-08-09 | 2014-04-02 | 清华大学 | Multicore processor cache and management method thereof |
US9201837B2 (en) * | 2013-03-13 | 2015-12-01 | Futurewei Technologies, Inc. | Disaggregated server architecture for data centers |
-
2015
- 2015-06-04 CN CN201510302580.4A patent/CN104866457B/en not_active Expired - Fee Related
Non-Patent Citations (2)
Title |
---|
Fast Hierarchical Cache Directory:A Scalable Cache Organization for Large-scale CMP;Chongmin Li等;《2010 Fifth IEEE International Conference on Networking,Architecture,and Storage》;20101231;第367-376页 * |
共享高速缓存多核处理器的关键技术研究;杜建军;《中国博士学位论文全文数据库(信息科技辑)》;20111215;摘要、正文第25-28页 * |
Also Published As
Publication number | Publication date |
---|---|
CN104866457A (en) | 2015-08-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Alian et al. | Application-transparent near-memory processing architecture with memory channel network | |
US11748535B2 (en) | System and method to generate a network-on-chip (NoC) description using incremental topology synthesis | |
US10445287B2 (en) | Circuit switch pre-reservation in an on-chip network | |
US20150347019A1 (en) | Systems and methods for segmenting data structures in a memory system | |
JP6522663B2 (en) | Interconnection system and method using hybrid memory cube link | |
US10725957B1 (en) | Uniform memory access architecture | |
KR20160074390A (en) | Low power entry in a shared memory link | |
US20100274975A1 (en) | Forming Multiprocessor Systems Using Dual Processors | |
KR20150091158A (en) | Method and relevant device for determining mode of managing shared virtual memory page | |
EP3217616B1 (en) | Memory access method and multi-processor system | |
CN106951390B (en) | NUMA system construction method capable of reducing cross-node memory access delay | |
Alian et al. | Netdimm: Low-latency near-memory network interface architecture | |
US9632958B2 (en) | System for migrating stash transactions | |
US9812186B2 (en) | Reducing latency in an expanded memory system | |
CN106250348A (en) | A kind of heterogeneous polynuclear framework buffer memory management method based on GPU memory access characteristic | |
US20150134868A1 (en) | Socket interposer having a multi-modal i/o interface | |
CN104102549A (en) | Method, device and chip for realizing mutual exclusion operation of multiple threads | |
EP2652636B1 (en) | Split traffic routing in a distributed shared memory multiprocessor | |
US20160134036A1 (en) | Signal integrity in mutli-junction topologies | |
US7649845B2 (en) | Handling hot spots in interconnection networks | |
CN109194721A (en) | A kind of asynchronous RDMA communication dynamic memory management method and system | |
US8671232B1 (en) | System and method for dynamically migrating stash transactions | |
US20140211662A1 (en) | Network Topologies for Energy Efficient Networks | |
CN104866457B (en) | A kind of chip multi-core processor static framework based on shared buffer memory | |
Li et al. | A compact low-power eDRAM-based NoC buffer |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20180615 |