CN112130848A - 一种面向便笺式存储器的带宽感知循环分块优化技术 - Google Patents
一种面向便笺式存储器的带宽感知循环分块优化技术 Download PDFInfo
- Publication number
- CN112130848A CN112130848A CN202011013688.9A CN202011013688A CN112130848A CN 112130848 A CN112130848 A CN 112130848A CN 202011013688 A CN202011013688 A CN 202011013688A CN 112130848 A CN112130848 A CN 112130848A
- Authority
- CN
- China
- Prior art keywords
- data
- access
- memory
- dma
- mode
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000000903 blocking effect Effects 0.000 title claims abstract description 24
- 238000005457 optimization Methods 0.000 title abstract description 13
- 238000005516 engineering process Methods 0.000 title abstract description 10
- 238000000034 method Methods 0.000 claims abstract description 47
- 238000013075 data extraction Methods 0.000 claims abstract description 32
- 230000006870 function Effects 0.000 claims description 46
- 230000001788 irregular Effects 0.000 claims description 24
- 238000011068 loading method Methods 0.000 claims description 23
- 238000012546 transfer Methods 0.000 claims description 21
- 230000004927 fusion Effects 0.000 claims description 15
- 238000000638 solvent extraction Methods 0.000 claims description 6
- 238000004590 computer program Methods 0.000 claims description 4
- 238000003860 storage Methods 0.000 claims description 4
- 230000002776 aggregation Effects 0.000 claims description 2
- 238000004220 aggregation Methods 0.000 claims description 2
- 238000005206 flow analysis Methods 0.000 claims description 2
- 238000004458 analytical method Methods 0.000 abstract description 32
- 230000006399 behavior Effects 0.000 abstract description 13
- 125000004122 cyclic group Chemical group 0.000 abstract description 10
- 238000003066 decision tree Methods 0.000 abstract description 9
- 230000003068 static effect Effects 0.000 abstract description 7
- 238000012360 testing method Methods 0.000 abstract description 6
- 230000000694 effects Effects 0.000 abstract description 4
- 229920006235 chlorinated polyethylene elastomer Polymers 0.000 description 16
- 238000012545 processing Methods 0.000 description 7
- 230000001133 acceleration Effects 0.000 description 6
- 230000005540 biological transmission Effects 0.000 description 6
- 239000008186 active pharmaceutical agent Substances 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 239000002699 waste material Substances 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 240000000233 Melia azedarach Species 0.000 description 2
- 238000000136 cloud-point extraction Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000005192 partition Methods 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 241001522296 Erithacus rubecula Species 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformation of program code
- G06F8/41—Compilation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/30—Creation or generation of source code
- G06F8/35—Creation or generation of source code model driven
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Memory System Of A Hierarchy Structure (AREA)
- Devices For Executing Special Programs (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011013688.9A CN112130848B (zh) | 2020-09-24 | 2020-09-24 | 一种面向便笺式存储器的带宽感知循环分块优化方法、编译系统、设备及存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011013688.9A CN112130848B (zh) | 2020-09-24 | 2020-09-24 | 一种面向便笺式存储器的带宽感知循环分块优化方法、编译系统、设备及存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112130848A true CN112130848A (zh) | 2020-12-25 |
CN112130848B CN112130848B (zh) | 2022-06-14 |
Family
ID=73839587
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011013688.9A Active CN112130848B (zh) | 2020-09-24 | 2020-09-24 | 一种面向便笺式存储器的带宽感知循环分块优化方法、编译系统、设备及存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112130848B (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112965810A (zh) * | 2021-01-27 | 2021-06-15 | 合肥大多数信息科技有限公司 | 一种基于共享网络通道的多内核浏览器数据整合方法 |
CN117312330A (zh) * | 2023-11-29 | 2023-12-29 | 中国人民解放军国防科技大学 | 基于便签式存储的向量数据聚集方法、装置及计算机设备 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100262973A1 (en) * | 2009-04-09 | 2010-10-14 | Rolf Ernst | Method For Operating a Multiprocessor Computer System |
CN101937343A (zh) * | 2010-09-17 | 2011-01-05 | 上海交通大学 | 异构多核虚拟执行环境的后端翻译框架实现的方法 |
CN102929580A (zh) * | 2012-11-06 | 2013-02-13 | 无锡江南计算技术研究所 | 数组多引用访问的分块方法和装置 |
CN103226487A (zh) * | 2013-04-25 | 2013-07-31 | 中国人民解放军信息工程大学 | 面向异构众核多级存储结构的数据分布与局部性优化方法 |
CN105138335A (zh) * | 2015-08-28 | 2015-12-09 | 牟永敏 | 一种基于控制流图的函数调用路径提取方法及装置 |
CN110187988A (zh) * | 2019-06-06 | 2019-08-30 | 中国科学技术大学 | 适用于虚函数和函数指针的静态函数调用图构建方法 |
-
2020
- 2020-09-24 CN CN202011013688.9A patent/CN112130848B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100262973A1 (en) * | 2009-04-09 | 2010-10-14 | Rolf Ernst | Method For Operating a Multiprocessor Computer System |
CN101937343A (zh) * | 2010-09-17 | 2011-01-05 | 上海交通大学 | 异构多核虚拟执行环境的后端翻译框架实现的方法 |
CN102929580A (zh) * | 2012-11-06 | 2013-02-13 | 无锡江南计算技术研究所 | 数组多引用访问的分块方法和装置 |
CN103226487A (zh) * | 2013-04-25 | 2013-07-31 | 中国人民解放军信息工程大学 | 面向异构众核多级存储结构的数据分布与局部性优化方法 |
CN105138335A (zh) * | 2015-08-28 | 2015-12-09 | 牟永敏 | 一种基于控制流图的函数调用路径提取方法及装置 |
CN110187988A (zh) * | 2019-06-06 | 2019-08-30 | 中国科学技术大学 | 适用于虚函数和函数指针的静态函数调用图构建方法 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112965810A (zh) * | 2021-01-27 | 2021-06-15 | 合肥大多数信息科技有限公司 | 一种基于共享网络通道的多内核浏览器数据整合方法 |
CN112965810B (zh) * | 2021-01-27 | 2022-06-24 | 合肥大多数信息科技有限公司 | 一种基于共享网络通道的多内核浏览器数据整合方法 |
CN117312330A (zh) * | 2023-11-29 | 2023-12-29 | 中国人民解放军国防科技大学 | 基于便签式存储的向量数据聚集方法、装置及计算机设备 |
CN117312330B (zh) * | 2023-11-29 | 2024-02-09 | 中国人民解放军国防科技大学 | 基于便签式存储的向量数据聚集方法、装置及计算机设备 |
Also Published As
Publication number | Publication date |
---|---|
CN112130848B (zh) | 2022-06-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9798528B2 (en) | Software solution for cooperative memory-side and processor-side data prefetching | |
Rotem et al. | Glow: Graph lowering compiler techniques for neural networks | |
Talati et al. | Prodigy: Improving the memory latency of data-indirect irregular workloads using hardware-software co-design | |
Wahib et al. | Scalable kernel fusion for memory-bound GPU applications | |
US8180964B1 (en) | Optimization of cache configuration for application design | |
KR101559090B1 (ko) | 이종 코어를 위한 자동 커널 마이그레이션 | |
KR101573586B1 (ko) | 논-리프 코드의 컴파일러 기반 벡터화를 위한 시스템들 및 방법들 | |
US8949532B1 (en) | Automatic generation of cache-optimized code | |
Prasad et al. | Automatic compilation of MATLAB programs for synergistic execution on heterogeneous processors | |
White et al. | Timing analysis for data and wrap-around fill caches | |
Piccoli et al. | Compiler support for selective page migration in NUMA architectures | |
Soliman et al. | WCET-driven dynamic data scratchpad management with compiler-directed prefetching | |
CN112130848B (zh) | 一种面向便笺式存储器的带宽感知循环分块优化方法、编译系统、设备及存储介质 | |
Chen et al. | Locality analysis through static parallel sampling | |
Stawinoga et al. | Predictable thread coarsening | |
Das et al. | Index array flattening through program transformation | |
Neves et al. | Compiler-assisted data streaming for regular code structures | |
Wu et al. | Bandwidth-aware loop tiling for dma-supported scratchpad memory | |
Madsen et al. | Towards a streaming model for nested data parallelism | |
Calvert | Parallelisation of java for graphics processors | |
Chakraborty et al. | Integrating software caches with scratch pad memory | |
Soliman | Automated compilation framework for scratchpad-based real-time systems | |
Li et al. | FreshBreeze: A data flow approach for meeting DDDAS challenges | |
Malhotra et al. | Library-based prefetching for pointer-intensive applications | |
Yu et al. | Hierarchical Read/Write Analysis for Pointer-Based OpenCL Programs on RRAM |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Wu Mingchuan Inventor after: Liu Ying Inventor after: Cui Huimin Inventor after: Wei Qingfu Inventor after: Li Quanfeng Inventor after: Li Limin Inventor after: Lv Fang Inventor after: Feng Xiaobing Inventor before: Wu Mingchuan Inventor before: Liu Ying Inventor before: Cui Huimin Inventor before: Wei Qingfu Inventor before: Li Quanfeng Inventor before: Li Limin Inventor before: Lv Fang Inventor before: Feng Xiaobing |
|
CB03 | Change of inventor or designer information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231221 Address after: Room 1305, 13th Floor, No.1 Zhongguancun Street, Haidian District, Beijing, 100086 Patentee after: Zhongke Jiahe (Beijing) Technology Co.,Ltd. Address before: 100190 No. 6 South Road, Zhongguancun Academy of Sciences, Beijing, Haidian District Patentee before: Institute of Computing Technology, Chinese Academy of Sciences |
|
TR01 | Transfer of patent right |