JP4857274B2 - 超並列型スーパーコンピュータでのアプリケーション・レイアウトの最適化 - Google Patents
超並列型スーパーコンピュータでのアプリケーション・レイアウトの最適化 Download PDFInfo
- Publication number
- JP4857274B2 JP4857274B2 JP2007535843A JP2007535843A JP4857274B2 JP 4857274 B2 JP4857274 B2 JP 4857274B2 JP 2007535843 A JP2007535843 A JP 2007535843A JP 2007535843 A JP2007535843 A JP 2007535843A JP 4857274 B2 JP4857274 B2 JP 4857274B2
- Authority
- JP
- Japan
- Prior art keywords
- software
- communication
- node
- software application
- nodes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5066—Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Multi Processors (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/963,101 US8117288B2 (en) | 2004-10-12 | 2004-10-12 | Optimizing layout of an application on a massively parallel supercomputer |
| US10/963,101 | 2004-10-12 | ||
| PCT/US2005/036196 WO2006044258A1 (en) | 2004-10-12 | 2005-10-06 | Optimizing layout of an aplication on a massively parallel supercomputer |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2008516346A JP2008516346A (ja) | 2008-05-15 |
| JP2008516346A5 JP2008516346A5 (https=) | 2008-08-21 |
| JP4857274B2 true JP4857274B2 (ja) | 2012-01-18 |
Family
ID=35677701
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2007535843A Expired - Fee Related JP4857274B2 (ja) | 2004-10-12 | 2005-10-06 | 超並列型スーパーコンピュータでのアプリケーション・レイアウトの最適化 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US8117288B2 (https=) |
| EP (1) | EP1807758A1 (https=) |
| JP (1) | JP4857274B2 (https=) |
| CN (1) | CN100568183C (https=) |
| WO (1) | WO2006044258A1 (https=) |
Families Citing this family (82)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7334044B1 (en) * | 1998-11-17 | 2008-02-19 | Burst.Com | Method for connection acceptance control and optimal multi-media content delivery over networks |
| US6850965B2 (en) * | 1998-11-17 | 2005-02-01 | Arthur Douglas Allen | Method for connection acceptance and rapid determination of optimal multi-media content delivery over network |
| US20060241928A1 (en) * | 2005-04-25 | 2006-10-26 | International Business Machines Corporation | Load balancing by spatial partitioning of interaction centers |
| US7840914B1 (en) | 2005-05-13 | 2010-11-23 | Massachusetts Institute Of Technology | Distributing computations in a parallel processing environment |
| US8756044B2 (en) * | 2005-05-31 | 2014-06-17 | The Mathworks, Inc. | Graphical partitioning for parallel execution of executable block diagram models |
| US8447580B2 (en) * | 2005-05-31 | 2013-05-21 | The Mathworks, Inc. | Modeling of a multiprocessor system |
| DE102005057697A1 (de) | 2005-12-02 | 2007-06-06 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Verfahren zur computergestützten Simulation technischer Prozesse |
| US8516444B2 (en) | 2006-02-23 | 2013-08-20 | International Business Machines Corporation | Debugging a high performance computing program |
| US7644142B2 (en) * | 2006-05-04 | 2010-01-05 | Intel Corporation | Methods and apparatus to perform process placement for distributed applications |
| US8082289B2 (en) | 2006-06-13 | 2011-12-20 | Advanced Cluster Systems, Inc. | Cluster computing support for application programs |
| US8082546B2 (en) * | 2006-09-29 | 2011-12-20 | International Business Machines Corporation | Job scheduling to maximize use of reusable resources and minimize resource deallocation |
| US8181168B1 (en) | 2007-02-07 | 2012-05-15 | Tilera Corporation | Memory access assignment for parallel processing architectures |
| US9330230B2 (en) * | 2007-04-19 | 2016-05-03 | International Business Machines Corporation | Validating a cabling topology in a distributed computing system |
| US20080288957A1 (en) * | 2007-05-18 | 2008-11-20 | International Business Machines Corporation | Method and system for optimizing communication in mpi programs for an execution environment |
| US8959516B2 (en) * | 2007-07-30 | 2015-02-17 | International Business Machines Corporation | Methods and systems for coordinated financial transactions in distributed and parallel environments |
| US20090064166A1 (en) * | 2007-08-28 | 2009-03-05 | Arimilli Lakshminarayana B | System and Method for Hardware Based Dynamic Load Balancing of Message Passing Interface Tasks |
| US8108876B2 (en) * | 2007-08-28 | 2012-01-31 | International Business Machines Corporation | Modifying an operation of one or more processors executing message passing interface tasks |
| US8127300B2 (en) * | 2007-08-28 | 2012-02-28 | International Business Machines Corporation | Hardware based dynamic load balancing of message passing interface tasks |
| US8312464B2 (en) * | 2007-08-28 | 2012-11-13 | International Business Machines Corporation | Hardware based dynamic load balancing of message passing interface tasks by modifying tasks |
| US8234652B2 (en) * | 2007-08-28 | 2012-07-31 | International Business Machines Corporation | Performing setup operations for receiving different amounts of data while processors are performing message passing interface tasks |
| US20090083614A1 (en) * | 2007-09-26 | 2009-03-26 | Xerox Corporation | System and method for optimizing information display in spreadsheets and tables |
| US8874722B2 (en) * | 2007-09-28 | 2014-10-28 | International Business Machines Corporation | Interactive tool for visualizing performance data in real-time to enable adaptive performance optimization and feedback |
| US8443287B2 (en) * | 2007-09-28 | 2013-05-14 | International Business Machines Corporation | Interactive tool for visualizing performance data in real-time to enable adaptive performance optimization and feedback |
| US20090158276A1 (en) * | 2007-12-12 | 2009-06-18 | Eric Lawrence Barsness | Dynamic distribution of nodes on a multi-node computer system |
| US8589490B2 (en) * | 2008-01-16 | 2013-11-19 | Janos Tapolcai | System, method, and computer program for solving mixed integer programs with peer-to-peer applications |
| US8296120B2 (en) * | 2008-06-20 | 2012-10-23 | Utah State University | FPGA simulated annealing accelerator |
| US8560277B2 (en) * | 2008-10-03 | 2013-10-15 | International Business Machines Corporation | Creating a load balanced spatial partitioning of a structured, diffusing system of particles |
| US8549092B2 (en) | 2009-02-19 | 2013-10-01 | Micron Technology, Inc. | Memory network methods, apparatus, and systems |
| US8874694B2 (en) * | 2009-08-18 | 2014-10-28 | Facebook, Inc. | Adaptive packaging of network resources |
| JP5577745B2 (ja) * | 2010-02-25 | 2014-08-27 | 日本電気株式会社 | クラスタシステム、プロセス配置方法、及びプログラム |
| US8386403B2 (en) * | 2010-03-02 | 2013-02-26 | Empire Technology Development Llc | Distributed-type Markov chain Monte Carlo |
| FR2960369B1 (fr) * | 2010-05-20 | 2013-03-01 | Bull Sas | Procede d'optimisation de routage dans un cluster comprenant des liens de communication statiques et programme d'ordinateur mettant en oeuvre ce procede |
| US10635062B2 (en) | 2010-06-29 | 2020-04-28 | International Business Machines Corporation | Systems and methods for highly parallel processing of parameterized simulations |
| US8839214B2 (en) | 2010-06-30 | 2014-09-16 | Microsoft Corporation | Indexable type transformations |
| JP5429382B2 (ja) * | 2010-08-10 | 2014-02-26 | 富士通株式会社 | ジョブ管理装置及びジョブ管理方法 |
| US9569398B2 (en) * | 2010-09-28 | 2017-02-14 | International Business Machines Corporation | Routing data communications packets in a parallel computer |
| US8909716B2 (en) | 2010-09-28 | 2014-12-09 | International Business Machines Corporation | Administering truncated receive functions in a parallel messaging interface |
| CN102667710B (zh) * | 2010-10-21 | 2014-09-03 | 北京华金瑞清生物医药技术有限公司 | 从复杂网络中识别模块化结构的方法和工具 |
| US8769542B2 (en) * | 2010-10-26 | 2014-07-01 | Palo Alto Research Center Incorporated | System for adaptive lot sizing in cellular manufacturing for balancing workloads across multiple cells using split-then-merge operations and earliest completion route algorithm |
| US9052974B2 (en) | 2010-11-05 | 2015-06-09 | International Business Machines Corporation | Fencing data transfers in a parallel active messaging interface of a parallel computer |
| US9069631B2 (en) | 2010-11-05 | 2015-06-30 | International Business Machines Corporation | Fencing data transfers in a parallel active messaging interface of a parallel computer |
| US9075759B2 (en) | 2010-11-05 | 2015-07-07 | International Business Machines Corporation | Fencing network direct memory access data transfers in a parallel active messaging interface of a parallel computer |
| US8527672B2 (en) | 2010-11-05 | 2013-09-03 | International Business Machines Corporation | Fencing direct memory access data transfers in a parallel active messaging interface of a parallel computer |
| US8490112B2 (en) | 2010-12-03 | 2013-07-16 | International Business Machines Corporation | Data communications for a collective operation in a parallel active messaging interface of a parallel computer |
| US8484658B2 (en) | 2010-12-03 | 2013-07-09 | International Business Machines Corporation | Data communications in a parallel active messaging interface of a parallel computer |
| US8572629B2 (en) | 2010-12-09 | 2013-10-29 | International Business Machines Corporation | Data communications in a parallel active messaging interface of a parallel computer |
| US8650262B2 (en) | 2010-12-09 | 2014-02-11 | International Business Machines Corporation | Endpoint-based parallel data processing in a parallel active messaging interface of a parallel computer |
| US8775531B2 (en) | 2011-01-06 | 2014-07-08 | International Business Machines Corporation | Completion processing for data communications instructions |
| US8732229B2 (en) | 2011-01-06 | 2014-05-20 | International Business Machines Corporation | Completion processing for data communications instructions |
| US8892850B2 (en) | 2011-01-17 | 2014-11-18 | International Business Machines Corporation | Endpoint-based parallel data processing with non-blocking collective instructions in a parallel active messaging interface of a parallel computer |
| US8584141B2 (en) | 2011-01-17 | 2013-11-12 | International Business Machines Corporation | Data communications in a parallel active messaging interface of a parallel computer |
| US8762536B2 (en) | 2011-01-31 | 2014-06-24 | Cray Inc. | Compact node ordered application placement in a multiprocessor computer |
| US8825983B2 (en) | 2011-02-15 | 2014-09-02 | International Business Machines Corporation | Data communications in a parallel active messaging interface of a parallel computer |
| US8904398B2 (en) * | 2011-03-31 | 2014-12-02 | International Business Machines Corporation | Hierarchical task mapping |
| JP2012243224A (ja) * | 2011-05-23 | 2012-12-10 | Fujitsu Ltd | プロセス配置装置、プロセス配置方法及びプロセス配置プログラム |
| JP5724626B2 (ja) * | 2011-05-23 | 2015-05-27 | 富士通株式会社 | プロセス配置装置、プロセス配置方法及びプロセス配置プログラム |
| US20120331153A1 (en) * | 2011-06-22 | 2012-12-27 | International Business Machines Corporation | Establishing A Data Communications Connection Between A Lightweight Kernel In A Compute Node Of A Parallel Computer And An Input-Output ('I/O') Node Of The Parallel Computer |
| US9262201B2 (en) | 2011-07-13 | 2016-02-16 | International Business Machines Corporation | Performing collective operations in a distributed processing system |
| US8528004B2 (en) | 2011-11-07 | 2013-09-03 | International Business Machines Corporation | Internode data communications in a parallel computer |
| US8495654B2 (en) | 2011-11-07 | 2013-07-23 | International Business Machines Corporation | Intranode data communications in a parallel computer |
| US8732725B2 (en) | 2011-11-09 | 2014-05-20 | International Business Machines Corporation | Managing internode data communications for an uninitialized process in a parallel computer |
| US8819653B2 (en) * | 2012-01-30 | 2014-08-26 | Cisco Technology, Inc. | Automated improvement of executable applications based on evaluating independent execution heuristics |
| JP5853794B2 (ja) * | 2012-03-19 | 2016-02-09 | 富士通株式会社 | 転置装置、転置方法、および転置プログラム |
| US9215138B2 (en) * | 2012-12-06 | 2015-12-15 | International Business Machines Corporation | Determining a system configuration for performing a collective operation on a parallel computer |
| US9063916B2 (en) * | 2013-02-27 | 2015-06-23 | Oracle International Corporation | Compact encoding of node locations |
| JP6499388B2 (ja) | 2013-08-14 | 2019-04-10 | 富士通株式会社 | 並列計算機システム、管理装置の制御プログラムおよび並列計算機システムの制御方法 |
| US9348651B2 (en) * | 2013-12-05 | 2016-05-24 | International Business Machines Corporation | Constructing a logical tree topology in a parallel computer |
| JP6413634B2 (ja) | 2014-10-30 | 2018-10-31 | 富士通株式会社 | ジョブ管理プログラム、ジョブ管理方法、およびジョブ管理装置 |
| JP6492977B2 (ja) | 2015-06-01 | 2019-04-03 | 富士通株式会社 | 並列演算装置、並列演算システム、ノード割当プログラム及びノード割当方法 |
| CN105938427B (zh) * | 2016-04-13 | 2018-06-29 | 中国科学院重庆绿色智能技术研究院 | 一种提高wrf并行计算效率的方法 |
| US10996989B2 (en) * | 2016-06-13 | 2021-05-04 | International Business Machines Corporation | Flexible optimized data handling in systems with multiple memories |
| US10318668B2 (en) * | 2016-06-15 | 2019-06-11 | International Business Machine Corporation | Automatic decomposition of simulation model |
| US10372507B2 (en) * | 2016-12-31 | 2019-08-06 | Intel Corporation | Compute engine architecture to support data-parallel loops with reduction operations |
| US10560351B1 (en) * | 2017-12-28 | 2020-02-11 | Architecture Technology Corporation | Network monitoring tool for supercomputers |
| US11651232B2 (en) | 2018-08-01 | 2023-05-16 | International Business Machines Corporation | Monte Carlo Markov chain based quantum program optimization |
| CN109635238B (zh) * | 2018-12-07 | 2023-08-29 | 北京字节跳动网络技术有限公司 | 矩阵运算方法、装置、设备及可读介质 |
| CN112433853B (zh) * | 2020-11-30 | 2023-04-28 | 西安交通大学 | 一种面向超级计算机数据并行应用的异构感知数据划分方法 |
| GB2606684A (en) * | 2021-02-03 | 2022-11-23 | Xonai Ltd | Allocating computational tasks to computer hardware |
| CN113517920A (zh) * | 2021-04-20 | 2021-10-19 | 东方红卫星移动通信有限公司 | 超密集低轨星座中物联网模拟载荷的计算卸载方法及系统 |
| CN116820772B (zh) * | 2023-07-03 | 2024-05-28 | 中山大学 | 网格并行读取方法、装置、终端设备和可读存储介质 |
| CN119292891B (zh) * | 2024-12-12 | 2025-04-22 | 中国石油大学(华东) | 新兴超级计算机用可解释DeePMD套件性能系统 |
| CN119996197A (zh) * | 2025-04-10 | 2025-05-13 | 西南科技大学 | 基于双重标记的并行通信映射构建方法 |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4983962A (en) * | 1987-09-14 | 1991-01-08 | Hammerstrom Daniel W | Neural-model, computational architecture employing broadcast hierarchy and hypergrid, point-to-point communication |
| US5202671A (en) * | 1989-10-24 | 1993-04-13 | International Business Machines Corporation | Pick function implementation in a parallel processing system |
| US5349682A (en) * | 1992-01-31 | 1994-09-20 | Parallel Pcs, Inc. | Dynamic fault-tolerant parallel processing system for performing an application function with increased efficiency using heterogeneous processors |
| US6021457A (en) * | 1995-09-28 | 2000-02-01 | Intel Corporation | Method and an apparatus for minimizing perturbation while monitoring parallel applications |
| JP3163237B2 (ja) * | 1995-09-28 | 2001-05-08 | 株式会社日立製作所 | 並列計算機システムの管理装置 |
| US6925642B1 (en) * | 1999-04-29 | 2005-08-02 | Hewlett-Packard Development Company, L.P. | Distributed computer network which spawns inter-node parallel processes based on resource availability |
| US6374403B1 (en) * | 1999-08-20 | 2002-04-16 | Hewlett-Packard Company | Programmatic method for reducing cost of control in parallel processes |
| US6438747B1 (en) * | 1999-08-20 | 2002-08-20 | Hewlett-Packard Company | Programmatic iteration scheduling for parallel processors |
| EP1381959A4 (en) * | 2001-02-24 | 2008-10-29 | Ibm | GLOBAL ARBORESCENT NETWORK FOR CALCULATION STRUCTURES |
| JP3980488B2 (ja) * | 2001-02-24 | 2007-09-26 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 超並列コンピュータ・システム |
| US7299466B2 (en) * | 2001-12-20 | 2007-11-20 | Cadence Design Systems, Inc. | Mechanism for managing execution environments for aggregated processes |
| US7103628B2 (en) * | 2002-06-20 | 2006-09-05 | Jp Morgan Chase & Co. | System and method for dividing computations |
| US7376693B2 (en) * | 2002-02-08 | 2008-05-20 | Jp Morgan Chase & Company | System architecture for distributed computing and method of using the system |
| US7644142B2 (en) * | 2006-05-04 | 2010-01-05 | Intel Corporation | Methods and apparatus to perform process placement for distributed applications |
-
2004
- 2004-10-12 US US10/963,101 patent/US8117288B2/en not_active Expired - Fee Related
-
2005
- 2005-10-06 EP EP05809830A patent/EP1807758A1/en not_active Ceased
- 2005-10-06 CN CNB2005800347177A patent/CN100568183C/zh not_active Expired - Fee Related
- 2005-10-06 JP JP2007535843A patent/JP4857274B2/ja not_active Expired - Fee Related
- 2005-10-06 WO PCT/US2005/036196 patent/WO2006044258A1/en not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| EP1807758A1 (en) | 2007-07-18 |
| US8117288B2 (en) | 2012-02-14 |
| WO2006044258A1 (en) | 2006-04-27 |
| CN101048736A (zh) | 2007-10-03 |
| US20060101104A1 (en) | 2006-05-11 |
| CN100568183C (zh) | 2009-12-09 |
| JP2008516346A (ja) | 2008-05-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP4857274B2 (ja) | 超並列型スーパーコンピュータでのアプリケーション・レイアウトの最適化 | |
| US5583990A (en) | System for allocating messages between virtual channels to avoid deadlock and to optimize the amount of message traffic on each type of virtual channel | |
| Bhanot et al. | Optimizing task layout on the Blue Gene/L supercomputer | |
| Bhatele et al. | Mapping applications with collectives over sub-communicators on torus networks | |
| Kamil et al. | Communication requirements and interconnect optimization for high-end scientific applications | |
| Prisacari et al. | Bandwidth-optimal all-to-all exchanges in fat tree networks | |
| CN114880272B (zh) | 全局高度数顶点集合通信的优化方法及应用 | |
| Ashby et al. | The impact of global communication latency at extreme scales on Krylov methods | |
| US8031614B2 (en) | Method and apparatus for routing data in an inter-nodal communications lattice of a massively parallel computer system by dynamic global mapping of contended links | |
| Azimi et al. | FLEXIBLE AND ADAPTIVE ON-CHIP INTERCONNECT FOR TERA-SCALE ARCHITECTURES. | |
| Gupta et al. | Performance analysis of a synchronous, circuit-switched interconnection cached network | |
| US20040158663A1 (en) | Interconnect topology for a scalable distributed computer system | |
| Afsharpour et al. | Performance/energy aware task migration algorithm for many‐core chips | |
| Mackenzie et al. | Comparative modeling of network topologies and routing strategies in multicomputers | |
| Agarwal et al. | An algorithmic approach to datacenter cabling | |
| Modarressi et al. | A Reconfigurable On‐Chip Interconnection Network for Large Multicore Systems | |
| Rahman et al. | Routing performance enhancement in hierarchical torus network by link-selection algorithm | |
| Yin | High-Performance, Energy-Efficient, and Scalable Accelerator Design for Emerging Machine Learning Applications | |
| Rajamanickam et al. | Exploiting Geometric Partitioning in Task Mapping for Parallel Computers. | |
| Newaz | Optimizing Performance of Distributed Memory Workloads for High Performance Computing (HPC) and Data Center Networks | |
| Dubinski et al. | High performance commodity networking in a 512-cpu teraflop beowulf cluster for computational astrophysics | |
| Rahman et al. | A deadlock-free routing algorithm using minimum number of virtual channels and application mappings for Hierarchical Torus Network | |
| Oral et al. | Multicast performance analysis for high-speed torus networks | |
| Yu et al. | Virtual topologies for scalable resource management and contention attenuation in a global address space model on the cray xt5 | |
| Feer et al. | Task Mapping for Noncontiguous Allocations. |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20080703 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20080703 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20110628 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20110927 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20111025 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20111031 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20141104 Year of fee payment: 3 |
|
| R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| LAPS | Cancellation because of no payment of annual fees |